Westreich, Daniel; Lessler, Justin; Funk, Michele Jonsson
2010-08-01
Propensity scores for the analysis of observational data are typically estimated using logistic regression. Our objective in this review was to assess machine learning alternatives to logistic regression, which may accomplish the same goals but with fewer assumptions or greater accuracy. We identified alternative methods for propensity score estimation and/or classification from the public health, biostatistics, discrete mathematics, and computer science literature, and evaluated these algorithms for applicability to the problem of propensity score estimation, potential advantages over logistic regression, and ease of use. We identified four techniques as alternatives to logistic regression: neural networks, support vector machines, decision trees (classification and regression trees [CART]), and meta-classifiers (in particular, boosting). Although the assumptions of logistic regression are well understood, those assumptions are frequently ignored. All four alternatives have advantages and disadvantages compared with logistic regression. Boosting (meta-classifiers) and, to a lesser extent, decision trees (particularly CART), appear to be most promising for use in the context of propensity score analysis, but extensive simulation studies are needed to establish their utility in practice. Copyright (c) 2010 Elsevier Inc. All rights reserved.
Predicting U.S. Army Reserve Unit Manning Using Market Demographics
2015-06-01
develops linear regression , classification tree, and logistic regression models to determine the ability of the location to support manning requirements... logistic regression model delivers predictive results that allow decision-makers to identify locations with a high probability of meeting unit...manning requirements. The recommendation of this thesis is that the USAR implement the logistic regression model. 14. SUBJECT TERMS U.S
No rationale for 1 variable per 10 events criterion for binary logistic regression analysis.
van Smeden, Maarten; de Groot, Joris A H; Moons, Karel G M; Collins, Gary S; Altman, Douglas G; Eijkemans, Marinus J C; Reitsma, Johannes B
2016-11-24
Ten events per variable (EPV) is a widely advocated minimal criterion for sample size considerations in logistic regression analysis. Of three previous simulation studies that examined this minimal EPV criterion only one supports the use of a minimum of 10 EPV. In this paper, we examine the reasons for substantial differences between these extensive simulation studies. The current study uses Monte Carlo simulations to evaluate small sample bias, coverage of confidence intervals and mean square error of logit coefficients. Logistic regression models fitted by maximum likelihood and a modified estimation procedure, known as Firth's correction, are compared. The results show that besides EPV, the problems associated with low EPV depend on other factors such as the total sample size. It is also demonstrated that simulation results can be dominated by even a few simulated data sets for which the prediction of the outcome by the covariates is perfect ('separation'). We reveal that different approaches for identifying and handling separation leads to substantially different simulation results. We further show that Firth's correction can be used to improve the accuracy of regression coefficients and alleviate the problems associated with separation. The current evidence supporting EPV rules for binary logistic regression is weak. Given our findings, there is an urgent need for new research to provide guidance for supporting sample size considerations for binary logistic regression analysis.
Westreich, Daniel; Lessler, Justin; Funk, Michele Jonsson
2010-01-01
Summary Objective Propensity scores for the analysis of observational data are typically estimated using logistic regression. Our objective in this Review was to assess machine learning alternatives to logistic regression which may accomplish the same goals but with fewer assumptions or greater accuracy. Study Design and Setting We identified alternative methods for propensity score estimation and/or classification from the public health, biostatistics, discrete mathematics, and computer science literature, and evaluated these algorithms for applicability to the problem of propensity score estimation, potential advantages over logistic regression, and ease of use. Results We identified four techniques as alternatives to logistic regression: neural networks, support vector machines, decision trees (CART), and meta-classifiers (in particular, boosting). Conclusion While the assumptions of logistic regression are well understood, those assumptions are frequently ignored. All four alternatives have advantages and disadvantages compared with logistic regression. Boosting (meta-classifiers) and to a lesser extent decision trees (particularly CART) appear to be most promising for use in the context of propensity score analysis, but extensive simulation studies are needed to establish their utility in practice. PMID:20630332
Held, Elizabeth; Cape, Joshua; Tintle, Nathan
2016-01-01
Machine learning methods continue to show promise in the analysis of data from genetic association studies because of the high number of variables relative to the number of observations. However, few best practices exist for the application of these methods. We extend a recently proposed supervised machine learning approach for predicting disease risk by genotypes to be able to incorporate gene expression data and rare variants. We then apply 2 different versions of the approach (radial and linear support vector machines) to simulated data from Genetic Analysis Workshop 19 and compare performance to logistic regression. Method performance was not radically different across the 3 methods, although the linear support vector machine tended to show small gains in predictive ability relative to a radial support vector machine and logistic regression. Importantly, as the number of genes in the models was increased, even when those genes contained causal rare variants, model predictive ability showed a statistically significant decrease in performance for both the radial support vector machine and logistic regression. The linear support vector machine showed more robust performance to the inclusion of additional genes. Further work is needed to evaluate machine learning approaches on larger samples and to evaluate the relative improvement in model prediction from the incorporation of gene expression data.
ERIC Educational Resources Information Center
West, Lindsey M.; Davis, Telsie A.; Thompson, Martie P.; Kaslow, Nadine J.
2011-01-01
Protective factors for fostering reasons for living were examined among low-income, suicidal, African American women. Bivariate logistic regressions revealed that higher levels of optimism, spiritual well-being, and family social support predicted reasons for living. Multivariate logistic regressions indicated that spiritual well-being showed…
Stylianou, Neophytos; Akbarov, Artur; Kontopantelis, Evangelos; Buchan, Iain; Dunn, Ken W
2015-08-01
Predicting mortality from burn injury has traditionally employed logistic regression models. Alternative machine learning methods have been introduced in some areas of clinical prediction as the necessary software and computational facilities have become accessible. Here we compare logistic regression and machine learning predictions of mortality from burn. An established logistic mortality model was compared to machine learning methods (artificial neural network, support vector machine, random forests and naïve Bayes) using a population-based (England & Wales) case-cohort registry. Predictive evaluation used: area under the receiver operating characteristic curve; sensitivity; specificity; positive predictive value and Youden's index. All methods had comparable discriminatory abilities, similar sensitivities, specificities and positive predictive values. Although some machine learning methods performed marginally better than logistic regression the differences were seldom statistically significant and clinically insubstantial. Random forests were marginally better for high positive predictive value and reasonable sensitivity. Neural networks yielded slightly better prediction overall. Logistic regression gives an optimal mix of performance and interpretability. The established logistic regression model of burn mortality performs well against more complex alternatives. Clinical prediction with a small set of strong, stable, independent predictors is unlikely to gain much from machine learning outside specialist research contexts. Copyright © 2015 Elsevier Ltd and ISBI. All rights reserved.
NASA Astrophysics Data System (ADS)
Pradhan, Biswajeet
2010-05-01
This paper presents the results of the cross-validation of a multivariate logistic regression model using remote sensing data and GIS for landslide hazard analysis on the Penang, Cameron, and Selangor areas in Malaysia. Landslide locations in the study areas were identified by interpreting aerial photographs and satellite images, supported by field surveys. SPOT 5 and Landsat TM satellite imagery were used to map landcover and vegetation index, respectively. Maps of topography, soil type, lineaments and land cover were constructed from the spatial datasets. Ten factors which influence landslide occurrence, i.e., slope, aspect, curvature, distance from drainage, lithology, distance from lineaments, soil type, landcover, rainfall precipitation, and normalized difference vegetation index (ndvi), were extracted from the spatial database and the logistic regression coefficient of each factor was computed. Then the landslide hazard was analysed using the multivariate logistic regression coefficients derived not only from the data for the respective area but also using the logistic regression coefficients calculated from each of the other two areas (nine hazard maps in all) as a cross-validation of the model. For verification of the model, the results of the analyses were then compared with the field-verified landslide locations. Among the three cases of the application of logistic regression coefficient in the same study area, the case of Selangor based on the Selangor logistic regression coefficients showed the highest accuracy (94%), where as Penang based on the Penang coefficients showed the lowest accuracy (86%). Similarly, among the six cases from the cross application of logistic regression coefficient in other two areas, the case of Selangor based on logistic coefficient of Cameron showed highest (90%) prediction accuracy where as the case of Penang based on the Selangor logistic regression coefficients showed the lowest accuracy (79%). Qualitatively, the cross application model yields reasonable results which can be used for preliminary landslide hazard mapping.
Neural network modeling for surgical decisions on traumatic brain injury patients.
Li, Y C; Liu, L; Chiu, W T; Jian, W S
2000-01-01
Computerized medical decision support systems have been a major research topic in recent years. Intelligent computer programs were implemented to aid physicians and other medical professionals in making difficult medical decisions. This report compares three different mathematical models for building a traumatic brain injury (TBI) medical decision support system (MDSS). These models were developed based on a large TBI patient database. This MDSS accepts a set of patient data such as the types of skull fracture, Glasgow Coma Scale (GCS), episode of convulsion and return the chance that a neurosurgeon would recommend an open-skull surgery for this patient. The three mathematical models described in this report including a logistic regression model, a multi-layer perceptron (MLP) neural network and a radial-basis-function (RBF) neural network. From the 12,640 patients selected from the database. A randomly drawn 9480 cases were used as the training group to develop/train our models. The other 3160 cases were in the validation group which we used to evaluate the performance of these models. We used sensitivity, specificity, areas under receiver-operating characteristics (ROC) curve and calibration curves as the indicator of how accurate these models are in predicting a neurosurgeon's decision on open-skull surgery. The results showed that, assuming equal importance of sensitivity and specificity, the logistic regression model had a (sensitivity, specificity) of (73%, 68%), compared to (80%, 80%) from the RBF model and (88%, 80%) from the MLP model. The resultant areas under ROC curve for logistic regression, RBF and MLP neural networks are 0.761, 0.880 and 0.897, respectively (P < 0.05). Among these models, the logistic regression has noticeably poorer calibration. This study demonstrated the feasibility of applying neural networks as the mechanism for TBI decision support systems based on clinical databases. The results also suggest that neural networks may be a better solution for complex, non-linear medical decision support systems than conventional statistical techniques such as logistic regression.
Zlotnik, Alexander; Alfaro, Miguel Cuchí; Pérez, María Carmen Pérez; Gallardo-Antolín, Ascensión; Martínez, Juan Manuel Montero
2016-05-01
The usage of decision support tools in emergency departments, based on predictive models, capable of estimating the probability of admission for patients in the emergency department may give nursing staff the possibility of allocating resources in advance. We present a methodology for developing and building one such system for a large specialized care hospital using a logistic regression and an artificial neural network model using nine routinely collected variables available right at the end of the triage process.A database of 255.668 triaged nonobstetric emergency department presentations from the Ramon y Cajal University Hospital of Madrid, from January 2011 to December 2012, was used to develop and test the models, with 66% of the data used for derivation and 34% for validation, with an ordered nonrandom partition. On the validation dataset areas under the receiver operating characteristic curve were 0.8568 (95% confidence interval, 0.8508-0.8583) for the logistic regression model and 0.8575 (95% confidence interval, 0.8540-0. 8610) for the artificial neural network model. χ Values for Hosmer-Lemeshow fixed "deciles of risk" were 65.32 for the logistic regression model and 17.28 for the artificial neural network model. A nomogram was generated upon the logistic regression model and an automated software decision support system with a Web interface was built based on the artificial neural network model.
Chen, Chau-Kuang; Bruce, Michelle; Tyler, Lauren; Brown, Claudine; Garrett, Angelica; Goggins, Susan; Lewis-Polite, Brandy; Weriwoh, Mirabel L; Juarez, Paul D.; Hood, Darryl B.; Skelton, Tyler
2014-01-01
The goal of this study was to analyze a 54-item instrument for assessment of perception of exposure to environmental contaminants within the context of the built environment, or exposome. This exposome was defined in five domains to include 1) home and hobby, 2) school, 3) community, 4) occupation, and 5) exposure history. Interviews were conducted with child-bearing-age minority women at Metro Nashville General Hospital at Meharry Medical College. Data were analyzed utilizing DTReg software for Support Vector Machine (SVM) modeling followed by an SPSS package for a logistic regression model. The target (outcome) variable of interest was respondent's residence by ZIP code. The results demonstrate that the rank order of important variables with respect to SVM modeling versus traditional logistic regression models is almost identical. This is the first study documenting that SVM analysis has discriminate power for determination of higher-ordered spatial relationships on an environmental exposure history questionnaire. PMID:23395953
Chen, Chau-Kuang; Bruce, Michelle; Tyler, Lauren; Brown, Claudine; Garrett, Angelica; Goggins, Susan; Lewis-Polite, Brandy; Weriwoh, Mirabel L; Juarez, Paul D; Hood, Darryl B; Skelton, Tyler
2013-02-01
The goal of this study was to analyze a 54-item instrument for assessment of perception of exposure to environmental contaminants within the context of the built environment, or exposome. This exposome was defined in five domains to include 1) home and hobby, 2) school, 3) community, 4) occupation, and 5) exposure history. Interviews were conducted with child-bearing-age minority women at Metro Nashville General Hospital at Meharry Medical College. Data were analyzed utilizing DTReg software for Support Vector Machine (SVM) modeling followed by an SPSS package for a logistic regression model. The target (outcome) variable of interest was respondent's residence by ZIP code. The results demonstrate that the rank order of important variables with respect to SVM modeling versus traditional logistic regression models is almost identical. This is the first study documenting that SVM analysis has discriminate power for determination of higher-ordered spatial relationships on an environmental exposure history questionnaire.
Sebire, Simon J; Haase, Anne M; Montgomery, Alan A; McNeill, Jade; Jago, Russ
2014-05-01
The current study investigated cross-sectional associations between maternal and paternal logistic and modeling physical activity support and the self-efficacy, self-esteem, and physical activity intentions of 11- to 12-year-old girls. 210 girls reported perceptions of maternal and paternal logistic and modeling support and their self-efficacy, self-esteem and intention to be physically active. Data were analyzed using multivariable regression models. Maternal logistic support was positively associated with participants' self-esteem, physical activity self-efficacy, and intention to be active. Maternal modeling was positively associated with self-efficacy. Paternal modeling was positively associated with self-esteem and self-efficacy but there was no evidence that paternal logistic support was associated with the psychosocial variables. Activity-related parenting practices were associated with psychosocial correlates of physical activity among adolescent girls. Logistic support from mothers, rather than modeling support or paternal support may be a particularly important target when designing interventions aimed at preventing the age-related decline in physical activity among girls.
Li, Ji; Gray, B.R.; Bates, D.M.
2008-01-01
Partitioning the variance of a response by design levels is challenging for binomial and other discrete outcomes. Goldstein (2003) proposed four definitions for variance partitioning coefficients (VPC) under a two-level logistic regression model. In this study, we explicitly derived formulae for multi-level logistic regression model and subsequently studied the distributional properties of the calculated VPCs. Using simulations and a vegetation dataset, we demonstrated associations between different VPC definitions, the importance of methods for estimating VPCs (by comparing VPC obtained using Laplace and penalized quasilikehood methods), and bivariate dependence between VPCs calculated at different levels. Such an empirical study lends an immediate support to wider applications of VPC in scientific data analysis.
Goo, Yeong-Jia James; Shen, Zone-De
2014-01-01
As the fraudulent financial statement of an enterprise is increasingly serious with each passing day, establishing a valid forecasting fraudulent financial statement model of an enterprise has become an important question for academic research and financial practice. After screening the important variables using the stepwise regression, the study also matches the logistic regression, support vector machine, and decision tree to construct the classification models to make a comparison. The study adopts financial and nonfinancial variables to assist in establishment of the forecasting fraudulent financial statement model. Research objects are the companies to which the fraudulent and nonfraudulent financial statement happened between years 1998 to 2012. The findings are that financial and nonfinancial information are effectively used to distinguish the fraudulent financial statement, and decision tree C5.0 has the best classification effect 85.71%. PMID:25302338
Chen, Suduan; Goo, Yeong-Jia James; Shen, Zone-De
2014-01-01
As the fraudulent financial statement of an enterprise is increasingly serious with each passing day, establishing a valid forecasting fraudulent financial statement model of an enterprise has become an important question for academic research and financial practice. After screening the important variables using the stepwise regression, the study also matches the logistic regression, support vector machine, and decision tree to construct the classification models to make a comparison. The study adopts financial and nonfinancial variables to assist in establishment of the forecasting fraudulent financial statement model. Research objects are the companies to which the fraudulent and nonfraudulent financial statement happened between years 1998 to 2012. The findings are that financial and nonfinancial information are effectively used to distinguish the fraudulent financial statement, and decision tree C5.0 has the best classification effect 85.71%.
Wang, Shuang; Jiang, Xiaoqian; Wu, Yuan; Cui, Lijuan; Cheng, Samuel; Ohno-Machado, Lucila
2013-01-01
We developed an EXpectation Propagation LOgistic REgRession (EXPLORER) model for distributed privacy-preserving online learning. The proposed framework provides a high level guarantee for protecting sensitive information, since the information exchanged between the server and the client is the encrypted posterior distribution of coefficients. Through experimental results, EXPLORER shows the same performance (e.g., discrimination, calibration, feature selection etc.) as the traditional frequentist Logistic Regression model, but provides more flexibility in model updating. That is, EXPLORER can be updated one point at a time rather than having to retrain the entire data set when new observations are recorded. The proposed EXPLORER supports asynchronized communication, which relieves the participants from coordinating with one another, and prevents service breakdown from the absence of participants or interrupted communications. PMID:23562651
Classifying machinery condition using oil samples and binary logistic regression
NASA Astrophysics Data System (ADS)
Phillips, J.; Cripps, E.; Lau, John W.; Hodkiewicz, M. R.
2015-08-01
The era of big data has resulted in an explosion of condition monitoring information. The result is an increasing motivation to automate the costly and time consuming human elements involved in the classification of machine health. When working with industry it is important to build an understanding and hence some trust in the classification scheme for those who use the analysis to initiate maintenance tasks. Typically "black box" approaches such as artificial neural networks (ANN) and support vector machines (SVM) can be difficult to provide ease of interpretability. In contrast, this paper argues that logistic regression offers easy interpretability to industry experts, providing insight to the drivers of the human classification process and to the ramifications of potential misclassification. Of course, accuracy is of foremost importance in any automated classification scheme, so we also provide a comparative study based on predictive performance of logistic regression, ANN and SVM. A real world oil analysis data set from engines on mining trucks is presented and using cross-validation we demonstrate that logistic regression out-performs the ANN and SVM approaches in terms of prediction for healthy/not healthy engines.
Role of social support in adolescent suicidal ideation and suicide attempts.
Miller, Adam Bryant; Esposito-Smythers, Christianne; Leichtweis, Richard N
2015-03-01
The present study examined the relative contributions of perceptions of social support from parents, close friends, and school on current suicidal ideation (SI) and suicide attempt (SA) history in a clinical sample of adolescents. Participants were 143 adolescents (64% female; 81% white; range, 12-18 years; M = 15.38; standard deviation = 1.43) admitted to a partial hospitalization program. Data were collected with well-validated assessments and a structured clinical interview. Main and interactive effects of perceptions of social support on SI were tested with linear regression. Main and interactive effects of social support on the odds of SA were tested with logistic regression. Results from the linear regression analysis revealed that perceptions of lower school support independently predicted greater severity of SI, accounting for parent and close friend support. Further, the relationship between lower perceived school support and SI was the strongest among those who perceived lower versus higher parental support. Results from the logistic regression analysis revealed that perceptions of lower parental support independently predicted SA history, accounting for school and close friend support. Further, those who perceived lower support from school and close friends reported the greatest odds of an SA history. Results address a significant gap in the social support and suicide literature by demonstrating that perceptions of parent and school support are relatively more important than peer support in understanding suicidal thoughts and history of suicidal behavior. Results suggest that improving social support across these domains may be important in suicide prevention efforts. Copyright © 2015 Society for Adolescent Health and Medicine. Published by Elsevier Inc. All rights reserved.
New machine-learning algorithms for prediction of Parkinson's disease
NASA Astrophysics Data System (ADS)
Mandal, Indrajit; Sairam, N.
2014-03-01
This article presents an enhanced prediction accuracy of diagnosis of Parkinson's disease (PD) to prevent the delay and misdiagnosis of patients using the proposed robust inference system. New machine-learning methods are proposed and performance comparisons are based on specificity, sensitivity, accuracy and other measurable parameters. The robust methods of treating Parkinson's disease (PD) includes sparse multinomial logistic regression, rotation forest ensemble with support vector machines and principal components analysis, artificial neural networks, boosting methods. A new ensemble method comprising of the Bayesian network optimised by Tabu search algorithm as classifier and Haar wavelets as projection filter is used for relevant feature selection and ranking. The highest accuracy obtained by linear logistic regression and sparse multinomial logistic regression is 100% and sensitivity, specificity of 0.983 and 0.996, respectively. All the experiments are conducted over 95% and 99% confidence levels and establish the results with corrected t-tests. This work shows a high degree of advancement in software reliability and quality of the computer-aided diagnosis system and experimentally shows best results with supportive statistical inference.
Zhu, K; Lou, Z; Zhou, J; Ballester, N; Kong, N; Parikh, P
2015-01-01
This article is part of the Focus Theme of Methods of Information in Medicine on "Big Data and Analytics in Healthcare". Hospital readmissions raise healthcare costs and cause significant distress to providers and patients. It is, therefore, of great interest to healthcare organizations to predict what patients are at risk to be readmitted to their hospitals. However, current logistic regression based risk prediction models have limited prediction power when applied to hospital administrative data. Meanwhile, although decision trees and random forests have been applied, they tend to be too complex to understand among the hospital practitioners. Explore the use of conditional logistic regression to increase the prediction accuracy. We analyzed an HCUP statewide inpatient discharge record dataset, which includes patient demographics, clinical and care utilization data from California. We extracted records of heart failure Medicare beneficiaries who had inpatient experience during an 11-month period. We corrected the data imbalance issue with under-sampling. In our study, we first applied standard logistic regression and decision tree to obtain influential variables and derive practically meaning decision rules. We then stratified the original data set accordingly and applied logistic regression on each data stratum. We further explored the effect of interacting variables in the logistic regression modeling. We conducted cross validation to assess the overall prediction performance of conditional logistic regression (CLR) and compared it with standard classification models. The developed CLR models outperformed several standard classification models (e.g., straightforward logistic regression, stepwise logistic regression, random forest, support vector machine). For example, the best CLR model improved the classification accuracy by nearly 20% over the straightforward logistic regression model. Furthermore, the developed CLR models tend to achieve better sensitivity of more than 10% over the standard classification models, which can be translated to correct labeling of additional 400 - 500 readmissions for heart failure patients in the state of California over a year. Lastly, several key predictor identified from the HCUP data include the disposition location from discharge, the number of chronic conditions, and the number of acute procedures. It would be beneficial to apply simple decision rules obtained from the decision tree in an ad-hoc manner to guide the cohort stratification. It could be potentially beneficial to explore the effect of pairwise interactions between influential predictors when building the logistic regression models for different data strata. Judicious use of the ad-hoc CLR models developed offers insights into future development of prediction models for hospital readmissions, which can lead to better intuition in identifying high-risk patients and developing effective post-discharge care strategies. Lastly, this paper is expected to raise the awareness of collecting data on additional markers and developing necessary database infrastructure for larger-scale exploratory studies on readmission risk prediction.
NASA Astrophysics Data System (ADS)
Kneringer, Philipp; Dietz, Sebastian; Mayr, Georg J.; Zeileis, Achim
2017-04-01
Low-visibility conditions have a large impact on aviation safety and economic efficiency of airports and airlines. To support decision makers, we develop a statistical probabilistic nowcasting tool for the occurrence of capacity-reducing operations related to low visibility. The probabilities of four different low visibility classes are predicted with an ordered logistic regression model based on time series of meteorological point measurements. Potential predictor variables for the statistical models are visibility, humidity, temperature and wind measurements at several measurement sites. A stepwise variable selection method indicates that visibility and humidity measurements are the most important model inputs. The forecasts are tested with a 30 minute forecast interval up to two hours, which is a sufficient time span for tactical planning at Vienna Airport. The ordered logistic regression models outperform persistence and are competitive with human forecasters.
Wang, Shuang; Jiang, Xiaoqian; Wu, Yuan; Cui, Lijuan; Cheng, Samuel; Ohno-Machado, Lucila
2013-06-01
We developed an EXpectation Propagation LOgistic REgRession (EXPLORER) model for distributed privacy-preserving online learning. The proposed framework provides a high level guarantee for protecting sensitive information, since the information exchanged between the server and the client is the encrypted posterior distribution of coefficients. Through experimental results, EXPLORER shows the same performance (e.g., discrimination, calibration, feature selection, etc.) as the traditional frequentist logistic regression model, but provides more flexibility in model updating. That is, EXPLORER can be updated one point at a time rather than having to retrain the entire data set when new observations are recorded. The proposed EXPLORER supports asynchronized communication, which relieves the participants from coordinating with one another, and prevents service breakdown from the absence of participants or interrupted communications. Copyright © 2013 Elsevier Inc. All rights reserved.
Ommen, Oliver; Thuem, Sonja; Pfaff, Holger; Janssen, Christian
2011-06-01
Empirical studies have confirmed that a trusting physician-patient interaction promotes patient satisfaction, adherence to treatment and improved health outcomes. The objective of this analysis was to investigate the relationship between social support, shared decision-making and inpatient's trust in physicians in a hospital setting. A written questionnaire was completed by 2,197 patients who were treated in the year 2000 in six hospitals in Germany. Logistic regression was performed with a dichotomized index for patient's trust in physicians. The logistic regression model identified significant relationships (p < 0.05) in terms of emotional support (standardized effect coefficient [sc], 3.65), informational support (sc, 1.70), shared decision-making (sc, 1.40), age (sc, 1.14), socioeconomic status (sc, 1.15) and gender (sc, 1.15). We found no significant relationship between 'tendency to excuse' and trust. The last regression model accounted for 49.1% of Nagelkerke's R-square. Insufficient physician communication skills can lead to extensive negative effects on the trust of patients in their physicians. Thus, it becomes clear that medical support requires not only biomedical, but also psychosocial skills.
Gender differences in social support and leisure-time physical activity.
Oliveira, Aldair J; Lopes, Claudia S; Rostila, Mikael; Werneck, Guilherme Loureiro; Griep, Rosane Härter; Leon, Antônio Carlos Monteiro Ponce de; Faerstein, Eduardo
2014-08-01
To identify gender differences in social support dimensions' effect on adults' leisure-time physical activity maintenance, type, and time. Longitudinal study of 1,278 non-faculty public employees at a university in Rio de Janeiro, RJ, Southeastern Brazil. Physical activity was evaluated using a dichotomous question with a two-week reference period, and further questions concerning leisure-time physical activity type (individual or group) and time spent on the activity. Social support was measured with the Medical Outcomes Study Social Support Scale. For the analysis, logistic regression models were adjusted separately by gender. A multinomial logistic regression showed an association between material support and individual activities among women (OR = 2.76; 95%CI 1.2;6.5). Affective support was associated with time spent on leisure-time physical activity only among men (OR = 1.80; 95%CI 1.1;3.2). All dimensions of social support that were examined influenced either the type of, or the time spent on, leisure-time physical activity. In some social support dimensions, the associations detected varied by gender. Future studies should attempt to elucidate the mechanisms involved in these gender differences.
Hsieh, Chung-Ho; Lu, Ruey-Hwa; Lee, Nai-Hsin; Chiu, Wen-Ta; Hsu, Min-Huei; Li, Yu-Chuan Jack
2011-01-01
Diagnosing acute appendicitis clinically is still difficult. We developed random forests, support vector machines, and artificial neural network models to diagnose acute appendicitis. Between January 2006 and December 2008, patients who had a consultation session with surgeons for suspected acute appendicitis were enrolled. Seventy-five percent of the data set was used to construct models including random forest, support vector machines, artificial neural networks, and logistic regression. Twenty-five percent of the data set was withheld to evaluate model performance. The area under the receiver operating characteristic curve (AUC) was used to evaluate performance, which was compared with that of the Alvarado score. Data from a total of 180 patients were collected, 135 used for training and 45 for testing. The mean age of patients was 39.4 years (range, 16-85). Final diagnosis revealed 115 patients with and 65 without appendicitis. The AUC of random forest, support vector machines, artificial neural networks, logistic regression, and Alvarado was 0.98, 0.96, 0.91, 0.87, and 0.77, respectively. The sensitivity, specificity, positive, and negative predictive values of random forest were 94%, 100%, 100%, and 87%, respectively. Random forest performed better than artificial neural networks, logistic regression, and Alvarado. We demonstrated that random forest can predict acute appendicitis with good accuracy and, deployed appropriately, can be an effective tool in clinical decision making. Copyright © 2011 Mosby, Inc. All rights reserved.
Hill, Andrew; Loh, Po-Ru; Bharadwaj, Ragu B.; Pons, Pascal; Shang, Jingbo; Guinan, Eva; Lakhani, Karim; Kilty, Iain
2017-01-01
Abstract Background: The association of differing genotypes with disease-related phenotypic traits offers great potential to both help identify new therapeutic targets and support stratification of patients who would gain the greatest benefit from specific drug classes. Development of low-cost genotyping and sequencing has made collecting large-scale genotyping data routine in population and therapeutic intervention studies. In addition, a range of new technologies is being used to capture numerous new and complex phenotypic descriptors. As a result, genotype and phenotype datasets have grown exponentially. Genome-wide association studies associate genotypes and phenotypes using methods such as logistic regression. As existing tools for association analysis limit the efficiency by which value can be extracted from increasing volumes of data, there is a pressing need for new software tools that can accelerate association analyses on large genotype-phenotype datasets. Results: Using open innovation (OI) and contest-based crowdsourcing, the logistic regression analysis in a leading, community-standard genetics software package (PLINK 1.07) was substantially accelerated. OI allowed us to do this in <6 months by providing rapid access to highly skilled programmers with specialized, difficult-to-find skill sets. Through a crowd-based contest a combination of computational, numeric, and algorithmic approaches was identified that accelerated the logistic regression in PLINK 1.07 by 18- to 45-fold. Combining contest-derived logistic regression code with coarse-grained parallelization, multithreading, and associated changes to data initialization code further developed through distributed innovation, we achieved an end-to-end speedup of 591-fold for a data set size of 6678 subjects by 645 863 variants, compared to PLINK 1.07's logistic regression. This represents a reduction in run time from 4.8 hours to 29 seconds. Accelerated logistic regression code developed in this project has been incorporated into the PLINK2 project. Conclusions: Using iterative competition-based OI, we have developed a new, faster implementation of logistic regression for genome-wide association studies analysis. We present lessons learned and recommendations on running a successful OI process for bioinformatics. PMID:28327993
Hill, Andrew; Loh, Po-Ru; Bharadwaj, Ragu B; Pons, Pascal; Shang, Jingbo; Guinan, Eva; Lakhani, Karim; Kilty, Iain; Jelinsky, Scott A
2017-05-01
The association of differing genotypes with disease-related phenotypic traits offers great potential to both help identify new therapeutic targets and support stratification of patients who would gain the greatest benefit from specific drug classes. Development of low-cost genotyping and sequencing has made collecting large-scale genotyping data routine in population and therapeutic intervention studies. In addition, a range of new technologies is being used to capture numerous new and complex phenotypic descriptors. As a result, genotype and phenotype datasets have grown exponentially. Genome-wide association studies associate genotypes and phenotypes using methods such as logistic regression. As existing tools for association analysis limit the efficiency by which value can be extracted from increasing volumes of data, there is a pressing need for new software tools that can accelerate association analyses on large genotype-phenotype datasets. Using open innovation (OI) and contest-based crowdsourcing, the logistic regression analysis in a leading, community-standard genetics software package (PLINK 1.07) was substantially accelerated. OI allowed us to do this in <6 months by providing rapid access to highly skilled programmers with specialized, difficult-to-find skill sets. Through a crowd-based contest a combination of computational, numeric, and algorithmic approaches was identified that accelerated the logistic regression in PLINK 1.07 by 18- to 45-fold. Combining contest-derived logistic regression code with coarse-grained parallelization, multithreading, and associated changes to data initialization code further developed through distributed innovation, we achieved an end-to-end speedup of 591-fold for a data set size of 6678 subjects by 645 863 variants, compared to PLINK 1.07's logistic regression. This represents a reduction in run time from 4.8 hours to 29 seconds. Accelerated logistic regression code developed in this project has been incorporated into the PLINK2 project. Using iterative competition-based OI, we have developed a new, faster implementation of logistic regression for genome-wide association studies analysis. We present lessons learned and recommendations on running a successful OI process for bioinformatics. © The Author 2017. Published by Oxford University Press.
Suzuki, Taku; Iwamoto, Takuji; Shizu, Kanae; Suzuki, Katsuji; Yamada, Harumoto; Sato, Kazuki
2017-05-01
This retrospective study was designed to investigate prognostic factors for postoperative outcomes for cubital tunnel syndrome (CubTS) using multiple logistic regression analysis with a large number of patients. Eighty-three patients with CubTS who underwent surgeries were enrolled. The following potential prognostic factors for disease severity were selected according to previous reports: sex, age, type of surgery, disease duration, body mass index, cervical lesion, presence of diabetes mellitus, Workers' Compensation status, preoperative severity, and preoperative electrodiagnostic testing. Postoperative severity of disease was assessed 2 years after surgery by Messina's criteria which is an outcome measure specifically for CubTS. Bivariate analysis was performed to select candidate prognostic factors for multiple linear regression analyses. Multiple logistic regression analysis was conducted to identify the association between postoperative severity and selected prognostic factors. Both bivariate and multiple linear regression analysis revealed only preoperative severity as an independent risk factor for poor prognosis, while other factors did not show any significant association. Although conflicting results exist regarding prognosis of CubTS, this study supports evidence from previous studies and concludes early surgical intervention portends the most favorable prognosis. Copyright © 2017 The Japanese Orthopaedic Association. Published by Elsevier B.V. All rights reserved.
Gender differences in social support and leisure-time physical activity
Oliveira, Aldair J; Lopes, Claudia S; Rostila, Mikael; Werneck, Guilherme Loureiro; Griep, Rosane Härter; de Leon, Antônio Carlos Monteiro Ponce; Faerstein, Eduardo
2014-01-01
OBJECTIVE To identify gender differences in social support dimensions’ effect on adults’ leisure-time physical activity maintenance, type, and time. METHODS Longitudinal study of 1,278 non-faculty public employees at a university in Rio de Janeiro, RJ, Southeastern Brazil. Physical activity was evaluated using a dichotomous question with a two-week reference period, and further questions concerning leisure-time physical activity type (individual or group) and time spent on the activity. Social support was measured with the Medical Outcomes Study Social Support Scale. For the analysis, logistic regression models were adjusted separately by gender. RESULTS A multinomial logistic regression showed an association between material support and individual activities among women (OR = 2.76; 95%CI 1.2;6.5). Affective support was associated with time spent on leisure-time physical activity only among men (OR = 1.80; 95%CI 1.1;3.2). CONCLUSIONS All dimensions of social support that were examined influenced either the type of, or the time spent on, leisure-time physical activity. In some social support dimensions, the associations detected varied by gender. Future studies should attempt to elucidate the mechanisms involved in these gender differences. PMID:25210819
NASA Astrophysics Data System (ADS)
Ariffin, Syaiba Balqish; Midi, Habshah
2014-06-01
This article is concerned with the performance of logistic ridge regression estimation technique in the presence of multicollinearity and high leverage points. In logistic regression, multicollinearity exists among predictors and in the information matrix. The maximum likelihood estimator suffers a huge setback in the presence of multicollinearity which cause regression estimates to have unduly large standard errors. To remedy this problem, a logistic ridge regression estimator is put forward. It is evident that the logistic ridge regression estimator outperforms the maximum likelihood approach for handling multicollinearity. The effect of high leverage points are then investigated on the performance of the logistic ridge regression estimator through real data set and simulation study. The findings signify that logistic ridge regression estimator fails to provide better parameter estimates in the presence of both high leverage points and multicollinearity.
Secure Logistic Regression Based on Homomorphic Encryption: Design and Evaluation
Song, Yongsoo; Wang, Shuang; Xia, Yuhou; Jiang, Xiaoqian
2018-01-01
Background Learning a model without accessing raw data has been an intriguing idea to security and machine learning researchers for years. In an ideal setting, we want to encrypt sensitive data to store them on a commercial cloud and run certain analyses without ever decrypting the data to preserve privacy. Homomorphic encryption technique is a promising candidate for secure data outsourcing, but it is a very challenging task to support real-world machine learning tasks. Existing frameworks can only handle simplified cases with low-degree polynomials such as linear means classifier and linear discriminative analysis. Objective The goal of this study is to provide a practical support to the mainstream learning models (eg, logistic regression). Methods We adapted a novel homomorphic encryption scheme optimized for real numbers computation. We devised (1) the least squares approximation of the logistic function for accuracy and efficiency (ie, reduce computation cost) and (2) new packing and parallelization techniques. Results Using real-world datasets, we evaluated the performance of our model and demonstrated its feasibility in speed and memory consumption. For example, it took approximately 116 minutes to obtain the training model from the homomorphically encrypted Edinburgh dataset. In addition, it gives fairly accurate predictions on the testing dataset. Conclusions We present the first homomorphically encrypted logistic regression outsourcing model based on the critical observation that the precision loss of classification models is sufficiently small so that the decision plan stays still. PMID:29666041
An ultra low power feature extraction and classification system for wearable seizure detection.
Page, Adam; Pramod Tim Oates, Siddharth; Mohsenin, Tinoosh
2015-01-01
In this paper we explore the use of a variety of machine learning algorithms for designing a reliable and low-power, multi-channel EEG feature extractor and classifier for predicting seizures from electroencephalographic data (scalp EEG). Different machine learning classifiers including k-nearest neighbor, support vector machines, naïve Bayes, logistic regression, and neural networks are explored with the goal of maximizing detection accuracy while minimizing power, area, and latency. The input to each machine learning classifier is a 198 feature vector containing 9 features for each of the 22 EEG channels obtained over 1-second windows. All classifiers were able to obtain F1 scores over 80% and onset sensitivity of 100% when tested on 10 patients. Among five different classifiers that were explored, logistic regression (LR) proved to have minimum hardware complexity while providing average F-1 score of 91%. Both ASIC and FPGA implementations of logistic regression are presented and show the smallest area, power consumption, and the lowest latency when compared to the previous work.
Support vector machines classifiers of physical activities in preschoolers
USDA-ARS?s Scientific Manuscript database
The goal of this study is to develop, test, and compare multinomial logistic regression (MLR) and support vector machines (SVM) in classifying preschool-aged children physical activity data acquired from an accelerometer. In this study, 69 children aged 3-5 years old were asked to participate in a s...
Sample size determination for logistic regression on a logit-normal distribution.
Kim, Seongho; Heath, Elisabeth; Heilbrun, Lance
2017-06-01
Although the sample size for simple logistic regression can be readily determined using currently available methods, the sample size calculation for multiple logistic regression requires some additional information, such as the coefficient of determination ([Formula: see text]) of a covariate of interest with other covariates, which is often unavailable in practice. The response variable of logistic regression follows a logit-normal distribution which can be generated from a logistic transformation of a normal distribution. Using this property of logistic regression, we propose new methods of determining the sample size for simple and multiple logistic regressions using a normal transformation of outcome measures. Simulation studies and a motivating example show several advantages of the proposed methods over the existing methods: (i) no need for [Formula: see text] for multiple logistic regression, (ii) available interim or group-sequential designs, and (iii) much smaller required sample size.
Staley, James R; Jones, Edmund; Kaptoge, Stephen; Butterworth, Adam S; Sweeting, Michael J; Wood, Angela M; Howson, Joanna M M
2017-06-01
Logistic regression is often used instead of Cox regression to analyse genome-wide association studies (GWAS) of single-nucleotide polymorphisms (SNPs) and disease outcomes with cohort and case-cohort designs, as it is less computationally expensive. Although Cox and logistic regression models have been compared previously in cohort studies, this work does not completely cover the GWAS setting nor extend to the case-cohort study design. Here, we evaluated Cox and logistic regression applied to cohort and case-cohort genetic association studies using simulated data and genetic data from the EPIC-CVD study. In the cohort setting, there was a modest improvement in power to detect SNP-disease associations using Cox regression compared with logistic regression, which increased as the disease incidence increased. In contrast, logistic regression had more power than (Prentice weighted) Cox regression in the case-cohort setting. Logistic regression yielded inflated effect estimates (assuming the hazard ratio is the underlying measure of association) for both study designs, especially for SNPs with greater effect on disease. Given logistic regression is substantially more computationally efficient than Cox regression in both settings, we propose a two-step approach to GWAS in cohort and case-cohort studies. First to analyse all SNPs with logistic regression to identify associated variants below a pre-defined P-value threshold, and second to fit Cox regression (appropriately weighted in case-cohort studies) to those identified SNPs to ensure accurate estimation of association with disease.
The crux of the method: assumptions in ordinary least squares and logistic regression.
Long, Rebecca G
2008-10-01
Logistic regression has increasingly become the tool of choice when analyzing data with a binary dependent variable. While resources relating to the technique are widely available, clear discussions of why logistic regression should be used in place of ordinary least squares regression are difficult to find. The current paper compares and contrasts the assumptions of ordinary least squares with those of logistic regression and explains why logistic regression's looser assumptions make it adept at handling violations of the more important assumptions in ordinary least squares.
Steganalysis using logistic regression
NASA Astrophysics Data System (ADS)
Lubenko, Ivans; Ker, Andrew D.
2011-02-01
We advocate Logistic Regression (LR) as an alternative to the Support Vector Machine (SVM) classifiers commonly used in steganalysis. LR offers more information than traditional SVM methods - it estimates class probabilities as well as providing a simple classification - and can be adapted more easily and efficiently for multiclass problems. Like SVM, LR can be kernelised for nonlinear classification, and it shows comparable classification accuracy to SVM methods. This work is a case study, comparing accuracy and speed of SVM and LR classifiers in detection of LSB Matching and other related spatial-domain image steganography, through the state-of-art 686-dimensional SPAM feature set, in three image sets.
Ardoino, Ilaria; Lanzoni, Monica; Marano, Giuseppe; Boracchi, Patrizia; Sagrini, Elisabetta; Gianstefani, Alice; Piscaglia, Fabio; Biganzoli, Elia M
2017-04-01
The interpretation of regression models results can often benefit from the generation of nomograms, 'user friendly' graphical devices especially useful for assisting the decision-making processes. However, in the case of multinomial regression models, whenever categorical responses with more than two classes are involved, nomograms cannot be drawn in the conventional way. Such a difficulty in managing and interpreting the outcome could often result in a limitation of the use of multinomial regression in decision-making support. In the present paper, we illustrate the derivation of a non-conventional nomogram for multinomial regression models, intended to overcome this issue. Although it may appear less straightforward at first sight, the proposed methodology allows an easy interpretation of the results of multinomial regression models and makes them more accessible for clinicians and general practitioners too. Development of prediction model based on multinomial logistic regression and of the pertinent graphical tool is illustrated by means of an example involving the prediction of the extent of liver fibrosis in hepatitis C patients by routinely available markers.
Harris, Katherine M.; Koenig, Harold G.; Han, Xiaotong; Sullivan, Greer; Mattox, Rhonda; Tang, Lingqi
2009-01-01
Objective The negative association between religiosity (religious beliefs and church attendance) and the likelihood of substance use disorders is well established, but the mechanism(s) remain poorly understood. We investigated whether this association was mediated by social support or mental health status. Method We utilized cross-sectional data from the 2002 National Survey on Drug Use and Health (n = 36,370). We first used logistic regression to regress any alcohol use in the past year on sociodemographic and religiosity variables. Then, among individuals who drank in the past year, we regressed past year alcohol abuse/dependence on sociodemographic and religiosity variables. To investigate whether social support mediated the association between religiosity and alcohol use and alcohol abuse/dependence we repeated the above models, adding the social support variables. To the extent that these added predictors modified the magnitude of the effect of the religiosity variables, we interpreted social support as a possible mediator. We also formally tested for mediation using path analysis. We investigated the possible mediating role of mental health status analogously. Parallel sets of analyses were conducted for any drug use, and drug abuse/dependence among those using any drugs as the dependent variables. Results The addition of social support and mental health status variables to logistic regression models had little effect on the magnitude of the religiosity coefficients in any of the models. While some of the tests of mediation were significant in the path analyses, the results were not always in the expected direction, and the magnitude of the effects was small. Conclusions The association between religiosity and decreased likelihood of a substance use disorder does not appear to be substantively mediated by either social support or mental health status. PMID:19714282
Using Dominance Analysis to Determine Predictor Importance in Logistic Regression
ERIC Educational Resources Information Center
Azen, Razia; Traxel, Nicole
2009-01-01
This article proposes an extension of dominance analysis that allows researchers to determine the relative importance of predictors in logistic regression models. Criteria for choosing logistic regression R[superscript 2] analogues were determined and measures were selected that can be used to perform dominance analysis in logistic regression. A…
"I'm Not Supporting His Kids": Nonresident Fathers' Contributions Given Mothers' New Fertility
ERIC Educational Resources Information Center
Meyer, Daniel R.; Cancian, Maria
2012-01-01
The authors examined whether nonresident fathers provide informal support to their children and whether support stops if their ex-partner goes on to have a child with a new man. A logistic regression analysis of longitudinal survey and administrative data for 434 women who received welfare in Wisconsin showed that fathers are less likely to…
Applying Kaplan-Meier to Item Response Data
ERIC Educational Resources Information Center
McNeish, Daniel
2018-01-01
Some IRT models can be equivalently modeled in alternative frameworks such as logistic regression. Logistic regression can also model time-to-event data, which concerns the probability of an event occurring over time. Using the relation between time-to-event models and logistic regression and the relation between logistic regression and IRT, this…
Profiles of Supportive Alumni: Donors, Volunteers, and Those Who "Do It All"
ERIC Educational Resources Information Center
Weerts, David J.; Ronca, Justin M.
2007-01-01
In the competitive marketplace of higher education, college and university alumni are increasingly called on to support their institutions in multiple ways: political advocacy, volunteerism, and charitable giving. Drawing on alumni survey data gathered from a large research extensive university, we employ a multinomial logistic regression model to…
ERIC Educational Resources Information Center
Fiebig, Jennifer Nepper; Braid, Barbara L.; Ross, Patricia A.; Tom, Matthew A.; Prinzo, Cara
2010-01-01
A multiple logistic regression model was used to determine the associations between the role of acculturation, perception of educational barriers, need for family kin support, vocational planning, and expectations for attaining future vocational goals against the demographic variables (gender, age, being the oldest child, the first to attend…
ERIC Educational Resources Information Center
Schaller, James; Yang, Nancy K.
2005-01-01
Differences in rates of case closure, case service cost, hours worked per week, and weekly wage between customers with autism closed successfully in competitive employment and supported employment were found using the Rehabilitation Service Administration national database of 2001. Using logistic regression, customer demographic variables related…
Unconditional or Conditional Logistic Regression Model for Age-Matched Case-Control Data?
Kuo, Chia-Ling; Duan, Yinghui; Grady, James
2018-01-01
Matching on demographic variables is commonly used in case-control studies to adjust for confounding at the design stage. There is a presumption that matched data need to be analyzed by matched methods. Conditional logistic regression has become a standard for matched case-control data to tackle the sparse data problem. The sparse data problem, however, may not be a concern for loose-matching data when the matching between cases and controls is not unique, and one case can be matched to other controls without substantially changing the association. Data matched on a few demographic variables are clearly loose-matching data, and we hypothesize that unconditional logistic regression is a proper method to perform. To address the hypothesis, we compare unconditional and conditional logistic regression models by precision in estimates and hypothesis testing using simulated matched case-control data. Our results support our hypothesis; however, the unconditional model is not as robust as the conditional model to the matching distortion that the matching process not only makes cases and controls similar for matching variables but also for the exposure status. When the study design involves other complex features or the computational burden is high, matching in loose-matching data can be ignored for negligible loss in testing and estimation if the distributions of matching variables are not extremely different between cases and controls.
Unconditional or Conditional Logistic Regression Model for Age-Matched Case–Control Data?
Kuo, Chia-Ling; Duan, Yinghui; Grady, James
2018-01-01
Matching on demographic variables is commonly used in case–control studies to adjust for confounding at the design stage. There is a presumption that matched data need to be analyzed by matched methods. Conditional logistic regression has become a standard for matched case–control data to tackle the sparse data problem. The sparse data problem, however, may not be a concern for loose-matching data when the matching between cases and controls is not unique, and one case can be matched to other controls without substantially changing the association. Data matched on a few demographic variables are clearly loose-matching data, and we hypothesize that unconditional logistic regression is a proper method to perform. To address the hypothesis, we compare unconditional and conditional logistic regression models by precision in estimates and hypothesis testing using simulated matched case–control data. Our results support our hypothesis; however, the unconditional model is not as robust as the conditional model to the matching distortion that the matching process not only makes cases and controls similar for matching variables but also for the exposure status. When the study design involves other complex features or the computational burden is high, matching in loose-matching data can be ignored for negligible loss in testing and estimation if the distributions of matching variables are not extremely different between cases and controls. PMID:29552553
Coping Styles in Heart Failure Patients with Depressive Symptoms
Trivedi, Ranak B.; Blumenthal, James A.; O'Connor, Christopher; Adams, Kirkwood; Hinderliter, Alan; Sueta-Dupree, Carla; Johnson, Kristy; Sherwood, Andrew
2009-01-01
Objective Elevated depressive symptoms have been linked to poorer prognosis in heart failure (HF) patients. Our objective was to identify coping styles associated with depressive symptoms in HF patients. Methods 222 stable HF patients (32.75% female, 45.4% non-Hispanic Black) completed multiple questionnaires. Beck Depression Inventory (BDI) assessed depressive symptoms, Life Orientation Test (LOT-R) assessed optimism, ENRICHD Social Support Inventory (ESSI) and Perceived Social Support Scale (PSSS) assessed social support, and COPE assessed coping styles. Linear regression analyses were employed to assess the association of coping styles with continuous BDI scores. Logistic regression analyses were performed using BDI scores dichotomized into BDI<10 versus BDI≥10, to identify coping styles accompanying clinically significant depressive symptoms. Results In linear regression models, higher BDI scores were associated with lower scores on the acceptance (β=-.14), humor (β=-.15), planning (β=-.15), and emotional support (β=-.14) subscales of the COPE, and higher scores on the behavioral disengagement (β=.41), denial (β=.33), venting (β=.25), and mental disengagement (β=.22) subscales. Higher PSSS and ESSI scores were associated with lower BDI scores (β=-.32 and -.25, respectively). Higher LOT-R scores were associated with higher BDI scores (β=.39, p<.001). In logistical regression models, BDI≥10 was associated with greater likelihood of behavioral disengagement (OR=1.3), denial (OR=1.2), mental disengagement (OR=1.3), venting (OR=1.2), and pessimism (OR=1.2), and lower perceived social support measured by PSSS (OR=.92) and ESSI (OR=.92). Conclusion Depressive symptoms in HF patients are associated with avoidant coping, lower perceived social support, and pessimism. Results raise the possibility that interventions designed to improve coping may reduce depressive symptoms. PMID:19773027
Coping styles in heart failure patients with depressive symptoms.
Trivedi, Ranak B; Blumenthal, James A; O'Connor, Christopher; Adams, Kirkwood; Hinderliter, Alan; Dupree, Carla; Johnson, Kristy; Sherwood, Andrew
2009-10-01
Elevated depressive symptoms have been linked to poorer prognosis in heart failure (HF) patients. Our objective was to identify coping styles associated with depressive symptoms in HF patients. A total of 222 stable HF patients (32.75% female, 45.4% non-Hispanic black) completed multiple questionnaires. Beck Depression Inventory (BDI) assessed depressive symptoms, Life Orientation Test (LOT-R) assessed optimism, ENRICHD Social Support Inventory (ESSI) and Perceived Social Support Scale (PSSS) assessed social support, and COPE assessed coping styles. Linear regression analyses were employed to assess the association of coping styles with continuous BDI scores. Logistic regression analyses were performed using BDI scores dichotomized into BDI<10 vs. BDI> or =10, to identify coping styles accompanying clinically significant depressive symptoms. In linear regression models, higher BDI scores were associated with lower scores on the acceptance (beta=-.14), humor (beta=-.15), planning (beta=-.15), and emotional support (beta=-.14) subscales of the COPE, and higher scores on the behavioral disengagement (beta=.41), denial (beta=.33), venting (beta=.25), and mental disengagement (beta=.22) subscales. Higher PSSS and ESSI scores were associated with lower BDI scores (beta=-.32 and -.25, respectively). Higher LOT-R scores were associated with higher BDI scores (beta=.39, P<.001). In logistical regression models, BDI> or =10 was associated with greater likelihood of behavioral disengagement (OR=1.3), denial (OR=1.2), mental disengagement (OR=1.3), venting (OR=1.2), and pessimism (OR=1.2), and lower perceived social support measured by PSSS (OR=.92) and ESSI (OR=.92). Depressive symptoms in HF patients are associated with avoidant coping, lower perceived social support, and pessimism. Results raise the possibility that interventions designed to improve coping may reduce depressive symptoms.
Secure Logistic Regression Based on Homomorphic Encryption: Design and Evaluation.
Kim, Miran; Song, Yongsoo; Wang, Shuang; Xia, Yuhou; Jiang, Xiaoqian
2018-04-17
Learning a model without accessing raw data has been an intriguing idea to security and machine learning researchers for years. In an ideal setting, we want to encrypt sensitive data to store them on a commercial cloud and run certain analyses without ever decrypting the data to preserve privacy. Homomorphic encryption technique is a promising candidate for secure data outsourcing, but it is a very challenging task to support real-world machine learning tasks. Existing frameworks can only handle simplified cases with low-degree polynomials such as linear means classifier and linear discriminative analysis. The goal of this study is to provide a practical support to the mainstream learning models (eg, logistic regression). We adapted a novel homomorphic encryption scheme optimized for real numbers computation. We devised (1) the least squares approximation of the logistic function for accuracy and efficiency (ie, reduce computation cost) and (2) new packing and parallelization techniques. Using real-world datasets, we evaluated the performance of our model and demonstrated its feasibility in speed and memory consumption. For example, it took approximately 116 minutes to obtain the training model from the homomorphically encrypted Edinburgh dataset. In addition, it gives fairly accurate predictions on the testing dataset. We present the first homomorphically encrypted logistic regression outsourcing model based on the critical observation that the precision loss of classification models is sufficiently small so that the decision plan stays still. ©Miran Kim, Yongsoo Song, Shuang Wang, Yuhou Xia, Xiaoqian Jiang. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 17.04.2018.
Zarb, Francis; McEntee, Mark F; Rainford, Louise
2015-06-01
To evaluate visual grading characteristics (VGC) and ordinal regression analysis during head CT optimisation as a potential alternative to visual grading assessment (VGA), traditionally employed to score anatomical visualisation. Patient images (n = 66) were obtained using current and optimised imaging protocols from two CT suites: a 16-slice scanner at the national Maltese centre for trauma and a 64-slice scanner in a private centre. Local resident radiologists (n = 6) performed VGA followed by VGC and ordinal regression analysis. VGC alone indicated that optimised protocols had similar image quality as current protocols. Ordinal logistic regression analysis provided an in-depth evaluation, criterion by criterion allowing the selective implementation of the protocols. The local radiology review panel supported the implementation of optimised protocols for brain CT examinations (including trauma) in one centre, achieving radiation dose reductions ranging from 24 % to 36 %. In the second centre a 29 % reduction in radiation dose was achieved for follow-up cases. The combined use of VGC and ordinal logistic regression analysis led to clinical decisions being taken on the implementation of the optimised protocols. This improved method of image quality analysis provided the evidence to support imaging protocol optimisation, resulting in significant radiation dose savings. • There is need for scientifically based image quality evaluation during CT optimisation. • VGC and ordinal regression analysis in combination led to better informed clinical decisions. • VGC and ordinal regression analysis led to dose reductions without compromising diagnostic efficacy.
NASA Astrophysics Data System (ADS)
Dokuchaev, P. M.; Meshalkina, J. L.; Yaroslavtsev, A. M.
2018-01-01
Comparative analysis of soils geospatial modeling using multinomial logistic regression, decision trees, random forest, regression trees and support vector machines algorithms was conducted. The visual interpretation of the digital maps obtained and their comparison with the existing map, as well as the quantitative assessment of the individual soil groups detection overall accuracy and of the models kappa showed that multiple logistic regression, support vector method, and random forest models application with spatial prediction of the conditional soil groups distribution can be reliably used for mapping of the study area. It has shown the most accurate detection for sod-podzolics soils (Phaeozems Albic) lightly eroded and moderately eroded soils. In second place, according to the mean overall accuracy of the prediction, there are sod-podzolics soils - non-eroded and warp one, as well as sod-gley soils (Umbrisols Gleyic) and alluvial soils (Fluvisols Dystric, Umbric). Heavy eroded sod-podzolics and gray forest soils (Phaeozems Albic) were detected by methods of automatic classification worst of all.
Lanfredi, Mariangela; Candini, Valentina; Buizza, Chiara; Ferrari, Clarissa; Boero, Maria E; Giobbio, Gian M; Goldschmidt, Nicoletta; Greppo, Stefania; Iozzino, Laura; Maggi, Paolo; Melegari, Anna; Pasqualetti, Patrizio; Rossi, Giuseppe; de Girolamo, Giovanni
2014-05-15
Quality of life (QOL) has been considered an important outcome measure in psychiatric research and determinants of QOL have been widely investigated. We aimed at detecting predictors of QOL at baseline and at testing the longitudinal interrelations of the baseline predictors with QOL scores at a 1-year follow-up in a sample of patients living in Residential Facilities (RFs). Logistic regression models were adopted to evaluate the association between WHOQoL-Bref scores and potential determinants of QOL. In addition, all variables significantly associated with QOL domains in the final logistic regression model were included by using the Structural Equation Modeling (SEM). We included 139 patients with a diagnosis of schizophrenia spectrum. In the final logistic regression model level of activity, social support, age, service satisfaction, spiritual well-being and symptoms' severity were identified as predictors of QOL scores at baseline. Longitudinal analyses carried out by SEM showed that 40% of QOL follow-up variability was explained by QOL at baseline, and significant indirect effects toward QOL at follow-up were found for satisfaction with services and for social support. Rehabilitation plans for people with schizophrenia living in RFs should also consider mediators of change in subjective QOL such as satisfaction with mental health services. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Early warnings for suicide attempt among Chinese rural population.
Lyu, Juncheng; Wang, Yingying; Shi, Hong; Zhang, Jie
2018-06-05
This study was to explore the main influencing factors of attempted suicide and establish an early warning model, so as to put forward prevention strategies for attempted suicide. Data came from a large-scale case-control epidemiological survey. A sample of 659 serious suicide attempters was randomly recruited from 13 rural counties in China. Each case was matched by a community control for gender, age, and residence location. Face to face interviews were conducted for all the cases and controls with the same structured questionnaire. Univariate logistic regression was applied to screen the factors and multivariate logistic regression was used to excavate the predictors. There were no statistical differences between suicide attempters and the community controls in gender, age, and residence location. The Cronbach`s coefficients for all the scales used were above 0.675. The multivariate logistic regressions have revealed 12 statistically significant variables predicting attempted suicide, including less education, family history of suicide, poor health, mental problem, aspiration strain, hopelessness, impulsivity, depression, negative life events. On the other hand, social support, coping skills, and healthy community protected the rural residents from suicide attempt. The excavated warning predictors are significant clinical meaning for the clinical psychiatrist. Crisis intervention strategies in rural China should be informed by the findings from this research. Education, social support, healthy community, and strain reduction are all measures to decrease the likelihood of crises. Copyright © 2018. Published by Elsevier B.V.
NASA Astrophysics Data System (ADS)
Lin, Yingzhi; Deng, Xiangzheng; Li, Xing; Ma, Enjun
2014-12-01
Spatially explicit simulation of land use change is the basis for estimating the effects of land use and cover change on energy fluxes, ecology and the environment. At the pixel level, logistic regression is one of the most common approaches used in spatially explicit land use allocation models to determine the relationship between land use and its causal factors in driving land use change, and thereby to evaluate land use suitability. However, these models have a drawback in that they do not determine/allocate land use based on the direct relationship between land use change and its driving factors. Consequently, a multinomial logistic regression method was introduced to address this flaw, and thereby, judge the suitability of a type of land use in any given pixel in a case study area of the Jiangxi Province, China. A comparison of the two regression methods indicated that the proportion of correctly allocated pixels using multinomial logistic regression was 92.98%, which was 8.47% higher than that obtained using logistic regression. Paired t-test results also showed that pixels were more clearly distinguished by multinomial logistic regression than by logistic regression. In conclusion, multinomial logistic regression is a more efficient and accurate method for the spatial allocation of land use changes. The application of this method in future land use change studies may improve the accuracy of predicting the effects of land use and cover change on energy fluxes, ecology, and environment.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yahya, Noorazrul, E-mail: noorazrul.yahya@research.uwa.edu.au; Ebert, Martin A.; Bulsara, Max
Purpose: Given the paucity of available data concerning radiotherapy-induced urinary toxicity, it is important to ensure derivation of the most robust models with superior predictive performance. This work explores multiple statistical-learning strategies for prediction of urinary symptoms following external beam radiotherapy of the prostate. Methods: The performance of logistic regression, elastic-net, support-vector machine, random forest, neural network, and multivariate adaptive regression splines (MARS) to predict urinary symptoms was analyzed using data from 754 participants accrued by TROG03.04-RADAR. Predictive features included dose-surface data, comorbidities, and medication-intake. Four symptoms were analyzed: dysuria, haematuria, incontinence, and frequency, each with three definitions (grade ≥more » 1, grade ≥ 2 and longitudinal) with event rate between 2.3% and 76.1%. Repeated cross-validations producing matched models were implemented. A synthetic minority oversampling technique was utilized in endpoints with rare events. Parameter optimization was performed on the training data. Area under the receiver operating characteristic curve (AUROC) was used to compare performance using sample size to detect differences of ≥0.05 at the 95% confidence level. Results: Logistic regression, elastic-net, random forest, MARS, and support-vector machine were the highest-performing statistical-learning strategies in 3, 3, 3, 2, and 1 endpoints, respectively. Logistic regression, MARS, elastic-net, random forest, neural network, and support-vector machine were the best, or were not significantly worse than the best, in 7, 7, 5, 5, 3, and 1 endpoints. The best-performing statistical model was for dysuria grade ≥ 1 with AUROC ± standard deviation of 0.649 ± 0.074 using MARS. For longitudinal frequency and dysuria grade ≥ 1, all strategies produced AUROC>0.6 while all haematuria endpoints and longitudinal incontinence models produced AUROC<0.6. Conclusions: Logistic regression and MARS were most likely to be the best-performing strategy for the prediction of urinary symptoms with elastic-net and random forest producing competitive results. The predictive power of the models was modest and endpoint-dependent. New features, including spatial dose maps, may be necessary to achieve better models.« less
Bingham, P; Verlander, N Q; Cheal, M J
2004-09-01
This paper examines why Snow's contention that cholera was principally spread by water was not accepted in the 1850s by the medical elite. The consequence of rejection was that hundreds in the UK continued to die. Logistic regression was used to re-analyse data, first published in 1852 by William Farr, consisting of the 1849 mortality rate from cholera and eight potential explanatory variables for the 38 registration districts of London. Logistic regression does not support Farr's original conclusion that a district's elevation above high water was the most important explanatory variable. Elevation above high water, water supply and poor rate each have an independent significant effect on district cholera mortality rate, but in terms of size of effect, it can be argued that water supply most strongly 'invited' further consideration. The science of epidemiology, that Farr helped to found, has continued to advance. Had logistic regression been available to Farr, its application to his 1852 data set would have changed his conclusion.
Mocellin, Simone; Ambrosi, Alessandro; Montesco, Maria Cristina; Foletto, Mirto; Zavagno, Giorgio; Nitti, Donato; Lise, Mario; Rossi, Carlo Riccardo
2006-08-01
Currently, approximately 80% of melanoma patients undergoing sentinel node biopsy (SNB) have negative sentinel lymph nodes (SLNs), and no prediction system is reliable enough to be implemented in the clinical setting to reduce the number of SNB procedures. In this study, the predictive power of support vector machine (SVM)-based statistical analysis was tested. The clinical records of 246 patients who underwent SNB at our institution were used for this analysis. The following clinicopathologic variables were considered: the patient's age and sex and the tumor's histological subtype, Breslow thickness, Clark level, ulceration, mitotic index, lymphocyte infiltration, regression, angiolymphatic invasion, microsatellitosis, and growth phase. The results of SVM-based prediction of SLN status were compared with those achieved with logistic regression. The SLN positivity rate was 22% (52 of 234). When the accuracy was > or = 80%, the negative predictive value, positive predictive value, specificity, and sensitivity were 98%, 54%, 94%, and 77% and 82%, 41%, 69%, and 93% by using SVM and logistic regression, respectively. Moreover, SVM and logistic regression were associated with a diagnostic error and an SNB percentage reduction of (1) 1% and 60% and (2) 15% and 73%, respectively. The results from this pilot study suggest that SVM-based prediction of SLN status might be evaluated as a prognostic method to avoid the SNB procedure in 60% of patients currently eligible, with a very low error rate. If validated in larger series, this strategy would lead to obvious advantages in terms of both patient quality of life and costs for the health care system.
Parenting styles, parenting practices, and physical activity in 10- to 11-year olds.
Jago, Russell; Davison, Kirsten K; Brockman, Rowan; Page, Angie S; Thompson, Janice L; Fox, Kenneth R
2011-01-01
The objective of this study was to determine whether parenting styles and practices are associated with children's physical activity. Cross-sectional survey of seven hundred ninety-two 10- to 11-year-old UK children in Bristol (UK) in 2008-2009 was conducted. Accelerometer-assessed physical activity and mean minutes of moderate-to-vigorous physical activity (mean MVPA) and mean counts per minute (mean CPM) were obtained. Maternal parenting style and physical activity parenting practices were self-reported. In regression analyses, permissive parenting was associated with higher mean MVPA among girls (+6.0 min/day, p<0.001) and greater mean CPM (+98.9 accelerometer counts/min, p=0.014) among boys when compared to children with authoritative parents. Maternal logistic support was associated with mean CPM for girls (+36.2 counts/min, p=0.001), while paternal logistic support was associated with boys' mean MVPA (+4.0 min/day, p=0.049) and mean CPM (+55.7 counts/min, p=0.014). Maternal permissive parenting was associated with higher levels of physical activity than authoritative parenting, but associations differed by child gender and type of physical activity. Maternal logistic support was associated with girls' physical activity, while paternal logistic support was associated with boys' physical activity. Health professionals could encourage parents to increase logistic support for their children's physical activity. Copyright © 2010 Elsevier Inc. All rights reserved.
Standards for Standardized Logistic Regression Coefficients
ERIC Educational Resources Information Center
Menard, Scott
2011-01-01
Standardized coefficients in logistic regression analysis have the same utility as standardized coefficients in linear regression analysis. Although there has been no consensus on the best way to construct standardized logistic regression coefficients, there is now sufficient evidence to suggest a single best approach to the construction of a…
Schörgendorfer, Angela; Branscum, Adam J; Hanson, Timothy E
2013-06-01
Logistic regression is a popular tool for risk analysis in medical and population health science. With continuous response data, it is common to create a dichotomous outcome for logistic regression analysis by specifying a threshold for positivity. Fitting a linear regression to the nondichotomized response variable assuming a logistic sampling model for the data has been empirically shown to yield more efficient estimates of odds ratios than ordinary logistic regression of the dichotomized endpoint. We illustrate that risk inference is not robust to departures from the parametric logistic distribution. Moreover, the model assumption of proportional odds is generally not satisfied when the condition of a logistic distribution for the data is violated, leading to biased inference from a parametric logistic analysis. We develop novel Bayesian semiparametric methodology for testing goodness of fit of parametric logistic regression with continuous measurement data. The testing procedures hold for any cutoff threshold and our approach simultaneously provides the ability to perform semiparametric risk estimation. Bayes factors are calculated using the Savage-Dickey ratio for testing the null hypothesis of logistic regression versus a semiparametric generalization. We propose a fully Bayesian and a computationally efficient empirical Bayesian approach to testing, and we present methods for semiparametric estimation of risks, relative risks, and odds ratios when parametric logistic regression fails. Theoretical results establish the consistency of the empirical Bayes test. Results from simulated data show that the proposed approach provides accurate inference irrespective of whether parametric assumptions hold or not. Evaluation of risk factors for obesity shows that different inferences are derived from an analysis of a real data set when deviations from a logistic distribution are permissible in a flexible semiparametric framework. © 2013, The International Biometric Society.
Classification of sodium MRI data of cartilage using machine learning.
Madelin, Guillaume; Poidevin, Frederick; Makrymallis, Antonios; Regatte, Ravinder R
2015-11-01
To assess the possible utility of machine learning for classifying subjects with and subjects without osteoarthritis using sodium magnetic resonance imaging data. Theory: Support vector machine, k-nearest neighbors, naïve Bayes, discriminant analysis, linear regression, logistic regression, neural networks, decision tree, and tree bagging were tested. Sodium magnetic resonance imaging with and without fluid suppression by inversion recovery was acquired on the knee cartilage of 19 controls and 28 osteoarthritis patients. Sodium concentrations were measured in regions of interests in the knee for both acquisitions. Mean (MEAN) and standard deviation (STD) of these concentrations were measured in each regions of interest, and the minimum, maximum, and mean of these two measurements were calculated over all regions of interests for each subject. The resulting 12 variables per subject were used as predictors for classification. Either Min [STD] alone, or in combination with Mean [MEAN] or Min [MEAN], all from fluid suppressed data, were the best predictors with an accuracy >74%, mainly with linear logistic regression and linear support vector machine. Other good classifiers include discriminant analysis, linear regression, and naïve Bayes. Machine learning is a promising technique for classifying osteoarthritis patients and controls from sodium magnetic resonance imaging data. © 2014 Wiley Periodicals, Inc.
Robust mislabel logistic regression without modeling mislabel probabilities.
Hung, Hung; Jou, Zhi-Yu; Huang, Su-Yun
2018-03-01
Logistic regression is among the most widely used statistical methods for linear discriminant analysis. In many applications, we only observe possibly mislabeled responses. Fitting a conventional logistic regression can then lead to biased estimation. One common resolution is to fit a mislabel logistic regression model, which takes into consideration of mislabeled responses. Another common method is to adopt a robust M-estimation by down-weighting suspected instances. In this work, we propose a new robust mislabel logistic regression based on γ-divergence. Our proposal possesses two advantageous features: (1) It does not need to model the mislabel probabilities. (2) The minimum γ-divergence estimation leads to a weighted estimating equation without the need to include any bias correction term, that is, it is automatically bias-corrected. These features make the proposed γ-logistic regression more robust in model fitting and more intuitive for model interpretation through a simple weighting scheme. Our method is also easy to implement, and two types of algorithms are included. Simulation studies and the Pima data application are presented to demonstrate the performance of γ-logistic regression. © 2017, The International Biometric Society.
Fungible weights in logistic regression.
Jones, Jeff A; Waller, Niels G
2016-06-01
In this article we develop methods for assessing parameter sensitivity in logistic regression models. To set the stage for this work, we first review Waller's (2008) equations for computing fungible weights in linear regression. Next, we describe 2 methods for computing fungible weights in logistic regression. To demonstrate the utility of these methods, we compute fungible logistic regression weights using data from the Centers for Disease Control and Prevention's (2010) Youth Risk Behavior Surveillance Survey, and we illustrate how these alternate weights can be used to evaluate parameter sensitivity. To make our work accessible to the research community, we provide R code (R Core Team, 2015) that will generate both kinds of fungible logistic regression weights. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Should metacognition be measured by logistic regression?
Rausch, Manuel; Zehetleitner, Michael
2017-03-01
Are logistic regression slopes suitable to quantify metacognitive sensitivity, i.e. the efficiency with which subjective reports differentiate between correct and incorrect task responses? We analytically show that logistic regression slopes are independent from rating criteria in one specific model of metacognition, which assumes (i) that rating decisions are based on sensory evidence generated independently of the sensory evidence used for primary task responses and (ii) that the distributions of evidence are logistic. Given a hierarchical model of metacognition, logistic regression slopes depend on rating criteria. According to all considered models, regression slopes depend on the primary task criterion. A reanalysis of previous data revealed that massive numbers of trials are required to distinguish between hierarchical and independent models with tolerable accuracy. It is argued that researchers who wish to use logistic regression as measure of metacognitive sensitivity need to control the primary task criterion and rating criteria. Copyright © 2017 Elsevier Inc. All rights reserved.
London Measure of Unplanned Pregnancy: guidance for its use as an outcome measure
Hall, Jennifer A; Barrett, Geraldine; Copas, Andrew; Stephenson, Judith
2017-01-01
Background The London Measure of Unplanned Pregnancy (LMUP) is a psychometrically validated measure of the degree of intention of a current or recent pregnancy. The LMUP is increasingly being used worldwide, and can be used to evaluate family planning or preconception care programs. However, beyond recommending the use of the full LMUP scale, there is no published guidance on how to use the LMUP as an outcome measure. Ordinal logistic regression has been recommended informally, but studies published to date have all used binary logistic regression and dichotomized the scale at different cut points. There is thus a need for evidence-based guidance to provide a standardized methodology for multivariate analysis and to enable comparison of results. This paper makes recommendations for the regression method for analysis of the LMUP as an outcome measure. Materials and methods Data collected from 4,244 pregnant women in Malawi were used to compare five regression methods: linear, logistic with two cut points, and ordinal logistic with either the full or grouped LMUP score. The recommendations were then tested on the original UK LMUP data. Results There were small but no important differences in the findings across the regression models. Logistic regression resulted in the largest loss of information, and assumptions were violated for the linear and ordinal logistic regression. Consequently, robust standard errors were used for linear regression and a partial proportional odds ordinal logistic regression model attempted. The latter could only be fitted for grouped LMUP score. Conclusion We recommend the linear regression model with robust standard errors to make full use of the LMUP score when analyzed as an outcome measure. Ordinal logistic regression could be considered, but a partial proportional odds model with grouped LMUP score may be required. Logistic regression is the least-favored option, due to the loss of information. For logistic regression, the cut point for un/planned pregnancy should be between nine and ten. These recommendations will standardize the analysis of LMUP data and enhance comparability of results across studies. PMID:28435343
Logistic models--an odd(s) kind of regression.
Jupiter, Daniel C
2013-01-01
The logistic regression model bears some similarity to the multivariable linear regression with which we are familiar. However, the differences are great enough to warrant a discussion of the need for and interpretation of logistic regression. Copyright © 2013 American College of Foot and Ankle Surgeons. Published by Elsevier Inc. All rights reserved.
Grav, Siv; Romild, Ulla; Hellzèn, Ove; Stordal, Eystein
2013-08-01
The aim of the current study was to examine the association of personality, neighbourhood, and civic participation with the level of perceived social support if needed. The sample consists of a total of 35,797 men (16,035) and women (19,762) drawn from the Nord-Trøndelag Health Study 3 (HUNT3), aged 20-89, with a fully completed short version of the Eysenck Personality Questionnaire (EPQ) including a complete response to questions regarding perceived social support. A multinomial logistic regression model was used to investigate the association between the three-category outcomes (high, medium, and low) of perceived social support. The Chi-square test detected a significant (p < 0.001) association between personality, sense of community, civic participation, self-rated health, living arrangement, age groups, gender, and perceived social support, except between perceived social support and loss of social network, in which no significance was found. The crude and adjusted multinomial logistic regression models show a relation between medium and low scores on perceived social support, personality, and sources of social support. Interactions were observed between gender and self-rated health. There is an association between the level of perceived social support and personality, sense of community in the neighbourhood, and civic participation. Even if the interaction between men and self-reported health decreases the odds for low and medium social support, health professionals should be aware of men with poor health and their lack of social support.
Can Predictive Modeling Identify Head and Neck Oncology Patients at Risk for Readmission?
Manning, Amy M; Casper, Keith A; Peter, Kay St; Wilson, Keith M; Mark, Jonathan R; Collar, Ryan M
2018-05-01
Objective Unplanned readmission within 30 days is a contributor to health care costs in the United States. The use of predictive modeling during hospitalization to identify patients at risk for readmission offers a novel approach to quality improvement and cost reduction. Study Design Two-phase study including retrospective analysis of prospectively collected data followed by prospective longitudinal study. Setting Tertiary academic medical center. Subjects and Methods Prospectively collected data for patients undergoing surgical treatment for head and neck cancer from January 2013 to January 2015 were used to build predictive models for readmission within 30 days of discharge using logistic regression, classification and regression tree (CART) analysis, and random forests. One model (logistic regression) was then placed prospectively into the discharge workflow from March 2016 to May 2016 to determine the model's ability to predict which patients would be readmitted within 30 days. Results In total, 174 admissions had descriptive data. Thirty-two were excluded due to incomplete data. Logistic regression, CART, and random forest predictive models were constructed using the remaining 142 admissions. When applied to 106 consecutive prospective head and neck oncology patients at the time of discharge, the logistic regression model predicted readmissions with a specificity of 94%, a sensitivity of 47%, a negative predictive value of 90%, and a positive predictive value of 62% (odds ratio, 14.9; 95% confidence interval, 4.02-55.45). Conclusion Prospectively collected head and neck cancer databases can be used to develop predictive models that can accurately predict which patients will be readmitted. This offers valuable support for quality improvement initiatives and readmission-related cost reduction in head and neck cancer care.
Who Is Conducting Teacher Research?
ERIC Educational Resources Information Center
Hahs-Vaughn, Debbie L.; Yanowitz, Karen L.
2009-01-01
Few researchers have gone beyond case studies to examine characteristics of teachers who engage in research activities. Results from the authors' logistic regression models provide evidence that teaching in private schools, teaching in a midsize or large city, participating in professional development programs, and receiving support from the…
Parameters Estimation of Geographically Weighted Ordinal Logistic Regression (GWOLR) Model
NASA Astrophysics Data System (ADS)
Zuhdi, Shaifudin; Retno Sari Saputro, Dewi; Widyaningsih, Purnami
2017-06-01
A regression model is the representation of relationship between independent variable and dependent variable. The dependent variable has categories used in the logistic regression model to calculate odds on. The logistic regression model for dependent variable has levels in the logistics regression model is ordinal. GWOLR model is an ordinal logistic regression model influenced the geographical location of the observation site. Parameters estimation in the model needed to determine the value of a population based on sample. The purpose of this research is to parameters estimation of GWOLR model using R software. Parameter estimation uses the data amount of dengue fever patients in Semarang City. Observation units used are 144 villages in Semarang City. The results of research get GWOLR model locally for each village and to know probability of number dengue fever patient categories.
Parenting styles, parenting practices, and physical activity in 10- to 11-year olds
Jago, Russell; Davison, Kirsten K.; Brockman, Rowan; Page, Angie S.; Thompson, Janice L.; Fox, Kenneth R.
2011-01-01
Objective The objective of this study was to determine whether parenting styles and practices are associated with children's physical activity. Methods Cross-sectional survey of seven hundred ninety-two 10- to 11-year-old UK children in Bristol (UK) in 2008–2009 was conducted. Accelerometer-assessed physical activity and mean minutes of moderate-to-vigorous physical activity (mean MVPA) and mean counts per minute (mean CPM) were obtained. Maternal parenting style and physical activity parenting practices were self-reported. Results In regression analyses, permissive parenting was associated with higher mean MVPA among girls (+ 6.0 min/day, p < 0.001) and greater mean CPM (+ 98.9 accelerometer counts/min, p = 0.014) among boys when compared to children with authoritative parents. Maternal logistic support was associated with mean CPM for girls (+ 36.2 counts/min, p = 0.001), while paternal logistic support was associated with boys' mean MVPA (+ 4.0 min/day, p = 0.049) and mean CPM (+ 55.7 counts/min, p = 0.014). Conclusions Maternal permissive parenting was associated with higher levels of physical activity than authoritative parenting, but associations differed by child gender and type of physical activity. Maternal logistic support was associated with girls' physical activity, while paternal logistic support was associated with boys' physical activity. Health professionals could encourage parents to increase logistic support for their children's physical activity. PMID:21070805
Factors associated with active commuting to work among women.
Bopp, Melissa; Child, Stephanie; Campbell, Matthew
2014-01-01
Active commuting (AC), the act of walking or biking to work, has notable health benefits though rates of AC remain low among women. This study used a social-ecological framework to examine the factors associated with AC among women. A convenience sample of employed, working women (n = 709) completed an online survey about their mode of travel to work. Individual, interpersonal, institutional, community, and environmental influences were assessed. Basic descriptive statistics and frequencies described the sample. Simple logistic regression models examined associations with the independent variables with AC participation and multiple logistic regression analysis determined the relative influence of social ecological factors on AC participation. The sample was primarily middle-aged (44.09±11.38 years) and non-Hispanic White (92%). Univariate analyses revealed several individual, interpersonal, institutional, community and environmental factors significantly associated with AC. The multivariable logistic regression analysis results indicated that significant factors associated with AC included number of children, income, perceived behavioral control, coworker AC, coworker AC normative beliefs, employer and community supports for AC, and traffic. The results of this study contribute to the limited body of knowledge on AC participation for women and may help to inform gender-tailored interventions to enhance AC behavior and improve health.
Supporting Regularized Logistic Regression Privately and Efficiently.
Li, Wenfa; Liu, Hongzhe; Yang, Peng; Xie, Wei
2016-01-01
As one of the most popular statistical and machine learning models, logistic regression with regularization has found wide adoption in biomedicine, social sciences, information technology, and so on. These domains often involve data of human subjects that are contingent upon strict privacy regulations. Concerns over data privacy make it increasingly difficult to coordinate and conduct large-scale collaborative studies, which typically rely on cross-institution data sharing and joint analysis. Our work here focuses on safeguarding regularized logistic regression, a widely-used statistical model while at the same time has not been investigated from a data security and privacy perspective. We consider a common use scenario of multi-institution collaborative studies, such as in the form of research consortia or networks as widely seen in genetics, epidemiology, social sciences, etc. To make our privacy-enhancing solution practical, we demonstrate a non-conventional and computationally efficient method leveraging distributing computing and strong cryptography to provide comprehensive protection over individual-level and summary data. Extensive empirical evaluations on several studies validate the privacy guarantee, efficiency and scalability of our proposal. We also discuss the practical implications of our solution for large-scale studies and applications from various disciplines, including genetic and biomedical studies, smart grid, network analysis, etc.
Supporting Regularized Logistic Regression Privately and Efficiently
Li, Wenfa; Liu, Hongzhe; Yang, Peng; Xie, Wei
2016-01-01
As one of the most popular statistical and machine learning models, logistic regression with regularization has found wide adoption in biomedicine, social sciences, information technology, and so on. These domains often involve data of human subjects that are contingent upon strict privacy regulations. Concerns over data privacy make it increasingly difficult to coordinate and conduct large-scale collaborative studies, which typically rely on cross-institution data sharing and joint analysis. Our work here focuses on safeguarding regularized logistic regression, a widely-used statistical model while at the same time has not been investigated from a data security and privacy perspective. We consider a common use scenario of multi-institution collaborative studies, such as in the form of research consortia or networks as widely seen in genetics, epidemiology, social sciences, etc. To make our privacy-enhancing solution practical, we demonstrate a non-conventional and computationally efficient method leveraging distributing computing and strong cryptography to provide comprehensive protection over individual-level and summary data. Extensive empirical evaluations on several studies validate the privacy guarantee, efficiency and scalability of our proposal. We also discuss the practical implications of our solution for large-scale studies and applications from various disciplines, including genetic and biomedical studies, smart grid, network analysis, etc. PMID:27271738
The purpose of this report is to provide a reference manual that could be used by investigators for making informed use of logistic regression using two methods (standard logistic regression and MARS). The details for analyses of relationships between a dependent binary response ...
ERIC Educational Resources Information Center
Chen, Chau-Kuang
2005-01-01
Logistic and Cox regression methods are practical tools used to model the relationships between certain student learning outcomes and their relevant explanatory variables. The logistic regression model fits an S-shaped curve into a binary outcome with data points of zero and one. The Cox regression model allows investigators to study the duration…
Yusuf, O B; Bamgboye, E A; Afolabi, R F; Shodimu, M A
2014-09-01
Logistic regression model is widely used in health research for description and predictive purposes. Unfortunately, most researchers are sometimes not aware that the underlying principles of the techniques have failed when the algorithm for maximum likelihood does not converge. Young researchers particularly postgraduate students may not know why separation problem whether quasi or complete occurs, how to identify it and how to fix it. This study was designed to critically evaluate convergence issues in articles that employed logistic regression analysis published in an African Journal of Medicine and medical sciences between 2004 and 2013. Problems of quasi or complete separation were described and were illustrated with the National Demographic and Health Survey dataset. A critical evaluation of articles that employed logistic regression was conducted. A total of 581 articles was reviewed, of which 40 (6.9%) used binary logistic regression. Twenty-four (60.0%) stated the use of logistic regression model in the methodology while none of the articles assessed model fit. Only 3 (12.5%) properly described the procedures. Of the 40 that used the logistic regression model, the problem of convergence occurred in 6 (15.0%) of the articles. Logistic regression tends to be poorly reported in studies published between 2004 and 2013. Our findings showed that the procedure may not be well understood by researchers since very few described the process in their reports and may be totally unaware of the problem of convergence or how to deal with it.
Logistic Regression: Concept and Application
ERIC Educational Resources Information Center
Cokluk, Omay
2010-01-01
The main focus of logistic regression analysis is classification of individuals in different groups. The aim of the present study is to explain basic concepts and processes of binary logistic regression analysis intended to determine the combination of independent variables which best explain the membership in certain groups called dichotomous…
Fang, Xingang; Bagui, Sikha; Bagui, Subhash
2017-08-01
The readily available high throughput screening (HTS) data from the PubChem database provides an opportunity for mining of small molecules in a variety of biological systems using machine learning techniques. From the thousands of available molecular descriptors developed to encode useful chemical information representing the characteristics of molecules, descriptor selection is an essential step in building an optimal quantitative structural-activity relationship (QSAR) model. For the development of a systematic descriptor selection strategy, we need the understanding of the relationship between: (i) the descriptor selection; (ii) the choice of the machine learning model; and (iii) the characteristics of the target bio-molecule. In this work, we employed the Signature descriptor to generate a dataset on the Human kallikrein 5 (hK 5) inhibition confirmatory assay data and compared multiple classification models including logistic regression, support vector machine, random forest and k-nearest neighbor. Under optimal conditions, the logistic regression model provided extremely high overall accuracy (98%) and precision (90%), with good sensitivity (65%) in the cross validation test. In testing the primary HTS screening data with more than 200K molecular structures, the logistic regression model exhibited the capability of eliminating more than 99.9% of the inactive structures. As part of our exploration of the descriptor-model-target relationship, the excellent predictive performance of the combination of the Signature descriptor and the logistic regression model on the assay data of the Human kallikrein 5 (hK 5) target suggested a feasible descriptor/model selection strategy on similar targets. Copyright © 2017 Elsevier Ltd. All rights reserved.
Influence of landscape-scale factors in limiting brook trout populations in Pennsylvania streams
Kocovsky, P.M.; Carline, R.F.
2006-01-01
Landscapes influence the capacity of streams to produce trout through their effect on water chemistry and other factors at the reach scale. Trout abundance also fluctuates over time; thus, to thoroughly understand how spatial factors at landscape scales affect trout populations, one must assess the changes in populations over time to provide a context for interpreting the importance of spatial factors. We used data from the Pennsylvania Fish and Boat Commission's fisheries management database to investigate spatial factors that affect the capacity of streams to support brook trout Salvelinus fontinalis and to provide models useful for their management. We assessed the relative importance of spatial and temporal variation by calculating variance components and comparing relative standard errors for spatial and temporal variation. We used binary logistic regression to predict the presence of harvestable-length brook trout and multiple linear regression to assess the mechanistic links between landscapes and trout populations and to predict population density. The variance in trout density among streams was equal to or greater than the temporal variation for several streams, indicating that differences among sites affect population density. Logistic regression models correctly predicted the absence of harvestable-length brook trout in 60% of validation samples. The r 2-value for the linear regression model predicting density was 0.3, indicating low predictive ability. Both logistic and linear regression models supported buffering capacity against acid episodes as an important mechanistic link between landscapes and trout populations. Although our models fail to predict trout densities precisely, their success at elucidating the mechanistic links between landscapes and trout populations, in concert with the importance of spatial variation, increases our understanding of factors affecting brook trout abundance and will help managers and private groups to protect and enhance populations of wild brook trout. ?? Copyright by the American Fisheries Society 2006.
Caregivers' Retirement Congruency: A Case for Caregiver Support
ERIC Educational Resources Information Center
Humble, Aine M.; Keefe, Janice M.; Auton, Greg M.
2012-01-01
Using the concept of "retirement congruency" (RC), which takes into account greater variation in retirement decisions (low, moderate, or high RC) than a dichotomous conceptualization (forced versus chosen), multinomial logistic regression was conducted on a sample of caregivers from the 2002 Canadian General Social Survey who were…
An Entropy-Based Measure for Assessing Fuzziness in Logistic Regression
Weiss, Brandi A.; Dardick, William
2015-01-01
This article introduces an entropy-based measure of data–model fit that can be used to assess the quality of logistic regression models. Entropy has previously been used in mixture-modeling to quantify how well individuals are classified into latent classes. The current study proposes the use of entropy for logistic regression models to quantify the quality of classification and separation of group membership. Entropy complements preexisting measures of data–model fit and provides unique information not contained in other measures. Hypothetical data scenarios, an applied example, and Monte Carlo simulation results are used to demonstrate the application of entropy in logistic regression. Entropy should be used in conjunction with other measures of data–model fit to assess how well logistic regression models classify cases into observed categories. PMID:29795897
Logistic regression applied to natural hazards: rare event logistic regression with replications
NASA Astrophysics Data System (ADS)
Guns, M.; Vanacker, V.
2012-06-01
Statistical analysis of natural hazards needs particular attention, as most of these phenomena are rare events. This study shows that the ordinary rare event logistic regression, as it is now commonly used in geomorphologic studies, does not always lead to a robust detection of controlling factors, as the results can be strongly sample-dependent. In this paper, we introduce some concepts of Monte Carlo simulations in rare event logistic regression. This technique, so-called rare event logistic regression with replications, combines the strength of probabilistic and statistical methods, and allows overcoming some of the limitations of previous developments through robust variable selection. This technique was here developed for the analyses of landslide controlling factors, but the concept is widely applicable for statistical analyses of natural hazards.
Large unbalanced credit scoring using Lasso-logistic regression ensemble.
Wang, Hong; Xu, Qingsong; Zhou, Lifeng
2015-01-01
Recently, various ensemble learning methods with different base classifiers have been proposed for credit scoring problems. However, for various reasons, there has been little research using logistic regression as the base classifier. In this paper, given large unbalanced data, we consider the plausibility of ensemble learning using regularized logistic regression as the base classifier to deal with credit scoring problems. In this research, the data is first balanced and diversified by clustering and bagging algorithms. Then we apply a Lasso-logistic regression learning ensemble to evaluate the credit risks. We show that the proposed algorithm outperforms popular credit scoring models such as decision tree, Lasso-logistic regression and random forests in terms of AUC and F-measure. We also provide two importance measures for the proposed model to identify important variables in the data.
An Entropy-Based Measure for Assessing Fuzziness in Logistic Regression.
Weiss, Brandi A; Dardick, William
2016-12-01
This article introduces an entropy-based measure of data-model fit that can be used to assess the quality of logistic regression models. Entropy has previously been used in mixture-modeling to quantify how well individuals are classified into latent classes. The current study proposes the use of entropy for logistic regression models to quantify the quality of classification and separation of group membership. Entropy complements preexisting measures of data-model fit and provides unique information not contained in other measures. Hypothetical data scenarios, an applied example, and Monte Carlo simulation results are used to demonstrate the application of entropy in logistic regression. Entropy should be used in conjunction with other measures of data-model fit to assess how well logistic regression models classify cases into observed categories.
Gilbert, Paul A; Perreira, Krista; Eng, Eugenia; Rhodes, Scott D
2014-09-01
We sought to quantify the association of social stressors with alcohol use among immigrant sexual and gender minority Latinos in North Carolina (n = 190). We modeled any drinking in past year using logistic regression and heavy episodic drinking in past 30 days using Poisson regression. Despite a large proportion of abstainers, there were indications of hazardous drinking. Among current drinkers, 63% reported at least one heavy drinking episode in past 30 days. Ethnic discrimination increased, and social support decreased, odds of any drinking in past year. Social support moderated the associations of English use and ethnic discrimination with heavy episodic drinking.
Qiu, Rong Min; Tao, Ye; Zhou, Yan; Zhi, Qing Hui; Lin, Huan Cai
2016-09-01
Social support might play a role in helping people adopt healthy behaviors and improve their health. Stronger social support from mothers has been found to be positively related to higher tooth brushing frequency in 1- to 3-year-old children. However, little is known regarding the relationship between the caregiver's social support and the oral health-related behaviors of 5-year-old children in China. This study aimed to investigate this relationship. A cross-sectional study was conducted among 1332 5-year-old children and their caregivers in Guangzhou, southern China. Data were collected using questionnaires that were completed by the caregivers and the children's caries status were examined. The caregivers' social support was measured using the Social Support Rating Scale. The measurements of the children's oral health-related behaviors included the frequencies of sugary snack intake and tooth brushing, utilization of dental services, and patterns of dental visits. Univariate and multiple logistic regression analyses were used to analyze the relationships between the variables. No association was found between the caregiver's social support and the child's oral health-related behaviors in a multiple logistic regression analysis. However, other factors, particularly the oral health-related behaviors of the caregiver, were found to be significantly linked to the child's oral health-related behaviors. The oral health-related behaviors of 5-year-old children in Guangzhou are unrelated to the caregiver's social support but are related to other specific factors, particularly the caregiver's oral health-related behaviors.
ERIC Educational Resources Information Center
Pushkarskaya, Helen; Usher, Ellen L.
2010-01-01
Using a unique sample of rural Kentucky residents, we demonstrated that, in the domain of operational and competitive environmental uncertainties, self-efficacy beliefs are significantly higher among nascent entrepreneurs than among non-entrepreneurs. We employed the hierarchical logistic regression analysis to demonstrate that this result is…
Power and Sample Size Calculations for Logistic Regression Tests for Differential Item Functioning
ERIC Educational Resources Information Center
Li, Zhushan
2014-01-01
Logistic regression is a popular method for detecting uniform and nonuniform differential item functioning (DIF) effects. Theoretical formulas for the power and sample size calculations are derived for likelihood ratio tests and Wald tests based on the asymptotic distribution of the maximum likelihood estimators for the logistic regression model.…
A Methodology for Generating Placement Rules that Utilizes Logistic Regression
ERIC Educational Resources Information Center
Wurtz, Keith
2008-01-01
The purpose of this article is to provide the necessary tools for institutional researchers to conduct a logistic regression analysis and interpret the results. Aspects of the logistic regression procedure that are necessary to evaluate models are presented and discussed with an emphasis on cutoff values and choosing the appropriate number of…
John Hogland; Nedret Billor; Nathaniel Anderson
2013-01-01
Discriminant analysis, referred to as maximum likelihood classification within popular remote sensing software packages, is a common supervised technique used by analysts. Polytomous logistic regression (PLR), also referred to as multinomial logistic regression, is an alternative classification approach that is less restrictive, more flexible, and easy to interpret. To...
Postincident Support for Healthcare Workers Experiencing Occupational Violence and Aggression.
Shea, Tracey; Cooper, Brian; De Cieri, Helen; Sheehan, Cathy; Donohue, Ross; Lindsay, Sarah
2018-05-10
To investigate the relative contributions of workplace type, occupational violence and aggression (OVA) strategies and interventions along with perceptions of the occupational health and safety (OHS) environment on the likelihood of receiving postincident support following the experience of OVA. We used a cross-sectional study design with an online survey to collect data from employees in nursing and midwifery in Victoria, Australia. Survey data collected from 3,072 members of the Australian Nursing and Midwifery Federation (Victorian branch) were analyzed using logistic regression. Of the 3,072 respondents who had experienced OVA in the preceding 12 months, 1,287 (42%) reported that they had received postincident support. Hierarchical logistic regression revealed that the OHS environment was the dominant factor that predicted the likelihood of workers receiving postincident support. Working in a positive OHS environment characterized by higher levels of leading indicators of OHS, prioritization of OHS, supervisor support for safety, and team psychological safety was the stronger predictor of postincident support. Being employed in a workplace that offered training in the management and prevention of OVA also increased the likelihood of receiving postincident support. While training in the management and prevention of OVA contributed to the likelihood of receiving postincident support, a greater emphasis on the OHS environment was more important in predicting the likelihood that workers received support. This study identifies workplace practices that facilitate the provision of postincident support for healthcare workers. Facilitating effective postincident support could improve outcomes for workers, their patients and workplaces, and society in general. © 2018 Sigma Theta Tau International.
Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble
Wang, Hong; Xu, Qingsong; Zhou, Lifeng
2015-01-01
Recently, various ensemble learning methods with different base classifiers have been proposed for credit scoring problems. However, for various reasons, there has been little research using logistic regression as the base classifier. In this paper, given large unbalanced data, we consider the plausibility of ensemble learning using regularized logistic regression as the base classifier to deal with credit scoring problems. In this research, the data is first balanced and diversified by clustering and bagging algorithms. Then we apply a Lasso-logistic regression learning ensemble to evaluate the credit risks. We show that the proposed algorithm outperforms popular credit scoring models such as decision tree, Lasso-logistic regression and random forests in terms of AUC and F-measure. We also provide two importance measures for the proposed model to identify important variables in the data. PMID:25706988
ERIC Educational Resources Information Center
Brown, Ben
2009-01-01
This article provides an analysis of survey data on perceptions of student misconduct, perceived respect for teachers, and support for corporal punishment among school teachers in South Korea. The data were gathered from a survey of 110 middle and high school teachers in Gyeonggi Province, South Korea. Descriptive, chi square, logistic regression,…
Rainfall-induced Landslide Susceptibility assessment at the Longnan county
NASA Astrophysics Data System (ADS)
Hong, Haoyuan; Zhang, Ying
2017-04-01
Landslides are a serious disaster in Longnan county, China. Therefore landslide susceptibility assessment is useful tool for government or decision making. The main objective of this study is to investigate and compare the frequency ratio, support vector machines, and logistic regression. The Longnan county (Jiangxi province, China) was selected as the case study. First, the landslide inventory map with 354 landslide locations was constructed. Then landslide locations were then randomly divided into a ratio of 70/30 for the training and validating the models. Second, fourteen landslide conditioning factors were prepared such as slope, aspect, altitude, topographic wetness index (TWI), stream power index (SPI), sediment transport index (STI), plan curvature, lithology, distance to faults, distance to rivers, distance to roads, land use, normalized difference vegetation index (NDVI), and rainfall. Using the frequency ratio, support vector machines, and logistic regression, a total of three landslide susceptibility models were constructed. Finally, the overall performance of the resulting models was assessed and compared using the Receiver operating characteristic (ROC) curve technique. The result showed that the support vector machines model is the best model in the study area. The success rate is 88.39 %; and prediction rate is 84.06 %.
Sirichotiratana, Nithat; Yogi, Subash; Prutipinyo, Chardsumon
2013-08-30
This study was conducted during February-March 2012 to determine the perception and support regarding smoke-free policy among tourists at Suvarnabhumi International Airport, Bangkok, Thailand. In this cross-sectional study, 200 tourists (n = 200) were enrolled by convenience sampling and interviewed by structured questionnaire. Descriptive statistics, chi-square, and multinomial logistic regression were adopted in the study. Results revealed that half (50%) of the tourists were current smokers and 55% had visited Thailand twice or more. Three quarter (76%) of tourists indicated that they would visit Thailand again even if it had a 100% smoke-free regulation. Almost all (99%) of the tourists had supported for the smoke-free policy (partial ban and total ban), and current smokers had higher percentage of support than non-smokers. Two factors, current smoking status and knowledge level, were significantly associated with perception level. After analysis with Multinomial Logistic Regression, it was found that perception, country group, and presence of designated smoking room (DSR) were associated with smoke-free policy. Recommendation is that, at institution level effective monitoring system is needed at the airport. At policy level, the recommendation is that effective comprehensive policy needed to be emphasized to ensure smoke-free airport environment.
[Prevalence and risk factors of postpartum depression in Tianhe District of Guangzhou].
Deng, Aiwen; Jiang, Tingting; Luo, Yingping; Xiong, Ribo
2014-01-01
To investigate the prevalence and risk factors of postpartum depression (PPD) in Tianhe district of Guangzhou. A total of 1428 postpartum women in 3 hospitals in Tianhe District of Guangzhou were screened with Edinburg Postnatal Depression Scale (EPDS), Social Support Rating Scale (SSRS) and a self-designed questionnaire of PPD-related factors during the period from May to September, 2013. The prevalence of PPD was 20.03% in these women. Unconditional logistic regression analysis showed a significant correlation of PPD with education, delivery mode, only daughter, relationship between mother-in-law and daughter-in-law, newborn gender satisfaction and housing condition (P<0.05). Multivariate logistic regression analysis identified education, delivery mode, only daughter, relationship between mother-in-law and daughter-in-law, and newborn gender satisfaction as the risk factors for PPD, and housing condition was negatively correlated with the incidence of PPD with an OR value of 0.900. Compared with healthy postpartum women, the patients with PPD exhibited significantly reduced total score of social support rating scale, score of objective support, score of subjective support, and social utilization degree. The prevalence of PPD is high in Tianhe District of Guangzhou, and health education and psychosocial intervention should be offered to prevent PPD.
Yan, Ping; Yang, Yi; Zhang, Li; Li, Fuye; Huang, Amei; Wang, Yanan; Dai, Yali; Yao, Hua
2018-01-01
Abstract We aim to analyze the correlated influential factors between work-related musculoskeletal disorders (WMSDs) and nursing practice environment and quality of life and social support. From January 2015 to October 2015, cluster sampling was performed on the nurses from 12 hospitals in the 6 areas in Xinjiang. The questionnaires including the modified Nordic Musculoskeletal Questionnaire, Practice Environment Scale (PES), the Mos 36-item Short Form Health Survey, and Social Support Rating Scale were used to investigate. Multivariate logistic regression analysis was used to explore the influential factors of WMSDs. The total prevalence of WMSDs was 79.52% in the nurses ever since the working occupation, which was mainly involved waist (64.83%), neck (61.83%), and shoulder (52.36%). Multivariate logistic regression analysis indicated age (≥26 years), working in the Department of Surgery, Department of Critical Care, Outpatient Department, and Department of Anesthesia, working duration of >40 hours per week were the risk factors of WMSDs in the nurses. The physiological function (PF), body pain, total healthy condition, adequate working force and financial support, and social support were the protective factors of WMSDs. The prevalence of WMSDs in the nurses in Xinjiang Autonomous Region was high. PF, bodily pain, total healthy condition, having adequate staff and support resources to provide quality patient care, and social support were the protective factors of WMSDs in the nurses. PMID:29489648
Yan, Ping; Yang, Yi; Zhang, Li; Li, Fuye; Huang, Amei; Wang, Yanan; Dai, Yali; Yao, Hua
2018-03-01
We aim to analyze the correlated influential factors between work-related musculoskeletal disorders (WMSDs) and nursing practice environment and quality of life and social support.From January 2015 to October 2015, cluster sampling was performed on the nurses from 12 hospitals in the 6 areas in Xinjiang. The questionnaires including the modified Nordic Musculoskeletal Questionnaire, Practice Environment Scale (PES), the Mos 36-item Short Form Health Survey, and Social Support Rating Scale were used to investigate. Multivariate logistic regression analysis was used to explore the influential factors of WMSDs.The total prevalence of WMSDs was 79.52% in the nurses ever since the working occupation, which was mainly involved waist (64.83%), neck (61.83%), and shoulder (52.36%). Multivariate logistic regression analysis indicated age (≥26 years), working in the Department of Surgery, Department of Critical Care, Outpatient Department, and Department of Anesthesia, working duration of >40 hours per week were the risk factors of WMSDs in the nurses. The physiological function (PF), body pain, total healthy condition, adequate working force and financial support, and social support were the protective factors of WMSDs.The prevalence of WMSDs in the nurses in Xinjiang Autonomous Region was high. PF, bodily pain, total healthy condition, having adequate staff and support resources to provide quality patient care, and social support were the protective factors of WMSDs in the nurses.
An Entropy-Based Measure for Assessing Fuzziness in Logistic Regression
ERIC Educational Resources Information Center
Weiss, Brandi A.; Dardick, William
2016-01-01
This article introduces an entropy-based measure of data-model fit that can be used to assess the quality of logistic regression models. Entropy has previously been used in mixture-modeling to quantify how well individuals are classified into latent classes. The current study proposes the use of entropy for logistic regression models to quantify…
What Are the Odds of that? A Primer on Understanding Logistic Regression
ERIC Educational Resources Information Center
Huang, Francis L.; Moon, Tonya R.
2013-01-01
The purpose of this Methodological Brief is to present a brief primer on logistic regression, a commonly used technique when modeling dichotomous outcomes. Using data from the National Education Longitudinal Study of 1988 (NELS:88), logistic regression techniques were used to investigate student-level variables in eighth grade (i.e., enrolled in a…
Vilar-Compte, Mireya; Giraldo-Rodríguez, Liliana; Ochoa-Laginas, Adriana; Gaitan-Rossi, Pablo
2018-04-01
We assessed the association between depression and elder abuse, and the mediation effect of social support among elder women in Mexico City. A total of 526 noninstitutionalized elder women, residing in Mexico City and attending public community centers were selected. Logistic regressions and structural equation models (SEM) were estimated. One fifth of the elderly women were at risk of depression, one third suffered some type of abuse in the past 12 months, and 82% reported low social support. Logistic models confirmed that depression was statistically associated with elder abuse and vice versa (odds ratio [OR] = 1.97 and 1.96, respectively). In both models, social support significantly reduced the association between these variables leading to study these associations through SEM. This approach highlighted that social support buffers the association between depression and elder abuse. Findings underline the relevance of programs and strategies targeted at increasing social support among urban older adults.
On the Usefulness of a Multilevel Logistic Regression Approach to Person-Fit Analysis
ERIC Educational Resources Information Center
Conijn, Judith M.; Emons, Wilco H. M.; van Assen, Marcel A. L. M.; Sijtsma, Klaas
2011-01-01
The logistic person response function (PRF) models the probability of a correct response as a function of the item locations. Reise (2000) proposed to use the slope parameter of the logistic PRF as a person-fit measure. He reformulated the logistic PRF model as a multilevel logistic regression model and estimated the PRF parameters from this…
Valle, Denis; Lima, Joanna M Tucker; Millar, Justin; Amratia, Punam; Haque, Ubydul
2015-11-04
Logistic regression is a statistical model widely used in cross-sectional and cohort studies to identify and quantify the effects of potential disease risk factors. However, the impact of imperfect tests on adjusted odds ratios (and thus on the identification of risk factors) is under-appreciated. The purpose of this article is to draw attention to the problem associated with modelling imperfect diagnostic tests, and propose simple Bayesian models to adequately address this issue. A systematic literature review was conducted to determine the proportion of malaria studies that appropriately accounted for false-negatives/false-positives in a logistic regression setting. Inference from the standard logistic regression was also compared with that from three proposed Bayesian models using simulations and malaria data from the western Brazilian Amazon. A systematic literature review suggests that malaria epidemiologists are largely unaware of the problem of using logistic regression to model imperfect diagnostic test results. Simulation results reveal that statistical inference can be substantially improved when using the proposed Bayesian models versus the standard logistic regression. Finally, analysis of original malaria data with one of the proposed Bayesian models reveals that microscopy sensitivity is strongly influenced by how long people have lived in the study region, and an important risk factor (i.e., participation in forest extractivism) is identified that would have been missed by standard logistic regression. Given the numerous diagnostic methods employed by malaria researchers and the ubiquitous use of logistic regression to model the results of these diagnostic tests, this paper provides critical guidelines to improve data analysis practice in the presence of misclassification error. Easy-to-use code that can be readily adapted to WinBUGS is provided, enabling straightforward implementation of the proposed Bayesian models.
Logistic regression for risk factor modelling in stuttering research.
Reed, Phil; Wu, Yaqionq
2013-06-01
To outline the uses of logistic regression and other statistical methods for risk factor analysis in the context of research on stuttering. The principles underlying the application of a logistic regression are illustrated, and the types of questions to which such a technique has been applied in the stuttering field are outlined. The assumptions and limitations of the technique are discussed with respect to existing stuttering research, and with respect to formulating appropriate research strategies to accommodate these considerations. Finally, some alternatives to the approach are briefly discussed. The way the statistical procedures are employed are demonstrated with some hypothetical data. Research into several practical issues concerning stuttering could benefit if risk factor modelling were used. Important examples are early diagnosis, prognosis (whether a child will recover or persist) and assessment of treatment outcome. After reading this article you will: (a) Summarize the situations in which logistic regression can be applied to a range of issues about stuttering; (b) Follow the steps in performing a logistic regression analysis; (c) Describe the assumptions of the logistic regression technique and the precautions that need to be checked when it is employed; (d) Be able to summarize its advantages over other techniques like estimation of group differences and simple regression. Copyright © 2012 Elsevier Inc. All rights reserved.
Dynamic Dimensionality Selection for Bayesian Classifier Ensembles
2015-03-19
learning of weights in an otherwise generatively learned naive Bayes classifier. WANBIA-C is very cometitive to Logistic Regression but much more...classifier, Generative learning, Discriminative learning, Naïve Bayes, Feature selection, Logistic regression , higher order attribute independence 16...discriminative learning of weights in an otherwise generatively learned naive Bayes classifier. WANBIA-C is very cometitive to Logistic Regression but
Travis Woolley; David C. Shaw; Lisa M. Ganio; Stephen Fitzgerald
2012-01-01
Logistic regression models used to predict tree mortality are critical to post-fire management, planning prescribed bums and understanding disturbance ecology. We review literature concerning post-fire mortality prediction using logistic regression models for coniferous tree species in the western USA. We include synthesis and review of: methods to develop, evaluate...
Preserving Institutional Privacy in Distributed binary Logistic Regression.
Wu, Yuan; Jiang, Xiaoqian; Ohno-Machado, Lucila
2012-01-01
Privacy is becoming a major concern when sharing biomedical data across institutions. Although methods for protecting privacy of individual patients have been proposed, it is not clear how to protect the institutional privacy, which is many times a critical concern of data custodians. Built upon our previous work, Grid Binary LOgistic REgression (GLORE)1, we developed an Institutional Privacy-preserving Distributed binary Logistic Regression model (IPDLR) that considers both individual and institutional privacy for building a logistic regression model in a distributed manner. We tested our method using both simulated and clinical data, showing how it is possible to protect the privacy of individuals and of institutions using a distributed strategy.
Covariate Imbalance and Adjustment for Logistic Regression Analysis of Clinical Trial Data
Ciolino, Jody D.; Martin, Reneé H.; Zhao, Wenle; Jauch, Edward C.; Hill, Michael D.; Palesch, Yuko Y.
2014-01-01
In logistic regression analysis for binary clinical trial data, adjusted treatment effect estimates are often not equivalent to unadjusted estimates in the presence of influential covariates. This paper uses simulation to quantify the benefit of covariate adjustment in logistic regression. However, International Conference on Harmonization guidelines suggest that covariate adjustment be pre-specified. Unplanned adjusted analyses should be considered secondary. Results suggest that that if adjustment is not possible or unplanned in a logistic setting, balance in continuous covariates can alleviate some (but never all) of the shortcomings of unadjusted analyses. The case of log binomial regression is also explored. PMID:24138438
Differentially private distributed logistic regression using private and public data.
Ji, Zhanglong; Jiang, Xiaoqian; Wang, Shuang; Xiong, Li; Ohno-Machado, Lucila
2014-01-01
Privacy protecting is an important issue in medical informatics and differential privacy is a state-of-the-art framework for data privacy research. Differential privacy offers provable privacy against attackers who have auxiliary information, and can be applied to data mining models (for example, logistic regression). However, differentially private methods sometimes introduce too much noise and make outputs less useful. Given available public data in medical research (e.g. from patients who sign open-consent agreements), we can design algorithms that use both public and private data sets to decrease the amount of noise that is introduced. In this paper, we modify the update step in Newton-Raphson method to propose a differentially private distributed logistic regression model based on both public and private data. We try our algorithm on three different data sets, and show its advantage over: (1) a logistic regression model based solely on public data, and (2) a differentially private distributed logistic regression model based on private data under various scenarios. Logistic regression models built with our new algorithm based on both private and public datasets demonstrate better utility than models that trained on private or public datasets alone without sacrificing the rigorous privacy guarantee.
Deng, Yingyuan; Wang, Tianfu; Chen, Siping; Liu, Weixiang
2017-01-01
The aim of the study is to screen the significant sonographic features by logistic regression analysis and fit a model to diagnose thyroid nodules. A total of 525 pathological thyroid nodules were retrospectively analyzed. All the nodules underwent conventional ultrasonography (US), strain elastosonography (SE), and contrast -enhanced ultrasound (CEUS). Those nodules’ 12 suspicious sonographic features were used to assess thyroid nodules. The significant features of diagnosing thyroid nodules were picked out by logistic regression analysis. All variables that were statistically related to diagnosis of thyroid nodules, at a level of p < 0.05 were embodied in a logistic regression analysis model. The significant features in the logistic regression model of diagnosing thyroid nodules were calcification, suspected cervical lymph node metastasis, hypoenhancement pattern, margin, shape, vascularity, posterior acoustic, echogenicity, and elastography score. According to the results of logistic regression analysis, the formula that could predict whether or not thyroid nodules are malignant was established. The area under the receiver operating curve (ROC) was 0.930 and the sensitivity, specificity, accuracy, positive predictive value, and negative predictive value were 83.77%, 89.56%, 87.05%, 86.04%, and 87.79% respectively. PMID:29228030
Pang, Tiantian; Huang, Leidan; Deng, Yingyuan; Wang, Tianfu; Chen, Siping; Gong, Xuehao; Liu, Weixiang
2017-01-01
The aim of the study is to screen the significant sonographic features by logistic regression analysis and fit a model to diagnose thyroid nodules. A total of 525 pathological thyroid nodules were retrospectively analyzed. All the nodules underwent conventional ultrasonography (US), strain elastosonography (SE), and contrast -enhanced ultrasound (CEUS). Those nodules' 12 suspicious sonographic features were used to assess thyroid nodules. The significant features of diagnosing thyroid nodules were picked out by logistic regression analysis. All variables that were statistically related to diagnosis of thyroid nodules, at a level of p < 0.05 were embodied in a logistic regression analysis model. The significant features in the logistic regression model of diagnosing thyroid nodules were calcification, suspected cervical lymph node metastasis, hypoenhancement pattern, margin, shape, vascularity, posterior acoustic, echogenicity, and elastography score. According to the results of logistic regression analysis, the formula that could predict whether or not thyroid nodules are malignant was established. The area under the receiver operating curve (ROC) was 0.930 and the sensitivity, specificity, accuracy, positive predictive value, and negative predictive value were 83.77%, 89.56%, 87.05%, 86.04%, and 87.79% respectively.
Amini, Payam; Maroufizadeh, Saman; Samani, Reza Omani; Hamidi, Omid; Sepidarkish, Mahdi
2017-06-01
Preterm birth (PTB) is a leading cause of neonatal death and the second biggest cause of death in children under five years of age. The objective of this study was to determine the prevalence of PTB and its associated factors using logistic regression and decision tree classification methods. This cross-sectional study was conducted on 4,415 pregnant women in Tehran, Iran, from July 6-21, 2015. Data were collected by a researcher-developed questionnaire through interviews with mothers and review of their medical records. To evaluate the accuracy of the logistic regression and decision tree methods, several indices such as sensitivity, specificity, and the area under the curve were used. The PTB rate was 5.5% in this study. The logistic regression outperformed the decision tree for the classification of PTB based on risk factors. Logistic regression showed that multiple pregnancies, mothers with preeclampsia, and those who conceived with assisted reproductive technology had an increased risk for PTB ( p < 0.05). Identifying and training mothers at risk as well as improving prenatal care may reduce the PTB rate. We also recommend that statisticians utilize the logistic regression model for the classification of risk groups for PTB.
Donta, Balaiah; Dasgupta, Anindita; Ghule, Mohan; Battala, Madhusudana; Nair, Saritha; Silverman, Jay G.; Jadhav, Arun; Palaye, Prajakta; Saggurti, Niranjan; Raj, Anita
2015-01-01
Objective Evidence has linked economic hardship with increased intimate partner violence (IPV) perpetration among males. However, less is known about how economic debt or gender norms related to men's roles in relationships or the household, which often underlie IPV perpetration, intersect in or may explain these associations. We assessed the intersection of economic debt, attitudes toward gender norms, and IPV perpetration among married men in India. Methods Data were from the evaluation of a family planning intervention among young married couples (n=1,081) in rural Maharashtra, India. Crude and adjusted logistic regression models for dichotomous outcome variables and linear regression models for continuous outcomes were used to examine debt in relation to husbands' attitudes toward gender-based norms (i.e., beliefs supporting IPV and beliefs regarding male dominance in relationships and the household), as well as sexual and physical IPV perpetration. Results Twenty percent of husbands reported debt. In adjusted linear regression models, debt was associated with husbands' attitudes supportive of IPV (b=0.015, p=0.004) and norms supporting male dominance in relationships and the household (b=0.006, p=0.003). In logistic regression models adjusted for relevant demographics, debt was associated with perpetration of physical IPV (adjusted odds ratio [AOR] = 1.4, 95% confidence interval [CI] 1.1, 1.9) and sexual IPV (AOR=1.6, 95% CI 1.1, 2.1) from husbands. These findings related to debt and relation to IPV were slightly attenuated when further adjusted for men's attitudes toward gender norms. Conclusion Findings suggest the need for combined gender equity and economic promotion interventions to address high levels of debt and related IPV reported among married couples in rural India. PMID:26556938
Reed, Elizabeth; Donta, Balaiah; Dasgupta, Anindita; Ghule, Mohan; Battala, Madhusudana; Nair, Saritha; Silverman, Jay G; Jadhav, Arun; Palaye, Prajakta; Saggurti, Niranjan; Raj, Anita
2015-01-01
Evidence has linked economic hardship with increased intimate partner violence (IPV) perpetration among males. However, less is known about how economic debt or gender norms related to men's roles in relationships or the household, which often underlie IPV perpetration, intersect in or may explain these associations. We assessed the intersection of economic debt, attitudes toward gender norms, and IPV perpetration among married men in India. Data were from the evaluation of a family planning intervention among young married couples (n=1,081) in rural Maharashtra, India. Crude and adjusted logistic regression models for dichotomous outcome variables and linear regression models for continuous outcomes were used to examine debt in relation to husbands' attitudes toward gender-based norms (i.e., beliefs supporting IPV and beliefs regarding male dominance in relationships and the household), as well as sexual and physical IPV perpetration. Twenty percent of husbands reported debt. In adjusted linear regression models, debt was associated with husbands' attitudes supportive of IPV (b=0.015, p=0.004) and norms supporting male dominance in relationships and the household (b=0.006, p=0.003). In logistic regression models adjusted for relevant demographics, debt was associated with perpetration of physical IPV (adjusted odds ratio [AOR] = 1.4, 95% confidence interval [CI] 1.1, 1.9) and sexual IPV (AOR=1.6, 95% CI 1.1, 2.1) from husbands. These findings related to debt and relation to IPV were slightly attenuated when further adjusted for men's attitudes toward gender norms. Findings suggest the need for combined gender equity and economic promotion interventions to address high levels of debt and related IPV reported among married couples in rural India.
Efficace, Fabio; Breccia, Massimo; Cottone, Francesco; Okumura, Iris; Doro, Maribel; Riccardi, Francesca; Rosti, Gianantonio; Baccarani, Michele
2016-12-01
The main objective of this study was to investigate whether social support is independently associated with psychological well-being in chronic myeloid leukemia (CML) patients. Secondary objectives were to compare the psychological well-being profile of CML patients with that of their peers in general population and to examine possible age- and sex-related differences. Analysis was performed on 417 patients in treatment with lifelong molecularly targeted therapies. Mean age of patients analyzed was 56 years (range 19-87 years) and 247 (59 %) were male and 170 (41 %) were female. Social support was assessed with the Multidimensional Scale of Perceived Social Support and psychological well-being was evaluated with the short version of the Psychological General Well-Being Index. Descriptive statistics and multivariate logistic regression analyses were used. Multivariate logistic regression analysis revealed that a greater social support was independently associated with lower anxiety and depression, as well as with higher positive well-being, self-control, and vitality (p < 0.001). Female patients reported statistically significant worse outcomes in all dimensions of psychological well-being. Age- and sex-adjusted comparisons with population norms revealed that depression (ES = -0.42, p < 0.001) and self-control (ES = -0.48, p < 0.001) were the two main impaired psychological dimensions. This study indicates that social support is a critical factor associated with psychological well-being of CML patients treated with modern lifelong targeted therapies.
Baiden, Philip; Fallon, Barbara; Antwi-Boasiako, Kofi
2017-11-16
To examine the proportion of Canadian adults with a history of child abuse who disclosed the abuse to child protection services before age 16 years and identify the effect of social support and disclosure of child abuse on lifetime suicidal ideation. Data for this study came from the Statistics Canada 2012 Canadian Community Health Survey-Mental Health (N = 9,076). Binary logistic regression was conducted to identify the effect of social support and disclosure of child abuse on suicidal ideation while simultaneously adjusting for the effect of type of child abuse and demographic, socioeconomic, health, and mental health factors. Of the 9,076 respondents who experienced at least one child abuse event, 21.5% reported ever experiencing suicidal ideation. Fewer than 6% of the respondents disclosed the abuse to someone from a child protection service before age 16 years. In the multivariate logistic regression model, respondents who disclosed the abuse to someone from child protection services were 1.37 times more likely to report lifetime suicidal ideation (95% CI, 1.10-1.71) than those who did not. Each additional unit increase in social support decreased the odds of lifetime suicidal ideation by a factor of 3% (95% CI, 0.95-0.98). Social support interventions that are effective in improving individuals' perception that support is available to them may help reduce suicidal ideation among those with a history of child abuse. © Copyright 2017 Physicians Postgraduate Press, Inc.
Data mining: Potential applications in research on nutrition and health.
Batterham, Marijka; Neale, Elizabeth; Martin, Allison; Tapsell, Linda
2017-02-01
Data mining enables further insights from nutrition-related research, but caution is required. The aim of this analysis was to demonstrate and compare the utility of data mining methods in classifying a categorical outcome derived from a nutrition-related intervention. Baseline data (23 variables, 8 categorical) on participants (n = 295) in an intervention trial were used to classify participants in terms of meeting the criteria of achieving 10 000 steps per day. Results from classification and regression trees (CARTs), random forests, adaptive boosting, logistic regression, support vector machines and neural networks were compared using area under the curve (AUC) and error assessments. The CART produced the best model when considering the AUC (0.703), overall error (18%) and within class error (28%). Logistic regression also performed reasonably well compared to the other models (AUC 0.675, overall error 23%, within class error 36%). All the methods gave different rankings of variables' importance. CART found that body fat, quality of life using the SF-12 Physical Component Summary (PCS) and the cholesterol: HDL ratio were the most important predictors of meeting the 10 000 steps criteria, while logistic regression showed the SF-12PCS, glucose levels and level of education to be the most significant predictors (P ≤ 0.01). Differing outcomes suggest caution is required with a single data mining method, particularly in a dataset with nonlinear relationships and outliers and when exploring relationships that were not the primary outcomes of the research. © 2017 Dietitians Association of Australia.
Goltz, Annemarie; Janowitz, Deborah; Hannemann, Anke; Nauck, Matthias; Hoffmann, Johanna; Seyfart, Tom; Völzke, Henry; Terock, Jan; Grabe, Hans Jörgen
2018-06-19
Depression and obesity are widespread and closely linked. Brain-derived neurotrophic factor (BDNF) and vitamin D are both assumed to be associated with depression and obesity. Little is known about the interplay between vitamin D and BDNF. We explored the putative associations and interactions between serum BDNF and vitamin D levels with depressive symptoms and abdominal obesity in a large population-based cohort. Data were obtained from the population-based Study of Health in Pomerania (SHIP)-Trend (n = 3,926). The associations of serum BDNF and vitamin D levels with depressive symptoms (measured using the Patient Health Questionnaire) were assessed with binary and multinomial logistic regression models. The associations of serum BDNF and vitamin D levels with obesity (measured by the waist-to-hip ratio [WHR]) were assessed with binary logistic and linear regression models with restricted cubic splines. Logistic regression models revealed inverse associations of vitamin D with depression (OR = 0.966; 95% CI 0.951-0.981) and obesity (OR = 0.976; 95% CI 0.967-0.985). No linear association of serum BDNF with depression or obesity was found. However, linear regression models revealed a U-shaped association of BDNF with WHR (p < 0.001). Vitamin D was inversely associated with depression and obesity. BDNF was associated with abdominal obesity, but not with depression. At the population level, our results support the relevant roles of vitamin D and BDNF in mental and physical health-related outcomes. © 2018 S. Karger AG, Basel.
Biomass Stoves and Lens Opacity and Cataract in Nepalese Women
Pokhrel, Amod K.; Bates, Michael N.; Shrestha, Sachet P.; Bailey, Ian L.; DiMartino, Robert B.; Smith, Kirk R.; Joshi, N. D.
2014-01-01
Purpose Cataract is the most prevalent cause of blindness in Nepal. Several epidemiologic studies have associated cataracts with use of biomass cookstoves. These studies, however, have had limitations, including potential control selection bias and limited adjustment for possible confounding. This study, in Pokhara city, in an area of Nepal where biomass cookstoves are widely used without direct venting of the smoke to the outdoors, focuses on pre-clinical measures of opacity, while avoiding selection bias and taking into account comprehensive data on potential confounding factors Methods Using a cross-sectional study design, severity of lenticular damage, judged on the LOCS III scales, was investigated in females (n=143), aged 20-65 years, without previously diagnosed cataract. Linear and logistic regression analyses were used to examine the relationships with stove type and length of use. Clinically significant cataract, used in the logistic regression models, was defined as a LOCS III score > 2. Results Using gas cookstoves as the reference group, logistic regression analysis for nuclear cataract showed the evidence of relationships with stove type: for biomass stoves, the odds ratio (OR) was 2.58 (95% confidence interval [CI]: 1.22-5.46) and, for kerosene stoves, the OR was 5.18 (95% CI: 0.88-30.38). Similar results were found for nuclear color (LOCS III score > 2), but no association was found with cortical cataracts. Supporting a relationship between biomass stoves and nuclear cataract was a trend with years of exposure to biomass cookstoves (p=0.01). Linear regression analyses did not show clear evidence of an association between lenticular damage and stove types. Biomass fuel used for heating was not associated with any form of opacity. Conclusions This study provides support for associations of biomass and kerosene cookstoves with nuclear opacity and change in nuclear color. The novel associations with kerosene cookstove use deserve further investigation. PMID:23400024
Unequal views of inequality: Cross-national support for redistribution 1985-2011.
VanHeuvelen, Tom
2017-05-01
This research examines public views on government responsibility to reduce income inequality, support for redistribution. While individual-level correlates of support for redistribution are relatively well understood, many questions remain at the country-level. Therefore, I examine how country-level characteristics affect aggregate support for redistribution. I test explanations of aggregate support using a unique dataset combining 18 waves of the International Social Survey Programme and European Social Survey. Results from mixed-effects logistic regression and fixed-effects linear regression models show two primary and contrasting effects. States that reduce inequality through bundles of tax and transfer policies are rewarded with more supportive publics. In contrast, economic development has a seemingly equivalent and dampening effect on public support. Importantly, the effect of economic development grows at higher levels of development, potentially overwhelming the amplifying effect of state redistribution. My results therefore suggest a fundamental challenge to proponents of egalitarian politics. Copyright © 2016 Elsevier Inc. All rights reserved.
Logistic regression for dichotomized counts.
Preisser, John S; Das, Kalyan; Benecha, Habtamu; Stamm, John W
2016-12-01
Sometimes there is interest in a dichotomized outcome indicating whether a count variable is positive or zero. Under this scenario, the application of ordinary logistic regression may result in efficiency loss, which is quantifiable under an assumed model for the counts. In such situations, a shared-parameter hurdle model is investigated for more efficient estimation of regression parameters relating to overall effects of covariates on the dichotomous outcome, while handling count data with many zeroes. One model part provides a logistic regression containing marginal log odds ratio effects of primary interest, while an ancillary model part describes the mean count of a Poisson or negative binomial process in terms of nuisance regression parameters. Asymptotic efficiency of the logistic model parameter estimators of the two-part models is evaluated with respect to ordinary logistic regression. Simulations are used to assess the properties of the models with respect to power and Type I error, the latter investigated under both misspecified and correctly specified models. The methods are applied to data from a randomized clinical trial of three toothpaste formulations to prevent incident dental caries in a large population of Scottish schoolchildren. © The Author(s) 2014.
Interpretation of commonly used statistical regression models.
Kasza, Jessica; Wolfe, Rory
2014-01-01
A review of some regression models commonly used in respiratory health applications is provided in this article. Simple linear regression, multiple linear regression, logistic regression and ordinal logistic regression are considered. The focus of this article is on the interpretation of the regression coefficients of each model, which are illustrated through the application of these models to a respiratory health research study. © 2013 The Authors. Respirology © 2013 Asian Pacific Society of Respirology.
Choi, Seung Hoan; Labadorf, Adam T; Myers, Richard H; Lunetta, Kathryn L; Dupuis, Josée; DeStefano, Anita L
2017-02-06
Next generation sequencing provides a count of RNA molecules in the form of short reads, yielding discrete, often highly non-normally distributed gene expression measurements. Although Negative Binomial (NB) regression has been generally accepted in the analysis of RNA sequencing (RNA-Seq) data, its appropriateness has not been exhaustively evaluated. We explore logistic regression as an alternative method for RNA-Seq studies designed to compare cases and controls, where disease status is modeled as a function of RNA-Seq reads using simulated and Huntington disease data. We evaluate the effect of adjusting for covariates that have an unknown relationship with gene expression. Finally, we incorporate the data adaptive method in order to compare false positive rates. When the sample size is small or the expression levels of a gene are highly dispersed, the NB regression shows inflated Type-I error rates but the Classical logistic and Bayes logistic (BL) regressions are conservative. Firth's logistic (FL) regression performs well or is slightly conservative. Large sample size and low dispersion generally make Type-I error rates of all methods close to nominal alpha levels of 0.05 and 0.01. However, Type-I error rates are controlled after applying the data adaptive method. The NB, BL, and FL regressions gain increased power with large sample size, large log2 fold-change, and low dispersion. The FL regression has comparable power to NB regression. We conclude that implementing the data adaptive method appropriately controls Type-I error rates in RNA-Seq analysis. Firth's logistic regression provides a concise statistical inference process and reduces spurious associations from inaccurately estimated dispersion parameters in the negative binomial framework.
Differentially private distributed logistic regression using private and public data
2014-01-01
Background Privacy protecting is an important issue in medical informatics and differential privacy is a state-of-the-art framework for data privacy research. Differential privacy offers provable privacy against attackers who have auxiliary information, and can be applied to data mining models (for example, logistic regression). However, differentially private methods sometimes introduce too much noise and make outputs less useful. Given available public data in medical research (e.g. from patients who sign open-consent agreements), we can design algorithms that use both public and private data sets to decrease the amount of noise that is introduced. Methodology In this paper, we modify the update step in Newton-Raphson method to propose a differentially private distributed logistic regression model based on both public and private data. Experiments and results We try our algorithm on three different data sets, and show its advantage over: (1) a logistic regression model based solely on public data, and (2) a differentially private distributed logistic regression model based on private data under various scenarios. Conclusion Logistic regression models built with our new algorithm based on both private and public datasets demonstrate better utility than models that trained on private or public datasets alone without sacrificing the rigorous privacy guarantee. PMID:25079786
Park, Ji Hyun; Kim, Hyeon-Young; Lee, Hanna; Yun, Eun Kyoung
2015-12-01
This study compares the performance of the logistic regression and decision tree analysis methods for assessing the risk factors for infection in cancer patients undergoing chemotherapy. The subjects were 732 cancer patients who were receiving chemotherapy at K university hospital in Seoul, Korea. The data were collected between March 2011 and February 2013 and were processed for descriptive analysis, logistic regression and decision tree analysis using the IBM SPSS Statistics 19 and Modeler 15.1 programs. The most common risk factors for infection in cancer patients receiving chemotherapy were identified as alkylating agents, vinca alkaloid and underlying diabetes mellitus. The logistic regression explained 66.7% of the variation in the data in terms of sensitivity and 88.9% in terms of specificity. The decision tree analysis accounted for 55.0% of the variation in the data in terms of sensitivity and 89.0% in terms of specificity. As for the overall classification accuracy, the logistic regression explained 88.0% and the decision tree analysis explained 87.2%. The logistic regression analysis showed a higher degree of sensitivity and classification accuracy. Therefore, logistic regression analysis is concluded to be the more effective and useful method for establishing an infection prediction model for patients undergoing chemotherapy. Copyright © 2015 Elsevier Ltd. All rights reserved.
Yang, Lixue; Chen, Kean
2015-11-01
To improve the design of underwater target recognition systems based on auditory perception, this study compared human listeners with automatic classifiers. Performances measures and strategies in three discrimination experiments, including discriminations between man-made and natural targets, between ships and submarines, and among three types of ships, were used. In the experiments, the subjects were asked to assign a score to each sound based on how confident they were about the category to which it belonged, and logistic regression, which represents linear discriminative models, also completed three similar tasks by utilizing many auditory features. The results indicated that the performances of logistic regression improved as the ratio between inter- and intra-class differences became larger, whereas the performances of the human subjects were limited by their unfamiliarity with the targets. Logistic regression performed better than the human subjects in all tasks but the discrimination between man-made and natural targets, and the strategies employed by excellent human subjects were similar to that of logistic regression. Logistic regression and several human subjects demonstrated similar performances when discriminating man-made and natural targets, but in this case, their strategies were not similar. An appropriate fusion of their strategies led to further improvement in recognition accuracy.
NASA Astrophysics Data System (ADS)
Mei, Zhixiong; Wu, Hao; Li, Shiyun
2018-06-01
The Conversion of Land Use and its Effects at Small regional extent (CLUE-S), which is a widely used model for land-use simulation, utilizes logistic regression to estimate the relationships between land use and its drivers, and thus, predict land-use change probabilities. However, logistic regression disregards possible spatial autocorrelation and self-organization in land-use data. Autologistic regression can depict spatial autocorrelation but cannot address self-organization, while logistic regression by considering only self-organization (NElogistic regression) fails to capture spatial autocorrelation. Therefore, this study developed a regression (NE-autologistic regression) method, which incorporated both spatial autocorrelation and self-organization, to improve CLUE-S. The Zengcheng District of Guangzhou, China was selected as the study area. The land-use data of 2001, 2005, and 2009, as well as 10 typical driving factors, were used to validate the proposed regression method and the improved CLUE-S model. Then, three future land-use scenarios in 2020: the natural growth scenario, ecological protection scenario, and economic development scenario, were simulated using the improved model. Validation results showed that NE-autologistic regression performed better than logistic regression, autologistic regression, and NE-logistic regression in predicting land-use change probabilities. The spatial allocation accuracy and kappa values of NE-autologistic-CLUE-S were higher than those of logistic-CLUE-S, autologistic-CLUE-S, and NE-logistic-CLUE-S for the simulations of two periods, 2001-2009 and 2005-2009, which proved that the improved CLUE-S model achieved the best simulation and was thereby effective to a certain extent. The scenario simulation results indicated that under all three scenarios, traffic land and residential/industrial land would increase, whereas arable land and unused land would decrease during 2009-2020. Apparent differences also existed in the simulated change sizes and locations of each land-use type under different scenarios. The results not only demonstrate the validity of the improved model but also provide a valuable reference for relevant policy-makers.
Induced abortion: risk factors for adolescent female students, a Brazilian study.
Correia, Divanise S; Cavalcante, Jairo C; Maia, Eulália M C
2009-12-16
The purpose of this study was to analyze risk factors for abortion among female teenagers from 12 to 19 years of age in the city of Maceió, Brazil. This is a cross-sectional study, conducted in ten schools. The sample was calculated by considering the number of admissions for postabortion curettage, obtained from the Information System of Hospitalization. Data were obtained through a semi-structured questionnaire divided into three basic blocks of data: sociodemographic, sexual life, and pregnancy/abortion. To analyze the data, the logistic regression model was used. The Forward Method was chosen to set the final model that minimizes the number of variables and maximizes the accuracy of the model. The significant analysis between the dichotomous variables provided eight significant variables. Two of them are protective for abortion: the ages 12-14 years and talking with parents about sex. After the logistic regression, the receipt of support for abortion was the most significant variable of all. The adolescent with an active sexual life, a previous pregnancy, who is married, and has received support for an abortion has a 99.74% probability for an abortion. The results of this study, demonstrating the importance of the group in adolescence, and the statistical significance of having a partner to support and approve the pregnancy appears as a preventive factor for abortion. It shows the importance of support and companionship for adolescent women.
Social support and amphetamine-type stimulant use among female sex workers in China.
Zhao, Qun; Mao, Yuchen; Li, Xiaoming; Zhou, Yuejiao; Shen, Zhiyong
2017-10-01
Existing research has suggested a positive role of social support in reducing drug use among female sex workers (FSWs). However, there is limited research on the role of social support in amphetamine-type stimulant (ATS) use among FSWs in China. This study explored the present situation of ATS use among FSWs in Guangxi, China and examined the associations of different types of social support from different sources with ATS use. A sample of 1022 FSWs was recruited from 56 commercial sex venues in Guangxi Autonomous Region in China. Bivariate comparison was used to compare demographic characteristics and source of emotional or tangible social support across frequency of ATS use among FSWs. The relationship between social support and ATS use was examined using multiple ordinal logistic regression models controlling for the potential confounding effects of demographic variables. The multiple ordinal logistic regression indicated that FSWs who were from younger age groups (aOR = 10.88 for age group <20; aOR = 2.80 for age group 20-23), and from all higher-income venues (aOR = 1.96 for venue level 1; aOR = 2.28 for venue level 2; aOR = 1.81 for venue level 3) tended to use ATS more frequently. They also tended to use ATS more frequently when they depended on their boyfriends (aOR = 1.08) for emotional support or on their co-workers for tangible support (aOR = 1.17). Different types of social support from different sources can be either positively or negatively associated with ATS use among FSWs, therefore, the future intervention efforts should differentiate and target different types and different sources of social support in response to the living and work conditions of FSWs.
Social support and support groups among people with HIV/AIDS in Ghana.
Abrefa-Gyan, Tina; Wu, Liyun; Lewis, Marilyn W
2016-01-01
HIV/AIDS, a chronic burden in Ghana, poses social and health outcome concerns to those infected. Examining the Medical Outcome Study Social Support Survey (MOS-SSS) instrument among 300 Ghanaians from a cross-sectional design, Principal Component Analysis yielded four factors (positive interaction, trust building, information giving, and essential support), which accounted for 85.73% of the total variance in the MOS-SSS. A logistic regression analysis showed that essential support was the strongest predictor of the length of time an individual stayed in the support group, whereas positive interaction indicated negative association. The study's implications for policy, research, and practice were discussed.
Unitary Response Regression Models
ERIC Educational Resources Information Center
Lipovetsky, S.
2007-01-01
The dependent variable in a regular linear regression is a numerical variable, and in a logistic regression it is a binary or categorical variable. In these models the dependent variable has varying values. However, there are problems yielding an identity output of a constant value which can also be modelled in a linear or logistic regression with…
Binary logistic regression-Instrument for assessing museum indoor air impact on exhibits.
Bucur, Elena; Danet, Andrei Florin; Lehr, Carol Blaziu; Lehr, Elena; Nita-Lazar, Mihai
2017-04-01
This paper presents a new way to assess the environmental impact on historical artifacts using binary logistic regression. The prediction of the impact on the exhibits during certain pollution scenarios (environmental impact) was calculated by a mathematical model based on the binary logistic regression; it allows the identification of those environmental parameters from a multitude of possible parameters with a significant impact on exhibitions and ranks them according to their severity effect. Air quality (NO 2 , SO 2 , O 3 and PM 2.5 ) and microclimate parameters (temperature, humidity) monitoring data from a case study conducted within exhibition and storage spaces of the Romanian National Aviation Museum Bucharest have been used for developing and validating the binary logistic regression method and the mathematical model. The logistic regression analysis was used on 794 data combinations (715 to develop of the model and 79 to validate it) by a Statistical Package for Social Sciences (SPSS 20.0). The results from the binary logistic regression analysis demonstrated that from six parameters taken into consideration, four of them present a significant effect upon exhibits in the following order: O 3 >PM 2.5 >NO 2 >humidity followed at a significant distance by the effects of SO 2 and temperature. The mathematical model, developed in this study, correctly predicted 95.1 % of the cumulated effect of the environmental parameters upon the exhibits. Moreover, this model could also be used in the decisional process regarding the preventive preservation measures that should be implemented within the exhibition space. The paper presents a new way to assess the environmental impact on historical artifacts using binary logistic regression. The mathematical model developed on the environmental parameters analyzed by the binary logistic regression method could be useful in a decision-making process establishing the best measures for pollution reduction and preventive preservation of exhibits.
Determining factors influencing survival of breast cancer by fuzzy logistic regression model.
Nikbakht, Roya; Bahrampour, Abbas
2017-01-01
Fuzzy logistic regression model can be used for determining influential factors of disease. This study explores the important factors of actual predictive survival factors of breast cancer's patients. We used breast cancer data which collected by cancer registry of Kerman University of Medical Sciences during the period of 2000-2007. The variables such as morphology, grade, age, and treatments (surgery, radiotherapy, and chemotherapy) were applied in the fuzzy logistic regression model. Performance of model was determined in terms of mean degree of membership (MDM). The study results showed that almost 41% of patients were in neoplasm and malignant group and more than two-third of them were still alive after 5-year follow-up. Based on the fuzzy logistic model, the most important factors influencing survival were chemotherapy, morphology, and radiotherapy, respectively. Furthermore, the MDM criteria show that the fuzzy logistic regression have a good fit on the data (MDM = 0.86). Fuzzy logistic regression model showed that chemotherapy is more important than radiotherapy in survival of patients with breast cancer. In addition, another ability of this model is calculating possibilistic odds of survival in cancer patients. The results of this study can be applied in clinical research. Furthermore, there are few studies which applied the fuzzy logistic models. Furthermore, we recommend using this model in various research areas.
Chien, Li-Yin; Tai, Chen-Jei; Yeh, Mei-Chiang
2012-01-01
Domestic decision-making power is an integral part of women's empowerment. No study has linked domestic decision-making power and social support concurrently to postpartum depression and compared these between immigrant and native populations. The aim of this study was to examine domestic decision-making power and social support and their relationship to postpartum depressive symptoms among immigrant and native women in Taiwan. This cross-sectional survey included 190 immigrant and 190 native women who had delivered healthy babies during the past year in Taipei City. Depression was measured using the Edinburgh Postnatal Depression Scale, with a cutoff score of 10. Logistic regression was used to determine the factors associated with postpartum depression symptoms. Immigrant mothers had significantly higher prevalence of postpartum depression symptoms (41.1% vs. 8.4%) and had significantly lower levels of domestic decision-making power and social support than native mothers did. Logistic regression showed that insufficient family income was associated with an increased risk of postpartum depression symptoms, whereas social support and domestic decision-making power levels were associated negatively with postpartum depression symptoms. After accounting for these factors, immigrant women remained at higher risk of postpartum depression symptoms than native women did, odds ratio = 2.59, 95% CI [1.27, 5.28]. Domestic decision-making power and social support are independent protective factors for postpartum depression symptoms among immigrant and native women in Taiwan. Social support and empowerment interventions should be tested to discover whether they are able to prevent or alleviate postpartum depression symptoms, with special emphasis on immigrant mothers.
Sirichotiratana, Nithat; Yogi, Subash; Prutipinyo, Chardsumon
2013-01-01
This study was conducted during February-March 2012 to determine the perception and support regarding smoke-free policy among tourists at Suvarnabhumi International Airport, Bangkok, Thailand. In this cross-sectional study, 200 tourists (n = 200) were enrolled by convenience sampling and interviewed by structured questionnaire. Descriptive statistics, chi-square, and multinomial logistic regression were adopted in the study. Results revealed that half (50%) of the tourists were current smokers and 55% had visited Thailand twice or more. Three quarter (76%) of tourists indicated that they would visit Thailand again even if it had a 100% smoke-free regulation. Almost all (99%) of the tourists had supported for the smoke-free policy (partial ban and total ban), and current smokers had higher percentage of support than non-smokers. Two factors, current smoking status and knowledge level, were significantly associated with perception level. After analysis with Multinomial Logistic Regression, it was found that perception, country group, and presence of designated smoking room (DSR) were associated with smoke-free policy. Recommendation is that, at institution level effective monitoring system is needed at the airport. At policy level, the recommendation is that effective comprehensive policy needed to be emphasized to ensure smoke-free airport environment. PMID:23999549
ERIC Educational Resources Information Center
Lubbers, Marcel; Jaspers, Eva; Ultee, Wout
2009-01-01
Two years after the legalization of same-sex marriages in the Netherlands, 65% of the Dutch population largely or completely disagrees with the statement "gay marriage should be abolished." This article shows, by way of multinomial logistic regression analysis of survey data, which socializing agents influence one's attitude toward…
Prediction models for clustered data: comparison of a random intercept and standard regression model
2013-01-01
Background When study data are clustered, standard regression analysis is considered inappropriate and analytical techniques for clustered data need to be used. For prediction research in which the interest of predictor effects is on the patient level, random effect regression models are probably preferred over standard regression analysis. It is well known that the random effect parameter estimates and the standard logistic regression parameter estimates are different. Here, we compared random effect and standard logistic regression models for their ability to provide accurate predictions. Methods Using an empirical study on 1642 surgical patients at risk of postoperative nausea and vomiting, who were treated by one of 19 anesthesiologists (clusters), we developed prognostic models either with standard or random intercept logistic regression. External validity of these models was assessed in new patients from other anesthesiologists. We supported our results with simulation studies using intra-class correlation coefficients (ICC) of 5%, 15%, or 30%. Standard performance measures and measures adapted for the clustered data structure were estimated. Results The model developed with random effect analysis showed better discrimination than the standard approach, if the cluster effects were used for risk prediction (standard c-index of 0.69 versus 0.66). In the external validation set, both models showed similar discrimination (standard c-index 0.68 versus 0.67). The simulation study confirmed these results. For datasets with a high ICC (≥15%), model calibration was only adequate in external subjects, if the used performance measure assumed the same data structure as the model development method: standard calibration measures showed good calibration for the standard developed model, calibration measures adapting the clustered data structure showed good calibration for the prediction model with random intercept. Conclusion The models with random intercept discriminate better than the standard model only if the cluster effect is used for predictions. The prediction model with random intercept had good calibration within clusters. PMID:23414436
Bouwmeester, Walter; Twisk, Jos W R; Kappen, Teus H; van Klei, Wilton A; Moons, Karel G M; Vergouwe, Yvonne
2013-02-15
When study data are clustered, standard regression analysis is considered inappropriate and analytical techniques for clustered data need to be used. For prediction research in which the interest of predictor effects is on the patient level, random effect regression models are probably preferred over standard regression analysis. It is well known that the random effect parameter estimates and the standard logistic regression parameter estimates are different. Here, we compared random effect and standard logistic regression models for their ability to provide accurate predictions. Using an empirical study on 1642 surgical patients at risk of postoperative nausea and vomiting, who were treated by one of 19 anesthesiologists (clusters), we developed prognostic models either with standard or random intercept logistic regression. External validity of these models was assessed in new patients from other anesthesiologists. We supported our results with simulation studies using intra-class correlation coefficients (ICC) of 5%, 15%, or 30%. Standard performance measures and measures adapted for the clustered data structure were estimated. The model developed with random effect analysis showed better discrimination than the standard approach, if the cluster effects were used for risk prediction (standard c-index of 0.69 versus 0.66). In the external validation set, both models showed similar discrimination (standard c-index 0.68 versus 0.67). The simulation study confirmed these results. For datasets with a high ICC (≥15%), model calibration was only adequate in external subjects, if the used performance measure assumed the same data structure as the model development method: standard calibration measures showed good calibration for the standard developed model, calibration measures adapting the clustered data structure showed good calibration for the prediction model with random intercept. The models with random intercept discriminate better than the standard model only if the cluster effect is used for predictions. The prediction model with random intercept had good calibration within clusters.
NASA Astrophysics Data System (ADS)
Xu, Chao; Zhou, Dongxiang; Zhai, Yongping; Liu, Yunhui
2015-12-01
This paper realizes the automatic segmentation and classification of Mycobacterium tuberculosis with conventional light microscopy. First, the candidate bacillus objects are segmented by the marker-based watershed transform. The markers are obtained by an adaptive threshold segmentation based on the adaptive scale Gaussian filter. The scale of the Gaussian filter is determined according to the color model of the bacillus objects. Then the candidate objects are extracted integrally after region merging and contaminations elimination. Second, the shape features of the bacillus objects are characterized by the Hu moments, compactness, eccentricity, and roughness, which are used to classify the single, touching and non-bacillus objects. We evaluated the logistic regression, random forest, and intersection kernel support vector machines classifiers in classifying the bacillus objects respectively. Experimental results demonstrate that the proposed method yields to high robustness and accuracy. The logistic regression classifier performs best with an accuracy of 91.68%.
Binary logistic regression modelling: Measuring the probability of relapse cases among drug addict
NASA Astrophysics Data System (ADS)
Ismail, Mohd Tahir; Alias, Siti Nor Shadila
2014-07-01
For many years Malaysia faced the drug addiction issues. The most serious case is relapse phenomenon among treated drug addict (drug addict who have under gone the rehabilitation programme at Narcotic Addiction Rehabilitation Centre, PUSPEN). Thus, the main objective of this study is to find the most significant factor that contributes to relapse to happen. The binary logistic regression analysis was employed to model the relationship between independent variables (predictors) and dependent variable. The dependent variable is the status of the drug addict either relapse, (Yes coded as 1) or not, (No coded as 0). Meanwhile the predictors involved are age, age at first taking drug, family history, education level, family crisis, community support and self motivation. The total of the sample is 200 which the data are provided by AADK (National Antidrug Agency). The finding of the study revealed that age and self motivation are statistically significant towards the relapse cases..
Austin, Peter C.; Tu, Jack V.; Ho, Jennifer E.; Levy, Daniel; Lee, Douglas S.
2014-01-01
Objective Physicians classify patients into those with or without a specific disease. Furthermore, there is often interest in classifying patients according to disease etiology or subtype. Classification trees are frequently used to classify patients according to the presence or absence of a disease. However, classification trees can suffer from limited accuracy. In the data-mining and machine learning literature, alternate classification schemes have been developed. These include bootstrap aggregation (bagging), boosting, random forests, and support vector machines. Study design and Setting We compared the performance of these classification methods with those of conventional classification trees to classify patients with heart failure according to the following sub-types: heart failure with preserved ejection fraction (HFPEF) vs. heart failure with reduced ejection fraction (HFREF). We also compared the ability of these methods to predict the probability of the presence of HFPEF with that of conventional logistic regression. Results We found that modern, flexible tree-based methods from the data mining literature offer substantial improvement in prediction and classification of heart failure sub-type compared to conventional classification and regression trees. However, conventional logistic regression had superior performance for predicting the probability of the presence of HFPEF compared to the methods proposed in the data mining literature. Conclusion The use of tree-based methods offers superior performance over conventional classification and regression trees for predicting and classifying heart failure subtypes in a population-based sample of patients from Ontario. However, these methods do not offer substantial improvements over logistic regression for predicting the presence of HFPEF. PMID:23384592
Chen, Ren; Tao, Feng; Ma, Ying; Zhong, Liqin; Qin, Xia; Hu, Zhi
2014-01-01
The aim of this study was to investigate the association between social support and AIDS high-risk behaviors in commercial sex workers (CSWs) in China. A cross-sectional study was performed based on a convenience sample. Data were collected through questionnaire interviews including information about social demographic characteristics, the Social Support Rating Scale (SSRS) and AIDS knowledge. Multiple logistic regression was performed to evaluate the association between social support and AIDS high-risk behaviors, specifically condom use during commercial sex. A total of 581 commercial sex workers from 4 counties in East China participated in the study. The majority of the participants were 15 to 30 years old (79.7%). Sources of individual and family support were mainly provided by their parents (50.3%), relatives and friends (46.3%), spouses (18.4%), respectively. Univariate analysis revealed that marital status, hobbies, smoking habit, individual monthly income and family monthly income were all significantly correlated with current levels of social support being received (P = 0.04, P = 0.00, P = 0.01, P = 0.01, P = 0.01, respectively). Furthermore, Multiple logistic regression analysis indicated that after adjusting for confounding factors, high levels of social support were significantly correlated with increased condom use at the last sexual encounter (P = 0.02, OR = 1.86, 95%CI: 1.10-3.16); and consistently in the past month with clients (P = 0.03, OR = 2.10, 95%CI: 1.09-4.04). CSWs with high levels of social support are more likely to use condoms during commercial sex. This suggests that increasing social support can potentially reduce AIDS-related high-risk behaviors and accordingly play an important role in AIDS prevention.
Mixed conditional logistic regression for habitat selection studies.
Duchesne, Thierry; Fortin, Daniel; Courbin, Nicolas
2010-05-01
1. Resource selection functions (RSFs) are becoming a dominant tool in habitat selection studies. RSF coefficients can be estimated with unconditional (standard) and conditional logistic regressions. While the advantage of mixed-effects models is recognized for standard logistic regression, mixed conditional logistic regression remains largely overlooked in ecological studies. 2. We demonstrate the significance of mixed conditional logistic regression for habitat selection studies. First, we use spatially explicit models to illustrate how mixed-effects RSFs can be useful in the presence of inter-individual heterogeneity in selection and when the assumption of independence from irrelevant alternatives (IIA) is violated. The IIA hypothesis states that the strength of preference for habitat type A over habitat type B does not depend on the other habitat types also available. Secondly, we demonstrate the significance of mixed-effects models to evaluate habitat selection of free-ranging bison Bison bison. 3. When movement rules were homogeneous among individuals and the IIA assumption was respected, fixed-effects RSFs adequately described habitat selection by simulated animals. In situations violating the inter-individual homogeneity and IIA assumptions, however, RSFs were best estimated with mixed-effects regressions, and fixed-effects models could even provide faulty conclusions. 4. Mixed-effects models indicate that bison did not select farmlands, but exhibited strong inter-individual variations in their response to farmlands. Less than half of the bison preferred farmlands over forests. Conversely, the fixed-effect model simply suggested an overall selection for farmlands. 5. Conditional logistic regression is recognized as a powerful approach to evaluate habitat selection when resource availability changes. This regression is increasingly used in ecological studies, but almost exclusively in the context of fixed-effects models. Fitness maximization can imply differences in trade-offs among individuals, which can yield inter-individual differences in selection and lead to departure from IIA. These situations are best modelled with mixed-effects models. Mixed-effects conditional logistic regression should become a valuable tool for ecological research.
Advanced colorectal neoplasia risk stratification by penalized logistic regression.
Lin, Yunzhi; Yu, Menggang; Wang, Sijian; Chappell, Richard; Imperiale, Thomas F
2016-08-01
Colorectal cancer is the second leading cause of death from cancer in the United States. To facilitate the efficiency of colorectal cancer screening, there is a need to stratify risk for colorectal cancer among the 90% of US residents who are considered "average risk." In this article, we investigate such risk stratification rules for advanced colorectal neoplasia (colorectal cancer and advanced, precancerous polyps). We use a recently completed large cohort study of subjects who underwent a first screening colonoscopy. Logistic regression models have been used in the literature to estimate the risk of advanced colorectal neoplasia based on quantifiable risk factors. However, logistic regression may be prone to overfitting and instability in variable selection. Since most of the risk factors in our study have several categories, it was tempting to collapse these categories into fewer risk groups. We propose a penalized logistic regression method that automatically and simultaneously selects variables, groups categories, and estimates their coefficients by penalizing the [Formula: see text]-norm of both the coefficients and their differences. Hence, it encourages sparsity in the categories, i.e. grouping of the categories, and sparsity in the variables, i.e. variable selection. We apply the penalized logistic regression method to our data. The important variables are selected, with close categories simultaneously grouped, by penalized regression models with and without the interactions terms. The models are validated with 10-fold cross-validation. The receiver operating characteristic curves of the penalized regression models dominate the receiver operating characteristic curve of naive logistic regressions, indicating a superior discriminative performance. © The Author(s) 2013.
Rupert, Michael G.; Cannon, Susan H.; Gartner, Joseph E.
2003-01-01
Logistic regression was used to predict the probability of debris flows occurring in areas recently burned by wildland fires. Multiple logistic regression is conceptually similar to multiple linear regression because statistical relations between one dependent variable and several independent variables are evaluated. In logistic regression, however, the dependent variable is transformed to a binary variable (debris flow did or did not occur), and the actual probability of the debris flow occurring is statistically modeled. Data from 399 basins located within 15 wildland fires that burned during 2000-2002 in Colorado, Idaho, Montana, and New Mexico were evaluated. More than 35 independent variables describing the burn severity, geology, land surface gradient, rainfall, and soil properties were evaluated. The models were developed as follows: (1) Basins that did and did not produce debris flows were delineated from National Elevation Data using a Geographic Information System (GIS). (2) Data describing the burn severity, geology, land surface gradient, rainfall, and soil properties were determined for each basin. These data were then downloaded to a statistics software package for analysis using logistic regression. (3) Relations between the occurrence/non-occurrence of debris flows and burn severity, geology, land surface gradient, rainfall, and soil properties were evaluated and several preliminary multivariate logistic regression models were constructed. All possible combinations of independent variables were evaluated to determine which combination produced the most effective model. The multivariate model that best predicted the occurrence of debris flows was selected. (4) The multivariate logistic regression model was entered into a GIS, and a map showing the probability of debris flows was constructed. The most effective model incorporates the percentage of each basin with slope greater than 30 percent, percentage of land burned at medium and high burn severity in each basin, particle size sorting, average storm intensity (millimeters per hour), soil organic matter content, soil permeability, and soil drainage. The results of this study demonstrate that logistic regression is a valuable tool for predicting the probability of debris flows occurring in recently-burned landscapes.
Ebrahimzadeh, Farzad; Hajizadeh, Ebrahim; Vahabi, Nasim; Almasian, Mohammad; Bakhteyar, Katayoon
2015-01-01
Background: Unwanted pregnancy not intended by at least one of the parents has undesirable consequences for the family and the society. In the present study, three classification models were used and compared to predict unwanted pregnancies in an urban population. Methods: In this cross-sectional study, 887 pregnant mothers referring to health centers in Khorramabad, Iran, in 2012 were selected by the stratified and cluster sampling; relevant variables were measured and for prediction of unwanted pregnancy, logistic regression, discriminant analysis, and probit regression models and SPSS software version 21 were used. To compare these models, indicators such as sensitivity, specificity, the area under the ROC curve, and the percentage of correct predictions were used. Results: The prevalence of unwanted pregnancies was 25.3%. The logistic and probit regression models indicated that parity and pregnancy spacing, contraceptive methods, household income and number of living male children were related to unwanted pregnancy. The performance of the models based on the area under the ROC curve was 0.735, 0.733, and 0.680 for logistic regression, probit regression, and linear discriminant analysis, respectively. Conclusion: Given the relatively high prevalence of unwanted pregnancies in Khorramabad, it seems necessary to revise family planning programs. Despite the similar accuracy of the models, if the researcher is interested in the interpretability of the results, the use of the logistic regression model is recommended. PMID:26793655
Ebrahimzadeh, Farzad; Hajizadeh, Ebrahim; Vahabi, Nasim; Almasian, Mohammad; Bakhteyar, Katayoon
2015-01-01
Unwanted pregnancy not intended by at least one of the parents has undesirable consequences for the family and the society. In the present study, three classification models were used and compared to predict unwanted pregnancies in an urban population. In this cross-sectional study, 887 pregnant mothers referring to health centers in Khorramabad, Iran, in 2012 were selected by the stratified and cluster sampling; relevant variables were measured and for prediction of unwanted pregnancy, logistic regression, discriminant analysis, and probit regression models and SPSS software version 21 were used. To compare these models, indicators such as sensitivity, specificity, the area under the ROC curve, and the percentage of correct predictions were used. The prevalence of unwanted pregnancies was 25.3%. The logistic and probit regression models indicated that parity and pregnancy spacing, contraceptive methods, household income and number of living male children were related to unwanted pregnancy. The performance of the models based on the area under the ROC curve was 0.735, 0.733, and 0.680 for logistic regression, probit regression, and linear discriminant analysis, respectively. Given the relatively high prevalence of unwanted pregnancies in Khorramabad, it seems necessary to revise family planning programs. Despite the similar accuracy of the models, if the researcher is interested in the interpretability of the results, the use of the logistic regression model is recommended.
Kempe, P T; van Oppen, P; de Haan, E; Twisk, J W R; Sluis, A; Smit, J H; van Dyck, R; van Balkom, A J L M
2007-09-01
Two methods for predicting remissions in obsessive-compulsive disorder (OCD) treatment are evaluated. Y-BOCS measurements of 88 patients with a primary OCD (DSM-III-R) diagnosis were performed over a 16-week treatment period, and during three follow-ups. Remission at any measurement was defined as a Y-BOCS score lower than thirteen combined with a reduction of seven points when compared with baseline. Logistic regression models were compared with a Cox regression for recurrent events model. Logistic regression yielded different models at different evaluation times. The recurrent events model remained stable when fewer measurements were used. Higher baseline levels of neuroticism and more severe OCD symptoms were associated with a lower chance of remission, early age of onset and more depressive symptoms with a higher chance. Choice of outcome time affects logistic regression prediction models. Recurrent events analysis uses all information on remissions and relapses. Short- and long-term predictors for OCD remission show overlap.
Impact of low vision on employment.
Mojon-Azzi, Stefania M; Sousa-Poza, Alfonso; Mojon, Daniel S
2010-01-01
We investigated the influence of self-reported corrected eyesight on several variables describing the perception by employees and self-employed persons of their employment. Our study was based on data from the Survey of Health, Ageing and Retirement in Europe (SHARE). SHARE is a multidisciplinary, cross-national database of microdata on health, socioeconomic status, social and family networks, collected on 31,115 individuals in 11 European countries and in Israel. With the help of ordered logistic regressions and binary logistic regressions, we analyzed the influence of perceived visual impairment--corrected by 19 covariates capturing socioeconomic and health-related factors--on 10 variables describing the respondents' employment situation. Based on data covering 10,340 working individuals, the results of the logistic and ordered regressions indicate that respondents with lower levels of self-reported general eyesight were significantly less satisfied with their jobs, felt they had less freedom to decide, less opportunity to develop new skills, less support in difficult situations, less recognition for their work, and an inadequate salary. Respondents with a lower eyesight level more frequently reported that they feared their health might limit their ability to work before regular retirement age and more often indicated that they were seeking early retirement. Analysis of this dataset from 12 countries demonstrates the strong impact of self-reported visual impairment on individual employment, and therefore on job satisfaction, productivity, and well-being. Copyright © 2010 S. Karger AG, Basel.
Estimating the exceedance probability of rain rate by logistic regression
NASA Technical Reports Server (NTRS)
Chiu, Long S.; Kedem, Benjamin
1990-01-01
Recent studies have shown that the fraction of an area with rain intensity above a fixed threshold is highly correlated with the area-averaged rain rate. To estimate the fractional rainy area, a logistic regression model, which estimates the conditional probability that rain rate over an area exceeds a fixed threshold given the values of related covariates, is developed. The problem of dependency in the data in the estimation procedure is bypassed by the method of partial likelihood. Analyses of simulated scanning multichannel microwave radiometer and observed electrically scanning microwave radiometer data during the Global Atlantic Tropical Experiment period show that the use of logistic regression in pixel classification is superior to multiple regression in predicting whether rain rate at each pixel exceeds a given threshold, even in the presence of noisy data. The potential of the logistic regression technique in satellite rain rate estimation is discussed.
NASA Astrophysics Data System (ADS)
Cary, Theodore W.; Cwanger, Alyssa; Venkatesh, Santosh S.; Conant, Emily F.; Sehgal, Chandra M.
2012-03-01
This study compares the performance of two proven but very different machine learners, Naïve Bayes and logistic regression, for differentiating malignant and benign breast masses using ultrasound imaging. Ultrasound images of 266 masses were analyzed quantitatively for shape, echogenicity, margin characteristics, and texture features. These features along with patient age, race, and mammographic BI-RADS category were used to train Naïve Bayes and logistic regression classifiers to diagnose lesions as malignant or benign. ROC analysis was performed using all of the features and using only a subset that maximized information gain. Performance was determined by the area under the ROC curve, Az, obtained from leave-one-out cross validation. Naïve Bayes showed significant variation (Az 0.733 +/- 0.035 to 0.840 +/- 0.029, P < 0.002) with the choice of features, but the performance of logistic regression was relatively unchanged under feature selection (Az 0.839 +/- 0.029 to 0.859 +/- 0.028, P = 0.605). Out of 34 features, a subset of 6 gave the highest information gain: brightness difference, margin sharpness, depth-to-width, mammographic BI-RADs, age, and race. The probabilities of malignancy determined by Naïve Bayes and logistic regression after feature selection showed significant correlation (R2= 0.87, P < 0.0001). The diagnostic performance of Naïve Bayes and logistic regression can be comparable, but logistic regression is more robust. Since probability of malignancy cannot be measured directly, high correlation between the probabilities derived from two basic but dissimilar models increases confidence in the predictive power of machine learning models for characterizing solid breast masses on ultrasound.
Wang, Qingliang; Li, Xiaojie; Hu, Kunpeng; Zhao, Kun; Yang, Peisheng; Liu, Bo
2015-05-12
To explore the risk factors of portal hypertensive gastropathy (PHG) in patients with hepatitis B associated cirrhosis and establish a Logistic regression model of noninvasive prediction. The clinical data of 234 hospitalized patients with hepatitis B associated cirrhosis from March 2012 to March 2014 were analyzed retrospectively. The dependent variable was the occurrence of PHG while the independent variables were screened by binary Logistic analysis. Multivariate Logistic regression was used for further analysis of significant noninvasive independent variables. Logistic regression model was established and odds ratio was calculated for each factor. The accuracy, sensitivity and specificity of model were evaluated by the curve of receiver operating characteristic (ROC). According to univariate Logistic regression, the risk factors included hepatic dysfunction, albumin (ALB), bilirubin (TB), prothrombin time (PT), platelet (PLT), white blood cell (WBC), portal vein diameter, spleen index, splenic vein diameter, diameter ratio, PLT to spleen volume ratio, esophageal varices (EV) and gastric varices (GV). Multivariate analysis showed that hepatic dysfunction (X1), TB (X2), PLT (X3) and splenic vein diameter (X4) were the major occurring factors for PHG. The established regression model was Logit P=-2.667+2.186X1-2.167X2+0.725X3+0.976X4. The accuracy of model for PHG was 79.1% with a sensitivity of 77.2% and a specificity of 80.8%. Hepatic dysfunction, TB, PLT and splenic vein diameter are risk factors for PHG and the noninvasive predicted Logistic regression model was Logit P=-2.667+2.186X1-2.167X2+0.725X3+0.976X4.
Variable Selection in Logistic Regression.
1987-06-01
23 %. AUTIOR(.) S. CONTRACT OR GRANT NUMBE Rf.i %Z. D. Bai, P. R. Krishnaiah and . C. Zhao F49620-85- C-0008 " PERFORMING ORGANIZATION NAME AND AOORESS...d I7 IOK-TK- d 7 -I0 7’ VARIABLE SELECTION IN LOGISTIC REGRESSION Z. D. Bai, P. R. Krishnaiah and L. C. Zhao Center for Multivariate Analysis...University of Pittsburgh Center for Multivariate Analysis University of Pittsburgh Y !I VARIABLE SELECTION IN LOGISTIC REGRESSION Z- 0. Bai, P. R. Krishnaiah
NASA Astrophysics Data System (ADS)
Madhu, B.; Ashok, N. C.; Balasubramanian, S.
2014-11-01
Multinomial logistic regression analysis was used to develop statistical model that can predict the probability of breast cancer in Southern Karnataka using the breast cancer occurrence data during 2007-2011. Independent socio-economic variables describing the breast cancer occurrence like age, education, occupation, parity, type of family, health insurance coverage, residential locality and socioeconomic status of each case was obtained. The models were developed as follows: i) Spatial visualization of the Urban- rural distribution of breast cancer cases that were obtained from the Bharat Hospital and Institute of Oncology. ii) Socio-economic risk factors describing the breast cancer occurrences were complied for each case. These data were then analysed using multinomial logistic regression analysis in a SPSS statistical software and relations between the occurrence of breast cancer across the socio-economic status and the influence of other socio-economic variables were evaluated and multinomial logistic regression models were constructed. iii) the model that best predicted the occurrence of breast cancer were identified. This multivariate logistic regression model has been entered into a geographic information system and maps showing the predicted probability of breast cancer occurrence in Southern Karnataka was created. This study demonstrates that Multinomial logistic regression is a valuable tool for developing models that predict the probability of breast cancer Occurrence in Southern Karnataka.
Parsaeian, M; Mohammad, K; Mahmoudi, M; Zeraati, H
2012-01-01
Background: The purpose of this investigation was to compare empirically predictive ability of an artificial neural network with a logistic regression in prediction of low back pain. Methods: Data from the second national health survey were considered in this investigation. This data includes the information of low back pain and its associated risk factors among Iranian people aged 15 years and older. Artificial neural network and logistic regression models were developed using a set of 17294 data and they were validated in a test set of 17295 data. Hosmer and Lemeshow recommendation for model selection was used in fitting the logistic regression. A three-layer perceptron with 9 inputs, 3 hidden and 1 output neurons was employed. The efficiency of two models was compared by receiver operating characteristic analysis, root mean square and -2 Loglikelihood criteria. Results: The area under the ROC curve (SE), root mean square and -2Loglikelihood of the logistic regression was 0.752 (0.004), 0.3832 and 14769.2, respectively. The area under the ROC curve (SE), root mean square and -2Loglikelihood of the artificial neural network was 0.754 (0.004), 0.3770 and 14757.6, respectively. Conclusions: Based on these three criteria, artificial neural network would give better performance than logistic regression. Although, the difference is statistically significant, it does not seem to be clinically significant. PMID:23113198
Parsaeian, M; Mohammad, K; Mahmoudi, M; Zeraati, H
2012-01-01
The purpose of this investigation was to compare empirically predictive ability of an artificial neural network with a logistic regression in prediction of low back pain. Data from the second national health survey were considered in this investigation. This data includes the information of low back pain and its associated risk factors among Iranian people aged 15 years and older. Artificial neural network and logistic regression models were developed using a set of 17294 data and they were validated in a test set of 17295 data. Hosmer and Lemeshow recommendation for model selection was used in fitting the logistic regression. A three-layer perceptron with 9 inputs, 3 hidden and 1 output neurons was employed. The efficiency of two models was compared by receiver operating characteristic analysis, root mean square and -2 Loglikelihood criteria. The area under the ROC curve (SE), root mean square and -2Loglikelihood of the logistic regression was 0.752 (0.004), 0.3832 and 14769.2, respectively. The area under the ROC curve (SE), root mean square and -2Loglikelihood of the artificial neural network was 0.754 (0.004), 0.3770 and 14757.6, respectively. Based on these three criteria, artificial neural network would give better performance than logistic regression. Although, the difference is statistically significant, it does not seem to be clinically significant.
NASA Astrophysics Data System (ADS)
Kamaruddin, Ainur Amira; Ali, Zalila; Noor, Norlida Mohd.; Baharum, Adam; Ahmad, Wan Muhamad Amir W.
2014-07-01
Logistic regression analysis examines the influence of various factors on a dichotomous outcome by estimating the probability of the event's occurrence. Logistic regression, also called a logit model, is a statistical procedure used to model dichotomous outcomes. In the logit model the log odds of the dichotomous outcome is modeled as a linear combination of the predictor variables. The log odds ratio in logistic regression provides a description of the probabilistic relationship of the variables and the outcome. In conducting logistic regression, selection procedures are used in selecting important predictor variables, diagnostics are used to check that assumptions are valid which include independence of errors, linearity in the logit for continuous variables, absence of multicollinearity, and lack of strongly influential outliers and a test statistic is calculated to determine the aptness of the model. This study used the binary logistic regression model to investigate overweight and obesity among rural secondary school students on the basis of their demographics profile, medical history, diet and lifestyle. The results indicate that overweight and obesity of students are influenced by obesity in family and the interaction between a student's ethnicity and routine meals intake. The odds of a student being overweight and obese are higher for a student having a family history of obesity and for a non-Malay student who frequently takes routine meals as compared to a Malay student.
Understanding logistic regression analysis.
Sperandei, Sandro
2014-01-01
Logistic regression is used to obtain odds ratio in the presence of more than one explanatory variable. The procedure is quite similar to multiple linear regression, with the exception that the response variable is binomial. The result is the impact of each variable on the odds ratio of the observed event of interest. The main advantage is to avoid confounding effects by analyzing the association of all variables together. In this article, we explain the logistic regression procedure using examples to make it as simple as possible. After definition of the technique, the basic interpretation of the results is highlighted and then some special issues are discussed.
Life stressors, coping strategies, and social supports in patients with irritable bowel syndrome
Roohafza, Hamidreza; Keshteli, Ammar Hassanzadeh; Daghaghzadeh, Hamed; Afshar, Hamid; Erfani, Zahra; Adibi, Peyman
2016-01-01
Background: The frequency and the perceived intensity of life stressors, coping strategies, and social supports are very important in everybody's well-being. This study intended to estimate the relation of irritable bowel syndrome (IBS) and these factors. Materials and Methods: This was a cross-sectional study carried out in Isfahan on 2013. Data were extracted from the framework of the study on the epidemiology of psychological, alimentary health, and nutrition. Symptoms of IBS were evaluated by Talley bowel disease questionnaire. Stressful life event, modified COPE scale, and Multidimensional Scale of Perceived Social Support were also used. About 4763 subjects were completed questionnaires. Analyzing data were done by t-test and multivariate logistic regression. Results: Of all returned questionnaire, 1024 (21.5%) were diagnosed with IBS. IBS and clinically-significant IBS (IBS-S) groups have significantly experienced a higher level of perceived intensity of stressors and had a higher frequency of stressors. The mean score of social supports and the mean scores of three coping strategies (problem engagement, support seeking, and positive reinterpretation and growth) were significantly lower in subjects with either IBS-S or IBS than in those with no IBS. Multivariate logistic regression revealed a significant association between frequency of stressors and perceived intensity of stressors with IBS (odds ratio [OR] =1.09 and OR = 1.02, respectively) or IBS-S (OR = 1.09 and OR = 1.03, respectively). Conclusions: People with IBS had higher numbers of stressors, higher perception of the intensity of stressors, less adaptive coping strategies, and less social supports which should be focused in psychosocial interventions. PMID:27761433
Life stressors, coping strategies, and social supports in patients with irritable bowel syndrome.
Roohafza, Hamidreza; Keshteli, Ammar Hassanzadeh; Daghaghzadeh, Hamed; Afshar, Hamid; Erfani, Zahra; Adibi, Peyman
2016-01-01
The frequency and the perceived intensity of life stressors, coping strategies, and social supports are very important in everybody's well-being. This study intended to estimate the relation of irritable bowel syndrome (IBS) and these factors. This was a cross-sectional study carried out in Isfahan on 2013. Data were extracted from the framework of the study on the epidemiology of psychological, alimentary health, and nutrition. Symptoms of IBS were evaluated by Talley bowel disease questionnaire. Stressful life event, modified COPE scale, and Multidimensional Scale of Perceived Social Support were also used. About 4763 subjects were completed questionnaires. Analyzing data were done by t -test and multivariate logistic regression. Of all returned questionnaire, 1024 (21.5%) were diagnosed with IBS. IBS and clinically-significant IBS (IBS-S) groups have significantly experienced a higher level of perceived intensity of stressors and had a higher frequency of stressors. The mean score of social supports and the mean scores of three coping strategies (problem engagement, support seeking, and positive reinterpretation and growth) were significantly lower in subjects with either IBS-S or IBS than in those with no IBS. Multivariate logistic regression revealed a significant association between frequency of stressors and perceived intensity of stressors with IBS (odds ratio [OR] =1.09 and OR = 1.02, respectively) or IBS-S (OR = 1.09 and OR = 1.03, respectively). People with IBS had higher numbers of stressors, higher perception of the intensity of stressors, less adaptive coping strategies, and less social supports which should be focused in psychosocial interventions.
Chen, Sung-Wei; Wang, Po-Chuan; Hsin, Ping-Lung; Oates, Anthony; Sun, I-Wen; Liu, Shen-Ing
2011-01-01
Microelectronic engineers are considered valuable human capital contributing significantly toward economic development, but they may encounter stressful work conditions in the context of a globalized industry. The study aims at identifying risk factors of depressive disorders primarily based on job stress models, the Demand-Control-Support and Effort-Reward Imbalance models, and at evaluating whether depressive disorders impair work performance in microelectronics engineers in Taiwan. The case-control study was conducted among 678 microelectronics engineers, 452 controls and 226 cases with depressive disorders which were defined by a score 17 or more on the Beck Depression Inventory and a psychiatrist's diagnosis. The self-administered questionnaires included the Job Content Questionnaire, Effort-Reward Imbalance Questionnaire, demography, psychosocial factors, health behaviors and work performance. Hierarchical logistic regression was applied to identify risk factors of depressive disorders. Multivariate linear regressions were used to determine factors affecting work performance. By hierarchical logistic regression, risk factors of depressive disorders are high demands, low work social support, high effort/reward ratio and low frequency of physical exercise. Combining the two job stress models may have better predictive power for depressive disorders than adopting either model alone. Three multivariate linear regressions provide similar results indicating that depressive disorders are associated with impaired work performance in terms of absence, role limitation and social functioning limitation. The results may provide insight into the applicability of job stress models in a globalized high-tech industry considerably focused in non-Western countries, and the design of workplace preventive strategies for depressive disorders in Asian electronics engineering population.
Chen, Wei-Qing; Wong, Tze Wai; Yu, Ignatius Tak-Sun
2008-01-01
To explore the relationship of occupational stress and social support with health-related behaviors of smoking, alcohol usage and physical inactivity, a cross-sectional survey was conducted among 561 offshore oil installation workers of a Chinese state-owned oil company. They were investigated with a self-administered questionnaire about socio-demographic characteristics, occupational stress, social support and health-related behaviors. Logistic regression analysis was used to study the association between occupational stress, social support and health-related behaviors and adjusted for age, educational level, marital status, duration of offshore work and job title. Of 561 workers, 218 (38.9%) were current smokers, 124 (22.1%) current drinkers, and 354 (63.1%) physically inactive in their leisure time. Further multivariate logistic regression analysis indicated that: (1) Current smoking was significantly negatively related with perceived stress from "Safety" (OR=0.74; 95% CI=0.58-0.94) and lack of supervisors' instrumental support (OR=0.34; 95% CI=0.18-0.65); (2) Current drinking was significantly positively related to perceived stress from "Interface between job and family/social life" (OR=1.32; 95% CI=1.02-1.70) and "Organizational structure" (OR=1.35; 95% CI=1.06-1.74), but was significantly negatively related to poor emotional support from friends (OR=0.54; 95% CI=0.62-0.96); (3) Physical inactivity after work was significantly positively associated with perceived stress from "Safety" (OR=1.44; 95% CI=1.16-1.79) and lack of instrumental support from both supervisors (OR=1.74; 95% CI=1.16-2.65) and friends (OR=1.68; 95% CI=1.06-2.42). The findings suggest that psychosocial factors of occupational stress and social support at offshore oil work might affect workers' health-related behaviors in different ways.
Pressman, Andrew; Sawyer, Kelly N; Devlin, William; Swor, Robert
2018-05-01
The role of circulatory support in the post-cardiac arrest period remains controversial. Our objective was to investigate the association between treatment with a percutaneous hemodynamic support device and outcome after admission for cardiac arrest. We performed a retrospective study of adult patients with admission diagnosis of cardiac arrest or ventricular fibrillation (VF) from the Michigan Inpatient Database, treated between July 1, 2010, and June 30, 2013. Patient demographics, clinical characteristics, treatments, and disposition were electronically abstracted based on ICD-9 codes at the hospital level. Mixed-effects logistic regression models were fit to test the effect of percutaneous hemodynamic support device defined as either percutaneous left ventricular assist device (pLVAD) or intra-aortic balloon pump (IABP) on survival. These models controlled for age, sex, VF, myocardial infarction (MI), and cardiogenic shock with hospital modeled as a random effect. A total of 103 hospitals contributed 4393 patients for analysis, predominately male (58.8%) with a mean age of 64.1years (SD 15.5). On univariate analysis, younger age, male sex, VF as the initial rhythm, acute MI, percutaneous coronary intervention, percutaneous hemodynamic support device, and absence of cardiogenic shock were associated with survival to discharge (each p<0.001). Mixed-effects logistic regressions revealed use of percutaneous hemodynamic support device was significantly associated with survival among all patients (OR 1.8 (1.28-2.54)), and especially in those with acute MI (OR 1.95 (1.31-2.93)) or cardiogenic shock (OR 1.96 (1.29-2.98)). Treatment with percutaneous hemodynamic support device in the post-arrest period may provide left ventricular support and improve outcome. Copyright © 2017 Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Koon, Sharon; Petscher, Yaacov
2015-01-01
The purpose of this report was to explicate the use of logistic regression and classification and regression tree (CART) analysis in the development of early warning systems. It was motivated by state education leaders' interest in maintaining high classification accuracy while simultaneously improving practitioner understanding of the rules by…
1993-09-01
compared to the male counterparts, the study does not discriminate between the two sexes . Out of the total of about 17000 records, about 30% of them are...few naval officers and pilots. Almost all the officers are in the Army. Hence, for the support vocations and sevice groups effects the study does not
Reduction of Racial Disparities in Prostate Cancer
2008-12-01
inhibitors, aspirin, anti-TNF medications), and other medications of interest (testosterone, finasteride , alpha receptor blockers). 12 We...0.01. There were 14 (7%) control-patients who had finasteride use, with an average of 398.6 doses per individual. None of the prostate cancer...patients had prior finasteride use. In a multiple logistic regression model (Table 2, see supporting materials), after adjustment for the matching
2017-03-23
PUBLIC RELEASE; DISTRIBUTION UNLIMITED Using Multiple and Logistic Regression to Estimate the Median Will- Cost and Probability of Cost and... Cost and Probability of Cost and Schedule Overrun for Program Managers Ryan C. Trudelle Follow this and additional works at: https://scholar.afit.edu...afit.edu. Recommended Citation Trudelle, Ryan C., "Using Multiple and Logistic Regression to Estimate the Median Will- Cost and Probability of Cost and
2013-11-01
Ptrend 0.78 0.62 0.75 Unconditional logistic regression was used to estimate odds ratios (OR) and 95 % confidence intervals (CI) for risk of node...Ptrend 0.71 0.67 Unconditional logistic regression was used to estimate odds ratios (OR) and 95 % confidence intervals (CI) for risk of high-grade tumors... logistic regression was used to estimate odds ratios (OR) and 95 % confidence intervals (CI) for the associations between each of the seven SNPs and
Kim, Sun Mi; Kim, Yongdai; Jeong, Kuhwan; Jeong, Heeyeong; Kim, Jiyoung
2018-01-01
The aim of this study was to compare the performance of image analysis for predicting breast cancer using two distinct regression models and to evaluate the usefulness of incorporating clinical and demographic data (CDD) into the image analysis in order to improve the diagnosis of breast cancer. This study included 139 solid masses from 139 patients who underwent a ultrasonography-guided core biopsy and had available CDD between June 2009 and April 2010. Three breast radiologists retrospectively reviewed 139 breast masses and described each lesion using the Breast Imaging Reporting and Data System (BI-RADS) lexicon. We applied and compared two regression methods-stepwise logistic (SL) regression and logistic least absolute shrinkage and selection operator (LASSO) regression-in which the BI-RADS descriptors and CDD were used as covariates. We investigated the performances of these regression methods and the agreement of radiologists in terms of test misclassification error and the area under the curve (AUC) of the tests. Logistic LASSO regression was superior (P<0.05) to SL regression, regardless of whether CDD was included in the covariates, in terms of test misclassification errors (0.234 vs. 0.253, without CDD; 0.196 vs. 0.258, with CDD) and AUC (0.785 vs. 0.759, without CDD; 0.873 vs. 0.735, with CDD). However, it was inferior (P<0.05) to the agreement of three radiologists in terms of test misclassification errors (0.234 vs. 0.168, without CDD; 0.196 vs. 0.088, with CDD) and the AUC without CDD (0.785 vs. 0.844, P<0.001), but was comparable to the AUC with CDD (0.873 vs. 0.880, P=0.141). Logistic LASSO regression based on BI-RADS descriptors and CDD showed better performance than SL in predicting the presence of breast cancer. The use of CDD as a supplement to the BI-RADS descriptors significantly improved the prediction of breast cancer using logistic LASSO regression.
Yu, Yuanyuan; Li, Hongkai; Sun, Xiaoru; Su, Ping; Wang, Tingting; Liu, Yi; Yuan, Zhongshang; Liu, Yanxun; Xue, Fuzhong
2017-12-28
Confounders can produce spurious associations between exposure and outcome in observational studies. For majority of epidemiologists, adjusting for confounders using logistic regression model is their habitual method, though it has some problems in accuracy and precision. It is, therefore, important to highlight the problems of logistic regression and search the alternative method. Four causal diagram models were defined to summarize confounding equivalence. Both theoretical proofs and simulation studies were performed to verify whether conditioning on different confounding equivalence sets had the same bias-reducing potential and then to select the optimum adjusting strategy, in which logistic regression model and inverse probability weighting based marginal structural model (IPW-based-MSM) were compared. The "do-calculus" was used to calculate the true causal effect of exposure on outcome, then the bias and standard error were used to evaluate the performances of different strategies. Adjusting for different sets of confounding equivalence, as judged by identical Markov boundaries, produced different bias-reducing potential in the logistic regression model. For the sets satisfied G-admissibility, adjusting for the set including all the confounders reduced the equivalent bias to the one containing the parent nodes of the outcome, while the bias after adjusting for the parent nodes of exposure was not equivalent to them. In addition, all causal effect estimations through logistic regression were biased, although the estimation after adjusting for the parent nodes of exposure was nearest to the true causal effect. However, conditioning on different confounding equivalence sets had the same bias-reducing potential under IPW-based-MSM. Compared with logistic regression, the IPW-based-MSM could obtain unbiased causal effect estimation when the adjusted confounders satisfied G-admissibility and the optimal strategy was to adjust for the parent nodes of outcome, which obtained the highest precision. All adjustment strategies through logistic regression were biased for causal effect estimation, while IPW-based-MSM could always obtain unbiased estimation when the adjusted set satisfied G-admissibility. Thus, IPW-based-MSM was recommended to adjust for confounders set.
Use and interpretation of logistic regression in habitat-selection studies
Keating, Kim A.; Cherry, Steve
2004-01-01
Logistic regression is an important tool for wildlife habitat-selection studies, but the method frequently has been misapplied due to an inadequate understanding of the logistic model, its interpretation, and the influence of sampling design. To promote better use of this method, we review its application and interpretation under 3 sampling designs: random, case-control, and use-availability. Logistic regression is appropriate for habitat use-nonuse studies employing random sampling and can be used to directly model the conditional probability of use in such cases. Logistic regression also is appropriate for studies employing case-control sampling designs, but careful attention is required to interpret results correctly. Unless bias can be estimated or probability of use is small for all habitats, results of case-control studies should be interpreted as odds ratios, rather than probability of use or relative probability of use. When data are gathered under a use-availability design, logistic regression can be used to estimate approximate odds ratios if probability of use is small, at least on average. More generally, however, logistic regression is inappropriate for modeling habitat selection in use-availability studies. In particular, using logistic regression to fit the exponential model of Manly et al. (2002:100) does not guarantee maximum-likelihood estimates, valid probabilities, or valid likelihoods. We show that the resource selection function (RSF) commonly used for the exponential model is proportional to a logistic discriminant function. Thus, it may be used to rank habitats with respect to probability of use and to identify important habitat characteristics or their surrogates, but it is not guaranteed to be proportional to probability of use. Other problems associated with the exponential model also are discussed. We describe an alternative model based on Lancaster and Imbens (1996) that offers a method for estimating conditional probability of use in use-availability studies. Although promising, this model fails to converge to a unique solution in some important situations. Further work is needed to obtain a robust method that is broadly applicable to use-availability studies.
Modeling Governance KB with CATPCA to Overcome Multicollinearity in the Logistic Regression
NASA Astrophysics Data System (ADS)
Khikmah, L.; Wijayanto, H.; Syafitri, U. D.
2017-04-01
The problem often encounters in logistic regression modeling are multicollinearity problems. Data that have multicollinearity between explanatory variables with the result in the estimation of parameters to be bias. Besides, the multicollinearity will result in error in the classification. In general, to overcome multicollinearity in regression used stepwise regression. They are also another method to overcome multicollinearity which involves all variable for prediction. That is Principal Component Analysis (PCA). However, classical PCA in only for numeric data. Its data are categorical, one method to solve the problems is Categorical Principal Component Analysis (CATPCA). Data were used in this research were a part of data Demographic and Population Survey Indonesia (IDHS) 2012. This research focuses on the characteristic of women of using the contraceptive methods. Classification results evaluated using Area Under Curve (AUC) values. The higher the AUC value, the better. Based on AUC values, the classification of the contraceptive method using stepwise method (58.66%) is better than the logistic regression model (57.39%) and CATPCA (57.39%). Evaluation of the results of logistic regression using sensitivity, shows the opposite where CATPCA method (99.79%) is better than logistic regression method (92.43%) and stepwise (92.05%). Therefore in this study focuses on major class classification (using a contraceptive method), then the selected model is CATPCA because it can raise the level of the major class model accuracy.
Logistic regression models of factors influencing the location of bioenergy and biofuels plants
T.M. Young; R.L. Zaretzki; J.H. Perdue; F.M. Guess; X. Liu
2011-01-01
Logistic regression models were developed to identify significant factors that influence the location of existing wood-using bioenergy/biofuels plants and traditional wood-using facilities. Logistic models provided quantitative insight for variables influencing the location of woody biomass-using facilities. Availability of "thinnings to a basal area of 31.7m2/ha...
Factors Associated With Peer Victimization Among Adolescents in Taiwan.
Huang, Hui-Wen; Chen, Jyu-Lin; Wang, Ruey-Hsia
2018-02-01
Adolescents who have experienced peer victimization face a higher risk of negative health outcomes. However, little is known about the factors that are associated with peer victimization among adolescents in Taiwan. The aim of this study was to examine the factors related to peer victimization among Taiwanese adolescents. A cross-sectional design was employed. Three hundred seventy-seven adolescents aged 13-16 years from seven middle schools in southern Taiwan were recruited as participants. Validated, self-reported questionnaires were used to gather data on demographic characteristics, resilience, peer relationship, parental monitoring, school connectedness, social support, and peer victimization. Logistic regression analysis was used to examine the factors that were related to peer victimization. About 17% (n = 64) of the participants experienced peer victimization during the previous 1-year period. Logistic regression analysis indicated that parental monitoring of daily life, school connectedness, and peer support were significant predictors of a reduced risk of peer victimization. The final model explained 23.1% of the total variance in less peer victimization and predicted 80.1% of peer victimization. School connectedness and peer support were identified as important factors facilitating the avoidance of peer victimization among adolescents in Taiwan. Healthcare providers and school personnel should consider school-based programs to improve school connectedness and to build an atmosphere of peer support to reduce peer victimization. Educating parents to monitor their adolescents' daily activities is also encouraged in concert with these school-based programs.
Discrete post-processing of total cloud cover ensemble forecasts
NASA Astrophysics Data System (ADS)
Hemri, Stephan; Haiden, Thomas; Pappenberger, Florian
2017-04-01
This contribution presents an approach to post-process ensemble forecasts for the discrete and bounded weather variable of total cloud cover. Two methods for discrete statistical post-processing of ensemble predictions are tested. The first approach is based on multinomial logistic regression, the second involves a proportional odds logistic regression model. Applying them to total cloud cover raw ensemble forecasts from the European Centre for Medium-Range Weather Forecasts improves forecast skill significantly. Based on station-wise post-processing of raw ensemble total cloud cover forecasts for a global set of 3330 stations over the period from 2007 to early 2014, the more parsimonious proportional odds logistic regression model proved to slightly outperform the multinomial logistic regression model. Reference Hemri, S., Haiden, T., & Pappenberger, F. (2016). Discrete post-processing of total cloud cover ensemble forecasts. Monthly Weather Review 144, 2565-2577.
Fuzzy multinomial logistic regression analysis: A multi-objective programming approach
NASA Astrophysics Data System (ADS)
Abdalla, Hesham A.; El-Sayed, Amany A.; Hamed, Ramadan
2017-05-01
Parameter estimation for multinomial logistic regression is usually based on maximizing the likelihood function. For large well-balanced datasets, Maximum Likelihood (ML) estimation is a satisfactory approach. Unfortunately, ML can fail completely or at least produce poor results in terms of estimated probabilities and confidence intervals of parameters, specially for small datasets. In this study, a new approach based on fuzzy concepts is proposed to estimate parameters of the multinomial logistic regression. The study assumes that the parameters of multinomial logistic regression are fuzzy. Based on the extension principle stated by Zadeh and Bárdossy's proposition, a multi-objective programming approach is suggested to estimate these fuzzy parameters. A simulation study is used to evaluate the performance of the new approach versus Maximum likelihood (ML) approach. Results show that the new proposed model outperforms ML in cases of small datasets.
Häberle, Lothar; Hack, Carolin C; Heusinger, Katharina; Wagner, Florian; Jud, Sebastian M; Uder, Michael; Beckmann, Matthias W; Schulz-Wendtland, Rüdiger; Wittenberg, Thomas; Fasching, Peter A
2017-08-30
Tumors in radiologically dense breast were overlooked on mammograms more often than tumors in low-density breasts. A fast reproducible and automated method of assessing percentage mammographic density (PMD) would be desirable to support decisions whether ultrasonography should be provided for women in addition to mammography in diagnostic mammography units. PMD assessment has still not been included in clinical routine work, as there are issues of interobserver variability and the procedure is quite time consuming. This study investigated whether fully automatically generated texture features of mammograms can replace time-consuming semi-automatic PMD assessment to predict a patient's risk of having an invasive breast tumor that is visible on ultrasound but masked on mammography (mammography failure). This observational study included 1334 women with invasive breast cancer treated at a hospital-based diagnostic mammography unit. Ultrasound was available for the entire cohort as part of routine diagnosis. Computer-based threshold PMD assessments ("observed PMD") were carried out and 363 texture features were obtained from each mammogram. Several variable selection and regression techniques (univariate selection, lasso, boosting, random forest) were applied to predict PMD from the texture features. The predicted PMD values were each used as new predictor for masking in logistic regression models together with clinical predictors. These four logistic regression models with predicted PMD were compared among themselves and with a logistic regression model with observed PMD. The most accurate masking prediction was determined by cross-validation. About 120 of the 363 texture features were selected for predicting PMD. Density predictions with boosting were the best substitute for observed PMD to predict masking. Overall, the corresponding logistic regression model performed better (cross-validated AUC, 0.747) than one without mammographic density (0.734), but less well than the one with the observed PMD (0.753). However, in patients with an assigned mammography failure risk >10%, covering about half of all masked tumors, the boosting-based model performed at least as accurately as the original PMD model. Automatically generated texture features can replace semi-automatically determined PMD in a prediction model for mammography failure, such that more than 50% of masked tumors could be discovered.
A Primer on Logistic Regression.
ERIC Educational Resources Information Center
Woldbeck, Tanya
This paper introduces logistic regression as a viable alternative when the researcher is faced with variables that are not continuous. If one is to use simple regression, the dependent variable must be measured on a continuous scale. In the behavioral sciences, it may not always be appropriate or possible to have a measured dependent variable on a…
A Solution to Separation and Multicollinearity in Multiple Logistic Regression
Shen, Jianzhao; Gao, Sujuan
2010-01-01
In dementia screening tests, item selection for shortening an existing screening test can be achieved using multiple logistic regression. However, maximum likelihood estimates for such logistic regression models often experience serious bias or even non-existence because of separation and multicollinearity problems resulting from a large number of highly correlated items. Firth (1993, Biometrika, 80(1), 27–38) proposed a penalized likelihood estimator for generalized linear models and it was shown to reduce bias and the non-existence problems. The ridge regression has been used in logistic regression to stabilize the estimates in cases of multicollinearity. However, neither solves the problems for each other. In this paper, we propose a double penalized maximum likelihood estimator combining Firth’s penalized likelihood equation with a ridge parameter. We present a simulation study evaluating the empirical performance of the double penalized likelihood estimator in small to moderate sample sizes. We demonstrate the proposed approach using a current screening data from a community-based dementia study. PMID:20376286
A Solution to Separation and Multicollinearity in Multiple Logistic Regression.
Shen, Jianzhao; Gao, Sujuan
2008-10-01
In dementia screening tests, item selection for shortening an existing screening test can be achieved using multiple logistic regression. However, maximum likelihood estimates for such logistic regression models often experience serious bias or even non-existence because of separation and multicollinearity problems resulting from a large number of highly correlated items. Firth (1993, Biometrika, 80(1), 27-38) proposed a penalized likelihood estimator for generalized linear models and it was shown to reduce bias and the non-existence problems. The ridge regression has been used in logistic regression to stabilize the estimates in cases of multicollinearity. However, neither solves the problems for each other. In this paper, we propose a double penalized maximum likelihood estimator combining Firth's penalized likelihood equation with a ridge parameter. We present a simulation study evaluating the empirical performance of the double penalized likelihood estimator in small to moderate sample sizes. We demonstrate the proposed approach using a current screening data from a community-based dementia study.
Wartberg, Lutz; Kriston, Levente; Kammerl, Rudolf
2017-07-01
Internet Gaming Disorder (IGD) has been included in the current edition of the Diagnostic and Statistical Manual of Mental Disorders-Fifth Edition (DSM-5). In the present study, the relationship among social support, friends only known through the Internet, health-related quality of life, and IGD in adolescence was explored for the first time. For this purpose, 1,095 adolescents aged from 12 to 14 years were surveyed with a standardized questionnaire concerning IGD, self-perceived social support, proportion of friends only known through the Internet, and health-related quality of life. The authors conducted unpaired t-tests, a chi-square test, as well as correlation and logistic regression analyses. According to the statistical analyses, adolescents with IGD reported lower self-perceived social support, more friends only known through the Internet, and a lower health-related quality of life compared with the group without IGD. Both in bivariate and multivariate logistic regression models, statistically significant associations between IGD and male gender, a higher proportion of friends only known through the Internet, and a lower health-related quality of life (multivariate model: Nagelkerke's R 2 = 0.37) were revealed. Lower self-perceived social support was related to IGD in the bivariate model only. In summary, quality of life and social aspects seem to be important factors for IGD in adolescence and therefore should be incorporated in further (longitudinal) studies. The findings of the present survey may provide starting points for the development of prevention and intervention programs for adolescents affected by IGD.
Suicidal behavior among homeless people in Japan.
Okamura, Tsuyoshi; Ito, Kae; Morikawa, Suimei; Awata, Shuichi
2014-04-01
The purpose of this study is to investigate the frequency and correlates of suicidal behavior among homeless people in Japan. A face-to-face survey was conducted in two districts of Tokyo, Japan, with 423 subjects who resided on streets and riversides and in urban parks and stations (street homeless) or who were residents of shelters, cheap hotels, or welfare homes for homeless people (sheltered homeless). When questioned about suicidal ideation in the previous 2 weeks, 51 subjects (12.2% of valid responses) had a recurring wish to die, 29 (6.9%) had frequent thoughts of suicide, and 22 (5.3%) had made suicide plans. In addition, 11 (2.9%) subjects had attempted suicide in the previous 2 weeks and 74 (17.7%) reported that they had ever attempted suicide. In univariate logistic regression analyses, street homelessness, lack of perceived emotional social support, poor subjective health perception, visual impairment, pain, insomnia, poor mental well-being, and current depression were significantly associated with recurrent thoughts of suicide in the previous 2 weeks. Among these, current depression had the greatest significance. In multivariate logistic regression analyses after controlling for depression, street homelessness and lack of perceived emotional social support were significantly associated with recurrent thoughts of suicide in the previous 2 weeks. Comprehensive interventions including housing and social support as well as mental health services might be crucial as effective strategies for suicide prevention among homeless people.
Ye, Dong-qing; Hu, Yi-song; Li, Xiang-pei; Huang, Fen; Yang, Shi-gui; Hao, Jia-hu; Yin, Jing; Zhang, Guo-qing; Liu, Hui-hui
2004-11-01
To explore the impact of environmental factors, daily lifestyle, psycho-social factors and the interactions between environmental factors and chemokines genes on systemic lupus erythematosus (SLE). Case-control study was carried out and environmental factors for SLE were analyzed by univariate and multivariate unconditional logistic regression. Interactions between environmental factors and chemokines polymorphism contributing to systemic lupus erythematosus were also analyzed by logistic regression model. There were nineteen factors associated with SLE when univariate unconditional logistic regression was used. However, when multivariate unconditional logistic regression was used, only five factors showed having impacts on the disease, in which drinking well water (OR=0.099) was protective factor for SLE, and multiple drug allergy (OR=8.174), over-exposure to sunshine (OR=18.339), taking antibiotics (OR=9.630) and oral contraceptives were risk factors for SLE. When unconditional logistic regression model was used, results showed that there was interaction between eating irritable food and -2518MCP-1G/G genotype (OR=4.387). No interaction between environmental factors was found that contributing to SLE in this study. Many environmental factors were related to SLE, and there was an interaction between -2518MCP-1G/G genotype and eating irritable food.
Mielniczuk, Jan; Teisseyre, Paweł
2018-03-01
Detection of gene-gene interactions is one of the most important challenges in genome-wide case-control studies. Besides traditional logistic regression analysis, recently the entropy-based methods attracted a significant attention. Among entropy-based methods, interaction information is one of the most promising measures having many desirable properties. Although both logistic regression and interaction information have been used in several genome-wide association studies, the relationship between them has not been thoroughly investigated theoretically. The present paper attempts to fill this gap. We show that although certain connections between the two methods exist, in general they refer two different concepts of dependence and looking for interactions in those two senses leads to different approaches to interaction detection. We introduce ordering between interaction measures and specify conditions for independent and dependent genes under which interaction information is more discriminative measure than logistic regression. Moreover, we show that for so-called perfect distributions those measures are equivalent. The numerical experiments illustrate the theoretical findings indicating that interaction information and its modified version are more universal tools for detecting various types of interaction than logistic regression and linkage disequilibrium measures. © 2017 WILEY PERIODICALS, INC.
A Clinical Decision Support System for Breast Cancer Patients
NASA Astrophysics Data System (ADS)
Fernandes, Ana S.; Alves, Pedro; Jarman, Ian H.; Etchells, Terence A.; Fonseca, José M.; Lisboa, Paulo J. G.
This paper proposes a Web clinical decision support system for clinical oncologists and for breast cancer patients making prognostic assessments, using the particular characteristics of the individual patient. This system comprises three different prognostic modelling methodologies: the clinically widely used Nottingham prognostic index (NPI); the Cox regression modelling and a partial logistic artificial neural network with automatic relevance determination (PLANN-ARD). All three models yield a different prognostic index that can be analysed together in order to obtain a more accurate prognostic assessment of the patient. Missing data is incorporated in the mentioned models, a common issue in medical data that was overcome using multiple imputation techniques. Risk group assignments are also provided through a methodology based on regression trees, where Boolean rules can be obtained expressed with patient characteristics.
ERIC Educational Resources Information Center
Shih, Ching-Lin; Liu, Tien-Hsiang; Wang, Wen-Chung
2014-01-01
The simultaneous item bias test (SIBTEST) method regression procedure and the differential item functioning (DIF)-free-then-DIF strategy are applied to the logistic regression (LR) method simultaneously in this study. These procedures are used to adjust the effects of matching true score on observed score and to better control the Type I error…
ERIC Educational Resources Information Center
Feist, Amber M.
2013-01-01
Hispanic women who are deaf constitute a heterogeneous group of individuals with varying vocational needs. To understand the unique needs of this population, it is important to analyze how consumer characteristics, presence of public supports, and type of services provided influence employment outcomes for Hispanic women who are deaf. The purpose…
Access disparities to Magnet hospitals for patients undergoing neurosurgical operations
Missios, Symeon; Bekelis, Kimon
2017-01-01
Background Centers of excellence focusing on quality improvement have demonstrated superior outcomes for a variety of surgical interventions. We investigated the presence of access disparities to hospitals recognized by the Magnet Recognition Program of the American Nurses Credentialing Center (ANCC) for patients undergoing neurosurgical operations. Methods We performed a cohort study of all neurosurgery patients who were registered in the New York Statewide Planning and Research Cooperative System (SPARCS) database from 2009–2013. We examined the association of African-American race and lack of insurance with Magnet status hospitalization for neurosurgical procedures. A mixed effects propensity adjusted multivariable regression analysis was used to control for confounding. Results During the study period, 190,535 neurosurgical patients met the inclusion criteria. Using a multivariable logistic regression, we demonstrate that African-Americans had lower admission rates to Magnet institutions (OR 0.62; 95% CI, 0.58–0.67). This persisted in a mixed effects logistic regression model (OR 0.77; 95% CI, 0.70–0.83) to adjust for clustering at the patient county level, and a propensity score adjusted logistic regression model (OR 0.75; 95% CI, 0.69–0.82). Additionally, lack of insurance was associated with lower admission rates to Magnet institutions (OR 0.71; 95% CI, 0.68–0.73), in a multivariable logistic regression model. This persisted in a mixed effects logistic regression model (OR 0.72; 95% CI, 0.69–0.74), and a propensity score adjusted logistic regression model (OR 0.72; 95% CI, 0.69–0.75). Conclusions Using a comprehensive all-payer cohort of neurosurgery patients in New York State we identified an association of African-American race and lack of insurance with lower rates of admission to Magnet hospitals. PMID:28684152
Adjusting for Confounding in Early Postlaunch Settings: Going Beyond Logistic Regression Models.
Schmidt, Amand F; Klungel, Olaf H; Groenwold, Rolf H H
2016-01-01
Postlaunch data on medical treatments can be analyzed to explore adverse events or relative effectiveness in real-life settings. These analyses are often complicated by the number of potential confounders and the possibility of model misspecification. We conducted a simulation study to compare the performance of logistic regression, propensity score, disease risk score, and stabilized inverse probability weighting methods to adjust for confounding. Model misspecification was induced in the independent derivation dataset. We evaluated performance using relative bias confidence interval coverage of the true effect, among other metrics. At low events per coefficient (1.0 and 0.5), the logistic regression estimates had a large relative bias (greater than -100%). Bias of the disease risk score estimates was at most 13.48% and 18.83%. For the propensity score model, this was 8.74% and >100%, respectively. At events per coefficient of 1.0 and 0.5, inverse probability weighting frequently failed or reduced to a crude regression, resulting in biases of -8.49% and 24.55%. Coverage of logistic regression estimates became less than the nominal level at events per coefficient ≤5. For the disease risk score, inverse probability weighting, and propensity score, coverage became less than nominal at events per coefficient ≤2.5, ≤1.0, and ≤1.0, respectively. Bias of misspecified disease risk score models was 16.55%. In settings with low events/exposed subjects per coefficient, disease risk score methods can be useful alternatives to logistic regression models, especially when propensity score models cannot be used. Despite better performance of disease risk score methods than logistic regression and propensity score models in small events per coefficient settings, bias, and coverage still deviated from nominal.
Burnout does not help predict depression among French school teachers.
Bianchi, Renzo; Schonfeld, Irvin Sam; Laurent, Eric
2015-11-01
Burnout has been viewed as a phase in the development of depression. However, supportive research is scarce. We examined whether burnout predicted depression among French school teachers. We conducted a 2-wave, 21-month study involving 627 teachers (73% female) working in French primary and secondary schools. Burnout was assessed with the Maslach Burnout Inventory and depression with the 9-item depression module of the Patient Health Questionnaire (PHQ-9). The PHQ-9 grades depressive symptom severity and provides a provisional diagnosis of major depression. Depression was treated both as a continuous and categorical variable using linear and logistic regression analyses. We controlled for gender, age, and length of employment. Controlling for baseline depressive symptoms, linear regression analysis showed that burnout symptoms at time 1 (T1) did not predict depressive symptoms at time 2 (T2). Baseline depressive symptoms accounted for about 88% of the association between T1 burnout and T2 depressive symptoms. Only baseline depressive symptoms predicted depressive symptoms at follow-up. Similarly, logistic regression analysis revealed that burnout symptoms at T1 did not predict incident cases of major depression at T2 when depressive symptoms at T1 were included in the predictive model. Only baseline depressive symptoms predicted cases of major depression at follow-up. This study does not support the view that burnout is a phase in the development of depression. Assessing burnout symptoms in addition to "classical" depressive symptoms may not always improve our ability to predict future depression.
Pfeiffer, R M; Riedl, R
2015-08-15
We assess the asymptotic bias of estimates of exposure effects conditional on covariates when summary scores of confounders, instead of the confounders themselves, are used to analyze observational data. First, we study regression models for cohort data that are adjusted for summary scores. Second, we derive the asymptotic bias for case-control studies when cases and controls are matched on a summary score, and then analyzed either using conditional logistic regression or by unconditional logistic regression adjusted for the summary score. Two scores, the propensity score (PS) and the disease risk score (DRS) are studied in detail. For cohort analysis, when regression models are adjusted for the PS, the estimated conditional treatment effect is unbiased only for linear models, or at the null for non-linear models. Adjustment of cohort data for DRS yields unbiased estimates only for linear regression; all other estimates of exposure effects are biased. Matching cases and controls on DRS and analyzing them using conditional logistic regression yields unbiased estimates of exposure effect, whereas adjusting for the DRS in unconditional logistic regression yields biased estimates, even under the null hypothesis of no association. Matching cases and controls on the PS yield unbiased estimates only under the null for both conditional and unconditional logistic regression, adjusted for the PS. We study the bias for various confounding scenarios and compare our asymptotic results with those from simulations with limited sample sizes. To create realistic correlations among multiple confounders, we also based simulations on a real dataset. Copyright © 2015 John Wiley & Sons, Ltd.
Nie, Z Q; Ou, Y Q; Zhuang, J; Qu, Y J; Mai, J Z; Chen, J M; Liu, X Q
2016-05-01
Conditional logistic regression analysis and unconditional logistic regression analysis are commonly used in case control study, but Cox proportional hazard model is often used in survival data analysis. Most literature only refer to main effect model, however, generalized linear model differs from general linear model, and the interaction was composed of multiplicative interaction and additive interaction. The former is only statistical significant, but the latter has biological significance. In this paper, macros was written by using SAS 9.4 and the contrast ratio, attributable proportion due to interaction and synergy index were calculated while calculating the items of logistic and Cox regression interactions, and the confidence intervals of Wald, delta and profile likelihood were used to evaluate additive interaction for the reference in big data analysis in clinical epidemiology and in analysis of genetic multiplicative and additive interactions.
Rhodes, Darson L; Kirchofer, Gregg; Hammig, Bart J; Ogletree, Roberta J
2013-05-01
This study examined the impact of professional preparation and class structure on sexuality topics taught and use of practice-based instructional strategies in US middle and high school health classes. Data from the classroom-level file of the 2006 School Health Policies and Programs were used. A series of multivariable logistic regression models were employed to determine if sexuality content taught was dependent on professional preparation and /or class structure (HE only versus HE/another subject combined). Additional multivariable logistic regression models were employed to determine if use of practice-based instructional strategies was dependent upon professional preparation and/or class structure. Years of teaching health topics and size of the school district were included as covariates in the multivariable logistic regression models. Findings indicated professionally prepared health educators were significantly more likely to teach 7 of the 13 sexuality topics as compared to nonprofessionally prepared health educators. There was no statistically significant difference in the instructional strategies used by professionally prepared and nonprofessionally prepared health educators. Exclusively health education classes versus combined classes were significantly more likely to have included 6 of the 13 topics and to have incorporated practice-based instructional strategies in the curricula. This study indicated professional preparation and class structure impacted sexuality content taught. Class structure also impacted whether opportunities for students to practice skills were made available. Results support the need for continued advocacy for professionally prepared health educators and health only courses. © 2013, American School Health Association.
Low, Ashley; Dixon, Shannan; Higgs, Amanda; Joines, Jessica; Hippman, Catriona
2018-02-01
Mental illness is extremely common and genetic counselors frequently see patients with mental illness. Genetic counselors report discomfort in providing psychiatric genetic counseling (GC), suggesting the need to look critically at training for psychiatric GC. This study aimed to investigate psychiatric GC training and its impact on perceived preparedness to provide psychiatric GC (preparedness). Current students and recent graduates were invited to complete an anonymous survey evaluating psychiatric GC training and outcomes. Bivariate correlations (p<.10) identified variables for inclusion in a logistic regression model to predict preparedness. Data were checked for assumptions underlying logistic regression. The logistic regression model for the 286 respondents [χ 2 (8)=84.87, p<.001] explained between 37.1% (Cox & Snell R 2 =.371) and 49.7% (Nagelkerke R 2 =.497) of the variance in preparedness scores. More frequent psychiatric GC instruction (OR=5.13), more active methods for practicing risk assessment (OR=4.43), and education on providing resources for mental illness (OR=4.99) made uniquely significant contributions to the model (p<.001). Responses to open-ended questions revealed interest in further psychiatric GC training, particularly enabling "hands on" experience. This exploratory study suggests that enriching GC training through more frequent psychiatric GC instruction and more active opportunities to practice psychiatric GC skills will support students in feeling more prepared to provide psychiatric GC after graduation.
Kruppa, Jochen; Liu, Yufeng; Biau, Gérard; Kohler, Michael; König, Inke R; Malley, James D; Ziegler, Andreas
2014-07-01
Probability estimation for binary and multicategory outcome using logistic and multinomial logistic regression has a long-standing tradition in biostatistics. However, biases may occur if the model is misspecified. In contrast, outcome probabilities for individuals can be estimated consistently with machine learning approaches, including k-nearest neighbors (k-NN), bagged nearest neighbors (b-NN), random forests (RF), and support vector machines (SVM). Because machine learning methods are rarely used by applied biostatisticians, the primary goal of this paper is to explain the concept of probability estimation with these methods and to summarize recent theoretical findings. Probability estimation in k-NN, b-NN, and RF can be embedded into the class of nonparametric regression learning machines; therefore, we start with the construction of nonparametric regression estimates and review results on consistency and rates of convergence. In SVMs, outcome probabilities for individuals are estimated consistently by repeatedly solving classification problems. For SVMs we review classification problem and then dichotomous probability estimation. Next we extend the algorithms for estimating probabilities using k-NN, b-NN, and RF to multicategory outcomes and discuss approaches for the multicategory probability estimation problem using SVM. In simulation studies for dichotomous and multicategory dependent variables we demonstrate the general validity of the machine learning methods and compare it with logistic regression. However, each method fails in at least one simulation scenario. We conclude with a discussion of the failures and give recommendations for selecting and tuning the methods. Applications to real data and example code are provided in a companion article (doi:10.1002/bimj.201300077). © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
A New Approach for Mobile Advertising Click-Through Rate Estimation Based on Deep Belief Nets.
Chen, Jie-Hao; Zhao, Zi-Qian; Shi, Ji-Yun; Zhao, Chong
2017-01-01
In recent years, with the rapid development of mobile Internet and its business applications, mobile advertising Click-Through Rate (CTR) estimation has become a hot research direction in the field of computational advertising, which is used to achieve accurate advertisement delivery for the best benefits in the three-side game between media, advertisers, and audiences. Current research on the estimation of CTR mainly uses the methods and models of machine learning, such as linear model or recommendation algorithms. However, most of these methods are insufficient to extract the data features and cannot reflect the nonlinear relationship between different features. In order to solve these problems, we propose a new model based on Deep Belief Nets to predict the CTR of mobile advertising, which combines together the powerful data representation and feature extraction capability of Deep Belief Nets, with the advantage of simplicity of traditional Logistic Regression models. Based on the training dataset with the information of over 40 million mobile advertisements during a period of 10 days, our experiments show that our new model has better estimation accuracy than the classic Logistic Regression (LR) model by 5.57% and Support Vector Regression (SVR) model by 5.80%.
A New Approach for Mobile Advertising Click-Through Rate Estimation Based on Deep Belief Nets
Zhao, Zi-Qian; Shi, Ji-Yun; Zhao, Chong
2017-01-01
In recent years, with the rapid development of mobile Internet and its business applications, mobile advertising Click-Through Rate (CTR) estimation has become a hot research direction in the field of computational advertising, which is used to achieve accurate advertisement delivery for the best benefits in the three-side game between media, advertisers, and audiences. Current research on the estimation of CTR mainly uses the methods and models of machine learning, such as linear model or recommendation algorithms. However, most of these methods are insufficient to extract the data features and cannot reflect the nonlinear relationship between different features. In order to solve these problems, we propose a new model based on Deep Belief Nets to predict the CTR of mobile advertising, which combines together the powerful data representation and feature extraction capability of Deep Belief Nets, with the advantage of simplicity of traditional Logistic Regression models. Based on the training dataset with the information of over 40 million mobile advertisements during a period of 10 days, our experiments show that our new model has better estimation accuracy than the classic Logistic Regression (LR) model by 5.57% and Support Vector Regression (SVR) model by 5.80%. PMID:29209363
Li, Yi; Tseng, Yufeng J.; Pan, Dahua; Liu, Jianzhong; Kern, Petra S.; Gerberick, G. Frank; Hopfinger, Anton J.
2008-01-01
Currently, the only validated methods to identify skin sensitization effects are in vivo models, such as the Local Lymph Node Assay (LLNA) and guinea pig studies. There is a tremendous need, in particular due to novel legislation, to develop animal alternatives, eg. Quantitative Structure-Activity Relationship (QSAR) models. Here, QSAR models for skin sensitization using LLNA data have been constructed. The descriptors used to generate these models are derived from the 4D-molecular similarity paradigm and are referred to as universal 4D-fingerprints. A training set of 132 structurally diverse compounds and a test set of 15 structurally diverse compounds were used in this study. The statistical methodologies used to build the models are logistic regression (LR), and partial least square coupled logistic regression (PLS-LR), which prove to be effective tools for studying skin sensitization measures expressed in the two categorical terms of sensitizer and non-sensitizer. QSAR models with low values of the Hosmer-Lemeshow goodness-of-fit statistic, χHL2, are significant and predictive. For the training set, the cross-validated prediction accuracy of the logistic regression models ranges from 77.3% to 78.0%, while that of PLS-logistic regression models ranges from 87.1% to 89.4%. For the test set, the prediction accuracy of logistic regression models ranges from 80.0%-86.7%, while that of PLS-logistic regression models ranges from 73.3%-80.0%. The QSAR models are made up of 4D-fingerprints related to aromatic atoms, hydrogen bond acceptors and negatively partially charged atoms. PMID:17226934
Are math readiness and personality predictive of first-year retention in engineering?
Moses, Laurie; Hall, Cathy; Wuensch, Karl; De Urquidi, Karen; Kauffmann, Paul; Swart, William; Duncan, Steve; Dixon, Gene
2011-01-01
On the basis of J. G. Borkowski, L. K. Chan, and N. Muthukrishna's model of academic success (2000), the present authors hypothesized that freshman retention in an engineering program would be related to not only basic aptitude but also affective factors. Participants were 129 college freshmen with engineering as their stated major. Aptitude was measured by SAT verbal and math scores, high school grade-point average (GPA), and an assessment of calculus readiness. Affective factors were assessed by the NEO-Five Factor Inventory (FFI; P. I. Costa & R. R. McCrae, 2007), and the Nowicki-Duke Locus of Control (LOC) scale (S. Nowicki & M. Duke, 1974). A binary logistic regression analysis found that calculus readiness and high school GPA were predictive of retention. Scores on the Neuroticism and Openness subscales from the NEO-FFI and LOC were correlated with retention status, but Openness was the only affective factor with a significant unique effect in the binary logistic regression. Results of the study lend modest support to Borkowski's model.
HIV testing among MSM in Bogotá, Colombia: The role of structural and individual characteristics
Reisen, Carol A.; Zea, Maria Cecilia; Bianchi, Fernanda T.; Poppen, Paul J.; del Río González, Ana Maria; Romero, Rodrigo A. Aguayo; Pérez, Carolin
2014-01-01
This study used mixed methods to examine characteristics related to HIV testing among men who have sex with men (MSM) in Bogotá, Colombia. A sample of 890 MSM responded to a computerized quantitative survey. Follow-up qualitative data included 20 in-depth interviews with MSM and 12 key informant interviews. Hierarchical logistic set regression indicated that sequential sets of variables reflecting demographic characteristics, insurance coverage, risk appraisal, and social context each added to the explanation of HIV testing. Follow-up logistic regression showed that individuals who were older, had higher income, paid for their own insurance, had had a sexually transmitted infection, knew more people living with HIV, and had greater social support were more likely to have been tested for HIV at least once. Qualitative findings provided details of personal and structural barriers to testing, as well as interrelationships among these factors. Recommendations to increase HIV testing among Colombian MSM are offered. PMID:25068180
Price, James
2015-01-01
Propoxyphene was withdrawn from the US market in November 2010. This drug is still tested for in the workplace as part of expanded panel nonregulated testing. A convenience sample of urine specimens (n = 7838) were provided by workers from various industries. The percentage of positive specimens with 95% confidence intervals was calculated for each year of the study. Logistic regression was used to assess the impact of the year upon the propoxyphene result. The prevalence of positive propoxyphene tests was much higher before the product's withdrawal from the market. Logistic regression provided evidence of a decreasing linear trend (P < 0.000; β = -0.71). The odds ratio signifies that for every additional year the urine specimens were 0.49 times less likely to be positive for propoxyphene. This favors the determination that the change in propoxyphene positive drug test over the years is not by chance. The conclusion supports no longer performing nonregulated workplace propoxyphene urine drug testing for this population.
MODELING SNAKE MICROHABITAT FROM RADIOTELEMETRY STUDIES USING POLYTOMOUS LOGISTIC REGRESSION
Multivariate analysis of snake microhabitat has historically used techniques that were derived under assumptions of normality and common covariance structure (e.g., discriminant function analysis, MANOVA). In this study, polytomous logistic regression (PLR which does not require ...
Roadside sobriety tests and attitudes toward a regulated cannabis market
Looby, Alison; Earleywine, Mitch; Gieringer, Dale
2007-01-01
Background Many argue that prohibition creates more troubles than alternative policies, but fewer than half of American voters support a taxed and regulated market for cannabis. Some oppose a regulated market because of concerns about driving after smoking cannabis. Although a roadside sobriety test for impairment exists, few voters know about it. The widespread use of a roadside sobriety test that could detect recent cannabis use might lead some voters who currently oppose a regulated market to support it. In contrast, a question that primes respondents about the potential for driving after cannabis use might lead respondents to be less likely to support a regulated market. Methods Phone interviews with a national sample of 1002 registered voters asked about support for a regulated cannabis market and support for such a market if a reliable roadside sobriety test were widely available. Results In this sample of registered voters, 36% supported a regulated cannabis market. Exploratory chi-square tests revealed significantly higher support among men and Caucasians but no link to age or education. These demographic variables covaried significantly. Logistic regression revealed that gender, ethnicity, and political party were significant when all predictors were included. Support increased significantly with a reliable roadside sobriety test to 44%, but some respondents who had agreed to the regulated market no longer agreed when the sobriety test was mentioned. Logistic regression revealed that ethnicity and political affiliation were again significant predictors of support with a reliable sobriety test, but gender was no longer significant. None of these demographic variables could identify who would change their votes in response to the reliable roadside test. Conclusion Increased awareness and use of roadside sobriety tests that detect recent cannabis use could increase support for a regulated cannabis market. Identifying concerns of voters who are not Caucasian or Democrats could help alter cannabis policy. PMID:17266759
Brenn, T; Arnesen, E
1985-01-01
For comparative evaluation, discriminant analysis, logistic regression and Cox's model were used to select risk factors for total and coronary deaths among 6595 men aged 20-49 followed for 9 years. Groups with mortality between 5 and 93 per 1000 were considered. Discriminant analysis selected variable sets only marginally different from the logistic and Cox methods which always selected the same sets. A time-saving option, offered for both the logistic and Cox selection, showed no advantage compared with discriminant analysis. Analysing more than 3800 subjects, the logistic and Cox methods consumed, respectively, 80 and 10 times more computer time than discriminant analysis. When including the same set of variables in non-stepwise analyses, all methods estimated coefficients that in most cases were almost identical. In conclusion, discriminant analysis is advocated for preliminary or stepwise analysis, otherwise Cox's method should be used.
ERIC Educational Resources Information Center
DeMars, Christine E.
2009-01-01
The Mantel-Haenszel (MH) and logistic regression (LR) differential item functioning (DIF) procedures have inflated Type I error rates when there are large mean group differences, short tests, and large sample sizes.When there are large group differences in mean score, groups matched on the observed number-correct score differ on true score,…
Satellite rainfall retrieval by logistic regression
NASA Technical Reports Server (NTRS)
Chiu, Long S.
1986-01-01
The potential use of logistic regression in rainfall estimation from satellite measurements is investigated. Satellite measurements provide covariate information in terms of radiances from different remote sensors.The logistic regression technique can effectively accommodate many covariates and test their significance in the estimation. The outcome from the logistical model is the probability that the rainrate of a satellite pixel is above a certain threshold. By varying the thresholds, a rainrate histogram can be obtained, from which the mean and the variant can be estimated. A logistical model is developed and applied to rainfall data collected during GATE, using as covariates the fractional rain area and a radiance measurement which is deduced from a microwave temperature-rainrate relation. It is demonstrated that the fractional rain area is an important covariate in the model, consistent with the use of the so-called Area Time Integral in estimating total rain volume in other studies. To calibrate the logistical model, simulated rain fields generated by rainfield models with prescribed parameters are needed. A stringent test of the logistical model is its ability to recover the prescribed parameters of simulated rain fields. A rain field simulation model which preserves the fractional rain area and lognormality of rainrates as found in GATE is developed. A stochastic regression model of branching and immigration whose solutions are lognormally distributed in some asymptotic limits has also been developed.
Linkages between gender equity and intimate partner violence among urban Brazilian youth.
Gomez, Anu Manchikanti; Speizer, Ilene S; Moracco, Kathryn E
2011-10-01
Gender inequity is a risk factor for intimate partner violence (IPV), although there is little research on this relationship that focuses on youth or males. Using survey data collected from 240 male and 198 female youth aged 15-24 in Rio de Janeiro, Brazil, we explore the association between individual-level support for gender equity and IPV experiences in the past 6 months and describe responses to and motivations for IPV. Factor analysis was used to construct gender equity scales for males and females. Logistic and multinomial logistic regression models were used to examine the relationship between gender equity and IPV. About half of female youth reported some form of recent IPV, including any victimization (32%), any perpetration (40%), and both victimization and perpetration (22%). A total of 18% of male youth reported recently perpetrating IPV. In logistic regression models, support for gender equity had a protective effect against any female IPV victimization and any male IPV perpetration and was not associated with female IPV perpetration. Female victims reported leaving the abusive partner, but later returning to him as the most frequent response to IPV. Male perpetrators said the most common response of their victims was to retaliate with violence. Jealousy was the most frequently reported motivation of females perpetrating IPV. Gender equity is an important predictor of IPV among youth. Examining the gendered context of IPV will be useful in the development of targeted interventions to promote gender equity and healthy relationships and to help reduce IPV among youth. Copyright © 2011 Society for Adolescent Health and Medicine. Published by Elsevier Inc. All rights reserved.
Practical Session: Logistic Regression
NASA Astrophysics Data System (ADS)
Clausel, M.; Grégoire, G.
2014-12-01
An exercise is proposed to illustrate the logistic regression. One investigates the different risk factors in the apparition of coronary heart disease. It has been proposed in Chapter 5 of the book of D.G. Kleinbaum and M. Klein, "Logistic Regression", Statistics for Biology and Health, Springer Science Business Media, LLC (2010) and also by D. Chessel and A.B. Dufour in Lyon 1 (see Sect. 6 of http://pbil.univ-lyon1.fr/R/pdf/tdr341.pdf). This example is based on data given in the file evans.txt coming from http://www.sph.emory.edu/dkleinb/logreg3.htm#data.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ghazali, Amirul Syafiq Mohd; Ali, Zalila; Noor, Norlida Mohd
Multinomial logistic regression is widely used to model the outcomes of a polytomous response variable, a categorical dependent variable with more than two categories. The model assumes that the conditional mean of the dependent categorical variables is the logistic function of an affine combination of predictor variables. Its procedure gives a number of logistic regression models that make specific comparisons of the response categories. When there are q categories of the response variable, the model consists of q-1 logit equations which are fitted simultaneously. The model is validated by variable selection procedures, tests of regression coefficients, a significant test ofmore » the overall model, goodness-of-fit measures, and validation of predicted probabilities using odds ratio. This study used the multinomial logistic regression model to investigate obesity and overweight among primary school students in a rural area on the basis of their demographic profiles, lifestyles and on the diet and food intake. The results indicated that obesity and overweight of students are related to gender, religion, sleep duration, time spent on electronic games, breakfast intake in a week, with whom meals are taken, protein intake, and also, the interaction between breakfast intake in a week with sleep duration, and the interaction between gender and protein intake.« less
NASA Astrophysics Data System (ADS)
Ghazali, Amirul Syafiq Mohd; Ali, Zalila; Noor, Norlida Mohd; Baharum, Adam
2015-10-01
Multinomial logistic regression is widely used to model the outcomes of a polytomous response variable, a categorical dependent variable with more than two categories. The model assumes that the conditional mean of the dependent categorical variables is the logistic function of an affine combination of predictor variables. Its procedure gives a number of logistic regression models that make specific comparisons of the response categories. When there are q categories of the response variable, the model consists of q-1 logit equations which are fitted simultaneously. The model is validated by variable selection procedures, tests of regression coefficients, a significant test of the overall model, goodness-of-fit measures, and validation of predicted probabilities using odds ratio. This study used the multinomial logistic regression model to investigate obesity and overweight among primary school students in a rural area on the basis of their demographic profiles, lifestyles and on the diet and food intake. The results indicated that obesity and overweight of students are related to gender, religion, sleep duration, time spent on electronic games, breakfast intake in a week, with whom meals are taken, protein intake, and also, the interaction between breakfast intake in a week with sleep duration, and the interaction between gender and protein intake.
The cross-validated AUC for MCP-logistic regression with high-dimensional data.
Jiang, Dingfeng; Huang, Jian; Zhang, Ying
2013-10-01
We propose a cross-validated area under the receiving operator characteristic (ROC) curve (CV-AUC) criterion for tuning parameter selection for penalized methods in sparse, high-dimensional logistic regression models. We use this criterion in combination with the minimax concave penalty (MCP) method for variable selection. The CV-AUC criterion is specifically designed for optimizing the classification performance for binary outcome data. To implement the proposed approach, we derive an efficient coordinate descent algorithm to compute the MCP-logistic regression solution surface. Simulation studies are conducted to evaluate the finite sample performance of the proposed method and its comparison with the existing methods including the Akaike information criterion (AIC), Bayesian information criterion (BIC) or Extended BIC (EBIC). The model selected based on the CV-AUC criterion tends to have a larger predictive AUC and smaller classification error than those with tuning parameters selected using the AIC, BIC or EBIC. We illustrate the application of the MCP-logistic regression with the CV-AUC criterion on three microarray datasets from the studies that attempt to identify genes related to cancers. Our simulation studies and data examples demonstrate that the CV-AUC is an attractive method for tuning parameter selection for penalized methods in high-dimensional logistic regression models.
Lee, Ahyoung Anna; Jang, Yuri
2016-01-01
Based on the job demands-resources (JD-R) model, this study explored the role of physical injury and organizational support in predicting home health workers' turnover intention. In a sample of home health workers in Central Texas (n = 150), about 37% reported turnover intention. The logistic regression model showed that turnover intention was 3.23 times more likely among those who had experienced work-related injury. On the other hand, organizational support was found to reduce the likelihood of turnover intention. Findings suggest that injury and organizational support should be prioritized in prevention and intervention efforts to promote home health workers' safety and retention.
ERIC Educational Resources Information Center
Daly-Smith, Andy J. W.; McKenna, Jim; Radley, Duncan; Long, Jonathan
2011-01-01
Objective: To investigate the value of additional days of active commuting for meeting a criterion of 300+ minutes of moderate-to-vigorous physical activity (MVPA; 60+ mins/day x 5) during the school week. Methods: Based on seven-day diaries supported by teachers, binary logistic regression analyses were used to predict achievement of MVPA…
Detecting Dementia Through Interactive Computer Avatars
Adachi, Hiroyoshi; Ukita, Norimichi; Ikeda, Manabu; Kazui, Hiroaki; Kudo, Takashi; Nakamura, Satoshi
2017-01-01
This paper proposes a new approach to automatically detect dementia. Even though some works have detected dementia from speech and language attributes, most have applied detection using picture descriptions, narratives, and cognitive tasks. In this paper, we propose a new computer avatar with spoken dialog functionalities that produces spoken queries based on the mini-mental state examination, the Wechsler memory scale-revised, and other related neuropsychological questions. We recorded the interactive data of spoken dialogues from 29 participants (14 dementia and 15 healthy controls) and extracted various audiovisual features. We tried to predict dementia using audiovisual features and two machine learning algorithms (support vector machines and logistic regression). Here, we show that the support vector machines outperformed logistic regression, and by using the extracted features they classified the participants into two groups with 0.93 detection performance, as measured by the areas under the receiver operating characteristic curve. We also newly identified some contributing features, e.g., gap before speaking, the variations of fundamental frequency, voice quality, and the ratio of smiling. We concluded that our system has the potential to detect dementia through spoken dialog systems and that the system can assist health care workers. In addition, these findings could help medical personnel detect signs of dementia. PMID:29018636
Vaeth, Michael; Skovlund, Eva
2004-06-15
For a given regression problem it is possible to identify a suitably defined equivalent two-sample problem such that the power or sample size obtained for the two-sample problem also applies to the regression problem. For a standard linear regression model the equivalent two-sample problem is easily identified, but for generalized linear models and for Cox regression models the situation is more complicated. An approximately equivalent two-sample problem may, however, also be identified here. In particular, we show that for logistic regression and Cox regression models the equivalent two-sample problem is obtained by selecting two equally sized samples for which the parameters differ by a value equal to the slope times twice the standard deviation of the independent variable and further requiring that the overall expected number of events is unchanged. In a simulation study we examine the validity of this approach to power calculations in logistic regression and Cox regression models. Several different covariate distributions are considered for selected values of the overall response probability and a range of alternatives. For the Cox regression model we consider both constant and non-constant hazard rates. The results show that in general the approach is remarkably accurate even in relatively small samples. Some discrepancies are, however, found in small samples with few events and a highly skewed covariate distribution. Comparison with results based on alternative methods for logistic regression models with a single continuous covariate indicates that the proposed method is at least as good as its competitors. The method is easy to implement and therefore provides a simple way to extend the range of problems that can be covered by the usual formulas for power and sample size determination. Copyright 2004 John Wiley & Sons, Ltd.
Kesselmeier, Miriam; Lorenzo Bermejo, Justo
2017-11-01
Logistic regression is the most common technique used for genetic case-control association studies. A disadvantage of standard maximum likelihood estimators of the genotype relative risk (GRR) is their strong dependence on outlier subjects, for example, patients diagnosed at unusually young age. Robust methods are available to constrain outlier influence, but they are scarcely used in genetic studies. This article provides a non-intimidating introduction to robust logistic regression, and investigates its benefits and limitations in genetic association studies. We applied the bounded Huber and extended the R package 'robustbase' with the re-descending Hampel functions to down-weight outlier influence. Computer simulations were carried out to assess the type I error rate, mean squared error (MSE) and statistical power according to major characteristics of the genetic study and investigated markers. Simulations were complemented with the analysis of real data. Both standard and robust estimation controlled type I error rates. Standard logistic regression showed the highest power but standard GRR estimates also showed the largest bias and MSE, in particular for associated rare and recessive variants. For illustration, a recessive variant with a true GRR=6.32 and a minor allele frequency=0.05 investigated in a 1000 case/1000 control study by standard logistic regression resulted in power=0.60 and MSE=16.5. The corresponding figures for Huber-based estimation were power=0.51 and MSE=0.53. Overall, Hampel- and Huber-based GRR estimates did not differ much. Robust logistic regression may represent a valuable alternative to standard maximum likelihood estimation when the focus lies on risk prediction rather than identification of susceptibility variants. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Item Response Theory Modeling of the Philadelphia Naming Test.
Fergadiotis, Gerasimos; Kellough, Stacey; Hula, William D
2015-06-01
In this study, we investigated the fit of the Philadelphia Naming Test (PNT; Roach, Schwartz, Martin, Grewal, & Brecher, 1996) to an item-response-theory measurement model, estimated the precision of the resulting scores and item parameters, and provided a theoretical rationale for the interpretation of PNT overall scores by relating explanatory variables to item difficulty. This article describes the statistical model underlying the computer adaptive PNT presented in a companion article (Hula, Kellough, & Fergadiotis, 2015). Using archival data, we evaluated the fit of the PNT to 1- and 2-parameter logistic models and examined the precision of the resulting parameter estimates. We regressed the item difficulty estimates on three predictor variables: word length, age of acquisition, and contextual diversity. The 2-parameter logistic model demonstrated marginally better fit, but the fit of the 1-parameter logistic model was adequate. Precision was excellent for both person ability and item difficulty estimates. Word length, age of acquisition, and contextual diversity all independently contributed to variance in item difficulty. Item-response-theory methods can be productively used to analyze and quantify anomia severity in aphasia. Regression of item difficulty on lexical variables supported the validity of the PNT and interpretation of anomia severity scores in the context of current word-finding models.
Rooks, Ronica N.; Simonsick, Eleanor M.; Schulz, Richard; Rubin, Susan; Harris, Tamara
2017-01-01
Objective: The aim of this study is to examine social, economic, and health factors related to paid work in well-functioning older adults and if and how these factors vary by race. Method: We used sex-stratified logistic and multinomial logistic regression to examine cross-sectional data in the Health, Aging, and Body Composition cohort study. The sample included 3,075 community-dwelling Black (42%) and White adults aged 70 to 79 at baseline. Results: Multinomial logistic regression analyses show Black men were more likely to work full-time, and Black women were more likely to work part-time. Men with ≥US$50,000 family income were more likely to work full-time. Men with better physical functioning were more likely to work full- and part-time. Women with ≥US$50,000 family income and fewer chronic diseases were more likely to work full-time. Women who were overweight and had fewer chronic diseases were more likely to work part-time. Discussion: Results suggest that well-functioning, older Black adults were more likely to work than their White counterparts, and working relates to better health and higher income, providing support for a productive or successful aging perspective. PMID:28894767
Perceived resource support for chronic illnesses among diabetics in north-western China.
Zhong, Huiqin; Shao, Ya; Fan, Ling; Zhong, Tangshen; Ren, Lu; Wang, Yan
2016-06-01
A high level of social support can improve long-term diabetes self-management. Support from a single source has been evaluated. This study aims to analyze support from multiple and multilevel sources for diabetic patients by using the Chronic Illness Resources Survey (CIRS). Factors influencing the utilization of the CIRS were also evaluated. A total of 297 patients with diabetes were investigated using the CIRS and Perceived Diabetes Self-management Scale in Shihezi City, China. Descriptive statistics were used to explain demographic variables and scores of the scales. Factors affecting the utilization of chronic illness resources were determined through univariate analysis and then examined by multivariate logistic regression analysis. Of the 297 diabetic patients surveyed, 67% failed to reach the standard (more than 3 points) of utilizing chronic illness resources. Moreover, utilization of chronic illness resources was positively moderately correlated with self-management of diabetes (r = 0.75, P < 0.05). According to the multivariate logistic regression analysis, age (OR, 3.42; 95%CI, 1.19-9.84) and monthly income (OR, 5.27; 95%CI, 1.86-14.90) were significantly positively associated with the CIRS score. Individuals with high school (OR, 2.61; 95%CI, 1.13-6.05) and college (OR, 3.02; 95%CI, 1.13-8.04) degrees obtained higher scores in the survey than those with elementary school education. Results indicated that utilization of resources and support for chronic illness self-management, particularly personal adjustment and organization, were not ideal among diabetics in the communities of north-western China. Improved utilization of chronic illness resources was conducive for proper diabetes self-management. Furthermore, the level of utilization of chronic illness resources increased with age, literacy level, and monthly income.
Sampson, Maureen L; Gounden, Verena; van Deventer, Hendrik E; Remaley, Alan T
2016-02-01
The main drawback of the periodic analysis of quality control (QC) material is that test performance is not monitored in time periods between QC analyses, potentially leading to the reporting of faulty test results. The objective of this study was to develop a patient based QC procedure for the more timely detection of test errors. Results from a Chem-14 panel measured on the Beckman LX20 analyzer were used to develop the model. Each test result was predicted from the other 13 members of the panel by multiple regression, which resulted in correlation coefficients between the predicted and measured result of >0.7 for 8 of the 14 tests. A logistic regression model, which utilized the measured test result, the predicted test result, the day of the week and time of day, was then developed for predicting test errors. The output of the logistic regression was tallied by a daily CUSUM approach and used to predict test errors, with a fixed specificity of 90%. The mean average run length (ARL) before error detection by CUSUM-Logistic Regression (CSLR) was 20 with a mean sensitivity of 97%, which was considerably shorter than the mean ARL of 53 (sensitivity 87.5%) for a simple prediction model that only used the measured result for error detection. A CUSUM-Logistic Regression analysis of patient laboratory data can be an effective approach for the rapid and sensitive detection of clinical laboratory errors. Published by Elsevier Inc.
Fatigue design of a cellular phone folder using regression model-based multi-objective optimization
NASA Astrophysics Data System (ADS)
Kim, Young Gyun; Lee, Jongsoo
2016-08-01
In a folding cellular phone, the folding device is repeatedly opened and closed by the user, which eventually results in fatigue damage, particularly to the front of the folder. Hence, it is important to improve the safety and endurance of the folder while also reducing its weight. This article presents an optimal design for the folder front that maximizes its fatigue endurance while minimizing its thickness. Design data for analysis and optimization were obtained experimentally using a test jig. Multi-objective optimization was carried out using a nonlinear regression model. Three regression methods were employed: back-propagation neural networks, logistic regression and support vector machines. The AdaBoost ensemble technique was also used to improve the approximation. Two-objective Pareto-optimal solutions were identified using the non-dominated sorting genetic algorithm (NSGA-II). Finally, a numerically optimized solution was validated against experimental product data, in terms of both fatigue endurance and thickness index.
Nonconvex Sparse Logistic Regression With Weakly Convex Regularization
NASA Astrophysics Data System (ADS)
Shen, Xinyue; Gu, Yuantao
2018-06-01
In this work we propose to fit a sparse logistic regression model by a weakly convex regularized nonconvex optimization problem. The idea is based on the finding that a weakly convex function as an approximation of the $\\ell_0$ pseudo norm is able to better induce sparsity than the commonly used $\\ell_1$ norm. For a class of weakly convex sparsity inducing functions, we prove the nonconvexity of the corresponding sparse logistic regression problem, and study its local optimality conditions and the choice of the regularization parameter to exclude trivial solutions. Despite the nonconvexity, a method based on proximal gradient descent is used to solve the general weakly convex sparse logistic regression, and its convergence behavior is studied theoretically. Then the general framework is applied to a specific weakly convex function, and a necessary and sufficient local optimality condition is provided. The solution method is instantiated in this case as an iterative firm-shrinkage algorithm, and its effectiveness is demonstrated in numerical experiments by both randomly generated and real datasets.
A comparative study on entrepreneurial attitudes modeled with logistic regression and Bayes nets.
López Puga, Jorge; García García, Juan
2012-11-01
Entrepreneurship research is receiving increasing attention in our context, as entrepreneurs are key social agents involved in economic development. We compare the success of the dichotomic logistic regression model and the Bayes simple classifier to predict entrepreneurship, after manipulating the percentage of missing data and the level of categorization in predictors. A sample of undergraduate university students (N = 1230) completed five scales (motivation, attitude towards business creation, obstacles, deficiencies, and training needs) and we found that each of them predicted different aspects of the tendency to business creation. Additionally, our results show that the receiver operating characteristic (ROC) curve is affected by the rate of missing data in both techniques, but logistic regression seems to be more vulnerable when faced with missing data, whereas Bayes nets underperform slightly when categorization has been manipulated. Our study sheds light on the potential entrepreneur profile and we propose to use Bayesian networks as an additional alternative to overcome the weaknesses of logistic regression when missing data are present in applied research.
Campos-Filho, N; Franco, E L
1989-02-01
A frequent procedure in matched case-control studies is to report results from the multivariate unmatched analyses if they do not differ substantially from the ones obtained after conditioning on the matching variables. Although conceptually simple, this rule requires that an extensive series of logistic regression models be evaluated by both the conditional and unconditional maximum likelihood methods. Most computer programs for logistic regression employ only one maximum likelihood method, which requires that the analyses be performed in separate steps. This paper describes a Pascal microcomputer (IBM PC) program that performs multiple logistic regression by both maximum likelihood estimation methods, which obviates the need for switching between programs to obtain relative risk estimates from both matched and unmatched analyses. The program calculates most standard statistics and allows factoring of categorical or continuous variables by two distinct methods of contrast. A built-in, descriptive statistics option allows the user to inspect the distribution of cases and controls across categories of any given variable.
Comparison of cranial sex determination by discriminant analysis and logistic regression.
Amores-Ampuero, Anabel; Alemán, Inmaculada
2016-04-05
Various methods have been proposed for estimating dimorphism. The objective of this study was to compare sex determination results from cranial measurements using discriminant analysis or logistic regression. The study sample comprised 130 individuals (70 males) of known sex, age, and cause of death from San José cemetery in Granada (Spain). Measurements of 19 neurocranial dimensions and 11 splanchnocranial dimensions were subjected to discriminant analysis and logistic regression, and the percentages of correct classification were compared between the sex functions obtained with each method. The discriminant capacity of the selected variables was evaluated with a cross-validation procedure. The percentage accuracy with discriminant analysis was 78.2% for the neurocranium (82.4% in females and 74.6% in males) and 73.7% for the splanchnocranium (79.6% in females and 68.8% in males). These percentages were higher with logistic regression analysis: 85.7% for the neurocranium (in both sexes) and 94.1% for the splanchnocranium (100% in females and 91.7% in males).
Race differences in depression vulnerability following Hurricane Katrina.
Ali, Jeanelle S; Farrell, Amy S; Alexander, Adam C; Forde, David R; Stockton, Michelle; Ward, Kenneth D
2017-05-01
This study investigated whether racial disparities in depression were present after Hurricane Katrina. Data were gathered from 932 New Orleans residents who were present when Hurricane Katrina struck, and who returned to New Orleans the following year. Multiple logistic regression models evaluated racial differences in screening positive for depression (a score ≥16 on the Center for Epidemiologic Studies Depression Scale), and explored whether differential vulnerability (prehurricane physical and mental health functioning and education level), differential exposure to hurricane-related stressors, and loss of social support moderated and/or reduced the association of race with depression. A univariate logistic regression analysis showed the odds for screening positive for depression were 86% higher for African Americans than for Caucasians (odds ratio [OR] = 1.86 [1.28-2.71], p = .0012). However, after controlling simultaneously for sociodemographic characteristics, preexisting vulnerabilities, social support, and trauma-specific factors, race was no longer a significant correlate for screening positive for depression (OR = 1.54 [0.95-2.48], p = .0771). The racial disparity in postdisaster depression seems to be confounded by sociodemographic characteristics, preexisting vulnerabilities, social support, and trauma-specific factors. Nonetheless, even after adjusting for these factors, there was a nonsignificant trend effect for race, which could suggest race played an important role in depression outcomes following Hurricane Katrina. Future studies should examine these associations prospectively, using stronger assessments for depression, and incorporate measures for discrimination and segregation, to further understand possible racial disparities in depression after Hurricane Katrina. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Race Differences in Depression Vulnerability Following Hurricane Katrina
Ali, Jeanelle S.; Farrell, Amy S.; Alexander, Adam C.; Forde, David R.; Stockton, Michelle; Ward, Kenneth D.
2016-01-01
OBJECTIVE This study investigated whether racial disparities in depression were present after Hurricane Katrina. METHOD Data were gathered from 932 New Orleans residents who were present when Hurricane Katrina struck, and who returned to New Orleans the following year. Multiple logistic regression models evaluated racial differences in screening positive for depression (a score ≥16 on the Center for Epidemiologic Studies Depression scale), and explored whether differential vulnerability (pre-hurricane physical and mental health functioning and education level), differential exposure to hurricane-related stressors, and loss of social support moderated and/or reduced the association of race with depression. RESULTS A univariate logistic regression analysis showed the odds for screening positive for depression were 86% higher for African Americans than for Caucasians (OR=1.86 [1.28–2.71], p=.0012). However, after controlling simultaneously for sociodemographic characteristics, preexisting vulnerabilities, social support, and trauma-specific factors, race was no longer a significant correlate for screening positive for depression (OR=1.54 [0.95–2.48], p=.0771). CONCLUSIONS The racial disparity in post disaster depression seems to be confounded by sociodemographic characteristics, preexisting vulnerabilities, social support, and trauma-specific factors. Nonetheless, even after adjusting for these factors, there was a non-significant trend effect for race, which could suggest race played an important role in depression outcomes following Hurricane Katrina. Future studies should examine these associations prospectively, using stronger assessments for depression, and incorporate measures for discrimination and segregation, to further understand possible racial disparities in depression after Hurricane Katrina. PMID:27869461
Lin, Chao-Cheng; Bai, Ya-Mei; Chen, Jen-Yeu; Hwang, Tzung-Jeng; Chen, Tzu-Ting; Chiu, Hung-Wen; Li, Yu-Chuan
2010-03-01
Metabolic syndrome (MetS) is an important side effect of second-generation antipsychotics (SGAs). However, many SGA-treated patients with MetS remain undetected. In this study, we trained and validated artificial neural network (ANN) and multiple logistic regression models without biochemical parameters to rapidly identify MetS in patients with SGA treatment. A total of 383 patients with a diagnosis of schizophrenia or schizoaffective disorder (DSM-IV criteria) with SGA treatment for more than 6 months were investigated to determine whether they met the MetS criteria according to the International Diabetes Federation. The data for these patients were collected between March 2005 and September 2005. The input variables of ANN and logistic regression were limited to demographic and anthropometric data only. All models were trained by randomly selecting two-thirds of the patient data and were internally validated with the remaining one-third of the data. The models were then externally validated with data from 69 patients from another hospital, collected between March 2008 and June 2008. The area under the receiver operating characteristic curve (AUC) was used to measure the performance of all models. Both the final ANN and logistic regression models had high accuracy (88.3% vs 83.6%), sensitivity (93.1% vs 86.2%), and specificity (86.9% vs 83.8%) to identify MetS in the internal validation set. The mean +/- SD AUC was high for both the ANN and logistic regression models (0.934 +/- 0.033 vs 0.922 +/- 0.035, P = .63). During external validation, high AUC was still obtained for both models. Waist circumference and diastolic blood pressure were the common variables that were left in the final ANN and logistic regression models. Our study developed accurate ANN and logistic regression models to detect MetS in patients with SGA treatment. The models are likely to provide a noninvasive tool for large-scale screening of MetS in this group of patients. (c) 2010 Physicians Postgraduate Press, Inc.
Bayesian logistic regression in detection of gene-steroid interaction for cancer at PDLIM5 locus.
Wang, Ke-Sheng; Owusu, Daniel; Pan, Yue; Xie, Changchun
2016-06-01
The PDZ and LIM domain 5 (PDLIM5) gene may play a role in cancer, bipolar disorder, major depression, alcohol dependence and schizophrenia; however, little is known about the interaction effect of steroid and PDLIM5 gene on cancer. This study examined 47 single-nucleotide polymorphisms (SNPs) within the PDLIM5 gene in the Marshfield sample with 716 cancer patients (any diagnosed cancer, excluding minor skin cancer) and 2848 noncancer controls. Multiple logistic regression model in PLINK software was used to examine the association of each SNP with cancer. Bayesian logistic regression in PROC GENMOD in SAS statistical software, ver. 9.4 was used to detect gene- steroid interactions influencing cancer. Single marker analysis using PLINK identified 12 SNPs associated with cancer (P< 0.05); especially, SNP rs6532496 revealed the strongest association with cancer (P = 6.84 × 10⁻³); while the next best signal was rs951613 (P = 7.46 × 10⁻³). Classic logistic regression in PROC GENMOD showed that both rs6532496 and rs951613 revealed strong gene-steroid interaction effects (OR=2.18, 95% CI=1.31-3.63 with P = 2.9 × 10⁻³ for rs6532496 and OR=2.07, 95% CI=1.24-3.45 with P = 5.43 × 10⁻³ for rs951613, respectively). Results from Bayesian logistic regression showed stronger interaction effects (OR=2.26, 95% CI=1.2-3.38 for rs6532496 and OR=2.14, 95% CI=1.14-3.2 for rs951613, respectively). All the 12 SNPs associated with cancer revealed significant gene-steroid interaction effects (P < 0.05); whereas 13 SNPs showed gene-steroid interaction effects without main effect on cancer. SNP rs4634230 revealed the strongest gene-steroid interaction effect (OR=2.49, 95% CI=1.5-4.13 with P = 4.0 × 10⁻⁴ based on the classic logistic regression and OR=2.59, 95% CI=1.4-3.97 from Bayesian logistic regression; respectively). This study provides evidence of common genetic variants within the PDLIM5 gene and interactions between PLDIM5 gene polymorphisms and steroid use influencing cancer.
Deletion Diagnostics for Alternating Logistic Regressions
Preisser, John S.; By, Kunthel; Perin, Jamie; Qaqish, Bahjat F.
2013-01-01
Deletion diagnostics are introduced for the regression analysis of clustered binary outcomes estimated with alternating logistic regressions, an implementation of generalized estimating equations (GEE) that estimates regression coefficients in a marginal mean model and in a model for the intracluster association given by the log odds ratio. The diagnostics are developed within an estimating equations framework that recasts the estimating functions for association parameters based upon conditional residuals into equivalent functions based upon marginal residuals. Extensions of earlier work on GEE diagnostics follow directly, including computational formulae for one-step deletion diagnostics that measure the influence of a cluster of observations on the estimated regression parameters and on the overall marginal mean or association model fit. The diagnostic formulae are evaluated with simulations studies and with an application concerning an assessment of factors associated with health maintenance visits in primary care medical practices. The application and the simulations demonstrate that the proposed cluster-deletion diagnostics for alternating logistic regressions are good approximations of their exact fully iterated counterparts. PMID:22777960
Knol, Mirjam J; van der Tweel, Ingeborg; Grobbee, Diederick E; Numans, Mattijs E; Geerlings, Mirjam I
2007-10-01
To determine the presence of interaction in epidemiologic research, typically a product term is added to the regression model. In linear regression, the regression coefficient of the product term reflects interaction as departure from additivity. However, in logistic regression it refers to interaction as departure from multiplicativity. Rothman has argued that interaction estimated as departure from additivity better reflects biologic interaction. So far, literature on estimating interaction on an additive scale using logistic regression only focused on dichotomous determinants. The objective of the present study was to provide the methods to estimate interaction between continuous determinants and to illustrate these methods with a clinical example. and results From the existing literature we derived the formulas to quantify interaction as departure from additivity between one continuous and one dichotomous determinant and between two continuous determinants using logistic regression. Bootstrapping was used to calculate the corresponding confidence intervals. To illustrate the theory with an empirical example, data from the Utrecht Health Project were used, with age and body mass index as risk factors for elevated diastolic blood pressure. The methods and formulas presented in this article are intended to assist epidemiologists to calculate interaction on an additive scale between two variables on a certain outcome. The proposed methods are included in a spreadsheet which is freely available at: http://www.juliuscenter.nl/additive-interaction.xls.
Park, Sookyung; Kim, Haeryun; Kim, Haesung
2009-01-01
This study examined the roles played by parental alcohol abuse and social support, peer substance abuse risk and social support, and substance abuse risk among adolescents in South Korea. Participants were adolescents between the ages of 15 and 22 years (mean, 18), residing in Seoul city and in surrounding Kyung-gi Province. Of 259 participants, 41.3% scored 2 or more on the POSIT scale, which suggested they met the problematic criteria for substance abuse risk. Logistic regression results suggested that the influence of social support on substance abuse risk among adolescents depended on the source of support--parents or peers. These findings need to be considered in the development of intervention programs for adolescents at risk for substance abuse.
ERIC Educational Resources Information Center
Osborne, Jason W.
2012-01-01
Logistic regression is slowly gaining acceptance in the social sciences, and fills an important niche in the researcher's toolkit: being able to predict important outcomes that are not continuous in nature. While OLS regression is a valuable tool, it cannot routinely be used to predict outcomes that are binary or categorical in nature. These…
Mocellin, Simone; Thompson, John F; Pasquali, Sandro; Montesco, Maria C; Pilati, Pierluigi; Nitti, Donato; Saw, Robyn P; Scolyer, Richard A; Stretch, Jonathan R; Rossi, Carlo R
2009-12-01
To improve selection for sentinel node (SN) biopsy (SNB) in patients with cutaneous melanoma using statistical models predicting SN status. About 80% of patients currently undergoing SNB are node negative. In the absence of conclusive evidence of a SNBassociated survival benefit, these patients may be over-treated. Here, we tested the efficiency of 4 different models in predicting SN status. The clinicopathologic data (age, gender, tumor thickness, Clark level, regression, ulceration, histologic subtype, and mitotic index) of 1132 melanoma patients who had undergone SNB at institutions in Italy and Australia were analyzed. Logistic regression, classification tree, random forest, and support vector machine models were fitted to the data. The predictive models were built with the aim of maximizing the negative predictive value (NPV) and reducing the rate of SNB procedures though minimizing the error rate. After cross-validation logistic regression, classification tree, random forest, and support vector machine predictive models obtained clinically relevant NPV (93.6%, 94.0%, 97.1%, and 93.0%, respectively), SNB reduction (27.5%, 29.8%, 18.2%, and 30.1%, respectively), and error rates (1.8%, 1.8%, 0.5%, and 2.1%, respectively). Using commonly available clinicopathologic variables, predictive models can preoperatively identify a proportion of patients ( approximately 25%) who might be spared SNB, with an acceptable (1%-2%) error. If validated in large prospective series, these models might be implemented in the clinical setting for improved patient selection, which ultimately would lead to better quality of life for patients and optimization of resource allocation for the health care system.
Snyder, Marcia; Freeman, Mary C.; Purucker, S. Thomas; Pringle, Catherine M.
2016-01-01
Freshwater shrimps are an important biotic component of tropical ecosystems. However, they can have a low probability of detection when abundances are low. We sampled 3 of the most common freshwater shrimp species, Macrobrachium olfersii, Macrobrachium carcinus, and Macrobrachium heterochirus, and used occupancy modeling and logistic regression models to improve our limited knowledge of distribution of these cryptic species by investigating both local- and landscape-scale effects at La Selva Biological Station in Costa Rica. Local-scale factors included substrate type and stream size, and landscape-scale factors included presence or absence of regional groundwater inputs. Capture rates for 2 of the sampled species (M. olfersii and M. carcinus) were sufficient to compare the fit of occupancy models. Occupancy models did not converge for M. heterochirus, but M. heterochirus had high enough occupancy rates that logistic regression could be used to model the relationship between occupancy rates and predictors. The best-supported models for M. olfersii and M. carcinus included conductivity, discharge, and substrate parameters. Stream size was positively correlated with occupancy rates of all 3 species. High stream conductivity, which reflects the quantity of regional groundwater input into the stream, was positively correlated with M. olfersii occupancy rates. Boulder substrates increased occupancy rate of M. carcinus and decreased the detection probability of M. olfersii. Our models suggest that shrimp distribution is driven by factors that function at local (substrate and discharge) and landscape (conductivity) scales.
Intermediate and advanced topics in multilevel logistic regression analysis
Merlo, Juan
2017-01-01
Multilevel data occur frequently in health services, population and public health, and epidemiologic research. In such research, binary outcomes are common. Multilevel logistic regression models allow one to account for the clustering of subjects within clusters of higher‐level units when estimating the effect of subject and cluster characteristics on subject outcomes. A search of the PubMed database demonstrated that the use of multilevel or hierarchical regression models is increasing rapidly. However, our impression is that many analysts simply use multilevel regression models to account for the nuisance of within‐cluster homogeneity that is induced by clustering. In this article, we describe a suite of analyses that can complement the fitting of multilevel logistic regression models. These ancillary analyses permit analysts to estimate the marginal or population‐average effect of covariates measured at the subject and cluster level, in contrast to the within‐cluster or cluster‐specific effects arising from the original multilevel logistic regression model. We describe the interval odds ratio and the proportion of opposed odds ratios, which are summary measures of effect for cluster‐level covariates. We describe the variance partition coefficient and the median odds ratio which are measures of components of variance and heterogeneity in outcomes. These measures allow one to quantify the magnitude of the general contextual effect. We describe an R 2 measure that allows analysts to quantify the proportion of variation explained by different multilevel logistic regression models. We illustrate the application and interpretation of these measures by analyzing mortality in patients hospitalized with a diagnosis of acute myocardial infarction. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. PMID:28543517
Intermediate and advanced topics in multilevel logistic regression analysis.
Austin, Peter C; Merlo, Juan
2017-09-10
Multilevel data occur frequently in health services, population and public health, and epidemiologic research. In such research, binary outcomes are common. Multilevel logistic regression models allow one to account for the clustering of subjects within clusters of higher-level units when estimating the effect of subject and cluster characteristics on subject outcomes. A search of the PubMed database demonstrated that the use of multilevel or hierarchical regression models is increasing rapidly. However, our impression is that many analysts simply use multilevel regression models to account for the nuisance of within-cluster homogeneity that is induced by clustering. In this article, we describe a suite of analyses that can complement the fitting of multilevel logistic regression models. These ancillary analyses permit analysts to estimate the marginal or population-average effect of covariates measured at the subject and cluster level, in contrast to the within-cluster or cluster-specific effects arising from the original multilevel logistic regression model. We describe the interval odds ratio and the proportion of opposed odds ratios, which are summary measures of effect for cluster-level covariates. We describe the variance partition coefficient and the median odds ratio which are measures of components of variance and heterogeneity in outcomes. These measures allow one to quantify the magnitude of the general contextual effect. We describe an R 2 measure that allows analysts to quantify the proportion of variation explained by different multilevel logistic regression models. We illustrate the application and interpretation of these measures by analyzing mortality in patients hospitalized with a diagnosis of acute myocardial infarction. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd.
Predicting Social Trust with Binary Logistic Regression
ERIC Educational Resources Information Center
Adwere-Boamah, Joseph; Hufstedler, Shirley
2015-01-01
This study used binary logistic regression to predict social trust with five demographic variables from a national sample of adult individuals who participated in The General Social Survey (GSS) in 2012. The five predictor variables were respondents' highest degree earned, race, sex, general happiness and the importance of personally assisting…
Effect of folic acid on appetite in children: ordinal logistic and fuzzy logistic regressions.
Namdari, Mahshid; Abadi, Alireza; Taheri, S Mahmoud; Rezaei, Mansour; Kalantari, Naser; Omidvar, Nasrin
2014-03-01
Reduced appetite and low food intake are often a concern in preschool children, since it can lead to malnutrition, a leading cause of impaired growth and mortality in childhood. It is occasionally considered that folic acid has a positive effect on appetite enhancement and consequently growth in children. The aim of this study was to assess the effect of folic acid on the appetite of preschool children 3 to 6 y old. The study sample included 127 children ages 3 to 6 who were randomly selected from 20 preschools in the city of Tehran in 2011. Since appetite was measured by linguistic terms, a fuzzy logistic regression was applied for modeling. The obtained results were compared with a statistical ordinal logistic model. After controlling for the potential confounders, in a statistical ordinal logistic model, serum folate showed a significantly positive effect on appetite. A small but positive effect of folate was detected by fuzzy logistic regression. Based on fuzzy regression, the risk for poor appetite in preschool children was related to the employment status of their mothers. In this study, a positive association was detected between the levels of serum folate and improved appetite. For further investigation, a randomized controlled, double-blind clinical trial could be helpful to address causality. Copyright © 2014 Elsevier Inc. All rights reserved.
Who cares about health inequalities? Cross-country evidence from the World Health Survey
King, Nicholas B; Harper, Sam; Young, Meredith E
2013-01-01
Reduction of health inequalities within and between countries is a global health priority, but little is known about the determinants of popular support for this goal. We used data from the World Health Survey to assess individual preferences for prioritizing reductions in health and health care inequalities. We used descriptive tables and regression analysis to study the determinants of preferences for reducing health inequalities as the primary health system goal. Determinants included individual socio-demographic characteristics (age, sex, urban residence, education, marital status, household income, self-rated health, health care use, satisfaction with health care system) and country-level characteristics [gross domestic product (GDP) per capita, disability-free life expectancy, equality in child mortality, income inequality, health and public health expenditures]. We used logistic regression to assess the likelihood that individuals ranked minimizing inequalities first, and rank-ordered logistic regression to compare the ranking of other priorities against minimizing health inequalities. Individuals tended to prioritize health system goals related to overall improvement (improving population health and health care responsiveness) over those related to equality and fairness (minimizing inequalities in health and responsiveness, and promoting fairness of financial contribution). Individuals in countries with higher GDP per capita, life expectancy, and equality in child mortality were more likely to prioritize minimizing health inequalities. PMID:23059735
Estimation of Nasal Tip Support Using Computer-Aided Design and 3-Dimensional Printed Models
Gray, Eric; Maducdoc, Marlon; Manuel, Cyrus; Wong, Brian J. F.
2016-01-01
IMPORTANCE Palpation of the nasal tip is an essential component of the preoperative rhinoplasty examination. Measuring tip support is challenging, and the forces that correspond to ideal tip support are unknown. OBJECTIVE To identify the integrated reaction force and the minimum and ideal mechanical properties associated with nasal tip support. DESIGN, SETTING, AND PARTICIPANTS Three-dimensional (3-D) printed anatomic silicone nasal models were created using a computed tomographic scan and computer-aided design software. From this model, 3-D printing and casting methods were used to create 5 anatomically correct nasal models of varying constitutive Young moduli (0.042, 0.086, 0.098, 0.252, and 0.302 MPa) from silicone. Thirty rhinoplasty surgeons who attended a regional rhinoplasty course evaluated the reaction force (nasal tip recoil) of each model by palpation and selected the model that satisfied their requirements for minimum and ideal tip support. Data were collected from May 3 to 4, 2014. RESULTS Of the 30 respondents, 4 surgeons had been in practice for 1 to 5 years; 9 surgeons, 6 to 15 years; 7 surgeons, 16 to 25 years; and 10 surgeons, 26 or more years. Seventeen surgeons considered themselves in the advanced to expert skill competency levels. Logistic regression estimated the minimum threshold for the Young moduli for adequate and ideal tip support to be 0.096 and 0.154 MPa, respectively. Logistic regression estimated the thresholds for the reaction force associated with the absolute minimum and ideal requirements for good tip recoil to be 0.26 to 4.74 N and 0.37 to 7.19 N during 1- to 8-mm displacement, respectively. CONCLUSIONS AND RELEVANCE This study presents a method to estimate clinically relevant nasal tip reaction forces, which serve as a proxy for nasal tip support. This information will become increasingly important in computational modeling of nasal tip mechanics and ultimately will enhance surgical planning for rhinoplasty. LEVEL OF EVIDENCE NA. PMID:27124818
Clustering performance comparison using K-means and expectation maximization algorithms.
Jung, Yong Gyu; Kang, Min Soo; Heo, Jun
2014-11-14
Clustering is an important means of data mining based on separating data categories by similar features. Unlike the classification algorithm, clustering belongs to the unsupervised type of algorithms. Two representatives of the clustering algorithms are the K -means and the expectation maximization (EM) algorithm. Linear regression analysis was extended to the category-type dependent variable, while logistic regression was achieved using a linear combination of independent variables. To predict the possibility of occurrence of an event, a statistical approach is used. However, the classification of all data by means of logistic regression analysis cannot guarantee the accuracy of the results. In this paper, the logistic regression analysis is applied to EM clusters and the K -means clustering method for quality assessment of red wine, and a method is proposed for ensuring the accuracy of the classification results.
Delva, J; Spencer, M S; Lin, J K
2000-01-01
This article compares estimates of the relative odds of nitrite use obtained from weighted unconditional logistic regression with estimates obtained from conditional logistic regression after post-stratification and matching of cases with controls by neighborhood of residence. We illustrate these methods by comparing the odds associated with nitrite use among adults of four racial/ethnic groups, with and without a high school education. We used aggregated data from the 1994-B through 1996 National Household Survey on Drug Abuse (NHSDA). Difference between the methods and implications for analysis and inference are discussed.
Austin, Peter C; Lee, Douglas S; Steyerberg, Ewout W; Tu, Jack V
2012-01-01
In biomedical research, the logistic regression model is the most commonly used method for predicting the probability of a binary outcome. While many clinical researchers have expressed an enthusiasm for regression trees, this method may have limited accuracy for predicting health outcomes. We aimed to evaluate the improvement that is achieved by using ensemble-based methods, including bootstrap aggregation (bagging) of regression trees, random forests, and boosted regression trees. We analyzed 30-day mortality in two large cohorts of patients hospitalized with either acute myocardial infarction (N = 16,230) or congestive heart failure (N = 15,848) in two distinct eras (1999–2001 and 2004–2005). We found that both the in-sample and out-of-sample prediction of ensemble methods offered substantial improvement in predicting cardiovascular mortality compared to conventional regression trees. However, conventional logistic regression models that incorporated restricted cubic smoothing splines had even better performance. We conclude that ensemble methods from the data mining and machine learning literature increase the predictive performance of regression trees, but may not lead to clear advantages over conventional logistic regression models for predicting short-term mortality in population-based samples of subjects with cardiovascular disease. PMID:22777999
Giocos, Georgina; Kagee, Ashraf; Swartz, Leslie
2008-11-01
The present study sought to determine whether the Theory of Planned Behaviour predicted stated hypothetical willingness to participate (WTP) in future Phase III HIV vaccine trials among South African adolescents. Hierarchical logistic regression analyses showed that The Theory of Planned Behaviour (TPB) significantly predicted WTP. Of all the predictors, Subjective norms significantly predicted WTP (OR = 1.19, 95% C.I. = 1.06-1.34). A stepwise logistic regression analysis revealed that Subjective Norms (OR = 1.19, 95% C.I. = 1.07-1.34) and Attitude towards participation in an HIV vaccine trial (OR = 1.32, 95% C.I. = 1.00-1.74) were significant predictors of WTP. The addition of Knowledge of HIV vaccines and HIV vaccine trials, Perceived self-risk of HIV infection, Health-promoting behaviours and Attitudes towards HIV/AIDS yielded non-significant results. These findings provide support for the Theory of Reasoned Action (TRA) and suggest that psychosocial factors may play an important role in WTP in Phase III HIV vaccine trials among adolescents.
Andu, Eaden; Wagenaar, Brad H; Kemp, Chris G; Nevin, Paul E; Simoni, Jane M; Andrasik, Michele; Cohn, Susan E; French, Audrey L; Rao, Deepa
2018-04-26
We sought to examine risk and protective factors for Posttraumatic Stress Disorder (PTSD) among African American women living with HIV. This is a cross-sectional analysis of baseline data from a randomized trial of an HIV stigma reduction intervention. We examined data from two-hundred and thirty-nine African American women living with HIV. We examined whether age, marital status, level of education, internalized HIV-related stigma, and social support as potential protective and risk factors for PTSD symptoms using logistic regression. We analyzed bi-variate associations between each variable and PTSD symptoms, and constructed a multivariate logistic regression model adjusting for all variables. We found 67% reported clinically significant PTSD symptoms at baseline. Our results suggest that age, education, and internalized stigma were found to be associated with PTSD symptoms (p < 0.001), with older age and more education as protective factors and stigma as a risk factor for PTSD. Therefore, understanding this relationship may help improve assessment and treatment through evidence- based and trauma-informed strategies.
Predicting β-Turns in Protein Using Kernel Logistic Regression
Elbashir, Murtada Khalafallah; Sheng, Yu; Wang, Jianxin; Wu, FangXiang; Li, Min
2013-01-01
A β-turn is a secondary protein structure type that plays a significant role in protein configuration and function. On average 25% of amino acids in protein structures are located in β-turns. It is very important to develope an accurate and efficient method for β-turns prediction. Most of the current successful β-turns prediction methods use support vector machines (SVMs) or neural networks (NNs). The kernel logistic regression (KLR) is a powerful classification technique that has been applied successfully in many classification problems. However, it is often not found in β-turns classification, mainly because it is computationally expensive. In this paper, we used KLR to obtain sparse β-turns prediction in short evolution time. Secondary structure information and position-specific scoring matrices (PSSMs) are utilized as input features. We achieved Q total of 80.7% and MCC of 50% on BT426 dataset. These results show that KLR method with the right algorithm can yield performance equivalent to or even better than NNs and SVMs in β-turns prediction. In addition, KLR yields probabilistic outcome and has a well-defined extension to multiclass case. PMID:23509793
Predicting β-turns in protein using kernel logistic regression.
Elbashir, Murtada Khalafallah; Sheng, Yu; Wang, Jianxin; Wu, Fangxiang; Li, Min
2013-01-01
A β-turn is a secondary protein structure type that plays a significant role in protein configuration and function. On average 25% of amino acids in protein structures are located in β-turns. It is very important to develope an accurate and efficient method for β-turns prediction. Most of the current successful β-turns prediction methods use support vector machines (SVMs) or neural networks (NNs). The kernel logistic regression (KLR) is a powerful classification technique that has been applied successfully in many classification problems. However, it is often not found in β-turns classification, mainly because it is computationally expensive. In this paper, we used KLR to obtain sparse β-turns prediction in short evolution time. Secondary structure information and position-specific scoring matrices (PSSMs) are utilized as input features. We achieved Q total of 80.7% and MCC of 50% on BT426 dataset. These results show that KLR method with the right algorithm can yield performance equivalent to or even better than NNs and SVMs in β-turns prediction. In addition, KLR yields probabilistic outcome and has a well-defined extension to multiclass case.
Wu, Ping-An; Li, Yun-Liang; Wu, Han-Jiang; Wang, Kai; Fan, Guo-Zheng
2007-09-01
To investigate the relationship between muscle segment homeobox gene-1 (MSX1) and the genetic susceptibility of nonsyndromic cleft lip and palate (NSCLP) in Hunan Hans. One microsatellite DNA marker CA repeat in MSX1 intron region was used as genetic marker. The genotypes of 387 members in 129 NSCLP nuclear family trios were analyzed by polymerase chain reaction (PCR) and denaturing polyacrylamide gel electrophoresis. Then transmission disequilibrium test (TDT) and Logistic regression analysis were used to conduct association analysis. TDT analysis confirmed that CA4 allele in CL/P and CPO groups preferentially transmitted to the affected offspring (P = 0.018, P = 0.041). Logistic regression analysis indicated that the recessive model of inheritance was supported, and CA4 itself or CA4 acting as a marker for a disease allele or haplotype was inherited in a recessive fashion (P = 0.009). MSX1 gene is associated with NSCLP, and MSX1 gene may be directly involved either in the etiology of NSCLP or in linkage disequilibrium with disease-predisposing sites.
NASA Astrophysics Data System (ADS)
Lawrence, R.; Landenburger, L.; Jewett, J.
2007-12-01
Whitebark pine seeds have long been identified as the most significant vegetative food source for grizzly bears in the Greater Yellowstone Ecosystem (GYE) and, hence, a crucial element of suitable grizzly bear habitat. The overall health and status of whitebark pine in the GYE is currently threatened by mountain pine beetle infestations and the spread of whitepine blister rust. Whitebark pine distribution (presence/absence) was mapped for the GYE using Landsat 7 Enhanced Thematic Mapper (ETM+) imagery and topographic data as part of a long-term inter-agency monitoring program. Logistic regression was compared with classification tree analysis (CTA) with and without boosting. Overall comparative classification accuracies for the central portion of the GYE covering three ETM+ images along a single path ranged from 91.6% using logistic regression to 95.8% with See5's CTA algorithm with the maximum 99 boosts. The analysis is being extended to the entire northern Rocky Mountain Ecosystem and extended over decadal time scales. The analysis is being extended to the entire northern Rocky Mountain Ecosystem and extended over decadal time scales.
Cohen, Leonard A; Bonito, Arthur J; Eicheldinger, Celia; Manski, Richard J; Macek, Mark D; Edwards, Robert R; Khanna, Niharika
2010-01-01
Patient-centered care has a positive impact on patient health status. This report compares patient assessments of patient centeredness during treatment in hospital emergency departments (EDs) and physician and dentist offices for dental problems and injuries. Participants included low-income White, Black, and Hispanic adults who had experienced a dental problem or injury during the previous 12 months and who visited an emergency department, physician, or dentist for treatment. A stratified random sample of Maryland households participated in a cross-sectional telephone survey. Interviews were completed with 94.8% (401/423) of eligible individuals. Multivariable logistic regression analyses were performed. The measure of predictive power, the pseudo-R2s, calculated for the logistic regression models ranged from 12% to 18% for the analyses of responses to the measures of patient centeredness (satisfaction with treatment, careful listening, thorough explaining, spending enough time, and treated with courtesy and respect). EDs were less likely than dentists to treat patients with great courtesy and respect. Further research is needed to identify factors that support patient-centered care.
Hyperhomocysteinemia is a risk factor for Alzheimer's disease in an Algerian population.
Nazef, Khaled; Khelil, Malika; Chelouti, Hiba; Kacimi, Ghouti; Bendini, Mohamed; Tazir, Meriem; Belarbi, Soraya; El Hadi Cherifi, Mohamed; Djerdjouri, Bahia
2014-04-01
There is growing evidence that increased blood concentration of total homocysteine (tHcy) may be a risk factor for Alzheimer's disease (AD). The present study was conducted to evaluate the association of serum tHcy and other biochemical risk factors with AD. This is a case-control study including 41 individuals diagnosed with AD and 46 nondemented controls. Serum levels of all studied biochemical parameters were performed. Univariate logistic regression showed a significant increase of tHcy (p = 0.008), urea (p = 0.036) and a significant decrease of vitamin B12 (p = 0.012) in AD group vs. controls. Using multivariate logistic regression, tHcy (p = 0.007, OR = 1.376) appeared as an independent risk factor predictor of AD. There was a significant positive correlation between tHcy and creatinine (p <0.0001). A negative correlation was found between tHcy and vitamin B12 (p <0.0001). Our findings support that hyperhomocysteinemia is a risk factor for AD in an Algerian population and is also associated with vitamin B12 deficiency. Copyright © 2014 IMSS. Published by Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Fidalgo, Angel M.; Alavi, Seyed Mohammad; Amirian, Seyed Mohammad Reza
2014-01-01
This study examines three controversial aspects in differential item functioning (DIF) detection by logistic regression (LR) models: first, the relative effectiveness of different analytical strategies for detecting DIF; second, the suitability of the Wald statistic for determining the statistical significance of the parameters of interest; and…
ERIC Educational Resources Information Center
French, Brian F.; Maller, Susan J.
2007-01-01
Two unresolved implementation issues with logistic regression (LR) for differential item functioning (DIF) detection include ability purification and effect size use. Purification is suggested to control inaccuracies in DIF detection as a result of DIF items in the ability estimate. Additionally, effect size use may be beneficial in controlling…
A Note on Three Statistical Tests in the Logistic Regression DIF Procedure
ERIC Educational Resources Information Center
Paek, Insu
2012-01-01
Although logistic regression became one of the well-known methods in detecting differential item functioning (DIF), its three statistical tests, the Wald, likelihood ratio (LR), and score tests, which are readily available under the maximum likelihood, do not seem to be consistently distinguished in DIF literature. This paper provides a clarifying…
Comparison of Two Approaches for Handling Missing Covariates in Logistic Regression
ERIC Educational Resources Information Center
Peng, Chao-Ying Joanne; Zhu, Jin
2008-01-01
For the past 25 years, methodological advances have been made in missing data treatment. Most published work has focused on missing data in dependent variables under various conditions. The present study seeks to fill the void by comparing two approaches for handling missing data in categorical covariates in logistic regression: the…
Comparison of IRT Likelihood Ratio Test and Logistic Regression DIF Detection Procedures
ERIC Educational Resources Information Center
Atar, Burcu; Kamata, Akihito
2011-01-01
The Type I error rates and the power of IRT likelihood ratio test and cumulative logit ordinal logistic regression procedures in detecting differential item functioning (DIF) for polytomously scored items were investigated in this Monte Carlo simulation study. For this purpose, 54 simulation conditions (combinations of 3 sample sizes, 2 sample…
Multiple Logistic Regression Analysis of Cigarette Use among High School Students
ERIC Educational Resources Information Center
Adwere-Boamah, Joseph
2011-01-01
A binary logistic regression analysis was performed to predict high school students' cigarette smoking behavior from selected predictors from 2009 CDC Youth Risk Behavior Surveillance Survey. The specific target student behavior of interest was frequent cigarette use. Five predictor variables included in the model were: a) race, b) frequency of…
ERIC Educational Resources Information Center
Anderson, Carolyn J.; Verkuilen, Jay; Peyton, Buddy L.
2010-01-01
Survey items with multiple response categories and multiple-choice test questions are ubiquitous in psychological and educational research. We illustrate the use of log-multiplicative association (LMA) models that are extensions of the well-known multinomial logistic regression model for multiple dependent outcome variables to reanalyze a set of…
Propensity Score Estimation with Data Mining Techniques: Alternatives to Logistic Regression
ERIC Educational Resources Information Center
Keller, Bryan S. B.; Kim, Jee-Seon; Steiner, Peter M.
2013-01-01
Propensity score analysis (PSA) is a methodological technique which may correct for selection bias in a quasi-experiment by modeling the selection process using observed covariates. Because logistic regression is well understood by researchers in a variety of fields and easy to implement in a number of popular software packages, it has…
Two-factor logistic regression in pediatric liver transplantation
NASA Astrophysics Data System (ADS)
Uzunova, Yordanka; Prodanova, Krasimira; Spasov, Lyubomir
2017-12-01
Using a two-factor logistic regression analysis an estimate is derived for the probability of absence of infections in the early postoperative period after pediatric liver transplantation. The influence of both the bilirubin level and the international normalized ratio of prothrombin time of blood coagulation at the 5th postoperative day is studied.
ERIC Educational Resources Information Center
Courtney, Jon R.; Prophet, Retta
2011-01-01
Placement instability is often associated with a number of negative outcomes for children. To gain state level contextual knowledge of factors associated with placement stability/instability, logistic regression was applied to selected variables from the New Mexico Adoption and Foster Care Administrative Reporting System dataset. Predictors…
Length bias correction in gene ontology enrichment analysis using logistic regression.
Mi, Gu; Di, Yanming; Emerson, Sarah; Cumbie, Jason S; Chang, Jeff H
2012-01-01
When assessing differential gene expression from RNA sequencing data, commonly used statistical tests tend to have greater power to detect differential expression of genes encoding longer transcripts. This phenomenon, called "length bias", will influence subsequent analyses such as Gene Ontology enrichment analysis. In the presence of length bias, Gene Ontology categories that include longer genes are more likely to be identified as enriched. These categories, however, are not necessarily biologically more relevant. We show that one can effectively adjust for length bias in Gene Ontology analysis by including transcript length as a covariate in a logistic regression model. The logistic regression model makes the statistical issue underlying length bias more transparent: transcript length becomes a confounding factor when it correlates with both the Gene Ontology membership and the significance of the differential expression test. The inclusion of the transcript length as a covariate allows one to investigate the direct correlation between the Gene Ontology membership and the significance of testing differential expression, conditional on the transcript length. We present both real and simulated data examples to show that the logistic regression approach is simple, effective, and flexible.
Hansson, Lisbeth; Khamis, Harry J
2008-12-01
Simulated data sets are used to evaluate conditional and unconditional maximum likelihood estimation in an individual case-control design with continuous covariates when there are different rates of excluded cases and different levels of other design parameters. The effectiveness of the estimation procedures is measured by method bias, variance of the estimators, root mean square error (RMSE) for logistic regression and the percentage of explained variation. Conditional estimation leads to higher RMSE than unconditional estimation in the presence of missing observations, especially for 1:1 matching. The RMSE is higher for the smaller stratum size, especially for the 1:1 matching. The percentage of explained variation appears to be insensitive to missing data, but is generally higher for the conditional estimation than for the unconditional estimation. It is particularly good for the 1:2 matching design. For minimizing RMSE, a high matching ratio is recommended; in this case, conditional and unconditional logistic regression models yield comparable levels of effectiveness. For maximizing the percentage of explained variation, the 1:2 matching design with the conditional logistic regression model is recommended.
Lee, Seokho; Shin, Hyejin; Lee, Sang Han
2016-12-01
Alzheimer's disease (AD) is usually diagnosed by clinicians through cognitive and functional performance test with a potential risk of misdiagnosis. Since the progression of AD is known to cause structural changes in the corpus callosum (CC), the CC thickness can be used as a functional covariate in AD classification problem for a diagnosis. However, misclassified class labels negatively impact the classification performance. Motivated by AD-CC association studies, we propose a logistic regression for functional data classification that is robust to misdiagnosis or label noise. Specifically, our logistic regression model is constructed by adopting individual intercepts to functional logistic regression model. This approach enables to indicate which observations are possibly mislabeled and also lead to a robust and efficient classifier. An effective algorithm using MM algorithm provides simple closed-form update formulas. We test our method using synthetic datasets to demonstrate its superiority over an existing method, and apply it to differentiating patients with AD from healthy normals based on CC from MRI. © 2016, The International Biometric Society.
Szekér, Szabolcs; Vathy-Fogarassy, Ágnes
2018-01-01
Logistic regression based propensity score matching is a widely used method in case-control studies to select the individuals of the control group. This method creates a suitable control group if all factors affecting the output variable are known. However, if relevant latent variables exist as well, which are not taken into account during the calculations, the quality of the control group is uncertain. In this paper, we present a statistics-based research in which we try to determine the relationship between the accuracy of the logistic regression model and the uncertainty of the dependent variable of the control group defined by propensity score matching. Our analyses show that there is a linear correlation between the fit of the logistic regression model and the uncertainty of the output variable. In certain cases, a latent binary explanatory variable can result in a relative error of up to 70% in the prediction of the outcome variable. The observed phenomenon calls the attention of analysts to an important point, which must be taken into account when deducting conclusions.
Disclosure of HIV Status and Social Support Among People Living With HIV
Jorjoran Shushtari, Zahra; Sajjadi, Homeira; Forouzan, Ameneh Setareh; Salimi, Yahya; Dejman, Masoumeh
2014-01-01
Background: Disclosure of HIV is important for improving self-care behaviors, psychological well-being, commitment to the treatment, and reducing risk of transmission. One of the major benefits of disclosure is social support, which is an essential resource for effective coping with HIV infection. However, receiving any social support requires disclosing of HIV status. Objectives: This study aimed to determine the disclosure of HIV status and its related factors such as social support in addition to demographic and disease characteristics among people living with HIV in Iran. Patients and Methods: This cross-sectional study, using simple random sampling, was carried out on 175 people with HIV/AIDS who referred to Behavioral Counseling Centers. The self-administrated, Norbeck Social Support Questionnaire was used to measure social support. Disclosure of HIV status was assessed with an investigator-designed questions. Multiple logistic regression analysis with backward Likelihood Ratio method was applied to identify the adjusted odds ratio between disclosure as dependent variable and demographic variables, social support as independent variables. Results: Participants were often disclosed their HIV status to family members. But there were differences about disclosure of HIV status within the context of the family. Family members were perceived as more supportive. Multiple logistic regression analysis demonstrates that the gender (adjusted OR = 0.181; 95% CI .068-0.479), CD4 cell count (adjusted OR = 0.997; 95% CI 0.994-0.999), route of transmission (injection-drug user [adjusted OR = 9.366; 95% CI 3.358-26.123] and other routes [tattooing, mother to child, dental services, etc.], [adjusted OR = 3.752; 95% CI 1.157-12.167]), and functional support variable (adjusted OR = 1.007; 95% CI 1.001-1.013) remained in the model as significant predictors for disclosure. Conclusions: The results of this study regarding disclosure of HIV status and its relations to social support and some demographic variables can provide an understanding based on the evidence for promotion of knowledge and coping interventions about people living with HIV/AIDS and their perceived social support status. PMID:25389470
Logistic regression for circular data
NASA Astrophysics Data System (ADS)
Al-Daffaie, Kadhem; Khan, Shahjahan
2017-05-01
This paper considers the relationship between a binary response and a circular predictor. It develops the logistic regression model by employing the linear-circular regression approach. The maximum likelihood method is used to estimate the parameters. The Newton-Raphson numerical method is used to find the estimated values of the parameters. A data set from weather records of Toowoomba city is analysed by the proposed methods. Moreover, a simulation study is considered. The R software is used for all computations and simulations.
Naval Research Logistics Quarterly. Volume 28. Number 3,
1981-09-01
denotes component-wise maximum. f has antone (isotone) differences on C x D if for cl < c2 and d, < d2, NAVAL RESEARCH LOGISTICS QUARTERLY VOL. 28...or negative correlations and linear or nonlinear regressions. Given are the mo- ments to order two and, for special cases, (he regression function and...data sets. We designate this bnb distribution as G - B - N(a, 0, v). The distribution admits only of positive correlation and linear regressions
Gupta, Jhumka; Reed, Elizabeth; Kelly, Jocelyn; Stein, Dan J; Williams, David R
2011-01-01
Background Despite widespread apartheid-related human rights violations (HRV) and intimate partner violence (IPV) in South Africa, research investigating the influence of HRV on IPV perpetration is scarce. Methods This study analysed data from the South Africa Stress and Health Study, a cross-sectional survey conducted from 2003 to 2004 with 4351 South Africans examining public health concerns associated with apartheid. Analyses were restricted to men who had ever been married or had ever cohabited with a female partner. Logistic regression was used to examine associations between experiences of HRV and lifetime physical IPV perpetration. Results A total of 772 South Africa men met the study criteria (389 liberation supporters and 383 government supporters). Adjusted logistic regression analyses indicated that among liberation supporters, a significant association existed between experiencing major HRV (AOR 2.40, 95% CI 1.20 to 4.81), custody-related HRV (AOR 6.61, 95% CI 2.00 to 21.83), victimisation of close friends/family members (AOR 3.38, 95% CI 1.26 to 9.07) and physical IPV perpetration. Among government supporters, a significant association was observed between experiencing HRV (AOR 2.99, 95% CI 1.34 to 6.65) and victimisation of close friends/immediate family (AOR 5.42, 95% CI 1.44 to 19.02) and IPV perpetration. Conclusion This work indicates the importance of men’s experiences with HRV with regard to IPV perpetration risk. Future work is needed to understand the mechanisms underlying the observed relationships, particularly regarding mental health and gender norms as suggested by current literature, in order to inform interventions in South Africa and other regions affected by politically motivated conflict. PMID:21148138
Gupta, Jhumka; Reed, Elizabeth; Kelly, Jocelyn; Stein, Dan J; Williams, David R
2012-06-01
Despite widespread apartheid-related human rights violations (HRV) and intimate partner violence (IPV) in South Africa, research investigating the influence of HRV on IPV perpetration is scarce. This study analysed data from the South Africa Stress and Health Study, a cross-sectional survey conducted from 2003 to 2004 with 4351 South Africans examining public health concerns associated with apartheid. Analyses were restricted to men who had ever been married or had ever cohabited with a female partner. Logistic regression was used to examine associations between experiences of HRV and lifetime physical IPV perpetration. A total of 772 South Africa men met the study criteria (389 liberation supporters and 383 government supporters). Adjusted logistic regression analyses indicated that among liberation supporters, a significant association existed between experiencing major HRV (AOR 2.40, 95% CI 1.20 to 4.81), custody-related HRV (AOR 6.61, 95% CI 2.00 to 21.83), victimisation of close friends/family members (AOR 3.38, 95% CI 1.26 to 9.07) and physical IPV perpetration. Among government supporters, a significant association was observed between experiencing HRV (AOR 2.99, 95% CI 1.34 to 6.65) and victimisation of close friends/immediate family (AOR 5.42, 95% CI 1.44 to 19.02) and IPV perpetration. This work indicates the importance of men's experiences with HRV with regard to IPV perpetration risk. Future work is needed to understand the mechanisms underlying the observed relationships, particularly regarding mental health and gender norms as suggested by current literature, in order to inform interventions in South Africa and other regions affected by politically motivated conflict.
Workplace support for employees with cancer
Nowrouzi, B.; Lightfoot, N.; Cote, K.; Watson, R.
2009-01-01
Objective The aim of the present study was to survey human resources personnel about how their northeastern Ontario workplaces assist employees with cancer. Study Design and Setting This cross-sectional study was conducted from December 2007 to April 2008. Surveys were sent to 255 workplaces in northeastern Ontario with 25 or more employees, and 101 workplaces responded (39.6% response rate). Logistic regression modelling was used to identify factors associated with more or less workplace support. More or less workplace support was defined by provision of paid time to employees with medical appointments and an offer of a return-to-work meeting and reduced hours for employees with cancer. Factors considered in the model included organization size, geographic location (urban, rural), and workplace type (private sector, public sector). Results Most of the human resources staff who completed the surveys were women (67.4%), and respondents ranged in age from 25 to 70 years (mean: 45.30 ± 8.10 years). Respondents reported working for organizations that ranged in size from 25 to more than 9000 employees. In the logistic regression model, large organization size [odds ratio (or): 6.97; 95% confidence interval (ci): 1.34 to 36.2] and public sector (or: 4.98; 95% ci: 1.16 to 21.3) were associated with employer assistance. Public sector employers provided assistance at a rate 5 times that of private sector employers, and large organizations (>50 employees) provided assistance at a rate 7 times that of smaller organizations. Conclusions In the population studied, employees with cancer benefit from working in larger and public sector organizations. The data suggest a need for further support for employees with cancer in some other organizations. PMID:19862358
Hillen, T; Schaub, R; Hiestermann, A; Kirschner, W; Robra, B P
2000-08-01
To compare the health status and factors influencing the health of populations that had previously lived under different political systems. Cross sectional health and social survey using postal interviews. The relation between self reported health and psychosocial factors (stressful life events, social support, education, health promoting life style and health endangering behaviour) was investigated. To determine East-West differences a logistic regression model including interaction terms was fitted. East and West Berlin shortly after reunification 1991. Representative sample of 4430 Berlin residents aged 18 years and over (response rate 63%). Of all respondents, 15.4% rated their health as unsatisfactory. Residents of East Berlin rated their health more frequently as unsatisfactory than residents of West Berlin (Or(age adjusted)= 1.29, 95%CI 1.08, 1.52), these differences occurred predominantly in the over 60 years age group. Logistic regression showed significant independent effects of stressful life events, social support, education, and health promoting life style on self rated health. The effects of education and health promoting life style were observed to be more pronounced in the western part of Berlin. Old age and female sex showed a stronger association with unsatisfactory health status in the eastern part of Berlin. For subjects aged over 60 years there was evidence that living in the former East Berlin had an adverse effect on health compared with West Berlin. The impact of education and a health promoting lifestyle on self rated health seemed to be weaker in a former socialist society compared with that of a Western democracy. This study supports an "additive model" rather than a "buffering model" in explaining the effects of psychosocial factors on health.
Blended learning in situated contexts: 3-year evaluation of an online peer review project.
Bridges, S; Chang, J W W; Chu, C H; Gardner, K
2014-08-01
Situated and sociocultural perspectives on learning indicate that the design of complex tasks supported by educational technologies holds potential for dental education in moving novices towards closer approximation of the clinical outcomes of their expert mentors. A cross-faculty-, student-centred, web-based project in operative dentistry was established within the Universitas 21 (U21) network of higher education institutions to support university goals for internationalisation in clinical learning by enabling distributed interactions across sites and institutions. This paper aims to present evaluation of one dental faculty's project experience of curriculum redesign for deeper student learning. A mixed-method case study approach was utilised. Three cohorts of second-year students from a 5-year bachelor of dental surgery (BDS) programme were invited to participate in annual surveys and focus group interviews on project completion. Survey data were analysed for differences between years using multivariate logistical regression analysis. Thematic analysis of questionnaire open responses and interview transcripts was conducted. Multivariate logistic regression analysis noted significant differences across items over time indicating learning improvements, attainment of university aims and the positive influence of redesign. Students perceived the enquiry-based project as stimulating and motivating, and building confidence in operative techniques. Institutional goals for greater understanding of others and lifelong learning showed improvement over time. Despite positive scores, students indicated global citizenship and intercultural understanding were conceptually challenging. Establishment of online student learning communities through a blended approach to learning stimulated motivation and intellectual engagement, thereby supporting a situated approach to cognition. Sociocultural perspectives indicate that novice-expert interactions supported student development of professional identities. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Bond, H S; Sullivan, S G; Cowling, B J
2016-06-01
Influenza vaccination is the most practical means available for preventing influenza virus infection and is widely used in many countries. Because vaccine components and circulating strains frequently change, it is important to continually monitor vaccine effectiveness (VE). The test-negative design is frequently used to estimate VE. In this design, patients meeting the same clinical case definition are recruited and tested for influenza; those who test positive are the cases and those who test negative form the comparison group. When determining VE in these studies, the typical approach has been to use logistic regression, adjusting for potential confounders. Because vaccine coverage and influenza incidence change throughout the season, time is included among these confounders. While most studies use unconditional logistic regression, adjusting for time, an alternative approach is to use conditional logistic regression, matching on time. Here, we used simulation data to examine the potential for both regression approaches to permit accurate and robust estimates of VE. In situations where vaccine coverage changed during the influenza season, the conditional model and unconditional models adjusting for categorical week and using a spline function for week provided more accurate estimates. We illustrated the two approaches on data from a test-negative study of influenza VE against hospitalization in children in Hong Kong which resulted in the conditional logistic regression model providing the best fit to the data.
Asghari, Mehdi Poursheikhali; Hayatshahi, Sayyed Hamed Sadat; Abdolmaleki, Parviz
2012-01-01
From both the structural and functional points of view, β-turns play important biological roles in proteins. In the present study, a novel two-stage hybrid procedure has been developed to identify β-turns in proteins. Binary logistic regression was initially used for the first time to select significant sequence parameters in identification of β-turns due to a re-substitution test procedure. Sequence parameters were consisted of 80 amino acid positional occurrences and 20 amino acid percentages in sequence. Among these parameters, the most significant ones which were selected by binary logistic regression model, were percentages of Gly, Ser and the occurrence of Asn in position i+2, respectively, in sequence. These significant parameters have the highest effect on the constitution of a β-turn sequence. A neural network model was then constructed and fed by the parameters selected by binary logistic regression to build a hybrid predictor. The networks have been trained and tested on a non-homologous dataset of 565 protein chains. With applying a nine fold cross-validation test on the dataset, the network reached an overall accuracy (Qtotal) of 74, which is comparable with results of the other β-turn prediction methods. In conclusion, this study proves that the parameter selection ability of binary logistic regression together with the prediction capability of neural networks lead to the development of more precise models for identifying β-turns in proteins. PMID:27418910
Asghari, Mehdi Poursheikhali; Hayatshahi, Sayyed Hamed Sadat; Abdolmaleki, Parviz
2012-01-01
From both the structural and functional points of view, β-turns play important biological roles in proteins. In the present study, a novel two-stage hybrid procedure has been developed to identify β-turns in proteins. Binary logistic regression was initially used for the first time to select significant sequence parameters in identification of β-turns due to a re-substitution test procedure. Sequence parameters were consisted of 80 amino acid positional occurrences and 20 amino acid percentages in sequence. Among these parameters, the most significant ones which were selected by binary logistic regression model, were percentages of Gly, Ser and the occurrence of Asn in position i+2, respectively, in sequence. These significant parameters have the highest effect on the constitution of a β-turn sequence. A neural network model was then constructed and fed by the parameters selected by binary logistic regression to build a hybrid predictor. The networks have been trained and tested on a non-homologous dataset of 565 protein chains. With applying a nine fold cross-validation test on the dataset, the network reached an overall accuracy (Qtotal) of 74, which is comparable with results of the other β-turn prediction methods. In conclusion, this study proves that the parameter selection ability of binary logistic regression together with the prediction capability of neural networks lead to the development of more precise models for identifying β-turns in proteins.
Crane, Paul K; Gibbons, Laura E; Jolley, Lance; van Belle, Gerald
2006-11-01
We present an ordinal logistic regression model for identification of items with differential item functioning (DIF) and apply this model to a Mini-Mental State Examination (MMSE) dataset. We employ item response theory ability estimation in our models. Three nested ordinal logistic regression models are applied to each item. Model testing begins with examination of the statistical significance of the interaction term between ability and the group indicator, consistent with nonuniform DIF. Then we turn our attention to the coefficient of the ability term in models with and without the group term. If including the group term has a marked effect on that coefficient, we declare that it has uniform DIF. We examined DIF related to language of test administration in addition to self-reported race, Hispanic ethnicity, age, years of education, and sex. We used PARSCALE for IRT analyses and STATA for ordinal logistic regression approaches. We used an iterative technique for adjusting IRT ability estimates on the basis of DIF findings. Five items were found to have DIF related to language. These same items also had DIF related to other covariates. The ordinal logistic regression approach to DIF detection, when combined with IRT ability estimates, provides a reasonable alternative for DIF detection. There appear to be several items with significant DIF related to language of test administration in the MMSE. More attention needs to be paid to the specific criteria used to determine whether an item has DIF, not just the technique used to identify DIF.
Conditional Poisson models: a flexible alternative to conditional logistic case cross-over analysis.
Armstrong, Ben G; Gasparrini, Antonio; Tobias, Aurelio
2014-11-24
The time stratified case cross-over approach is a popular alternative to conventional time series regression for analysing associations between time series of environmental exposures (air pollution, weather) and counts of health outcomes. These are almost always analyzed using conditional logistic regression on data expanded to case-control (case crossover) format, but this has some limitations. In particular adjusting for overdispersion and auto-correlation in the counts is not possible. It has been established that a Poisson model for counts with stratum indicators gives identical estimates to those from conditional logistic regression and does not have these limitations, but it is little used, probably because of the overheads in estimating many stratum parameters. The conditional Poisson model avoids estimating stratum parameters by conditioning on the total event count in each stratum, thus simplifying the computing and increasing the number of strata for which fitting is feasible compared with the standard unconditional Poisson model. Unlike the conditional logistic model, the conditional Poisson model does not require expanding the data, and can adjust for overdispersion and auto-correlation. It is available in Stata, R, and other packages. By applying to some real data and using simulations, we demonstrate that conditional Poisson models were simpler to code and shorter to run than are conditional logistic analyses and can be fitted to larger data sets than possible with standard Poisson models. Allowing for overdispersion or autocorrelation was possible with the conditional Poisson model but when not required this model gave identical estimates to those from conditional logistic regression. Conditional Poisson regression models provide an alternative to case crossover analysis of stratified time series data with some advantages. The conditional Poisson model can also be used in other contexts in which primary control for confounding is by fine stratification.
Use of generalized ordered logistic regression for the analysis of multidrug resistance data.
Agga, Getahun E; Scott, H Morgan
2015-10-01
Statistical analysis of antimicrobial resistance data largely focuses on individual antimicrobial's binary outcome (susceptible or resistant). However, bacteria are becoming increasingly multidrug resistant (MDR). Statistical analysis of MDR data is mostly descriptive often with tabular or graphical presentations. Here we report the applicability of generalized ordinal logistic regression model for the analysis of MDR data. A total of 1,152 Escherichia coli, isolated from the feces of weaned pigs experimentally supplemented with chlortetracycline (CTC) and copper, were tested for susceptibilities against 15 antimicrobials and were binary classified into resistant or susceptible. The 15 antimicrobial agents tested were grouped into eight different antimicrobial classes. We defined MDR as the number of antimicrobial classes to which E. coli isolates were resistant ranging from 0 to 8. Proportionality of the odds assumption of the ordinal logistic regression model was violated only for the effect of treatment period (pre-treatment, during-treatment and post-treatment); but not for the effect of CTC or copper supplementation. Subsequently, a partially constrained generalized ordinal logistic model was built that allows for the effect of treatment period to vary while constraining the effects of treatment (CTC and copper supplementation) to be constant across the levels of MDR classes. Copper (Proportional Odds Ratio [Prop OR]=1.03; 95% CI=0.73-1.47) and CTC (Prop OR=1.1; 95% CI=0.78-1.56) supplementation were not significantly associated with the level of MDR adjusted for the effect of treatment period. MDR generally declined over the trial period. In conclusion, generalized ordered logistic regression can be used for the analysis of ordinal data such as MDR data when the proportionality assumptions for ordered logistic regression are violated. Published by Elsevier B.V.
Assessing LULC changes over Chilika Lake watershed in Eastern India using Driving Force Analysis
NASA Astrophysics Data System (ADS)
Jadav, S.; Syed, T. H.
2017-12-01
Rapid population growth and industrial development has brought about significant changes in Land Use Land Cover (LULC) of many developing countries in the world. This study investigates LULC changes in the Chilika Lake watershed of Eastern India for the period of 1988 to 2016. The methodology involves pre-processing and classification of Landsat satellite images using support vector machine (SVM) supervised classification algorithm. Results reveal that `Cropland', `Emergent Vegetation' and `Settlement' has expanded over the study period by 284.61 km², 106.83 km² and 98.83 km² respectively. Contemporaneously, `Lake Area', `Vegetation' and `Scrub Land' have decreased by 121.62 km², 96.05 km² and 80.29 km² respectively. This study also analyzes five major driving force variables of socio-economic and climatological factors triggering LULC changes through a bivariate logistic regression model. The outcome gives credible relative operating characteristics (ROC) value of 0.76 that indicate goodness fit of logistic regression model. In addition, independent variables like distance to drainage network and average annual rainfall have negative regression coefficient values that represent decreased rate of dependent variable (changed LULC) whereas independent variables (population density, distance to road and distance to railway) have positive regression coefficient indicates increased rate of changed LULC . Results from this study will be crucial for planning and restoration of this vital lake water body that has major implications over the society and environment at large.
Fei, Y; Hu, J; Li, W-Q; Wang, W; Zong, G-Q
2017-03-01
Essentials Predicting the occurrence of portosplenomesenteric vein thrombosis (PSMVT) is difficult. We studied 72 patients with acute pancreatitis. Artificial neural networks modeling was more accurate than logistic regression in predicting PSMVT. Additional predictive factors may be incorporated into artificial neural networks. Objective To construct and validate artificial neural networks (ANNs) for predicting the occurrence of portosplenomesenteric venous thrombosis (PSMVT) and compare the predictive ability of the ANNs with that of logistic regression. Methods The ANNs and logistic regression modeling were constructed using simple clinical and laboratory data of 72 acute pancreatitis (AP) patients. The ANNs and logistic modeling were first trained on 48 randomly chosen patients and validated on the remaining 24 patients. The accuracy and the performance characteristics were compared between these two approaches by SPSS17.0 software. Results The training set and validation set did not differ on any of the 11 variables. After training, the back propagation network training error converged to 1 × 10 -20 , and it retained excellent pattern recognition ability. When the ANNs model was applied to the validation set, it revealed a sensitivity of 80%, specificity of 85.7%, a positive predictive value of 77.6% and negative predictive value of 90.7%. The accuracy was 83.3%. Differences could be found between ANNs modeling and logistic regression modeling in these parameters (10.0% [95% CI, -14.3 to 34.3%], 14.3% [95% CI, -8.6 to 37.2%], 15.7% [95% CI, -9.9 to 41.3%], 11.8% [95% CI, -8.2 to 31.8%], 22.6% [95% CI, -1.9 to 47.1%], respectively). When ANNs modeling was used to identify PSMVT, the area under receiver operating characteristic curve was 0.849 (95% CI, 0.807-0.901), which demonstrated better overall properties than logistic regression modeling (AUC = 0.716) (95% CI, 0.679-0.761). Conclusions ANNs modeling was a more accurate tool than logistic regression in predicting the occurrence of PSMVT following AP. More clinical factors or biomarkers may be incorporated into ANNs modeling to improve its predictive ability. © 2016 International Society on Thrombosis and Haemostasis.
McLaren, Christine E.; Chen, Wen-Pin; Nie, Ke; Su, Min-Ying
2009-01-01
Rationale and Objectives Dynamic contrast enhanced MRI (DCE-MRI) is a clinical imaging modality for detection and diagnosis of breast lesions. Analytical methods were compared for diagnostic feature selection and performance of lesion classification to differentiate between malignant and benign lesions in patients. Materials and Methods The study included 43 malignant and 28 benign histologically-proven lesions. Eight morphological parameters, ten gray level co-occurrence matrices (GLCM) texture features, and fourteen Laws’ texture features were obtained using automated lesion segmentation and quantitative feature extraction. Artificial neural network (ANN) and logistic regression analysis were compared for selection of the best predictors of malignant lesions among the normalized features. Results Using ANN, the final four selected features were compactness, energy, homogeneity, and Law_LS, with area under the receiver operating characteristic curve (AUC) = 0.82, and accuracy = 0.76. The diagnostic performance of these 4-features computed on the basis of logistic regression yielded AUC = 0.80 (95% CI, 0.688 to 0.905), similar to that of ANN. The analysis also shows that the odds of a malignant lesion decreased by 48% (95% CI, 25% to 92%) for every increase of 1 SD in the Law_LS feature, adjusted for differences in compactness, energy, and homogeneity. Using logistic regression with z-score transformation, a model comprised of compactness, NRL entropy, and gray level sum average was selected, and it had the highest overall accuracy of 0.75 among all models, with AUC = 0.77 (95% CI, 0.660 to 0.880). When logistic modeling of transformations using the Box-Cox method was performed, the most parsimonious model with predictors, compactness and Law_LS, had an AUC of 0.79 (95% CI, 0.672 to 0.898). Conclusion The diagnostic performance of models selected by ANN and logistic regression was similar. The analytic methods were found to be roughly equivalent in terms of predictive ability when a small number of variables were chosen. The robust ANN methodology utilizes a sophisticated non-linear model, while logistic regression analysis provides insightful information to enhance interpretation of the model features. PMID:19409817
Qin, Gang; Bian, Zhao-Lian; Shen, Yi; Zhang, Lei; Zhu, Xiao-Hong; Liu, Yan-Mei; Shao, Jian-Guo
2016-06-04
Several models have been proposed to predict the short-term outcome of acute-on-chronic liver failure (ACLF) after treatment. We aimed to determine whether better decisions for artificial liver support system (ALSS) treatment could be made with a model than without, through decision curve analysis (DCA). The medical profiles of a cohort of 232 patients with hepatitis B virus (HBV)-associated ACLF were retrospectively analyzed to explore the role of plasma prothrombin activity (PTA), model for end-stage liver disease (MELD) and logistic regression model (LRM) in identifying patients who could benefit from ALSS. The accuracy and reliability of PTA, MELD and LRM were evaluated with previously reported cutoffs. DCA was performed to evaluate the clinical role of these models in predicting the treatment outcome. With the cut-off value of 0.2, LRM had sensitivity of 92.6 %, specificity of 42.3 % and an area under the receiving operating characteristic curve (AUC) of 0.68, which showed superior discrimination over PTA and MELD. DCA revealed that the LRM-guided ALSS treatment was superior over other strategies including "treating all" and MELD-guided therapy, for the midrange threshold probabilities of 16 to 64 %. The use of LRM-guided ALSS treatment could increase both the accuracy and efficiency of this procedure, allowing the avoidance of unnecessary ALSS.
Coly, A; Morisky, D
2004-06-01
Two health clinics in Los Angeles County, California. To identify factors associated with completion of care among foreign-born adolescents treated for latent tuberculosis infection (LTBI). A total of 766 low-income adolescents (79% participation rate), including 610 foreign-born, were recruited. In prospective face-to-face interviews, data were obtained on socio-demographic and lifestyle characteristics, psychosocial factors and clinic-related variables. Medical chart data were abstracted regarding clinic appointment keeping and completion of treatment. Univariate and multivariate logistic regression analyses were performed to identify factors associated with completion of care. Foreign-born adolescents were more likely to complete care than US-born adolescents, with 82% completion of care rate. In logistic regression analyses after controlling for age, medication taking behavior (OR 1.26, 95%CI 1.15-1.39), living with both parents (OR 1.74, 95%CI 1.02-2.97), sexual intercourse (OR 0.66, 95%CI 0.36-1.19) and speaking mostly or only English with parents (OR 0.39, 95%CI 0.15-1.03) were independently associated with completion of care. These findings contribute to our understanding of the factors that may explain why some adolescents complete care whereas others do not. They provide supportive evidence that tailored intervention programs should be developed to support the screening and completion of treatment of foreign-born adolescents.
Multivariate Models for Prediction of Human Skin Sensitization ...
One of the lnteragency Coordinating Committee on the Validation of Alternative Method's (ICCVAM) top priorities is the development and evaluation of non-animal approaches to identify potential skin sensitizers. The complexity of biological events necessary to produce skin sensitization suggests that no single alternative method will replace the currently accepted animal tests. ICCVAM is evaluating an integrated approach to testing and assessment based on the adverse outcome pathway for skin sensitization that uses machine learning approaches to predict human skin sensitization hazard. We combined data from three in chemico or in vitro assays - the direct peptide reactivity assay (DPRA), human cell line activation test (h-CLAT) and KeratinoSens TM assay - six physicochemical properties and an in silico read-across prediction of skin sensitization hazard into 12 variable groups. The variable groups were evaluated using two machine learning approaches , logistic regression and support vector machine, to predict human skin sensitization hazard. Models were trained on 72 substances and tested on an external set of 24 substances. The six models (three logistic regression and three support vector machine) with the highest accuracy (92%) used: (1) DPRA, h-CLAT and read-across; (2) DPRA, h-CLAT, read-across and KeratinoSens; or (3) DPRA, h-CLAT, read-across, KeratinoSens and log P. The models performed better at predicting human skin sensitization hazard than the murine
Ai, Zi-Sheng; Gao, You-Shui; Sun, Yuan; Liu, Yue; Zhang, Chang-Qing; Jiang, Cheng-Hua
2013-03-01
Risk factors for femoral neck fracture-induced avascular necrosis of the femoral head have not been elucidated clearly in middle-aged and elderly patients. Moreover, the high incidence of screw removal in China and its effect on the fate of the involved femoral head require statistical methods to reflect their intrinsic relationship. Ninety-nine patients older than 45 years with femoral neck fracture were treated by internal fixation between May 1999 and April 2004. Descriptive analysis, interaction analysis between associated factors, single factor logistic regression, multivariate logistic regression, and detailed interaction analysis were employed to explore potential relationships among associated factors. Avascular necrosis of the femoral head was found in 15 cases (15.2 %). Age × the status of implants (removal vs. maintenance) and gender × the timing of reduction were interactive according to two-factor interactive analysis. Age, the displacement of fractures, the quality of reduction, and the status of implants were found to be significant factors in single factor logistic regression analysis. Age, age × the status of implants, and the quality of reduction were found to be significant factors in multivariate logistic regression analysis. In fine interaction analysis after multivariate logistic regression analysis, implant removal was the most important risk factor for avascular necrosis in 56-to-85-year-old patients, with a risk ratio of 26.00 (95 % CI = 3.076-219.747). The middle-aged and elderly have less incidence of avascular necrosis of the femoral head following femoral neck fractures treated by cannulated screws. The removal of cannulated screws can induce a significantly high incidence of avascular necrosis of the femoral head in elderly patients, while a high-quality reduction is helpful to reduce avascular necrosis.
Zhou, Jinzhe; Zhou, Yanbing; Cao, Shougen; Li, Shikuan; Wang, Hao; Niu, Zhaojian; Chen, Dong; Wang, Dongsheng; Lv, Liang; Zhang, Jian; Li, Yu; Jiao, Xuelong; Tan, Xiaojie; Zhang, Jianli; Wang, Haibo; Zhang, Bingyuan; Lu, Yun; Sun, Zhenqing
2016-01-01
Reporting of surgical complications is common, but few provide information about the severity and estimate risk factors of complications. If have, but lack of specificity. We retrospectively analyzed data on 2795 gastric cancer patients underwent surgical procedure at the Affiliated Hospital of Qingdao University between June 2007 and June 2012, established multivariate logistic regression model to predictive risk factors related to the postoperative complications according to the Clavien-Dindo classification system. Twenty-four out of 86 variables were identified statistically significant in univariate logistic regression analysis, 11 significant variables entered multivariate analysis were employed to produce the risk model. Liver cirrhosis, diabetes mellitus, Child classification, invasion of neighboring organs, combined resection, introperative transfusion, Billroth II anastomosis of reconstruction, malnutrition, surgical volume of surgeons, operating time and age were independent risk factors for postoperative complications after gastrectomy. Based on logistic regression equation, p=Exp∑BiXi / (1+Exp∑BiXi), multivariate logistic regression predictive model that calculated the risk of postoperative morbidity was developed, p = 1/(1 + e((4.810-1.287X1-0.504X2-0.500X3-0.474X4-0.405X5-0.318X6-0.316X7-0.305X8-0.278X9-0.255X10-0.138X11))). The accuracy, sensitivity and specificity of the model to predict the postoperative complications were 86.7%, 76.2% and 88.6%, respectively. This risk model based on Clavien-Dindo grading severity of complications system and logistic regression analysis can predict severe morbidity specific to an individual patient's risk factors, estimate patients' risks and benefits of gastric surgery as an accurate decision-making tool and may serve as a template for the development of risk models for other surgical groups.
Support for smoke-free policies in the Cyprus hospitality industry.
Lazuras, Lambros; Savva, Christos S; Talias, Michael A; Soteriades, Elpidoforos S
2015-12-01
The present study used attitudinal and behavioural indicators to measure support for smoke-free policies among employers and employees in the hospitality industry in Cyprus. A representative sample of 600 participants (95 % response rate) completed anonymous structured questionnaires on demographic variables, smoking status, exposure to second-hand smoke at work and related health beliefs, social norms, and smoke-free policy support. Participants were predominantly males (68.3 %), with a mean age of 40 years (SD = 12.69), and 39.7 % were employers/owners of the hospitality venue. Analysis of variance showed that employers and smokers were less supportive of smoke-free policies, as compared to employees and non-smokers. Linear regression models showed that attitudes towards smoke-free policy were predicted by smoking status, SHS exposure and related health beliefs, and social norm variables. Logistic regression analysis showed that willingness to confront a policy violator was predicted by SHS exposure, perceived prevalence of smoker clients, and smoke-free policy attitudes. SHS exposure and related health beliefs, and normative factors should be targeted by interventions aiming to promote policy support in the hospitality industry in Cyprus.
DOT National Transportation Integrated Search
1988-10-01
An analysis of the current environment within the Acquisition stage of the Weapon System Life Cycle Pertaining to the Logistics Support Analysis (LSA) process, the Logistics Support Analysis Record (LSAR), and other Logistics Support data was underta...
DOT National Transportation Integrated Search
1988-10-01
An analysis of the current environment within the Acquisition stage of the Weapon System Life Cycle Pertaining to the Logistics Support Analysis (LSA) process, the Logistics Support Analysis Record (LSAR), and other Logistics Support data was underta...
Ioannidis, J P; McQueen, P G; Goedert, J J; Kaslow, R A
1998-03-01
Complex immunogenetic associations of disease involving a large number of gene products are difficult to evaluate with traditional statistical methods and may require complex modeling. The authors evaluated the performance of feed-forward backpropagation neural networks in predicting rapid progression to acquired immunodeficiency syndrome (AIDS) for patients with human immunodeficiency virus (HIV) infection on the basis of major histocompatibility complex variables. Networks were trained on data from patients from the Multicenter AIDS Cohort Study (n = 139) and then validated on patients from the DC Gay cohort (n = 102). The outcome of interest was rapid disease progression, defined as progression to AIDS in <6 years from seroconversion. Human leukocyte antigen (HLA) variables were selected as network inputs with multivariate regression and a previously described algorithm selecting markers with extreme point estimates for progression risk. Network performance was compared with that of logistic regression. Networks with 15 HLA inputs and a single hidden layer of five nodes achieved a sensitivity of 87.5% and specificity of 95.6% in the training set, vs. 77.0% and 76.9%, respectively, achieved by logistic regression. When validated on the DC Gay cohort, networks averaged a sensitivity of 59.1% and specificity of 74.3%, vs. 53.1% and 61.4%, respectively, for logistic regression. Neural networks offer further support to the notion that HIV disease progression may be dependent on complex interactions between different class I and class II alleles and transporters associated with antigen processing variants. The effect in the current models is of moderate magnitude, and more data as well as other host and pathogen variables may need to be considered to improve the performance of the models. Artificial intelligence methods may complement linear statistical methods for evaluating immunogenetic associations of disease.
Rank-Optimized Logistic Matrix Regression toward Improved Matrix Data Classification.
Zhang, Jianguang; Jiang, Jianmin
2018-02-01
While existing logistic regression suffers from overfitting and often fails in considering structural information, we propose a novel matrix-based logistic regression to overcome the weakness. In the proposed method, 2D matrices are directly used to learn two groups of parameter vectors along each dimension without vectorization, which allows the proposed method to fully exploit the underlying structural information embedded inside the 2D matrices. Further, we add a joint [Formula: see text]-norm on two parameter matrices, which are organized by aligning each group of parameter vectors in columns. This added co-regularization term has two roles-enhancing the effect of regularization and optimizing the rank during the learning process. With our proposed fast iterative solution, we carried out extensive experiments. The results show that in comparison to both the traditional tensor-based methods and the vector-based regression methods, our proposed solution achieves better performance for matrix data classifications.
Anxiety and Depression among Breast Cancer Patients in an Urban Setting in Malaysia.
Hassan, Mohd Rohaizat; Shah, Shamsul Azhar; Ghazi, Hasanain Faisal; Mohd Mujar, Noor Mastura; Samsuri, Mohd Fadhli; Baharom, Nizam
2015-01-01
Breast cancer is one of the most feared diseases among women and it could induce the development of psychological disorders like anxiety and depression. An assessment was here performed of the status and to determine contributory factors. A cross-sectional study was conducted among breast cancer patients at University Kebangsaan Malaysia Medical Center (UKMMC), Kuala Lumpur. A total of 205 patients who were diagnosed between 2007 until 2010 were interviewed using the questionnaires of Hospital Anxiety and Depression (HADS). The associated factors investigated concerned socio-demographics, socio economic background and the cancer status. Descriptive analysis, chi-squared tests and logistic regression were used for the statistical test analysis. The prevalence of anxiety was 31.7% (n=65 ) and of depression was 22.0% (n=45) among the breast cancer patients. Age group (p= 0.032), monthly income (p=0.015) and number of visits per month (p=0.007) were significantly associated with anxiety. For depression, marital status (p=0.012), accompanying person (p=0.041), financial support (p-0.007) and felt burden (p=0.038) were significantly associated. In binary logistic regression, those in the younger age group were low monthly income were 2 times more likely to be associated with anxiety. Having less financial support and being single were 3 and 4 times more likely to be associated with depression. In management of breast cancer patients, more care or support should be given to the young and low socio economic status as they are at high risk of anxiety and depression.
Detecting DIF in Polytomous Items Using MACS, IRT and Ordinal Logistic Regression
ERIC Educational Resources Information Center
Elosua, Paula; Wells, Craig
2013-01-01
The purpose of the present study was to compare the Type I error rate and power of two model-based procedures, the mean and covariance structure model (MACS) and the item response theory (IRT), and an observed-score based procedure, ordinal logistic regression, for detecting differential item functioning (DIF) in polytomous items. A simulation…
ERIC Educational Resources Information Center
Rudner, Lawrence
2016-01-01
In the machine learning literature, it is commonly accepted as fact that as calibration sample sizes increase, Naïve Bayes classifiers initially outperform Logistic Regression classifiers in terms of classification accuracy. Applied to subtests from an on-line final examination and from a highly regarded certification examination, this study shows…
ERIC Educational Resources Information Center
Fan, Xitao; Wang, Lin
The Monte Carlo study compared the performance of predictive discriminant analysis (PDA) and that of logistic regression (LR) for the two-group classification problem. Prior probabilities were used for classification, but the cost of misclassification was assumed to be equal. The study used a fully crossed three-factor experimental design (with…
ERIC Educational Resources Information Center
Nguyen, Phuong L.
2006-01-01
This study examines the effects of parental SES, school quality, and community factors on children's enrollment and achievement in rural areas in Viet Nam, using logistic regression and ordered logistic regression. Multivariate analysis reveals significant differences in educational enrollment and outcomes by level of household expenditures and…
School Exits in the Milwaukee Parental Choice Program: Evidence of a Marketplace?
ERIC Educational Resources Information Center
Ford, Michael
2011-01-01
This article examines whether the large number of school exits from the Milwaukee school voucher program is evidence of a marketplace. Two logistic regression and multinomial logistic regression models tested the relation between the inability to draw large numbers of voucher students and the ability for a private school to remain viable. Data on…
Hierarchical Bayesian Logistic Regression to forecast metabolic control in type 2 DM patients.
Dagliati, Arianna; Malovini, Alberto; Decata, Pasquale; Cogni, Giulia; Teliti, Marsida; Sacchi, Lucia; Cerra, Carlo; Chiovato, Luca; Bellazzi, Riccardo
2016-01-01
In this work we present our efforts in building a model able to forecast patients' changes in clinical conditions when repeated measurements are available. In this case the available risk calculators are typically not applicable. We propose a Hierarchical Bayesian Logistic Regression model, which allows taking into account individual and population variability in model parameters estimate. The model is used to predict metabolic control and its variation in type 2 diabetes mellitus. In particular we have analyzed a population of more than 1000 Italian type 2 diabetic patients, collected within the European project Mosaic. The results obtained in terms of Matthews Correlation Coefficient are significantly better than the ones gathered with standard logistic regression model, based on data pooling.
Model building strategy for logistic regression: purposeful selection.
Zhang, Zhongheng
2016-03-01
Logistic regression is one of the most commonly used models to account for confounders in medical literature. The article introduces how to perform purposeful selection model building strategy with R. I stress on the use of likelihood ratio test to see whether deleting a variable will have significant impact on model fit. A deleted variable should also be checked for whether it is an important adjustment of remaining covariates. Interaction should be checked to disentangle complex relationship between covariates and their synergistic effect on response variable. Model should be checked for the goodness-of-fit (GOF). In other words, how the fitted model reflects the real data. Hosmer-Lemeshow GOF test is the most widely used for logistic regression model.
Butler, Sandra S; Simpson, Nan; Brennan, Mark; Turner, Winston
2010-11-01
Recruiting and retaining an adequate number of personal support workers in home care is both challenging and essential to allowing elders to age in place. A mixed-method, longitudinal study examined turnover in a sample of 261 personal support workers in Maine; 70 workers (26.8%) left their employment in the first year of the study. Logistic regression analysis indicated that younger age and lack of health insurance were significant predictors of turnover. Analysis of telephone interviews revealed three overarching themes related to termination: job not worthwhile, personal reasons, and burnout. Implications of study findings for gerontological social workers are outlined.
NASA Astrophysics Data System (ADS)
Ceppi, C.; Mancini, F.; Ritrovato, G.
2009-04-01
This study aim at the landslide susceptibility mapping within an area of the Daunia (Apulian Apennines, Italy) by a multivariate statistical method and data manipulation in a Geographical Information System (GIS) environment. Among the variety of existing statistical data analysis techniques, the logistic regression was chosen to produce a susceptibility map all over an area where small settlements are historically threatened by landslide phenomena. By logistic regression a best fitting between the presence or absence of landslide (dependent variable) and the set of independent variables is performed on the basis of a maximum likelihood criterion, bringing to the estimation of regression coefficients. The reliability of such analysis is therefore due to the ability to quantify the proneness to landslide occurrences by the probability level produced by the analysis. The inventory of dependent and independent variables were managed in a GIS, where geometric properties and attributes have been translated into raster cells in order to proceed with the logistic regression by means of SPSS (Statistical Package for the Social Sciences) package. A landslide inventory was used to produce the bivariate dependent variable whereas the independent set of variable concerned with slope, aspect, elevation, curvature, drained area, lithology and land use after their reductions to dummy variables. The effect of independent parameters on landslide occurrence was assessed by the corresponding coefficient in the logistic regression function, highlighting a major role played by the land use variable in determining occurrence and distribution of phenomena. Once the outcomes of the logistic regression are determined, data are re-introduced in the GIS to produce a map reporting the proneness to landslide as predicted level of probability. As validation of results and regression model a cell-by-cell comparison between the susceptibility map and the initial inventory of landslide events was performed and an agreement at 75% level achieved.
Determination of riverbank erosion probability using Locally Weighted Logistic Regression
NASA Astrophysics Data System (ADS)
Ioannidou, Elena; Flori, Aikaterini; Varouchakis, Emmanouil A.; Giannakis, Georgios; Vozinaki, Anthi Eirini K.; Karatzas, George P.; Nikolaidis, Nikolaos
2015-04-01
Riverbank erosion is a natural geomorphologic process that affects the fluvial environment. The most important issue concerning riverbank erosion is the identification of the vulnerable locations. An alternative to the usual hydrodynamic models to predict vulnerable locations is to quantify the probability of erosion occurrence. This can be achieved by identifying the underlying relations between riverbank erosion and the geomorphological or hydrological variables that prevent or stimulate erosion. Thus, riverbank erosion can be determined by a regression model using independent variables that are considered to affect the erosion process. The impact of such variables may vary spatially, therefore, a non-stationary regression model is preferred instead of a stationary equivalent. Locally Weighted Regression (LWR) is proposed as a suitable choice. This method can be extended to predict the binary presence or absence of erosion based on a series of independent local variables by using the logistic regression model. It is referred to as Locally Weighted Logistic Regression (LWLR). Logistic regression is a type of regression analysis used for predicting the outcome of a categorical dependent variable (e.g. binary response) based on one or more predictor variables. The method can be combined with LWR to assign weights to local independent variables of the dependent one. LWR allows model parameters to vary over space in order to reflect spatial heterogeneity. The probabilities of the possible outcomes are modelled as a function of the independent variables using a logistic function. Logistic regression measures the relationship between a categorical dependent variable and, usually, one or several continuous independent variables by converting the dependent variable to probability scores. Then, a logistic regression is formed, which predicts success or failure of a given binary variable (e.g. erosion presence or absence) for any value of the independent variables. The erosion occurrence probability can be calculated in conjunction with the model deviance regarding the independent variables tested. The most straightforward measure for goodness of fit is the G statistic. It is a simple and effective way to study and evaluate the Logistic Regression model efficiency and the reliability of each independent variable. The developed statistical model is applied to the Koiliaris River Basin on the island of Crete, Greece. Two datasets of river bank slope, river cross-section width and indications of erosion were available for the analysis (12 and 8 locations). Two different types of spatial dependence functions, exponential and tricubic, were examined to determine the local spatial dependence of the independent variables at the measurement locations. The results show a significant improvement when the tricubic function is applied as the erosion probability is accurately predicted at all eight validation locations. Results for the model deviance show that cross-section width is more important than bank slope in the estimation of erosion probability along the Koiliaris riverbanks. The proposed statistical model is a useful tool that quantifies the erosion probability along the riverbanks and can be used to assist managing erosion and flooding events. Acknowledgements This work is part of an on-going THALES project (CYBERSENSORS - High Frequency Monitoring System for Integrated Water Resources Management of Rivers). The project has been co-financed by the European Union (European Social Fund - ESF) and Greek national funds through the Operational Program "Education and Lifelong Learning" of the National Strategic Reference Framework (NSRF) - Research Funding Program: THALES. Investing in knowledge society through the European Social Fund.
NASA Astrophysics Data System (ADS)
Yilmaz, Işık
2009-06-01
The purpose of this study is to compare the landslide susceptibility mapping methods of frequency ratio (FR), logistic regression and artificial neural networks (ANN) applied in the Kat County (Tokat—Turkey). Digital elevation model (DEM) was first constructed using GIS software. Landslide-related factors such as geology, faults, drainage system, topographical elevation, slope angle, slope aspect, topographic wetness index (TWI) and stream power index (SPI) were used in the landslide susceptibility analyses. Landslide susceptibility maps were produced from the frequency ratio, logistic regression and neural networks models, and they were then compared by means of their validations. The higher accuracies of the susceptibility maps for all three models were obtained from the comparison of the landslide susceptibility maps with the known landslide locations. However, respective area under curve (AUC) values of 0.826, 0.842 and 0.852 for frequency ratio, logistic regression and artificial neural networks showed that the map obtained from ANN model is more accurate than the other models, accuracies of all models can be evaluated relatively similar. The results obtained in this study also showed that the frequency ratio model can be used as a simple tool in assessment of landslide susceptibility when a sufficient number of data were obtained. Input process, calculations and output process are very simple and can be readily understood in the frequency ratio model, however logistic regression and neural networks require the conversion of data to ASCII or other formats. Moreover, it is also very hard to process the large amount of data in the statistical package.
2007-03-01
simulation are analyzed using regression, statistical and marginal benefit techniques to show how the MOEs are affected by varying levels of the...being supported by the seabase increases. A large marginal benefit is realized in reducing a unit’s frequency and time spent in a balk state by...units. SOF units operate within the range of sea-based helicopter assets; therefore the risk of a ‘ bingo ’ (i.e., near empty) fuel state is nearly
2004-03-01
Breusch - Pagan test for constant variance of the residuals. Using Microsoft Excel® we calculate a p-value of 0.841237. This high p-value, which is above...our alpha of 0.05, indicates that our residuals indeed pass the Breusch - Pagan test for constant variance. In addition to the assumption tests , we...Wilk Test for Normality – Support (Reduced) Model (OLS) Finally, we perform a Breusch - Pagan test for constant variance of the residuals. Using
ERIC Educational Resources Information Center
Schumacher, Phyllis; Olinsky, Alan; Quinn, John; Smith, Richard
2010-01-01
The authors extended previous research by 2 of the authors who conducted a study designed to predict the successful completion of students enrolled in an actuarial program. They used logistic regression to determine the probability of an actuarial student graduating in the major or dropping out. They compared the results of this study with those…
Carolyn B. Meyer; Sherri L. Miller; C. John Ralph
2004-01-01
The scale at which habitat variables are measured affects the accuracy of resource selection functions in predicting animal use of sites. We used logistic regression models for a wide-ranging species, the marbled murrelet, (Brachyramphus marmoratus) in a large region in California to address how much changing the spatial or temporal scale of...
ERIC Educational Resources Information Center
Monahan, Patrick O.; McHorney, Colleen A.; Stump, Timothy E.; Perkins, Anthony J.
2007-01-01
Previous methodological and applied studies that used binary logistic regression (LR) for detection of differential item functioning (DIF) in dichotomously scored items either did not report an effect size or did not employ several useful measures of DIF magnitude derived from the LR model. Equations are provided for these effect size indices.…
ERIC Educational Resources Information Center
Magis, David; Raiche, Gilles; Beland, Sebastien; Gerard, Paul
2011-01-01
We present an extension of the logistic regression procedure to identify dichotomous differential item functioning (DIF) in the presence of more than two groups of respondents. Starting from the usual framework of a single focal group, we propose a general approach to estimate the item response functions in each group and to test for the presence…
Risk Factors of Falls in Community-Dwelling Older Adults: Logistic Regression Tree Analysis
ERIC Educational Resources Information Center
Yamashita, Takashi; Noe, Douglas A.; Bailer, A. John
2012-01-01
Purpose of the Study: A novel logistic regression tree-based method was applied to identify fall risk factors and possible interaction effects of those risk factors. Design and Methods: A nationally representative sample of American older adults aged 65 years and older (N = 9,592) in the Health and Retirement Study 2004 and 2006 modules was used.…
ERIC Educational Resources Information Center
Gordovil-Merino, Amalia; Guardia-Olmos, Joan; Pero-Cebollero, Maribel
2012-01-01
In this paper, we used simulations to compare the performance of classical and Bayesian estimations in logistic regression models using small samples. In the performed simulations, conditions were varied, including the type of relationship between independent and dependent variable values (i.e., unrelated and related values), the type of variable…
Ohlmacher, G.C.; Davis, J.C.
2003-01-01
Landslides in the hilly terrain along the Kansas and Missouri rivers in northeastern Kansas have caused millions of dollars in property damage during the last decade. To address this problem, a statistical method called multiple logistic regression has been used to create a landslide-hazard map for Atchison, Kansas, and surrounding areas. Data included digitized geology, slopes, and landslides, manipulated using ArcView GIS. Logistic regression relates predictor variables to the occurrence or nonoccurrence of landslides within geographic cells and uses the relationship to produce a map showing the probability of future landslides, given local slopes and geologic units. Results indicated that slope is the most important variable for estimating landslide hazard in the study area. Geologic units consisting mostly of shale, siltstone, and sandstone were most susceptible to landslides. Soil type and aspect ratio were considered but excluded from the final analysis because these variables did not significantly add to the predictive power of the logistic regression. Soil types were highly correlated with the geologic units, and no significant relationships existed between landslides and slope aspect. ?? 2003 Elsevier Science B.V. All rights reserved.
A Method for Calculating the Probability of Successfully Completing a Rocket Propulsion Ground Test
NASA Technical Reports Server (NTRS)
Messer, Bradley
2007-01-01
Propulsion ground test facilities face the daily challenge of scheduling multiple customers into limited facility space and successfully completing their propulsion test projects. Over the last decade NASA s propulsion test facilities have performed hundreds of tests, collected thousands of seconds of test data, and exceeded the capabilities of numerous test facility and test article components. A logistic regression mathematical modeling technique has been developed to predict the probability of successfully completing a rocket propulsion test. A logistic regression model is a mathematical modeling approach that can be used to describe the relationship of several independent predictor variables X(sub 1), X(sub 2),.., X(sub k) to a binary or dichotomous dependent variable Y, where Y can only be one of two possible outcomes, in this case Success or Failure of accomplishing a full duration test. The use of logistic regression modeling is not new; however, modeling propulsion ground test facilities using logistic regression is both a new and unique application of the statistical technique. Results from this type of model provide project managers with insight and confidence into the effectiveness of rocket propulsion ground testing.
Fei, Yang; Hu, Jian; Gao, Kun; Tu, Jianfeng; Li, Wei-Qin; Wang, Wei
2017-06-01
To construct a radical basis function (RBF) artificial neural networks (ANNs) model to predict the incidence of acute pancreatitis (AP)-induced portal vein thrombosis. The analysis included 353 patients with AP who had admitted between January 2011 and December 2015. RBF ANNs model and logistic regression model were constructed based on eleven factors relevant to AP respectively. Statistical indexes were used to evaluate the value of the prediction in two models. The predict sensitivity, specificity, positive predictive value, negative predictive value and accuracy by RBF ANNs model for PVT were 73.3%, 91.4%, 68.8%, 93.0% and 87.7%, respectively. There were significant differences between the RBF ANNs and logistic regression models in these parameters (P<0.05). In addition, a comparison of the area under receiver operating characteristic curves of the two models showed a statistically significant difference (P<0.05). The RBF ANNs model is more likely to predict the occurrence of PVT induced by AP than logistic regression model. D-dimer, AMY, Hct and PT were important prediction factors of approval for AP-induced PVT. Copyright © 2017 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Zhou, Yan; Zhou, Yang; Yuan, Kai; Jia, Zhiyu; Li, Shuo
2018-05-01
Aiming at the demonstration of autonomic logistics system to be used at the new generation of aviation materiel in our country, the modeling and simulating method of aviation materiel support effectiveness considering autonomic logistics are studied. Firstly, this paper introduced the idea of JSF autonomic logistics and analyzed the influence of autonomic logistics on support effectiveness from aspects of reliability, false alarm rate, troubleshooting time, and support delay time and maintenance level. On this basis, the paper studies the modeling and simulating methods of support effectiveness considering autonomic logistics, and puts forward the maintenance support simulation process considering autonomic logistics. Finally, taking the typical aviation materiel as an example, this paper analyzes and verifies the above-mentioned support effectiveness modeling and simulating method of aviation materiel considering autonomic logistics.
A simple measure of cognitive reserve is relevant for cognitive performance in MS patients.
Della Corte, Marida; Santangelo, Gabriella; Bisecco, Alvino; Sacco, Rosaria; Siciliano, Mattia; d'Ambrosio, Alessandro; Docimo, Renato; Cuomo, Teresa; Lavorgna, Luigi; Bonavita, Simona; Tedeschi, Gioacchino; Gallo, Antonio
2018-05-04
Cognitive reserve (CR) contributes to preserve cognition despite brain damage. This theory has been applied to multiple sclerosis (MS) to explain the partial relationship between cognition and MRI markers of brain pathology. Our aim was to determine the relationship between two measures of CR and cognition in MS. One hundred and forty-seven MS patients were enrolled. Cognition was assessed using the Rao's Brief Repeatable Battery and the Stroop Test. CR was measured as the vocabulary subtest of the WAIS-R score (VOC) and the number of years of formal education (EDU). Regression analysis included raw score data on each neuropsychological (NP) test as dependent variables and demographic/clinical parameters, VOC, and EDU as independent predictors. A binary logistic regression analysis including clinical/CR parameters as covariates and absence/presence of cognitive deficits as dependent variables was performed too. VOC, but not EDU, was strongly correlated with performances at all ten NP tests. EDU was correlated with executive performances. The binary logistic regression showed that only the Expanded Disability Status Scale (EDSS) and VOC were independently correlated with the presence/absence of CD. The lower the VOC and/or the higher the EDSS, the higher the frequency of CD. In conclusion, our study supports the relevance of CR in subtending cognitive performances and the presence of CD in MS patients.
Dietary consumption patterns and laryngeal cancer risk.
Vlastarakos, Petros V; Vassileiou, Andrianna; Delicha, Evie; Kikidis, Dimitrios; Protopapas, Dimosthenis; Nikolopoulos, Thomas P
2016-06-01
We conducted a case-control study to investigate the effect of diet on laryngeal carcinogenesis. Our study population was made up of 140 participants-70 patients with laryngeal cancer (LC) and 70 controls with a non-neoplastic condition that was unrelated to diet, smoking, or alcohol. A food-frequency questionnaire determined the mean consumption of 113 different items during the 3 years prior to symptom onset. Total energy intake and cooking mode were also noted. The relative risk, odds ratio (OR), and 95% confidence interval (CI) were estimated by multiple logistic regression analysis. We found that the total energy intake was significantly higher in the LC group (p < 0.001), and that the difference remained statistically significant after logistic regression analysis (p < 0.001; OR: 118.70). Notably, meat consumption was higher in the LC group (p < 0.001), and the difference remained significant after logistic regression analysis (p = 0.029; OR: 1.16). LC patients also consumed significantly more fried food (p = 0.036); this difference also remained significant in the logistic regression model (p = 0.026; OR: 5.45). The LC group also consumed significantly more seafood (p = 0.012); the difference persisted after logistic regression analysis (p = 0.009; OR: 2.48), with the consumption of shrimp proving detrimental (p = 0.049; OR: 2.18). Finally, the intake of zinc was significantly higher in the LC group before and after logistic regression analysis (p = 0.034 and p = 0.011; OR: 30.15, respectively). Cereal consumption (including pastas) was also higher among the LC patients (p = 0.043), with logistic regression analysis showing that their negative effect was possibly associated with the sauces and dressings that traditionally accompany pasta dishes (p = 0.006; OR: 4.78). Conversely, a higher consumption of dairy products was found in controls (p < 0.05); logistic regression analysis showed that calcium appeared to be protective at the micronutrient level (p < 0.001; OR: 0.27). We found no difference in the overall consumption of fruits and vegetables between the LC patients and controls; however, the LC patients did have a greater consumption of cooked tomatoes and cooked root vegetables (p = 0.039 for both), and the controls had more consumption of leeks (p = 0.042) and, among controls younger than 65 years, cooked beans (p = 0.037). Lemon (p = 0.037), squeezed fruit juice (p = 0.032), and watermelon (p = 0.018) were also more frequently consumed by the controls. Other differences at the micronutrient level included greater consumption by the LC patients of retinol (p = 0.044), polyunsaturated fats (p = 0.041), and linoleic acid (p = 0.008); LC patients younger than 65 years also had greater intake of riboflavin (p = 0.045). We conclude that the differences in dietary consumption patterns between LC patients and controls indicate a possible role for lifestyle modifications involving nutritional factors as a means of decreasing the risk of laryngeal cancer.
Genital hiatus size is associated with and predictive of apical vaginal support loss.
Lowder, Jerry L; Oliphant, Sallie S; Shepherd, Jonathan P; Ghetti, Chiara; Sutkin, Gary
2016-06-01
Recognition and assessment of apical vaginal support defects remains a significant challenge in the evaluation and management of prolapse. There are several reasons that this is likely: (1) Although the Pelvic Organ Prolapse-Quantification examination is the standard prolapse staging system used in the Female Pelvic Medicine and Reconstructive Surgery field for reporting outcomes, this assessment is not used commonly in clinical care outside the subspecialty; (2) no clinically useful and accepted definition of apical support loss exists, and (3) no consensus or guidelines address the degree of apical support loss at which an apical support procedure should be performed routinely. The purpose of this study was to identify a simple screening measure for significant loss of apical vaginal support. This was an analysis of women with Pelvic Organ Prolapse-Quantification stage 0-IV prolapse. Women with total vaginal length of ≥7 cm were included to define a population with "normal" vaginal length. Univariable and linear regression analyses were used to identify Pelvic Organ Prolapse-Quantification points that were associated with 3 definitions of apical support loss: the International Consultation on Incontinence, the Pelvic Floor Disorders Network revised eCARE, and a Pelvic Organ Prolapse-Quantification point C cut-point developed by Dietz et al. Linear and logistic regression models were created to assess predictors of overall apical support loss according to these definitions. Receiver operator characteristic curves were generated to determine test characteristics of the predictor variables and the areas under the curves were calculated. Of 469 women, 453 women met the inclusion criterion. The median Pelvic Organ Prolapse-Quantification stage was III, and the median leading edge of prolapse was +2 cm (range, -3 to 12 cm). By stage of prolapse (0-IV), mean genital hiatus size (genital hiatus; mid urethra to posterior fourchette) increased: 2.0 ± 0.5, 3.0 ± 0.5, 4.0 ± 1.0, 5.0 ± 1.0, and 6.5 ± 1.5 cm, respectively (P < .01). Pelvic Organ Prolapse-Quantification points B anterior, B posterior, and genital hiatus had moderate-to-strong associations with overall apical support loss and all definitions of apical support loss. Linear regression models that predict overall apical support loss and logistic regression models predict apical support loss as defined by International Continence Society, eCARE, and the point C; cut-point definitions were fit with points B anterior, B posterior, and genital hiatus; these 3 points explained more than one-half of the model variance. Receiver operator characteristic analysis for all definitions of apical support loss found that genital hiatus >3.75 cm was highly predictive of apical support loss (area under the curve, >0.8 in all models). Increasing genital hiatus size is associated highly with and predictive of apical vaginal support loss. Specifically, the Pelvic Organ Prolapse-Quantification measurement genital hiatus of ≥3.75 cm is highly predictive of apical support loss by all study definitions. This simple measurement can be used to screen for apical support loss and the need for further evaluation of apical vaginal support before planning a hysterectomy or prolapse surgery. Copyright © 2015 Elsevier Inc. All rights reserved.
Lee, Bum Ju; Kim, Keun Ho; Ku, Boncho; Jang, Jun-Su; Kim, Jong Yeol
2013-05-01
The body mass index (BMI) provides essential medical information related to body weight for the treatment and prognosis prediction of diseases such as cardiovascular disease, diabetes, and stroke. We propose a method for the prediction of normal, overweight, and obese classes based only on the combination of voice features that are associated with BMI status, independently of weight and height measurements. A total of 1568 subjects were divided into 4 groups according to age and gender differences. We performed statistical analyses by analysis of variance (ANOVA) and Scheffe test to find significant features in each group. We predicted BMI status (normal, overweight, and obese) by a logistic regression algorithm and two ensemble classification algorithms (bagging and random forests) based on statistically significant features. In the Female-2030 group (females aged 20-40 years), classification experiments using an imbalanced (original) data set gave area under the receiver operating characteristic curve (AUC) values of 0.569-0.731 by logistic regression, whereas experiments using a balanced data set gave AUC values of 0.893-0.994 by random forests. AUC values in Female-4050 (females aged 41-60 years), Male-2030 (males aged 20-40 years), and Male-4050 (males aged 41-60 years) groups by logistic regression in imbalanced data were 0.585-0.654, 0.581-0.614, and 0.557-0.653, respectively. AUC values in Female-4050, Male-2030, and Male-4050 groups in balanced data were 0.629-0.893 by bagging, 0.707-0.916 by random forests, and 0.695-0.854 by bagging, respectively. In each group, we found discriminatory features showing statistical differences among normal, overweight, and obese classes. The results showed that the classification models built by logistic regression in imbalanced data were better than those built by the other two algorithms, and significant features differed according to age and gender groups. Our results could support the development of BMI diagnosis tools for real-time monitoring; such tools are considered helpful in improving automated BMI status diagnosis in remote healthcare or telemedicine and are expected to have applications in forensic and medical science. Copyright © 2013 Elsevier B.V. All rights reserved.
Hillen, T.; Schaub, R.; Hiestermann, A.; Kirschner, W.; Robra, B.
2000-01-01
STUDY OBJECTIVE—To compare the health status and factors influencing the health of populations that had previously lived under different political systems. DESIGN—Cross sectional health and social survey using postal interviews. The relation between self reported health and psychosocial factors (stressful life events, social support, education, health promoting life style and health endangering behaviour) was investigated. To determine East-West differences a logistic regression model including interaction terms was fitted. SETTING—East and West Berlin shortly after reunification 1991. PARTICIPANTS—Representative sample of 4430 Berlin residents aged 18 years and over (response rate 63%). RESULTS—Of all respondents, 15.4% rated their health as unsatisfactory. Residents of East Berlin rated their health more frequently as unsatisfactory than residents of West Berlin (Orage adjusted= 1.29, 95%CI 1.08, 1.52), these differences occurred predominantly in the over 60 years age group. Logistic regression showed significant independent effects of stressful life events, social support, education, and health promoting life style on self rated health. The effects of education and health promoting life style were observed to be more pronounced in the western part of Berlin. Old age and female sex showed a stronger association with unsatisfactory health status in the eastern part of Berlin. CONCLUSIONS—For subjects aged over 60 years there was evidence that living in the former East Berlin had an adverse effect on health compared with West Berlin. The impact of education and a health promoting lifestyle on self rated health seemed to be weaker in a former socialist society compared with that of a Western democracy. This study supports an "additive model" rather than a "buffering model" in explaining the effects of psychosocial factors on health. Keywords: self rated health; health inequalities; stress; social support PMID:10890868
A decision support model for investment on P2P lending platform.
Zeng, Xiangxiang; Liu, Li; Leung, Stephen; Du, Jiangze; Wang, Xun; Li, Tao
2017-01-01
Peer-to-peer (P2P) lending, as a novel economic lending model, has triggered new challenges on making effective investment decisions. In a P2P lending platform, one lender can invest N loans and a loan may be accepted by M investors, thus forming a bipartite graph. Basing on the bipartite graph model, we built an iteration computation model to evaluate the unknown loans. To validate the proposed model, we perform extensive experiments on real-world data from the largest American P2P lending marketplace-Prosper. By comparing our experimental results with those obtained by Bayes and Logistic Regression, we show that our computation model can help borrowers select good loans and help lenders make good investment decisions. Experimental results also show that the Logistic classification model is a good complement to our iterative computation model, which motivates us to integrate the two classification models. The experimental results of the hybrid classification model demonstrate that the logistic classification model and our iteration computation model are complementary to each other. We conclude that the hybrid model (i.e., the integration of iterative computation model and Logistic classification model) is more efficient and stable than the individual model alone.
A decision support model for investment on P2P lending platform
Liu, Li; Leung, Stephen; Du, Jiangze; Wang, Xun; Li, Tao
2017-01-01
Peer-to-peer (P2P) lending, as a novel economic lending model, has triggered new challenges on making effective investment decisions. In a P2P lending platform, one lender can invest N loans and a loan may be accepted by M investors, thus forming a bipartite graph. Basing on the bipartite graph model, we built an iteration computation model to evaluate the unknown loans. To validate the proposed model, we perform extensive experiments on real-world data from the largest American P2P lending marketplace—Prosper. By comparing our experimental results with those obtained by Bayes and Logistic Regression, we show that our computation model can help borrowers select good loans and help lenders make good investment decisions. Experimental results also show that the Logistic classification model is a good complement to our iterative computation model, which motivates us to integrate the two classification models. The experimental results of the hybrid classification model demonstrate that the logistic classification model and our iteration computation model are complementary to each other. We conclude that the hybrid model (i.e., the integration of iterative computation model and Logistic classification model) is more efficient and stable than the individual model alone. PMID:28877234
ERIC Educational Resources Information Center
Guler, Nese; Penfield, Randall D.
2009-01-01
In this study, we investigate the logistic regression (LR), Mantel-Haenszel (MH), and Breslow-Day (BD) procedures for the simultaneous detection of both uniform and nonuniform differential item functioning (DIF). A simulation study was used to assess and compare the Type I error rate and power of a combined decision rule (CDR), which assesses DIF…
ERIC Educational Resources Information Center
Le, Huy; Marcus, Justin
2012-01-01
This study used Monte Carlo simulation to examine the properties of the overall odds ratio (OOR), which was recently introduced as an index for overall effect size in multiple logistic regression. It was found that the OOR was relatively independent of study base rate and performed better than most commonly used R-square analogs in indexing model…
Predicting Student Success on the Texas Chemistry STAAR Test: A Logistic Regression Analysis
ERIC Educational Resources Information Center
Johnson, William L.; Johnson, Annabel M.; Johnson, Jared
2012-01-01
Background: The context is the new Texas STAAR end-of-course testing program. Purpose: The authors developed a logistic regression model to predict who would pass-or-fail the new Texas chemistry STAAR end-of-course exam. Setting: Robert E. Lee High School (5A) with an enrollment of 2700 students, Tyler, Texas. Date of the study was the 2011-2012…
Susan L. King
2003-01-01
The performance of two classifiers, logistic regression and neural networks, are compared for modeling noncatastrophic individual tree mortality for 21 species of trees in West Virginia. The output of the classifier is usually a continuous number between 0 and 1. A threshold is selected between 0 and 1 and all of the trees below the threshold are classified as...
Logistic regression trees for initial selection of interesting loci in case-control studies
Nickolov, Radoslav Z; Milanov, Valentin B
2007-01-01
Modern genetic epidemiology faces the challenge of dealing with hundreds of thousands of genetic markers. The selection of a small initial subset of interesting markers for further investigation can greatly facilitate genetic studies. In this contribution we suggest the use of a logistic regression tree algorithm known as logistic tree with unbiased selection. Using the simulated data provided for Genetic Analysis Workshop 15, we show how this algorithm, with incorporation of multifactor dimensionality reduction method, can reduce an initial large pool of markers to a small set that includes the interesting markers with high probability. PMID:18466557
Rupert, Michael G.; Cannon, Susan H.; Gartner, Joseph E.; Michael, John A.; Helsel, Dennis R.
2008-01-01
Logistic regression was used to develop statistical models that can be used to predict the probability of debris flows in areas recently burned by wildfires by using data from 14 wildfires that burned in southern California during 2003-2006. Twenty-eight independent variables describing the basin morphology, burn severity, rainfall, and soil properties of 306 drainage basins located within those burned areas were evaluated. The models were developed as follows: (1) Basins that did and did not produce debris flows soon after the 2003 to 2006 fires were delineated from data in the National Elevation Dataset using a geographic information system; (2) Data describing the basin morphology, burn severity, rainfall, and soil properties were compiled for each basin. These data were then input to a statistics software package for analysis using logistic regression; and (3) Relations between the occurrence or absence of debris flows and the basin morphology, burn severity, rainfall, and soil properties were evaluated, and five multivariate logistic regression models were constructed. All possible combinations of independent variables were evaluated to determine which combinations produced the most effective models, and the multivariate models that best predicted the occurrence of debris flows were identified. Percentage of high burn severity and 3-hour peak rainfall intensity were significant variables in all models. Soil organic matter content and soil clay content were significant variables in all models except Model 5. Soil slope was a significant variable in all models except Model 4. The most suitable model can be selected from these five models on the basis of the availability of independent variables in the particular area of interest and field checking of probability maps. The multivariate logistic regression models can be entered into a geographic information system, and maps showing the probability of debris flows can be constructed in recently burned areas of southern California. This study demonstrates that logistic regression is a valuable tool for developing models that predict the probability of debris flows occurring in recently burned landscapes.
Hein, R; Abbas, S; Seibold, P; Salazar, R; Flesch-Janys, D; Chang-Claude, J
2012-01-01
Menopausal hormone therapy (MHT) is associated with an increased breast cancer risk in postmenopausal women, with combined estrogen-progestagen therapy posing a greater risk than estrogen monotherapy. However, few studies focused on potential effect modification of MHT-associated breast cancer risk by genetic polymorphisms in the progesterone metabolism. We assessed effect modification of MHT use by five coding single nucleotide polymorphisms (SNPs) in the progesterone metabolizing enzymes AKR1C3 (rs7741), AKR1C4 (rs3829125, rs17134592), and SRD5A1 (rs248793, rs3736316) using a two-center population-based case-control study from Germany with 2,502 postmenopausal breast cancer patients and 4,833 matched controls. An empirical-Bayes procedure that tests for interaction using a weighted combination of the prospective and the retrospective case-control estimators as well as standard prospective logistic regression were applied to assess multiplicative statistical interaction between polymorphisms and duration of MHT use with regard to breast cancer risk assuming a log-additive mode of inheritance. No genetic marginal effects were observed. Breast cancer risk associated with duration of combined therapy was significantly modified by SRD5A1_rs3736316, showing a reduced risk elevation in carriers of the minor allele (p (interaction,empirical-Bayes) = 0.006 using the empirical-Bayes method, p (interaction,logistic regression) = 0.013 using logistic regression). The risk associated with duration of use of monotherapy was increased by AKR1C3_rs7741 in minor allele carriers (p (interaction,empirical-Bayes) = 0.083, p (interaction,logistic regression) = 0.029) and decreased in minor allele carriers of two SNPs in AKR1C4 (rs3829125: p (interaction,empirical-Bayes) = 0.07, p (interaction,logistic regression) = 0.021; rs17134592: p (interaction,empirical-Bayes) = 0.101, p (interaction,logistic regression) = 0.038). After Bonferroni correction for multiple testing only SRD5A1_rs3736316 assessed using the empirical-Bayes method remained significant. Postmenopausal breast cancer risk associated with combined therapy may be modified by genetic variation in SRD5A1. Further well-powered studies are, however, required to replicate our finding.
The Relationship Between Emotional Support and Health-Related Self-Efficacy in Older Prisoners.
Noujaim, Deborah; Fortinsky, Richard H; Barry, Lisa C
2017-09-01
To determine whether emotional support, and proportion of emotional support provided by specific sources (e.g., family, other prisoners, clinicians), is associated with health-related self-efficacy among older prisoners. Cross-sectional study of 140 older prisoners age ≥50 with chronic medical illness who completed face-to-face interviews. Logistic regression, controlling for demographic, incarceration, and clinical/behavioral factors evaluated the association between emotional support, operationalized as a score and as a proportion of total emotional support from specific sources, and health-related self-efficacy. Higher emotional support scores, and greater proportion of support from clinicians, were associated with lower likelihood of poor health-related self-efficacy. Those with >50% of their emotional support coming from other prisoners had higher likelihood of poor self-efficacy. Among older prisoners with chronic illness, higher emotional support, particularly from clinicians, is associated with lower likelihood of poor self-efficacy; relying on other prisoners for emotional support is associated with poor health-related self-efficacy.
Brody, Gene H; Yu, Tianyi; Chen, Edith; Miller, Gregory E
2017-07-01
Individuals exposed to adverse childhood experiences (ACEs) are vulnerable to various health problems later in life. This study was designed to determine whether participation in an efficacious program to enhance supportive parenting would ameliorate the association between ACEs and prediabetes status at age 25. Rural African American parents and their 11-year-old children (N=390) participated in the Strong African American Families (SAAF) program or a control condition. Each youth at age 25 provided a total ACEs score and a blood sample from which overnight fasting glucose was assayed. Logistic regression equations were used to test the hypotheses. The logistic regression analyses revealed a significant interaction between total ACEs and random assignment to SAAF or control, OR=0.56, 95% CI [0.36, 0.88]. Follow-up analyses indicated that, for participants in the control condition, a 1-point increase in ACEs was associated with a 37.3% increase in risk of having prediabetes. ACEs were not associated with the likelihood of having prediabetes among participants in the SAAF condition. Control participants with high total ACEs scores were 3.54 times more likely to have prediabetes than were SAAF participants with similar scores. This study indicated that participation at age 11 in a randomized controlled trial designed to enhance supportive parenting ameliorated the association of ACEs with prediabetes at age 25. If substantiated, these findings may provide a strategy for preventing negative health consequences of ACEs. Copyright © 2017 Elsevier Inc. All rights reserved.
Císař, Petr; Labbé, Laurent; Souček, Pavel; Pelissier, Pablo; Kerneis, Thierry
2018-01-01
The main aim of this study was to develop a new objective method for evaluating the impacts of different diets on the live fish skin using image-based features. In total, one-hundred and sixty rainbow trout (Oncorhynchus mykiss) were fed either a fish-meal based diet (80 fish) or a 100% plant-based diet (80 fish) and photographed using consumer-grade digital camera. Twenty-three colour features and four texture features were extracted. Four different classification methods were used to evaluate fish diets including Random forest (RF), Support vector machine (SVM), Logistic regression (LR) and k-Nearest neighbours (k-NN). The SVM with radial based kernel provided the best classifier with correct classification rate (CCR) of 82% and Kappa coefficient of 0.65. Although the both LR and RF methods were less accurate than SVM, they achieved good classification with CCR 75% and 70% respectively. The k-NN was the least accurate (40%) classification model. Overall, it can be concluded that consumer-grade digital cameras could be employed as the fast, accurate and non-invasive sensor for classifying rainbow trout based on their diets. Furthermore, these was a close association between image-based features and fish diet received during cultivation. These procedures can be used as non-invasive, accurate and precise approaches for monitoring fish status during the cultivation by evaluating diet’s effects on fish skin. PMID:29596375
Saberioon, Mohammadmehdi; Císař, Petr; Labbé, Laurent; Souček, Pavel; Pelissier, Pablo; Kerneis, Thierry
2018-03-29
The main aim of this study was to develop a new objective method for evaluating the impacts of different diets on the live fish skin using image-based features. In total, one-hundred and sixty rainbow trout ( Oncorhynchus mykiss ) were fed either a fish-meal based diet (80 fish) or a 100% plant-based diet (80 fish) and photographed using consumer-grade digital camera. Twenty-three colour features and four texture features were extracted. Four different classification methods were used to evaluate fish diets including Random forest (RF), Support vector machine (SVM), Logistic regression (LR) and k -Nearest neighbours ( k -NN). The SVM with radial based kernel provided the best classifier with correct classification rate (CCR) of 82% and Kappa coefficient of 0.65. Although the both LR and RF methods were less accurate than SVM, they achieved good classification with CCR 75% and 70% respectively. The k -NN was the least accurate (40%) classification model. Overall, it can be concluded that consumer-grade digital cameras could be employed as the fast, accurate and non-invasive sensor for classifying rainbow trout based on their diets. Furthermore, these was a close association between image-based features and fish diet received during cultivation. These procedures can be used as non-invasive, accurate and precise approaches for monitoring fish status during the cultivation by evaluating diet's effects on fish skin.
Kim, Dong Wook; Kim, Hwiyoung; Nam, Woong; Kim, Hyung Jun; Cha, In-Ho
2018-04-23
The aim of this study was to build and validate five types of machine learning models that can predict the occurrence of BRONJ associated with dental extraction in patients taking bisphosphonates for the management of osteoporosis. A retrospective review of the medical records was conducted to obtain cases and controls for the study. Total 125 patients consisting of 41 cases and 84 controls were selected for the study. Five machine learning prediction algorithms including multivariable logistic regression model, decision tree, support vector machine, artificial neural network, and random forest were implemented. The outputs of these models were compared with each other and also with conventional methods, such as serum CTX level. Area under the receiver operating characteristic (ROC) curve (AUC) was used to compare the results. The performance of machine learning models was significantly superior to conventional statistical methods and single predictors. The random forest model yielded the best performance (AUC = 0.973), followed by artificial neural network (AUC = 0.915), support vector machine (AUC = 0.882), logistic regression (AUC = 0.844), decision tree (AUC = 0.821), drug holiday alone (AUC = 0.810), and CTX level alone (AUC = 0.630). Machine learning methods showed superior performance in predicting BRONJ associated with dental extraction compared to conventional statistical methods using drug holiday and serum CTX level. Machine learning can thus be applied in a wide range of clinical studies. Copyright © 2017. Published by Elsevier Inc.
Applications of statistics to medical science, III. Correlation and regression.
Watanabe, Hiroshi
2012-01-01
In this third part of a series surveying medical statistics, the concepts of correlation and regression are reviewed. In particular, methods of linear regression and logistic regression are discussed. Arguments related to survival analysis will be made in a subsequent paper.
Schell, Greggory J; Lavieri, Mariel S; Stein, Joshua D; Musch, David C
2013-12-21
Open-angle glaucoma (OAG) is a prevalent, degenerate ocular disease which can lead to blindness without proper clinical management. The tests used to assess disease progression are susceptible to process and measurement noise. The aim of this study was to develop a methodology which accounts for the inherent noise in the data and improve significant disease progression identification. Longitudinal observations from the Collaborative Initial Glaucoma Treatment Study (CIGTS) were used to parameterize and validate a Kalman filter model and logistic regression function. The Kalman filter estimates the true value of biomarkers associated with OAG and forecasts future values of these variables. We develop two logistic regression models via generalized estimating equations (GEE) for calculating the probability of experiencing significant OAG progression: one model based on the raw measurements from CIGTS and another model based on the Kalman filter estimates of the CIGTS data. Receiver operating characteristic (ROC) curves and associated area under the ROC curve (AUC) estimates are calculated using cross-fold validation. The logistic regression model developed using Kalman filter estimates as data input achieves higher sensitivity and specificity than the model developed using raw measurements. The mean AUC for the Kalman filter-based model is 0.961 while the mean AUC for the raw measurements model is 0.889. Hence, using the probability function generated via Kalman filter estimates and GEE for logistic regression, we are able to more accurately classify patients and instances as experiencing significant OAG progression. A Kalman filter approach for estimating the true value of OAG biomarkers resulted in data input which improved the accuracy of a logistic regression classification model compared to a model using raw measurements as input. This methodology accounts for process and measurement noise to enable improved discrimination between progression and nonprogression in chronic diseases.
Computing group cardinality constraint solutions for logistic regression problems.
Zhang, Yong; Kwon, Dongjin; Pohl, Kilian M
2017-01-01
We derive an algorithm to directly solve logistic regression based on cardinality constraint, group sparsity and use it to classify intra-subject MRI sequences (e.g. cine MRIs) of healthy from diseased subjects. Group cardinality constraint models are often applied to medical images in order to avoid overfitting of the classifier to the training data. Solutions within these models are generally determined by relaxing the cardinality constraint to a weighted feature selection scheme. However, these solutions relate to the original sparse problem only under specific assumptions, which generally do not hold for medical image applications. In addition, inferring clinical meaning from features weighted by a classifier is an ongoing topic of discussion. Avoiding weighing features, we propose to directly solve the group cardinality constraint logistic regression problem by generalizing the Penalty Decomposition method. To do so, we assume that an intra-subject series of images represents repeated samples of the same disease patterns. We model this assumption by combining series of measurements created by a feature across time into a single group. Our algorithm then derives a solution within that model by decoupling the minimization of the logistic regression function from enforcing the group sparsity constraint. The minimum to the smooth and convex logistic regression problem is determined via gradient descent while we derive a closed form solution for finding a sparse approximation of that minimum. We apply our method to cine MRI of 38 healthy controls and 44 adult patients that received reconstructive surgery of Tetralogy of Fallot (TOF) during infancy. Our method correctly identifies regions impacted by TOF and generally obtains statistically significant higher classification accuracy than alternative solutions to this model, i.e., ones relaxing group cardinality constraints. Copyright © 2016 Elsevier B.V. All rights reserved.
Ren, Yilong; Wang, Yunpeng; Wu, Xinkai; Yu, Guizhen; Ding, Chuan
2016-10-01
Red light running (RLR) has become a major safety concern at signalized intersection. To prevent RLR related crashes, it is critical to identify the factors that significantly impact the drivers' behaviors of RLR, and to predict potential RLR in real time. In this research, 9-month's RLR events extracted from high-resolution traffic data collected by loop detectors from three signalized intersections were applied to identify the factors that significantly affect RLR behaviors. The data analysis indicated that occupancy time, time gap, used yellow time, time left to yellow start, whether the preceding vehicle runs through the intersection during yellow, and whether there is a vehicle passing through the intersection on the adjacent lane were significantly factors for RLR behaviors. Furthermore, due to the rare events nature of RLR, a modified rare events logistic regression model was developed for RLR prediction. The rare events logistic regression method has been applied in many fields for rare events studies and shows impressive performance, but so far none of previous research has applied this method to study RLR. The results showed that the rare events logistic regression model performed significantly better than the standard logistic regression model. More importantly, the proposed RLR prediction method is purely based on loop detector data collected from a single advance loop detector located 400 feet away from stop-bar. This brings great potential for future field applications of the proposed method since loops have been widely implemented in many intersections and can collect data in real time. This research is expected to contribute to the improvement of intersection safety significantly. Copyright © 2016 Elsevier Ltd. All rights reserved.
Engoren, Milo; Habib, Robert H; Dooner, John J; Schwann, Thomas A
2013-08-01
As many as 14 % of patients undergoing coronary artery bypass surgery are readmitted within 30 days. Readmission is usually the result of morbidity and may lead to death. The purpose of this study is to develop and compare statistical and genetic programming models to predict readmission. Patients were divided into separate Construction and Validation populations. Using 88 variables, logistic regression, genetic programs, and artificial neural nets were used to develop predictive models. Models were first constructed and tested on the Construction populations, then validated on the Validation population. Areas under the receiver operator characteristic curves (AU ROC) were used to compare the models. Two hundred and two patients (7.6 %) in the 2,644 patient Construction group and 216 (8.0 %) of the 2,711 patient Validation group were re-admitted within 30 days of CABG surgery. Logistic regression predicted readmission with AU ROC = .675 ± .021 in the Construction group. Genetic programs significantly improved the accuracy, AU ROC = .767 ± .001, p < .001). Artificial neural nets were less accurate with AU ROC = 0.597 ± .001 in the Construction group. Predictive accuracy of all three techniques fell in the Validation group. However, the accuracy of genetic programming (AU ROC = .654 ± .001) was still trivially but statistically non-significantly better than that of the logistic regression (AU ROC = .644 ± .020, p = .61). Genetic programming and logistic regression provide alternative methods to predict readmission that are similarly accurate.
Eken, Cenker; Bilge, Ugur; Kartal, Mutlu; Eray, Oktay
2009-06-03
Logistic regression is the most common statistical model for processing multivariate data in the medical literature. Artificial intelligence models like an artificial neural network (ANN) and genetic algorithm (GA) may also be useful to interpret medical data. The purpose of this study was to perform artificial intelligence models on a medical data sheet and compare to logistic regression. ANN, GA, and logistic regression analysis were carried out on a data sheet of a previously published article regarding patients presenting to an emergency department with flank pain suspicious for renal colic. The study population was composed of 227 patients: 176 patients had a diagnosis of urinary stone, while 51 ultimately had no calculus. The GA found two decision rules in predicting urinary stones. Rule 1 consisted of being male, pain not spreading to back, and no fever. In rule 2, pelvicaliceal dilatation on bedside ultrasonography replaced no fever. ANN, GA rule 1, GA rule 2, and logistic regression had a sensitivity of 94.9, 67.6, 56.8, and 95.5%, a specificity of 78.4, 76.47, 86.3, and 47.1%, a positive likelihood ratio of 4.4, 2.9, 4.1, and 1.8, and a negative likelihood ratio of 0.06, 0.42, 0.5, and 0.09, respectively. The area under the curve was found to be 0.867, 0.720, 0.715, and 0.713 for all applications, respectively. Data mining techniques such as ANN and GA can be used for predicting renal colic in emergency settings and to constitute clinical decision rules. They may be an alternative to conventional multivariate analysis applications used in biostatistics.
NASA Astrophysics Data System (ADS)
Duman, T. Y.; Can, T.; Gokceoglu, C.; Nefeslioglu, H. A.; Sonmez, H.
2006-11-01
As a result of industrialization, throughout the world, cities have been growing rapidly for the last century. One typical example of these growing cities is Istanbul, the population of which is over 10 million. Due to rapid urbanization, new areas suitable for settlement and engineering structures are necessary. The Cekmece area located west of the Istanbul metropolitan area is studied, because the landslide activity is extensive in this area. The purpose of this study is to develop a model that can be used to characterize landslide susceptibility in map form using logistic regression analysis of an extensive landslide database. A database of landslide activity was constructed using both aerial-photography and field studies. About 19.2% of the selected study area is covered by deep-seated landslides. The landslides that occur in the area are primarily located in sandstones with interbedded permeable and impermeable layers such as claystone, siltstone and mudstone. About 31.95% of the total landslide area is located at this unit. To apply logistic regression analyses, a data matrix including 37 variables was constructed. The variables used in the forwards stepwise analyses are different measures of slope, aspect, elevation, stream power index (SPI), plan curvature, profile curvature, geology, geomorphology and relative permeability of lithological units. A total of 25 variables were identified as exerting strong influence on landslide occurrence, and included by the logistic regression equation. Wald statistics values indicate that lithology, SPI and slope are more important than the other parameters in the equation. Beta coefficients of the 25 variables included the logistic regression equation provide a model for landslide susceptibility in the Cekmece area. This model is used to generate a landslide susceptibility map that correctly classified 83.8% of the landslide-prone areas.
NASA Astrophysics Data System (ADS)
Thompson, E. David; Bowling, Bethany V.; Markle, Ross E.
2018-02-01
Studies over the last 30 years have considered various factors related to student success in introductory biology courses. While much of the available literature suggests that the best predictors of success in a college course are prior college grade point average (GPA) and class attendance, faculty often require a valuable predictor of success in those courses wherein the majority of students are in the first semester and have no previous record of college GPA or attendance. In this study, we evaluated the efficacy of the ACT Mathematics subject exam and Lawson's Classroom Test of Scientific Reasoning in predicting success in a major's introductory biology course. A logistic regression was utilized to determine the effectiveness of a combination of scientific reasoning (SR) scores and ACT math (ACT-M) scores to predict student success. In summary, we found that the model—with both SR and ACT-M as significant predictors—could be an effective predictor of student success and thus could potentially be useful in practical decision making for the course, such as directing students to support services at an early point in the semester.
2018-01-01
Background Many studies have tried to develop predictors for return-to-work (RTW). However, since complex factors have been demonstrated to predict RTW, it is difficult to use them practically. This study investigated whether factors used in previous studies could predict whether an individual had returned to his/her original work by four years after termination of the worker's recovery period. Methods An initial logistic regression analysis of 1,567 participants of the fourth Panel Study of Worker's Compensation Insurance yielded odds ratios. The participants were divided into two subsets, a training dataset and a test dataset. Using the training dataset, logistic regression, decision tree, random forest, and support vector machine models were established, and important variables of each model were identified. The predictive abilities of the different models were compared. Results The analysis showed that only earned income and company-related factors significantly affected return-to-original-work (RTOW). The random forest model showed the best accuracy among the tested machine learning models; however, the difference was not prominent. Conclusion It is possible to predict a worker's probability of RTOW using machine learning techniques with moderate accuracy. PMID:29736160
Relation between serum creatinine and postoperative results of open-heart surgery.
Ezeldin, Tamer H
2013-10-01
To determine the impact of preoperative serum creatinine level in non-dialyzable patients on postoperative morbidity and mortality. This is a prospective study, where serum creatinine was used to give primary assessment on renal function status preoperatively. This study includes 1,033 patients, who underwent coronary artery bypass grafting, or valve(s) operations. The study took place at Al-Hada Military Hospital, Taif, Kingdom of Saudi between May 2008 and January 2012. Data were statistically analyzed using Chi square (x2) test and multivariable logistic regression, to evaluate the postoperative morbidity and mortality risks associated with low serum creatinine levels. Postoperative mortality increased with high serum creatinine level >1.8 mg/dL (p=0.0005). Multivariable logistic regression, adjusting for potentially confounding variables demonstrated that a creatinine level of more than 1.8 mg/dL was associated with increased risk of re-operation for bleeding, postoperative renal failure, prolonged ventilatory support, ICU stay, and total hospital stay. Perioperative serum creatinine is strongly related to post operative morbidity and mortality in open heart surgery. High serum creatinine in non-dialyzable patients can predict the increased morbidity and mortality after cardiac operations.
Musculoskeletal disorders among workers in plastic manufacturing plants.
Fernandes, Rita de Cássia Pereira; Assunção, Ada Avila; Silvany Neto, Annibal Muniz; Carvalho, Fernando Martins
2010-03-01
Epidemiological studies have indicated an association between musculoskeletal disorders (MSDs) and physical work demands. Psychosocial work demands have also been identified as possible risk factors, but findings have been inconsistent. To evaluate factors associated with upper back, neck and upper limb MSD among workers from 14 plastic manufacturing companies located in the city of Salvador, Brazil. A cross-sectional study design was used to survey a stratified proportional random sample of 577 workers. Data were collected by questionnaire interviews. Factor analysis was carried out on 11 physical demands variables. Psychosocial work demands were measured by demand, control and social support questions. The role of socio-demographic factors, lifestyle and household tasks was also examined. Multiple logistic regression was used to identify factors related to upper back, neck and upper limb MSDs. Results from multiple logistic regression showed that distal upper limb MSDs were related to manual handling, work repetitiveness, psychosocial demands, job dissatisfaction, and gender. Neck, shoulder or upper back MSDs were related to manual handling, work repetitiveness, psychosocial demands, job dissatisfaction, and physical unfitness. Reducing the prevalence of musculoskeletal disorders requires: improving the work environment, reducing biomechanical risk factors, and replanning work organization. Programs must also be aware of gender specificities related to MSDs.
Multivariate prediction of upper limb prosthesis acceptance or rejection.
Biddiss, Elaine A; Chau, Tom T
2008-07-01
To develop a model for prediction of upper limb prosthesis use or rejection. A questionnaire exploring factors in prosthesis acceptance was distributed internationally to individuals with upper limb absence through community-based support groups and rehabilitation hospitals. A total of 191 participants (59 prosthesis rejecters and 132 prosthesis wearers) were included in this study. A logistic regression model, a C5.0 decision tree, and a radial basis function neural network were developed and compared in terms of sensitivity (prediction of prosthesis rejecters), specificity (prediction of prosthesis wearers), and overall cross-validation accuracy. The logistic regression and neural network provided comparable overall accuracies of approximately 84 +/- 3%, specificity of 93%, and sensitivity of 61%. Fitting time-frame emerged as the predominant predictor. Individuals fitted within two years of birth (congenital) or six months of amputation (acquired) were 16 times more likely to continue prosthesis use. To increase rates of prosthesis acceptance, clinical directives should focus on timely, client-centred fitting strategies and the development of improved prostheses and healthcare for individuals with high-level or bilateral limb absence. Multivariate analyses are useful in determining the relative importance of the many factors involved in prosthesis acceptance and rejection.
Barriers to health-care and psychological distress among mothers living with HIV in Quebec (Canada).
Blais, Martin; Fernet, Mylène; Proulx-Boucher, Karène; Lebouché, Bertrand; Rodrigue, Carl; Lapointe, Normand; Otis, Joanne; Samson, Johanne
2015-01-01
Health-care providers play a major role in providing good quality care and in preventing psychological distress among mothers living with HIV (MLHIV). The objectives of this study are to explore the impact of health-care services and satisfaction with care providers on psychological distress in MLHIV. One hundred MLHIV were recruited from community and clinical settings in the province of Quebec (Canada). Prevalence estimation of clinical psychological distress and univariate and multivariable logistic regression models were performed to predict clinical psychological distress. Forty-five percent of the participants reported clinical psychological distress. In the multivariable regression, the following variables were significantly associated with psychological distress while controlling for sociodemographic variables: resilience, quality of communication with the care providers, resources, and HIV disclosure concerns. The multivariate results support the key role of personal, structural, and medical resources in understanding psychological distress among MLHIV. Interventions that can support the psychological health of MLHIV are discussed.
Evolahti, Annika; Hultcrantz, Malou; Collins, Aila
2006-11-01
The aim of the present study was to investigate whether there is an association between serum cortisol and work-related stress, as defined by the demand-control model in a longitudinal design. One hundred ten women aged 47-53 years completed a health questionnaire, including the Swedish version of the Job Content Scale, and participated in a psychological interview at baseline and in a follow-up session 2 years later. Morning blood samples were drawn for analyses of cortisol. Multiple stepwise regression analyses and logistic regression analyses showed that work demands and lack of social support were significantly associated with cortisol. The results of this study showed that negative work characteristics in terms of high demands and low social support contributed significantly to the biological stress levels in middle-aged women. Participation in the study may have served as an intervention, increasing the women's awareness and thus improving their health profiles on follow-up.
Adams, Richard E.; Urosevich, Thomas G.; Hoffman, Stuart N.; Kirchner, H. Lester; Hyacinthe, Johanna C.; Figley, Charles R.; Boscarino, Joseph J.; Boscarino, Joseph A.
2017-01-01
Using a stress process model, the authors examined social and psychological resources to better understand mental health outcomes among veterans. For this study, we surveyed 700 U.S. veterans who were outpatients in the Geisinger Health System. Independent variables included demographic factors, stressful and traumatic events, social support measures, and psychosocial factors. Using logistic regression, the authors examined 4 types of social connections: social support, help-seeking support, social capital, and other mental health support to predict mental health outcomes, including posttraumatic stress disorder, depression, suicide ideation, alcohol misuse, mental health service use, and Veterans Affairs service use. Results suggested that help-seeking support since deployment was a risk factor for 5 adverse outcomes, whereas social support was protective for 1 outcome. We concluded that high levels of help-seeking support since deployment among veterans was associated with a higher prevalence of mental health problems. These findings were unexpected and suggest the need for additional social support-related research among veterans. PMID:29098116
Qadir, Farah; Khalid, Amna; Medhin, Girmay
2015-01-01
This study aimed to identify prevalence rates of psychological distress among Pakistani women seeking help for primary infertility. The associations of social support, marital adjustment, and sociodemographic factors with psychological distress were also examined. A total of 177 women with primary infertility were interviewed from one hospital in Islamabad using a Self-Reporting Questionnaire, the Multidimensional Scale of Perceived Social Support, and the Locke-Wallace Marital Adjustment Test. The data were collected between November 2012 and March 2013. The prevalence of psychological distress was 37.3 percent. The results of the logistic regression suggested that marital adjustment and social support were significantly negatively associated with psychological distress in this sample. These associations were not confounded by any of the demographic variables controlled in the multivariable regression models. The role of perceived social support and adjustment in marriage among women experiencing primary infertility are important factors in understanding their psychological distress. The results of this small-scale effort highlight the need for social and familial awareness to help tackle the psychological distress related to infertility. Future research needs to focus on the way the experience of infertility is conditioned by social structural realities. New ways need to be developed to better take into account the process and nature of the infertility experience.
Extended family and friendship support and suicidality among African Americans.
Nguyen, Ann W; Taylor, Robert Joseph; Chatters, Linda M; Taylor, Harry Owen; Lincoln, Karen D; Mitchell, Uchechi A
2017-03-01
This study examined the relationship between informal social support from extended family and friends and suicidality among African Americans. Logistic regression analysis was based on a nationally representative sample of African Americans from the National Survey of American Life (N = 3263). Subjective closeness and frequency of contact with extended family and friends and negative family interaction were examined in relation to lifetime suicide ideation and attempts. Subjective closeness to family and frequency of contact with friends were negatively associated with suicide ideation and attempts. Subjective closeness to friends and negative family interaction were positively associated with suicide ideation and attempts. Significant interactions between social support and negative interaction showed that social support buffers against the harmful effects of negative interaction on suicidality. Findings are discussed in relation to the functions of positive and negative social ties in suicidality.
Development of a web service for analysis in a distributed network.
Jiang, Xiaoqian; Wu, Yuan; Marsolo, Keith; Ohno-Machado, Lucila
2014-01-01
We describe functional specifications and practicalities in the software development process for a web service that allows the construction of the multivariate logistic regression model, Grid Logistic Regression (GLORE), by aggregating partial estimates from distributed sites, with no exchange of patient-level data. We recently developed and published a web service for model construction and data analysis in a distributed environment. This recent paper provided an overview of the system that is useful for users, but included very few details that are relevant for biomedical informatics developers or network security personnel who may be interested in implementing this or similar systems. We focus here on how the system was conceived and implemented. We followed a two-stage development approach by first implementing the backbone system and incrementally improving the user experience through interactions with potential users during the development. Our system went through various stages such as concept proof, algorithm validation, user interface development, and system testing. We used the Zoho Project management system to track tasks and milestones. We leveraged Google Code and Apache Subversion to share code among team members, and developed an applet-servlet architecture to support the cross platform deployment. During the development process, we encountered challenges such as Information Technology (IT) infrastructure gaps and limited team experience in user-interface design. We figured out solutions as well as enabling factors to support the translation of an innovative privacy-preserving, distributed modeling technology into a working prototype. Using GLORE (a distributed model that we developed earlier) as a pilot example, we demonstrated the feasibility of building and integrating distributed modeling technology into a usable framework that can support privacy-preserving, distributed data analysis among researchers at geographically dispersed institutes.
Development of a Web Service for Analysis in a Distributed Network
Jiang, Xiaoqian; Wu, Yuan; Marsolo, Keith; Ohno-Machado, Lucila
2014-01-01
Objective: We describe functional specifications and practicalities in the software development process for a web service that allows the construction of the multivariate logistic regression model, Grid Logistic Regression (GLORE), by aggregating partial estimates from distributed sites, with no exchange of patient-level data. Background: We recently developed and published a web service for model construction and data analysis in a distributed environment. This recent paper provided an overview of the system that is useful for users, but included very few details that are relevant for biomedical informatics developers or network security personnel who may be interested in implementing this or similar systems. We focus here on how the system was conceived and implemented. Methods: We followed a two-stage development approach by first implementing the backbone system and incrementally improving the user experience through interactions with potential users during the development. Our system went through various stages such as concept proof, algorithm validation, user interface development, and system testing. We used the Zoho Project management system to track tasks and milestones. We leveraged Google Code and Apache Subversion to share code among team members, and developed an applet-servlet architecture to support the cross platform deployment. Discussion: During the development process, we encountered challenges such as Information Technology (IT) infrastructure gaps and limited team experience in user-interface design. We figured out solutions as well as enabling factors to support the translation of an innovative privacy-preserving, distributed modeling technology into a working prototype. Conclusion: Using GLORE (a distributed model that we developed earlier) as a pilot example, we demonstrated the feasibility of building and integrating distributed modeling technology into a usable framework that can support privacy-preserving, distributed data analysis among researchers at geographically dispersed institutes. PMID:25848586
Preparing patients with cancer who work and treatment responsiveness.
Kamau, Caroline
2017-03-01
Many patients with life-limiting illnesses continue to work because of financial reasons and because work provides good psychosocial support. A lack of appropriate advice/support through patient education could, however, make having a job detrimental to well-being (eg, symptom worsening). This study investigated the frequency with which patients received information that empowers their understanding of their condition, treatment, side effects of treatment and the likely impact on occupational functioning. A cross-sectional study. An analysis of survey data from 3457 patients with cancer in employment. Logistic regression showed that patients who received information about the impact of cancer on work life or education are 1.72 times more likely to have a positive treatment outcome. Patients who receive written information about the type of cancer are 1.99 times more likely to have a positive treatment outcome. Also, patients who receive written information before a cancer-related operation are 1.90 times more likely to have a positive treatment outcome. Information about the side effects of cancer treatment produces worse odds of a positive treatment outcome (0.65-1). A stepwise logistic regression analysing the effects irrespective of current employment status in 6710 patients showed that preparing them produces nearly twice better odds of cancer treatment responsiveness. Palliative care teams should consider ways of actively advising patients who work. Whereas the results showed evidence of good practice in cancer care, there is a need to ensure that all working patients with potentially life-limiting illnesses receive similar support. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.
Kashiwagi, Masayo; Tamiya, Nanako; Murata, Masako
2015-08-01
The purpose of the present study was to identify characteristics of visiting nurse agencies (VNA) in Japan with high home death rates by a prefecture-wide survey. A cross-sectional study of visiting nurse agencies (n = 101) in Ibaraki Prefecture, Japan, was completed. Data included the basic characteristics of each VNA, the type of services provided, level of coordination with other service providers, total number of VNA patients who died per year and place of death and contractual relationship with home-care supporting clinics providing end-of-life care services in the home 24 h a day. The VNA characteristics were analyzed by logistic regression, using the home death rate per VNA as a dependent variable. A total 69 agencies, excluding those that did not report number of deaths (n = 14) and those without deaths during the year (n = 6), were analyzed. The median home death rate of the 69 VNA was 29.8%. The results of logistic regression analysis showed that higher home death rate was significantly associated with lack of attachment to a hospital, existence of a contractual relationship with home-care supporting clinics and existence of an interactive information exchange through telephone/face-to-face communication with attending physicians. In order to increase the home death rate of people using VNA, policymakers must consider establishing home-based service systems within the community that can provide home end-of-life care services 24 h a day, and support the interactive exchange of information between the visiting nurse and the attending physician. © 2014 The Authors. Geriatrics & Gerontology International published by Wiley Publishing Asia Pty Ltd on behalf of Japanese Geriatrics Society.
New robust statistical procedures for the polytomous logistic regression models.
Castilla, Elena; Ghosh, Abhik; Martin, Nirian; Pardo, Leandro
2018-05-17
This article derives a new family of estimators, namely the minimum density power divergence estimators, as a robust generalization of the maximum likelihood estimator for the polytomous logistic regression model. Based on these estimators, a family of Wald-type test statistics for linear hypotheses is introduced. Robustness properties of both the proposed estimators and the test statistics are theoretically studied through the classical influence function analysis. Appropriate real life examples are presented to justify the requirement of suitable robust statistical procedures in place of the likelihood based inference for the polytomous logistic regression model. The validity of the theoretical results established in the article are further confirmed empirically through suitable simulation studies. Finally, an approach for the data-driven selection of the robustness tuning parameter is proposed with empirical justifications. © 2018, The International Biometric Society.
Staley, Dennis M.; Negri, Jacquelyn A.; Kean, Jason W.; Laber, Jayme L.; Tillery, Anne C.; Youberg, Ann M.
2016-06-30
Wildfire can significantly alter the hydrologic response of a watershed to the extent that even modest rainstorms can generate dangerous flash floods and debris flows. To reduce public exposure to hazard, the U.S. Geological Survey produces post-fire debris-flow hazard assessments for select fires in the western United States. We use publicly available geospatial data describing basin morphology, burn severity, soil properties, and rainfall characteristics to estimate the statistical likelihood that debris flows will occur in response to a storm of a given rainfall intensity. Using an empirical database and refined geospatial analysis methods, we defined new equations for the prediction of debris-flow likelihood using logistic regression methods. We showed that the new logistic regression model outperformed previous models used to predict debris-flow likelihood.
A computational approach to compare regression modelling strategies in prediction research.
Pajouheshnia, Romin; Pestman, Wiebe R; Teerenstra, Steven; Groenwold, Rolf H H
2016-08-25
It is often unclear which approach to fit, assess and adjust a model will yield the most accurate prediction model. We present an extension of an approach for comparing modelling strategies in linear regression to the setting of logistic regression and demonstrate its application in clinical prediction research. A framework for comparing logistic regression modelling strategies by their likelihoods was formulated using a wrapper approach. Five different strategies for modelling, including simple shrinkage methods, were compared in four empirical data sets to illustrate the concept of a priori strategy comparison. Simulations were performed in both randomly generated data and empirical data to investigate the influence of data characteristics on strategy performance. We applied the comparison framework in a case study setting. Optimal strategies were selected based on the results of a priori comparisons in a clinical data set and the performance of models built according to each strategy was assessed using the Brier score and calibration plots. The performance of modelling strategies was highly dependent on the characteristics of the development data in both linear and logistic regression settings. A priori comparisons in four empirical data sets found that no strategy consistently outperformed the others. The percentage of times that a model adjustment strategy outperformed a logistic model ranged from 3.9 to 94.9 %, depending on the strategy and data set. However, in our case study setting the a priori selection of optimal methods did not result in detectable improvement in model performance when assessed in an external data set. The performance of prediction modelling strategies is a data-dependent process and can be highly variable between data sets within the same clinical domain. A priori strategy comparison can be used to determine an optimal logistic regression modelling strategy for a given data set before selecting a final modelling approach.
Perceptions of social support, empowerment and youth risk behaviors.
Reininger, Belinda M; Pérez, Adriana; Aguirre Flores, Maria I; Chen, Zhongxue; Rahbar, Mohammad H
2012-02-01
This study examined the association of perceived social support and community empowerment among urban middle-school students living in Matamoros, Mexico and the risk behaviors of fighting, alcohol and tobacco use, and sexual activity. Middle school students (n = 1,181) from 32 public and private Mexican schools were surveyed. Weighted multiple logistic regression analyses were conducted. Among girls, lack of parent/teacher interactions regarding school increased odds for fighting, alcohol and tobacco use. Among boys, lack of empowerment increased odds of alcohol and tobacco use and lack of parent/teacher interactions regarding school increased odds for sexual activity. Community empowerment and perceived social support are uniquely associated with risk behaviors for girls and boys. Additionally, perceived social support from individuals most immediate to the youth are associated with protection against risk for some behaviors, while perceived social support from individuals more removed from youth have mixed association with risk behaviors.
Cakir, Ebru; Kucuk, Ulku; Pala, Emel Ebru; Sezer, Ozlem; Ekin, Rahmi Gokhan; Cakmak, Ozgur
2017-05-01
Conventional cytomorphologic assessment is the first step to establish an accurate diagnosis in urinary cytology. In cytologic preparations, the separation of low-grade urothelial carcinoma (LGUC) from reactive urothelial proliferation (RUP) can be exceedingly difficult. The bladder washing cytologies of 32 LGUC and 29 RUP were reviewed. The cytologic slides were examined for the presence or absence of the 28 cytologic features. The cytologic criteria showing statistical significance in LGUC were increased numbers of monotonous single (non-umbrella) cells, three-dimensional cellular papillary clusters without fibrovascular cores, irregular bordered clusters, atypical single cells, irregular nuclear overlap, cytoplasmic homogeneity, increased N/C ratio, pleomorphism, nuclear border irregularity, nuclear eccentricity, elongated nuclei, and hyperchromasia (p ˂ 0.05), and the cytologic criteria showing statistical significance in RUP were inflammatory background, mixture of small and large urothelial cells, loose monolayer aggregates, and vacuolated cytoplasm (p ˂ 0.05). When these variables were subjected to a stepwise logistic regression analysis, four features were selected to distinguish LGUC from RUP: increased numbers of monotonous single (non-umbrella) cells, increased nuclear cytoplasmic ratio, hyperchromasia, and presence of small and large urothelial cells (p = 0.0001). By this logistic model of the 32 cases with proven LGUC, the stepwise logistic regression analysis correctly predicted 31 (96.9%) patients with this diagnosis, and of the 29 patients with RUP, the logistic model correctly predicted 26 (89.7%) patients as having this disease. There are several cytologic features to separate LGUC from RUP. Stepwise logistic regression analysis is a valuable tool for determining the most useful cytologic criteria to distinguish these entities. © 2017 APMIS. Published by John Wiley & Sons Ltd.
Ooms, Linda; Leemrijse, Chantal; Collard, Dorine; Schipper-van Veldhoven, Nicolette; Veenhof, Cindy
2018-06-01
Health-enhancing physical activity (HEPA) promotion programs are implemented in sports clubs. The purpose of this study was to examine the characteristics of the insufficiently active participants that benefit from these programs. Data of three sporting programs, developed for insufficiently active adults, were used for this study. These sporting programs were implemented in different sports clubs in the Netherlands. Participants completed an online questionnaire at baseline and after six months (n = 458). Of this sample, 35.1% (n = 161) was insufficiently active (i.e. not meeting HEPA levels) at baseline. Accordingly, two groups were compared: participants who were insufficiently active at baseline, but increased their physical activity to HEPA levels after six months (activated group, n = 86) versus participants who were insufficiently active both at baseline and after six months (non-activated group, n = 75). Potential associated characteristics (demographic, social, sport history, physical activity) were included as independent variables in bivariate and multivariate logistic regression analyses. The percentage of active participants increased significantly from baseline to six months (from 64.9 to 76.9%, p < 0.05). The bivariate logistic regression analyses showed that participants in the activated group were more likely to receive support from family members with regard to their sport participation (62.8% vs. 42.7%, p = 0.02) and spent more time in moderate-intensity physical activity (128 ± 191 min/week vs. 70 ± 106 min/week, p = 0.02) at baseline compared with participants in the non-activated group. These results were confirmed in the multivariate logistic regression analyses: when receiving support from most family members, there is a 216% increase in the odds of being in the activated group (OR = 2.155; 95% CI: 1.118-4.154, p = 0.02) and for each additional 1 min/week spent in moderate-intensity physical activity, the odds increases with 0.3% (OR = 1.003; 95% CI: 1.001-1.006, p = 0.02). The results suggest that HEPA sporting programs can be used to increase HEPA levels of insufficiently active people, but it seems a challenge to reach the least active ones. It is important that promotional strategies and channels are tailored to the target group. Furthermore, strategies that promote family support may enhance the impact of the programs.
When women tell: intimate partner violence and the factors related to police notification.
Novisky, Meghan A; Peralta, Robert L
2015-01-01
We analyze how victim perceptions of mandatory arrest policies, perpetrator substance use, and presence of children are related to decisions to invoke law enforcement assistance. Logistic regression was used on survey responses from women receiving care in domestic violence shelters. Results suggest that as victim support for mandatory arrest increases, the odds of law enforcement notification of the abuse also increase. Accordingly, mandatory arrest may simply be reducing the probability of reporting intimate partner violence (IPV) among those who do not support the policy, instead of reducing IPV. Results also suggest that perpetrator substance use plays a significant role in law enforcement notification. © The Author(s) 2014.
Science of Test Research Consortium: Year Two Final Report
2012-10-02
July 2012. Analysis of an Intervention for Small Unmanned Aerial System ( SUAS ) Accidents, submitted to Quality Engineering, LQEN-2012-0056. Stone... Systems Engineering. Wolf, S. E., R. R. Hill, and J. J. Pignatiello. June 2012. Using Neural Networks and Logistic Regression to Model Small Unmanned ...Human Retina. 6. Wolf, S. E. March 2012. Modeling Small Unmanned Aerial System Mishaps using Logistic Regression and Artificial Neural Networks. 7
ERIC Educational Resources Information Center
Hidalgo, Mª Dolores; Gómez-Benito, Juana; Zumbo, Bruno D.
2014-01-01
The authors analyze the effectiveness of the R[superscript 2] and delta log odds ratio effect size measures when using logistic regression analysis to detect differential item functioning (DIF) in dichotomous items. A simulation study was carried out, and the Type I error rate and power estimates under conditions in which only statistical testing…
Brian S. Cade; Barry R. Noon; Rick D. Scherer; John J. Keane
2017-01-01
Counts of avian fledglings, nestlings, or clutch size that are bounded below by zero and above by some small integer form a discrete random variable distribution that is not approximated well by conventional parametric count distributions such as the Poisson or negative binomial. We developed a logistic quantile regression model to provide estimates of the empirical...
Mohammed, Mohammed A; Manktelow, Bradley N; Hofer, Timothy P
2016-04-01
There is interest in deriving case-mix adjusted standardised mortality ratios so that comparisons between healthcare providers, such as hospitals, can be undertaken in the controversial belief that variability in standardised mortality ratios reflects quality of care. Typically standardised mortality ratios are derived using a fixed effects logistic regression model, without a hospital term in the model. This fails to account for the hierarchical structure of the data - patients nested within hospitals - and so a hierarchical logistic regression model is more appropriate. However, four methods have been advocated for deriving standardised mortality ratios from a hierarchical logistic regression model, but their agreement is not known and neither do we know which is to be preferred. We found significant differences between the four types of standardised mortality ratios because they reflect a range of underlying conceptual issues. The most subtle issue is the distinction between asking how an average patient fares in different hospitals versus how patients at a given hospital fare at an average hospital. Since the answers to these questions are not the same and since the choice between these two approaches is not obvious, the extent to which profiling hospitals on mortality can be undertaken safely and reliably, without resolving these methodological issues, remains questionable. © The Author(s) 2012.
Chan, Siew Foong; Deeks, Jonathan J; Macaskill, Petra; Irwig, Les
2008-01-01
To compare three predictive models based on logistic regression to estimate adjusted likelihood ratios allowing for interdependency between diagnostic variables (tests). This study was a review of the theoretical basis, assumptions, and limitations of published models; and a statistical extension of methods and application to a case study of the diagnosis of obstructive airways disease based on history and clinical examination. Albert's method includes an offset term to estimate an adjusted likelihood ratio for combinations of tests. Spiegelhalter and Knill-Jones method uses the unadjusted likelihood ratio for each test as a predictor and computes shrinkage factors to allow for interdependence. Knottnerus' method differs from the other methods because it requires sequencing of tests, which limits its application to situations where there are few tests and substantial data. Although parameter estimates differed between the models, predicted "posttest" probabilities were generally similar. Construction of predictive models using logistic regression is preferred to the independence Bayes' approach when it is important to adjust for dependency of tests errors. Methods to estimate adjusted likelihood ratios from predictive models should be considered in preference to a standard logistic regression model to facilitate ease of interpretation and application. Albert's method provides the most straightforward approach.
Cameron, Isobel M; Scott, Neil W; Adler, Mats; Reid, Ian C
2014-12-01
It is important for clinical practice and research that measurement scales of well-being and quality of life exhibit only minimal differential item functioning (DIF). DIF occurs where different groups of people endorse items in a scale to different extents after being matched by the intended scale attribute. We investigate the equivalence or otherwise of common methods of assessing DIF. Three methods of measuring age- and sex-related DIF (ordinal logistic regression, Rasch analysis and Mantel χ(2) procedure) were applied to Hospital Anxiety Depression Scale (HADS) data pertaining to a sample of 1,068 patients consulting primary care practitioners. Three items were flagged by all three approaches as having either age- or sex-related DIF with a consistent direction of effect; a further three items identified did not meet stricter criteria for important DIF using at least one method. When applying strict criteria for significant DIF, ordinal logistic regression was slightly less sensitive. Ordinal logistic regression, Rasch analysis and contingency table methods yielded consistent results when identifying DIF in the HADS depression and HADS anxiety scales. Regardless of methods applied, investigators should use a combination of statistical significance, magnitude of the DIF effect and investigator judgement when interpreting the results.
NASA Astrophysics Data System (ADS)
Cao, Faxian; Yang, Zhijing; Ren, Jinchang; Ling, Wing-Kuen; Zhao, Huimin; Marshall, Stephen
2017-12-01
Although the sparse multinomial logistic regression (SMLR) has provided a useful tool for sparse classification, it suffers from inefficacy in dealing with high dimensional features and manually set initial regressor values. This has significantly constrained its applications for hyperspectral image (HSI) classification. In order to tackle these two drawbacks, an extreme sparse multinomial logistic regression (ESMLR) is proposed for effective classification of HSI. First, the HSI dataset is projected to a new feature space with randomly generated weight and bias. Second, an optimization model is established by the Lagrange multiplier method and the dual principle to automatically determine a good initial regressor for SMLR via minimizing the training error and the regressor value. Furthermore, the extended multi-attribute profiles (EMAPs) are utilized for extracting both the spectral and spatial features. A combinational linear multiple features learning (MFL) method is proposed to further enhance the features extracted by ESMLR and EMAPs. Finally, the logistic regression via the variable splitting and the augmented Lagrangian (LORSAL) is adopted in the proposed framework for reducing the computational time. Experiments are conducted on two well-known HSI datasets, namely the Indian Pines dataset and the Pavia University dataset, which have shown the fast and robust performance of the proposed ESMLR framework.
Latin hypercube approach to estimate uncertainty in ground water vulnerability
Gurdak, J.J.; McCray, J.E.; Thyne, G.; Qi, S.L.
2007-01-01
A methodology is proposed to quantify prediction uncertainty associated with ground water vulnerability models that were developed through an approach that coupled multivariate logistic regression with a geographic information system (GIS). This method uses Latin hypercube sampling (LHS) to illustrate the propagation of input error and estimate uncertainty associated with the logistic regression predictions of ground water vulnerability. Central to the proposed method is the assumption that prediction uncertainty in ground water vulnerability models is a function of input error propagation from uncertainty in the estimated logistic regression model coefficients (model error) and the values of explanatory variables represented in the GIS (data error). Input probability distributions that represent both model and data error sources of uncertainty were simultaneously sampled using a Latin hypercube approach with logistic regression calculations of probability of elevated nonpoint source contaminants in ground water. The resulting probability distribution represents the prediction intervals and associated uncertainty of the ground water vulnerability predictions. The method is illustrated through a ground water vulnerability assessment of the High Plains regional aquifer. Results of the LHS simulations reveal significant prediction uncertainties that vary spatially across the regional aquifer. Additionally, the proposed method enables a spatial deconstruction of the prediction uncertainty that can lead to improved prediction of ground water vulnerability. ?? 2007 National Ground Water Association.
The Impact of Worksite Supports for Healthy Eating on Dietary Behaviors.
Dodson, Elizabeth Anne; Hipp, James Aaron; Gao, Mengchao; Tabak, Rachel Gail; Yang, Lin; Brownson, Ross Charles
2016-08-01
The purpose of this study was to assess the availability of worksite supports (WSS) for healthy eating and examine associations between existing supports and dietary behaviors. A cross-sectional, telephone-based study was conducted with 2013 participants in four metropolitan areas in 2012. Logistic regression was used to examine associations between dietary behaviors and the availability or use of WSS. Those reporting the availability of a cafeteria/snack bar/food services at the worksite were more likely to consume fruits and vegetables more than twice/day, and less likely to consume fast food more than twice/week. Study results highlight the utility of specific WSS to improve employee dietary behaviors while raising questions about why the presence of healthy foods at the worksite may not translate into employee consumption of such foods.
Who supports whom? Gender and intergenerational transfers in post-industrial Barbados.
Quashie, Nekehia T
2015-06-01
This study examines the likelihood that older adults and their children in Bridgetown, Barbados engage in exchanges of financial, functional, and material support and the extent to which gender influences transfers. Data come from the 2000 Survey of Health, Well-Being and Aging in Latin America and the Caribbean (SABE) of Bridgetown, Barbados N = 3876 children, representing 1135 families. Multivariate logistic regression models examine the demographic and economic situations of both older and younger cohorts that encourage or constrain intergenerational exchanges. Results confirm, as in many developing countries, a higher proportion of older Barbadians receive rather than provide support. Gender differentiation in support transfers depends on the type of support examined and the living arrangements of parents and children. Support exchanges are highly conditioned by the socioeconomic circumstances of both generations but gender stratification in the labor market does not appear to mediate support exchanges. These findings suggest some flexibility in gender systems with respect to intergenerational support within Barbado.
Kapadia, Farzana; Halkitis, Perry; Barton, Staci; Siconolfi, Daniel; Figueroa, Rafael Perez
2014-01-01
Few studies have examined how social support network characteristics are related to perceived receipt of social support among male sexual minority youth. Using egocentric network data collected from a study of male sexual minority youth (n=592), multivariable logistic regression analyses examined distinct associations between individual and social network characteristics with receipt of (1) emotional and (2) material support. In multivariable models, frequent communication and having friends in one’s network yielded a two-fold increase in the likelihood of receiving emotional support whereas frequent communication was associated with an almost three-fold higher likelihood of perceived material support. Finally, greater internalized homophobia and personal experiences of gay-related stigma were inversely associated with perceived receipt of emotional and material support, respectively. Understanding the evolving social context and social interactions of this new generation of male sexual minority youth is warranted in order to understand the broader, contextual factors associated with their overall health and well-being. PMID:25214756
Kupek, Emil
2006-03-15
Structural equation modelling (SEM) has been increasingly used in medical statistics for solving a system of related regression equations. However, a great obstacle for its wider use has been its difficulty in handling categorical variables within the framework of generalised linear models. A large data set with a known structure among two related outcomes and three independent variables was generated to investigate the use of Yule's transformation of odds ratio (OR) into Q-metric by (OR-1)/(OR+1) to approximate Pearson's correlation coefficients between binary variables whose covariance structure can be further analysed by SEM. Percent of correctly classified events and non-events was compared with the classification obtained by logistic regression. The performance of SEM based on Q-metric was also checked on a small (N = 100) random sample of the data generated and on a real data set. SEM successfully recovered the generated model structure. SEM of real data suggested a significant influence of a latent confounding variable which would have not been detectable by standard logistic regression. SEM classification performance was broadly similar to that of the logistic regression. The analysis of binary data can be greatly enhanced by Yule's transformation of odds ratios into estimated correlation matrix that can be further analysed by SEM. The interpretation of results is aided by expressing them as odds ratios which are the most frequently used measure of effect in medical statistics.
NASA Technical Reports Server (NTRS)
Palguta, T.; Bradley, W.; Stockton, T.
1988-01-01
The purpose is to outline an Office of Space Science and Applications (OSSA) integrated logistics support strategy that will ensure effective logistics support of OSSA payloads at an affordable life-cycle cost. Program objectives, organizational relationships, and implementation of the logistics strategy are discussed.
Kumar, Santosh; Calvo, Rocio; Avendano, Mauricio; Sivaramakrishnan, Kavita; Berkman, Lisa F
2012-03-01
High levels of social capital and social integration are associated with self-rated health in many developed countries. However, it is not known whether this association extends to non-western and less economically advanced countries. We examine associations between social support, volunteering, and self-rated health in 139 low-, middle- and high-income countries. Data come from the Gallup World Poll, an internationally comparable survey conducted yearly from 2005 to 2009 for those 15 and over. Volunteering was measured by self-reports of volunteering to an organization in the past month. Social support was based on self-reports of access to support from relatives and friends. We started by estimating random coefficient (multi-level) models and then used multivariate logistic regression to model health as a function of social support and volunteering, controlling for age, gender, education, marital status, and religiosity. We found statistically significant evidence of cross-national variation in the association between social capital variables and self-rated health. In the multivariate logistic model, self-rated health were significantly associated with having social support from friends and relatives and volunteering. Results from stratified analyses indicate that these associations are strikingly consistent across countries. Our results indicate that the link between social capital and health is not restricted to high-income countries but extends across many geographical regions regardless of their national-income level. Copyright © 2012 Elsevier Ltd. All rights reserved.
Risk modeling for ventricular assist device support in post-cardiotomy shock.
Alsoufi, Bahaaldin; Rao, Vivek; Tang, Augustine; Maganti, Manjula; Cusimano, Robert
2012-04-01
Post-cardiotomy shock (PCS) has a complex etiology. Although treatment with inotrops and intra-aortic balloon pump (IABP) support improves cardiac performance, end-organ injuries are common and lead to prolonged ICU stay, extended hospitalization and increased mortality. Early consideration of mechanical circulatory support may prevent such complications and improve outcome. Between January 1997 and January 2002, 321 patients required IABP and inotropic support for PCS following coronary artery bypass grafting (CABG) at our institution. Perioperative variables including age, mixed venous saturation (MVO2), inotropic requirements and LV function were analyzed using multivariate statistical methods. All explanatory variables with a univariate p value <0.10 were entered into a stepwise logistic regression model to predict hospital mortality. Odds ratios from significant variables (p < 0.05) in the regression model were used to compose a risk score. Overall hospital mortality was 16%. The independent risk factors for mortality in this population were: MVO2 < 60% (OR = 3.2), milrinone > 0.5 μg/kg/min (OR = 3.2), age > 75 (OR = 2.7), adrenaline > 0.1 μg/kg/min (OR = 1.5). A 15-point risk score was developed based on the regression model. Hospital mortality in patients with a score >6 was 46% (n = 13/28), 3-6 was 31% (n = 9/29) and <3 was 11% (n = 29/264). A significant proportion of patients with PCS continue to face high mortality despite IABP and inotropic support. Advanced age, heavy inotropic dependency and poor oxygen delivery all predicted increased risk for death. Further investigation is needed to assess whether early institution of VAD support could improve outcome in this high-risk group of patients.
ERIC Educational Resources Information Center
Kasapoglu, Koray
2014-01-01
This study aims to investigate which factors are associated with Turkey's 15-year-olds' scoring above the OECD average (493) on the PISA'09 reading assessment. Collected from a total of 4,996 15-year-old students from Turkey, data were analyzed by logistic regression analysis in order to model the data of students who were split into two: (1)…
Upgrade Summer Severe Weather Tool
NASA Technical Reports Server (NTRS)
Watson, Leela
2011-01-01
The goal of this task was to upgrade to the existing severe weather database by adding observations from the 2010 warm season, update the verification dataset with results from the 2010 warm season, use statistical logistic regression analysis on the database and develop a new forecast tool. The AMU analyzed 7 stability parameters that showed the possibility of providing guidance in forecasting severe weather, calculated verification statistics for the Total Threat Score (TTS), and calculated warm season verification statistics for the 2010 season. The AMU also performed statistical logistic regression analysis on the 22-year severe weather database. The results indicated that the logistic regression equation did not show an increase in skill over the previously developed TTS. The equation showed less accuracy than TTS at predicting severe weather, little ability to distinguish between severe and non-severe weather days, and worse standard categorical accuracy measures and skill scores over TTS.
Estimating the Probability of Rare Events Occurring Using a Local Model Averaging.
Chen, Jin-Hua; Chen, Chun-Shu; Huang, Meng-Fan; Lin, Hung-Chih
2016-10-01
In statistical applications, logistic regression is a popular method for analyzing binary data accompanied by explanatory variables. But when one of the two outcomes is rare, the estimation of model parameters has been shown to be severely biased and hence estimating the probability of rare events occurring based on a logistic regression model would be inaccurate. In this article, we focus on estimating the probability of rare events occurring based on logistic regression models. Instead of selecting a best model, we propose a local model averaging procedure based on a data perturbation technique applied to different information criteria to obtain different probability estimates of rare events occurring. Then an approximately unbiased estimator of Kullback-Leibler loss is used to choose the best one among them. We design complete simulations to show the effectiveness of our approach. For illustration, a necrotizing enterocolitis (NEC) data set is analyzed. © 2016 Society for Risk Analysis.
Evaluating the perennial stream using logistic regression in central Taiwan
NASA Astrophysics Data System (ADS)
Ruljigaljig, T.; Cheng, Y. S.; Lin, H. I.; Lee, C. H.; Yu, T. T.
2014-12-01
This study produces a perennial stream head potential map, based on a logistic regression method with a Geographic Information System (GIS). Perennial stream initiation locations, indicates the location of the groundwater and surface contact, were identified in the study area from field survey. The perennial stream potential map in central Taiwan was constructed using the relationship between perennial stream and their causative factors, such as Catchment area, slope gradient, aspect, elevation, groundwater recharge and precipitation. Here, the field surveys of 272 streams were determined in the study area. The areas under the curve for logistic regression methods were calculated as 0.87. The results illustrate the importance of catchment area and groundwater recharge as key factors within the model. The results obtained from the model within the GIS were then used to produce a map of perennial stream and estimate the location of perennial stream head.
Menditto, Anthony A; Linhorst, Donald M; Coleman, James C; Beck, Niels C
2006-04-01
Development of policies and procedures to contend with the risks presented by elopement, aggression, and suicidal behaviors are long-standing challenges for mental health administrators. Guidance in making such judgments can be obtained through the use of a multivariate statistical technique known as logistic regression. This procedure can be used to develop a predictive equation that is mathematically formulated to use the best combination of predictors, rather than considering just one factor at a time. This paper presents an overview of logistic regression and its utility in mental health administrative decision making. A case example of its application is presented using data on elopements from Missouri's long-term state psychiatric hospitals. Ultimately, the use of statistical prediction analyses tempered with differential qualitative weighting of classification errors can augment decision-making processes in a manner that provides guidance and flexibility while wrestling with the complex problem of risk assessment and decision making.
Lei, Yang; Nollen, Nikki; Ahluwahlia, Jasjit S; Yu, Qing; Mayo, Matthew S
2015-04-09
Other forms of tobacco use are increasing in prevalence, yet most tobacco control efforts are aimed at cigarettes. In light of this, it is important to identify individuals who are using both cigarettes and alternative tobacco products (ATPs). Most previous studies have used regression models. We conducted a traditional logistic regression model and a classification and regression tree (CART) model to illustrate and discuss the added advantages of using CART in the setting of identifying high-risk subgroups of ATP users among cigarettes smokers. The data were collected from an online cross-sectional survey administered by Survey Sampling International between July 5, 2012 and August 15, 2012. Eligible participants self-identified as current smokers, African American, White, or Latino (of any race), were English-speaking, and were at least 25 years old. The study sample included 2,376 participants and was divided into independent training and validation samples for a hold out validation. Logistic regression and CART models were used to examine the important predictors of cigarettes + ATP users. The logistic regression model identified nine important factors: gender, age, race, nicotine dependence, buying cigarettes or borrowing, whether the price of cigarettes influences the brand purchased, whether the participants set limits on cigarettes per day, alcohol use scores, and discrimination frequencies. The C-index of the logistic regression model was 0.74, indicating good discriminatory capability. The model performed well in the validation cohort also with good discrimination (c-index = 0.73) and excellent calibration (R-square = 0.96 in the calibration regression). The parsimonious CART model identified gender, age, alcohol use score, race, and discrimination frequencies to be the most important factors. It also revealed interesting partial interactions. The c-index is 0.70 for the training sample and 0.69 for the validation sample. The misclassification rate was 0.342 for the training sample and 0.346 for the validation sample. The CART model was easier to interpret and discovered target populations that possess clinical significance. This study suggests that the non-parametric CART model is parsimonious, potentially easier to interpret, and provides additional information in identifying the subgroups at high risk of ATP use among cigarette smokers.
Falkenberg, A; Nyfjäll, M; Hellgren, C; Vingård, E
2012-01-01
The aim of this longitudinal study is to investigate how different aspects of social support at work and in leisure time are associated with self rated health and sickness absence. The 541 participants in the study were representative for a working population in the public sector in Sweden with a majority being woman. Most of the variables were created from data from a questionnaire in March-April 2005. There were four independent variables and two dependent variables. The dependent were based on data from November 2006. A logistic regression model was used for the analysis of associations. A separate model was adapted for each of the explanatory variables for each outcome, which gave five models per independent variable. The study has given a greater awareness of the importance of employees receiving social support, regardless of type of support or from whom the support is coming. Social support has a strong association with SRH in a longitudinal perspective and no association between social support and sickness absence.
Akkus, Zeki; Camdeviren, Handan; Celik, Fatma; Gur, Ali; Nas, Kemal
2005-09-01
To determine the risk factors of osteoporosis using a multiple binary logistic regression method and to assess the risk variables for osteoporosis, which is a major and growing health problem in many countries. We presented a case-control study, consisting of 126 postmenopausal healthy women as control group and 225 postmenopausal osteoporotic women as the case group. The study was carried out in the Department of Physical Medicine and Rehabilitation, Dicle University, Diyarbakir, Turkey between 1999-2002. The data from the 351 participants were collected using a standard questionnaire that contains 43 variables. A multiple logistic regression model was then used to evaluate the data and to find the best regression model. We classified 80.1% (281/351) of the participants using the regression model. Furthermore, the specificity value of the model was 67% (84/126) of the control group while the sensitivity value was 88% (197/225) of the case group. We found the distribution of residual values standardized for final model to be exponential using the Kolmogorow-Smirnow test (p=0.193). The receiver operating characteristic curve was found successful to predict patients with risk for osteoporosis. This study suggests that low levels of dietary calcium intake, physical activity, education, and longer duration of menopause are independent predictors of the risk of low bone density in our population. Adequate dietary calcium intake in combination with maintaining a daily physical activity, increasing educational level, decreasing birth rate, and duration of breast-feeding may contribute to healthy bones and play a role in practical prevention of osteoporosis in Southeast Anatolia. In addition, the findings of the present study indicate that the use of multivariate statistical method as a multiple logistic regression in osteoporosis, which maybe influenced by many variables, is better than univariate statistical evaluation.
Shi, K-Q; Zhou, Y-Y; Yan, H-D; Li, H; Wu, F-L; Xie, Y-Y; Braddock, M; Lin, X-Y; Zheng, M-H
2017-02-01
At present, there is no ideal model for predicting the short-term outcome of patients with acute-on-chronic hepatitis B liver failure (ACHBLF). This study aimed to establish and validate a prognostic model by using the classification and regression tree (CART) analysis. A total of 1047 patients from two separate medical centres with suspected ACHBLF were screened in the study, which were recognized as derivation cohort and validation cohort, respectively. CART analysis was applied to predict the 3-month mortality of patients with ACHBLF. The accuracy of the CART model was tested using the area under the receiver operating characteristic curve, which was compared with the model for end-stage liver disease (MELD) score and a new logistic regression model. CART analysis identified four variables as prognostic factors of ACHBLF: total bilirubin, age, serum sodium and INR, and three distinct risk groups: low risk (4.2%), intermediate risk (30.2%-53.2%) and high risk (81.4%-96.9%). The new logistic regression model was constructed with four independent factors, including age, total bilirubin, serum sodium and prothrombin activity by multivariate logistic regression analysis. The performances of the CART model (0.896), similar to the logistic regression model (0.914, P=.382), exceeded that of MELD score (0.667, P<.001). The results were confirmed in the validation cohort. We have developed and validated a novel CART model superior to MELD for predicting three-month mortality of patients with ACHBLF. Thus, the CART model could facilitate medical decision-making and provide clinicians with a validated practical bedside tool for ACHBLF risk stratification. © 2016 John Wiley & Sons Ltd.
Arevalillo, Jorge M; Sztein, Marcelo B; Kotloff, Karen L; Levine, Myron M; Simon, Jakub K
2017-10-01
Immunologic correlates of protection are important in vaccine development because they give insight into mechanisms of protection, assist in the identification of promising vaccine candidates, and serve as endpoints in bridging clinical vaccine studies. Our goal is the development of a methodology to identify immunologic correlates of protection using the Shigella challenge as a model. The proposed methodology utilizes the Random Forests (RF) machine learning algorithm as well as Classification and Regression Trees (CART) to detect immune markers that predict protection, identify interactions between variables, and define optimal cutoffs. Logistic regression modeling is applied to estimate the probability of protection and the confidence interval (CI) for such a probability is computed by bootstrapping the logistic regression models. The results demonstrate that the combination of Classification and Regression Trees and Random Forests complements the standard logistic regression and uncovers subtle immune interactions. Specific levels of immunoglobulin IgG antibody in blood on the day of challenge predicted protection in 75% (95% CI 67-86). Of those subjects that did not have blood IgG at or above a defined threshold, 100% were protected if they had IgA antibody secreting cells above a defined threshold. Comparison with the results obtained by applying only logistic regression modeling with standard Akaike Information Criterion for model selection shows the usefulness of the proposed method. Given the complexity of the immune system, the use of machine learning methods may enhance traditional statistical approaches. When applied together, they offer a novel way to quantify important immune correlates of protection that may help the development of vaccines. Copyright © 2017 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Schaeben, Helmut; Semmler, Georg
2016-09-01
The objective of prospectivity modeling is prediction of the conditional probability of the presence T = 1 or absence T = 0 of a target T given favorable or prohibitive predictors B, or construction of a two classes 0,1 classification of T. A special case of logistic regression called weights-of-evidence (WofE) is geologists' favorite method of prospectivity modeling due to its apparent simplicity. However, the numerical simplicity is deceiving as it is implied by the severe mathematical modeling assumption of joint conditional independence of all predictors given the target. General weights of evidence are explicitly introduced which are as simple to estimate as conventional weights, i.e., by counting, but do not require conditional independence. Complementary to the regression view is the classification view on prospectivity modeling. Boosting is the construction of a strong classifier from a set of weak classifiers. From the regression point of view it is closely related to logistic regression. Boost weights-of-evidence (BoostWofE) was introduced into prospectivity modeling to counterbalance violations of the assumption of conditional independence even though relaxation of modeling assumptions with respect to weak classifiers was not the (initial) purpose of boosting. In the original publication of BoostWofE a fabricated dataset was used to "validate" this approach. Using the same fabricated dataset it is shown that BoostWofE cannot generally compensate lacking conditional independence whatever the consecutively processing order of predictors. Thus the alleged features of BoostWofE are disproved by way of counterexamples, while theoretical findings are confirmed that logistic regression including interaction terms can exactly compensate violations of joint conditional independence if the predictors are indicators.
Ross, Whitney Trotter; Meister, Melanie R; Shepherd, Jonathan P; Olsen, Margaret A; Lowder, Jerry L
2017-10-01
Apical vaginal support is considered the keystone of pelvic organ support. Level I evidence supports reestablishment of apical support at time of hysterectomy, regardless of whether the hysterectomy is performed for prolapse. National rates of apical support procedure performance at time of inpatient hysterectomy have not been well described. We sought to estimate trends and factors associated with use of apical support procedures at time of inpatient hysterectomy for benign indications in a large national database. The National (Nationwide) Inpatient Sample was used to identify hysterectomies performed from 2004 through 2013 for benign indications. International Classification of Diseases, Ninth Revision, Clinical Modification codes were used to select both procedures and diagnoses. The primary outcome was performance of an apical support procedure at time of hysterectomy. Descriptive and multivariable analyses were performed. There were 3,509,230 inpatient hysterectomies performed for benign disease from 2004 through 2013. In both nonprolapse and prolapse groups, there was a significant decrease in total number of annual hysterectomies performed over the study period (P < .0001). There were 2,790,652 (79.5%) hysterectomies performed without a diagnosis of prolapse, and an apical support procedure was performed in only 85,879 (3.1%). There was a significant decrease in the proportion of hysterectomies with concurrent apical support procedure (high of 4.0% in 2004 to 2.5% in 2013, P < .0001). In the multivariable logistic regression model, increasing age, hospital type (urban teaching), hospital bed size (large and medium), and hysterectomy type (vaginal and laparoscopically assisted vaginal) were associated with performance of an apical support procedure. During the study period, 718,578 (20.5%) inpatient hysterectomies were performed for prolapse diagnoses and 266,743 (37.1%) included an apical support procedure. There was a significant increase in the proportion of hysterectomies with concurrent apical support procedure (low of 31.3% in 2005 to 49.3% in 2013, P < .0001). In the multivariable logistic regression model, increasing age, hospital type (urban teaching), hospital bed size (medium and large), and hysterectomy type (total laparoscopic and laparoscopic supracervical) were associated with performance of an apical support procedure. This national database study demonstrates that apical support procedures are not routinely performed at time of inpatient hysterectomy regardless of presence of prolapse diagnosis. Educational efforts are needed to increase awareness of the importance of reestablishing apical vaginal support at time of hysterectomy regardless of indication. Copyright © 2017 Elsevier Inc. All rights reserved.
Dai, Wenjie; Kaminga, Atipatsa C.; Tan, Hongzhuan; Wang, Jieru; Lai, Zhiwei; Wu, Xin; Liu, Aizhong
2017-01-01
Background Although numerous studies have indicated that exposure to natural disasters may increase survivors’ risk of post-traumatic stress disorder (PTSD) and anxiety, studies focusing on the long-term psychological outcomes of flood survivors are limited. Thus, this study aimed to estimate the prevalence of PTSD and anxiety among flood survivors 17 years after the 1998 Dongting Lake flood and to identify the risk factors for PTSD and anxiety. Methods This cross-sectional study was conducted in December 2015, 17 years after the 1998 Dongting Lake flood. Survivors in hard-hit areas of the flood disaster were enrolled in this study using a stratified, systematic random sampling method. Well qualified investigators conducted face-to-face interviews with participants using the PTSD Checklist-Civilian version, the Zung Self-Rating Anxiety Scale, the Chinese version of the Social Support Rating Scale and the Revised Eysenck Personality Questionnaire-Short Scale for Chinese to assess PTSD, anxiety, social support and personality traits, respectively. Logistic regression analyses were used to identify factors associated with PTSD and anxiety. Results A total of 325 participants were recruited in this study, and the prevalence of PTSD and anxiety was 9.5% and 9.2%, respectively. Multivariable logistic regression analyses indicated that female sex, experiencing at least three flood-related stressors, having a low level of social support, and having the trait of emotional instability were risk factors for long-term adverse psychological outcomes among flood survivors after the disaster. Conclusions PTSD and anxiety were common long-term adverse psychological outcomes among flood survivors. Early and effective psychological interventions for flood survivors are needed to prevent the development of PTSD and anxiety in the long run after a flood, especially for individuals who are female, experience at least three flood-related stressors, have a low level of social support and have the trait of emotional instability. PMID:28170427
Alquaiz, ALJohara M; Almuneef, Maha; Kazi, Ambreen; Almeneessier, Aljohara
2017-12-01
Intimate partner violence is a worldwide public health problem. The objectives of this study were to measure the prevalence and types of domestic violence, and to explore the association between social determinants (sociodemographic factors, husband-related factors, and social support) and violence against women by their intimate partner (husband). We conducted a cross-sectional survey in 18 randomly selected primary health care centers and 13 private institutions (teaching institutes, government offices, social welfare organizations) in Riyadh, Saudi Arabia. Female data collectors took interview from 1,883 married Saudi females aged 30 to 75 years. Interviews included sociodemographic information, reproductive health variables, and social support questionnaire. Violence was measured using modified Intimate Partner Violence Against Women questionnaire developed by the World Health Organization. Multivariate logistic regression analysis was conducted. The lifetime prevalence for any type of violence was 43.0% ( n = 810). The most frequent type was controlling behavior (36.8%), followed by emotional violence (22%), sexual violence (12.7%), and physical violence (9.0%). Multivariate logistic regression analysis revealed that the following were associated with greater odds of reporting domestic violence: younger age 30 to 40 years (adjusted odds ratio [aOR] = 1.9, 95% confidence interval [CI] = [1.3, 3.0]), 41 to 50 years (aOR = 1.6, 95% CI = [1.1, 2.5]); lack of emotional support (aOR = 1.7, 95% CI = [1.2, 2.5]); lack of tangible support (aOR = 1.4, 95% CI = [1.1, 1.9]); and perceived poor self-health (aOR = 1.7, 95% CI = [1.0, 3.0]), husbands' poor health (aOR = 1.9, 95% CI = [1.2, 2.0]), and polygamy (aOR = 1.6, 95% CI = [1.5, 2.6]). Domestic violence occurs frequently in Saudi Arabia. Both social conditions and social relations are significantly associated with domestic violence against Saudi women. Furthermore, improvement in implementation of the local policies and multisectoral protection services can prevent women from domestic violence.
Patregnani, Jason T; Borgman, Matthew A; Maegele, Marc; Wade, Charles E; Blackbourne, Lorne H; Spinella, Philip C
2012-05-01
In adults, early traumatic coagulopathy and shock are both common and independently associated with mortality. There are little data regarding both the incidence and association of early coagulopathy and shock on outcomes in pediatric patients with traumatic injuries. Our objective was to determine whether coagulopathy and shock on admission are independently associated with mortality in children with traumatic injuries. A retrospective review of the Joint Theater Trauma Registry from U.S. combat support hospitals in Iraq and Afghanistan from 2002 to 2009 was performed. Coagulopathy was defined as an international normalized ratio of ≥1.5 and shock as a base deficit of ≥6. Laboratory values were measured on admission. Primary outcome was inhospital mortality. Univariate analyses were performed on all admission variables followed by reverse stepwise multivariate logistic regression to determine independent associations. Combat support hospitals in Iraq and Afghanistan. Patients <18 yrs of age with Injury Severity Score, international normalized ratio, base deficit, and inhospital mortality were included. Of 1998 in the cohort, 744 (37%) had a complete set of data for analysis. None. The incidence of early coagulopathy and shock were 27% and 38.3% and associated with mortality of 22% and 16.8%, respectively. After multivariate logistic regression, early coagulopathy had an odds ratio of 2.2 (95% confidence interval 1.1-4.5) and early shock had an odds ratio of 3.0 (95% confidence interval 1.2-7.5) for mortality. Patients with coagulopathy and shock had an odds ratio of 3.8 (95% confidence interval 2.0-7.4) for mortality. In children with traumatic injuries treated at combat support hospitals, coagulopathy and shock on admission are common and independently associated with a high incidence of inhospital mortality. Future studies are needed to determine whether more rapid and accurate methods of measuring coagulopathy and shock as well as if early goal-directed treatment of these states can improve outcomes in children.
Prediction of sickness absence: development of a screening instrument
Duijts, S F A; Kant, IJ; Landeweerd, J A; Swaen, G M H
2006-01-01
Objectives To develop a concise screening instrument for early identification of employees at risk for sickness absence due to psychosocial health complaints. Methods Data from the Maastricht Cohort Study on “Fatigue at Work” were used to identify items to be associated with an increased risk of sickness absence. The analytical procedures univariate logistic regression, backward stepwise linear regression, and multiple logistic regression were successively applied. For both men and women, sum scores were calculated, and sensitivity and specificity rates of different cut‐off points on the screening instrument were defined. Results In women, results suggested that feeling depressed, having a burnout, being tired, being less interested in work, experiencing obligatory change in working days, and living alone, were strong predictors of sickness absence due to psychosocial health complaints. In men, statistically significant predictors were having a history of sickness absence, compulsive thinking, being mentally fatigued, finding it hard to relax, lack of supervisor support, and having no hobbies. A potential cut‐off point of 10 on the screening instrument resulted in a sensitivity score of 41.7% for women and 38.9% for men, and a specificity score of 91.3% for women and 90.6% for men. Conclusions This study shows that it is possible to identify predictive factors for sickness absence and to develop an instrument for early identification of employees at risk for sickness absence. The results of this study increase the possibility for both employers and policymakers to implement interventions directed at the prevention of sickness absence. PMID:16698807
Quirke, Michael; Curran, Emma May; O'Kelly, Patrick; Moran, Ruth; Daly, Eimear; Aylward, Seamus; McElvaney, Gerry; Wakai, Abel
2018-01-01
To measure the percentage rate and risk factors for amendment in the type, duration and setting of outpatient parenteral antimicrobial therapy ( OPAT) for the treatment of cellulitis. A retrospective cohort study of adult patients receiving OPAT for cellulitis was performed. Treatment amendment (TA) was defined as hospital admission or change in antibiotic therapy in order to achieve clinical response. Multivariable logistic regression (MVLR) and classification and regression tree (CART) analysis were performed. There were 307 patients enrolled. TA occurred in 36 patients (11.7%). Significant risk factors for TA on MVLR were increased age, increased Numerical Pain Scale Score (NPSS) and immunocompromise. The median OPAT duration was 7 days. Increased age, heart rate and C reactive protein were associated with treatment prolongation. CART analysis selected age <64.5 years, female gender and NPSS <2.5 in the final model, generating a low-sensitivity (27.8%), high-specificity (97.1%) decision tree. Increased age, NPSS and immunocompromise were associated with OPAT amendment. These identified risk factors can be used to support an evidence-based approach to patient selection for OPAT in cellulitis. The CART algorithm has good specificity but lacks sensitivity and is shown to be inferior in this study to logistic regression modelling. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Separation in Logistic Regression: Causes, Consequences, and Control.
Mansournia, Mohammad Ali; Geroldinger, Angelika; Greenland, Sander; Heinze, Georg
2018-04-01
Separation is encountered in regression models with a discrete outcome (such as logistic regression) where the covariates perfectly predict the outcome. It is most frequent under the same conditions that lead to small-sample and sparse-data bias, such as presence of a rare outcome, rare exposures, highly correlated covariates, or covariates with strong effects. In theory, separation will produce infinite estimates for some coefficients. In practice, however, separation may be unnoticed or mishandled because of software limits in recognizing and handling the problem and in notifying the user. We discuss causes of separation in logistic regression and describe how common software packages deal with it. We then describe methods that remove separation, focusing on the same penalized-likelihood techniques used to address more general sparse-data problems. These methods improve accuracy, avoid software problems, and allow interpretation as Bayesian analyses with weakly informative priors. We discuss likelihood penalties, including some that can be implemented easily with any software package, and their relative advantages and disadvantages. We provide an illustration of ideas and methods using data from a case-control study of contraceptive practices and urinary tract infection.
NASA Astrophysics Data System (ADS)
Nong, Yu; Du, Qingyun; Wang, Kun; Miao, Lei; Zhang, Weiwei
2008-10-01
Urban growth modeling, one of the most important aspects of land use and land cover change study, has attracted substantial attention because it helps to comprehend the mechanisms of land use change thus helps relevant policies made. This study applied multinomial logistic regression to model urban growth in the Jiayu county of Hubei province, China to discover the relationship between urban growth and the driving forces of which biophysical and social-economic factors are selected as independent variables. This type of regression is similar to binary logistic regression, but it is more general because the dependent variable is not restricted to two categories, as those previous studies did. The multinomial one can simulate the process of multiple land use competition between urban land, bare land, cultivated land and orchard land. Taking the land use type of Urban as reference category, parameters could be estimated with odds ratio. A probability map is generated from the model to predict where urban growth will occur as a result of the computation.
Mollon, Lea; Bhattacharjee, Sandipan
2017-12-04
Little is known regarding the health-related quality of life among myocardial infarction (MI) survivors in the United States. The purpose of this population-based study was to identify differences in health-related quality of life domains between MI survivors and propensity score matched controls. This retrospective, cross-sectional matched case-control study examined differences in health-related quality of life (HRQoL) among MI survivors of myocardial infarction compared to propensity score matched controls using data from the 2015 Behavioral Risk Factor Surveillance System (BRFSS) survey. Propensity scores were generated via logistic regression for MI survivors and controls based on gender, race/ethnicity, age, body mass index (BMI), smoking status, and comorbidities. Chi-square tests were used to compare differences between MI survivors to controls for demographic variables. A multivariate analysis of HRQoL domains estimated odds ratios. Life satisfaction, sleep quality, and activity limitations were estimated using binary logistic regression. Social support, perceived general health, perceived physical health, and perceived mental health were estimated using multinomial logistic regression. Significance was set at p < 0.05. The final sample consisted of 16,729 MI survivors matched to 50,187 controls (n = 66,916). Survivors were approximately 2.7 times more likely to report fair/poor general health compared to control (AOR = 2.72, 95% CI: 2.43-3.05) and 1.5 times more likely to report limitations to daily activities (AOR = 1.46, 95% CI: 1.34-1.59). Survivors were more likely to report poor physical health >15 days in the month (AOR = 1.63, 95% CI: 1.46-1.83) and poor mental health >15 days in the month (AOR = 1.25, 95% CI: 1.07-1.46) compared to matched controls. There was no difference in survivors compared to controls in level of emotional support (rarely/never: AOR = 0.75, 95% CI: 0.48-1.18; sometimes: AOR = 0.73, 95% CI: 0.41-1.28), hours of recommended sleep (AOR = 1.14, 95% CI: 0.94-1.38), or life satisfaction (AOR = 1.62, 95% CI: 0.99-2.63). MI survivors experienced lower HRQoL on domains of general health, physical health, daily activity, and mental health compared to the general population.
Shi, Xiao; Zhang, Ting-Ting; Hu, Wei-Ping; Ji, Qing-Hai
2017-04-25
The relationship between marital status and oral cavity squamous cell carcinoma (OCSCC) survival has not been explored. The objective of our study was to evaluate the impact of marital status on OCSCC survival and investigate the potential mechanisms. Married patients had better 5-year cancer-specific survival (CSS) (66.7% vs 54.9%) and 5-year overall survival (OS) (56.0% vs 41.1%). In multivariate Cox regression models, unmarried patients also showed higher mortality risk for both CSS (Hazard Ratio [HR]: 1.260, 95% confidence interval (CI): 1.187-1.339, P < 0.001) and OS (HR: 1.328, 95% CI: 1.266-1.392, P < 0.001). Multivariate logistic regression showed married patients were more likely to be diagnosed at earlier stage (P < 0.001) and receive surgery (P < 0.001). Married patients still demonstrated better prognosis in the 1:1 matched group analysis (CSS: 62.9% vs 60.8%, OS: 52.3% vs 46.5%). 11022 eligible OCSCC patients were identified from Surveillance, Epidemiology, and End Results (SEER) database, including 5902 married and 5120 unmarried individuals. Kaplan-Meier analysis, Log-rank test and Cox proportional hazards regression model were used to analyze survival and mortality risk. Influence of marital status on stage, age at diagnosis and selection of treatment was determined by binomial and multinomial logistic regression. Propensity score matching method was adopted to perform a 1:1 matched cohort. Marriage has an independently protective effect on OCSCC survival. Earlier diagnosis and more sufficient treatment are possible explanations. Besides, even after 1:1 matching, survival advantage of married group still exists, indicating that spousal support from other aspects may also play an important role.
Shi, Xiao; Zhang, Ting-ting; Hu, Wei-ping; Ji, Qing-hai
2017-01-01
Background The relationship between marital status and oral cavity squamous cell carcinoma (OCSCC) survival has not been explored. The objective of our study was to evaluate the impact of marital status on OCSCC survival and investigate the potential mechanisms. Results Married patients had better 5-year cancer-specific survival (CSS) (66.7% vs 54.9%) and 5-year overall survival (OS) (56.0% vs 41.1%). In multivariate Cox regression models, unmarried patients also showed higher mortality risk for both CSS (Hazard Ratio [HR]: 1.260, 95% confidence interval (CI): 1.187–1.339, P < 0.001) and OS (HR: 1.328, 95% CI: 1.266–1.392, P < 0.001). Multivariate logistic regression showed married patients were more likely to be diagnosed at earlier stage (P < 0.001) and receive surgery (P < 0.001). Married patients still demonstrated better prognosis in the 1:1 matched group analysis (CSS: 62.9% vs 60.8%, OS: 52.3% vs 46.5%). Materials and Methods 11022 eligible OCSCC patients were identified from Surveillance, Epidemiology, and End Results (SEER) database, including 5902 married and 5120 unmarried individuals. Kaplan-Meier analysis, Log-rank test and Cox proportional hazards regression model were used to analyze survival and mortality risk. Influence of marital status on stage, age at diagnosis and selection of treatment was determined by binomial and multinomial logistic regression. Propensity score matching method was adopted to perform a 1:1 matched cohort. Conclusions Marriage has an independently protective effect on OCSCC survival. Earlier diagnosis and more sufficient treatment are possible explanations. Besides, even after 1:1 matching, survival advantage of married group still exists, indicating that spousal support from other aspects may also play an important role. PMID:28415710
Ghaddar, Suad; Brown, Cynthia J; Pagán, José A; Díaz, Violeta
2010-09-01
To explore the relationship between acculturation and healthy lifestyle habits in the largely Hispanic populations living in underserved communities in the United States of America along the U.S.-Mexico border. A cross-sectional study was conducted from April 2006 to June 2008 using survey data from the Alliance for a Healthy Border, a program designed to reduce health disparities in the U.S.-Mexico border region by funding nutrition and physical activity education programs at 12 federally qualified community health centers in Arizona, California, New Mexico, and Texas. The survey included questions on acculturation, diet, exercise, and demographic factors and was completed by 2,381 Alliance program participants, of whom 95.3% were Hispanic and 45.4% were under the U.S. poverty level for 2007. Chi-square (χ2) and Student's t tests were used for bivariate comparisons between acculturation and dietary and physical activity measures. Linear regression and binary logistic regression were used to control for factors associated with nutrition and exercise. Based on univariate tests and confirmed by regression analysis controlling for sociodemographic and health variables, less acculturated survey respondents reported a significantly higher frequency of fruit and vegetable consumption and healthier dietary habits than those who were more acculturated. Adjusted binary logistic regression confirmed that individuals with low language acculturation were less likely to engage in physical activity than those with moderate to high acculturation (odds ratio 0.75, 95% confidence interval 0.59-0.95). Findings confirmed an association between acculturation and healthy lifestyle habits and supported the hypothesis that acculturation in border community populations tends to decrease the practice of some healthy dietary habits while increasing exposure to and awareness of the importance of other healthy behaviors.
Fan, Yin-Guang; Xiao, Qin; Wang, Qian; Li, Wen-Xian; Dong, Ma-Xia; Ye, Dong-Qing
2008-03-01
To explore the relationships between quality of life, negative life events, social support and suicide ideation among undergraduates in colleges. 3517 undergraduates in colleges were recruited by multistage stratified random clustered sampling method. Factors associated with suicide ideation were analyzed with logistic regression by scores of Beck Scale for Suicide Ideation(BSSI), Generic Quality of Life Inventory (GQOLI), Adolescent Self-rate Life Events Checklist (ASLEC), Social Support Rating Scale (SSRS) and a questionnaire on background information. The rate of suicide ideation within 7 days was 14.1%, especially in females (15.96%), with single parent (23.79%) and disabled undergraduates (25.00%). The primary risk factors for suicide ideation were with low psychological function, material life, family/social support, lower availability of support and more negative life events. The prevalence of suicide ideation among these undergraduates was high, appropriate measures focusing on these risk factors should be implemented.
Extended family and friendship support and suicidality among African Americans
Taylor, Robert Joseph; Chatters, Linda M.; Taylor, Harry Owen; Lincoln, Karen D.; Mitchell, Uchechi A.
2016-01-01
Purpose This study examined the relationship between informal social support from extended family and friends and suicidality among African Americans. Methods Logistic regression analysis was based on a nationally representative sample of African Americans from the National Survey of American Life (N = 3263). Subjective closeness and frequency of contact with extended family and friends and negative family interaction were examined in relation to lifetime suicide ideation and attempts. Results Subjective closeness to family and frequency of contact with friends were negatively associated with suicide ideation and attempts. Subjective closeness to friends and negative family interaction were positively associated with suicide ideation and attempts. Significant interactions between social support and negative interaction showed that social support buffers against the harmful effects of negative interaction on suicidality. Conclusions Findings are discussed in relation to the functions of positive and negative social ties in suicidality. PMID:27838732
Working women making it work: intimate partner violence, employment, and workplace support.
Swanberg, Jennifer; Macke, Caroline; Logan, T K
2007-03-01
Partner violence may have significant consequences on women's employment, yet limited information is available about how women cope on the job with perpetrators' tactics and the consequences of her coping methods on employment status. This article investigates whether there is an association between workplace disclosure of victimization and current employment status; and whether there is an association between receiving workplace support and current employment status among women who disclosed victimization circumstances to someone at work. Using a sample of partner victimized women who were employed within the past year (N = 485), cross-tabulation and ANOVA procedures were conducted to examine the differences between currently employed and unemployed women. Binary logistic regressions were conducted to examine whether disclosure and receiving workplace support were significantly associated with current employment. Results indicate that disclosure and workplace support are associated with employment. Implications for clinical practice, workplace policies, and future research are discussed.
Wu, Lin-Na; Yang, Guo-Yun; Ge, Ning
2013-03-01
To investigate the influence of depression, social supports and quality of sleep and quality of life on old women who were 60 years or older and postmenopause with coronary heart disease. 125 old women with coronary heart disease completed questionnaires of Seattle Angina Questionnaire (SAQ), Social Support Scale (SSRS) and Self-rating Depression Scale (SDS). Logistic regression analysis and Spearman correlation analysis were performed to evaluate the relationship between social-psycological factors and quality of life. 120 of questionnaires wereeffective (representing 96% of all collected questionnaires). Regression analysis showed that marital status (OR = 2.450), education (OR = 0.520), income (OR = 19.541) and course of disease (OR = 0.309) were associated with QOL in CHD (P < 0.05). Spearman analysis demonstrated that there were negative correlations between SQA score and PSQI and depression scores (r = -0.771, P < 0.01; r = -0.703, P < 0.05); and positive correlation between SQA score and Social support score (r = 0.565, P < 0.05). Social-psychological factors might influence the quality of life in old women with coronary heart disease, it is important that physicians pay attention to these factors when they treat old women with coronary heart disease.
Factors Associated With Caregivers' Resilience in a Terminal Cancer Care Setting.
Hwang, In Cheol; Kim, Young Sung; Lee, Yong Joo; Choi, Youn Seon; Hwang, Sun Wook; Kim, Hyo Min; Koh, Su-Jin
2018-04-01
Resilience implies characteristics such as self-efficacy, adaptability to change, optimism, and the ability to recover from traumatic stress. Studies on resilience in family caregivers (FCs) of patients with terminal cancer are rare. This study aims to examine the factors associated with FCs' resilience in a terminal cancer care setting. This is a cross-sectional study of 273 FCs from 7 hospice and palliative care units in Korea. Resilience was categorized as high and low, and factors associated with resilience were grouped or categorized into subscales. A multivariate logistic regression analysis was used to examine relevant factors. High FCs' resilience was significantly associated with FCs' health status, depression, and social support. In a multivariate regression model, FCs' perception of good health (adjusted odds ratio [aOR] = 2.26, 95% confidence interval [CI] = 1.16-4.40), positive social support (aOR = 3.70, 95% CI = 1.07-12.87), and absence of depression (aOR = 3.12, 95% CI = 1.59-6.13) remained significantly associated with high FCs' resilience. Lack of family support is associated with and may be a cause of diminished resilience. And more concern should be paid to FCs to improve FCs' health and emotional status. Education programs might be effective for improving caregivers' resilience. Further research with supportive interventions is indicated.
Logistic Mixed Models to Investigate Implicit and Explicit Belief Tracking.
Lages, Martin; Scheel, Anne
2016-01-01
We investigated the proposition of a two-systems Theory of Mind in adults' belief tracking. A sample of N = 45 participants predicted the choice of one of two opponent players after observing several rounds in an animated card game. Three matches of this card game were played and initial gaze direction on target and subsequent choice predictions were recorded for each belief task and participant. We conducted logistic regressions with mixed effects on the binary data and developed Bayesian logistic mixed models to infer implicit and explicit mentalizing in true belief and false belief tasks. Although logistic regressions with mixed effects predicted the data well a Bayesian logistic mixed model with latent task- and subject-specific parameters gave a better account of the data. As expected explicit choice predictions suggested a clear understanding of true and false beliefs (TB/FB). Surprisingly, however, model parameters for initial gaze direction also indicated belief tracking. We discuss why task-specific parameters for initial gaze directions are different from choice predictions yet reflect second-order perspective taking.
Model selection for logistic regression models
NASA Astrophysics Data System (ADS)
Duller, Christine
2012-09-01
Model selection for logistic regression models decides which of some given potential regressors have an effect and hence should be included in the final model. The second interesting question is whether a certain factor is heterogeneous among some subsets, i.e. whether the model should include a random intercept or not. In this paper these questions will be answered with classical as well as with Bayesian methods. The application show some results of recent research projects in medicine and business administration.
Radiomorphometric analysis of frontal sinus for sex determination.
Verma, Saumya; Mahima, V G; Patil, Karthikeya
2014-09-01
Sex determination of unknown individuals carries crucial significance in forensic research, in cases where fragments of skull persist with no likelihood of identification based on dental arch. In these instances sex determination becomes important to rule out certain number of possibilities instantly and helps in establishing a biological profile of human remains. The aim of the study is to evaluate a mathematical method based on logistic regression analysis capable of ascertaining the sex of individuals in the South Indian population. The study was conducted in the department of Oral Medicine and Radiology. The right and left areas, maximum height, width of frontal sinus were determined in 100 Caldwell views of 50 women and 50 men aged 20 years and above, with the help of Vernier callipers and a square grid with 1 square measuring 1mm(2) in area. Student's t-test, logistic regression analysis. The mean values of variables were greater in men, based on Student's t-test at 5% level of significance. The mathematical model based on logistic regression analysis gave percentage agreement of total area to correctly predict the female gender as 55.2%, of right area as 60.9% and of left area as 55.2%. The areas of the frontal sinus and the logistic regression proved to be unreliable in sex determination. (Logit = 0.924 - 0.00217 × right area).
Genetic prediction of type 2 diabetes using deep neural network.
Kim, J; Kim, J; Kwak, M J; Bajaj, M
2018-04-01
Type 2 diabetes (T2DM) has strong heritability but genetic models to explain heritability have been challenging. We tested deep neural network (DNN) to predict T2DM using the nested case-control study of Nurses' Health Study (3326 females, 45.6% T2DM) and Health Professionals Follow-up Study (2502 males, 46.5% T2DM). We selected 96, 214, 399, and 678 single-nucleotide polymorphism (SNPs) through Fisher's exact test and L1-penalized logistic regression. We split each dataset randomly in 4:1 to train prediction models and test their performance. DNN and logistic regressions showed better area under the curve (AUC) of ROC curves than the clinical model when 399 or more SNPs included. DNN was superior than logistic regressions in AUC with 399 or more SNPs in male and 678 SNPs in female. Addition of clinical factors consistently increased AUC of DNN but failed to improve logistic regressions with 214 or more SNPs. In conclusion, we show that DNN can be a versatile tool to predict T2DM incorporating large numbers of SNPs and clinical information. Limitations include a relatively small number of the subjects mostly of European ethnicity. Further studies are warranted to confirm and improve performance of genetic prediction models using DNN in different ethnic groups. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Austin, Peter C
2010-04-22
Multilevel logistic regression models are increasingly being used to analyze clustered data in medical, public health, epidemiological, and educational research. Procedures for estimating the parameters of such models are available in many statistical software packages. There is currently little evidence on the minimum number of clusters necessary to reliably fit multilevel regression models. We conducted a Monte Carlo study to compare the performance of different statistical software procedures for estimating multilevel logistic regression models when the number of clusters was low. We examined procedures available in BUGS, HLM, R, SAS, and Stata. We found that there were qualitative differences in the performance of different software procedures for estimating multilevel logistic models when the number of clusters was low. Among the likelihood-based procedures, estimation methods based on adaptive Gauss-Hermite approximations to the likelihood (glmer in R and xtlogit in Stata) or adaptive Gaussian quadrature (Proc NLMIXED in SAS) tended to have superior performance for estimating variance components when the number of clusters was small, compared to software procedures based on penalized quasi-likelihood. However, only Bayesian estimation with BUGS allowed for accurate estimation of variance components when there were fewer than 10 clusters. For all statistical software procedures, estimation of variance components tended to be poor when there were only five subjects per cluster, regardless of the number of clusters.
Product unit neural network models for predicting the growth limits of Listeria monocytogenes.
Valero, A; Hervás, C; García-Gimeno, R M; Zurera, G
2007-08-01
A new approach to predict the growth/no growth interface of Listeria monocytogenes as a function of storage temperature, pH, citric acid (CA) and ascorbic acid (AA) is presented. A linear logistic regression procedure was performed and a non-linear model was obtained by adding new variables by means of a Neural Network model based on Product Units (PUNN). The classification efficiency of the training data set and the generalization data of the new Logistic Regression PUNN model (LRPU) were compared with Linear Logistic Regression (LLR) and Polynomial Logistic Regression (PLR) models. 92% of the total cases from the LRPU model were correctly classified, an improvement on the percentage obtained using the PLR model (90%) and significantly higher than the results obtained with the LLR model, 80%. On the other hand predictions of LRPU were closer to data observed which permits to design proper formulations in minimally processed foods. This novel methodology can be applied to predictive microbiology for describing growth/no growth interface of food-borne microorganisms such as L. monocytogenes. The optimal balance is trying to find models with an acceptable interpretation capacity and with good ability to fit the data on the boundaries of variable range. The results obtained conclude that these kinds of models might well be very a valuable tool for mathematical modeling.
Lacagnina, Valerio; Leto-Barone, Maria S; La Piana, Simona; Seidita, Aurelio; Pingitore, Giuseppe; Di Lorenzo, Gabriele
2014-01-01
This article uses the logistic regression model for diagnostic decision making in patients with chronic nasal symptoms. We studied the ability of the logistic regression model, obtained by the evaluation of a database, to detect patients with positive allergy skin-prick test (SPT) and patients with negative SPT. The model developed was validated using the data set obtained from another medical institution. The analysis was performed using a database obtained from a questionnaire administered to the patients with nasal symptoms containing personal data, clinical data, and results of allergy testing (SPT). All variables found to be significantly different between patients with positive and negative SPT (p < 0.05) were selected for the logistic regression models and were analyzed with backward stepwise logistic regression, evaluated with area under the curve of the receiver operating characteristic curve. A second set of patients from another institution was used to prove the model. The accuracy of the model in identifying, over the second set, both patients whose SPT will be positive and negative was high. The model detected 96% of patients with nasal symptoms and positive SPT and classified 94% of those with negative SPT. This study is preliminary to the creation of a software that could help the primary care doctors in a diagnostic decision making process (need of allergy testing) in patients complaining of chronic nasal symptoms.
De Marco, Molly; Thorburn, Sheryl
2009-11-01
Millions of US households experienced food insecurity in 2005. Research indicates that low wages and little social support contribute to food insecurity. The present study aimed to examine whether social support moderates the relationship between income and food insecurity. Using a mail survey, we collected data on social support sources (social network, intimate partner and community) and social support functions from a social network (instrumental, informational and emotional). We used hierarchical logistic regression to examine the potential moderation of various measures of social support on the relationship between income and food insecurity, adjusting for potential confounding variables. Oregon, USA. A stratified random sample of Oregonians aged 18-64 years (n 343). We found no evidence of an association between social support and food insecurity, nor any evidence that social support acts as a moderator between income and food insecurity, regardless of the measure of social support used. Although previous research suggested that social support could offset the negative impact of low income on food security, our study did not find support for such an effect.
77 FR 32598 - 36(b)(1) Arms Sales Notification
Federal Register 2010, 2011, 2012, 2013, 2014
2012-06-01
... engineering and logistics support services, and other related elements of logistics support. (iv) Military.... Government and contractor engineering and logistics support services, and other related elements of logistics... acceptable military balance in the area. The Republic of Korea (ROK) intends to use the HARPOON Block II...
Zhang, Dongdong; Chen, Ling; Yin, Dan; Miao, Jinping; Sun, Yehuan
2014-07-01
To explore the correlation between suicide ideation and family function & negative life events, as well as other influential factors in adolescents, thus present a theoretical base for clinicians and school staff to develop intervention for those problems. By adopting current situation random sampling method, Self-Rating Idea of Suicide Scale, Adolescent Self-Rating Life Events Check List and Family APGAR Index were used to assess adolescents at random in a hygiene vocational school in Changzhou City, Jiangsu Province and a collage in Wuhu City, Anhui Province. 3700 questionnaires were granted, 3675 questionnaires were collected, among which 3620 were valid. Chi-square test, t-test, and univariate logistic regression were employed in univariate analysis, multivariate logistic regression was used in multivariate analysis. The detection rate of suicide ideation is 7.0%, and the top five suicide ideation characteristics were: poor academic performance (33.6%), serious family functional impairment (25.8%), lower-middle academic performance (11.7%), bad economic conditions (10.8%) and study in Grade Three (9.9%). Multiple logistic regression showed that the following three high-level stress amount in negative life events are most crucial for suicide ideation. They are "relationships" (OR = 1.135, 95% CI 1.071 - 1. 202), "academic pressure" (OR = 1.169, 95% CI 1.101 - 1.241), and "external events" (OR = 1.278, 95% CI 1.187 - 1.376). What' s more, the stress of attending higher grades (OR = 1.980, 95% CI 1.302 - 3.008), poor academic performance (OR = 7.206, 95% CI 1.745 - 9.789), moderate family functional impairment (OR = 2.562, 95% CI 1.527 - 2.892) and its serious level (OR = 8.287, 95% CI 3.154 - 6.917) are also influential factors for suicide ideation. Severe family functional impairment and high-level stress amount of negative life events produced the main factors of suicide ideation. Therefore, necessary and sufficient support should be given to adolescents by families and schools.
Real, J; Cleries, R; Forné, C; Roso-Llorach, A; Martínez-Sánchez, J M
In medicine and biomedical research, statistical techniques like logistic, linear, Cox and Poisson regression are widely known. The main objective is to describe the evolution of multivariate techniques used in observational studies indexed in PubMed (1970-2013), and to check the requirements of the STROBE guidelines in the author guidelines in Spanish journals indexed in PubMed. A targeted PubMed search was performed to identify papers that used logistic linear Cox and Poisson models. Furthermore, a review was also made of the author guidelines of journals published in Spain and indexed in PubMed and Web of Science. Only 6.1% of the indexed manuscripts included a term related to multivariate analysis, increasing from 0.14% in 1980 to 12.3% in 2013. In 2013, 6.7, 2.5, 3.5, and 0.31% of the manuscripts contained terms related to logistic, linear, Cox and Poisson regression, respectively. On the other hand, 12.8% of journals author guidelines explicitly recommend to follow the STROBE guidelines, and 35.9% recommend the CONSORT guideline. A low percentage of Spanish scientific journals indexed in PubMed include the STROBE statement requirement in the author guidelines. Multivariate regression models in published observational studies such as logistic regression, linear, Cox and Poisson are increasingly used both at international level, as well as in journals published in Spanish. Copyright © 2015 Sociedad Española de Médicos de Atención Primaria (SEMERGEN). Publicado por Elsevier España, S.L.U. All rights reserved.
2011-01-01
Introduction Necrotizing fasciitis (NF) is a life threatening infectious disease with a high mortality rate. We carried out a microbiological characterization of the causative pathogens. We investigated the correlation of mortality in NF with bloodstream infection and with the presence of co-morbidities. Methods In this retrospective study, we analyzed 323 patients who presented with necrotizing fasciitis at two different institutions. Bloodstream infection (BSI) was defined as a positive blood culture result. The patients were categorized as survivors and non-survivors. Eleven clinically important variables which were statistically significant by univariate analysis were selected for multivariate regression analysis and a stepwise logistic regression model was developed to determine the association between BSI and mortality. Results Univariate logistic regression analysis showed that patients with hypotension, heart disease, liver disease, presence of Vibrio spp. in wound cultures, presence of fungus in wound cultures, and presence of Streptococcus group A, Aeromonas spp. or Vibrio spp. in blood cultures, had a significantly higher risk of in-hospital mortality. Our multivariate logistic regression analysis showed a higher risk of mortality in patients with pre-existing conditions like hypotension, heart disease, and liver disease. Multivariate logistic regression analysis also showed that presence of Vibrio spp in wound cultures, and presence of Streptococcus Group A in blood cultures were associated with a high risk of mortality while debridement > = 3 was associated with improved survival. Conclusions Mortality in patients with necrotizing fasciitis was significantly associated with the presence of Vibrio in wound cultures and Streptococcus group A in blood cultures. PMID:21693053
Prediction of siRNA potency using sparse logistic regression.
Hu, Wei; Hu, John
2014-06-01
RNA interference (RNAi) can modulate gene expression at post-transcriptional as well as transcriptional levels. Short interfering RNA (siRNA) serves as a trigger for the RNAi gene inhibition mechanism, and therefore is a crucial intermediate step in RNAi. There have been extensive studies to identify the sequence characteristics of potent siRNAs. One such study built a linear model using LASSO (Least Absolute Shrinkage and Selection Operator) to measure the contribution of each siRNA sequence feature. This model is simple and interpretable, but it requires a large number of nonzero weights. We have introduced a novel technique, sparse logistic regression, to build a linear model using single-position specific nucleotide compositions which has the same prediction accuracy of the linear model based on LASSO. The weights in our new model share the same general trend as those in the previous model, but have only 25 nonzero weights out of a total 84 weights, a 54% reduction compared to the previous model. Contrary to the linear model based on LASSO, our model suggests that only a few positions are influential on the efficacy of the siRNA, which are the 5' and 3' ends and the seed region of siRNA sequences. We also employed sparse logistic regression to build a linear model using dual-position specific nucleotide compositions, a task LASSO is not able to accomplish well due to its high dimensional nature. Our results demonstrate the superiority of sparse logistic regression as a technique for both feature selection and regression over LASSO in the context of siRNA design.
Somebody to lean on: Social relationships predict post-treatment depression severity in adults.
Hallgren, Mats; Lundin, Andreas; Tee, Fwo Yi; Burström, Bo; Forsell, Yvonne
2017-03-01
Supportive social relationships can help protect against depression, but few studies have examined how social relationships influence the response to depression treatment. We examined longitudinal associations between the availability of social relationships and depression severity following a 12-week intervention. In total, 946 adults aged 18-71 years with mild-to-moderate depression were recruited from primary care centres across Sweden and treated for 12 weeks. The interventions included internet-based cognitive behavioural therapy (ICBT), 'usual care' (CBT or supportive counselling) and exercise. The primary outcome was the change in depression severity. The availability of social relationships were self-rated and based on the Interview Schedule for Social Interaction (ISSI). Prospective associations were explored using and logistic regression models. Participants with greater access to supportive social relationships reported larger improvements in depression compared to those with 'low' availability of relationships (β= -3.95, 95% CI= -5.49, -2.41, p< .01). Binary logistic models indicated a significantly better 'treatment response' (50% score reduction) in those reporting high compared to low availability of relationships (OR= 2.17, 95% CI= 1.40, 3.36, p< .01). Neither gender nor the type of treatment received moderated these effects. In conclusion, social relationships appear to play a key role in recovery from depression. Copyright © 2017 Elsevier Ireland Ltd. All rights reserved.
Is the influence of social support on mental health the same for immigrants and non-immigrants?
Puyat, Joseph H
2013-06-01
The association between social support and mental health across immigrant groups were examined in this study. A population-based sample was extracted from a 2009/10 Canadian community health survey. Self-reported mood or anxiety disorders and a standardized social support scale were used as outcome and explanatory variables. The association between these variables was measured using logistic regression controlling for sex, age, marital status, education, self-rated health and perceived stress. Stratified analyses were performed to test if the strength of association differed by immigrant status. In comparison with individuals who had moderate levels of social support, individuals with low social support had higher odds of reporting mental disorders and this association appeared strongest among recent immigrants. Using the same comparison group, individuals with high social support had lower odds of reporting mental disorders and this association appeared stronger among long-term immigrants. Findings were discussed within the context of immigration stress and acculturation strategies.
Santos, Frédéric; Guyomarc'h, Pierre; Bruzek, Jaroslav
2014-12-01
Accuracy of identification tools in forensic anthropology primarily rely upon the variations inherent in the data upon which they are built. Sex determination methods based on craniometrics are widely used and known to be specific to several factors (e.g. sample distribution, population, age, secular trends, measurement technique, etc.). The goal of this study is to discuss the potential variations linked to the statistical treatment of the data. Traditional craniometrics of four samples extracted from documented osteological collections (from Portugal, France, the U.S.A., and Thailand) were used to test three different classification methods: linear discriminant analysis (LDA), logistic regression (LR), and support vector machines (SVM). The Portuguese sample was set as a training model on which the other samples were applied in order to assess the validity and reliability of the different models. The tests were performed using different parameters: some included the selection of the best predictors; some included a strict decision threshold (sex assessed only if the related posterior probability was high, including the notion of indeterminate result); and some used an unbalanced sex-ratio. Results indicated that LR tends to perform slightly better than the other techniques and offers a better selection of predictors. Also, the use of a decision threshold (i.e. p>0.95) is essential to ensure an acceptable reliability of sex determination methods based on craniometrics. Although the Portuguese, French, and American samples share a similar sexual dimorphism, application of Western models on the Thai sample (that displayed a lower degree of dimorphism) was unsuccessful. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Ultrasound based computer-aided-diagnosis of kidneys for pediatric hydronephrosis
NASA Astrophysics Data System (ADS)
Cerrolaza, Juan J.; Peters, Craig A.; Martin, Aaron D.; Myers, Emmarie; Safdar, Nabile; Linguraru, Marius G.
2014-03-01
Ultrasound is the mainstay of imaging for pediatric hydronephrosis, though its potential as diagnostic tool is limited by its subjective assessment, and lack of correlation with renal function. Therefore, all cases showing signs of hydronephrosis undergo further invasive studies, like diuretic renogram, in order to assess the actual renal function. Under the hypothesis that renal morphology is correlated with renal function, a new ultrasound based computer-aided diagnosis (CAD) tool for pediatric hydronephrosis is presented. From 2D ultrasound, a novel set of morphological features of the renal collecting systems and the parenchyma, is automatically extracted using image analysis techniques. From the original set of features, including size, geometric and curvature descriptors, a subset of ten features are selected as predictive variables, combining a feature selection technique and area under the curve filtering. Using the washout half time (T1/2) as indicative of renal obstruction, two groups are defined. Those cases whose T1/2 is above 30 minutes are considered to be severe, while the rest would be in the safety zone, where diuretic renography could be avoided. Two different classification techniques are evaluated (logistic regression, and support vector machines). Adjusting the probability decision thresholds to operate at the point of maximum sensitivity, i.e., preventing any severe case be misclassified, specificities of 53%, and 75% are achieved, for the logistic regression and the support vector machine classifier, respectively. The proposed CAD system allows to establish a link between non-invasive non-ionizing imaging techniques and renal function, limiting the need for invasive and ionizing diuretic renography.
Fraccaro, Paolo; Nicolo, Massimo; Bonetto, Monica; Giacomini, Mauro; Weller, Peter; Traverso, Carlo Enrico; Prosperi, Mattia; OSullivan, Dympna
2015-01-27
To investigate machine learning methods, ranging from simpler interpretable techniques to complex (non-linear) "black-box" approaches, for automated diagnosis of Age-related Macular Degeneration (AMD). Data from healthy subjects and patients diagnosed with AMD or other retinal diseases were collected during routine visits via an Electronic Health Record (EHR) system. Patients' attributes included demographics and, for each eye, presence/absence of major AMD-related clinical signs (soft drusen, retinal pigment epitelium, defects/pigment mottling, depigmentation area, subretinal haemorrhage, subretinal fluid, macula thickness, macular scar, subretinal fibrosis). Interpretable techniques known as white box methods including logistic regression and decision trees as well as less interpreitable techniques known as black box methods, such as support vector machines (SVM), random forests and AdaBoost, were used to develop models (trained and validated on unseen data) to diagnose AMD. The gold standard was confirmed diagnosis of AMD by physicians. Sensitivity, specificity and area under the receiver operating characteristic (AUC) were used to assess performance. Study population included 487 patients (912 eyes). In terms of AUC, random forests, logistic regression and adaboost showed a mean performance of (0.92), followed by SVM and decision trees (0.90). All machine learning models identified soft drusen and age as the most discriminating variables in clinicians' decision pathways to diagnose AMD. Both black-box and white box methods performed well in identifying diagnoses of AMD and their decision pathways. Machine learning models developed through the proposed approach, relying on clinical signs identified by retinal specialists, could be embedded into EHR to provide physicians with real time (interpretable) support.
Turk, Tahir; Newton, Fiona; Choudhury, Sohel; Islam, Md Shafiqul
2018-06-01
Tobacco use contributes to an estimated 14.6% of male and 5.7% of female deaths in Bangladesh. We examine the determinants of tobacco-related quit attempts among Bangladeshis with and without awareness of the synergized "People Behind the Packs" (PBTP) communication campaign used to support the introduction of pack-based graphic warning labels (GWLs) in 2016. Data from 1,796 adults were collected using multistage sampling and a cross-sectional face-to-face survey. Analyses used a normalized design weight to ensure representativeness to the national population of smokers within Bangladesh. For the overall sample, the multivariable logistic regression model revealed quit attempts were associated with having seen the pack-based GWLs, recalling ≥1 PBTP campaign message, higher levels of self-efficacy to quit, and recognizing more potential side-effects associated with using tobacco products. Conversely, the likelihood of quitting attempts were lower among dual tobacco users (relative to smokers) and those using tobacco at least daily (vs. less than daily). The hierarchical multivariable logistic regression model among those aware of ≥1 PBTP campaign message indicated quitting attempts were positively associated with recalling more of the campaign messages and discussing them with others. This national evaluation of pack-based GWLs and accompanying PBTP campaign within Bangladesh supports the efficacy of using synergized communication messages when introducing such labels. That quit attempts are more likely among those discussing PBTP campaign messages with others and recalling more PBTP campaign messages highlights the importance of ensuring message content is both memorable and engaging.
The Columbus logistics support at the APMC: Requirements and implementation aspects
NASA Technical Reports Server (NTRS)
Canu, C.; Battocchio, L.; Masullo, S.
1993-01-01
This paper focuses on the logistics support to be provided by the APM Center (APMC). Among the Columbus ground infrastructures, this center is tasked to provide logistics, sustaining engineering and P/L integration support to the ongoing missions of the APM, i.e. the Columbus Laboratory attached to the Freedom Space Station. The following is illustrated: an analysis of the requirements that are levied on the logistics support of the APM; how such requirements are reflected in the corresponding support to be available on-ground and at APMC; the functional components of the APMC logistics support and how such components interact each other; how the logistics support function interfaces with the other functions of the ground support; and how the logistics support is being designed in terms of resources (such as hardware, data bases, etc.). Emphasis is given to the data handling aspects and to the related data bases that will constitute for the logistics activities the fundamental source of information during the APM planned lifetime. Functional and physical architectures, together with trades for possible implementation, are addressed. Commonalities with other centers are taken into account and recommendations are made for possible reuse of tools already developed in the C/D phase. Finally, programmatic considerations are discussed for the actual implementation of the center.
ISS Logistics Hardware Disposition and Metrics Validation
NASA Technical Reports Server (NTRS)
Rogers, Toneka R.
2010-01-01
I was assigned to the Logistics Division of the International Space Station (ISS)/Spacecraft Processing Directorate. The Division consists of eight NASA engineers and specialists that oversee the logistics portion of the Checkout, Assembly, and Payload Processing Services (CAPPS) contract. Boeing, their sub-contractors and the Boeing Prime contract out of Johnson Space Center, provide the Integrated Logistics Support for the ISS activities at Kennedy Space Center. Essentially they ensure that spares are available to support flight hardware processing and the associated ground support equipment (GSE). Boeing maintains a Depot for electrical, mechanical and structural modifications and/or repair capability as required. My assigned task was to learn project management techniques utilized by NASA and its' contractors to provide an efficient and effective logistics support infrastructure to the ISS program. Within the Space Station Processing Facility (SSPF) I was exposed to Logistics support components, such as, the NASA Spacecraft Services Depot (NSSD) capabilities, Mission Processing tools, techniques and Warehouse support issues, required for integrating Space Station elements at the Kennedy Space Center. I also supported the identification of near-term ISS Hardware and Ground Support Equipment (GSE) candidates for excessing/disposition prior to October 2010; and the validation of several Logistics Metrics used by the contractor to measure logistics support effectiveness.
Kusano, Kristofer; Gabler, Hampton C
2014-01-01
The odds of death for a seriously injured crash victim are drastically reduced if he or she received care at a trauma center. Advanced automated crash notification (AACN) algorithms are postcrash safety systems that use data measured by the vehicles during the crash to predict the likelihood of occupants being seriously injured. The accuracy of these models are crucial to the success of an AACN. The objective of this study was to compare the predictive performance of competing injury risk models and algorithms: logistic regression, random forest, AdaBoost, naïve Bayes, support vector machine, and classification k-nearest neighbors. This study compared machine learning algorithms to the widely adopted logistic regression modeling approach. Machine learning algorithms have not been commonly studied in the motor vehicle injury literature. Machine learning algorithms may have higher predictive power than logistic regression, despite the drawback of lacking the ability to perform statistical inference. To evaluate the performance of these algorithms, data on 16,398 vehicles involved in non-rollover collisions were extracted from the NASS-CDS. Vehicles with any occupants having an Injury Severity Score (ISS) of 15 or greater were defined as those requiring victims to be treated at a trauma center. The performance of each model was evaluated using cross-validation. Cross-validation assesses how a model will perform in the future given new data not used for model training. The crash ΔV (change in velocity during the crash), damage side (struck side of the vehicle), seat belt use, vehicle body type, number of events, occupant age, and occupant sex were used as predictors in each model. Logistic regression slightly outperformed the machine learning algorithms based on sensitivity and specificity of the models. Previous studies on AACN risk curves used the same data to train and test the power of the models and as a result had higher sensitivity compared to the cross-validated results from this study. Future studies should account for future data; for example, by using cross-validation or risk presenting optimistic predictions of field performance. Past algorithms have been criticized for relying on age and sex, being difficult to measure by vehicle sensors, and inaccuracies in classifying damage side. The models with accurate damage side and including age/sex did outperform models with less accurate damage side and without age/sex, but the differences were small, suggesting that the success of AACN is not reliant on these predictors.
Untas, Aurélie; Thumma, Jyothi; Rascle, Nicole; Rayner, Hugh; Mapes, Donna; Lopes, Antonio A; Fukuhara, Shunichi; Akizawa, Tadao; Morgenstern, Hal; Robinson, Bruce M; Pisoni, Ronald L; Combe, Christian
2011-01-01
This study aimed to investigate the influence of social support and other psychosocial factors on mortality, adherence to medical care recommendations, and physical quality of life among hemodialysis patients. Data on 32,332 hemodialysis patients enrolled in the Dialysis Outcomes and Practice Patterns Study (1996 to 2008) in 12 countries were analyzed. Social support and other psychosocial factors related to ESRD and its treatment were measured by patient self-reports of health interference with social activities, isolation, feeling like a burden, and support from family and dialysis staff. Cox regression and logistic regression were used to examine associations of baseline social support and other psychosocial factors with all-cause mortality and with other measured outcomes at baseline, adjusting for potential confounders. Mortality was higher among patients reporting that their health interfered with social activities, were isolated, felt like a burden, and were dissatisfied with family support. Poorer family support and several psychosocial measures also were associated with lower adherence to the prescribed hemodialysis length and the recommended weight gain between sessions. Some international differences were observed. Poorer self-reported social support and other psychosocial factors were associated with poor physical quality of life. Poorer social support and other psychosocial factors are associated with higher mortality risk, lower adherence to medical care, and poorer physical quality of life in hemodialysis patients. More research is needed to assess whether interventions to improve social support and other psychosocial factors will lengthen survival and enhance quality of life.
The impact of worksite supports for healthy eating on dietary behaviors
Dodson, Elizabeth A.; Hipp, J. Aaron; Gao, Mengchao; Tabak, Rachel G.; Yang, Lin; Brownson, Ross C.
2016-01-01
Objective The purpose of this study was to assess the availability of worksite supports (WSS) for healthy eating and examine associations between existing supports and dietary behaviors. Methods A cross-sectional, telephone-based study was conducted with 2013 participants in four metropolitan areas in 2012. Logistic regression was used to examine associations between dietary behaviors and the availability or use of WSS. Results Those reporting the availability of a cafeteria/snack bar/food services at the worksite were more likely to consume fruits and vegetables more than twice/day, and less likely to consume fast food more than twice/week. Conclusions Study results highlight the utility of specific WSS to improve employee dietary behaviors while raising questions about why the presence of healthy foods at the worksite may not translate into employee consumption of such foods. PMID:27414016
Correlates of Illicit Drug Use Among Indigenous Peoples in Canada: A Test of Social Support Theory.
Cao, Liqun; Burton, Velmer S; Liu, Liu
2018-02-01
Relying on a national stratified random sample of Indigenous peoples aged 19 years old and above in Canada, this study investigates the correlates of illicit drug use among Indigenous peoples, paying special attention to the association between social support measures and illegal drug use. Results from multivariate logistical regression show that measures of social support, such as residential mobility, strength of ties within communities, and lack of timely counseling, are statistically significant correlates of illicit drug use. Those identifying as Christian are significantly less likely to use illegal drugs. This is the first nationwide analysis of the illicit drug usage of Indigenous peoples in Canada. The results are robust because we have controlled for a range of comorbidity variables as well as a series of sociodemographic variables. Policy implications from these findings are discussed.
Boivin, Rémi; Leclerc, Chloé
2016-01-01
This article analyzes reported incidents of domestic violence according to the source of the complaint and whether the victim initially supported judicial action against the offender. Almost three quarters of incidents studied were reported by the victim (72%), and a little more than half of victims initially wanted to press charges (55%). Using multinomial logistic regression models, situational and individual factors are used to distinguish 4 incident profiles. Incidents in which the victim made the initial report to the police and wished to press charges are the most distinct and involve partners who were already separated at the time of the incident or had a history of domestic violence. The other profiles also show important differences.
Guo, Huey-Ming; Shyu, Yea-Ing Lotus; Chang, Her-Kun
2006-01-01
In this article, the authors provide an overview of a research method to predict quality of care in home health nursing data set. The results of this study can be visualized through classification an regression tree (CART) graphs. The analysis was more effective, and the results were more informative since the home health nursing dataset was analyzed with a combination of the logistic regression and CART, these two techniques complete each other. And the results more informative that more patients' characters were related to quality of care in home care. The results contributed to home health nurse predict patient outcome in case management. Improved prediction is needed for interventions to be appropriately targeted for improved patient outcome and quality of care.
A general framework for the use of logistic regression models in meta-analysis.
Simmonds, Mark C; Higgins, Julian Pt
2016-12-01
Where individual participant data are available for every randomised trial in a meta-analysis of dichotomous event outcomes, "one-stage" random-effects logistic regression models have been proposed as a way to analyse these data. Such models can also be used even when individual participant data are not available and we have only summary contingency table data. One benefit of this one-stage regression model over conventional meta-analysis methods is that it maximises the correct binomial likelihood for the data and so does not require the common assumption that effect estimates are normally distributed. A second benefit of using this model is that it may be applied, with only minor modification, in a range of meta-analytic scenarios, including meta-regression, network meta-analyses and meta-analyses of diagnostic test accuracy. This single model can potentially replace the variety of often complex methods used in these areas. This paper considers, with a range of meta-analysis examples, how random-effects logistic regression models may be used in a number of different types of meta-analyses. This one-stage approach is compared with widely used meta-analysis methods including Bayesian network meta-analysis and the bivariate and hierarchical summary receiver operating characteristic (ROC) models for meta-analyses of diagnostic test accuracy. © The Author(s) 2014.
2011-01-01
Background The relationship between asthma and traffic-related pollutants has received considerable attention. The use of individual-level exposure measures, such as residence location or proximity to emission sources, may avoid ecological biases. Method This study focused on the pediatric Medicaid population in Detroit, MI, a high-risk population for asthma-related events. A population-based matched case-control analysis was used to investigate associations between acute asthma outcomes and proximity of residence to major roads, including freeways. Asthma cases were identified as all children who made at least one asthma claim, including inpatient and emergency department visits, during the three-year study period, 2004-06. Individually matched controls were randomly selected from the rest of the Medicaid population on the basis of non-respiratory related illness. We used conditional logistic regression with distance as both categorical and continuous variables, and examined non-linear relationships with distance using polynomial splines. The conditional logistic regression models were then extended by considering multiple asthma states (based on the frequency of acute asthma outcomes) using polychotomous conditional logistic regression. Results Asthma events were associated with proximity to primary roads with an odds ratio of 0.97 (95% CI: 0.94, 0.99) for a 1 km increase in distance using conditional logistic regression, implying that asthma events are less likely as the distance between the residence and a primary road increases. Similar relationships and effect sizes were found using polychotomous conditional logistic regression. Another plausible exposure metric, a reduced form response surface model that represents atmospheric dispersion of pollutants from roads, was not associated under that exposure model. Conclusions There is moderately strong evidence of elevated risk of asthma close to major roads based on the results obtained in this population-based matched case-control study. PMID:21513554
Viswanathan, M; Pearl, D L; Taboada, E N; Parmley, E J; Mutschall, S K; Jardine, C M
2017-05-01
Using data collected from a cross-sectional study of 25 farms (eight beef, eight swine and nine dairy) in 2010, we assessed clustering of molecular subtypes of C. jejuni based on a Campylobacter-specific 40 gene comparative genomic fingerprinting assay (CGF40) subtypes, using unweighted pair-group method with arithmetic mean (UPGMA) analysis, and multiple correspondence analysis. Exact logistic regression was used to determine which genes differentiate wildlife and livestock subtypes in our study population. A total of 33 bovine livestock (17 beef and 16 dairy), 26 wildlife (20 raccoon (Procyon lotor), five skunk (Mephitis mephitis) and one mouse (Peromyscus spp.) C. jejuni isolates were subtyped using CGF40. Dendrogram analysis, based on UPGMA, showed distinct branches separating bovine livestock and mammalian wildlife isolates. Furthermore, two-dimensional multiple correspondence analysis was highly concordant with dendrogram analysis showing clear differentiation between livestock and wildlife CGF40 subtypes. Based on multilevel logistic regression models with a random intercept for farm of origin, we found that isolates in general, and raccoons more specifically, were significantly more likely to be part of the wildlife branch. Exact logistic regression conducted gene by gene revealed 15 genes that were predictive of whether an isolate was of wildlife or bovine livestock isolate origin. Both multiple correspondence analysis and exact logistic regression revealed that in most cases, the presence of a particular gene (13 of 15) was associated with an isolate being of livestock rather than wildlife origin. In conclusion, the evidence gained from dendrogram analysis, multiple correspondence analysis and exact logistic regression indicates that mammalian wildlife carry CGF40 subtypes of C. jejuni distinct from those carried by bovine livestock. Future studies focused on source attribution of C. jejuni in human infections will help determine whether wildlife transmit Campylobacter jejuni directly to humans. © 2016 Blackwell Verlag GmbH.
GIS-based spatial decision support system for grain logistics management
NASA Astrophysics Data System (ADS)
Zhen, Tong; Ge, Hongyi; Jiang, Yuying; Che, Yi
2010-07-01
Grain logistics is the important component of the social logistics, which can be attributed to frequent circulation and the great quantity. At present time, there is no modern grain logistics distribution management system, and the logistics cost is the high. Geographic Information Systems (GIS) have been widely used for spatial data manipulation and model operations and provide effective decision support through its spatial database management capabilities and cartographic visualization. In the present paper, a spatial decision support system (SDSS) is proposed to support policy makers and to reduce the cost of grain logistics. The system is composed of two major components: grain logistics goods tracking model and vehicle routing problem optimization model and also allows incorporation of data coming from external sources. The proposed system is an effective tool to manage grain logistics in order to increase the speed of grain logistics and reduce the grain circulation cost.
Pütter, Carolin; Pechlivanis, Sonali; Nöthen, Markus M; Jöckel, Karl-Heinz; Wichmann, Heinz-Erich; Scherag, André
2011-01-01
Genome-wide association studies have identified robust associations between single nucleotide polymorphisms and complex traits. As the proportion of phenotypic variance explained is still limited for most of the traits, larger and larger meta-analyses are being conducted to detect additional associations. Here we investigate the impact of the study design and the underlying assumption about the true genetic effect in a bimodal mixture situation on the power to detect associations. We performed simulations of quantitative phenotypes analysed by standard linear regression and dichotomized case-control data sets from the extremes of the quantitative trait analysed by standard logistic regression. Using linear regression, markers with an effect in the extremes of the traits were almost undetectable, whereas analysing extremes by case-control design had superior power even for much smaller sample sizes. Two real data examples are provided to support our theoretical findings and to explore our mixture and parameter assumption. Our findings support the idea to re-analyse the available meta-analysis data sets to detect new loci in the extremes. Moreover, our investigation offers an explanation for discrepant findings when analysing quantitative traits in the general population and in the extremes. Copyright © 2011 S. Karger AG, Basel.
Messersmith, Lisa J; Semrau, Katherine; Hammett, Theodore M; Phong, Nguyen Tuan; Tung, Nguyen Duy; Nguyen, Ha; Glandon, Douglas; Huong, Nguyen Mai; Anh, Hoang Tu
2013-01-01
In Vietnam, discrimination against people living with HIV/AIDS (PLHIV) is defined within and prohibited by the 2007 national HIV/AIDS law. Despite the law, PLHIV face discrimination in health care, employment, education and other spheres. This study presents the first national estimates of the levels and types of discrimination that are defined in Vietnamese law and experienced by PLHIV in Vietnam. A nationally representative sample of 1200 PLHIV was surveyed, and 129 PLHIV participated in focus group discussions (FGDs). In the last 12 months, nearly half of the survey population experienced at least one form of discrimination and many experienced up to six different types of discrimination. The most common forms of discrimination included disclosure of HIV status without consent; denial of access to education for children; loss of employment; advice, primarily from health care providers, to abstain from sex; and physical and emotional harm. In logistic regression analysis, the experience of discrimination differed by gender, region of residence and membership status in a PLHIV support group. The logistic regression and FGD results indicate that disclosure of HIV status without consent was associated with experiencing other forms of discrimination. Key programme and policy recommendations are discussed.
Zhao, Qian; Chen, Haoyang; Yan, Hongyan; He, Yan; Zhu, Li; Fu, WenTing; Shen, Biyu
2018-01-31
This study aimed (i) to complement existing research by focusing on body image disturbance issues in Chinese Systemic Lupus Erythematosus (SLE) patients; (ii) to investigate how Chinese patients make sense of disease diagnosis and perceived cultural influences within the context of their SLE. A total of 118 SLE patients underwent standardized laboratory examinations and completed several questionnaires. Independent sample t-test, Mann-Whitney U-test, Chi-square test, and multivariate analysis using backward stepwise logistic regression model were used to analyze these data. We found 18.3% SLE patients had BID, which were significantly higher than the control group (.8%). SLE patients are more concerned about their physical changes caused by disease. There were significant correlations among personal health insurance, complication of diabetes, appearance of new rash, depression, anxiety, self-esteem and BID in patients with SLE. Meanwhile, logistic regression analysis revealed that appearance of new rash and high anxiety were significantly associated with BID in SLE patients. In conclusion, it is beneficial to pay attention to the physical and mental health of patients with rheumatic disease from the perspective of body image, to understand their needs and to provide effective and effective service for them.
Classification of vegetation types in military region
NASA Astrophysics Data System (ADS)
Gonçalves, Miguel; Silva, Jose Silvestre; Bioucas-Dias, Jose
2015-10-01
In decision-making process regarding planning and execution of military operations, the terrain is a determining factor. Aerial photographs are a source of vital information for the success of an operation in hostile region, namely when the cartographic information behind enemy lines is scarce or non-existent. The objective of present work is the development of a tool capable of processing aerial photos. The methodology implemented starts with feature extraction, followed by the application of an automatic selector of features. The next step, using the k-fold cross validation technique, estimates the input parameters for the following classifiers: Sparse Multinomial Logist Regression (SMLR), K Nearest Neighbor (KNN), Linear Classifier using Principal Component Expansion on the Joint Data (PCLDC) and Multi-Class Support Vector Machine (MSVM). These classifiers were used in two different studies with distinct objectives: discrimination of vegetation's density and identification of vegetation's main components. It was found that the best classifier on the first approach is the Sparse Logistic Multinomial Regression (SMLR). On the second approach, the implemented methodology applied to high resolution images showed that the better performance was achieved by KNN classifier and PCLDC. Comparing the two approaches there is a multiscale issue, in which for different resolutions, the best solution to the problem requires different classifiers and the extraction of different features.
Eisen, Jane L; Coles, Meredith E; Shea, M Tracie; Pagano, Maria E; Stout, Robert L; Yen, Shirley; Grilo, Carlos M; Rasmussen, Steven A
2006-06-01
In this study we examined the convergence between obsessive-compulsive personality disorder (OCPD) criteria and obsessive-compulsive disorder (OCD). Baseline assessments of 629 participants of the Collaborative Longitudinal Personality Disorders Study were used to examine the associations between OCPD criteria and diagnoses of OCD. Three of the eight OCPD criteria--hoarding, perfectionism, and preoccupation with details--were significantly more frequent in subjects with OCD (n = 89) than in subjects without OCD (n = 540). Logistic regressions were used to predict the probability of each OCPD criterion as a function of Axis I diagnoses (OCD, additional anxiety disorders, and major depressive disorder). Associations between OCD and these three OCPD criteria remained significant in the logistic regressions, showing unique associations with OCD and odds ratios ranging from 2.71 to 2.99. In addition, other anxiety disorders and major depressive disorder showed few associations with specific OCPD criteria. This study suggests variability in the strength of the relationships between specific OCPD criteria and OCD. The findings also support a unique relationship between OCPD symptoms and OCD, compared to other anxiety disorders or major depression. Future efforts to explore the link between Axis I and Axis II disorders may be enriched by conducting analyses at the symptom level.
Eisen, Jane L.; Coles, Meredith E.; Shea, M. Tracie; Pagano, Maria E.; Stout, Robert L.; Yen, Shirley; Grilo, Carlos M.; Rasmussen, Steven A.
2008-01-01
In this study we examined the convergence between obsessive-compulsive personality disorder (OCPD) criteria and obsessive-compulsive disorder (OCD). Baseline assessments of 629 participants of the Collaborative Longitudinal Personality Disorders Study were used to examine the associations between OCPD criteria and diagnoses of OCD. Three of the eight OCPD criteria—hoarding, perfectionism, and preoccupation with details—were significantly more frequent in subjects with OCD (n = 89) than in subjects without OCD (n = 540). Logistic regressions were used to predict the probability of each OCPD criterion as a function of Axis I diagnoses (OCD, additional anxiety disorders, and major depressive disorder). Associations between OCD and these three OCPD criteria remained significant in the logistic regressions, showing unique associations with OCD and odds ratios ranging from 2.71 to 2.99. In addition, other anxiety disorders and major depressive disorder showed few associations with specific OCPD criteria. This study suggests variability in the strength of the relationships between specific OCPD criteria and OCD. The findings also support a unique relationship between OCPD symptoms and OCD, compared to other anxiety disorders or major depression. Future efforts to explore the link between Axis I and Axis II disorders may be enriched by conducting analyses at the symptom level. PMID:16776557
Swan, Emily; Bouwman, Laura; Hiddink, Gerrit Jan; Aarts, Noelle; Koelen, Maria
2015-06-01
Research has identified multiple factors that predict unhealthy eating practices. However what remains poorly understood are factors that promote healthy eating practices. This study aimed to determine a set of factors that represent a profile of healthy eaters. This research applied Antonovsky's salutogenic framework for health development to examine a set of factors that predict healthy eating in a cross-sectional study of Dutch adults. Data were analyzed from participants (n = 703) who completed the study's survey in January 2013. Logistic regression analysis was performed to test the association of survey factors on the outcome variable high dietary score. In the multivariate logistic regression model, five factors contributed significantly (p < .05) to the predictive ability of the overall model: being female; living with a partner; a strong sense of coherence (construct from the salutogenic framework), flexible restraint of eating, and self-efficacy for healthy eating. Findings complement what is already known of the factors that relate to poor eating practices. This can provide nutrition promotion with a more comprehensive picture of the factors that both support and hinder healthy eating practices. Future research should explore these factors to better understand their origins and mechanisms in relation to healthy eating practices. Copyright © 2015 Elsevier Ltd. All rights reserved.
Nematollahi, M; Akbari, R; Nikeghbalian, S; Salehnasab, C
2017-01-01
Kidney transplantation is the treatment of choice for patients with end-stage renal disease (ESRD). Prediction of the transplant survival is of paramount importance. The objective of this study was to develop a model for predicting survival in kidney transplant recipients. In a cross-sectional study, 717 patients with ESRD admitted to Nemazee Hospital during 2008-2012 for renal transplantation were studied and the transplant survival was predicted for 5 years. The multilayer perceptron of artificial neural networks (MLP-ANN), logistic regression (LR), Support Vector Machine (SVM), and evaluation tools were used to verify the determinant models of the predictions and determine the independent predictors. The accuracy, area under curve (AUC), sensitivity, and specificity of SVM, MLP-ANN, and LR models were 90.4%, 86.5%, 98.2%, and 49.6%; 85.9%, 76.9%, 97.3%, and 26.1%; and 84.7%, 77.4%, 97.5%, and 17.4%, respectively. Meanwhile, the independent predictors were discharge time creatinine level, recipient age, donor age, donor blood group, cause of ESRD, recipient hypertension after transplantation, and duration of dialysis before transplantation. SVM and MLP-ANN models could efficiently be used for determining survival prediction in kidney transplant recipients.
Pals, Regitze A S; Olesen, Kasper; Willaing, Ingrid
2016-06-01
To explore the effects of the Next Education (NEED) patient education approach in diabetes education. We tested the use of the NEED approach at eight intervention sites (n=193). Six additional sites served as controls (n=58). Data were collected through questionnaires, interviews and observations. We analysed data using descriptive statistics, logistic regression and systematic text condensation. Results from logistic regression demonstrated better overall assessment of education program experiences and enhanced self-reported improvements in maintaining medications correctly among patients from intervention sites, as compared to control sites. Interviews and observations suggested that improvements in health behavior could be explained by mechanisms related to the education setting, including using person-centeredness and dialogue. However, similar mechanisms were observed at control sites. Observations suggested that the quality of group dynamics, patients' motivation and educators' ability to facilitate participation in education, supported by the NEED approach, contributed to better results at intervention sites. The use of participatory approaches and, in particular, the NEED patient education approach in group-based diabetes education improved self-management skills and health behavior outcomes among individuals with diabetes. The use of dialogue tools in diabetes education is advised for educators. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Sharma, Bimala; Nam, Eun Woo; Kim, Ha Yun; Kim, Jong Koo
2015-01-01
The study examines the prevalence of suicidal ideation and suicide attempt, and associated factors among school-going urban adolescents in Peru. A cross-sectional survey was conducted in a sample of 916 secondary school adolescents in 2014. A structured questionnaire adapted from Global School-based Student Health Survey was used to obtain information. Data were analyzed using logistic regression models at 5% level of significance. Overall, 26.3% reported having suicidal ideation, and 17.5% reported having attempted suicide during the past 12 months. Multivariate logistic regression analysis showed that female sex, being in a fight, being insulted, being attacked, perceived unhappiness, smoking and sexual intercourse initiation were significantly associated with increased risk of suicidal ideation, while female sex, being in a fight, being insulted, being attacked, perceived unhappiness, alcohol and illicit drug use were related to suicide attempt. The prevalence of suicidal ideation and suicide attempts observed in the survey area is relatively high. Female adolescents are particularly vulnerable to report suicidal ideation and suicide attempt. Interventions that address the issue of violence against adolescents, fighting with peers, health risk behaviors particularly initiation of smoking, alcohol and illicit drug use and encourage supportive role of parents may reduce the risk of suicidal behaviors. PMID:26610536
Space exploration initiative (SEI) logistics support lessons from the DoD
NASA Astrophysics Data System (ADS)
Cox, John R.; McCoy, Walbert G.; Jenkins, Terence
Proven and innovative logistics management approaches and techniques used for developing and supporting DoD and Strategic Defense Initiative Office (SDIO) systems are described on the basis of input from DoD to the SEI Synthesis Group; SDIO-developed logistics initiatives, innovative tools, and methodologies; and logistics planning support provided to the NASA/Johnson Planet Surface System Office. The approach is tailored for lunar/Martian surface operations, and provides guidelines for the development and management of a crucial element of the SEI logistics support program. A case study is presented which shows how incorporation of DoD's proven and innovative logistics management approach, tools, and techniques can substantially benefit early logistics planning for SEI, while also implementing many of DoD's recommendations for SEI.
2012-09-01
3,435 10,461 9.1 3.1 63 Unmarried with Children+ Unmarried without Children 439,495 0.01 10,350 43,870 10.1 2.2 64 Married with Children+ Married ...logistic regression model was used to predict the probability of eligibility for the survey (known eligibility vs . unknown eligibility). A second logistic...regression model was used to predict the probability of response among eligible sample members (complete response vs . non-response). CHAID (Chi
Habitat features and predictive habitat modeling for the Colorado chipmunk in southern New Mexico
Rivieccio, M.; Thompson, B.C.; Gould, W.R.; Boykin, K.G.
2003-01-01
Two subspecies of Colorado chipmunk (state threatened and federal species of concern) occur in southern New Mexico: Tamias quadrivittatus australis in the Organ Mountains and T. q. oscuraensis in the Oscura Mountains. We developed a GIS model of potentially suitable habitat based on vegetation and elevation features, evaluated site classifications of the GIS model, and determined vegetation and terrain features associated with chipmunk occurrence. We compared GIS model classifications with actual vegetation and elevation features measured at 37 sites. At 60 sites we measured 18 habitat variables regarding slope, aspect, tree species, shrub species, and ground cover. We used logistic regression to analyze habitat variables associated with chipmunk presence/absence. All (100%) 37 sample sites (28 predicted suitable, 9 predicted unsuitable) were classified correctly by the GIS model regarding elevation and vegetation. For 28 sites predicted suitable by the GIS model, 18 sites (64%) appeared visually suitable based on habitat variables selected from logistic regression analyses, of which 10 sites (36%) were specifically predicted as suitable habitat via logistic regression. We detected chipmunks at 70% of sites deemed suitable via the logistic regression models. Shrub cover, tree density, plant proximity, presence of logs, and presence of rock outcrop were retained in the logistic model for the Oscura Mountains; litter, shrub cover, and grass cover were retained in the logistic model for the Organ Mountains. Evaluation of predictive models illustrates the need for multi-stage analyses to best judge performance. Microhabitat analyses indicate prospective needs for different management strategies between the subspecies. Sensitivities of each population of the Colorado chipmunk to natural and prescribed fire suggest that partial burnings of areas inhabited by Colorado chipmunks in southern New Mexico may be beneficial. These partial burnings may later help avoid a fire that could substantially reduce habitat of chipmunks over a mountain range.
Space Station - An integrated approach to operational logistics support
NASA Technical Reports Server (NTRS)
Hosmer, G. J.
1986-01-01
Development of an efficient and cost effective operational logistics system for the Space Station will require logistics planning early in the program's design and development phase. This paper will focus on Integrated Logistics Support (ILS) Program techniques and their application to the Space Station program design, production and deployment phases to assure the development of an effective and cost efficient operational logistics system. The paper will provide the methodology and time-phased programmatic steps required to establish a Space Station ILS Program that will provide an operational logistics system based on planned Space Station program logistics support.
A comparison of rule-based and machine learning approaches for classifying patient portal messages.
Cronin, Robert M; Fabbri, Daniel; Denny, Joshua C; Rosenbloom, S Trent; Jackson, Gretchen Purcell
2017-09-01
Secure messaging through patient portals is an increasingly popular way that consumers interact with healthcare providers. The increasing burden of secure messaging can affect clinic staffing and workflows. Manual management of portal messages is costly and time consuming. Automated classification of portal messages could potentially expedite message triage and delivery of care. We developed automated patient portal message classifiers with rule-based and machine learning techniques using bag of words and natural language processing (NLP) approaches. To evaluate classifier performance, we used a gold standard of 3253 portal messages manually categorized using a taxonomy of communication types (i.e., main categories of informational, medical, logistical, social, and other communications, and subcategories including prescriptions, appointments, problems, tests, follow-up, contact information, and acknowledgement). We evaluated our classifiers' accuracies in identifying individual communication types within portal messages with area under the receiver-operator curve (AUC). Portal messages often contain more than one type of communication. To predict all communication types within single messages, we used the Jaccard Index. We extracted the variables of importance for the random forest classifiers. The best performing approaches to classification for the major communication types were: logistic regression for medical communications (AUC: 0.899); basic (rule-based) for informational communications (AUC: 0.842); and random forests for social communications and logistical communications (AUCs: 0.875 and 0.925, respectively). The best performing classification approach of classifiers for individual communication subtypes was random forests for Logistical-Contact Information (AUC: 0.963). The Jaccard Indices by approach were: basic classifier, Jaccard Index: 0.674; Naïve Bayes, Jaccard Index: 0.799; random forests, Jaccard Index: 0.859; and logistic regression, Jaccard Index: 0.861. For medical communications, the most predictive variables were NLP concepts (e.g., Temporal_Concept, which maps to 'morning', 'evening' and Idea_or_Concept which maps to 'appointment' and 'refill'). For logistical communications, the most predictive variables contained similar numbers of NLP variables and words (e.g., Telephone mapping to 'phone', 'insurance'). For social and informational communications, the most predictive variables were words (e.g., social: 'thanks', 'much', informational: 'question', 'mean'). This study applies automated classification methods to the content of patient portal messages and evaluates the application of NLP techniques on consumer communications in patient portal messages. We demonstrated that random forest and logistic regression approaches accurately classified the content of portal messages, although the best approach to classification varied by communication type. Words were the most predictive variables for classification of most communication types, although NLP variables were most predictive for medical communication types. As adoption of patient portals increases, automated techniques could assist in understanding and managing growing volumes of messages. Further work is needed to improve classification performance to potentially support message triage and answering. Copyright © 2017 Elsevier B.V. All rights reserved.
Attitudes towards drug legalization among drug users.
Trevino, Roberto A; Richard, Alan J
2002-01-01
Research shows that support for legalization of drugs varies significantly among different sociodemographic and political groups. Yet there is little research examining the degree of support for legalization of drugs among drug users. This paper examines how frequency and type of drug use affect the support for legalization of drugs after adjusting for the effects of political affiliation and sociodemographic characteristics. A sample of 188 drug users and non-drug users were asked whether they would support the legalization of marijuana, cocaine, and heroin. Respondents reported their use of marijuana, crack, cocaine, heroin, speedball, and/or methamphetamines during the previous 30 days. Support for legalization of drugs was analyzed by estimating three separate logistic regressions. The results showed that the support for the legalization of drugs depended on the definition of "drug user" and the type of drug. In general, however, the results showed that marijuana users were more likely to support legalizing marijuana, but they were less likely to support the legalization of cocaine and heroin. On the other hand, users of crack, cocaine, heroin, speedball, and/or methamphetamines were more likely to support legalizing all drugs including cocaine and heroin.
The logistic model for predicting the non-gonoactive Aedes aegypti females.
Reyes-Villanueva, Filiberto; Rodríguez-Pérez, Mario A
2004-01-01
To estimate, using logistic regression, the likelihood of occurrence of a non-gonoactive Aedes aegypti female, previously fed human blood, with relation to body size and collection method. This study was conducted in Monterrey, Mexico, between 1994 and 1996. Ten samplings of 60 mosquitoes of Ae. aegypti females were carried out in three dengue endemic areas: six of biting females, two of emerging mosquitoes, and two of indoor resting females. Gravid females, as well as those with blood in the gut were removed. Mosquitoes were taken to the laboratory and engorged on human blood. After 48 hours, ovaries were dissected to register whether they were gonoactive or non-gonoactive. Wing-length in mm was an indicator for body size. The logistic regression model was used to assess the likelihood of non-gonoactivity, as a binary variable, in relation to wing-length and collection method. Of the 600 females, 164 (27%) remained non-gonoactive, with a wing-length range of 1.9-3.2 mm, almost equal to that of all females (1.8-3.3 mm). The logistic regression model showed a significant likelihood of a female remaining non-gonoactive (Y=1). The collection method did not influence the binary response, but there was an inverse relationship between non-gonoactivity and wing-length. Dengue vector populations from Monterrey, Mexico display a wide-range body size. Logistic regression was a useful tool to estimate the likelihood for an engorged female to remain non-gonoactive. The necessity for a second blood meal is present in any female, but small mosquitoes are more likely to bite again within a 2-day interval, in order to attain egg maturation. The English version of this paper is available too at: http://www.insp.mx/salud/index.html.
The Application of the Cumulative Logistic Regression Model to Automated Essay Scoring
ERIC Educational Resources Information Center
Haberman, Shelby J.; Sinharay, Sandip
2010-01-01
Most automated essay scoring programs use a linear regression model to predict an essay score from several essay features. This article applied a cumulative logit model instead of the linear regression model to automated essay scoring. Comparison of the performances of the linear regression model and the cumulative logit model was performed on a…
Predictors of self-rated health: a 12-month prospective study of IT and media workers.
Hasson, Dan; Arnetz, Bengt B; Theorell, Töres; Anderberg, Ulla Maria
2006-07-31
The aim of the present study was to determine health-related risk and salutogenic factors and to use these to construct prediction models for future self-rated health (SRH), i.e. find possible characteristics predicting individuals improving or worsening in SRH over time (0-12 months). A prospective study was conducted with measurements (physiological markers and self-ratings) at 0, 6 and 12 months, involving 303 employees (187 men and 116 women, age 23-64) from four information technology and two media companies. There were a multitude of statistically significant cross-sectional correlations (Spearman's Rho) between SRH and other self-ratings as well as physiological markers. Predictors of future SRH were baseline ratings of SRH, self-esteem and social support (logistic regression), and SRH, sleep quality and sense of coherence (linear regression). The results of the present study indicate that baseline SRH and other self-ratings are predictive of future SRH. It is cautiously implied that SRH, self-esteem, social support, sleep quality and sense of coherence might be predictors of future SRH and therefore possibly also of various future health outcomes.
78 FR 54242 - 36(b)(1) Arms Sales Notification
Federal Register 2010, 2011, 2012, 2013, 2014
2013-09-03
... elements of logistical and program support. The estimated cost is $1.2 billion. This proposed sale will... support services, and other related elements of logistical and program support. (iv) Military Department... logistical support to sustain the combat and operational readiness of its existing aircraft fleet. The...
Regularization Paths for Conditional Logistic Regression: The clogitL1 Package.
Reid, Stephen; Tibshirani, Rob
2014-07-01
We apply the cyclic coordinate descent algorithm of Friedman, Hastie, and Tibshirani (2010) to the fitting of a conditional logistic regression model with lasso [Formula: see text] and elastic net penalties. The sequential strong rules of Tibshirani, Bien, Hastie, Friedman, Taylor, Simon, and Tibshirani (2012) are also used in the algorithm and it is shown that these offer a considerable speed up over the standard coordinate descent algorithm with warm starts. Once implemented, the algorithm is used in simulation studies to compare the variable selection and prediction performance of the conditional logistic regression model against that of its unconditional (standard) counterpart. We find that the conditional model performs admirably on datasets drawn from a suitable conditional distribution, outperforming its unconditional counterpart at variable selection. The conditional model is also fit to a small real world dataset, demonstrating how we obtain regularization paths for the parameters of the model and how we apply cross validation for this method where natural unconditional prediction rules are hard to come by.
Computational tools for exact conditional logistic regression.
Corcoran, C; Mehta, C; Patel, N; Senchaudhuri, P
Logistic regression analyses are often challenged by the inability of unconditional likelihood-based approximations to yield consistent, valid estimates and p-values for model parameters. This can be due to sparseness or separability in the data. Conditional logistic regression, though useful in such situations, can also be computationally unfeasible when the sample size or number of explanatory covariates is large. We review recent developments that allow efficient approximate conditional inference, including Monte Carlo sampling and saddlepoint approximations. We demonstrate through real examples that these methods enable the analysis of significantly larger and more complex data sets. We find in this investigation that for these moderately large data sets Monte Carlo seems a better alternative, as it provides unbiased estimates of the exact results and can be executed in less CPU time than can the single saddlepoint approximation. Moreover, the double saddlepoint approximation, while computationally the easiest to obtain, offers little practical advantage. It produces unreliable results and cannot be computed when a maximum likelihood solution does not exist. Copyright 2001 John Wiley & Sons, Ltd.
Regularization Paths for Conditional Logistic Regression: The clogitL1 Package
Reid, Stephen; Tibshirani, Rob
2014-01-01
We apply the cyclic coordinate descent algorithm of Friedman, Hastie, and Tibshirani (2010) to the fitting of a conditional logistic regression model with lasso (ℓ1) and elastic net penalties. The sequential strong rules of Tibshirani, Bien, Hastie, Friedman, Taylor, Simon, and Tibshirani (2012) are also used in the algorithm and it is shown that these offer a considerable speed up over the standard coordinate descent algorithm with warm starts. Once implemented, the algorithm is used in simulation studies to compare the variable selection and prediction performance of the conditional logistic regression model against that of its unconditional (standard) counterpart. We find that the conditional model performs admirably on datasets drawn from a suitable conditional distribution, outperforming its unconditional counterpart at variable selection. The conditional model is also fit to a small real world dataset, demonstrating how we obtain regularization paths for the parameters of the model and how we apply cross validation for this method where natural unconditional prediction rules are hard to come by. PMID:26257587
Ordinal logistic regression analysis on the nutritional status of children in KarangKitri village
NASA Astrophysics Data System (ADS)
Ohyver, Margaretha; Yongharto, Kimmy Octavian
2015-09-01
Ordinal logistic regression is a statistical technique that can be used to describe the relationship between ordinal response variable with one or more independent variables. This method has been used in various fields including in the health field. In this research, ordinal logistic regression is used to describe the relationship between nutritional status of children with age, gender, height, and family status. Nutritional status of children in this research is divided into over nutrition, well nutrition, less nutrition, and malnutrition. The purpose for this research is to describe the characteristics of children in the KarangKitri Village and to determine the factors that influence the nutritional status of children in the KarangKitri village. There are three things that obtained from this research. First, there are still children who are not categorized as well nutritional status. Second, there are children who come from sufficient economic level which include in not normal status. Third, the factors that affect the nutritional level of children are age, family status, and height.
The arcsine is asinine: the analysis of proportions in ecology.
Warton, David I; Hui, Francis K C
2011-01-01
The arcsine square root transformation has long been standard procedure when analyzing proportional data in ecology, with applications in data sets containing binomial and non-binomial response variables. Here, we argue that the arcsine transform should not be used in either circumstance. For binomial data, logistic regression has greater interpretability and higher power than analyses of transformed data. However, it is important to check the data for additional unexplained variation, i.e., overdispersion, and to account for it via the inclusion of random effects in the model if found. For non-binomial data, the arcsine transform is undesirable on the grounds of interpretability, and because it can produce nonsensical predictions. The logit transformation is proposed as an alternative approach to address these issues. Examples are presented in both cases to illustrate these advantages, comparing various methods of analyzing proportions including untransformed, arcsine- and logit-transformed linear models and logistic regression (with or without random effects). Simulations demonstrate that logistic regression usually provides a gain in power over other methods.
Sukkarieh-Haraty, Ola; Howard, Elizabeth
2015-01-01
The purpose of this study was to assess the relationship between diabetes self-care, diabetes-specific emotional distress, and social support and glycemic control (hemoglobin A1C levels: HbA1c) among a sample of Lebanese adults with type 2 diabetes. A descriptive correlational design was adapted with descriptive statistics and multiple logistic regressions for analyses. A convenience sample of 140 adults diagnosed with type 2 diabetes was recruited from 2 diabetes clinics in Greater Beirut. Participants were asked to complete 4 questionnaires in Arabic. Significant associations (P < .05) were found between following a general diet for more than 3.5 days per week and higher social support and HbA1c levels of 7% or more. Social support was positively associated with HbA1c levels such that participants with uncontrolled glycemic levels, as evidenced by higher values for HbA1c, received more support from their social network.
Hébert, Martine; Lavoie, Francine; Blais, Martin
2015-01-01
The present analysis explored the contribution of personal (resilience), familial (maternal and paternal support, sibling support) and extra-familial (peer support, other adult) to the prediction of clinical levels of PTSD symptoms in teenagers reporting sexual abuse while controlling for abuse-related variables (type of abuse, severity, and multiple abuse). In a representative sample of high schools students in the province of Quebec, a total of 15.2% of high school girls and 4.4% of high school boys reported a history of child sexual abuse. Sexually abused girls (27.8%) were more likely than boys (14.9%) to obtain scores reaching clinical levels of PTSD symptoms. A logistic hierarchical regression revealed that over and above the characteristics of the sexual abuse experienced, resilience, maternal as well as peer support contributed to the prediction of symptoms of PTSD reaching the clinical threshold. Avenues for intervention practices and prevention among adolescent victims of sexual assault are discussed. PMID:24714884
Religious Social Support and Hypertension Among Older North American Seventh-Day Adventists.
Charlemagne-Badal, Sherma J; Lee, Jerry W
2016-04-01
Seventh-day Adventists have been noted for their unique lifestyle, religious practices and longevity. However, we know little about how religion is directly related to health in this group. Specifically, we know nothing about how religious social support is related to hypertension. Using data from the Biopsychosocial Religion and Health Study, we carried out a cross-sectional study of 9581 and a prospective study of 5720 North American Seventh-day Adventists examining new 534 cases of hypertension occurring up to 4 years later. We used binary logistic regression analyses to examine study hypotheses. Of the religious social support variables, in both the cross-sectional and prospective study only anticipated support significantly predicted hypertension, but the relationship was mediated by BMI. There were no significant race or gender differences. The favorable relationships between anticipated support and hypertension appear to be mediated by BMI and are an indication of how this dimension of religion combined with lifestyle promotes good health, specifically, reduced risk of hypertension.
Avalos, Marta; Adroher, Nuria Duran; Lagarde, Emmanuel; Thiessard, Frantz; Grandvalet, Yves; Contrand, Benjamin; Orriols, Ludivine
2012-09-01
Large data sets with many variables provide particular challenges when constructing analytic models. Lasso-related methods provide a useful tool, although one that remains unfamiliar to most epidemiologists. We illustrate the application of lasso methods in an analysis of the impact of prescribed drugs on the risk of a road traffic crash, using a large French nationwide database (PLoS Med 2010;7:e1000366). In the original case-control study, the authors analyzed each exposure separately. We use the lasso method, which can simultaneously perform estimation and variable selection in a single model. We compare point estimates and confidence intervals using (1) a separate logistic regression model for each drug with a Bonferroni correction and (2) lasso shrinkage logistic regression analysis. Shrinkage regression had little effect on (bias corrected) point estimates, but led to less conservative results, noticeably for drugs with moderate levels of exposure. Carbamates, carboxamide derivative and fatty acid derivative antiepileptics, drugs used in opioid dependence, and mineral supplements of potassium showed stronger associations. Lasso is a relevant method in the analysis of databases with large number of exposures and can be recommended as an alternative to conventional strategies.
NASA Astrophysics Data System (ADS)
Shafizadeh-Moghadam, Hossein; Helbich, Marco
2015-03-01
The rapid growth of megacities requires special attention among urban planners worldwide, and particularly in Mumbai, India, where growth is very pronounced. To cope with the planning challenges this will bring, developing a retrospective understanding of urban land-use dynamics and the underlying driving-forces behind urban growth is a key prerequisite. This research uses regression-based land-use change models - and in particular non-spatial logistic regression models (LR) and auto-logistic regression models (ALR) - for the Mumbai region over the period 1973-2010, in order to determine the drivers behind spatiotemporal urban expansion. Both global models are complemented by a local, spatial model, the so-called geographically weighted logistic regression (GWLR) model, one that explicitly permits variations in driving-forces across space. The study comes to two main conclusions. First, both global models suggest similar driving-forces behind urban growth over time, revealing that LRs and ALRs result in estimated coefficients with comparable magnitudes. Second, all the local coefficients show distinctive temporal and spatial variations. It is therefore concluded that GWLR aids our understanding of urban growth processes, and so can assist context-related planning and policymaking activities when seeking to secure a sustainable urban future.
Yoshida, Yilin; Broyles, Stephanie; Scribner, Richard; Chen, Liwei; Phillippi, Stephen; Jackson-Thompson, Jeanette; Simoes, Eduardo J; Tseng, Tung-Sung
2018-06-26
This study examined the moderating role of social support in the acculturation-obesity/central obesity relationship in Mexican American (MA) men and women. Data from NHANES 1999-2008 were used. Acculturation derived from language use, country of birth and length of residence in the U.S. Social support assessed emotional and financial support. BMI (≥30) and waist circumference (≥88 cm for women; ≥102 cm for men) measured obesity and central obesity, respectively. Weighted multivariate logistic regression models were used to describe associations. Compared to less acculturation, more acculturation was associated with higher odds of obesity (ORs 2.48; 95% CI 1.06-5.83) and central obesity (2.90; 1.39-6.08) among MA men with low/no social support, but not among MA men reporting high social support. The modifying effects was not observed among women. Higher amounts of social support appeared to attenuate the risk of obesity/central obesity associated with acculturation. Interventions enhancing social support maybe effective among acculturated MAs, particularly among men.
Interlenghi, Gabriela dos Santos; Salles-Costa, Rosana
2015-11-01
To verify the association between perceived social support and household food insecurity (HFI). A cross-sectional survey. A population-based study with a representative sample of households from a metropolitan area of Rio de Janeiro, Brazil, conducted in 2010. HFI was estimated with the Brazilian Food Insecurity Scale (EBIA). Social support was assessed using the adapted and validated Brazilian version of the Medical Outcomes Study Social Support Survey. Multinomial logistic regression was used to evaluate the association between social support and HFI, adjusting for potential confounders. Adults (n 1022) aged 19-60 years old (27% men, 73% women) who were responsible for feeding the household. Individuals with high scores of social support were less likely to experience moderate HFI (OR=0·96; 95% CI 0·94, 0·99) and severe HFI (OR=0·96; 95% CI 0·94, 0·98). These findings indicate that social support may contribute to reducing HFI in populations vulnerable to poverty. Strategies to increase social relationships should be encouraged in this group to enhance their perceived social support.
Disparities in health-related Internet use among African American men, 2010.
Mitchell, Jamie A; Thompson, Hayley S; Watkins, Daphne C; Shires, Deirdre; Modlin, Charles S
2014-03-20
Given the benefits of health-related Internet use, we examined whether sociodemographic, medical, and access-related factors predicted this outcome among African American men, a population burdened with health disparities. African American men (n = 329) completed an anonymous survey at a community health fair in 2010; logistic regression was used to identify predictors. Only education (having attended some college or more) predicted health-related Internet use (P < .001). African American men may vary in how they prefer to receive health information; those with less education may need support to engage effectively with health-related Internet use.
Roland, Lauren T.; Kallogjeri, Dorina; Sinks, Belinda C.; Rauch, Steven D.; Shepard, Neil T.; White, Judith A.; Goebel, Joel A.
2015-01-01
Objective Test performance of a focused dizziness questionnaire’s ability to discriminate between peripheral and non-peripheral causes of vertigo. Study Design Prospective multi-center Setting Four academic centers with experienced balance specialists Patients New dizzy patients Interventions A 32-question survey was given to participants. Balance specialists were blinded and a diagnosis was established for all participating patients within 6 months. Main outcomes Multinomial logistic regression was used to evaluate questionnaire performance in predicting final diagnosis and differentiating between peripheral and non-peripheral vertigo. Univariate and multivariable stepwise logistic regression were used to identify questions as significant predictors of the ultimate diagnosis. C-index was used to evaluate performance and discriminative power of the multivariable models. Results 437 patients participated in the study. Eight participants without confirmed diagnoses were excluded and 429 were included in the analysis. Multinomial regression revealed that the model had good overall predictive accuracy of 78.5% for the final diagnosis and 75.5% for differentiating between peripheral and non-peripheral vertigo. Univariate logistic regression identified significant predictors of three main categories of vertigo: peripheral, central and other. Predictors were entered into forward stepwise multivariable logistic regression. The discriminative power of the final models for peripheral, central and other causes were considered good as measured by c-indices of 0.75, 0.7 and 0.78, respectively. Conclusions This multicenter study demonstrates a focused dizziness questionnaire can accurately predict diagnosis for patients with chronic/relapsing dizziness referred to outpatient clinics. Additionally, this survey has significant capability to differentiate peripheral from non-peripheral causes of vertigo and may, in the future, serve as a screening tool for specialty referral. Clinical utility of this questionnaire to guide specialty referral is discussed. PMID:26485598
Roland, Lauren T; Kallogjeri, Dorina; Sinks, Belinda C; Rauch, Steven D; Shepard, Neil T; White, Judith A; Goebel, Joel A
2015-12-01
Test performance of a focused dizziness questionnaire's ability to discriminate between peripheral and nonperipheral causes of vertigo. Prospective multicenter. Four academic centers with experienced balance specialists. New dizzy patients. A 32-question survey was given to participants. Balance specialists were blinded and a diagnosis was established for all participating patients within 6 months. Multinomial logistic regression was used to evaluate questionnaire performance in predicting final diagnosis and differentiating between peripheral and nonperipheral vertigo. Univariate and multivariable stepwise logistic regression were used to identify questions as significant predictors of the ultimate diagnosis. C-index was used to evaluate performance and discriminative power of the multivariable models. In total, 437 patients participated in the study. Eight participants without confirmed diagnoses were excluded and 429 were included in the analysis. Multinomial regression revealed that the model had good overall predictive accuracy of 78.5% for the final diagnosis and 75.5% for differentiating between peripheral and nonperipheral vertigo. Univariate logistic regression identified significant predictors of three main categories of vertigo: peripheral, central, and other. Predictors were entered into forward stepwise multivariable logistic regression. The discriminative power of the final models for peripheral, central, and other causes was considered good as measured by c-indices of 0.75, 0.7, and 0.78, respectively. This multicenter study demonstrates a focused dizziness questionnaire can accurately predict diagnosis for patients with chronic/relapsing dizziness referred to outpatient clinics. Additionally, this survey has significant capability to differentiate peripheral from nonperipheral causes of vertigo and may, in the future, serve as a screening tool for specialty referral. Clinical utility of this questionnaire to guide specialty referral is discussed.
Prediction of cold and heat patterns using anthropometric measures based on machine learning.
Lee, Bum Ju; Lee, Jae Chul; Nam, Jiho; Kim, Jong Yeol
2018-01-01
To examine the association of body shape with cold and heat patterns, to determine which anthropometric measure is the best indicator for discriminating between the two patterns, and to investigate whether using a combination of measures can improve the predictive power to diagnose these patterns. Based on a total of 4,859 subjects (3,000 women and 1,859 men), statistical analyses using binary logistic regression were performed to assess the significance of the difference and the predictive power of each anthropometric measure, and binary logistic regression and Naive Bayes with the variable selection technique were used to assess the improvement in the predictive power of the patterns using the combined measures. In women, the strongest indicators for determining the cold and heat patterns among anthropometric measures were body mass index (BMI) and rib circumference; in men, the best indicator was BMI. In experiments using a combination of measures, the values of the area under the receiver operating characteristic curve in women were 0.776 by Naive Bayes and 0.772 by logistic regression, and the values in men were 0.788 by Naive Bayes and 0.779 by logistic regression. Individuals with a higher BMI have a tendency toward a heat pattern in both women and men. The use of a combination of anthropometric measures can slightly improve the diagnostic accuracy. Our findings can provide fundamental information for the diagnosis of cold and heat patterns based on body shape for personalized medicine.
Teng, Ju-Hsi; Lin, Kuan-Chia; Ho, Bin-Shenq
2007-10-01
A community-based aboriginal study was conducted and analysed to explore the application of classification tree and logistic regression. A total of 1066 aboriginal residents in Yilan County were screened during 2003-2004. The independent variables include demographic characteristics, physical examinations, geographic location, health behaviours, dietary habits and family hereditary diseases history. Risk factors of cardiovascular diseases were selected as the dependent variables in further analysis. The completion rate for heath interview is 88.9%. The classification tree results find that if body mass index is higher than 25.72 kg m(-2) and the age is above 51 years, the predicted probability for number of cardiovascular risk factors > or =3 is 73.6% and the population is 322. If body mass index is higher than 26.35 kg m(-2) and geographical latitude of the village is lower than 24 degrees 22.8', the predicted probability for number of cardiovascular risk factors > or =4 is 60.8% and the population is 74. As the logistic regression results indicate that body mass index, drinking habit and menopause are the top three significant independent variables. The classification tree model specifically shows the discrimination paths and interactions between the risk groups. The logistic regression model presents and analyses the statistical independent factors of cardiovascular risks. Applying both models to specific situations will provide a different angle for the design and management of future health intervention plans after community-based study.
Gong, Xu; Cui, Jianli; Jiang, Ziping; Lu, Laijin; Li, Xiucun
2018-03-01
Few clinical retrospective studies have reported the risk factors of pedicled flap necrosis in hand soft tissue reconstruction. The aim of this study was to identify non-technical risk factors associated with pedicled flap perioperative necrosis in hand soft tissue reconstruction via a multivariate logistic regression analysis. For patients with hand soft tissue reconstruction, we carefully reviewed hospital records and identified 163 patients who met the inclusion criteria. The characteristics of these patients, flap transfer procedures and postoperative complications were recorded. Eleven predictors were identified. The correlations between pedicled flap necrosis and risk factors were analysed using a logistic regression model. Of 163 skin flaps, 125 flaps survived completely without any complications. The pedicled flap necrosis rate in hands was 11.04%, which included partial flap necrosis (7.36%) and total flap necrosis (3.68%). Soft tissue defects in fingers were noted in 68.10% of all cases. The logistic regression analysis indicated that the soft tissue defect site (P = 0.046, odds ratio (OR) = 0.079, confidence interval (CI) (0.006, 0.959)), flap size (P = 0.020, OR = 1.024, CI (1.004, 1.045)) and postoperative wound infection (P < 0.001, OR = 17.407, CI (3.821, 79.303)) were statistically significant risk factors for pedicled flap necrosis of the hand. Soft tissue defect site, flap size and postoperative wound infection were risk factors associated with pedicled flap necrosis in hand soft tissue defect reconstruction. © 2017 Royal Australasian College of Surgeons.
Lincoln, Karen D.; Taylor, Robert Joseph; Bullard, Kai McKeever; Chatters, Linda M.; Himle, Joseph A.; Woodward, Amanda Toler; Jackson, James S.
2010-01-01
Objectives Both emotional support and negative interaction with family members have been linked to mental health. However, few studies have examined the associations between emotional support and negative interaction and psychiatric disorders in late life. This study investigated the relationship between emotional support and negative interaction on lifetime prevalence of mood and anxiety disorders among older African Americans. Design The analyses utilized the National Survey of American Life. Methods Logistic regression and negative binomial regression analyses were used to examine the effect of emotional support and negative interaction with family members on the prevalence of lifetime DSM-IV mood and anxiety disorders. Participants Data from 786 African Americans aged 55 years and older were used. Measurement The DSM-IV World Mental Health Composite International Diagnostic Interview (WMH-CIDI) was used to assess mental disorders. Three dependent variables were investigated: the prevalence of lifetime mood disorders, the prevalence of lifetime anxiety disorders, and the total number of lifetime mood and anxiety disorders. Results Multivariate analysis found that emotional support was not associated with any of the three dependent variables. Negative interaction was significantly and positively associated with the odds of having a lifetime mood disorder, a lifetime anxiety disorder and the number of lifetime mood and anxiety disorders. Conclusions This is the first study to investigate the relationship between emotional support, negative interaction with family members and psychiatric disorders among older African Americans. Negative interaction was a risk factor for mood and anxiety disorders among older African Americans, whereas emotional support was not significant. PMID:20157904
Marques dos Santos, Letícia; Neves dos Santos, Darci; Rodrigues, Laura Cunha; Barreto, Maurício Lima
2012-11-01
Atopic and non-atopic asthma have distinct risk factors and immunological mechanisms, and few studies differentiate between the impacts of psychosocial factors on the prevalence of these disease phenotypes. The authors aimed to identify whether the effect of maternal mental health on prevalence of asthma symptoms differs between atopic and non-atopic children, taking into account family social support. This is a cross-sectional study of 1013 children participating in the Social Change Allergy and Asthma in Latin America project. Psychosocial data were collected through a household survey utilising Self-Reporting Questionnaire and Medical Outcome Study Social Support Scale. Socioeconomic and wheezing information was obtained through the questionnaire of the International Study of Allergy and Asthma in Childhood, and level of allergen-specific IgE was measured to identify atopy. Polytomous logistic regression was used to estimate the association between maternal mental health, social support and atopic and non-atopic wheezing. Effect modification was evaluated through stratified polytomous regression according to social support level. Maternal mental disorder had the same impact on atopic and non-atopic wheezing, even after adjusting for confounding variables. Affective, material and informational supports had protective effects on non-atopic asthma, and there is some evidence that social supports may act as a buffer for the impact of maternal mental disorder on non-atopic wheezing. Poor maternal mental health is positively associated with wheezing, independent of whether asthma is atopic or non-atopic, but perception of high levels of social support appears to buffer this relationship in non-atopic wheezers only.
2011-01-01
Background The majority of studies of the local food environment in relation to obesity risk have been conducted in the US, UK, and Australia. The evidence remains limited to western societies. The aim of this paper is to examine the association of local food environment to body mass index (BMI) in a study of older Japanese individuals. Methods The analysis was based on 12,595 respondents from cross-sectional data of the Aichi Gerontological Evaluation Study (AGES), conducted in 2006 and 2007. Using Geographic Information Systems (GIS), we mapped respondents' access to supermarkets, convenience stores, and fast food outlets, based on a street network (both the distance to the nearest stores and the number of stores within 500 m of the respondents' home). Multiple linear regression and logistic regression analyses were performed to examine the association between food environment and BMI. Results In contrast to previous reports, we found that better access to supermarkets was related to higher BMI. Better access to fast food outlets or convenience stores was also associated with higher BMI, but only among those living alone. The logistic regression analysis, using categorized BMI, showed that the access to supermarkets was only related to being overweight or obese, but not related to being underweight. Conclusions Our findings provide mixed support for the types of food environment measures previously used in western settings. Importantly, our results suggest the need to develop culture-specific approaches to characterizing neighborhood contexts when hypotheses are extrapolated across national borders. PMID:21777439
A new computational strategy for predicting essential genes.
Cheng, Jian; Wu, Wenwu; Zhang, Yinwen; Li, Xiangchen; Jiang, Xiaoqian; Wei, Gehong; Tao, Shiheng
2013-12-21
Determination of the minimum gene set for cellular life is one of the central goals in biology. Genome-wide essential gene identification has progressed rapidly in certain bacterial species; however, it remains difficult to achieve in most eukaryotic species. Several computational models have recently been developed to integrate gene features and used as alternatives to transfer gene essentiality annotations between organisms. We first collected features that were widely used by previous predictive models and assessed the relationships between gene features and gene essentiality using a stepwise regression model. We found two issues that could significantly reduce model accuracy: (i) the effect of multicollinearity among gene features and (ii) the diverse and even contrasting correlations between gene features and gene essentiality existing within and among different species. To address these issues, we developed a novel model called feature-based weighted Naïve Bayes model (FWM), which is based on Naïve Bayes classifiers, logistic regression, and genetic algorithm. The proposed model assesses features and filters out the effects of multicollinearity and diversity. The performance of FWM was compared with other popular models, such as support vector machine, Naïve Bayes model, and logistic regression model, by applying FWM to reciprocally predict essential genes among and within 21 species. Our results showed that FWM significantly improves the accuracy and robustness of essential gene prediction. FWM can remarkably improve the accuracy of essential gene prediction and may be used as an alternative method for other classification work. This method can contribute substantially to the knowledge of the minimum gene sets required for living organisms and the discovery of new drug targets.
Measuring Combat Logistics Force (CLF) Adequacy in Supporting Naval Operations
2012-03-01
existing fuel consumption rates and the hotel services load. Because logistics planning factors for foreign carriers were not available, existing... LOGISTICS FORCE (CLF) ADEQUACY IN SUPPORTING NAVAL OPERATIONS by Philip J. Mock March 2012 Thesis Advisor: Wayne Hughes Second Reader...DATES COVERED Master’s Thesis 4. TITLE AND SUBTITLE Measuring Combat Logistics Force (CLF) Adequacy in Supporting Naval Operations 5. FUNDING
A regularization corrected score method for nonlinear regression models with covariate error.
Zucker, David M; Gorfine, Malka; Li, Yi; Tadesse, Mahlet G; Spiegelman, Donna
2013-03-01
Many regression analyses involve explanatory variables that are measured with error, and failing to account for this error is well known to lead to biased point and interval estimates of the regression coefficients. We present here a new general method for adjusting for covariate error. Our method consists of an approximate version of the Stefanski-Nakamura corrected score approach, using the method of regularization to obtain an approximate solution of the relevant integral equation. We develop the theory in the setting of classical likelihood models; this setting covers, for example, linear regression, nonlinear regression, logistic regression, and Poisson regression. The method is extremely general in terms of the types of measurement error models covered, and is a functional method in the sense of not involving assumptions on the distribution of the true covariate. We discuss the theoretical properties of the method and present simulation results in the logistic regression setting (univariate and multivariate). For illustration, we apply the method to data from the Harvard Nurses' Health Study concerning the relationship between physical activity and breast cancer mortality in the period following a diagnosis of breast cancer. Copyright © 2013, The International Biometric Society.
Logistic Mixed Models to Investigate Implicit and Explicit Belief Tracking
Lages, Martin; Scheel, Anne
2016-01-01
We investigated the proposition of a two-systems Theory of Mind in adults’ belief tracking. A sample of N = 45 participants predicted the choice of one of two opponent players after observing several rounds in an animated card game. Three matches of this card game were played and initial gaze direction on target and subsequent choice predictions were recorded for each belief task and participant. We conducted logistic regressions with mixed effects on the binary data and developed Bayesian logistic mixed models to infer implicit and explicit mentalizing in true belief and false belief tasks. Although logistic regressions with mixed effects predicted the data well a Bayesian logistic mixed model with latent task- and subject-specific parameters gave a better account of the data. As expected explicit choice predictions suggested a clear understanding of true and false beliefs (TB/FB). Surprisingly, however, model parameters for initial gaze direction also indicated belief tracking. We discuss why task-specific parameters for initial gaze directions are different from choice predictions yet reflect second-order perspective taking. PMID:27853440
Distiller, Larry A; Joffe, Barry I; Melville, Vanessa; Welman, Tania; Distiller, Greg B
2006-01-01
The factors responsible for premature coronary atherosclerosis in patients with type 1 diabetes are ill defined. We therefore assessed carotid intima-media complex thickness (IMT) in relatively long-surviving patients with type 1 diabetes as a marker of atherosclerosis and correlated this with traditional risk factors. Cross-sectional study of 148 patients with relatively long-surviving (>18 years) type 1 diabetes (76 men and 72 women) attending the Centre for Diabetes and Endocrinology, Johannesburg. The mean common carotid artery IMT and presence or absence of plaque was evaluated by high-resolution B-mode ultrasound. Their median age was 48 years and duration of diabetes 26 years (range 18-59 years). Traditional risk factors (age, duration of diabetes, glycemic control, hypertension, smoking and lipoprotein concentrations) were recorded. Three response variables were defined and modeled. Standard multiple regression was used for a continuous IMT variable, logistic regression for the presence/absence of plaque and ordinal logistic regression to model three categories of "risk." The median common carotid IMT was 0.62 mm (range 0.44-1.23 mm) with plaque detected in 28 cases. The multiple regression model found significant associations between IMT and current age (P=.001), duration of diabetes (P=.033), BMI (P=.008) and diagnosed hypertension (P=.046) with HDL showing a protective effect (P=.022). Current age (P=.001) and diagnosed hypertension (P=.004), smoking (P=.008) and retinopathy (P=.033) were significant in the logistic regression model. Current age was also significant in the ordinal logistic regression model (P<.001), as was total cholesterol/HDL ratio (P<.001) and mean HbA(1c) concentration (P=.073). The major factors influencing common carotid IMT in patients with relatively long-surviving type 1 diabetes are age, duration of diabetes, existing hypertension and HDL (protective) with a relatively minor role ascribed to relatively long-standing glycemic control.
NASA Supportability Engineering Implementation Utilizing DoD Practices and Processes
NASA Technical Reports Server (NTRS)
Smith, David A.; Smith, John V.
2010-01-01
The Ares I design and development program made the determination early in the System Design Review Phase to utilize DoD ILS and LSA approach for supportability engineering as an integral part of the system engineering process. This paper is to provide a review of the overall approach to design Ares-I with an emphasis on a more affordable, supportable, and sustainable launch vehicle. Discussions will include the requirements development, design influence, support concept alternatives, ILS and LSA planning, Logistics support analyses/trades performed, LSA tailoring for NASA Ares Program, support system infrastructure identification, ILS Design Review documentation, Working Group coordination, and overall ILS implementation. At the outset, the Ares I Project initiated the development of the Integrated Logistics Support Plan (ILSP) and a Logistics Support Analysis process to provide a path forward for the management of the Ares-I ILS program and supportability analysis activities. The ILSP provide the initial planning and coordination between the Ares-I Project Elements and Ground Operation Project. The LSA process provided a system engineering approach in the development of the Ares-I supportability requirements; influence the design for supportability and development of alternative support concepts that satisfies the program operability requirements. The LSA planning and analysis results are documented in the Logistics Support Analysis Report. This document was required during the Ares-I System Design Review (SDR) and Preliminary Design Review (PDR) review cycles. To help coordinate the LSA process across the Ares-I project and between programs, the LSA Report is updated and released quarterly. A System Requirement Analysis was performed to determine the supportability requirements and technical performance measurements (TPMs). Two working groups were established to provide support in the management and implement the Ares-I ILS program, the Integrated Logistics Support Working Group (ILSWG) and the Logistics Support Analysis Record Working Group (LSARWG). The Ares I ILSWG is established to assess the requirements and conduct, evaluate analyses and trade studies associated with acquisition logistic and supportability processes and to resolve Ares I integrated logistics and supportability issues. It established a strategic collaborative alliance for coordination of Logistics Support Analysis activates in support of the integrated Ares I vehicle design and development of logistics support infrastructure. A Joint Ares I - Orion LSAR Working Group was established to: 1) Guide the development of Ares-I and Orion LSAR data and serve as a model for future Constellation programs, 2) Develop rules and assumptions that will apply across the Constellation program with regards to the program's LSAR development, and 3) Maintain the Constellation LSAR Style Guide.
Correlation and simple linear regression.
Eberly, Lynn E
2007-01-01
This chapter highlights important steps in using correlation and simple linear regression to address scientific questions about the association of two continuous variables with each other. These steps include estimation and inference, assessing model fit, the connection between regression and ANOVA, and study design. Examples in microbiology are used throughout. This chapter provides a framework that is helpful in understanding more complex statistical techniques, such as multiple linear regression, linear mixed effects models, logistic regression, and proportional hazards regression.
Multiple Imputation of a Randomly Censored Covariate Improves Logistic Regression Analysis.
Atem, Folefac D; Qian, Jing; Maye, Jacqueline E; Johnson, Keith A; Betensky, Rebecca A
2016-01-01
Randomly censored covariates arise frequently in epidemiologic studies. The most commonly used methods, including complete case and single imputation or substitution, suffer from inefficiency and bias. They make strong parametric assumptions or they consider limit of detection censoring only. We employ multiple imputation, in conjunction with semi-parametric modeling of the censored covariate, to overcome these shortcomings and to facilitate robust estimation. We develop a multiple imputation approach for randomly censored covariates within the framework of a logistic regression model. We use the non-parametric estimate of the covariate distribution or the semiparametric Cox model estimate in the presence of additional covariates in the model. We evaluate this procedure in simulations, and compare its operating characteristics to those from the complete case analysis and a survival regression approach. We apply the procedures to an Alzheimer's study of the association between amyloid positivity and maternal age of onset of dementia. Multiple imputation achieves lower standard errors and higher power than the complete case approach under heavy and moderate censoring and is comparable under light censoring. The survival regression approach achieves the highest power among all procedures, but does not produce interpretable estimates of association. Multiple imputation offers a favorable alternative to complete case analysis and ad hoc substitution methods in the presence of randomly censored covariates within the framework of logistic regression.
Liu, Huijun; Li, Shuzhuo; Feldman, Marc. W.
2015-01-01
This study examined gender differences in the influence of marital status and marital quality on life satisfaction. The roles of intergenerational support and perceived socioeconomic status in the relationship between marriage and life satisfaction were also explored. The analysis was conducted with data from the Chinese General Social Survey (CGSS) in 2006, representing 1,317 women and 1,152 men at least 25 years old. Chi-squared tests and logistic regression models were used in this process. Marriage, including marital status and relationship quality, has a protective function for life satisfaction. Marital status is more important for males, but marital quality is more important for females. The moderating roles of intergenerational support and perceived socioeconomic status are gender specific, perhaps due to norms that ascribe different roles to men and women in marriage. PMID:26640317
Caspers, Kristin M; Cadoret, Remi J; Langbehn, Douglas; Yucuis, Rebecca; Troutman, Beth
2005-06-01
Research has shown insecure attachment style is associated with ineffective emotional regulation leading to maladaptive behaviors in adulthood. In the present study, we examined the association between attachment style and illicit substance use within a sample of adoptees (n=148). It was predicted that insecure attachment style would be associated with a higher incidence of lifetime illicit substance use and that perceived social support would mediate this association. Logistic regression analyses showed higher prevalence of illicit substance use among both insecure attachment groups as compared to the secure group. No difference was found between the two insecure types. Perceived social support was found to mediate the association between attachment style and illicit substance use for the insecure-preoccupied group only. The findings from the present study further implicate attachment style in the risk for illicit substance use, as well as preventions designed to identify those at risk for use.
Potochnick, Stephanie R.; Perreira, Krista M.
2011-01-01
We examined how the migration and acculturation experiences of first-generation Latino youth contributed to their psychological well-being. Data came from the Latino Adolescent Migration, Health, and Adaptation (LAMHA) study, which surveyed 281 first-generation Latino immigrant youth, ages 12–19. Using logistic regression, we evaluated how migration stressors (i.e. traumatic events, choice of migration, discrimination, and documentation status) and migration supports (i.e. family and teacher support, acculturation, and personal-motivation) were associated with depressive symptoms and anxiety. We found that migration stressors increased the risk of both depressive symptoms and anxiety. Time in the US and support from family and teachers reduced the risk of depressive symptoms and anxiety. Compared to documented adolescents, undocumented adolescents were at greater risk of anxiety, and children in mixed-status families were at greater risk of anxiety and marginally greater risk of depressive symptoms. PMID:20611049
Farias Júnior, José Cazuza de; Reis, Rodrigo Siqueira; Hallal, Pedro Curi
2014-05-01
The aim of this study was to evaluate the association between levels of physical activity, psychosocial and perceived environmental factors in adolescents from Northeastern Brazil. A cross-sectional epidemiologic study was conducted with 2,859 adolescents enrolled in secondary schools (57.8% females; mean = 16.5 years; SD = 1.2) in the city of João Pessoa, Paraíba State, Brazil. The following physical activity correlates were measured: attitude, self-efficacy, social support from friends and parents, and perceived environmental characteristics. Physical activity was measured using a questionnaire. Multivariable ordinal logistic regression with proportional odds model analysis showed that the following factors are positively related to physical activity levels in adolescents: attitude, self-efficacy, as well as social support from parents and friends. Physical activity intervention programs should increase self-efficacy and social support from parents and friends, as well as a positive attitude toward physical activity.
Farren, G L; Zhang, T; Martin, S B; Thomas, K T
2017-01-01
To examine the relations of sex, exercise self-efficacy, outcome expectations, and social support with meeting physical activity guidelines (PAGs). Three hundred ninety-six college students participated in this study in the summer 2013. Students completed online questionnaires that assessed physical activity behaviors and psychosocial factors (ie, self-efficacy, outcome expectancies, and social support). Students' physical activity profile was categorized as meeting no PAGs, meeting aerobic PAGs only, meeting muscle-strengthening PAGs only, or meeting both PAGs. A multinomial logistic regression revealed that students' sex and psychosocial factors significantly affected the odds of meeting any and all PAGs. Sex significantly moderated the relationship between outcome expectancy and meeting aerobic PAGs and between outcome expectancy meeting muscle-strengthening PAGs. Results indicate that interventions designed to increase psychosocial factors may increase the likelihood of students meeting any and all PAGs. Social support may be especially beneficial for increasing muscle-strengthening activity.
Evidence in support of foster care during acute refugee crises.
Duerr, Ann; Posner, Samuel F; Gilbert, Mark
2003-11-01
The United Nations High Commissioner on Refugees (UNHCR) and United Nations Children's Fund (UNICEF) policy encourages foster care during refugee emergencies. We examined evidence to support this policy using data from the 1994 Rwandan refugee crisis. The association of weight gain and acute illness with family status (foster children vs children living with their biological families) was examined using latent growth curve and repeated measures logistic regression analysis. Weight gain for all children averaged 0.40 kg/month and was associated with child's age but not with family status, child's or caregiver's sex, caregiver's marital status, possession of blankets or plastic sheeting, severe malnutrition, month of enrollment, or acute illness. Illness was not more common among foster children than among children living with their biological families. This analysis supports the UNHCR/UNICEF recommendation of fostering for unaccompanied children during an acute refugee crisis.
Salihu, Hamisu M; Adegoke, Korede; Turner, DeAnne; Al Agili, Dania; Berry, Estrellita Lo
2017-04-01
This study examined the association between social support and health-related quality of life (HRQoL) among low-income women in the southeastern region of the United States. Analysis was performed on data from a community needs assessment survey that was designed to explore social determinants of health and QoL indicators using a community-based participatory research approach. The study sample comprised 132 women aged 18 years old and older. Bivariate analysis and logistic regressions with bootstrapping were performed. Social support was predictive of physical and mental HRQoL in a contrasting fashion, suggesting a complex relation. Other social determinants of global HRQoL independent of social support status include marital and employment status, maternal age, and income. Our results also demonstrate complex interaction patterns across race, social support, and HRQoL. The linkage between social support and HRQoL may not be a simple relation, as previously assumed. Rather, it is characterized by multifaceted interactions through which social determinants of health modulate the impact of social support on HRQoL. These are new findings.
Zhang, Xiaoyan; Ra, Chaelin Karen; Zhang, Donglan; Zhang, Yunting; MacLeod, Kara E
2016-01-01
National reports showed that over 20% of high school students were victims of bullying, which could potentially lead to psychological problems. School social support may be protective against mental distress linked with victimization. This study examined the main and moderating effects of social support from adults in schools on non-specific serious psychological distress (SPD) related to victimization among California adolescents. Utilizing the 2011-2012 California Health Interview Survey (CHIS), we analyzed a representative sample of 2,799 adolescents aged 12-17 years old. Logistic regression analyses were conducted modeling the odds of SPD in relation to school social support and victimization. Adolescents who were victimized were twice as likely to have SPD compared to non-victims. Higher level of social support from adults in schools was protective against SPD, but did not buffer the effect of bullying exposure. Findings from the present study suggested that adult support from schools can help with students' psychological problems but does not appear to prevent the psychological consequences of victimization. Additional intervention is needed, above and beyond social support, to prevent victimization and its psychological consequences.
Multinomial logistic regression in workers' health
NASA Astrophysics Data System (ADS)
Grilo, Luís M.; Grilo, Helena L.; Gonçalves, Sónia P.; Junça, Ana
2017-11-01
In European countries, namely in Portugal, it is common to hear some people mentioning that they are exposed to excessive and continuous psychosocial stressors at work. This is increasing in diverse activity sectors, such as, the Services sector. A representative sample was collected from a Portuguese Services' organization, by applying a survey (internationally validated), which variables were measured in five ordered categories in Likert-type scale. A multinomial logistic regression model is used to estimate the probability of each category of the dependent variable general health perception where, among other independent variables, burnout appear as statistically significant.
Du, Qing-Yun; Wang, En-Yin; Huang, Yan; Guo, Xiao-Yi; Xiong, Yu-Jing; Yu, Yi-Ping; Yao, Gui-Dong; Shi, Sen-Lin; Sun, Ying-Pu
2016-04-01
To evaluate the independent effects of the degree of blastocoele expansion and re-expansion and the inner cell mass (ICM) and trophectoderm (TE) grades on predicting live birth after fresh and vitrified/warmed single blastocyst transfer. Retrospective study. Reproductive medical center. Women undergoing 844 fresh and 370 vitrified/warmed single blastocyst transfer cycles. None. Live-birth rate correlated with blastocyst morphology parameters by logistic regression analysis and Spearman correlations analysis. The degree of blastocoele expansion and re-expansion was the only blastocyst morphology parameter that exhibited a significant ability to predict live birth in both fresh and vitrified/warmed single blastocyst transfer cycles respectively by multivariate logistic regression and Spearman correlations analysis. Although the ICM grade was significantly related to live birth in fresh cycles according to the univariate model, its effect was not maintained in the multivariate logistic analysis. In vitrified/warmed cycles, neither ICM nor TE grade was correlated with live birth by logistic regression analysis. This study is the first to confirm that the degree of blastocoele expansion and re-expansion is a better predictor of live birth after both fresh and vitrified/warmed single blastocyst transfer cycles than ICM or TE grade. Copyright © 2016. Published by Elsevier Inc.