What Are the Odds of that? A Primer on Understanding Logistic Regression
ERIC Educational Resources Information Center
Huang, Francis L.; Moon, Tonya R.
2013-01-01
The purpose of this Methodological Brief is to present a brief primer on logistic regression, a commonly used technique when modeling dichotomous outcomes. Using data from the National Education Longitudinal Study of 1988 (NELS:88), logistic regression techniques were used to investigate student-level variables in eighth grade (i.e., enrolled in a…
Li, Ji; Gray, B.R.; Bates, D.M.
2008-01-01
Partitioning the variance of a response by design levels is challenging for binomial and other discrete outcomes. Goldstein (2003) proposed four definitions for variance partitioning coefficients (VPC) under a two-level logistic regression model. In this study, we explicitly derived formulae for multi-level logistic regression model and subsequently studied the distributional properties of the calculated VPCs. Using simulations and a vegetation dataset, we demonstrated associations between different VPC definitions, the importance of methods for estimating VPCs (by comparing VPC obtained using Laplace and penalized quasilikehood methods), and bivariate dependence between VPCs calculated at different levels. Such an empirical study lends an immediate support to wider applications of VPC in scientific data analysis.
Parameters Estimation of Geographically Weighted Ordinal Logistic Regression (GWOLR) Model
NASA Astrophysics Data System (ADS)
Zuhdi, Shaifudin; Retno Sari Saputro, Dewi; Widyaningsih, Purnami
2017-06-01
A regression model is the representation of relationship between independent variable and dependent variable. The dependent variable has categories used in the logistic regression model to calculate odds on. The logistic regression model for dependent variable has levels in the logistics regression model is ordinal. GWOLR model is an ordinal logistic regression model influenced the geographical location of the observation site. Parameters estimation in the model needed to determine the value of a population based on sample. The purpose of this research is to parameters estimation of GWOLR model using R software. Parameter estimation uses the data amount of dengue fever patients in Semarang City. Observation units used are 144 villages in Semarang City. The results of research get GWOLR model locally for each village and to know probability of number dengue fever patient categories.
ERIC Educational Resources Information Center
Courtney, Jon R.; Prophet, Retta
2011-01-01
Placement instability is often associated with a number of negative outcomes for children. To gain state level contextual knowledge of factors associated with placement stability/instability, logistic regression was applied to selected variables from the New Mexico Adoption and Foster Care Administrative Reporting System dataset. Predictors…
NASA Astrophysics Data System (ADS)
Lin, Yingzhi; Deng, Xiangzheng; Li, Xing; Ma, Enjun
2014-12-01
Spatially explicit simulation of land use change is the basis for estimating the effects of land use and cover change on energy fluxes, ecology and the environment. At the pixel level, logistic regression is one of the most common approaches used in spatially explicit land use allocation models to determine the relationship between land use and its causal factors in driving land use change, and thereby to evaluate land use suitability. However, these models have a drawback in that they do not determine/allocate land use based on the direct relationship between land use change and its driving factors. Consequently, a multinomial logistic regression method was introduced to address this flaw, and thereby, judge the suitability of a type of land use in any given pixel in a case study area of the Jiangxi Province, China. A comparison of the two regression methods indicated that the proportion of correctly allocated pixels using multinomial logistic regression was 92.98%, which was 8.47% higher than that obtained using logistic regression. Paired t-test results also showed that pixels were more clearly distinguished by multinomial logistic regression than by logistic regression. In conclusion, multinomial logistic regression is a more efficient and accurate method for the spatial allocation of land use changes. The application of this method in future land use change studies may improve the accuracy of predicting the effects of land use and cover change on energy fluxes, ecology, and environment.
Choi, Seung Hoan; Labadorf, Adam T; Myers, Richard H; Lunetta, Kathryn L; Dupuis, Josée; DeStefano, Anita L
2017-02-06
Next generation sequencing provides a count of RNA molecules in the form of short reads, yielding discrete, often highly non-normally distributed gene expression measurements. Although Negative Binomial (NB) regression has been generally accepted in the analysis of RNA sequencing (RNA-Seq) data, its appropriateness has not been exhaustively evaluated. We explore logistic regression as an alternative method for RNA-Seq studies designed to compare cases and controls, where disease status is modeled as a function of RNA-Seq reads using simulated and Huntington disease data. We evaluate the effect of adjusting for covariates that have an unknown relationship with gene expression. Finally, we incorporate the data adaptive method in order to compare false positive rates. When the sample size is small or the expression levels of a gene are highly dispersed, the NB regression shows inflated Type-I error rates but the Classical logistic and Bayes logistic (BL) regressions are conservative. Firth's logistic (FL) regression performs well or is slightly conservative. Large sample size and low dispersion generally make Type-I error rates of all methods close to nominal alpha levels of 0.05 and 0.01. However, Type-I error rates are controlled after applying the data adaptive method. The NB, BL, and FL regressions gain increased power with large sample size, large log2 fold-change, and low dispersion. The FL regression has comparable power to NB regression. We conclude that implementing the data adaptive method appropriately controls Type-I error rates in RNA-Seq analysis. Firth's logistic regression provides a concise statistical inference process and reduces spurious associations from inaccurately estimated dispersion parameters in the negative binomial framework.
Intermediate and advanced topics in multilevel logistic regression analysis
Merlo, Juan
2017-01-01
Multilevel data occur frequently in health services, population and public health, and epidemiologic research. In such research, binary outcomes are common. Multilevel logistic regression models allow one to account for the clustering of subjects within clusters of higher‐level units when estimating the effect of subject and cluster characteristics on subject outcomes. A search of the PubMed database demonstrated that the use of multilevel or hierarchical regression models is increasing rapidly. However, our impression is that many analysts simply use multilevel regression models to account for the nuisance of within‐cluster homogeneity that is induced by clustering. In this article, we describe a suite of analyses that can complement the fitting of multilevel logistic regression models. These ancillary analyses permit analysts to estimate the marginal or population‐average effect of covariates measured at the subject and cluster level, in contrast to the within‐cluster or cluster‐specific effects arising from the original multilevel logistic regression model. We describe the interval odds ratio and the proportion of opposed odds ratios, which are summary measures of effect for cluster‐level covariates. We describe the variance partition coefficient and the median odds ratio which are measures of components of variance and heterogeneity in outcomes. These measures allow one to quantify the magnitude of the general contextual effect. We describe an R 2 measure that allows analysts to quantify the proportion of variation explained by different multilevel logistic regression models. We illustrate the application and interpretation of these measures by analyzing mortality in patients hospitalized with a diagnosis of acute myocardial infarction. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. PMID:28543517
Intermediate and advanced topics in multilevel logistic regression analysis.
Austin, Peter C; Merlo, Juan
2017-09-10
Multilevel data occur frequently in health services, population and public health, and epidemiologic research. In such research, binary outcomes are common. Multilevel logistic regression models allow one to account for the clustering of subjects within clusters of higher-level units when estimating the effect of subject and cluster characteristics on subject outcomes. A search of the PubMed database demonstrated that the use of multilevel or hierarchical regression models is increasing rapidly. However, our impression is that many analysts simply use multilevel regression models to account for the nuisance of within-cluster homogeneity that is induced by clustering. In this article, we describe a suite of analyses that can complement the fitting of multilevel logistic regression models. These ancillary analyses permit analysts to estimate the marginal or population-average effect of covariates measured at the subject and cluster level, in contrast to the within-cluster or cluster-specific effects arising from the original multilevel logistic regression model. We describe the interval odds ratio and the proportion of opposed odds ratios, which are summary measures of effect for cluster-level covariates. We describe the variance partition coefficient and the median odds ratio which are measures of components of variance and heterogeneity in outcomes. These measures allow one to quantify the magnitude of the general contextual effect. We describe an R 2 measure that allows analysts to quantify the proportion of variation explained by different multilevel logistic regression models. We illustrate the application and interpretation of these measures by analyzing mortality in patients hospitalized with a diagnosis of acute myocardial infarction. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd.
Deng, Yingyuan; Wang, Tianfu; Chen, Siping; Liu, Weixiang
2017-01-01
The aim of the study is to screen the significant sonographic features by logistic regression analysis and fit a model to diagnose thyroid nodules. A total of 525 pathological thyroid nodules were retrospectively analyzed. All the nodules underwent conventional ultrasonography (US), strain elastosonography (SE), and contrast -enhanced ultrasound (CEUS). Those nodules’ 12 suspicious sonographic features were used to assess thyroid nodules. The significant features of diagnosing thyroid nodules were picked out by logistic regression analysis. All variables that were statistically related to diagnosis of thyroid nodules, at a level of p < 0.05 were embodied in a logistic regression analysis model. The significant features in the logistic regression model of diagnosing thyroid nodules were calcification, suspected cervical lymph node metastasis, hypoenhancement pattern, margin, shape, vascularity, posterior acoustic, echogenicity, and elastography score. According to the results of logistic regression analysis, the formula that could predict whether or not thyroid nodules are malignant was established. The area under the receiver operating curve (ROC) was 0.930 and the sensitivity, specificity, accuracy, positive predictive value, and negative predictive value were 83.77%, 89.56%, 87.05%, 86.04%, and 87.79% respectively. PMID:29228030
Pang, Tiantian; Huang, Leidan; Deng, Yingyuan; Wang, Tianfu; Chen, Siping; Gong, Xuehao; Liu, Weixiang
2017-01-01
The aim of the study is to screen the significant sonographic features by logistic regression analysis and fit a model to diagnose thyroid nodules. A total of 525 pathological thyroid nodules were retrospectively analyzed. All the nodules underwent conventional ultrasonography (US), strain elastosonography (SE), and contrast -enhanced ultrasound (CEUS). Those nodules' 12 suspicious sonographic features were used to assess thyroid nodules. The significant features of diagnosing thyroid nodules were picked out by logistic regression analysis. All variables that were statistically related to diagnosis of thyroid nodules, at a level of p < 0.05 were embodied in a logistic regression analysis model. The significant features in the logistic regression model of diagnosing thyroid nodules were calcification, suspected cervical lymph node metastasis, hypoenhancement pattern, margin, shape, vascularity, posterior acoustic, echogenicity, and elastography score. According to the results of logistic regression analysis, the formula that could predict whether or not thyroid nodules are malignant was established. The area under the receiver operating curve (ROC) was 0.930 and the sensitivity, specificity, accuracy, positive predictive value, and negative predictive value were 83.77%, 89.56%, 87.05%, 86.04%, and 87.79% respectively.
Kempe, P T; van Oppen, P; de Haan, E; Twisk, J W R; Sluis, A; Smit, J H; van Dyck, R; van Balkom, A J L M
2007-09-01
Two methods for predicting remissions in obsessive-compulsive disorder (OCD) treatment are evaluated. Y-BOCS measurements of 88 patients with a primary OCD (DSM-III-R) diagnosis were performed over a 16-week treatment period, and during three follow-ups. Remission at any measurement was defined as a Y-BOCS score lower than thirteen combined with a reduction of seven points when compared with baseline. Logistic regression models were compared with a Cox regression for recurrent events model. Logistic regression yielded different models at different evaluation times. The recurrent events model remained stable when fewer measurements were used. Higher baseline levels of neuroticism and more severe OCD symptoms were associated with a lower chance of remission, early age of onset and more depressive symptoms with a higher chance. Choice of outcome time affects logistic regression prediction models. Recurrent events analysis uses all information on remissions and relapses. Short- and long-term predictors for OCD remission show overlap.
ERIC Educational Resources Information Center
West, Lindsey M.; Davis, Telsie A.; Thompson, Martie P.; Kaslow, Nadine J.
2011-01-01
Protective factors for fostering reasons for living were examined among low-income, suicidal, African American women. Bivariate logistic regressions revealed that higher levels of optimism, spiritual well-being, and family social support predicted reasons for living. Multivariate logistic regressions indicated that spiritual well-being showed…
Two-factor logistic regression in pediatric liver transplantation
NASA Astrophysics Data System (ADS)
Uzunova, Yordanka; Prodanova, Krasimira; Spasov, Lyubomir
2017-12-01
Using a two-factor logistic regression analysis an estimate is derived for the probability of absence of infections in the early postoperative period after pediatric liver transplantation. The influence of both the bilirubin level and the international normalized ratio of prothrombin time of blood coagulation at the 5th postoperative day is studied.
Hansson, Lisbeth; Khamis, Harry J
2008-12-01
Simulated data sets are used to evaluate conditional and unconditional maximum likelihood estimation in an individual case-control design with continuous covariates when there are different rates of excluded cases and different levels of other design parameters. The effectiveness of the estimation procedures is measured by method bias, variance of the estimators, root mean square error (RMSE) for logistic regression and the percentage of explained variation. Conditional estimation leads to higher RMSE than unconditional estimation in the presence of missing observations, especially for 1:1 matching. The RMSE is higher for the smaller stratum size, especially for the 1:1 matching. The percentage of explained variation appears to be insensitive to missing data, but is generally higher for the conditional estimation than for the unconditional estimation. It is particularly good for the 1:2 matching design. For minimizing RMSE, a high matching ratio is recommended; in this case, conditional and unconditional logistic regression models yield comparable levels of effectiveness. For maximizing the percentage of explained variation, the 1:2 matching design with the conditional logistic regression model is recommended.
Independent Prognostic Factors for Acute Organophosphorus Pesticide Poisoning.
Tang, Weidong; Ruan, Feng; Chen, Qi; Chen, Suping; Shao, Xuebo; Gao, Jianbo; Zhang, Mao
2016-07-01
Acute organophosphorus pesticide poisoning (AOPP) is becoming a significant problem and a potential cause of human mortality because of the abuse of organophosphate compounds. This study aims to determine the independent prognostic factors of AOPP by using multivariate logistic regression analysis. The clinical data for 71 subjects with AOPP admitted to our hospital were retrospectively analyzed. This information included the Acute Physiology and Chronic Health Evaluation II (APACHE II) scores, 6-h post-admission blood lactate levels, post-admission 6-h lactate clearance rates, admission blood cholinesterase levels, 6-h post-admission blood cholinesterase levels, cholinesterase activity, blood pH, and other factors. Univariate analysis and multivariate logistic regression analyses were conducted to identify all prognostic factors and independent prognostic factors, respectively. A receiver operating characteristic curve was plotted to analyze the testing power of independent prognostic factors. Twelve of 71 subjects died. Admission blood lactate levels, 6-h post-admission blood lactate levels, post-admission 6-h lactate clearance rates, blood pH, and APACHE II scores were identified as prognostic factors for AOPP according to the univariate analysis, whereas only 6-h post-admission blood lactate levels, post-admission 6-h lactate clearance rates, and blood pH were independent prognostic factors identified by multivariate logistic regression analysis. The receiver operating characteristic analysis suggested that post-admission 6-h lactate clearance rates were of moderate diagnostic value. High 6-h post-admission blood lactate levels, low blood pH, and low post-admission 6-h lactate clearance rates were independent prognostic factors identified by multivariate logistic regression analysis. Copyright © 2016 by Daedalus Enterprises.
ERIC Educational Resources Information Center
Nguyen, Phuong L.
2006-01-01
This study examines the effects of parental SES, school quality, and community factors on children's enrollment and achievement in rural areas in Viet Nam, using logistic regression and ordered logistic regression. Multivariate analysis reveals significant differences in educational enrollment and outcomes by level of household expenditures and…
NASA Astrophysics Data System (ADS)
Ceppi, C.; Mancini, F.; Ritrovato, G.
2009-04-01
This study aim at the landslide susceptibility mapping within an area of the Daunia (Apulian Apennines, Italy) by a multivariate statistical method and data manipulation in a Geographical Information System (GIS) environment. Among the variety of existing statistical data analysis techniques, the logistic regression was chosen to produce a susceptibility map all over an area where small settlements are historically threatened by landslide phenomena. By logistic regression a best fitting between the presence or absence of landslide (dependent variable) and the set of independent variables is performed on the basis of a maximum likelihood criterion, bringing to the estimation of regression coefficients. The reliability of such analysis is therefore due to the ability to quantify the proneness to landslide occurrences by the probability level produced by the analysis. The inventory of dependent and independent variables were managed in a GIS, where geometric properties and attributes have been translated into raster cells in order to proceed with the logistic regression by means of SPSS (Statistical Package for the Social Sciences) package. A landslide inventory was used to produce the bivariate dependent variable whereas the independent set of variable concerned with slope, aspect, elevation, curvature, drained area, lithology and land use after their reductions to dummy variables. The effect of independent parameters on landslide occurrence was assessed by the corresponding coefficient in the logistic regression function, highlighting a major role played by the land use variable in determining occurrence and distribution of phenomena. Once the outcomes of the logistic regression are determined, data are re-introduced in the GIS to produce a map reporting the proneness to landslide as predicted level of probability. As validation of results and regression model a cell-by-cell comparison between the susceptibility map and the initial inventory of landslide events was performed and an agreement at 75% level achieved.
The use of generalized estimating equations in the analysis of motor vehicle crash data.
Hutchings, Caroline B; Knight, Stacey; Reading, James C
2003-01-01
The purpose of this study was to determine if it is necessary to use generalized estimating equations (GEEs) in the analysis of seat belt effectiveness in preventing injuries in motor vehicle crashes. The 1992 Utah crash dataset was used, excluding crash participants where seat belt use was not appropriate (n=93,633). The model used in the 1996 Report to Congress [Report to congress on benefits of safety belts and motorcycle helmets, based on data from the Crash Outcome Data Evaluation System (CODES). National Center for Statistics and Analysis, NHTSA, Washington, DC, February 1996] was analyzed for all occupants with logistic regression, one level of nesting (occupants within crashes), and two levels of nesting (occupants within vehicles within crashes) to compare the use of GEEs with logistic regression. When using one level of nesting compared to logistic regression, 13 of 16 variance estimates changed more than 10%, and eight of 16 parameter estimates changed more than 10%. In addition, three of the independent variables changed from significant to insignificant (alpha=0.05). With the use of two levels of nesting, two of 16 variance estimates and three of 16 parameter estimates changed more than 10% from the variance and parameter estimates in one level of nesting. One of the independent variables changed from insignificant to significant (alpha=0.05) in the two levels of nesting model; therefore, only two of the independent variables changed from significant to insignificant when the logistic regression model was compared to the two levels of nesting model. The odds ratio of seat belt effectiveness in preventing injuries was 12% lower when a one-level nested model was used. Based on these results, we stress the need to use a nested model and GEEs when analyzing motor vehicle crash data.
Use of generalized ordered logistic regression for the analysis of multidrug resistance data.
Agga, Getahun E; Scott, H Morgan
2015-10-01
Statistical analysis of antimicrobial resistance data largely focuses on individual antimicrobial's binary outcome (susceptible or resistant). However, bacteria are becoming increasingly multidrug resistant (MDR). Statistical analysis of MDR data is mostly descriptive often with tabular or graphical presentations. Here we report the applicability of generalized ordinal logistic regression model for the analysis of MDR data. A total of 1,152 Escherichia coli, isolated from the feces of weaned pigs experimentally supplemented with chlortetracycline (CTC) and copper, were tested for susceptibilities against 15 antimicrobials and were binary classified into resistant or susceptible. The 15 antimicrobial agents tested were grouped into eight different antimicrobial classes. We defined MDR as the number of antimicrobial classes to which E. coli isolates were resistant ranging from 0 to 8. Proportionality of the odds assumption of the ordinal logistic regression model was violated only for the effect of treatment period (pre-treatment, during-treatment and post-treatment); but not for the effect of CTC or copper supplementation. Subsequently, a partially constrained generalized ordinal logistic model was built that allows for the effect of treatment period to vary while constraining the effects of treatment (CTC and copper supplementation) to be constant across the levels of MDR classes. Copper (Proportional Odds Ratio [Prop OR]=1.03; 95% CI=0.73-1.47) and CTC (Prop OR=1.1; 95% CI=0.78-1.56) supplementation were not significantly associated with the level of MDR adjusted for the effect of treatment period. MDR generally declined over the trial period. In conclusion, generalized ordered logistic regression can be used for the analysis of ordinal data such as MDR data when the proportionality assumptions for ordered logistic regression are violated. Published by Elsevier B.V.
Modeling Governance KB with CATPCA to Overcome Multicollinearity in the Logistic Regression
NASA Astrophysics Data System (ADS)
Khikmah, L.; Wijayanto, H.; Syafitri, U. D.
2017-04-01
The problem often encounters in logistic regression modeling are multicollinearity problems. Data that have multicollinearity between explanatory variables with the result in the estimation of parameters to be bias. Besides, the multicollinearity will result in error in the classification. In general, to overcome multicollinearity in regression used stepwise regression. They are also another method to overcome multicollinearity which involves all variable for prediction. That is Principal Component Analysis (PCA). However, classical PCA in only for numeric data. Its data are categorical, one method to solve the problems is Categorical Principal Component Analysis (CATPCA). Data were used in this research were a part of data Demographic and Population Survey Indonesia (IDHS) 2012. This research focuses on the characteristic of women of using the contraceptive methods. Classification results evaluated using Area Under Curve (AUC) values. The higher the AUC value, the better. Based on AUC values, the classification of the contraceptive method using stepwise method (58.66%) is better than the logistic regression model (57.39%) and CATPCA (57.39%). Evaluation of the results of logistic regression using sensitivity, shows the opposite where CATPCA method (99.79%) is better than logistic regression method (92.43%) and stepwise (92.05%). Therefore in this study focuses on major class classification (using a contraceptive method), then the selected model is CATPCA because it can raise the level of the major class model accuracy.
NASA Astrophysics Data System (ADS)
Zeraatpisheh, Mojtaba; Ayoubi, Shamsollah; Jafari, Azam; Finke, Peter
2017-05-01
The efficiency of different digital and conventional soil mapping approaches to produce categorical maps of soil types is determined by cost, sample size, accuracy and the selected taxonomic level. The efficiency of digital and conventional soil mapping approaches was examined in the semi-arid region of Borujen, central Iran. This research aimed to (i) compare two digital soil mapping approaches including Multinomial logistic regression and random forest, with the conventional soil mapping approach at four soil taxonomic levels (order, suborder, great group and subgroup levels), (ii) validate the predicted soil maps by the same validation data set to determine the best method for producing the soil maps, and (iii) select the best soil taxonomic level by different approaches at three sample sizes (100, 80, and 60 point observations), in two scenarios with and without a geomorphology map as a spatial covariate. In most predicted maps, using both digital soil mapping approaches, the best results were obtained using the combination of terrain attributes and the geomorphology map, although differences between the scenarios with and without the geomorphology map were not significant. Employing the geomorphology map increased map purity and the Kappa index, and led to a decrease in the 'noisiness' of soil maps. Multinomial logistic regression had better performance at higher taxonomic levels (order and suborder levels); however, random forest showed better performance at lower taxonomic levels (great group and subgroup levels). Multinomial logistic regression was less sensitive than random forest to a decrease in the number of training observations. The conventional soil mapping method produced a map with larger minimum polygon size because of traditional cartographic criteria used to make the geological map 1:100,000 (on which the conventional soil mapping map was largely based). Likewise, conventional soil mapping map had also a larger average polygon size that resulted in a lower level of detail. Multinomial logistic regression at the order level (map purity of 0.80), random forest at the suborder (map purity of 0.72) and great group level (map purity of 0.60), and conventional soil mapping at the subgroup level (map purity of 0.48) produced the most accurate maps in the study area. The multinomial logistic regression method was identified as the most effective approach based on a combined index of map purity, map information content, and map production cost. The combined index also showed that smaller sample size led to a preference for the order level, while a larger sample size led to a preference for the great group level.
A comparative study on entrepreneurial attitudes modeled with logistic regression and Bayes nets.
López Puga, Jorge; García García, Juan
2012-11-01
Entrepreneurship research is receiving increasing attention in our context, as entrepreneurs are key social agents involved in economic development. We compare the success of the dichotomic logistic regression model and the Bayes simple classifier to predict entrepreneurship, after manipulating the percentage of missing data and the level of categorization in predictors. A sample of undergraduate university students (N = 1230) completed five scales (motivation, attitude towards business creation, obstacles, deficiencies, and training needs) and we found that each of them predicted different aspects of the tendency to business creation. Additionally, our results show that the receiver operating characteristic (ROC) curve is affected by the rate of missing data in both techniques, but logistic regression seems to be more vulnerable when faced with missing data, whereas Bayes nets underperform slightly when categorization has been manipulated. Our study sheds light on the potential entrepreneur profile and we propose to use Bayesian networks as an additional alternative to overcome the weaknesses of logistic regression when missing data are present in applied research.
McLaren, Christine E.; Chen, Wen-Pin; Nie, Ke; Su, Min-Ying
2009-01-01
Rationale and Objectives Dynamic contrast enhanced MRI (DCE-MRI) is a clinical imaging modality for detection and diagnosis of breast lesions. Analytical methods were compared for diagnostic feature selection and performance of lesion classification to differentiate between malignant and benign lesions in patients. Materials and Methods The study included 43 malignant and 28 benign histologically-proven lesions. Eight morphological parameters, ten gray level co-occurrence matrices (GLCM) texture features, and fourteen Laws’ texture features were obtained using automated lesion segmentation and quantitative feature extraction. Artificial neural network (ANN) and logistic regression analysis were compared for selection of the best predictors of malignant lesions among the normalized features. Results Using ANN, the final four selected features were compactness, energy, homogeneity, and Law_LS, with area under the receiver operating characteristic curve (AUC) = 0.82, and accuracy = 0.76. The diagnostic performance of these 4-features computed on the basis of logistic regression yielded AUC = 0.80 (95% CI, 0.688 to 0.905), similar to that of ANN. The analysis also shows that the odds of a malignant lesion decreased by 48% (95% CI, 25% to 92%) for every increase of 1 SD in the Law_LS feature, adjusted for differences in compactness, energy, and homogeneity. Using logistic regression with z-score transformation, a model comprised of compactness, NRL entropy, and gray level sum average was selected, and it had the highest overall accuracy of 0.75 among all models, with AUC = 0.77 (95% CI, 0.660 to 0.880). When logistic modeling of transformations using the Box-Cox method was performed, the most parsimonious model with predictors, compactness and Law_LS, had an AUC of 0.79 (95% CI, 0.672 to 0.898). Conclusion The diagnostic performance of models selected by ANN and logistic regression was similar. The analytic methods were found to be roughly equivalent in terms of predictive ability when a small number of variables were chosen. The robust ANN methodology utilizes a sophisticated non-linear model, while logistic regression analysis provides insightful information to enhance interpretation of the model features. PMID:19409817
Ordinal logistic regression analysis on the nutritional status of children in KarangKitri village
NASA Astrophysics Data System (ADS)
Ohyver, Margaretha; Yongharto, Kimmy Octavian
2015-09-01
Ordinal logistic regression is a statistical technique that can be used to describe the relationship between ordinal response variable with one or more independent variables. This method has been used in various fields including in the health field. In this research, ordinal logistic regression is used to describe the relationship between nutritional status of children with age, gender, height, and family status. Nutritional status of children in this research is divided into over nutrition, well nutrition, less nutrition, and malnutrition. The purpose for this research is to describe the characteristics of children in the KarangKitri Village and to determine the factors that influence the nutritional status of children in the KarangKitri village. There are three things that obtained from this research. First, there are still children who are not categorized as well nutritional status. Second, there are children who come from sufficient economic level which include in not normal status. Third, the factors that affect the nutritional level of children are age, family status, and height.
Access disparities to Magnet hospitals for patients undergoing neurosurgical operations
Missios, Symeon; Bekelis, Kimon
2017-01-01
Background Centers of excellence focusing on quality improvement have demonstrated superior outcomes for a variety of surgical interventions. We investigated the presence of access disparities to hospitals recognized by the Magnet Recognition Program of the American Nurses Credentialing Center (ANCC) for patients undergoing neurosurgical operations. Methods We performed a cohort study of all neurosurgery patients who were registered in the New York Statewide Planning and Research Cooperative System (SPARCS) database from 2009–2013. We examined the association of African-American race and lack of insurance with Magnet status hospitalization for neurosurgical procedures. A mixed effects propensity adjusted multivariable regression analysis was used to control for confounding. Results During the study period, 190,535 neurosurgical patients met the inclusion criteria. Using a multivariable logistic regression, we demonstrate that African-Americans had lower admission rates to Magnet institutions (OR 0.62; 95% CI, 0.58–0.67). This persisted in a mixed effects logistic regression model (OR 0.77; 95% CI, 0.70–0.83) to adjust for clustering at the patient county level, and a propensity score adjusted logistic regression model (OR 0.75; 95% CI, 0.69–0.82). Additionally, lack of insurance was associated with lower admission rates to Magnet institutions (OR 0.71; 95% CI, 0.68–0.73), in a multivariable logistic regression model. This persisted in a mixed effects logistic regression model (OR 0.72; 95% CI, 0.69–0.74), and a propensity score adjusted logistic regression model (OR 0.72; 95% CI, 0.69–0.75). Conclusions Using a comprehensive all-payer cohort of neurosurgery patients in New York State we identified an association of African-American race and lack of insurance with lower rates of admission to Magnet hospitals. PMID:28684152
Adjusting for Confounding in Early Postlaunch Settings: Going Beyond Logistic Regression Models.
Schmidt, Amand F; Klungel, Olaf H; Groenwold, Rolf H H
2016-01-01
Postlaunch data on medical treatments can be analyzed to explore adverse events or relative effectiveness in real-life settings. These analyses are often complicated by the number of potential confounders and the possibility of model misspecification. We conducted a simulation study to compare the performance of logistic regression, propensity score, disease risk score, and stabilized inverse probability weighting methods to adjust for confounding. Model misspecification was induced in the independent derivation dataset. We evaluated performance using relative bias confidence interval coverage of the true effect, among other metrics. At low events per coefficient (1.0 and 0.5), the logistic regression estimates had a large relative bias (greater than -100%). Bias of the disease risk score estimates was at most 13.48% and 18.83%. For the propensity score model, this was 8.74% and >100%, respectively. At events per coefficient of 1.0 and 0.5, inverse probability weighting frequently failed or reduced to a crude regression, resulting in biases of -8.49% and 24.55%. Coverage of logistic regression estimates became less than the nominal level at events per coefficient ≤5. For the disease risk score, inverse probability weighting, and propensity score, coverage became less than nominal at events per coefficient ≤2.5, ≤1.0, and ≤1.0, respectively. Bias of misspecified disease risk score models was 16.55%. In settings with low events/exposed subjects per coefficient, disease risk score methods can be useful alternatives to logistic regression models, especially when propensity score models cannot be used. Despite better performance of disease risk score methods than logistic regression and propensity score models in small events per coefficient settings, bias, and coverage still deviated from nominal.
Wang, Shuang; Jiang, Xiaoqian; Wu, Yuan; Cui, Lijuan; Cheng, Samuel; Ohno-Machado, Lucila
2013-01-01
We developed an EXpectation Propagation LOgistic REgRession (EXPLORER) model for distributed privacy-preserving online learning. The proposed framework provides a high level guarantee for protecting sensitive information, since the information exchanged between the server and the client is the encrypted posterior distribution of coefficients. Through experimental results, EXPLORER shows the same performance (e.g., discrimination, calibration, feature selection etc.) as the traditional frequentist Logistic Regression model, but provides more flexibility in model updating. That is, EXPLORER can be updated one point at a time rather than having to retrain the entire data set when new observations are recorded. The proposed EXPLORER supports asynchronized communication, which relieves the participants from coordinating with one another, and prevents service breakdown from the absence of participants or interrupted communications. PMID:23562651
Effect of folic acid on appetite in children: ordinal logistic and fuzzy logistic regressions.
Namdari, Mahshid; Abadi, Alireza; Taheri, S Mahmoud; Rezaei, Mansour; Kalantari, Naser; Omidvar, Nasrin
2014-03-01
Reduced appetite and low food intake are often a concern in preschool children, since it can lead to malnutrition, a leading cause of impaired growth and mortality in childhood. It is occasionally considered that folic acid has a positive effect on appetite enhancement and consequently growth in children. The aim of this study was to assess the effect of folic acid on the appetite of preschool children 3 to 6 y old. The study sample included 127 children ages 3 to 6 who were randomly selected from 20 preschools in the city of Tehran in 2011. Since appetite was measured by linguistic terms, a fuzzy logistic regression was applied for modeling. The obtained results were compared with a statistical ordinal logistic model. After controlling for the potential confounders, in a statistical ordinal logistic model, serum folate showed a significantly positive effect on appetite. A small but positive effect of folate was detected by fuzzy logistic regression. Based on fuzzy regression, the risk for poor appetite in preschool children was related to the employment status of their mothers. In this study, a positive association was detected between the levels of serum folate and improved appetite. For further investigation, a randomized controlled, double-blind clinical trial could be helpful to address causality. Copyright © 2014 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Ariffin, Syaiba Balqish; Midi, Habshah
2014-06-01
This article is concerned with the performance of logistic ridge regression estimation technique in the presence of multicollinearity and high leverage points. In logistic regression, multicollinearity exists among predictors and in the information matrix. The maximum likelihood estimator suffers a huge setback in the presence of multicollinearity which cause regression estimates to have unduly large standard errors. To remedy this problem, a logistic ridge regression estimator is put forward. It is evident that the logistic ridge regression estimator outperforms the maximum likelihood approach for handling multicollinearity. The effect of high leverage points are then investigated on the performance of the logistic ridge regression estimator through real data set and simulation study. The findings signify that logistic ridge regression estimator fails to provide better parameter estimates in the presence of both high leverage points and multicollinearity.
Akkus, Zeki; Camdeviren, Handan; Celik, Fatma; Gur, Ali; Nas, Kemal
2005-09-01
To determine the risk factors of osteoporosis using a multiple binary logistic regression method and to assess the risk variables for osteoporosis, which is a major and growing health problem in many countries. We presented a case-control study, consisting of 126 postmenopausal healthy women as control group and 225 postmenopausal osteoporotic women as the case group. The study was carried out in the Department of Physical Medicine and Rehabilitation, Dicle University, Diyarbakir, Turkey between 1999-2002. The data from the 351 participants were collected using a standard questionnaire that contains 43 variables. A multiple logistic regression model was then used to evaluate the data and to find the best regression model. We classified 80.1% (281/351) of the participants using the regression model. Furthermore, the specificity value of the model was 67% (84/126) of the control group while the sensitivity value was 88% (197/225) of the case group. We found the distribution of residual values standardized for final model to be exponential using the Kolmogorow-Smirnow test (p=0.193). The receiver operating characteristic curve was found successful to predict patients with risk for osteoporosis. This study suggests that low levels of dietary calcium intake, physical activity, education, and longer duration of menopause are independent predictors of the risk of low bone density in our population. Adequate dietary calcium intake in combination with maintaining a daily physical activity, increasing educational level, decreasing birth rate, and duration of breast-feeding may contribute to healthy bones and play a role in practical prevention of osteoporosis in Southeast Anatolia. In addition, the findings of the present study indicate that the use of multivariate statistical method as a multiple logistic regression in osteoporosis, which maybe influenced by many variables, is better than univariate statistical evaluation.
Dembo, Richard; Belenko, Steven; Childs, Kristina; Wareham, Jennifer; Schmeidler, James
2009-08-01
High rates of infection for chlamydia and gonorrhea have been noted among youths involved in the juvenile justice system. Although both individual and community-level factors have been found to be associated with sexually transmitted disease (STD) risk, their relative importance has not been tested in this population. A two-level logistic regression analysis was completed to assess the influence of individual-level and community-level predictors on STD test results among arrested youths processed at a centralized intake facility. Results from weighted two level logistic regression analyses (n = 1,368) indicated individual-level factors of gender (being female), age, race (being African American), and criminal history predicted the youths' positive STD status. For the community-level predictors, concentrated disadvantage significantly and positively predicted the youths' STD status. Implications of these findings for future research and public health policy are discussed.
Sample size determination for logistic regression on a logit-normal distribution.
Kim, Seongho; Heath, Elisabeth; Heilbrun, Lance
2017-06-01
Although the sample size for simple logistic regression can be readily determined using currently available methods, the sample size calculation for multiple logistic regression requires some additional information, such as the coefficient of determination ([Formula: see text]) of a covariate of interest with other covariates, which is often unavailable in practice. The response variable of logistic regression follows a logit-normal distribution which can be generated from a logistic transformation of a normal distribution. Using this property of logistic regression, we propose new methods of determining the sample size for simple and multiple logistic regressions using a normal transformation of outcome measures. Simulation studies and a motivating example show several advantages of the proposed methods over the existing methods: (i) no need for [Formula: see text] for multiple logistic regression, (ii) available interim or group-sequential designs, and (iii) much smaller required sample size.
Goltz, Annemarie; Janowitz, Deborah; Hannemann, Anke; Nauck, Matthias; Hoffmann, Johanna; Seyfart, Tom; Völzke, Henry; Terock, Jan; Grabe, Hans Jörgen
2018-06-19
Depression and obesity are widespread and closely linked. Brain-derived neurotrophic factor (BDNF) and vitamin D are both assumed to be associated with depression and obesity. Little is known about the interplay between vitamin D and BDNF. We explored the putative associations and interactions between serum BDNF and vitamin D levels with depressive symptoms and abdominal obesity in a large population-based cohort. Data were obtained from the population-based Study of Health in Pomerania (SHIP)-Trend (n = 3,926). The associations of serum BDNF and vitamin D levels with depressive symptoms (measured using the Patient Health Questionnaire) were assessed with binary and multinomial logistic regression models. The associations of serum BDNF and vitamin D levels with obesity (measured by the waist-to-hip ratio [WHR]) were assessed with binary logistic and linear regression models with restricted cubic splines. Logistic regression models revealed inverse associations of vitamin D with depression (OR = 0.966; 95% CI 0.951-0.981) and obesity (OR = 0.976; 95% CI 0.967-0.985). No linear association of serum BDNF with depression or obesity was found. However, linear regression models revealed a U-shaped association of BDNF with WHR (p < 0.001). Vitamin D was inversely associated with depression and obesity. BDNF was associated with abdominal obesity, but not with depression. At the population level, our results support the relevant roles of vitamin D and BDNF in mental and physical health-related outcomes. © 2018 S. Karger AG, Basel.
Staley, James R; Jones, Edmund; Kaptoge, Stephen; Butterworth, Adam S; Sweeting, Michael J; Wood, Angela M; Howson, Joanna M M
2017-06-01
Logistic regression is often used instead of Cox regression to analyse genome-wide association studies (GWAS) of single-nucleotide polymorphisms (SNPs) and disease outcomes with cohort and case-cohort designs, as it is less computationally expensive. Although Cox and logistic regression models have been compared previously in cohort studies, this work does not completely cover the GWAS setting nor extend to the case-cohort study design. Here, we evaluated Cox and logistic regression applied to cohort and case-cohort genetic association studies using simulated data and genetic data from the EPIC-CVD study. In the cohort setting, there was a modest improvement in power to detect SNP-disease associations using Cox regression compared with logistic regression, which increased as the disease incidence increased. In contrast, logistic regression had more power than (Prentice weighted) Cox regression in the case-cohort setting. Logistic regression yielded inflated effect estimates (assuming the hazard ratio is the underlying measure of association) for both study designs, especially for SNPs with greater effect on disease. Given logistic regression is substantially more computationally efficient than Cox regression in both settings, we propose a two-step approach to GWAS in cohort and case-cohort studies. First to analyse all SNPs with logistic regression to identify associated variants below a pre-defined P-value threshold, and second to fit Cox regression (appropriately weighted in case-cohort studies) to those identified SNPs to ensure accurate estimation of association with disease.
Wang, Shuang; Jiang, Xiaoqian; Wu, Yuan; Cui, Lijuan; Cheng, Samuel; Ohno-Machado, Lucila
2013-06-01
We developed an EXpectation Propagation LOgistic REgRession (EXPLORER) model for distributed privacy-preserving online learning. The proposed framework provides a high level guarantee for protecting sensitive information, since the information exchanged between the server and the client is the encrypted posterior distribution of coefficients. Through experimental results, EXPLORER shows the same performance (e.g., discrimination, calibration, feature selection, etc.) as the traditional frequentist logistic regression model, but provides more flexibility in model updating. That is, EXPLORER can be updated one point at a time rather than having to retrain the entire data set when new observations are recorded. The proposed EXPLORER supports asynchronized communication, which relieves the participants from coordinating with one another, and prevents service breakdown from the absence of participants or interrupted communications. Copyright © 2013 Elsevier Inc. All rights reserved.
Dudley, Robert W.; Hodgkins, Glenn A.; Dickinson, Jesse
2017-01-01
We present a logistic regression approach for forecasting the probability of future groundwater levels declining or maintaining below specific groundwater-level thresholds. We tested our approach on 102 groundwater wells in different climatic regions and aquifers of the United States that are part of the U.S. Geological Survey Groundwater Climate Response Network. We evaluated the importance of current groundwater levels, precipitation, streamflow, seasonal variability, Palmer Drought Severity Index, and atmosphere/ocean indices for developing the logistic regression equations. Several diagnostics of model fit were used to evaluate the regression equations, including testing of autocorrelation of residuals, goodness-of-fit metrics, and bootstrap validation testing. The probabilistic predictions were most successful at wells with high persistence (low month-to-month variability) in their groundwater records and at wells where the groundwater level remained below the defined low threshold for sustained periods (generally three months or longer). The model fit was weakest at wells with strong seasonal variability in levels and with shorter duration low-threshold events. We identified challenges in deriving probabilistic-forecasting models and possible approaches for addressing those challenges.
The crux of the method: assumptions in ordinary least squares and logistic regression.
Long, Rebecca G
2008-10-01
Logistic regression has increasingly become the tool of choice when analyzing data with a binary dependent variable. While resources relating to the technique are widely available, clear discussions of why logistic regression should be used in place of ordinary least squares regression are difficult to find. The current paper compares and contrasts the assumptions of ordinary least squares with those of logistic regression and explains why logistic regression's looser assumptions make it adept at handling violations of the more important assumptions in ordinary least squares.
Sparse Logistic Regression for Diagnosis of Liver Fibrosis in Rat by Using SCAD-Penalized Likelihood
Yan, Fang-Rong; Lin, Jin-Guan; Liu, Yu
2011-01-01
The objective of the present study is to find out the quantitative relationship between progression of liver fibrosis and the levels of certain serum markers using mathematic model. We provide the sparse logistic regression by using smoothly clipped absolute deviation (SCAD) penalized function to diagnose the liver fibrosis in rats. Not only does it give a sparse solution with high accuracy, it also provides the users with the precise probabilities of classification with the class information. In the simulative case and the experiment case, the proposed method is comparable to the stepwise linear discriminant analysis (SLDA) and the sparse logistic regression with least absolute shrinkage and selection operator (LASSO) penalty, by using receiver operating characteristic (ROC) with bayesian bootstrap estimating area under the curve (AUC) diagnostic sensitivity for selected variable. Results show that the new approach provides a good correlation between the serum marker levels and the liver fibrosis induced by thioacetamide (TAA) in rats. Meanwhile, this approach might also be used in predicting the development of liver cirrhosis. PMID:21716672
Using Dominance Analysis to Determine Predictor Importance in Logistic Regression
ERIC Educational Resources Information Center
Azen, Razia; Traxel, Nicole
2009-01-01
This article proposes an extension of dominance analysis that allows researchers to determine the relative importance of predictors in logistic regression models. Criteria for choosing logistic regression R[superscript 2] analogues were determined and measures were selected that can be used to perform dominance analysis in logistic regression. A…
Ngo, Long H; Inouye, Sharon K; Jones, Richard N; Travison, Thomas G; Libermann, Towia A; Dillon, Simon T; Kuchel, George A; Vasunilashorn, Sarinnapha M; Alsop, David C; Marcantonio, Edward R
2017-06-06
The nested case-control study (NCC) design within a prospective cohort study is used when outcome data are available for all subjects, but the exposure of interest has not been collected, and is difficult or prohibitively expensive to obtain for all subjects. A NCC analysis with good matching procedures yields estimates that are as efficient and unbiased as estimates from the full cohort study. We present methodological considerations in a matched NCC design and analysis, which include the choice of match algorithms, analysis methods to evaluate the association of exposures of interest with outcomes, and consideration of overmatching. Matched, NCC design within a longitudinal observational prospective cohort study in the setting of two academic hospitals. Study participants are patients aged over 70 years who underwent scheduled major non-cardiac surgery. The primary outcome was postoperative delirium from in-hospital interviews and medical record review. The main exposure was IL-6 concentration (pg/ml) from blood sampled at three time points before delirium occurred. We used nonparametric signed ranked test to test for the median of the paired differences. We used conditional logistic regression to model the risk of IL-6 on delirium incidence. Simulation was used to generate a sample of cohort data on which unconditional multivariable logistic regression was used, and the results were compared to those of the conditional logistic regression. Partial R-square was used to assess the level of overmatching. We found that the optimal match algorithm yielded more matched pairs than the greedy algorithm. The choice of analytic strategy-whether to consider measured cytokine levels as the predictor or outcome-- yielded inferences that have different clinical interpretations but similar levels of statistical significance. Estimation results from NCC design using conditional logistic regression, and from simulated cohort design using unconditional logistic regression, were similar. We found minimal evidence for overmatching. Using a matched NCC approach introduces methodological challenges into the study design and data analysis. Nonetheless, with careful selection of the match algorithm, match factors, and analysis methods, this design is cost effective and, for our study, yields estimates that are similar to those from a prospective cohort study design.
Applying Kaplan-Meier to Item Response Data
ERIC Educational Resources Information Center
McNeish, Daniel
2018-01-01
Some IRT models can be equivalently modeled in alternative frameworks such as logistic regression. Logistic regression can also model time-to-event data, which concerns the probability of an event occurring over time. Using the relation between time-to-event models and logistic regression and the relation between logistic regression and IRT, this…
Jovanovic, Milos; Radovanovic, Sandro; Vukicevic, Milan; Van Poucke, Sven; Delibasic, Boris
2016-09-01
Quantification and early identification of unplanned readmission risk have the potential to improve the quality of care during hospitalization and after discharge. However, high dimensionality, sparsity, and class imbalance of electronic health data and the complexity of risk quantification, challenge the development of accurate predictive models. Predictive models require a certain level of interpretability in order to be applicable in real settings and create actionable insights. This paper aims to develop accurate and interpretable predictive models for readmission in a general pediatric patient population, by integrating a data-driven model (sparse logistic regression) and domain knowledge based on the international classification of diseases 9th-revision clinical modification (ICD-9-CM) hierarchy of diseases. Additionally, we propose a way to quantify the interpretability of a model and inspect the stability of alternative solutions. The analysis was conducted on >66,000 pediatric hospital discharge records from California, State Inpatient Databases, Healthcare Cost and Utilization Project between 2009 and 2011. We incorporated domain knowledge based on the ICD-9-CM hierarchy in a data driven, Tree-Lasso regularized logistic regression model, providing the framework for model interpretation. This approach was compared with traditional Lasso logistic regression resulting in models that are easier to interpret by fewer high-level diagnoses, with comparable prediction accuracy. The results revealed that the use of a Tree-Lasso model was as competitive in terms of accuracy (measured by area under the receiver operating characteristic curve-AUC) as the traditional Lasso logistic regression, but integration with the ICD-9-CM hierarchy of diseases provided more interpretable models in terms of high-level diagnoses. Additionally, interpretations of models are in accordance with existing medical understanding of pediatric readmission. Best performing models have similar performances reaching AUC values 0.783 and 0.779 for traditional Lasso and Tree-Lasso, respectfully. However, information loss of Lasso models is 0.35 bits higher compared to Tree-Lasso model. We propose a method for building predictive models applicable for the detection of readmission risk based on Electronic Health records. Integration of domain knowledge (in the form of ICD-9-CM taxonomy) and a data-driven, sparse predictive algorithm (Tree-Lasso Logistic Regression) resulted in an increase of interpretability of the resulting model. The models are interpreted for the readmission prediction problem in general pediatric population in California, as well as several important subpopulations, and the interpretations of models comply with existing medical understanding of pediatric readmission. Finally, quantitative assessment of the interpretability of the models is given, that is beyond simple counts of selected low-level features. Copyright © 2016 Elsevier B.V. All rights reserved.
Dietary consumption patterns and laryngeal cancer risk.
Vlastarakos, Petros V; Vassileiou, Andrianna; Delicha, Evie; Kikidis, Dimitrios; Protopapas, Dimosthenis; Nikolopoulos, Thomas P
2016-06-01
We conducted a case-control study to investigate the effect of diet on laryngeal carcinogenesis. Our study population was made up of 140 participants-70 patients with laryngeal cancer (LC) and 70 controls with a non-neoplastic condition that was unrelated to diet, smoking, or alcohol. A food-frequency questionnaire determined the mean consumption of 113 different items during the 3 years prior to symptom onset. Total energy intake and cooking mode were also noted. The relative risk, odds ratio (OR), and 95% confidence interval (CI) were estimated by multiple logistic regression analysis. We found that the total energy intake was significantly higher in the LC group (p < 0.001), and that the difference remained statistically significant after logistic regression analysis (p < 0.001; OR: 118.70). Notably, meat consumption was higher in the LC group (p < 0.001), and the difference remained significant after logistic regression analysis (p = 0.029; OR: 1.16). LC patients also consumed significantly more fried food (p = 0.036); this difference also remained significant in the logistic regression model (p = 0.026; OR: 5.45). The LC group also consumed significantly more seafood (p = 0.012); the difference persisted after logistic regression analysis (p = 0.009; OR: 2.48), with the consumption of shrimp proving detrimental (p = 0.049; OR: 2.18). Finally, the intake of zinc was significantly higher in the LC group before and after logistic regression analysis (p = 0.034 and p = 0.011; OR: 30.15, respectively). Cereal consumption (including pastas) was also higher among the LC patients (p = 0.043), with logistic regression analysis showing that their negative effect was possibly associated with the sauces and dressings that traditionally accompany pasta dishes (p = 0.006; OR: 4.78). Conversely, a higher consumption of dairy products was found in controls (p < 0.05); logistic regression analysis showed that calcium appeared to be protective at the micronutrient level (p < 0.001; OR: 0.27). We found no difference in the overall consumption of fruits and vegetables between the LC patients and controls; however, the LC patients did have a greater consumption of cooked tomatoes and cooked root vegetables (p = 0.039 for both), and the controls had more consumption of leeks (p = 0.042) and, among controls younger than 65 years, cooked beans (p = 0.037). Lemon (p = 0.037), squeezed fruit juice (p = 0.032), and watermelon (p = 0.018) were also more frequently consumed by the controls. Other differences at the micronutrient level included greater consumption by the LC patients of retinol (p = 0.044), polyunsaturated fats (p = 0.041), and linoleic acid (p = 0.008); LC patients younger than 65 years also had greater intake of riboflavin (p = 0.045). We conclude that the differences in dietary consumption patterns between LC patients and controls indicate a possible role for lifestyle modifications involving nutritional factors as a means of decreasing the risk of laryngeal cancer.
Cancer prevalence and education by cancer site: logistic regression analysis.
Johnson, Stephanie; Corsten, Martin J; McDonald, James T; Gupta, Michael
2010-10-01
Previously, using the American National Health Interview Survey (NHIS) and a logistic regression analysis, we found that upper aerodigestive tract (UADT) cancer is correlated with low socioeconomic status (SES). The objective of this study was to determine if this correlation between low SES and cancer prevalence exists for other cancers. We again used the NHIS and employed education level as our main measure of SES. We controlled for potentially confounding factors, including smoking status and alcohol consumption. We found that only two cancer subsites shared the pattern of increased prevalence with low education level and decreased prevalence with high education level: UADT cancer and cervical cancer. UADT cancer and cervical cancer were the only two cancers identified that had a link between prevalence and lower education level. This raises the possibility that an associated risk factor for the two cancers is causing the relationship between lower education level and prevalence.
Impact of Contextual Factors on Prostate Cancer Risk and Outcomes
2013-07-01
framework with random effects (“frailty models”) while the case-control analyses (Aim 4) will use multilevel unconditional logistic regression models...contextual-level SES on prostate cancer risk within racial/ethnic groups. The survival analyses (Aims 1-3) will utilize a proportional hazards regression
ERIC Educational Resources Information Center
Davidson, J. Cody
2016-01-01
Mathematics is the most common subject area of remedial need and the majority of remedial math students never pass a college-level credit-bearing math class. The majorities of studies that investigate this phenomenon are conducted at community colleges and use some type of regression model; however, none have used a continuation ratio model. The…
Jiang, Yanlin; Xu, Hong; Zhang, Hao; Ou, Xunyan; Xu, Zhen; Ai, Liping; Sun, Lisha; Liu, Caigang
2017-09-22
The current management of the axilla in level 1 node-positive breast cancer patients is axillary lymph node dissection regardless of the status of the level 2 axillary lymph nodes. The goal of this study was to develop a nomogram predicting the probability of level 2 axillary lymph node metastasis (L-2-ALNM) in patients with level 1 axillary node-positive breast cancer. We reviewed the records of 974 patients with pathology-confirmed level 1 node-positive breast cancer between 2010 and 2014 at the Liaoning Cancer Hospital and Institute. The patients were randomized 1:1 and divided into a modeling group and a validation group. Clinical and pathological features of the patients were assessed with uni- and multivariate logistic regression. A nomogram based on independent predictors for the L-2-ALNM identified by multivariate logistic regression was constructed. Independent predictors of L-2-ALNM by the multivariate logistic regression analysis included tumor size, Ki-67 status, histological grade, and number of positive level 1 axillary lymph nodes. The areas under the receiver operating characteristic curve of the modeling set and the validation set were 0.828 and 0.816, respectively. The false-negative rates of the L-2-ALNM nomogram were 1.82% and 7.41% for the predicted probability cut-off points of < 6% and < 10%, respectively, when applied to the validation group. Our nomogram could help predict L-2-ALNM in patients with level 1 axillary lymph node metastasis. Patients with a low probability of L-2-ALNM could be spared level 2 axillary lymph node dissection, thereby reducing postoperative morbidity.
Standards for Standardized Logistic Regression Coefficients
ERIC Educational Resources Information Center
Menard, Scott
2011-01-01
Standardized coefficients in logistic regression analysis have the same utility as standardized coefficients in linear regression analysis. Although there has been no consensus on the best way to construct standardized logistic regression coefficients, there is now sufficient evidence to suggest a single best approach to the construction of a…
Schörgendorfer, Angela; Branscum, Adam J; Hanson, Timothy E
2013-06-01
Logistic regression is a popular tool for risk analysis in medical and population health science. With continuous response data, it is common to create a dichotomous outcome for logistic regression analysis by specifying a threshold for positivity. Fitting a linear regression to the nondichotomized response variable assuming a logistic sampling model for the data has been empirically shown to yield more efficient estimates of odds ratios than ordinary logistic regression of the dichotomized endpoint. We illustrate that risk inference is not robust to departures from the parametric logistic distribution. Moreover, the model assumption of proportional odds is generally not satisfied when the condition of a logistic distribution for the data is violated, leading to biased inference from a parametric logistic analysis. We develop novel Bayesian semiparametric methodology for testing goodness of fit of parametric logistic regression with continuous measurement data. The testing procedures hold for any cutoff threshold and our approach simultaneously provides the ability to perform semiparametric risk estimation. Bayes factors are calculated using the Savage-Dickey ratio for testing the null hypothesis of logistic regression versus a semiparametric generalization. We propose a fully Bayesian and a computationally efficient empirical Bayesian approach to testing, and we present methods for semiparametric estimation of risks, relative risks, and odds ratios when parametric logistic regression fails. Theoretical results establish the consistency of the empirical Bayes test. Results from simulated data show that the proposed approach provides accurate inference irrespective of whether parametric assumptions hold or not. Evaluation of risk factors for obesity shows that different inferences are derived from an analysis of a real data set when deviations from a logistic distribution are permissible in a flexible semiparametric framework. © 2013, The International Biometric Society.
Westreich, Daniel; Lessler, Justin; Funk, Michele Jonsson
2010-01-01
Summary Objective Propensity scores for the analysis of observational data are typically estimated using logistic regression. Our objective in this Review was to assess machine learning alternatives to logistic regression which may accomplish the same goals but with fewer assumptions or greater accuracy. Study Design and Setting We identified alternative methods for propensity score estimation and/or classification from the public health, biostatistics, discrete mathematics, and computer science literature, and evaluated these algorithms for applicability to the problem of propensity score estimation, potential advantages over logistic regression, and ease of use. Results We identified four techniques as alternatives to logistic regression: neural networks, support vector machines, decision trees (CART), and meta-classifiers (in particular, boosting). Conclusion While the assumptions of logistic regression are well understood, those assumptions are frequently ignored. All four alternatives have advantages and disadvantages compared with logistic regression. Boosting (meta-classifiers) and to a lesser extent decision trees (particularly CART) appear to be most promising for use in the context of propensity score analysis, but extensive simulation studies are needed to establish their utility in practice. PMID:20630332
Robust mislabel logistic regression without modeling mislabel probabilities.
Hung, Hung; Jou, Zhi-Yu; Huang, Su-Yun
2018-03-01
Logistic regression is among the most widely used statistical methods for linear discriminant analysis. In many applications, we only observe possibly mislabeled responses. Fitting a conventional logistic regression can then lead to biased estimation. One common resolution is to fit a mislabel logistic regression model, which takes into consideration of mislabeled responses. Another common method is to adopt a robust M-estimation by down-weighting suspected instances. In this work, we propose a new robust mislabel logistic regression based on γ-divergence. Our proposal possesses two advantageous features: (1) It does not need to model the mislabel probabilities. (2) The minimum γ-divergence estimation leads to a weighted estimating equation without the need to include any bias correction term, that is, it is automatically bias-corrected. These features make the proposed γ-logistic regression more robust in model fitting and more intuitive for model interpretation through a simple weighting scheme. Our method is also easy to implement, and two types of algorithms are included. Simulation studies and the Pima data application are presented to demonstrate the performance of γ-logistic regression. © 2017, The International Biometric Society.
Fungible weights in logistic regression.
Jones, Jeff A; Waller, Niels G
2016-06-01
In this article we develop methods for assessing parameter sensitivity in logistic regression models. To set the stage for this work, we first review Waller's (2008) equations for computing fungible weights in linear regression. Next, we describe 2 methods for computing fungible weights in logistic regression. To demonstrate the utility of these methods, we compute fungible logistic regression weights using data from the Centers for Disease Control and Prevention's (2010) Youth Risk Behavior Surveillance Survey, and we illustrate how these alternate weights can be used to evaluate parameter sensitivity. To make our work accessible to the research community, we provide R code (R Core Team, 2015) that will generate both kinds of fungible logistic regression weights. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Westreich, Daniel; Lessler, Justin; Funk, Michele Jonsson
2010-08-01
Propensity scores for the analysis of observational data are typically estimated using logistic regression. Our objective in this review was to assess machine learning alternatives to logistic regression, which may accomplish the same goals but with fewer assumptions or greater accuracy. We identified alternative methods for propensity score estimation and/or classification from the public health, biostatistics, discrete mathematics, and computer science literature, and evaluated these algorithms for applicability to the problem of propensity score estimation, potential advantages over logistic regression, and ease of use. We identified four techniques as alternatives to logistic regression: neural networks, support vector machines, decision trees (classification and regression trees [CART]), and meta-classifiers (in particular, boosting). Although the assumptions of logistic regression are well understood, those assumptions are frequently ignored. All four alternatives have advantages and disadvantages compared with logistic regression. Boosting (meta-classifiers) and, to a lesser extent, decision trees (particularly CART), appear to be most promising for use in the context of propensity score analysis, but extensive simulation studies are needed to establish their utility in practice. Copyright (c) 2010 Elsevier Inc. All rights reserved.
Addressing data privacy in matched studies via virtual pooling.
Saha-Chaudhuri, P; Weinberg, C R
2017-09-07
Data confidentiality and shared use of research data are two desirable but sometimes conflicting goals in research with multi-center studies and distributed data. While ideal for straightforward analysis, confidentiality restrictions forbid creation of a single dataset that includes covariate information of all participants. Current approaches such as aggregate data sharing, distributed regression, meta-analysis and score-based methods can have important limitations. We propose a novel application of an existing epidemiologic tool, specimen pooling, to enable confidentiality-preserving analysis of data arising from a matched case-control, multi-center design. Instead of pooling specimens prior to assay, we apply the methodology to virtually pool (aggregate) covariates within nodes. Such virtual pooling retains most of the information used in an analysis with individual data and since individual participant data is not shared externally, within-node virtual pooling preserves data confidentiality. We show that aggregated covariate levels can be used in a conditional logistic regression model to estimate individual-level odds ratios of interest. The parameter estimates from the standard conditional logistic regression are compared to the estimates based on a conditional logistic regression model with aggregated data. The parameter estimates are shown to be similar to those without pooling and to have comparable standard errors and confidence interval coverage. Virtual data pooling can be used to maintain confidentiality of data from multi-center study and can be particularly useful in research with large-scale distributed data.
Should metacognition be measured by logistic regression?
Rausch, Manuel; Zehetleitner, Michael
2017-03-01
Are logistic regression slopes suitable to quantify metacognitive sensitivity, i.e. the efficiency with which subjective reports differentiate between correct and incorrect task responses? We analytically show that logistic regression slopes are independent from rating criteria in one specific model of metacognition, which assumes (i) that rating decisions are based on sensory evidence generated independently of the sensory evidence used for primary task responses and (ii) that the distributions of evidence are logistic. Given a hierarchical model of metacognition, logistic regression slopes depend on rating criteria. According to all considered models, regression slopes depend on the primary task criterion. A reanalysis of previous data revealed that massive numbers of trials are required to distinguish between hierarchical and independent models with tolerable accuracy. It is argued that researchers who wish to use logistic regression as measure of metacognitive sensitivity need to control the primary task criterion and rating criteria. Copyright © 2017 Elsevier Inc. All rights reserved.
London Measure of Unplanned Pregnancy: guidance for its use as an outcome measure
Hall, Jennifer A; Barrett, Geraldine; Copas, Andrew; Stephenson, Judith
2017-01-01
Background The London Measure of Unplanned Pregnancy (LMUP) is a psychometrically validated measure of the degree of intention of a current or recent pregnancy. The LMUP is increasingly being used worldwide, and can be used to evaluate family planning or preconception care programs. However, beyond recommending the use of the full LMUP scale, there is no published guidance on how to use the LMUP as an outcome measure. Ordinal logistic regression has been recommended informally, but studies published to date have all used binary logistic regression and dichotomized the scale at different cut points. There is thus a need for evidence-based guidance to provide a standardized methodology for multivariate analysis and to enable comparison of results. This paper makes recommendations for the regression method for analysis of the LMUP as an outcome measure. Materials and methods Data collected from 4,244 pregnant women in Malawi were used to compare five regression methods: linear, logistic with two cut points, and ordinal logistic with either the full or grouped LMUP score. The recommendations were then tested on the original UK LMUP data. Results There were small but no important differences in the findings across the regression models. Logistic regression resulted in the largest loss of information, and assumptions were violated for the linear and ordinal logistic regression. Consequently, robust standard errors were used for linear regression and a partial proportional odds ordinal logistic regression model attempted. The latter could only be fitted for grouped LMUP score. Conclusion We recommend the linear regression model with robust standard errors to make full use of the LMUP score when analyzed as an outcome measure. Ordinal logistic regression could be considered, but a partial proportional odds model with grouped LMUP score may be required. Logistic regression is the least-favored option, due to the loss of information. For logistic regression, the cut point for un/planned pregnancy should be between nine and ten. These recommendations will standardize the analysis of LMUP data and enhance comparability of results across studies. PMID:28435343
Logistic models--an odd(s) kind of regression.
Jupiter, Daniel C
2013-01-01
The logistic regression model bears some similarity to the multivariable linear regression with which we are familiar. However, the differences are great enough to warrant a discussion of the need for and interpretation of logistic regression. Copyright © 2013 American College of Foot and Ankle Surgeons. Published by Elsevier Inc. All rights reserved.
Dahlin, Johanna; Härkönen, Juho
2013-12-01
Multiple studies have found that women report being in worse health despite living longer. Gender gaps vary cross-nationally, but relatively little is known about the causes of comparative differences. Existing literature is inconclusive as to whether gender gaps in health are smaller in more gender equal societies. We analyze gender gaps in self-rated health (SRH) and limiting longstanding illness (LLI) with five waves of European Social Survey data for 191,104 respondents from 28 countries. We use means, odds ratios, logistic regressions, and multilevel random slopes logistic regressions. Gender gaps in subjective health vary visibly across Europe. In many countries (especially in Eastern and Southern Europe), women report distinctly worse health, while in others (such as Estonia, Finland, and Great Britain) there are small or no differences. Logistic regressions ran separately for each country revealed that individual-level socioeconomic and demographic variables explain a majority of these gaps in some countries, but contribute little to their understanding in most countries. In yet other countries, men had worse health when these variables were controlled for. Cross-national variation in the gender gaps exists after accounting for individual-level factors. Against expectations, the remaining gaps are not systematically related to societal-level gender inequality in the multilevel analyses. Our findings stress persistent cross-national variability in gender gaps in health and call for further analysis. Copyright © 2013 Elsevier Ltd. All rights reserved.
Liu, Chaoqun; Zhong, Chunrong; Zhou, Xuezhen; Chen, Renjuan; Wu, Jiangyue; Wang, Weiye; Li, Xiating; Ding, Huisi; Guo, Yanfang; Gao, Qin; Hu, Xingwen; Xiong, Guoping; Yang, Xuefeng; Hao, Liping; Xiao, Mei; Yang, Nianhong
2017-01-01
Bilirubin concentrations have been recently reported to be negatively associated with type 2 diabetes mellitus. We examined the association between bilirubin concentrations and gestational diabetes mellitus. In a prospective cohort study, 2969 pregnant women were recruited prior to 16 weeks of gestation and were followed up until delivery. The value of bilirubin was tested and oral glucose tolerance test was conducted to screen gestational diabetes mellitus. The relationship between serum bilirubin concentration and gestational weeks was studied by two-piecewise linear regression. A subsample of 1135 participants with serum bilirubin test during 16-18 weeks gestation was conducted to research the association between serum bilirubin levels and risk of gestational diabetes mellitus by logistic regression. Gestational diabetes mellitus developed in 8.5 % of the participants (223 of 2969). Two-piecewise linear regression analyses demonstrated that the levels of bilirubin decreased with gestational week up to the turning point 23 and after that point, levels of bilirubin were increased slightly. In multiple logistic regression analysis, the relative risk of developing gestational diabetes mellitus was lower in the highest tertile of direct bilirubin than that in the lowest tertile (RR 0.60; 95 % CI, 0.35-0.89). The results suggested that women with higher serum direct bilirubin levels during the second trimester of pregnancy have lower risk for development of gestational diabetes mellitus.
Reboussin, Beth A; Preisser, John S; Song, Eun-Young; Wolfson, Mark
2012-07-01
Under-age drinking is an enormous public health issue in the USA. Evidence that community level structures may impact on under-age drinking has led to a proliferation of efforts to change the environment surrounding the use of alcohol. Although the focus of these efforts is to reduce drinking by individual youths, environmental interventions are typically implemented at the community level with entire communities randomized to the same intervention condition. A distinct feature of these trials is the tendency of the behaviours of individuals residing in the same community to be more alike than that of others residing in different communities, which is herein called 'clustering'. Statistical analyses and sample size calculations must account for this clustering to avoid type I errors and to ensure an appropriately powered trial. Clustering itself may also be of scientific interest. We consider the alternating logistic regressions procedure within the population-averaged modelling framework to estimate the effect of a law enforcement intervention on the prevalence of under-age drinking behaviours while modelling the clustering at multiple levels, e.g. within communities and within neighbourhoods nested within communities, by using pairwise odds ratios. We then derive sample size formulae for estimating intervention effects when planning a post-test-only or repeated cross-sectional community-randomized trial using the alternating logistic regressions procedure.
Bidirectional relationship between renal function and periodontal disease in older Japanese women.
Yoshihara, Akihiro; Iwasaki, Masanori; Miyazaki, Hideo; Nakamura, Kazutoshi
2016-09-01
The purpose of this study was to evaluate the reciprocal effects of chronic kidney disease (CKD) and periodontal disease. A total of 332 postmenopausal never smoking women were enrolled, and their serum high-sensitivity C-reactive protein, serum osteocalcin and serum cystatin C levels were measured. Poor renal function was defined as serum cystatin C > 0.91 mg/l. Periodontal disease markers, including clinical attachment level and the periodontal inflamed surface area (PISA), were also evaluated. Logistic regression analysis was conducted to evaluate the relationships between renal function and periodontal disease markers, serum osteocalcin level and hsCRP level. The prevalence-rate ratios (PRRs) on multiple Poisson regression analyses were determined to evaluate the relationships between periodontal disease markers and serum osteocalcin, serum cystatin C and serum hsCRP levels. On logistic regression analysis, PISA was significantly associated with serum cystatin C level. The odds ratio for serum cystatin C level was 2.44 (p = 0.011). The PRR between serum cystatin C level and periodontal disease markers such as number of sites with clinical attachment level ≥6 mm was significantly positive (3.12, p < 0.001). Similar tendencies were shown for serum osteocalcin level. This study suggests that CKD and periodontal disease can have reciprocal effects. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hu, H.; Kim, Rokho; Korrick, S.
1996-12-31
In an earlier report based on participants in the Veterans Administration Normative Aging Study, we found a significant association between the risk of hypertension and lead levels in tibia. To examine the possible confounding effects of education and occupation, we considered in this study five levels of education and three levels of occupation as independent variables in the statistical model. Of 1,171 active subjects seen between August 1991 and December 1994, 563 provided complete data for this analysis. In the initial logistic regression model, acre and body mass index, family history of hypertension, and dietary sodium intake, but neither cumulativemore » smoking nor alcohol ingestion, conferred increased odds ratios for being hypertensive that were statistically significant. When the lead biomarkers were added separately to this initial logistic model, tibia lead and patella lead levels were associated with significantly elevated odds ratios for hypertension. In the final backward elimination logistic regression model that included categorical variables for education and occupation, the only variables retained were body mass index, family history of hypertension, and tibia lead level. We conclude that education and occupation variables were not confounding the association between the lead biomarkers and hypertension that we reported previously. 27 refs., 3 tabs.« less
Xu, Jun-Fang; Xu, Jing; Li, Shi-Zhu; Jia, Tia-Wu; Huang, Xi-Bao; Zhang, Hua-Ming; Chen, Mei; Yang, Guo-Jing; Gao, Shu-Jing; Wang, Qing-Yun; Zhou, Xiao-Nong
2013-01-01
Background The transmission of schistosomiasis japonica in a local setting is still poorly understood in the lake regions of the People's Republic of China (P. R. China), and its transmission patterns are closely related to human, social and economic factors. Methodology/Principal Findings We aimed to apply the integrated approach of artificial neural network (ANN) and logistic regression model in assessment of transmission risks of Schistosoma japonicum with epidemiological data collected from 2339 villagers from 1247 households in six villages of Jiangling County, P.R. China. By using the back-propagation (BP) of the ANN model, 16 factors out of 27 factors were screened, and the top five factors ranked by the absolute value of mean impact value (MIV) were mainly related to human behavior, i.e. integration of water contact history and infection history, family with past infection, history of water contact, infection history, and infection times. The top five factors screened by the logistic regression model were mainly related to the social economics, i.e. village level, economic conditions of family, age group, education level, and infection times. The risk of human infection with S. japonicum is higher in the population who are at age 15 or younger, or with lower education, or with the higher infection rate of the village, or with poor family, and in the population with more than one time to be infected. Conclusion/Significance Both BP artificial neural network and logistic regression model established in a small scale suggested that individual behavior and socioeconomic status are the most important risk factors in the transmission of schistosomiasis japonica. It was reviewed that the young population (≤15) in higher-risk areas was the main target to be intervened for the disease transmission control. PMID:23556015
Black, L E; Brion, G M; Freitas, S J
2007-06-01
Predicting the presence of enteric viruses in surface waters is a complex modeling problem. Multiple water quality parameters that indicate the presence of human fecal material, the load of fecal material, and the amount of time fecal material has been in the environment are needed. This paper presents the results of a multiyear study of raw-water quality at the inlet of a potable-water plant that related 17 physical, chemical, and biological indices to the presence of enteric viruses as indicated by cytopathic changes in cell cultures. It was found that several simple, multivariate logistic regression models that could reliably identify observations of the presence or absence of total culturable virus could be fitted. The best models developed combined a fecal age indicator (the atypical coliform [AC]/total coliform [TC] ratio), the detectable presence of a human-associated sterol (epicoprostanol) to indicate the fecal source, and one of several fecal load indicators (the levels of Giardia species cysts, coliform bacteria, and coprostanol). The best fit to the data was found when the AC/TC ratio, the presence of epicoprostanol, and the density of fecal coliform bacteria were input into a simple, multivariate logistic regression equation, resulting in 84.5% and 78.6% accuracies for the identification of the presence and absence of total culturable virus, respectively. The AC/TC ratio was the most influential input variable in all of the models generated, but producing the best prediction required additional input related to the fecal source and the fecal load. The potential for replacing microbial indicators of fecal load with levels of coprostanol was proposed and evaluated by multivariate logistic regression modeling for the presence and absence of virus.
Radiomorphometric analysis of frontal sinus for sex determination.
Verma, Saumya; Mahima, V G; Patil, Karthikeya
2014-09-01
Sex determination of unknown individuals carries crucial significance in forensic research, in cases where fragments of skull persist with no likelihood of identification based on dental arch. In these instances sex determination becomes important to rule out certain number of possibilities instantly and helps in establishing a biological profile of human remains. The aim of the study is to evaluate a mathematical method based on logistic regression analysis capable of ascertaining the sex of individuals in the South Indian population. The study was conducted in the department of Oral Medicine and Radiology. The right and left areas, maximum height, width of frontal sinus were determined in 100 Caldwell views of 50 women and 50 men aged 20 years and above, with the help of Vernier callipers and a square grid with 1 square measuring 1mm(2) in area. Student's t-test, logistic regression analysis. The mean values of variables were greater in men, based on Student's t-test at 5% level of significance. The mathematical model based on logistic regression analysis gave percentage agreement of total area to correctly predict the female gender as 55.2%, of right area as 60.9% and of left area as 55.2%. The areas of the frontal sinus and the logistic regression proved to be unreliable in sex determination. (Logit = 0.924 - 0.00217 × right area).
The purpose of this report is to provide a reference manual that could be used by investigators for making informed use of logistic regression using two methods (standard logistic regression and MARS). The details for analyses of relationships between a dependent binary response ...
Predicting U.S. Army Reserve Unit Manning Using Market Demographics
2015-06-01
develops linear regression , classification tree, and logistic regression models to determine the ability of the location to support manning requirements... logistic regression model delivers predictive results that allow decision-makers to identify locations with a high probability of meeting unit...manning requirements. The recommendation of this thesis is that the USAR implement the logistic regression model. 14. SUBJECT TERMS U.S
ERIC Educational Resources Information Center
Chen, Chau-Kuang
2005-01-01
Logistic and Cox regression methods are practical tools used to model the relationships between certain student learning outcomes and their relevant explanatory variables. The logistic regression model fits an S-shaped curve into a binary outcome with data points of zero and one. The Cox regression model allows investigators to study the duration…
Yusuf, O B; Bamgboye, E A; Afolabi, R F; Shodimu, M A
2014-09-01
Logistic regression model is widely used in health research for description and predictive purposes. Unfortunately, most researchers are sometimes not aware that the underlying principles of the techniques have failed when the algorithm for maximum likelihood does not converge. Young researchers particularly postgraduate students may not know why separation problem whether quasi or complete occurs, how to identify it and how to fix it. This study was designed to critically evaluate convergence issues in articles that employed logistic regression analysis published in an African Journal of Medicine and medical sciences between 2004 and 2013. Problems of quasi or complete separation were described and were illustrated with the National Demographic and Health Survey dataset. A critical evaluation of articles that employed logistic regression was conducted. A total of 581 articles was reviewed, of which 40 (6.9%) used binary logistic regression. Twenty-four (60.0%) stated the use of logistic regression model in the methodology while none of the articles assessed model fit. Only 3 (12.5%) properly described the procedures. Of the 40 that used the logistic regression model, the problem of convergence occurred in 6 (15.0%) of the articles. Logistic regression tends to be poorly reported in studies published between 2004 and 2013. Our findings showed that the procedure may not be well understood by researchers since very few described the process in their reports and may be totally unaware of the problem of convergence or how to deal with it.
Logistic Regression: Concept and Application
ERIC Educational Resources Information Center
Cokluk, Omay
2010-01-01
The main focus of logistic regression analysis is classification of individuals in different groups. The aim of the present study is to explain basic concepts and processes of binary logistic regression analysis intended to determine the combination of independent variables which best explain the membership in certain groups called dichotomous…
NASA Astrophysics Data System (ADS)
Pradhan, Biswajeet
2010-05-01
This paper presents the results of the cross-validation of a multivariate logistic regression model using remote sensing data and GIS for landslide hazard analysis on the Penang, Cameron, and Selangor areas in Malaysia. Landslide locations in the study areas were identified by interpreting aerial photographs and satellite images, supported by field surveys. SPOT 5 and Landsat TM satellite imagery were used to map landcover and vegetation index, respectively. Maps of topography, soil type, lineaments and land cover were constructed from the spatial datasets. Ten factors which influence landslide occurrence, i.e., slope, aspect, curvature, distance from drainage, lithology, distance from lineaments, soil type, landcover, rainfall precipitation, and normalized difference vegetation index (ndvi), were extracted from the spatial database and the logistic regression coefficient of each factor was computed. Then the landslide hazard was analysed using the multivariate logistic regression coefficients derived not only from the data for the respective area but also using the logistic regression coefficients calculated from each of the other two areas (nine hazard maps in all) as a cross-validation of the model. For verification of the model, the results of the analyses were then compared with the field-verified landslide locations. Among the three cases of the application of logistic regression coefficient in the same study area, the case of Selangor based on the Selangor logistic regression coefficients showed the highest accuracy (94%), where as Penang based on the Penang coefficients showed the lowest accuracy (86%). Similarly, among the six cases from the cross application of logistic regression coefficient in other two areas, the case of Selangor based on logistic coefficient of Cameron showed highest (90%) prediction accuracy where as the case of Penang based on the Selangor logistic regression coefficients showed the lowest accuracy (79%). Qualitatively, the cross application model yields reasonable results which can be used for preliminary landslide hazard mapping.
Avalos, Marta; Adroher, Nuria Duran; Lagarde, Emmanuel; Thiessard, Frantz; Grandvalet, Yves; Contrand, Benjamin; Orriols, Ludivine
2012-09-01
Large data sets with many variables provide particular challenges when constructing analytic models. Lasso-related methods provide a useful tool, although one that remains unfamiliar to most epidemiologists. We illustrate the application of lasso methods in an analysis of the impact of prescribed drugs on the risk of a road traffic crash, using a large French nationwide database (PLoS Med 2010;7:e1000366). In the original case-control study, the authors analyzed each exposure separately. We use the lasso method, which can simultaneously perform estimation and variable selection in a single model. We compare point estimates and confidence intervals using (1) a separate logistic regression model for each drug with a Bonferroni correction and (2) lasso shrinkage logistic regression analysis. Shrinkage regression had little effect on (bias corrected) point estimates, but led to less conservative results, noticeably for drugs with moderate levels of exposure. Carbamates, carboxamide derivative and fatty acid derivative antiepileptics, drugs used in opioid dependence, and mineral supplements of potassium showed stronger associations. Lasso is a relevant method in the analysis of databases with large number of exposures and can be recommended as an alternative to conventional strategies.
Individual relocation decisions after tornadoes: a multi-level analysis.
Cong, Zhen; Nejat, Ali; Liang, Daan; Pei, Yaolin; Javid, Roxana J
2018-04-01
This study examines how multi-level factors affected individuals' relocation decisions after EF4 and EF5 (Enhanced Fujita Tornado Intensity Scale) tornadoes struck the United States in 2013. A telephone survey was conducted with 536 respondents, including oversampled older adults, one year after these two disaster events. Respondents' addresses were used to associate individual information with block group-level variables recorded by the American Community Survey. Logistic regression revealed that residential damage and homeownership are important predictors of relocation. There was also significant interaction between these two variables, indicating less difference between homeowners and renters at higher damage levels. Homeownership diminished the likelihood of relocation among younger respondents. Random effects logistic regression found that the percentage of homeownership and of higher income households in the community buffered the effect of damage on relocation; the percentage of older adults reduced the likelihood of this group relocating. The findings are assessed from the standpoint of age difference, policy implications, and social capital and vulnerability. © 2018 The Author(s). Disasters © Overseas Development Institute, 2018.
Data mining: Potential applications in research on nutrition and health.
Batterham, Marijka; Neale, Elizabeth; Martin, Allison; Tapsell, Linda
2017-02-01
Data mining enables further insights from nutrition-related research, but caution is required. The aim of this analysis was to demonstrate and compare the utility of data mining methods in classifying a categorical outcome derived from a nutrition-related intervention. Baseline data (23 variables, 8 categorical) on participants (n = 295) in an intervention trial were used to classify participants in terms of meeting the criteria of achieving 10 000 steps per day. Results from classification and regression trees (CARTs), random forests, adaptive boosting, logistic regression, support vector machines and neural networks were compared using area under the curve (AUC) and error assessments. The CART produced the best model when considering the AUC (0.703), overall error (18%) and within class error (28%). Logistic regression also performed reasonably well compared to the other models (AUC 0.675, overall error 23%, within class error 36%). All the methods gave different rankings of variables' importance. CART found that body fat, quality of life using the SF-12 Physical Component Summary (PCS) and the cholesterol: HDL ratio were the most important predictors of meeting the 10 000 steps criteria, while logistic regression showed the SF-12PCS, glucose levels and level of education to be the most significant predictors (P ≤ 0.01). Differing outcomes suggest caution is required with a single data mining method, particularly in a dataset with nonlinear relationships and outliers and when exploring relationships that were not the primary outcomes of the research. © 2017 Dietitians Association of Australia.
An Entropy-Based Measure for Assessing Fuzziness in Logistic Regression
Weiss, Brandi A.; Dardick, William
2015-01-01
This article introduces an entropy-based measure of data–model fit that can be used to assess the quality of logistic regression models. Entropy has previously been used in mixture-modeling to quantify how well individuals are classified into latent classes. The current study proposes the use of entropy for logistic regression models to quantify the quality of classification and separation of group membership. Entropy complements preexisting measures of data–model fit and provides unique information not contained in other measures. Hypothetical data scenarios, an applied example, and Monte Carlo simulation results are used to demonstrate the application of entropy in logistic regression. Entropy should be used in conjunction with other measures of data–model fit to assess how well logistic regression models classify cases into observed categories. PMID:29795897
Logistic regression applied to natural hazards: rare event logistic regression with replications
NASA Astrophysics Data System (ADS)
Guns, M.; Vanacker, V.
2012-06-01
Statistical analysis of natural hazards needs particular attention, as most of these phenomena are rare events. This study shows that the ordinary rare event logistic regression, as it is now commonly used in geomorphologic studies, does not always lead to a robust detection of controlling factors, as the results can be strongly sample-dependent. In this paper, we introduce some concepts of Monte Carlo simulations in rare event logistic regression. This technique, so-called rare event logistic regression with replications, combines the strength of probabilistic and statistical methods, and allows overcoming some of the limitations of previous developments through robust variable selection. This technique was here developed for the analyses of landslide controlling factors, but the concept is widely applicable for statistical analyses of natural hazards.
Large unbalanced credit scoring using Lasso-logistic regression ensemble.
Wang, Hong; Xu, Qingsong; Zhou, Lifeng
2015-01-01
Recently, various ensemble learning methods with different base classifiers have been proposed for credit scoring problems. However, for various reasons, there has been little research using logistic regression as the base classifier. In this paper, given large unbalanced data, we consider the plausibility of ensemble learning using regularized logistic regression as the base classifier to deal with credit scoring problems. In this research, the data is first balanced and diversified by clustering and bagging algorithms. Then we apply a Lasso-logistic regression learning ensemble to evaluate the credit risks. We show that the proposed algorithm outperforms popular credit scoring models such as decision tree, Lasso-logistic regression and random forests in terms of AUC and F-measure. We also provide two importance measures for the proposed model to identify important variables in the data.
An Entropy-Based Measure for Assessing Fuzziness in Logistic Regression.
Weiss, Brandi A; Dardick, William
2016-12-01
This article introduces an entropy-based measure of data-model fit that can be used to assess the quality of logistic regression models. Entropy has previously been used in mixture-modeling to quantify how well individuals are classified into latent classes. The current study proposes the use of entropy for logistic regression models to quantify the quality of classification and separation of group membership. Entropy complements preexisting measures of data-model fit and provides unique information not contained in other measures. Hypothetical data scenarios, an applied example, and Monte Carlo simulation results are used to demonstrate the application of entropy in logistic regression. Entropy should be used in conjunction with other measures of data-model fit to assess how well logistic regression models classify cases into observed categories.
Arevalillo, Jorge M; Sztein, Marcelo B; Kotloff, Karen L; Levine, Myron M; Simon, Jakub K
2017-10-01
Immunologic correlates of protection are important in vaccine development because they give insight into mechanisms of protection, assist in the identification of promising vaccine candidates, and serve as endpoints in bridging clinical vaccine studies. Our goal is the development of a methodology to identify immunologic correlates of protection using the Shigella challenge as a model. The proposed methodology utilizes the Random Forests (RF) machine learning algorithm as well as Classification and Regression Trees (CART) to detect immune markers that predict protection, identify interactions between variables, and define optimal cutoffs. Logistic regression modeling is applied to estimate the probability of protection and the confidence interval (CI) for such a probability is computed by bootstrapping the logistic regression models. The results demonstrate that the combination of Classification and Regression Trees and Random Forests complements the standard logistic regression and uncovers subtle immune interactions. Specific levels of immunoglobulin IgG antibody in blood on the day of challenge predicted protection in 75% (95% CI 67-86). Of those subjects that did not have blood IgG at or above a defined threshold, 100% were protected if they had IgA antibody secreting cells above a defined threshold. Comparison with the results obtained by applying only logistic regression modeling with standard Akaike Information Criterion for model selection shows the usefulness of the proposed method. Given the complexity of the immune system, the use of machine learning methods may enhance traditional statistical approaches. When applied together, they offer a novel way to quantify important immune correlates of protection that may help the development of vaccines. Copyright © 2017 Elsevier Inc. All rights reserved.
Exploring students' patterns of reasoning
NASA Astrophysics Data System (ADS)
Matloob Haghanikar, Mojgan
As part of a collaborative study of the science preparation of elementary school teachers, we investigated the quality of students' reasoning and explored the relationship between sophistication of reasoning and the degree to which the courses were considered inquiry oriented. To probe students' reasoning, we developed open-ended written content questions with the distinguishing feature of applying recently learned concepts in a new context. We devised a protocol for developing written content questions that provided a common structure for probing and classifying students' sophistication level of reasoning. In designing our protocol, we considered several distinct criteria, and classified students' responses based on their performance for each criterion. First, we classified concepts into three types: Descriptive, Hypothetical, and Theoretical and categorized the abstraction levels of the responses in terms of the types of concepts and the inter-relationship between the concepts. Second, we devised a rubric based on Bloom's revised taxonomy with seven traits (both knowledge types and cognitive processes) and a defined set of criteria to evaluate each trait. Along with analyzing students' reasoning, we visited universities and observed the courses in which the students were enrolled. We used the Reformed Teaching Observation Protocol (RTOP) to rank the courses with respect to characteristics that are valued for the inquiry courses. We conducted logistic regression for a sample of 18courses with about 900 students and reported the results for performing logistic regression to estimate the relationship between traits of reasoning and RTOP score. In addition, we analyzed conceptual structure of students' responses, based on conceptual classification schemes, and clustered students' responses into six categories. We derived regression model, to estimate the relationship between the sophistication of the categories of conceptual structure and RTOP scores. However, the outcome variable with six categories required a more complicated regression model, known as multinomial logistic regression, generalized from binary logistic regression. With the large amount of collected data, we found that the likelihood of the higher cognitive processes were in favor of classes with higher measures on inquiry. However, the usage of more abstract concepts with higher order conceptual structures was less prevalent in higher RTOP courses.
Real, J; Cleries, R; Forné, C; Roso-Llorach, A; Martínez-Sánchez, J M
In medicine and biomedical research, statistical techniques like logistic, linear, Cox and Poisson regression are widely known. The main objective is to describe the evolution of multivariate techniques used in observational studies indexed in PubMed (1970-2013), and to check the requirements of the STROBE guidelines in the author guidelines in Spanish journals indexed in PubMed. A targeted PubMed search was performed to identify papers that used logistic linear Cox and Poisson models. Furthermore, a review was also made of the author guidelines of journals published in Spain and indexed in PubMed and Web of Science. Only 6.1% of the indexed manuscripts included a term related to multivariate analysis, increasing from 0.14% in 1980 to 12.3% in 2013. In 2013, 6.7, 2.5, 3.5, and 0.31% of the manuscripts contained terms related to logistic, linear, Cox and Poisson regression, respectively. On the other hand, 12.8% of journals author guidelines explicitly recommend to follow the STROBE guidelines, and 35.9% recommend the CONSORT guideline. A low percentage of Spanish scientific journals indexed in PubMed include the STROBE statement requirement in the author guidelines. Multivariate regression models in published observational studies such as logistic regression, linear, Cox and Poisson are increasingly used both at international level, as well as in journals published in Spanish. Copyright © 2015 Sociedad Española de Médicos de Atención Primaria (SEMERGEN). Publicado por Elsevier España, S.L.U. All rights reserved.
Prediction of siRNA potency using sparse logistic regression.
Hu, Wei; Hu, John
2014-06-01
RNA interference (RNAi) can modulate gene expression at post-transcriptional as well as transcriptional levels. Short interfering RNA (siRNA) serves as a trigger for the RNAi gene inhibition mechanism, and therefore is a crucial intermediate step in RNAi. There have been extensive studies to identify the sequence characteristics of potent siRNAs. One such study built a linear model using LASSO (Least Absolute Shrinkage and Selection Operator) to measure the contribution of each siRNA sequence feature. This model is simple and interpretable, but it requires a large number of nonzero weights. We have introduced a novel technique, sparse logistic regression, to build a linear model using single-position specific nucleotide compositions which has the same prediction accuracy of the linear model based on LASSO. The weights in our new model share the same general trend as those in the previous model, but have only 25 nonzero weights out of a total 84 weights, a 54% reduction compared to the previous model. Contrary to the linear model based on LASSO, our model suggests that only a few positions are influential on the efficacy of the siRNA, which are the 5' and 3' ends and the seed region of siRNA sequences. We also employed sparse logistic regression to build a linear model using dual-position specific nucleotide compositions, a task LASSO is not able to accomplish well due to its high dimensional nature. Our results demonstrate the superiority of sparse logistic regression as a technique for both feature selection and regression over LASSO in the context of siRNA design.
Power and Sample Size Calculations for Logistic Regression Tests for Differential Item Functioning
ERIC Educational Resources Information Center
Li, Zhushan
2014-01-01
Logistic regression is a popular method for detecting uniform and nonuniform differential item functioning (DIF) effects. Theoretical formulas for the power and sample size calculations are derived for likelihood ratio tests and Wald tests based on the asymptotic distribution of the maximum likelihood estimators for the logistic regression model.…
A Methodology for Generating Placement Rules that Utilizes Logistic Regression
ERIC Educational Resources Information Center
Wurtz, Keith
2008-01-01
The purpose of this article is to provide the necessary tools for institutional researchers to conduct a logistic regression analysis and interpret the results. Aspects of the logistic regression procedure that are necessary to evaluate models are presented and discussed with an emphasis on cutoff values and choosing the appropriate number of…
John Hogland; Nedret Billor; Nathaniel Anderson
2013-01-01
Discriminant analysis, referred to as maximum likelihood classification within popular remote sensing software packages, is a common supervised technique used by analysts. Polytomous logistic regression (PLR), also referred to as multinomial logistic regression, is an alternative classification approach that is less restrictive, more flexible, and easy to interpret. To...
Vázquez-Nava, Francisco; Treviño-Garcia-Manzo, Norberto; Vázquez-Rodríguez, Carlos F; Vázquez-Rodríguez, Eliza M
2013-01-01
To determine the association between family structure, maternal education level, and maternal employment with sedentary lifestyle in primary school-age children. Data were obtained from 897 children aged 6 to 12 years. A questionnaire was used to collect information. Body mass index (BMI) was determined using the age- and gender-specific Centers for Disease Control and Prevention definition. Children were categorized as: normal weight (5(th) percentile≤BMI<85(th) percentile), at risk for overweight (85(th)≤BMI<95(th) percentile), overweight (≥ 95(th) percentile). For the analysis, overweight was defined as BMI at or above the 85(th) percentile for each gender. Adjusted odds ratios (adjusted ORs) for physical inactivity were determined using a logistic regression model. The prevalence of overweight was 40.7%, and of sedentary lifestyle, 57.2%. The percentage of non-intact families was 23.5%. Approximately 48.7% of the mothers had a non-acceptable educational level, and 38.8% of the mothers worked outside of the home. The logistic regression model showed that living in a non-intact family household (adjusted OR=1.67; 95% CI=1.04-2.66) is associated with sedentary lifestyle in overweight children. In the group of normal weight children, logistic regression analysis show that living in a non-intact family, having a mother with a non-acceptable education level, and having a mother who works outside of the home were not associated with sedentary lifestyle. Living in a non-intact family, more than low maternal educational level and having a working mother, appears to be associated with sedentary lifestyle in overweight primary school-age children. Copyright © 2013 Sociedade Brasileira de Pediatria. Published by Elsevier Editora Ltda. All rights reserved.
Goldman, S A
1996-10-01
Neurotoxicity in relation to concomitant administration of lithium and neuroleptic drugs, particularly haloperidol, has been an ongoing issue. This study examined whether use of lithium with neuroleptic drugs enhances neurotoxicity leading to permanent sequelae. The Spontaneous Reporting System database of the United States Food and Drug Administration and extant literature were reviewed for spectrum cases of lithium/neuroleptic neurotoxicity. Groups taking lithium alone (Li), lithium/haloperidol (LiHal) and lithium/ nonhaloperidol neuroleptics (LiNeuro), each paired for recovery and sequelae, were established for 237 cases. Statistical analyses included pairwise comparisons of lithium levels using the Wilcoxon Rank Sum procedure and logistic regression to analyze the relationship between independent variables and development of sequelae. The Li and Li-Neuro groups showed significant statistical differences in median lithium levels between recovery and sequelae pairs, whereas the LiHal pair did not differ significantly. Lithium level was associated with sequelae development overall and within the Li and LiNeuro groups; no such association was evident in the LiHal group. On multivariable logistic regression analysis, lithium level and taking lithium/haloperidol were significant factors in the development of sequelae, with multiple possibly confounding factors (e.g., age, sex) not statistically significant. Multivariable logistic regression analyses with neuroleptic dose as five discrete dose ranges or actual dose did not show an association between development of sequelae and dose. Database limitations notwithstanding, the lack of apparent impact of serum lithium level on the development of sequelae in patients treated with haloperidol contrasts notably with results in the Li and LiNeuro groups. These findings may suggest a possible effect of pharmacodynamic factors in lithium/neuroleptic combination therapy.
Measuring Productivity of Depot-Level Aircraft Maintenance in the Air Force Logistics Command.
1985-09-01
of Figures...... . . . . . . . . . . . . vi List of Tables . . . . . . . . . ............ vii Abstract . . . ...................... viii I...59 6. DEA Efficiency Values (Third DEA Model) . .... 62 7. DMU 5 Input Efficiencies ................ 64 vi F "-’ List of Tables Table Page I. DEA...Regression Results for 20 Months . . . ..... 68 V. Regression Results for 7 Quarters . . ..... 70 VI . Coefficients of Correlation (Using Quarterly Data
Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble
Wang, Hong; Xu, Qingsong; Zhou, Lifeng
2015-01-01
Recently, various ensemble learning methods with different base classifiers have been proposed for credit scoring problems. However, for various reasons, there has been little research using logistic regression as the base classifier. In this paper, given large unbalanced data, we consider the plausibility of ensemble learning using regularized logistic regression as the base classifier to deal with credit scoring problems. In this research, the data is first balanced and diversified by clustering and bagging algorithms. Then we apply a Lasso-logistic regression learning ensemble to evaluate the credit risks. We show that the proposed algorithm outperforms popular credit scoring models such as decision tree, Lasso-logistic regression and random forests in terms of AUC and F-measure. We also provide two importance measures for the proposed model to identify important variables in the data. PMID:25706988
Design, innovation, and rural creative places: Are the arts the cherry on top, or the secret sauce?
Wojan, Timothy R; Nichols, Bonnie
2018-01-01
Creative class theory explains the positive relationship between the arts and commercial innovation as the mutual attraction of artists and other creative workers by an unobserved creative milieu. This study explores alternative theories for rural settings, by analyzing establishment-level survey data combined with data on the local arts scene. The study identifies the local contextual factors associated with a strong design orientation, and estimates the impact that a strong design orientation has on the local economy. Data on innovation and design come from a nationally representative sample of establishments in tradable industries. Latent class analysis allows identifying unobserved subpopulations comprised of establishments with different design and innovation orientations. Logistic regression allows estimating the association between an establishment's design orientation and local contextual factors. A quantile instrumental variable regression allows assessing the robustness of the logistic regression results with respect to endogeneity. An estimate of design orientation at the local level derived from the survey is used to examine variation in economic performance during the period of recovery from the Great Recession (2010-2014). Three distinct innovation (substantive, nominal, and non-innovators) and design orientations (design-integrated, "design last finish," and no systematic approach to design) are identified. Innovation- and design-intensive establishments were identified in both rural and urban areas. Rural design-integrated establishments tended to locate in counties with more highly educated workforces and containing at least one performing arts organization. A quantile instrumental variable regression confirmed that the logistic regression result is robust to endogeneity concerns. Finally, rural areas characterized by design-integrated establishments experienced faster growth in wages relative to rural areas characterized by establishments using no systematic approach to design.
Design, innovation, and rural creative places: Are the arts the cherry on top, or the secret sauce?
Nichols, Bonnie
2018-01-01
Objective Creative class theory explains the positive relationship between the arts and commercial innovation as the mutual attraction of artists and other creative workers by an unobserved creative milieu. This study explores alternative theories for rural settings, by analyzing establishment-level survey data combined with data on the local arts scene. The study identifies the local contextual factors associated with a strong design orientation, and estimates the impact that a strong design orientation has on the local economy. Method Data on innovation and design come from a nationally representative sample of establishments in tradable industries. Latent class analysis allows identifying unobserved subpopulations comprised of establishments with different design and innovation orientations. Logistic regression allows estimating the association between an establishment’s design orientation and local contextual factors. A quantile instrumental variable regression allows assessing the robustness of the logistic regression results with respect to endogeneity. An estimate of design orientation at the local level derived from the survey is used to examine variation in economic performance during the period of recovery from the Great Recession (2010–2014). Results Three distinct innovation (substantive, nominal, and non-innovators) and design orientations (design-integrated, “design last finish,” and no systematic approach to design) are identified. Innovation- and design-intensive establishments were identified in both rural and urban areas. Rural design-integrated establishments tended to locate in counties with more highly educated workforces and containing at least one performing arts organization. A quantile instrumental variable regression confirmed that the logistic regression result is robust to endogeneity concerns. Finally, rural areas characterized by design-integrated establishments experienced faster growth in wages relative to rural areas characterized by establishments using no systematic approach to design. PMID:29489884
New machine-learning algorithms for prediction of Parkinson's disease
NASA Astrophysics Data System (ADS)
Mandal, Indrajit; Sairam, N.
2014-03-01
This article presents an enhanced prediction accuracy of diagnosis of Parkinson's disease (PD) to prevent the delay and misdiagnosis of patients using the proposed robust inference system. New machine-learning methods are proposed and performance comparisons are based on specificity, sensitivity, accuracy and other measurable parameters. The robust methods of treating Parkinson's disease (PD) includes sparse multinomial logistic regression, rotation forest ensemble with support vector machines and principal components analysis, artificial neural networks, boosting methods. A new ensemble method comprising of the Bayesian network optimised by Tabu search algorithm as classifier and Haar wavelets as projection filter is used for relevant feature selection and ranking. The highest accuracy obtained by linear logistic regression and sparse multinomial logistic regression is 100% and sensitivity, specificity of 0.983 and 0.996, respectively. All the experiments are conducted over 95% and 99% confidence levels and establish the results with corrected t-tests. This work shows a high degree of advancement in software reliability and quality of the computer-aided diagnosis system and experimentally shows best results with supportive statistical inference.
Landslide Hazard Mapping in Rwanda Using Logistic Regression
NASA Astrophysics Data System (ADS)
Piller, A.; Anderson, E.; Ballard, H.
2015-12-01
Landslides in the United States cause more than $1 billion in damages and 50 deaths per year (USGS 2014). Globally, figures are much more grave, yet monitoring, mapping and forecasting of these hazards are less than adequate. Seventy-five percent of the population of Rwanda earns a living from farming, mostly subsistence. Loss of farmland, housing, or life, to landslides is a very real hazard. Landslides in Rwanda have an impact at the economic, social, and environmental level. In a developing nation that faces challenges in tracking, cataloging, and predicting the numerous landslides that occur each year, satellite imagery and spatial analysis allow for remote study. We have focused on the development of a landslide inventory and a statistical methodology for assessing landslide hazards. Using logistic regression on approximately 30 test variables (i.e. slope, soil type, land cover, etc.) and a sample of over 200 landslides, we determine which variables are statistically most relevant to landslide occurrence in Rwanda. A preliminary predictive hazard map for Rwanda has been produced, using the variables selected from the logistic regression analysis.
Evaluating penalized logistic regression models to predict Heat-Related Electric grid stress days
DOE Office of Scientific and Technical Information (OSTI.GOV)
Bramer, Lisa M.; Rounds, J.; Burleyson, C. D.
Understanding the conditions associated with stress on the electricity grid is important in the development of contingency plans for maintaining reliability during periods when the grid is stressed. In this paper, heat-related grid stress and the relationship with weather conditions were examined using data from the eastern United States. Penalized logistic regression models were developed and applied to predict stress on the electric grid using weather data. The inclusion of other weather variables, such as precipitation, in addition to temperature improved model performance. Several candidate models and combinations of predictive variables were examined. A penalized logistic regression model which wasmore » fit at the operation-zone level was found to provide predictive value and interpretability. Additionally, the importance of different weather variables observed at various time scales were examined. Maximum temperature and precipitation were identified as important across all zones while the importance of other weather variables was zone specific. In conclusion, the methods presented in this work are extensible to other regions and can be used to aid in planning and development of the electrical grid.« less
Evaluating penalized logistic regression models to predict Heat-Related Electric grid stress days
Bramer, Lisa M.; Rounds, J.; Burleyson, C. D.; ...
2017-09-22
Understanding the conditions associated with stress on the electricity grid is important in the development of contingency plans for maintaining reliability during periods when the grid is stressed. In this paper, heat-related grid stress and the relationship with weather conditions were examined using data from the eastern United States. Penalized logistic regression models were developed and applied to predict stress on the electric grid using weather data. The inclusion of other weather variables, such as precipitation, in addition to temperature improved model performance. Several candidate models and combinations of predictive variables were examined. A penalized logistic regression model which wasmore » fit at the operation-zone level was found to provide predictive value and interpretability. Additionally, the importance of different weather variables observed at various time scales were examined. Maximum temperature and precipitation were identified as important across all zones while the importance of other weather variables was zone specific. In conclusion, the methods presented in this work are extensible to other regions and can be used to aid in planning and development of the electrical grid.« less
An Entropy-Based Measure for Assessing Fuzziness in Logistic Regression
ERIC Educational Resources Information Center
Weiss, Brandi A.; Dardick, William
2016-01-01
This article introduces an entropy-based measure of data-model fit that can be used to assess the quality of logistic regression models. Entropy has previously been used in mixture-modeling to quantify how well individuals are classified into latent classes. The current study proposes the use of entropy for logistic regression models to quantify…
On the Usefulness of a Multilevel Logistic Regression Approach to Person-Fit Analysis
ERIC Educational Resources Information Center
Conijn, Judith M.; Emons, Wilco H. M.; van Assen, Marcel A. L. M.; Sijtsma, Klaas
2011-01-01
The logistic person response function (PRF) models the probability of a correct response as a function of the item locations. Reise (2000) proposed to use the slope parameter of the logistic PRF as a person-fit measure. He reformulated the logistic PRF model as a multilevel logistic regression model and estimated the PRF parameters from this…
Stylianou, Neophytos; Akbarov, Artur; Kontopantelis, Evangelos; Buchan, Iain; Dunn, Ken W
2015-08-01
Predicting mortality from burn injury has traditionally employed logistic regression models. Alternative machine learning methods have been introduced in some areas of clinical prediction as the necessary software and computational facilities have become accessible. Here we compare logistic regression and machine learning predictions of mortality from burn. An established logistic mortality model was compared to machine learning methods (artificial neural network, support vector machine, random forests and naïve Bayes) using a population-based (England & Wales) case-cohort registry. Predictive evaluation used: area under the receiver operating characteristic curve; sensitivity; specificity; positive predictive value and Youden's index. All methods had comparable discriminatory abilities, similar sensitivities, specificities and positive predictive values. Although some machine learning methods performed marginally better than logistic regression the differences were seldom statistically significant and clinically insubstantial. Random forests were marginally better for high positive predictive value and reasonable sensitivity. Neural networks yielded slightly better prediction overall. Logistic regression gives an optimal mix of performance and interpretability. The established logistic regression model of burn mortality performs well against more complex alternatives. Clinical prediction with a small set of strong, stable, independent predictors is unlikely to gain much from machine learning outside specialist research contexts. Copyright © 2015 Elsevier Ltd and ISBI. All rights reserved.
Valle, Denis; Lima, Joanna M Tucker; Millar, Justin; Amratia, Punam; Haque, Ubydul
2015-11-04
Logistic regression is a statistical model widely used in cross-sectional and cohort studies to identify and quantify the effects of potential disease risk factors. However, the impact of imperfect tests on adjusted odds ratios (and thus on the identification of risk factors) is under-appreciated. The purpose of this article is to draw attention to the problem associated with modelling imperfect diagnostic tests, and propose simple Bayesian models to adequately address this issue. A systematic literature review was conducted to determine the proportion of malaria studies that appropriately accounted for false-negatives/false-positives in a logistic regression setting. Inference from the standard logistic regression was also compared with that from three proposed Bayesian models using simulations and malaria data from the western Brazilian Amazon. A systematic literature review suggests that malaria epidemiologists are largely unaware of the problem of using logistic regression to model imperfect diagnostic test results. Simulation results reveal that statistical inference can be substantially improved when using the proposed Bayesian models versus the standard logistic regression. Finally, analysis of original malaria data with one of the proposed Bayesian models reveals that microscopy sensitivity is strongly influenced by how long people have lived in the study region, and an important risk factor (i.e., participation in forest extractivism) is identified that would have been missed by standard logistic regression. Given the numerous diagnostic methods employed by malaria researchers and the ubiquitous use of logistic regression to model the results of these diagnostic tests, this paper provides critical guidelines to improve data analysis practice in the presence of misclassification error. Easy-to-use code that can be readily adapted to WinBUGS is provided, enabling straightforward implementation of the proposed Bayesian models.
NASA Astrophysics Data System (ADS)
García-Rodríguez, M. J.; Malpica, J. A.; Benito, B.; Díaz, M.
2008-03-01
This work has evaluated the probability of earthquake-triggered landslide occurrence in the whole of El Salvador, with a Geographic Information System (GIS) and a logistic regression model. Slope gradient, elevation, aspect, mean annual precipitation, lithology, land use, and terrain roughness are the predictor variables used to determine the dependent variable of occurrence or non-occurrence of landslides within an individual grid cell. The results illustrate the importance of terrain roughness and soil type as key factors within the model — using only these two variables the analysis returned a significance level of 89.4%. The results obtained from the model within the GIS were then used to produce a map of relative landslide susceptibility.
Logistic regression for risk factor modelling in stuttering research.
Reed, Phil; Wu, Yaqionq
2013-06-01
To outline the uses of logistic regression and other statistical methods for risk factor analysis in the context of research on stuttering. The principles underlying the application of a logistic regression are illustrated, and the types of questions to which such a technique has been applied in the stuttering field are outlined. The assumptions and limitations of the technique are discussed with respect to existing stuttering research, and with respect to formulating appropriate research strategies to accommodate these considerations. Finally, some alternatives to the approach are briefly discussed. The way the statistical procedures are employed are demonstrated with some hypothetical data. Research into several practical issues concerning stuttering could benefit if risk factor modelling were used. Important examples are early diagnosis, prognosis (whether a child will recover or persist) and assessment of treatment outcome. After reading this article you will: (a) Summarize the situations in which logistic regression can be applied to a range of issues about stuttering; (b) Follow the steps in performing a logistic regression analysis; (c) Describe the assumptions of the logistic regression technique and the precautions that need to be checked when it is employed; (d) Be able to summarize its advantages over other techniques like estimation of group differences and simple regression. Copyright © 2012 Elsevier Inc. All rights reserved.
A Predictive Model for Readmissions Among Medicare Patients in a California Hospital.
Duncan, Ian; Huynh, Nhan
2017-11-17
Predictive models for hospital readmission rates are in high demand because of the Centers for Medicare & Medicaid Services (CMS) Hospital Readmission Reduction Program (HRRP). The LACE index is one of the most popular predictive tools among hospitals in the United States. The LACE index is a simple tool with 4 parameters: Length of stay, Acuity of admission, Comorbidity, and Emergency visits in the previous 6 months. The authors applied logistic regression to develop a predictive model for a medium-sized not-for-profit community hospital in California using patient-level data with more specific patient information (including 13 explanatory variables). Specifically, the logistic regression is applied to 2 populations: a general population including all patients and the specific group of patients targeted by the CMS penalty (characterized as ages 65 or older with select conditions). The 2 resulting logistic regression models have a higher sensitivity rate compared to the sensitivity of the LACE index. The C statistic values of the model applied to both populations demonstrate moderate levels of predictive power. The authors also build an economic model to demonstrate the potential financial impact of the use of the model for targeting high-risk patients in a sample hospital and demonstrate that, on balance, whether the hospital gains or loses from reducing readmissions depends on its margin and the extent of its readmission penalties.
Impact of low vision on employment.
Mojon-Azzi, Stefania M; Sousa-Poza, Alfonso; Mojon, Daniel S
2010-01-01
We investigated the influence of self-reported corrected eyesight on several variables describing the perception by employees and self-employed persons of their employment. Our study was based on data from the Survey of Health, Ageing and Retirement in Europe (SHARE). SHARE is a multidisciplinary, cross-national database of microdata on health, socioeconomic status, social and family networks, collected on 31,115 individuals in 11 European countries and in Israel. With the help of ordered logistic regressions and binary logistic regressions, we analyzed the influence of perceived visual impairment--corrected by 19 covariates capturing socioeconomic and health-related factors--on 10 variables describing the respondents' employment situation. Based on data covering 10,340 working individuals, the results of the logistic and ordered regressions indicate that respondents with lower levels of self-reported general eyesight were significantly less satisfied with their jobs, felt they had less freedom to decide, less opportunity to develop new skills, less support in difficult situations, less recognition for their work, and an inadequate salary. Respondents with a lower eyesight level more frequently reported that they feared their health might limit their ability to work before regular retirement age and more often indicated that they were seeking early retirement. Analysis of this dataset from 12 countries demonstrates the strong impact of self-reported visual impairment on individual employment, and therefore on job satisfaction, productivity, and well-being. Copyright © 2010 S. Karger AG, Basel.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hooman, A.; Mohammadzadeh, M
Some medical and epidemiological surveys have been designed to predict a nominal response variable with several levels. With regard to the type of pregnancy there are four possible states: wanted, unwanted by wife, unwanted by husband and unwanted by couple. In this paper, we have predicted the type of pregnancy, as well as the factors influencing it using three different models and comparing them. Regarding the type of pregnancy with several levels, we developed a multinomial logistic regression, a neural network and a flexible discrimination based on the data and compared their results using tow statistical indices: Surface under curvemore » (ROC) and kappa coefficient. Based on these tow indices, flexible discrimination proved to be a better fit for prediction on data in comparison to other methods. When the relations among variables are complex, one can use flexible discrimination instead of multinomial logistic regression and neural network to predict the nominal response variables with several levels in order to gain more accurate predictions.« less
Dynamic Dimensionality Selection for Bayesian Classifier Ensembles
2015-03-19
learning of weights in an otherwise generatively learned naive Bayes classifier. WANBIA-C is very cometitive to Logistic Regression but much more...classifier, Generative learning, Discriminative learning, Naïve Bayes, Feature selection, Logistic regression , higher order attribute independence 16...discriminative learning of weights in an otherwise generatively learned naive Bayes classifier. WANBIA-C is very cometitive to Logistic Regression but
Travis Woolley; David C. Shaw; Lisa M. Ganio; Stephen Fitzgerald
2012-01-01
Logistic regression models used to predict tree mortality are critical to post-fire management, planning prescribed bums and understanding disturbance ecology. We review literature concerning post-fire mortality prediction using logistic regression models for coniferous tree species in the western USA. We include synthesis and review of: methods to develop, evaluate...
Preserving Institutional Privacy in Distributed binary Logistic Regression.
Wu, Yuan; Jiang, Xiaoqian; Ohno-Machado, Lucila
2012-01-01
Privacy is becoming a major concern when sharing biomedical data across institutions. Although methods for protecting privacy of individual patients have been proposed, it is not clear how to protect the institutional privacy, which is many times a critical concern of data custodians. Built upon our previous work, Grid Binary LOgistic REgression (GLORE)1, we developed an Institutional Privacy-preserving Distributed binary Logistic Regression model (IPDLR) that considers both individual and institutional privacy for building a logistic regression model in a distributed manner. We tested our method using both simulated and clinical data, showing how it is possible to protect the privacy of individuals and of institutions using a distributed strategy.
Covariate Imbalance and Adjustment for Logistic Regression Analysis of Clinical Trial Data
Ciolino, Jody D.; Martin, Reneé H.; Zhao, Wenle; Jauch, Edward C.; Hill, Michael D.; Palesch, Yuko Y.
2014-01-01
In logistic regression analysis for binary clinical trial data, adjusted treatment effect estimates are often not equivalent to unadjusted estimates in the presence of influential covariates. This paper uses simulation to quantify the benefit of covariate adjustment in logistic regression. However, International Conference on Harmonization guidelines suggest that covariate adjustment be pre-specified. Unplanned adjusted analyses should be considered secondary. Results suggest that that if adjustment is not possible or unplanned in a logistic setting, balance in continuous covariates can alleviate some (but never all) of the shortcomings of unadjusted analyses. The case of log binomial regression is also explored. PMID:24138438
Differentially private distributed logistic regression using private and public data.
Ji, Zhanglong; Jiang, Xiaoqian; Wang, Shuang; Xiong, Li; Ohno-Machado, Lucila
2014-01-01
Privacy protecting is an important issue in medical informatics and differential privacy is a state-of-the-art framework for data privacy research. Differential privacy offers provable privacy against attackers who have auxiliary information, and can be applied to data mining models (for example, logistic regression). However, differentially private methods sometimes introduce too much noise and make outputs less useful. Given available public data in medical research (e.g. from patients who sign open-consent agreements), we can design algorithms that use both public and private data sets to decrease the amount of noise that is introduced. In this paper, we modify the update step in Newton-Raphson method to propose a differentially private distributed logistic regression model based on both public and private data. We try our algorithm on three different data sets, and show its advantage over: (1) a logistic regression model based solely on public data, and (2) a differentially private distributed logistic regression model based on private data under various scenarios. Logistic regression models built with our new algorithm based on both private and public datasets demonstrate better utility than models that trained on private or public datasets alone without sacrificing the rigorous privacy guarantee.
Amini, Payam; Maroufizadeh, Saman; Samani, Reza Omani; Hamidi, Omid; Sepidarkish, Mahdi
2017-06-01
Preterm birth (PTB) is a leading cause of neonatal death and the second biggest cause of death in children under five years of age. The objective of this study was to determine the prevalence of PTB and its associated factors using logistic regression and decision tree classification methods. This cross-sectional study was conducted on 4,415 pregnant women in Tehran, Iran, from July 6-21, 2015. Data were collected by a researcher-developed questionnaire through interviews with mothers and review of their medical records. To evaluate the accuracy of the logistic regression and decision tree methods, several indices such as sensitivity, specificity, and the area under the curve were used. The PTB rate was 5.5% in this study. The logistic regression outperformed the decision tree for the classification of PTB based on risk factors. Logistic regression showed that multiple pregnancies, mothers with preeclampsia, and those who conceived with assisted reproductive technology had an increased risk for PTB ( p < 0.05). Identifying and training mothers at risk as well as improving prenatal care may reduce the PTB rate. We also recommend that statisticians utilize the logistic regression model for the classification of risk groups for PTB.
Tangen, C M; Koch, G G
1999-03-01
In the randomized clinical trial setting, controlling for covariates is expected to produce variance reduction for the treatment parameter estimate and to adjust for random imbalances of covariates between the treatment groups. However, for the logistic regression model, variance reduction is not obviously obtained. This can lead to concerns about the assumptions of the logistic model. We introduce a complementary nonparametric method for covariate adjustment. It provides results that are usually compatible with expectations for analysis of covariance. The only assumptions required are based on randomization and sampling arguments. The resulting treatment parameter is a (unconditional) population average log-odds ratio that has been adjusted for random imbalance of covariates. Data from a randomized clinical trial are used to compare results from the traditional maximum likelihood logistic method with those from the nonparametric logistic method. We examine treatment parameter estimates, corresponding standard errors, and significance levels in models with and without covariate adjustment. In addition, we discuss differences between unconditional population average treatment parameters and conditional subpopulation average treatment parameters. Additional features of the nonparametric method, including stratified (multicenter) and multivariate (multivisit) analyses, are illustrated. Extensions of this methodology to the proportional odds model are also made.
2011-01-01
Background The relationship between asthma and traffic-related pollutants has received considerable attention. The use of individual-level exposure measures, such as residence location or proximity to emission sources, may avoid ecological biases. Method This study focused on the pediatric Medicaid population in Detroit, MI, a high-risk population for asthma-related events. A population-based matched case-control analysis was used to investigate associations between acute asthma outcomes and proximity of residence to major roads, including freeways. Asthma cases were identified as all children who made at least one asthma claim, including inpatient and emergency department visits, during the three-year study period, 2004-06. Individually matched controls were randomly selected from the rest of the Medicaid population on the basis of non-respiratory related illness. We used conditional logistic regression with distance as both categorical and continuous variables, and examined non-linear relationships with distance using polynomial splines. The conditional logistic regression models were then extended by considering multiple asthma states (based on the frequency of acute asthma outcomes) using polychotomous conditional logistic regression. Results Asthma events were associated with proximity to primary roads with an odds ratio of 0.97 (95% CI: 0.94, 0.99) for a 1 km increase in distance using conditional logistic regression, implying that asthma events are less likely as the distance between the residence and a primary road increases. Similar relationships and effect sizes were found using polychotomous conditional logistic regression. Another plausible exposure metric, a reduced form response surface model that represents atmospheric dispersion of pollutants from roads, was not associated under that exposure model. Conclusions There is moderately strong evidence of elevated risk of asthma close to major roads based on the results obtained in this population-based matched case-control study. PMID:21513554
Shi, Huilan; Jia, Junya; Li, Dong; Wei, Li; Shang, Wenya; Zheng, Zhenfeng
2018-02-09
Precise renal histopathological diagnosis will guide therapy strategy in patients with lupus nephritis. Blood oxygen level dependent (BOLD) magnetic resonance imaging (MRI) has been applicable noninvasive technique in renal disease. This current study was performed to explore whether BOLD MRI could contribute to diagnose renal pathological pattern. Adult patients with lupus nephritis renal pathological diagnosis were recruited for this study. Renal biopsy tissues were assessed based on the lupus nephritis ISN/RPS 2003 classification. The Blood oxygen level dependent magnetic resonance imaging (BOLD-MRI) was used to obtain functional magnetic resonance parameter, R2* values. Several functions of R2* values were calculated and used to construct algorithmic models for renal pathological patterns. In addition, the algorithmic models were compared as to their diagnostic capability. Both Histopathology and BOLD MRI were used to examine a total of twelve patients. Renal pathological patterns included five classes III (including 3 as class III + V) and seven classes IV (including 4 as class IV + V). Three algorithmic models, including decision tree, line discriminant, and logistic regression, were constructed to distinguish the renal pathological pattern of class III and class IV. The sensitivity of the decision tree model was better than that of the line discriminant model (71.87% vs 59.48%, P < 0.001) and inferior to that of the Logistic regression model (71.87% vs 78.71%, P < 0.001). The specificity of decision tree model was equivalent to that of the line discriminant model (63.87% vs 63.73%, P = 0.939) and higher than that of the logistic regression model (63.87% vs 38.0%, P < 0.001). The Area under the ROC curve (AUROCC) of the decision tree model was greater than that of the line discriminant model (0.765 vs 0.629, P < 0.001) and logistic regression model (0.765 vs 0.662, P < 0.001). BOLD MRI is a useful non-invasive imaging technique for the evaluation of lupus nephritis. Decision tree models constructed using functions of R2* values may facilitate the prediction of renal pathological patterns.
Ashtari, Fereshte; Esmaeil, Nafiseh; Mansourian, Marjan; Poursafa, Parinaz; Mirmosayyeb, Omid; Barzegar, Mahdi; Pourgheisari, Hajar
2018-06-15
The evidence for an impact of ambient air pollution on the incidence and severity of multiple sclerosis (MS) is still limited. In the present study, we assessed the association between daily air pollution levels and MS prevalence and severity in Isfahan city, Iran. Data related to MS patients has been collected from 2008 to 2016 in a referral university clinic. The air quality index (AQI) data, were collected from 6 monitoring stations of Isfahan department of environment. The distribution map presenting the sites of air pollution monitoring stations as well as the residential address of MS patients was plotted on geographical information system (GIS). An increase in AQI level in four areas of the city (north, west, east and south) was associated with higher expanded disability status scale (EDSS) of MS patients[logistic regression odds ratio = 1.01 (95% CI = 1.008,1.012)]. Moreover, significant inverse association between the complete remission after the first attack with AQI level in total areas [logistic regression odds ratio = 0.987 (95% CI = 0.977, 0.997)] was found in crude model. However, after adjustment for confounding variables through multivariate logistic regression, AQI level was associated with degree of complete remission after first attack 1.005 (95% CI = 1.004, 1.006). The results of our study suggest that air pollution could play a role in the severity and remission of MS disease. However, more detailed studies with considering the complex involvement of different environmental factors including sunlight exposure, diet, depression and vitamin D are needed to determine the outcome of MS. Copyright © 2018 Elsevier B.V. All rights reserved.
Sperm Retrieval in Patients with Klinefelter Syndrome: A Skewed Regression Model Analysis.
Chehrazi, Mohammad; Rahimiforoushani, Abbas; Sabbaghian, Marjan; Nourijelyani, Keramat; Sadighi Gilani, Mohammad Ali; Hoseini, Mostafa; Vesali, Samira; Yaseri, Mehdi; Alizadeh, Ahad; Mohammad, Kazem; Samani, Reza Omani
2017-01-01
The most common chromosomal abnormality due to non-obstructive azoospermia (NOA) is Klinefelter syndrome (KS) which occurs in 1-1.72 out of 500-1000 male infants. The probability of retrieving sperm as the outcome could be asymmetrically different between patients with and without KS, therefore logistic regression analysis is not a well-qualified test for this type of data. This study has been designed to evaluate skewed regression model analysis for data collected from microsurgical testicular sperm extraction (micro-TESE) among azoospermic patients with and without non-mosaic KS syndrome. This cohort study compared the micro-TESE outcome between 134 men with classic KS and 537 men with NOA and normal karyotype who were referred to Royan Institute between 2009 and 2011. In addition to our main outcome, which was sperm retrieval, we also used logistic and skewed regression analyses to compare the following demographic and hormonal factors: age, level of follicle stimulating hormone (FSH), luteinizing hormone (LH), and testosterone between the two groups. A comparison of the micro-TESE between the KS and control groups showed a success rate of 28.4% (38/134) for the KS group and 22.2% (119/537) for the control group. In the KS group, a significantly difference (P<0.001) existed between testosterone levels for the successful sperm retrieval group (3.4 ± 0.48 mg/mL) compared to the unsuccessful sperm retrieval group (2.33 ± 0.23 mg/mL). The index for quasi Akaike information criterion (QAIC) had a goodness of fit of 74 for the skewed model which was lower than logistic regression (QAIC=85). According to the results, skewed regression is more efficient in estimating sperm retrieval success when the data from patients with KS are analyzed. This finding should be investigated by conducting additional studies with different data structures.
Are Women More Likely to Be Hired or Promoted into Management Positions?
ERIC Educational Resources Information Center
Lyness, Karen S.; Judiesch, Michael K.
1999-01-01
In a three-year study of 30,996 financial-services managers, logistic regression analyses showed that women were more likely to be promoted rather than hired into management positions. Relative to men, women in higher-level positions received fewer promotions than women in lower-level positions. (63 references) (SK)
Logistic regression for dichotomized counts.
Preisser, John S; Das, Kalyan; Benecha, Habtamu; Stamm, John W
2016-12-01
Sometimes there is interest in a dichotomized outcome indicating whether a count variable is positive or zero. Under this scenario, the application of ordinary logistic regression may result in efficiency loss, which is quantifiable under an assumed model for the counts. In such situations, a shared-parameter hurdle model is investigated for more efficient estimation of regression parameters relating to overall effects of covariates on the dichotomous outcome, while handling count data with many zeroes. One model part provides a logistic regression containing marginal log odds ratio effects of primary interest, while an ancillary model part describes the mean count of a Poisson or negative binomial process in terms of nuisance regression parameters. Asymptotic efficiency of the logistic model parameter estimators of the two-part models is evaluated with respect to ordinary logistic regression. Simulations are used to assess the properties of the models with respect to power and Type I error, the latter investigated under both misspecified and correctly specified models. The methods are applied to data from a randomized clinical trial of three toothpaste formulations to prevent incident dental caries in a large population of Scottish schoolchildren. © The Author(s) 2014.
Zhu, K; Lou, Z; Zhou, J; Ballester, N; Kong, N; Parikh, P
2015-01-01
This article is part of the Focus Theme of Methods of Information in Medicine on "Big Data and Analytics in Healthcare". Hospital readmissions raise healthcare costs and cause significant distress to providers and patients. It is, therefore, of great interest to healthcare organizations to predict what patients are at risk to be readmitted to their hospitals. However, current logistic regression based risk prediction models have limited prediction power when applied to hospital administrative data. Meanwhile, although decision trees and random forests have been applied, they tend to be too complex to understand among the hospital practitioners. Explore the use of conditional logistic regression to increase the prediction accuracy. We analyzed an HCUP statewide inpatient discharge record dataset, which includes patient demographics, clinical and care utilization data from California. We extracted records of heart failure Medicare beneficiaries who had inpatient experience during an 11-month period. We corrected the data imbalance issue with under-sampling. In our study, we first applied standard logistic regression and decision tree to obtain influential variables and derive practically meaning decision rules. We then stratified the original data set accordingly and applied logistic regression on each data stratum. We further explored the effect of interacting variables in the logistic regression modeling. We conducted cross validation to assess the overall prediction performance of conditional logistic regression (CLR) and compared it with standard classification models. The developed CLR models outperformed several standard classification models (e.g., straightforward logistic regression, stepwise logistic regression, random forest, support vector machine). For example, the best CLR model improved the classification accuracy by nearly 20% over the straightforward logistic regression model. Furthermore, the developed CLR models tend to achieve better sensitivity of more than 10% over the standard classification models, which can be translated to correct labeling of additional 400 - 500 readmissions for heart failure patients in the state of California over a year. Lastly, several key predictor identified from the HCUP data include the disposition location from discharge, the number of chronic conditions, and the number of acute procedures. It would be beneficial to apply simple decision rules obtained from the decision tree in an ad-hoc manner to guide the cohort stratification. It could be potentially beneficial to explore the effect of pairwise interactions between influential predictors when building the logistic regression models for different data strata. Judicious use of the ad-hoc CLR models developed offers insights into future development of prediction models for hospital readmissions, which can lead to better intuition in identifying high-risk patients and developing effective post-discharge care strategies. Lastly, this paper is expected to raise the awareness of collecting data on additional markers and developing necessary database infrastructure for larger-scale exploratory studies on readmission risk prediction.
ERIC Educational Resources Information Center
Leatherdale, Scott T.
2010-01-01
The objective is to examine school-level program and policy characteristics and student-level behavioural characteristics associated with being overweight. Multilevel logistic regression analysis were used to examine the school- and student-level characteristics associated with the odds of a student being overweight among 1264 Grade 5-8 students…
The Impact of Household Heads' Education Levels on the Poverty Risk: The Evidence from Turkey
ERIC Educational Resources Information Center
Bilenkisi, Fikret; Gungor, Mahmut Sami; Tapsin, Gulcin
2015-01-01
This study aims to analyze the relationship between the education levels of household heads and the poverty risk of households in Turkey. The logistic regression models have been estimated with the poverty risk of a household as a dependent variable and a set of educational levels as explanatory variables for all households. There are subgroups of…
ERIC Educational Resources Information Center
Childs, Kristina; Dembo, Richard; Belenko, Steven; Wareham, Jennifer; Schmeidler, James
2011-01-01
Variations in drug use have been found across individual-level factors and community characteristics, and by type of drug used. Relatively little research, however, has examined this variation among juvenile offenders. Based on a sample of 924 newly arrested juvenile offenders, two multilevel logistic regression models predicting marijuana test…
Interpretation of commonly used statistical regression models.
Kasza, Jessica; Wolfe, Rory
2014-01-01
A review of some regression models commonly used in respiratory health applications is provided in this article. Simple linear regression, multiple linear regression, logistic regression and ordinal logistic regression are considered. The focus of this article is on the interpretation of the regression coefficients of each model, which are illustrated through the application of these models to a respiratory health research study. © 2013 The Authors. Respirology © 2013 Asian Pacific Society of Respirology.
Jiang, Jun; Lei, Lan; Zhou, Xiaowan; Li, Peng; Wei, Ren
2018-02-20
Recent studies have shown that low hemoglobin (Hb) level promote the progression of chronic kidney disease. This study assessed the relationship between Hb level and type 1 diabetic nephropathy (DN) in Anhui Han's patients. There were a total of 236 patients diagnosed with type 1 diabetes mellitus and (T1DM) seen between January 2014 and December 2016 in our centre. Hemoglobin levels in patients with DN were compared with those without DN. The relationship between Hb level and the urinary albumin-creatinine ratio (ACR) was examined by Spearman's correlational analysis and multiple stepwise regression analysis. The binary logistic multivariate regression analysis was performed to analyze the correlated factors for type 1 DN, calculate the Odds Ratio (OR) and 95%confidence interval (CI). The predicting value of Hb level for DN was evaluated by area under receiver operation characteristic curve (AUROC) for discrimination and Hosmer-Lemeshow goodness-of-fit test for calibration. The average Hb levels in the DN group (116.1 ± 20.8 g/L) were significantly lower than the non-DN group (131.9 ± 14.4 g/L) , P < 0.001. Hb levels were independently correlated with the urinary ACR in multiple stepwise regression analysis. The logistic multivariate regression analysis showed that the Hb level (OR: 0.936, 95% CI: 0.910 to 0.963, P < 0.001) was inversely correlated with DN in patients with T1DM. In sub-analysis, low Hb level (Hb < 120g/L in female, Hb < 130g/L in male) was still negatively associated with DN in patients with T1DM. The AUROC was 0.721 (95% CI: 0.655 to 0.787) in assessing the discrimination of the Hb level for DN. The value of P was 0.593 in Hosmer-Lemeshow goodness-of-fit test. In Anhui Han's patients with T1DM, the Hb level is inversely correlated with urinary ACR and DN. This article is protected by copyright. All rights reserved.
Differentially private distributed logistic regression using private and public data
2014-01-01
Background Privacy protecting is an important issue in medical informatics and differential privacy is a state-of-the-art framework for data privacy research. Differential privacy offers provable privacy against attackers who have auxiliary information, and can be applied to data mining models (for example, logistic regression). However, differentially private methods sometimes introduce too much noise and make outputs less useful. Given available public data in medical research (e.g. from patients who sign open-consent agreements), we can design algorithms that use both public and private data sets to decrease the amount of noise that is introduced. Methodology In this paper, we modify the update step in Newton-Raphson method to propose a differentially private distributed logistic regression model based on both public and private data. Experiments and results We try our algorithm on three different data sets, and show its advantage over: (1) a logistic regression model based solely on public data, and (2) a differentially private distributed logistic regression model based on private data under various scenarios. Conclusion Logistic regression models built with our new algorithm based on both private and public datasets demonstrate better utility than models that trained on private or public datasets alone without sacrificing the rigorous privacy guarantee. PMID:25079786
Park, Ji Hyun; Kim, Hyeon-Young; Lee, Hanna; Yun, Eun Kyoung
2015-12-01
This study compares the performance of the logistic regression and decision tree analysis methods for assessing the risk factors for infection in cancer patients undergoing chemotherapy. The subjects were 732 cancer patients who were receiving chemotherapy at K university hospital in Seoul, Korea. The data were collected between March 2011 and February 2013 and were processed for descriptive analysis, logistic regression and decision tree analysis using the IBM SPSS Statistics 19 and Modeler 15.1 programs. The most common risk factors for infection in cancer patients receiving chemotherapy were identified as alkylating agents, vinca alkaloid and underlying diabetes mellitus. The logistic regression explained 66.7% of the variation in the data in terms of sensitivity and 88.9% in terms of specificity. The decision tree analysis accounted for 55.0% of the variation in the data in terms of sensitivity and 89.0% in terms of specificity. As for the overall classification accuracy, the logistic regression explained 88.0% and the decision tree analysis explained 87.2%. The logistic regression analysis showed a higher degree of sensitivity and classification accuracy. Therefore, logistic regression analysis is concluded to be the more effective and useful method for establishing an infection prediction model for patients undergoing chemotherapy. Copyright © 2015 Elsevier Ltd. All rights reserved.
Yang, Lixue; Chen, Kean
2015-11-01
To improve the design of underwater target recognition systems based on auditory perception, this study compared human listeners with automatic classifiers. Performances measures and strategies in three discrimination experiments, including discriminations between man-made and natural targets, between ships and submarines, and among three types of ships, were used. In the experiments, the subjects were asked to assign a score to each sound based on how confident they were about the category to which it belonged, and logistic regression, which represents linear discriminative models, also completed three similar tasks by utilizing many auditory features. The results indicated that the performances of logistic regression improved as the ratio between inter- and intra-class differences became larger, whereas the performances of the human subjects were limited by their unfamiliarity with the targets. Logistic regression performed better than the human subjects in all tasks but the discrimination between man-made and natural targets, and the strategies employed by excellent human subjects were similar to that of logistic regression. Logistic regression and several human subjects demonstrated similar performances when discriminating man-made and natural targets, but in this case, their strategies were not similar. An appropriate fusion of their strategies led to further improvement in recognition accuracy.
NASA Astrophysics Data System (ADS)
Mei, Zhixiong; Wu, Hao; Li, Shiyun
2018-06-01
The Conversion of Land Use and its Effects at Small regional extent (CLUE-S), which is a widely used model for land-use simulation, utilizes logistic regression to estimate the relationships between land use and its drivers, and thus, predict land-use change probabilities. However, logistic regression disregards possible spatial autocorrelation and self-organization in land-use data. Autologistic regression can depict spatial autocorrelation but cannot address self-organization, while logistic regression by considering only self-organization (NElogistic regression) fails to capture spatial autocorrelation. Therefore, this study developed a regression (NE-autologistic regression) method, which incorporated both spatial autocorrelation and self-organization, to improve CLUE-S. The Zengcheng District of Guangzhou, China was selected as the study area. The land-use data of 2001, 2005, and 2009, as well as 10 typical driving factors, were used to validate the proposed regression method and the improved CLUE-S model. Then, three future land-use scenarios in 2020: the natural growth scenario, ecological protection scenario, and economic development scenario, were simulated using the improved model. Validation results showed that NE-autologistic regression performed better than logistic regression, autologistic regression, and NE-logistic regression in predicting land-use change probabilities. The spatial allocation accuracy and kappa values of NE-autologistic-CLUE-S were higher than those of logistic-CLUE-S, autologistic-CLUE-S, and NE-logistic-CLUE-S for the simulations of two periods, 2001-2009 and 2005-2009, which proved that the improved CLUE-S model achieved the best simulation and was thereby effective to a certain extent. The scenario simulation results indicated that under all three scenarios, traffic land and residential/industrial land would increase, whereas arable land and unused land would decrease during 2009-2020. Apparent differences also existed in the simulated change sizes and locations of each land-use type under different scenarios. The results not only demonstrate the validity of the improved model but also provide a valuable reference for relevant policy-makers.
Unitary Response Regression Models
ERIC Educational Resources Information Center
Lipovetsky, S.
2007-01-01
The dependent variable in a regular linear regression is a numerical variable, and in a logistic regression it is a binary or categorical variable. In these models the dependent variable has varying values. However, there are problems yielding an identity output of a constant value which can also be modelled in a linear or logistic regression with…
Binary logistic regression-Instrument for assessing museum indoor air impact on exhibits.
Bucur, Elena; Danet, Andrei Florin; Lehr, Carol Blaziu; Lehr, Elena; Nita-Lazar, Mihai
2017-04-01
This paper presents a new way to assess the environmental impact on historical artifacts using binary logistic regression. The prediction of the impact on the exhibits during certain pollution scenarios (environmental impact) was calculated by a mathematical model based on the binary logistic regression; it allows the identification of those environmental parameters from a multitude of possible parameters with a significant impact on exhibitions and ranks them according to their severity effect. Air quality (NO 2 , SO 2 , O 3 and PM 2.5 ) and microclimate parameters (temperature, humidity) monitoring data from a case study conducted within exhibition and storage spaces of the Romanian National Aviation Museum Bucharest have been used for developing and validating the binary logistic regression method and the mathematical model. The logistic regression analysis was used on 794 data combinations (715 to develop of the model and 79 to validate it) by a Statistical Package for Social Sciences (SPSS 20.0). The results from the binary logistic regression analysis demonstrated that from six parameters taken into consideration, four of them present a significant effect upon exhibits in the following order: O 3 >PM 2.5 >NO 2 >humidity followed at a significant distance by the effects of SO 2 and temperature. The mathematical model, developed in this study, correctly predicted 95.1 % of the cumulated effect of the environmental parameters upon the exhibits. Moreover, this model could also be used in the decisional process regarding the preventive preservation measures that should be implemented within the exhibition space. The paper presents a new way to assess the environmental impact on historical artifacts using binary logistic regression. The mathematical model developed on the environmental parameters analyzed by the binary logistic regression method could be useful in a decision-making process establishing the best measures for pollution reduction and preventive preservation of exhibits.
Determining factors influencing survival of breast cancer by fuzzy logistic regression model.
Nikbakht, Roya; Bahrampour, Abbas
2017-01-01
Fuzzy logistic regression model can be used for determining influential factors of disease. This study explores the important factors of actual predictive survival factors of breast cancer's patients. We used breast cancer data which collected by cancer registry of Kerman University of Medical Sciences during the period of 2000-2007. The variables such as morphology, grade, age, and treatments (surgery, radiotherapy, and chemotherapy) were applied in the fuzzy logistic regression model. Performance of model was determined in terms of mean degree of membership (MDM). The study results showed that almost 41% of patients were in neoplasm and malignant group and more than two-third of them were still alive after 5-year follow-up. Based on the fuzzy logistic model, the most important factors influencing survival were chemotherapy, morphology, and radiotherapy, respectively. Furthermore, the MDM criteria show that the fuzzy logistic regression have a good fit on the data (MDM = 0.86). Fuzzy logistic regression model showed that chemotherapy is more important than radiotherapy in survival of patients with breast cancer. In addition, another ability of this model is calculating possibilistic odds of survival in cancer patients. The results of this study can be applied in clinical research. Furthermore, there are few studies which applied the fuzzy logistic models. Furthermore, we recommend using this model in various research areas.
Improving power and robustness for detecting genetic association with extreme-value sampling design.
Chen, Hua Yun; Li, Mingyao
2011-12-01
Extreme-value sampling design that samples subjects with extremely large or small quantitative trait values is commonly used in genetic association studies. Samples in such designs are often treated as "cases" and "controls" and analyzed using logistic regression. Such a case-control analysis ignores the potential dose-response relationship between the quantitative trait and the underlying trait locus and thus may lead to loss of power in detecting genetic association. An alternative approach to analyzing such data is to model the dose-response relationship by a linear regression model. However, parameter estimation from this model can be biased, which may lead to inflated type I errors. We propose a robust and efficient approach that takes into consideration of both the biased sampling design and the potential dose-response relationship. Extensive simulations demonstrate that the proposed method is more powerful than the traditional logistic regression analysis and is more robust than the linear regression analysis. We applied our method to the analysis of a candidate gene association study on high-density lipoprotein cholesterol (HDL-C) which includes study subjects with extremely high or low HDL-C levels. Using our method, we identified several SNPs showing a stronger evidence of association with HDL-C than the traditional case-control logistic regression analysis. Our results suggest that it is important to appropriately model the quantitative traits and to adjust for the biased sampling when dose-response relationship exists in extreme-value sampling designs. © 2011 Wiley Periodicals, Inc.
Supporting Regularized Logistic Regression Privately and Efficiently.
Li, Wenfa; Liu, Hongzhe; Yang, Peng; Xie, Wei
2016-01-01
As one of the most popular statistical and machine learning models, logistic regression with regularization has found wide adoption in biomedicine, social sciences, information technology, and so on. These domains often involve data of human subjects that are contingent upon strict privacy regulations. Concerns over data privacy make it increasingly difficult to coordinate and conduct large-scale collaborative studies, which typically rely on cross-institution data sharing and joint analysis. Our work here focuses on safeguarding regularized logistic regression, a widely-used statistical model while at the same time has not been investigated from a data security and privacy perspective. We consider a common use scenario of multi-institution collaborative studies, such as in the form of research consortia or networks as widely seen in genetics, epidemiology, social sciences, etc. To make our privacy-enhancing solution practical, we demonstrate a non-conventional and computationally efficient method leveraging distributing computing and strong cryptography to provide comprehensive protection over individual-level and summary data. Extensive empirical evaluations on several studies validate the privacy guarantee, efficiency and scalability of our proposal. We also discuss the practical implications of our solution for large-scale studies and applications from various disciplines, including genetic and biomedical studies, smart grid, network analysis, etc.
Supporting Regularized Logistic Regression Privately and Efficiently
Li, Wenfa; Liu, Hongzhe; Yang, Peng; Xie, Wei
2016-01-01
As one of the most popular statistical and machine learning models, logistic regression with regularization has found wide adoption in biomedicine, social sciences, information technology, and so on. These domains often involve data of human subjects that are contingent upon strict privacy regulations. Concerns over data privacy make it increasingly difficult to coordinate and conduct large-scale collaborative studies, which typically rely on cross-institution data sharing and joint analysis. Our work here focuses on safeguarding regularized logistic regression, a widely-used statistical model while at the same time has not been investigated from a data security and privacy perspective. We consider a common use scenario of multi-institution collaborative studies, such as in the form of research consortia or networks as widely seen in genetics, epidemiology, social sciences, etc. To make our privacy-enhancing solution practical, we demonstrate a non-conventional and computationally efficient method leveraging distributing computing and strong cryptography to provide comprehensive protection over individual-level and summary data. Extensive empirical evaluations on several studies validate the privacy guarantee, efficiency and scalability of our proposal. We also discuss the practical implications of our solution for large-scale studies and applications from various disciplines, including genetic and biomedical studies, smart grid, network analysis, etc. PMID:27271738
Classification of Effective Soil Depth by Using Multinomial Logistic Regression Analysis
NASA Astrophysics Data System (ADS)
Chang, C. H.; Chan, H. C.; Chen, B. A.
2016-12-01
Classification of effective soil depth is a task of determining the slopeland utilizable limitation in Taiwan. The "Slopeland Conservation and Utilization Act" categorizes the slopeland into agriculture and husbandry land, land suitable for forestry and land for enhanced conservation according to the factors including average slope, effective soil depth, soil erosion and parental rock. However, sit investigation of the effective soil depth requires a cost-effective field work. This research aimed to classify the effective soil depth by using multinomial logistic regression with the environmental factors. The Wen-Shui Watershed located at the central Taiwan was selected as the study areas. The analysis of multinomial logistic regression is performed by the assistance of a Geographic Information Systems (GIS). The effective soil depth was categorized into four levels including deeper, deep, shallow and shallower. The environmental factors of slope, aspect, digital elevation model (DEM), curvature and normalized difference vegetation index (NDVI) were selected for classifying the soil depth. An Error Matrix was then used to assess the model accuracy. The results showed an overall accuracy of 75%. At the end, a map of effective soil depth was produced to help planners and decision makers in determining the slopeland utilizable limitation in the study areas.
Does Group-Level Commitment Predict Employee Well-Being?: A Prospective Analysis.
Clausen, Thomas; Christensen, Karl Bang; Nielsen, Karina
2015-11-01
To investigate the links between group-level affective organizational commitment (AOC) and individual-level psychological well-being, self-reported sickness absence, and sleep disturbances. A total of 5085 care workers from 301 workgroups in the Danish eldercare services participated in both waves of the study (T1 [2005] and T2 [2006]). The three outcomes were analyzed using linear multilevel regression analysis, multilevel Poisson regression analysis, and multilevel logistic regression analysis, respectively. Group-level AOC (T1) significantly predicted individual-level psychological well-being, self-reported sickness absence, and sleep disturbances (T2). The association between group-level AOC (T1) and psychological well-being (T2) was fully mediated by individual-level AOC (T1), and the associations between group-level AOC (T1) and self-reported sickness absence and sleep disturbances (T2) were partially mediated by individual-level AOC (T1). Group-level AOC is an important predictor of employee well-being in contemporary health care organizations.
Ding, Xiaohan; Ye, Ping; Wang, Xiaona; Cao, Ruihua; Yang, Xu; Xiao, Wenkai; Zhang, Yun; Bai, Yongyi; Wu, Hongmei
2017-03-01
This prospective cohort study aimed at identifying association between uric acid (UA) and peripheral arterial stiffness. A prospective cohort longitudinal study was performed according to an average of 4.8 years' follow-up. The demographic data, anthropometric parameters, peripheral arterial stiffness (carotid-radial pulse-wave velocity, cr-PWV) and biomarker variables including UA were examined at both baseline and follow-up. Pearson's correlations were used to identify the associations between UA and peripheral arterial stiffness. Further logistic regressions were employed to determine the associations between UA and arterial stiffness. At the end of follow-up, 1447 subjects were included in the analyses. At baseline, cr-PWV ( r = 0.200, p < 0.001) was closely associated with UA. Furthermore, the follow-up cr-PWV ( r = 0.145, p < 0.001) was also strongly correlated to baseline UA in Pearson's correlation analysis. Multiple regressions also indicated the association between follow-up cr-PWV ( β = 0.493, p = 0.013) and baseline UA level. Logistic regressions revealed that higher baseline UA level was an independent predictor of arterial stiffness severity assessed by cr-PWV at follow-up cross-section. Peripheral arterial stiffness is closely associated with higher baseline UA level. Furthermore, a higher baseline UA level is an independent risk factor and predictor for peripheral arterial stiffness.
Mixed conditional logistic regression for habitat selection studies.
Duchesne, Thierry; Fortin, Daniel; Courbin, Nicolas
2010-05-01
1. Resource selection functions (RSFs) are becoming a dominant tool in habitat selection studies. RSF coefficients can be estimated with unconditional (standard) and conditional logistic regressions. While the advantage of mixed-effects models is recognized for standard logistic regression, mixed conditional logistic regression remains largely overlooked in ecological studies. 2. We demonstrate the significance of mixed conditional logistic regression for habitat selection studies. First, we use spatially explicit models to illustrate how mixed-effects RSFs can be useful in the presence of inter-individual heterogeneity in selection and when the assumption of independence from irrelevant alternatives (IIA) is violated. The IIA hypothesis states that the strength of preference for habitat type A over habitat type B does not depend on the other habitat types also available. Secondly, we demonstrate the significance of mixed-effects models to evaluate habitat selection of free-ranging bison Bison bison. 3. When movement rules were homogeneous among individuals and the IIA assumption was respected, fixed-effects RSFs adequately described habitat selection by simulated animals. In situations violating the inter-individual homogeneity and IIA assumptions, however, RSFs were best estimated with mixed-effects regressions, and fixed-effects models could even provide faulty conclusions. 4. Mixed-effects models indicate that bison did not select farmlands, but exhibited strong inter-individual variations in their response to farmlands. Less than half of the bison preferred farmlands over forests. Conversely, the fixed-effect model simply suggested an overall selection for farmlands. 5. Conditional logistic regression is recognized as a powerful approach to evaluate habitat selection when resource availability changes. This regression is increasingly used in ecological studies, but almost exclusively in the context of fixed-effects models. Fitness maximization can imply differences in trade-offs among individuals, which can yield inter-individual differences in selection and lead to departure from IIA. These situations are best modelled with mixed-effects models. Mixed-effects conditional logistic regression should become a valuable tool for ecological research.
A Multilevel Study of Students' Motivations of Studying Accounting: Implications for Employers
ERIC Educational Resources Information Center
Law, Philip; Yuen, Desmond
2012-01-01
Purpose: The purpose of this study is to examine the influence of factors affecting students' choice of accounting as a study major in Hong Kong. Design/methodology/approach: Multinomial logistic regression and Hierarchical Generalized Linear Modeling (HGLM) are used to analyze the survey data for the level one and level two data, which is the…
Modeling individual tree survial
Quang V. Cao
2016-01-01
Information provided by growth and yield models is the basis for forest managers to make decisions on how to manage their forests. Among different types of growth models, whole-stand models offer predictions at stand level, whereas individual-tree models give detailed information at tree level. The well-known logistic regression is commonly used to predict tree...
Is It Considered Violence? The Acceptability of Physical Punishment of Children in Europe
ERIC Educational Resources Information Center
Gracia, Enrique; Herrero, Juan
2008-01-01
This study analyzes correlates of the acceptability of physical punishment of children in Europe. The design was a three-level ordinal logistic regression of 10,812 people nested within 208 localities (cities), nested within 14 countries of the European Union. Results showed that higher levels of acceptability were reported by men, the older, the…
John W. Coulston
2011-01-01
Tropospheric ozone occurs at phytotoxic levels in the United States (Lefohn and Pinkerton 1988). Several plant species, including commercially important timber species, are sensitive to elevated ozone levels. Exposure to elevated ozone can cause growth reduction and foliar injury and make trees more susceptible to secondary stressors such as insects and pathogens (...
Advanced colorectal neoplasia risk stratification by penalized logistic regression.
Lin, Yunzhi; Yu, Menggang; Wang, Sijian; Chappell, Richard; Imperiale, Thomas F
2016-08-01
Colorectal cancer is the second leading cause of death from cancer in the United States. To facilitate the efficiency of colorectal cancer screening, there is a need to stratify risk for colorectal cancer among the 90% of US residents who are considered "average risk." In this article, we investigate such risk stratification rules for advanced colorectal neoplasia (colorectal cancer and advanced, precancerous polyps). We use a recently completed large cohort study of subjects who underwent a first screening colonoscopy. Logistic regression models have been used in the literature to estimate the risk of advanced colorectal neoplasia based on quantifiable risk factors. However, logistic regression may be prone to overfitting and instability in variable selection. Since most of the risk factors in our study have several categories, it was tempting to collapse these categories into fewer risk groups. We propose a penalized logistic regression method that automatically and simultaneously selects variables, groups categories, and estimates their coefficients by penalizing the [Formula: see text]-norm of both the coefficients and their differences. Hence, it encourages sparsity in the categories, i.e. grouping of the categories, and sparsity in the variables, i.e. variable selection. We apply the penalized logistic regression method to our data. The important variables are selected, with close categories simultaneously grouped, by penalized regression models with and without the interactions terms. The models are validated with 10-fold cross-validation. The receiver operating characteristic curves of the penalized regression models dominate the receiver operating characteristic curve of naive logistic regressions, indicating a superior discriminative performance. © The Author(s) 2013.
[Use of data display screens and ocular hypertension in local public sector workers].
Abellán Torró, Rosana; Merelles Tormo, Antoni
2014-01-01
The main objective of this study is to examine the association between work with data display screens (DDS) and ocular hypertension (OHT). A cross-sectional study among local public sector workers (Diputación Provincial de Valencia). Data from 620 people were collected over 25 months, from periodic medical examinations performed at an occupational health unit. Intraocular pressure (IOP) was obtained with a portable puff tonometer validated for screening, establishing the cut-off point for OHT at 22 mmHg. Both biological characteristics and other work-related variables were taken into account as covariates. Descriptive statistics of the data were obtained, together with nonparametric tests with a level of significance of 95% and logistic regression with p 〈0.1 as the level of significance of the likelihood test. The average age of the study population is 52.8 years. The prevalence of OHT was 3.5% (5.1% among men and 1.2% among women; p=0.012). No significant associations were found between hours of DDS-related work and OHT were found (p=0.395). Logistic regression corroborated the association between gender and OHT, with women less affected (OR = 0.234; 95%CI: 0.068 - 0.799; p=0.020). In our study, no associations were found between time of exposure to data display screens and ocular hypertension. Logistic regression points to a certain association between ocular hypertension and gender, with men being more predisposed. Copyright belongs to the Societat Catalana de Salut Laboral.
Rupert, Michael G.; Cannon, Susan H.; Gartner, Joseph E.
2003-01-01
Logistic regression was used to predict the probability of debris flows occurring in areas recently burned by wildland fires. Multiple logistic regression is conceptually similar to multiple linear regression because statistical relations between one dependent variable and several independent variables are evaluated. In logistic regression, however, the dependent variable is transformed to a binary variable (debris flow did or did not occur), and the actual probability of the debris flow occurring is statistically modeled. Data from 399 basins located within 15 wildland fires that burned during 2000-2002 in Colorado, Idaho, Montana, and New Mexico were evaluated. More than 35 independent variables describing the burn severity, geology, land surface gradient, rainfall, and soil properties were evaluated. The models were developed as follows: (1) Basins that did and did not produce debris flows were delineated from National Elevation Data using a Geographic Information System (GIS). (2) Data describing the burn severity, geology, land surface gradient, rainfall, and soil properties were determined for each basin. These data were then downloaded to a statistics software package for analysis using logistic regression. (3) Relations between the occurrence/non-occurrence of debris flows and burn severity, geology, land surface gradient, rainfall, and soil properties were evaluated and several preliminary multivariate logistic regression models were constructed. All possible combinations of independent variables were evaluated to determine which combination produced the most effective model. The multivariate model that best predicted the occurrence of debris flows was selected. (4) The multivariate logistic regression model was entered into a GIS, and a map showing the probability of debris flows was constructed. The most effective model incorporates the percentage of each basin with slope greater than 30 percent, percentage of land burned at medium and high burn severity in each basin, particle size sorting, average storm intensity (millimeters per hour), soil organic matter content, soil permeability, and soil drainage. The results of this study demonstrate that logistic regression is a valuable tool for predicting the probability of debris flows occurring in recently-burned landscapes.
Ebrahimzadeh, Farzad; Hajizadeh, Ebrahim; Vahabi, Nasim; Almasian, Mohammad; Bakhteyar, Katayoon
2015-01-01
Background: Unwanted pregnancy not intended by at least one of the parents has undesirable consequences for the family and the society. In the present study, three classification models were used and compared to predict unwanted pregnancies in an urban population. Methods: In this cross-sectional study, 887 pregnant mothers referring to health centers in Khorramabad, Iran, in 2012 were selected by the stratified and cluster sampling; relevant variables were measured and for prediction of unwanted pregnancy, logistic regression, discriminant analysis, and probit regression models and SPSS software version 21 were used. To compare these models, indicators such as sensitivity, specificity, the area under the ROC curve, and the percentage of correct predictions were used. Results: The prevalence of unwanted pregnancies was 25.3%. The logistic and probit regression models indicated that parity and pregnancy spacing, contraceptive methods, household income and number of living male children were related to unwanted pregnancy. The performance of the models based on the area under the ROC curve was 0.735, 0.733, and 0.680 for logistic regression, probit regression, and linear discriminant analysis, respectively. Conclusion: Given the relatively high prevalence of unwanted pregnancies in Khorramabad, it seems necessary to revise family planning programs. Despite the similar accuracy of the models, if the researcher is interested in the interpretability of the results, the use of the logistic regression model is recommended. PMID:26793655
Ebrahimzadeh, Farzad; Hajizadeh, Ebrahim; Vahabi, Nasim; Almasian, Mohammad; Bakhteyar, Katayoon
2015-01-01
Unwanted pregnancy not intended by at least one of the parents has undesirable consequences for the family and the society. In the present study, three classification models were used and compared to predict unwanted pregnancies in an urban population. In this cross-sectional study, 887 pregnant mothers referring to health centers in Khorramabad, Iran, in 2012 were selected by the stratified and cluster sampling; relevant variables were measured and for prediction of unwanted pregnancy, logistic regression, discriminant analysis, and probit regression models and SPSS software version 21 were used. To compare these models, indicators such as sensitivity, specificity, the area under the ROC curve, and the percentage of correct predictions were used. The prevalence of unwanted pregnancies was 25.3%. The logistic and probit regression models indicated that parity and pregnancy spacing, contraceptive methods, household income and number of living male children were related to unwanted pregnancy. The performance of the models based on the area under the ROC curve was 0.735, 0.733, and 0.680 for logistic regression, probit regression, and linear discriminant analysis, respectively. Given the relatively high prevalence of unwanted pregnancies in Khorramabad, it seems necessary to revise family planning programs. Despite the similar accuracy of the models, if the researcher is interested in the interpretability of the results, the use of the logistic regression model is recommended.
Estimating the exceedance probability of rain rate by logistic regression
NASA Technical Reports Server (NTRS)
Chiu, Long S.; Kedem, Benjamin
1990-01-01
Recent studies have shown that the fraction of an area with rain intensity above a fixed threshold is highly correlated with the area-averaged rain rate. To estimate the fractional rainy area, a logistic regression model, which estimates the conditional probability that rain rate over an area exceeds a fixed threshold given the values of related covariates, is developed. The problem of dependency in the data in the estimation procedure is bypassed by the method of partial likelihood. Analyses of simulated scanning multichannel microwave radiometer and observed electrically scanning microwave radiometer data during the Global Atlantic Tropical Experiment period show that the use of logistic regression in pixel classification is superior to multiple regression in predicting whether rain rate at each pixel exceeds a given threshold, even in the presence of noisy data. The potential of the logistic regression technique in satellite rain rate estimation is discussed.
NASA Astrophysics Data System (ADS)
Cary, Theodore W.; Cwanger, Alyssa; Venkatesh, Santosh S.; Conant, Emily F.; Sehgal, Chandra M.
2012-03-01
This study compares the performance of two proven but very different machine learners, Naïve Bayes and logistic regression, for differentiating malignant and benign breast masses using ultrasound imaging. Ultrasound images of 266 masses were analyzed quantitatively for shape, echogenicity, margin characteristics, and texture features. These features along with patient age, race, and mammographic BI-RADS category were used to train Naïve Bayes and logistic regression classifiers to diagnose lesions as malignant or benign. ROC analysis was performed using all of the features and using only a subset that maximized information gain. Performance was determined by the area under the ROC curve, Az, obtained from leave-one-out cross validation. Naïve Bayes showed significant variation (Az 0.733 +/- 0.035 to 0.840 +/- 0.029, P < 0.002) with the choice of features, but the performance of logistic regression was relatively unchanged under feature selection (Az 0.839 +/- 0.029 to 0.859 +/- 0.028, P = 0.605). Out of 34 features, a subset of 6 gave the highest information gain: brightness difference, margin sharpness, depth-to-width, mammographic BI-RADs, age, and race. The probabilities of malignancy determined by Naïve Bayes and logistic regression after feature selection showed significant correlation (R2= 0.87, P < 0.0001). The diagnostic performance of Naïve Bayes and logistic regression can be comparable, but logistic regression is more robust. Since probability of malignancy cannot be measured directly, high correlation between the probabilities derived from two basic but dissimilar models increases confidence in the predictive power of machine learning models for characterizing solid breast masses on ultrasound.
Wang, Qingliang; Li, Xiaojie; Hu, Kunpeng; Zhao, Kun; Yang, Peisheng; Liu, Bo
2015-05-12
To explore the risk factors of portal hypertensive gastropathy (PHG) in patients with hepatitis B associated cirrhosis and establish a Logistic regression model of noninvasive prediction. The clinical data of 234 hospitalized patients with hepatitis B associated cirrhosis from March 2012 to March 2014 were analyzed retrospectively. The dependent variable was the occurrence of PHG while the independent variables were screened by binary Logistic analysis. Multivariate Logistic regression was used for further analysis of significant noninvasive independent variables. Logistic regression model was established and odds ratio was calculated for each factor. The accuracy, sensitivity and specificity of model were evaluated by the curve of receiver operating characteristic (ROC). According to univariate Logistic regression, the risk factors included hepatic dysfunction, albumin (ALB), bilirubin (TB), prothrombin time (PT), platelet (PLT), white blood cell (WBC), portal vein diameter, spleen index, splenic vein diameter, diameter ratio, PLT to spleen volume ratio, esophageal varices (EV) and gastric varices (GV). Multivariate analysis showed that hepatic dysfunction (X1), TB (X2), PLT (X3) and splenic vein diameter (X4) were the major occurring factors for PHG. The established regression model was Logit P=-2.667+2.186X1-2.167X2+0.725X3+0.976X4. The accuracy of model for PHG was 79.1% with a sensitivity of 77.2% and a specificity of 80.8%. Hepatic dysfunction, TB, PLT and splenic vein diameter are risk factors for PHG and the noninvasive predicted Logistic regression model was Logit P=-2.667+2.186X1-2.167X2+0.725X3+0.976X4.
McEgan, Rachel; Mootian, Gabriel; Goodridge, Lawrence D; Schaffner, Donald W; Danyluk, Michelle D
2013-07-01
Coliforms, Escherichia coli, and various physicochemical water characteristics have been suggested as indicators of microbial water quality or index organisms for pathogen populations. The relationship between the presence and/or concentration of Salmonella and biological, physical, or chemical indicators in Central Florida surface water samples over 12 consecutive months was explored. Samples were taken monthly for 12 months from 18 locations throughout Central Florida (n = 202). Air and water temperature, pH, oxidation-reduction potential (ORP), turbidity, and conductivity were measured. Weather data were obtained from nearby weather stations. Aerobic plate counts and most probable numbers (MPN) for Salmonella, E. coli, and coliforms were performed. Weak linear relationships existed between biological indicators (E. coli/coliforms) and Salmonella levels (R(2) < 0.1) and between physicochemical indicators and Salmonella levels (R(2) < 0.1). The average rainfall (previous day, week, and month) before sampling did not correlate well with bacterial levels. Logistic regression analysis showed that E. coli concentration can predict the probability of enumerating selected Salmonella levels. The lack of good correlations between biological indicators and Salmonella levels and between physicochemical indicators and Salmonella levels shows that the relationship between pathogens and indicators is complex. However, Escherichia coli provides a reasonable way to predict Salmonella levels in Central Florida surface water through logistic regression.
McEgan, Rachel; Mootian, Gabriel; Goodridge, Lawrence D.; Schaffner, Donald W.
2013-01-01
Coliforms, Escherichia coli, and various physicochemical water characteristics have been suggested as indicators of microbial water quality or index organisms for pathogen populations. The relationship between the presence and/or concentration of Salmonella and biological, physical, or chemical indicators in Central Florida surface water samples over 12 consecutive months was explored. Samples were taken monthly for 12 months from 18 locations throughout Central Florida (n = 202). Air and water temperature, pH, oxidation-reduction potential (ORP), turbidity, and conductivity were measured. Weather data were obtained from nearby weather stations. Aerobic plate counts and most probable numbers (MPN) for Salmonella, E. coli, and coliforms were performed. Weak linear relationships existed between biological indicators (E. coli/coliforms) and Salmonella levels (R2 < 0.1) and between physicochemical indicators and Salmonella levels (R2 < 0.1). The average rainfall (previous day, week, and month) before sampling did not correlate well with bacterial levels. Logistic regression analysis showed that E. coli concentration can predict the probability of enumerating selected Salmonella levels. The lack of good correlations between biological indicators and Salmonella levels and between physicochemical indicators and Salmonella levels shows that the relationship between pathogens and indicators is complex. However, Escherichia coli provides a reasonable way to predict Salmonella levels in Central Florida surface water through logistic regression. PMID:23624476
Variable Selection in Logistic Regression.
1987-06-01
23 %. AUTIOR(.) S. CONTRACT OR GRANT NUMBE Rf.i %Z. D. Bai, P. R. Krishnaiah and . C. Zhao F49620-85- C-0008 " PERFORMING ORGANIZATION NAME AND AOORESS...d I7 IOK-TK- d 7 -I0 7’ VARIABLE SELECTION IN LOGISTIC REGRESSION Z. D. Bai, P. R. Krishnaiah and L. C. Zhao Center for Multivariate Analysis...University of Pittsburgh Center for Multivariate Analysis University of Pittsburgh Y !I VARIABLE SELECTION IN LOGISTIC REGRESSION Z- 0. Bai, P. R. Krishnaiah
NASA Astrophysics Data System (ADS)
Madhu, B.; Ashok, N. C.; Balasubramanian, S.
2014-11-01
Multinomial logistic regression analysis was used to develop statistical model that can predict the probability of breast cancer in Southern Karnataka using the breast cancer occurrence data during 2007-2011. Independent socio-economic variables describing the breast cancer occurrence like age, education, occupation, parity, type of family, health insurance coverage, residential locality and socioeconomic status of each case was obtained. The models were developed as follows: i) Spatial visualization of the Urban- rural distribution of breast cancer cases that were obtained from the Bharat Hospital and Institute of Oncology. ii) Socio-economic risk factors describing the breast cancer occurrences were complied for each case. These data were then analysed using multinomial logistic regression analysis in a SPSS statistical software and relations between the occurrence of breast cancer across the socio-economic status and the influence of other socio-economic variables were evaluated and multinomial logistic regression models were constructed. iii) the model that best predicted the occurrence of breast cancer were identified. This multivariate logistic regression model has been entered into a geographic information system and maps showing the predicted probability of breast cancer occurrence in Southern Karnataka was created. This study demonstrates that Multinomial logistic regression is a valuable tool for developing models that predict the probability of breast cancer Occurrence in Southern Karnataka.
Parsaeian, M; Mohammad, K; Mahmoudi, M; Zeraati, H
2012-01-01
Background: The purpose of this investigation was to compare empirically predictive ability of an artificial neural network with a logistic regression in prediction of low back pain. Methods: Data from the second national health survey were considered in this investigation. This data includes the information of low back pain and its associated risk factors among Iranian people aged 15 years and older. Artificial neural network and logistic regression models were developed using a set of 17294 data and they were validated in a test set of 17295 data. Hosmer and Lemeshow recommendation for model selection was used in fitting the logistic regression. A three-layer perceptron with 9 inputs, 3 hidden and 1 output neurons was employed. The efficiency of two models was compared by receiver operating characteristic analysis, root mean square and -2 Loglikelihood criteria. Results: The area under the ROC curve (SE), root mean square and -2Loglikelihood of the logistic regression was 0.752 (0.004), 0.3832 and 14769.2, respectively. The area under the ROC curve (SE), root mean square and -2Loglikelihood of the artificial neural network was 0.754 (0.004), 0.3770 and 14757.6, respectively. Conclusions: Based on these three criteria, artificial neural network would give better performance than logistic regression. Although, the difference is statistically significant, it does not seem to be clinically significant. PMID:23113198
Parsaeian, M; Mohammad, K; Mahmoudi, M; Zeraati, H
2012-01-01
The purpose of this investigation was to compare empirically predictive ability of an artificial neural network with a logistic regression in prediction of low back pain. Data from the second national health survey were considered in this investigation. This data includes the information of low back pain and its associated risk factors among Iranian people aged 15 years and older. Artificial neural network and logistic regression models were developed using a set of 17294 data and they were validated in a test set of 17295 data. Hosmer and Lemeshow recommendation for model selection was used in fitting the logistic regression. A three-layer perceptron with 9 inputs, 3 hidden and 1 output neurons was employed. The efficiency of two models was compared by receiver operating characteristic analysis, root mean square and -2 Loglikelihood criteria. The area under the ROC curve (SE), root mean square and -2Loglikelihood of the logistic regression was 0.752 (0.004), 0.3832 and 14769.2, respectively. The area under the ROC curve (SE), root mean square and -2Loglikelihood of the artificial neural network was 0.754 (0.004), 0.3770 and 14757.6, respectively. Based on these three criteria, artificial neural network would give better performance than logistic regression. Although, the difference is statistically significant, it does not seem to be clinically significant.
NASA Astrophysics Data System (ADS)
Kamaruddin, Ainur Amira; Ali, Zalila; Noor, Norlida Mohd.; Baharum, Adam; Ahmad, Wan Muhamad Amir W.
2014-07-01
Logistic regression analysis examines the influence of various factors on a dichotomous outcome by estimating the probability of the event's occurrence. Logistic regression, also called a logit model, is a statistical procedure used to model dichotomous outcomes. In the logit model the log odds of the dichotomous outcome is modeled as a linear combination of the predictor variables. The log odds ratio in logistic regression provides a description of the probabilistic relationship of the variables and the outcome. In conducting logistic regression, selection procedures are used in selecting important predictor variables, diagnostics are used to check that assumptions are valid which include independence of errors, linearity in the logit for continuous variables, absence of multicollinearity, and lack of strongly influential outliers and a test statistic is calculated to determine the aptness of the model. This study used the binary logistic regression model to investigate overweight and obesity among rural secondary school students on the basis of their demographics profile, medical history, diet and lifestyle. The results indicate that overweight and obesity of students are influenced by obesity in family and the interaction between a student's ethnicity and routine meals intake. The odds of a student being overweight and obese are higher for a student having a family history of obesity and for a non-Malay student who frequently takes routine meals as compared to a Malay student.
Understanding logistic regression analysis.
Sperandei, Sandro
2014-01-01
Logistic regression is used to obtain odds ratio in the presence of more than one explanatory variable. The procedure is quite similar to multiple linear regression, with the exception that the response variable is binomial. The result is the impact of each variable on the odds ratio of the observed event of interest. The main advantage is to avoid confounding effects by analyzing the association of all variables together. In this article, we explain the logistic regression procedure using examples to make it as simple as possible. After definition of the technique, the basic interpretation of the results is highlighted and then some special issues are discussed.
Contributions of sociodemographic factors to criminal behavior
Mundia, Lawrence; Matzin, Rohani; Mahalle, Salwa; Hamid, Malai Hayati; Osman, Ratna Suriani
2016-01-01
We explored the extent to which prisoner sociodemographic variables (age, education, marital status, employment, and whether their parents were married or not) influenced offending in 64 randomly selected Brunei inmates, comprising both sexes. A quantitative field survey design ideal for the type of participants used in a prison context was employed to investigate the problem. Hierarchical multiple regression analysis with backward elimination identified prisoner marital status and age groups as significantly related to offending. Furthermore, hierarchical multinomial logistic regression analysis with backward elimination indicated that prisoners’ age, primary level education, marital status, employment status, and parental marital status as significantly related to stealing offenses with high odds ratios. All 29 nonrecidivists were false negatives and predicted to reoffend upon release. Similarly, all 33 recidivists were projected to reoffend after release. Hierarchical binary logistic regression analysis revealed age groups (24–29 years and 30–35 years), employed prisoner, and primary level education as variables with high likelihood trends for reoffending. The results suggested that prisoner interventions (educational, counseling, and psychotherapy) in Brunei should treat not only antisocial personality, psychopathy, and mental health problems but also sociodemographic factors. The study generated offending patterns, trends, and norms that may inform subsequent investigations on Brunei prisoners. PMID:27382342
Serum Irisin Predicts Mortality Risk in Acute Heart Failure Patients.
Shen, Shutong; Gao, Rongrong; Bei, Yihua; Li, Jin; Zhang, Haifeng; Zhou, Yanli; Yao, Wenming; Xu, Dongjie; Zhou, Fang; Jin, Mengchao; Wei, Siqi; Wang, Kai; Xu, Xuejuan; Li, Yongqin; Xiao, Junjie; Li, Xinli
2017-01-01
Irisin is a peptide hormone cleaved from a plasma membrane protein fibronectin type III domain containing protein 5 (FNDC5). Emerging studies have indicated association between serum irisin and many major chronic diseases including cardiovascular diseases. However, the role of serum irisin as a predictor for mortality risk in acute heart failure (AHF) patients is not clear. AHF patients were enrolled and serum was collected at the admission and all patients were followed up for 1 year. Enzyme-linked immunosorbent assay was used to measure serum irisin levels. To explore predictors for AHF mortality, the univariate and multivariate logistic regression analysis, and receiver-operator characteristic (ROC) curve analysis were used. To determine the role of serum irisin levels in predicting survival, Kaplan-Meier survival analysis was used. In this study, 161 AHF patients were enrolled and serum irisin level was found to be significantly higher in patients deceased in 1-year follow-up. The univariate logistic regression analysis identified 18 variables associated with all-cause mortality in AHF patients, while the multivariate logistic regression analysis identified 2 variables namely blood urea nitrogen and serum irisin. ROC curve analysis indicated that blood urea nitrogen and the most commonly used biomarker, NT-pro-BNP, displayed poor prognostic value for AHF (AUCs ≤ 0.700) compared to serum irisin (AUC = 0.753). Kaplan-Meier survival analysis demonstrated that AHF patients with higher serum irisin had significantly higher mortality (P<0.001). Collectively, our study identified serum irisin as a predictive biomarker for 1-year all-cause mortality in AHF patients though large multicenter studies are highly needed. © 2017 The Author(s). Published by S. Karger AG, Basel.
Katić, Mašenjka; Pirsl, Filip; Steinberg, Seth M.; Dobbin, Marnie; Curtis, Lauren M.; Pulanić, Dražen; Desnica, Lana; Titarenko, Irina; Pavletic, Steven Z.
2016-01-01
Aim To identify the factors associated with vitamin D status in patients with chronic graft-vs-host disease (cGVHD) and evaluate the association between serum vitamin D (25(OH)D) levels and cGVHD characteristics and clinical outcomes defined by the National Institutes of Health (NIH) criteria. Methods 310 cGVHD patients enrolled in the NIH cGVHD natural history study (clinicaltrials.gov: NCT00092235) were analyzed. Univariate analysis and multiple logistic regression were used to determine the associations between various parameters and 25(OH)D levels, dichotomized into categorical variables: ≤20 and >20 ng/mL, and as a continuous parameter. Multiple logistic regression was used to develop a predictive model for low vitamin D. Survival analysis and association between cGVHD outcomes and 25(OH)D as a continuous as well as categorical variable: ≤20 and >20 ng/mL; <50 and ≥50 ng/mL, and among three ordered categories: ≤20, 20-50, and ≥50 ng/mL, was performed. PMID:27374829
ERIC Educational Resources Information Center
Koon, Sharon; Petscher, Yaacov
2015-01-01
The purpose of this report was to explicate the use of logistic regression and classification and regression tree (CART) analysis in the development of early warning systems. It was motivated by state education leaders' interest in maintaining high classification accuracy while simultaneously improving practitioner understanding of the rules by…
A Multilevel Assessment of Differential Item Functioning.
ERIC Educational Resources Information Center
Shen, Linjun
A multilevel approach was proposed for the assessment of differential item functioning and compared with the traditional logistic regression approach. Data from the Comprehensive Osteopathic Medical Licensing Examination for 2,300 freshman osteopathic medical students were analyzed. The multilevel approach used three-level hierarchical generalized…
Eshkoor, Sima Ataollahi; Hamid, Tengku Aizan; Nudin, Siti Sa'adiah Hassan; Mun, Chan Yoke
2013-06-01
This study aimed to identify the effects of sleep quality, physical activity, environmental quality, age, ethnicity, sex differences, marital status, and educational level on the risk of falls in the elderly individuals with dementia. Data were derived from a group of 1210 Malaysian elderly individuals who were noninstitutionalized and demented. The multiple logistic regression model was applied to estimate the risk of falls in respondents. Approximately the prevalence of falls was 17% among the individuals. The results of multiple logistic regression analysis revealed that age (odds ratio [OR] = 1.03), ethnicity (OR = 1.76), sleep quality (OR = 1.46), and environmental quality (OR = 0.62) significantly affected the risk of falls in individuals (P < .05). Furthermore, sex differences, marital status, educational level, and physical activity were not significant predictors of falls in samples (P > .05). It was found that age, ethnic non-Malay, and sleep disruption increased the risk of falls in respondents, but high environmental quality reduced the risk of falls.
2017-03-23
PUBLIC RELEASE; DISTRIBUTION UNLIMITED Using Multiple and Logistic Regression to Estimate the Median Will- Cost and Probability of Cost and... Cost and Probability of Cost and Schedule Overrun for Program Managers Ryan C. Trudelle Follow this and additional works at: https://scholar.afit.edu...afit.edu. Recommended Citation Trudelle, Ryan C., "Using Multiple and Logistic Regression to Estimate the Median Will- Cost and Probability of Cost and
2013-11-01
Ptrend 0.78 0.62 0.75 Unconditional logistic regression was used to estimate odds ratios (OR) and 95 % confidence intervals (CI) for risk of node...Ptrend 0.71 0.67 Unconditional logistic regression was used to estimate odds ratios (OR) and 95 % confidence intervals (CI) for risk of high-grade tumors... logistic regression was used to estimate odds ratios (OR) and 95 % confidence intervals (CI) for the associations between each of the seven SNPs and
Kim, Sun Mi; Kim, Yongdai; Jeong, Kuhwan; Jeong, Heeyeong; Kim, Jiyoung
2018-01-01
The aim of this study was to compare the performance of image analysis for predicting breast cancer using two distinct regression models and to evaluate the usefulness of incorporating clinical and demographic data (CDD) into the image analysis in order to improve the diagnosis of breast cancer. This study included 139 solid masses from 139 patients who underwent a ultrasonography-guided core biopsy and had available CDD between June 2009 and April 2010. Three breast radiologists retrospectively reviewed 139 breast masses and described each lesion using the Breast Imaging Reporting and Data System (BI-RADS) lexicon. We applied and compared two regression methods-stepwise logistic (SL) regression and logistic least absolute shrinkage and selection operator (LASSO) regression-in which the BI-RADS descriptors and CDD were used as covariates. We investigated the performances of these regression methods and the agreement of radiologists in terms of test misclassification error and the area under the curve (AUC) of the tests. Logistic LASSO regression was superior (P<0.05) to SL regression, regardless of whether CDD was included in the covariates, in terms of test misclassification errors (0.234 vs. 0.253, without CDD; 0.196 vs. 0.258, with CDD) and AUC (0.785 vs. 0.759, without CDD; 0.873 vs. 0.735, with CDD). However, it was inferior (P<0.05) to the agreement of three radiologists in terms of test misclassification errors (0.234 vs. 0.168, without CDD; 0.196 vs. 0.088, with CDD) and the AUC without CDD (0.785 vs. 0.844, P<0.001), but was comparable to the AUC with CDD (0.873 vs. 0.880, P=0.141). Logistic LASSO regression based on BI-RADS descriptors and CDD showed better performance than SL in predicting the presence of breast cancer. The use of CDD as a supplement to the BI-RADS descriptors significantly improved the prediction of breast cancer using logistic LASSO regression.
Yu, Yuanyuan; Li, Hongkai; Sun, Xiaoru; Su, Ping; Wang, Tingting; Liu, Yi; Yuan, Zhongshang; Liu, Yanxun; Xue, Fuzhong
2017-12-28
Confounders can produce spurious associations between exposure and outcome in observational studies. For majority of epidemiologists, adjusting for confounders using logistic regression model is their habitual method, though it has some problems in accuracy and precision. It is, therefore, important to highlight the problems of logistic regression and search the alternative method. Four causal diagram models were defined to summarize confounding equivalence. Both theoretical proofs and simulation studies were performed to verify whether conditioning on different confounding equivalence sets had the same bias-reducing potential and then to select the optimum adjusting strategy, in which logistic regression model and inverse probability weighting based marginal structural model (IPW-based-MSM) were compared. The "do-calculus" was used to calculate the true causal effect of exposure on outcome, then the bias and standard error were used to evaluate the performances of different strategies. Adjusting for different sets of confounding equivalence, as judged by identical Markov boundaries, produced different bias-reducing potential in the logistic regression model. For the sets satisfied G-admissibility, adjusting for the set including all the confounders reduced the equivalent bias to the one containing the parent nodes of the outcome, while the bias after adjusting for the parent nodes of exposure was not equivalent to them. In addition, all causal effect estimations through logistic regression were biased, although the estimation after adjusting for the parent nodes of exposure was nearest to the true causal effect. However, conditioning on different confounding equivalence sets had the same bias-reducing potential under IPW-based-MSM. Compared with logistic regression, the IPW-based-MSM could obtain unbiased causal effect estimation when the adjusted confounders satisfied G-admissibility and the optimal strategy was to adjust for the parent nodes of outcome, which obtained the highest precision. All adjustment strategies through logistic regression were biased for causal effect estimation, while IPW-based-MSM could always obtain unbiased estimation when the adjusted set satisfied G-admissibility. Thus, IPW-based-MSM was recommended to adjust for confounders set.
Use and interpretation of logistic regression in habitat-selection studies
Keating, Kim A.; Cherry, Steve
2004-01-01
Logistic regression is an important tool for wildlife habitat-selection studies, but the method frequently has been misapplied due to an inadequate understanding of the logistic model, its interpretation, and the influence of sampling design. To promote better use of this method, we review its application and interpretation under 3 sampling designs: random, case-control, and use-availability. Logistic regression is appropriate for habitat use-nonuse studies employing random sampling and can be used to directly model the conditional probability of use in such cases. Logistic regression also is appropriate for studies employing case-control sampling designs, but careful attention is required to interpret results correctly. Unless bias can be estimated or probability of use is small for all habitats, results of case-control studies should be interpreted as odds ratios, rather than probability of use or relative probability of use. When data are gathered under a use-availability design, logistic regression can be used to estimate approximate odds ratios if probability of use is small, at least on average. More generally, however, logistic regression is inappropriate for modeling habitat selection in use-availability studies. In particular, using logistic regression to fit the exponential model of Manly et al. (2002:100) does not guarantee maximum-likelihood estimates, valid probabilities, or valid likelihoods. We show that the resource selection function (RSF) commonly used for the exponential model is proportional to a logistic discriminant function. Thus, it may be used to rank habitats with respect to probability of use and to identify important habitat characteristics or their surrogates, but it is not guaranteed to be proportional to probability of use. Other problems associated with the exponential model also are discussed. We describe an alternative model based on Lancaster and Imbens (1996) that offers a method for estimating conditional probability of use in use-availability studies. Although promising, this model fails to converge to a unique solution in some important situations. Further work is needed to obtain a robust method that is broadly applicable to use-availability studies.
Relation between serum creatinine and postoperative results of open-heart surgery.
Ezeldin, Tamer H
2013-10-01
To determine the impact of preoperative serum creatinine level in non-dialyzable patients on postoperative morbidity and mortality. This is a prospective study, where serum creatinine was used to give primary assessment on renal function status preoperatively. This study includes 1,033 patients, who underwent coronary artery bypass grafting, or valve(s) operations. The study took place at Al-Hada Military Hospital, Taif, Kingdom of Saudi between May 2008 and January 2012. Data were statistically analyzed using Chi square (x2) test and multivariable logistic regression, to evaluate the postoperative morbidity and mortality risks associated with low serum creatinine levels. Postoperative mortality increased with high serum creatinine level >1.8 mg/dL (p=0.0005). Multivariable logistic regression, adjusting for potentially confounding variables demonstrated that a creatinine level of more than 1.8 mg/dL was associated with increased risk of re-operation for bleeding, postoperative renal failure, prolonged ventilatory support, ICU stay, and total hospital stay. Perioperative serum creatinine is strongly related to post operative morbidity and mortality in open heart surgery. High serum creatinine in non-dialyzable patients can predict the increased morbidity and mortality after cardiac operations.
Vitamin D and Male Sexual Function: A Transversal and Longitudinal Study.
Tirabassi, Giacomo; Sudano, Maurizio; Salvio, Gianmaria; Cutini, Melissa; Muscogiuri, Giovanna; Corona, Giovanni; Balercia, Giancarlo
2018-01-01
The effects of vitamin D on sexual function are very unclear. Therefore, we aimed at evaluating the possible association between vitamin D and sexual function and at assessing the influence of vitamin D administration on sexual function. We retrospectively studied 114 men by evaluating clinical, biochemical, and sexual parameters. A subsample ( n = 41) was also studied longitudinally before and after vitamin D replacement therapy. In the whole sample, after performing logistic regression models, higher levels of 25(OH) vitamin D were significantly associated with high values of total testosterone and of all the International Index of Erectile Function (IIEF) questionnaire parameters. On the other hand, higher levels of total testosterone were positively and significantly associated with high levels of erectile function and IIEF total score. After vitamin D replacement therapy, total and free testosterone increased and erectile function improved, whereas other sexual parameters did not change significantly. At logistic regression analysis, higher levels of vitamin D increase (Δ-) were significantly associated with high values of Δ-erectile function after adjustment for Δ-testosterone. Vitamin D is important for the wellness of male sexual function, and vitamin D administration improves sexual function.
Sirichotiratana, Nithat; Yogi, Subash; Prutipinyo, Chardsumon
2013-08-30
This study was conducted during February-March 2012 to determine the perception and support regarding smoke-free policy among tourists at Suvarnabhumi International Airport, Bangkok, Thailand. In this cross-sectional study, 200 tourists (n = 200) were enrolled by convenience sampling and interviewed by structured questionnaire. Descriptive statistics, chi-square, and multinomial logistic regression were adopted in the study. Results revealed that half (50%) of the tourists were current smokers and 55% had visited Thailand twice or more. Three quarter (76%) of tourists indicated that they would visit Thailand again even if it had a 100% smoke-free regulation. Almost all (99%) of the tourists had supported for the smoke-free policy (partial ban and total ban), and current smokers had higher percentage of support than non-smokers. Two factors, current smoking status and knowledge level, were significantly associated with perception level. After analysis with Multinomial Logistic Regression, it was found that perception, country group, and presence of designated smoking room (DSR) were associated with smoke-free policy. Recommendation is that, at institution level effective monitoring system is needed at the airport. At policy level, the recommendation is that effective comprehensive policy needed to be emphasized to ensure smoke-free airport environment.
[Associations between dormitory environment/other factors and sleep quality of medical students].
Zheng, Bang; Wang, Kailu; Pan, Ziqi; Li, Man; Pan, Yuting; Liu, Ting; Xu, Dan; Lyu, Jun
2016-03-01
To investigate the sleep quality and related factors among medical students in China, understand the association between dormitory environment and sleep quality, and provide evidence and recommendations for sleep hygiene intervention. A total of 555 undergraduate students were selected from a medical school of an university in Beijing through stratified-cluster random-sampling to conduct a questionnaire survey by using Chinese version of Pittsburgh Sleep Quality Index (PSQI) and self-designed questionnaire. Analyses were performed by using multiple logistic regression model as well as multilevel linear regression model. The prevalence of sleep disorder was 29.1%(149/512), and 39.1%(200/512) of the students reported that the sleep quality was influenced by dormitory environment. PSQI score was negatively correlated with self-reported rating of dormitory environment (γs=-0.310, P<0.001). Logistic regression analysis showed the related factors of sleep disorder included grade, sleep regularity, self-rated health status, pressures of school work and employment, as well as dormitory environment. RESULTS of multilevel regression analysis also indicated that perception on dormitory environment (individual level) was associated with sleep quality with the dormitory level random effects under control (b=-0.619, P<0.001). The prevalence of sleep disorder was high in medical students, which was associated with multiple factors. Dormitory environment should be taken into consideration when the interventions are taken to improve the sleep quality of students.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dean, Jamie A., E-mail: jamie.dean@icr.ac.uk; Wong, Kee H.; Gay, Hiram
Purpose: Current normal tissue complication probability modeling using logistic regression suffers from bias and high uncertainty in the presence of highly correlated radiation therapy (RT) dose data. This hinders robust estimates of dose-response associations and, hence, optimal normal tissue–sparing strategies from being elucidated. Using functional data analysis (FDA) to reduce the dimensionality of the dose data could overcome this limitation. Methods and Materials: FDA was applied to modeling of severe acute mucositis and dysphagia resulting from head and neck RT. Functional partial least squares regression (FPLS) and functional principal component analysis were used for dimensionality reduction of the dose-volume histogrammore » data. The reduced dose data were input into functional logistic regression models (functional partial least squares–logistic regression [FPLS-LR] and functional principal component–logistic regression [FPC-LR]) along with clinical data. This approach was compared with penalized logistic regression (PLR) in terms of predictive performance and the significance of treatment covariate–response associations, assessed using bootstrapping. Results: The area under the receiver operating characteristic curve for the PLR, FPC-LR, and FPLS-LR models was 0.65, 0.69, and 0.67, respectively, for mucositis (internal validation) and 0.81, 0.83, and 0.83, respectively, for dysphagia (external validation). The calibration slopes/intercepts for the PLR, FPC-LR, and FPLS-LR models were 1.6/−0.67, 0.45/0.47, and 0.40/0.49, respectively, for mucositis (internal validation) and 2.5/−0.96, 0.79/−0.04, and 0.79/0.00, respectively, for dysphagia (external validation). The bootstrapped odds ratios indicated significant associations between RT dose and severe toxicity in the mucositis and dysphagia FDA models. Cisplatin was significantly associated with severe dysphagia in the FDA models. None of the covariates was significantly associated with severe toxicity in the PLR models. Dose levels greater than approximately 1.0 Gy/fraction were most strongly associated with severe acute mucositis and dysphagia in the FDA models. Conclusions: FPLS and functional principal component analysis marginally improved predictive performance compared with PLR and provided robust dose-response associations. FDA is recommended for use in normal tissue complication probability modeling.« less
Dean, Jamie A; Wong, Kee H; Gay, Hiram; Welsh, Liam C; Jones, Ann-Britt; Schick, Ulrike; Oh, Jung Hun; Apte, Aditya; Newbold, Kate L; Bhide, Shreerang A; Harrington, Kevin J; Deasy, Joseph O; Nutting, Christopher M; Gulliford, Sarah L
2016-11-15
Current normal tissue complication probability modeling using logistic regression suffers from bias and high uncertainty in the presence of highly correlated radiation therapy (RT) dose data. This hinders robust estimates of dose-response associations and, hence, optimal normal tissue-sparing strategies from being elucidated. Using functional data analysis (FDA) to reduce the dimensionality of the dose data could overcome this limitation. FDA was applied to modeling of severe acute mucositis and dysphagia resulting from head and neck RT. Functional partial least squares regression (FPLS) and functional principal component analysis were used for dimensionality reduction of the dose-volume histogram data. The reduced dose data were input into functional logistic regression models (functional partial least squares-logistic regression [FPLS-LR] and functional principal component-logistic regression [FPC-LR]) along with clinical data. This approach was compared with penalized logistic regression (PLR) in terms of predictive performance and the significance of treatment covariate-response associations, assessed using bootstrapping. The area under the receiver operating characteristic curve for the PLR, FPC-LR, and FPLS-LR models was 0.65, 0.69, and 0.67, respectively, for mucositis (internal validation) and 0.81, 0.83, and 0.83, respectively, for dysphagia (external validation). The calibration slopes/intercepts for the PLR, FPC-LR, and FPLS-LR models were 1.6/-0.67, 0.45/0.47, and 0.40/0.49, respectively, for mucositis (internal validation) and 2.5/-0.96, 0.79/-0.04, and 0.79/0.00, respectively, for dysphagia (external validation). The bootstrapped odds ratios indicated significant associations between RT dose and severe toxicity in the mucositis and dysphagia FDA models. Cisplatin was significantly associated with severe dysphagia in the FDA models. None of the covariates was significantly associated with severe toxicity in the PLR models. Dose levels greater than approximately 1.0 Gy/fraction were most strongly associated with severe acute mucositis and dysphagia in the FDA models. FPLS and functional principal component analysis marginally improved predictive performance compared with PLR and provided robust dose-response associations. FDA is recommended for use in normal tissue complication probability modeling. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.
Zhong, P; Sun, D M; Wu, D H; Li, T M; Liu, X Y; Liu, H Y
2017-01-26
We evaluated serum total bilirubin levels as a predictor for metabolic syndrome (MetS) and investigated the relationship between serum total bilirubin levels and MetS prevalence. This cross-sectional study included 1728 participants over 65 years of age from Eastern China. Anthropometric data, lifestyle information, and previous medical history were collected. We then measured serum levels of fasting blood-glucose, total cholesterol, triglycerides, and total bilirubin, as well as alanine aminotransferase activity. The prevalence of MetS and each of its individual component were calculated per quartile of total bilirubin level. Logistic regression was used to assess the correlation between serum total bilirubin levels and MetS. Total bilirubin level in the women who did not have MetS was significantly higher than in those who had MetS (P<0.001). Serum total bilirubin quartiles were linearly and negatively correlated with MetS prevalence and hypertriglyceridemia (HTG) in females (P<0.005). Logistic regression showed that serum total bilirubin was an independent predictor of MetS for females (OR: 0.910, 95%CI: 0.863-0.960; P=0.001). The present study suggests that physiological levels of serum total bilirubin might be an independent risk factor for aged Chinese women, and the prevalence of MetS and HTG are negatively correlated to serum total bilirubin levels.
Logistic regression models of factors influencing the location of bioenergy and biofuels plants
T.M. Young; R.L. Zaretzki; J.H. Perdue; F.M. Guess; X. Liu
2011-01-01
Logistic regression models were developed to identify significant factors that influence the location of existing wood-using bioenergy/biofuels plants and traditional wood-using facilities. Logistic models provided quantitative insight for variables influencing the location of woody biomass-using facilities. Availability of "thinnings to a basal area of 31.7m2/ha...
Discrete post-processing of total cloud cover ensemble forecasts
NASA Astrophysics Data System (ADS)
Hemri, Stephan; Haiden, Thomas; Pappenberger, Florian
2017-04-01
This contribution presents an approach to post-process ensemble forecasts for the discrete and bounded weather variable of total cloud cover. Two methods for discrete statistical post-processing of ensemble predictions are tested. The first approach is based on multinomial logistic regression, the second involves a proportional odds logistic regression model. Applying them to total cloud cover raw ensemble forecasts from the European Centre for Medium-Range Weather Forecasts improves forecast skill significantly. Based on station-wise post-processing of raw ensemble total cloud cover forecasts for a global set of 3330 stations over the period from 2007 to early 2014, the more parsimonious proportional odds logistic regression model proved to slightly outperform the multinomial logistic regression model. Reference Hemri, S., Haiden, T., & Pappenberger, F. (2016). Discrete post-processing of total cloud cover ensemble forecasts. Monthly Weather Review 144, 2565-2577.
Fuzzy multinomial logistic regression analysis: A multi-objective programming approach
NASA Astrophysics Data System (ADS)
Abdalla, Hesham A.; El-Sayed, Amany A.; Hamed, Ramadan
2017-05-01
Parameter estimation for multinomial logistic regression is usually based on maximizing the likelihood function. For large well-balanced datasets, Maximum Likelihood (ML) estimation is a satisfactory approach. Unfortunately, ML can fail completely or at least produce poor results in terms of estimated probabilities and confidence intervals of parameters, specially for small datasets. In this study, a new approach based on fuzzy concepts is proposed to estimate parameters of the multinomial logistic regression. The study assumes that the parameters of multinomial logistic regression are fuzzy. Based on the extension principle stated by Zadeh and Bárdossy's proposition, a multi-objective programming approach is suggested to estimate these fuzzy parameters. A simulation study is used to evaluate the performance of the new approach versus Maximum likelihood (ML) approach. Results show that the new proposed model outperforms ML in cases of small datasets.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Wu, M.-M.; Graduate Institute of Medicine, College of Medicine, Fu-Jen Catholic University, Taipei, Taiwan; Chiou, H.-Y.
2006-10-01
Arsenic-contaminated well water has been shown to increase the risk of atherosclerosis. Because of involving S-adenosylmethionine, homocysteine may modify the risk by interfering with the biomethylation of ingested arsenic. In this study, we assessed the effect of plasma homocysteine level and urinary monomethylarsonic acid (MMA{sup V}) on the risk of atherosclerosis associated with arsenic. In total, 163 patients with carotid atherosclerosis and 163 controls were studied. Lifetime cumulative arsenic exposure from well water for study subjects was measured as index of arsenic exposure. Homocysteine level was determined by high-performance liquid chromatography (HPLC). Proportion of MMA{sup V} (MMA%) was calculated bymore » dividing with total arsenic species in urine, including arsenite, arsenate, MMA{sup V}, and dimethylarsinic acid (DMA{sup V}). Results of multiple linear regression analysis show a positive correlation of plasma homocysteine levels to the cumulative arsenic exposure after controlling for atherosclerosis status and nutritional factors (P < 0.05). This correlation, however, did not change substantially the effect of arsenic exposure on the risk of atherosclerosis as analyzed in a subsequent logistic regression model. Logistic regression analyses also show that elevated plasma homocysteine levels did not confer an independent risk for developing atherosclerosis in the study population. However, the risk of having atherosclerosis was increased to 5.4-fold (95% CI, 2.0-15.0) for the study subjects with high MMA% ({>=}16.5%) and high homocysteine levels ({>=}12.7 {mu}mol/l) as compared to those with low MMA% (<9.9%) and low homocysteine levels (<12.7 {mu}mol/l). Elevated homocysteinemia may exacerbate the formation of atherosclerosis related to arsenic exposure in individuals with high levels of MMA% in urine.« less
Serum Vitamin D Levels and Markers of Severity of Childhood Asthma in Costa Rica
Brehm, John M.; Celedón, Juan C.; Soto-Quiros, Manuel E.; Avila, Lydiana; Hunninghake, Gary M.; Forno, Erick; Laskey, Daniel; Sylvia, Jody S.; Hollis, Bruce W.; Weiss, Scott T.; Litonjua, Augusto A.
2009-01-01
Rationale: Maternal vitamin D intake during pregnancy has been inversely associated with asthma symptoms in early childhood. However, no study has examined the relationship between measured vitamin D levels and markers of asthma severity in childhood. Objectives: To determine the relationship between measured vitamin D levels and both markers of asthma severity and allergy in childhood. Methods: We examined the relation between 25-hydroxyvitamin D levels (the major circulating form of vitamin D) and markers of allergy and asthma severity in a cross-sectional study of 616 Costa Rican children between the ages of 6 and 14 years. Linear, logistic, and negative binomial regressions were used for the univariate and multivariate analyses. Measurements and Main Results: Of the 616 children with asthma, 175 (28%) had insufficient levels of vitamin D (<30 ng/ml). In multivariate linear regression models, vitamin D levels were significantly and inversely associated with total IgE and eosinophil count. In multivariate logistic regression models, a log10 unit increase in vitamin D levels was associated with reduced odds of any hospitalization in the previous year (odds ratio [OR], 0.05; 95% confidence interval [CI], 0.004–0.71; P = 0.03), any use of antiinflammatory medications in the previous year (OR, 0.18; 95% CI, 0.05–0.67; P = 0.01), and increased airway responsiveness (a ≤8.58-μmol provocative dose of methacholine producing a 20% fall in baseline FEV1 [OR, 0.15; 95% CI, 0.024–0.97; P = 0.05]). Conclusions: Our results suggest that vitamin D insufficiency is relatively frequent in an equatorial population of children with asthma. In these children, lower vitamin D levels are associated with increased markers of allergy and asthma severity. PMID:19179486
A Primer on Logistic Regression.
ERIC Educational Resources Information Center
Woldbeck, Tanya
This paper introduces logistic regression as a viable alternative when the researcher is faced with variables that are not continuous. If one is to use simple regression, the dependent variable must be measured on a continuous scale. In the behavioral sciences, it may not always be appropriate or possible to have a measured dependent variable on a…
de Oliveira, Elaine Cristina; dos Santos, Emerson Soares; Zeilhofer, Peter; Souza-Santos, Reinaldo; Atanaka-Santos, Marina
2013-11-15
In Brazil, 99% of the cases of malaria are concentrated in the Amazon region, with high level of transmission. The objectives of the study were to use geographic information systems (GIS) analysis and logistic regression as a tool to identify and analyse the relative likelihood and its socio-environmental determinants of malaria infection in the Vale do Amanhecer rural settlement, Brazil. A GIS database of georeferenced malaria cases, recorded in 2005, and multiple explanatory data layers was built, based on a multispectral Landsat 5 TM image, digital map of the settlement blocks and a SRTM digital elevation model. Satellite imagery was used to map the spatial patterns of land use and cover (LUC) and to derive spectral indices of vegetation density (NDVI) and soil/vegetation humidity (VSHI). An Euclidian distance operator was applied to measure proximity of domiciles to potential mosquito breeding habitats and gold mining areas. The malaria risk model was generated by multiple logistic regression, in which environmental factors were considered as independent variables and the number of cases, binarized by a threshold value was the dependent variable. Out of a total of 336 cases of malaria, 133 positive slides were from inhabitants at Road 08, which corresponds to 37.60% of the notifications. The southern region of the settlement presented 276 cases and a greater number of domiciles in which more than ten cases/home were notified. From these, 102 (30.36%) cases were caused by Plasmodium falciparum and 174 (51.79%) cases by Plasmodium vivax. Malaria risk is the highest in the south of the settlement, associated with proximity to gold mining sites, intense land use, high levels of soil/vegetation humidity and low vegetation density. Mid-resolution, remote sensing data and GIS-derived distance measures can be successfully combined with digital maps of the housing location of (non-) infected inhabitants to predict relative likelihood of disease infection through the analysis by logistic regression. Obtained findings on the relation between malaria cases and environmental factors should be applied in the future for land use planning in rural settlements in the Southern Amazon to minimize risks of disease transmission.
Tan, Ge; Yuan, Ruozhen; Wei, ChenChen; Xu, Mangmang; Liu, Ming
2018-05-26
Association between serum calcium and magnesium versus hemorrhagic transformation (HT) remains to be identified. A total of 1212 non-thrombolysis patients with serum calcium and magnesium collected within 24 h from stroke onset were enrolled. Backward stepwise multivariate logistic regression analysis was conducted to investigate association between calcium and magnesium versus HT. Calcium and magnesium were entered into logistic regression analysis in two models, separately: model 1, as continuous variable (per 1-mmol/L increase), and model 2, as four-categorized variable (being collapsed into quartiles). HT occurred in 140 patients (11.6%). Serum calcium was slightly lower in patients with HT than in patient without HT (P = 0.273). But serum magnesium was significantly lower in patients with HT than in patients without HT (P = 0.007). In logistic regression analysis, calcium displayed no association with HT. Magnesium, as either continuous or four-categorized variable, was independently and inversely associated with HT in stroke overall and stroke of large-artery atherosclerosis (LAA). The results demonstrated that serum calcium had no association with HT in patients without thrombolysis after acute ischemic stroke. Serum magnesium in low level was independently associated with increasing HT in stroke overall and particularly in stroke of LAA.
Development and validation of a mortality risk model for pediatric sepsis.
Chen, Mengshi; Lu, Xiulan; Hu, Li; Liu, Pingping; Zhao, Wenjiao; Yan, Haipeng; Tang, Liang; Zhu, Yimin; Xiao, Zhenghui; Chen, Lizhang; Tan, Hongzhuan
2017-05-01
Pediatric sepsis is a burdensome public health problem. Assessing the mortality risk of pediatric sepsis patients, offering effective treatment guidance, and improving prognosis to reduce mortality rates, are crucial.We extracted data derived from electronic medical records of pediatric sepsis patients that were collected during the first 24 hours after admission to the pediatric intensive care unit (PICU) of the Hunan Children's hospital from January 2012 to June 2014. A total of 788 children were randomly divided into a training (592, 75%) and validation group (196, 25%). The risk factors for mortality among these patients were identified by conducting multivariate logistic regression in the training group. Based on the established logistic regression equation, the logit probabilities for all patients (in both groups) were calculated to verify the model's internal and external validities.According to the training group, 6 variables (brain natriuretic peptide, albumin, total bilirubin, D-dimer, lactate levels, and mechanical ventilation in 24 hours) were included in the final logistic regression model. The areas under the curves of the model were 0.854 (0.826, 0.881) and 0.844 (0.816, 0.873) in the training and validation groups, respectively.The Mortality Risk Model for Pediatric Sepsis we established in this study showed acceptable accuracy to predict the mortality risk in pediatric sepsis patients.
Development and validation of a mortality risk model for pediatric sepsis
Chen, Mengshi; Lu, Xiulan; Hu, Li; Liu, Pingping; Zhao, Wenjiao; Yan, Haipeng; Tang, Liang; Zhu, Yimin; Xiao, Zhenghui; Chen, Lizhang; Tan, Hongzhuan
2017-01-01
Abstract Pediatric sepsis is a burdensome public health problem. Assessing the mortality risk of pediatric sepsis patients, offering effective treatment guidance, and improving prognosis to reduce mortality rates, are crucial. We extracted data derived from electronic medical records of pediatric sepsis patients that were collected during the first 24 hours after admission to the pediatric intensive care unit (PICU) of the Hunan Children's hospital from January 2012 to June 2014. A total of 788 children were randomly divided into a training (592, 75%) and validation group (196, 25%). The risk factors for mortality among these patients were identified by conducting multivariate logistic regression in the training group. Based on the established logistic regression equation, the logit probabilities for all patients (in both groups) were calculated to verify the model's internal and external validities. According to the training group, 6 variables (brain natriuretic peptide, albumin, total bilirubin, D-dimer, lactate levels, and mechanical ventilation in 24 hours) were included in the final logistic regression model. The areas under the curves of the model were 0.854 (0.826, 0.881) and 0.844 (0.816, 0.873) in the training and validation groups, respectively. The Mortality Risk Model for Pediatric Sepsis we established in this study showed acceptable accuracy to predict the mortality risk in pediatric sepsis patients. PMID:28514310
A Solution to Separation and Multicollinearity in Multiple Logistic Regression
Shen, Jianzhao; Gao, Sujuan
2010-01-01
In dementia screening tests, item selection for shortening an existing screening test can be achieved using multiple logistic regression. However, maximum likelihood estimates for such logistic regression models often experience serious bias or even non-existence because of separation and multicollinearity problems resulting from a large number of highly correlated items. Firth (1993, Biometrika, 80(1), 27–38) proposed a penalized likelihood estimator for generalized linear models and it was shown to reduce bias and the non-existence problems. The ridge regression has been used in logistic regression to stabilize the estimates in cases of multicollinearity. However, neither solves the problems for each other. In this paper, we propose a double penalized maximum likelihood estimator combining Firth’s penalized likelihood equation with a ridge parameter. We present a simulation study evaluating the empirical performance of the double penalized likelihood estimator in small to moderate sample sizes. We demonstrate the proposed approach using a current screening data from a community-based dementia study. PMID:20376286
A Solution to Separation and Multicollinearity in Multiple Logistic Regression.
Shen, Jianzhao; Gao, Sujuan
2008-10-01
In dementia screening tests, item selection for shortening an existing screening test can be achieved using multiple logistic regression. However, maximum likelihood estimates for such logistic regression models often experience serious bias or even non-existence because of separation and multicollinearity problems resulting from a large number of highly correlated items. Firth (1993, Biometrika, 80(1), 27-38) proposed a penalized likelihood estimator for generalized linear models and it was shown to reduce bias and the non-existence problems. The ridge regression has been used in logistic regression to stabilize the estimates in cases of multicollinearity. However, neither solves the problems for each other. In this paper, we propose a double penalized maximum likelihood estimator combining Firth's penalized likelihood equation with a ridge parameter. We present a simulation study evaluating the empirical performance of the double penalized likelihood estimator in small to moderate sample sizes. We demonstrate the proposed approach using a current screening data from a community-based dementia study.
Chang, Brian A; Pearson, William S; Owusu-Edusei, Kwame
2017-04-01
We used a combination of hot spot analysis (HSA) and spatial regression to examine county-level hot spot correlates for the most commonly reported nonviral sexually transmitted infections (STIs) in the 48 contiguous states in the United States (US). We obtained reported county-level total case rates of chlamydia, gonorrhea, and primary and secondary (P&S) syphilis in all counties in the 48 contiguous states from national surveillance data and computed temporally smoothed rates using 2008-2012 data. Covariates were obtained from county-level multiyear (2008-2012) American Community Surveys from the US census. We conducted HSA to identify hot spot counties for all three STIs. We then applied spatial logistic regression with the spatial error model to determine the association between the identified hot spots and the covariates. HSA indicated that ≥84% of hot spots for each STI were in the South. Spatial regression results indicated that, a 10-unit increase in the percentage of Black non-Hispanics was associated with ≈42% (P < 0.01) [≈22% (P < 0.01), for Hispanics] increase in the odds of being a hot spot county for chlamydia and gonorrhea, and ≈27% (P < 0.01) [≈11% (P < 0.01) for Hispanics] for P&S syphilis. Compared with the other regions (West, Midwest, and Northeast), counties in the South were 6.5 (P < 0.01; chlamydia), 9.6 (P < 0.01; gonorrhea), and 4.7 (P < 0.01; P&S syphilis) times more likely to be hot spots. Our study provides important information on hot spot clusters of nonviral STIs in the entire United States, including associations between hot spot counties and sociodemographic factors. Published by Elsevier Inc.
Prevalence and correlates of cognitive impairment in kidney transplant recipients.
Gupta, Aditi; Mahnken, Jonathan D; Johnson, David K; Thomas, Tashra S; Subramaniam, Dipti; Polshak, Tyler; Gani, Imran; John Chen, G; Burns, Jeffrey M; Sarnak, Mark J
2017-05-12
There is a high prevalence of cognitive impairment in dialysis patients. The prevalence of cognitive impairment after kidney transplantation is unknown. Study Design: Cross-sectional study. Single center study of prevalent kidney transplant recipients from a transplant clinic in a large academic center. Assessment of cognition using the Montreal Cognitive Assessment (MoCA). Demographic and clinical variables associated with cognitive impairment were also examined. Outcomes and Measurements: a) Prevalence of cognitive impairment defined by a MoCA score of <26. b) Multivariable linear and logistic regression to examine the association of demographic and clinical factors with cognitive impairment. Data from 226 patients were analyzed. Mean (SD) age was 54 (13.4) years, 73% were white, 60% were male, 37% had diabetes, 58% had an education level of college or above, and the mean (SD) time since kidney transplant was 3.4 (4.1) years. The prevalence of cognitive impairment was 58.0%. Multivariable linear regression demonstrated that older age, male gender and absence of diabetes were associated with lower MoCA scores (p < 0.01 for all). Estimated glomerular filtration rate (eGFR) was not associated with level of cognition. The logistic regression analysis confirmed the association of older age with cognitive impairment. Cognitive impairment is common in prevalent kidney transplant recipients, at a younger age compared to general population, and is associated with certain demographic variables, but not level of eGFR.
Denham, Melinda; Schell, Lawrence M; Deane, Glenn; Gallo, Mia V; Ravenscroft, Julia; DeCaprio, Anthony P
2005-02-01
Children are commonly exposed at background levels to several ubiquitous environmental pollutants, such as lead and persistent organic pollutants, that have been linked to neurologic and endocrine effects. These effects have prompted concern about alterations in human reproductive development. Few studies have examined the effects of these toxicants on human sexual maturation at levels commonly found in the general population, and none has been able to examine multiple toxicant exposures. The aim of the current investigation was to examine the relationship between attainment of menarche and levels of 6 environmental pollutants to which children are commonly exposed at low levels, ie, dichlorodiphenyldichloroethylene (p,p'-DDE), hexachlorobenzene (HCB), polychlorinated biphenyls (PCBs), mirex, lead, and mercury. This study was conducted with residents of the Akwesasne Mohawk Nation, a sovereign territory that spans the St Lawrence River and the boundaries of New York State and Ontario and Quebec, Canada. Since the 1950s, the St Lawrence River has been a site of substantial industrial development, and the Nation is currently adjacent to a US National Priority Superfund site. PCB, p,p'-DDE, HCB, and mirex levels exceeding the US Food and Drug Administration recommended tolerance limits for human consumption have been found in local animal species. The present analysis included 138 Akwesasne Mohawk Nation girls 10 to 16.9 years of age. Blood samples and sociodemographic data were collected by Akwesasne community members, without prior knowledge of participants' exposure status. Attainment of menses (menarche) was assessed as present or absent at the time of the interview. Congener-specific PCB analysis was available, and all 16 PCB congeners detected in >50% of the sample were included in analyses (International Union of Pure and Applied Chemistry numbers 52, 70, 74, 84, 87, 95, 99, 101 [+90], 105, 110, 118, 138 [+163 and 164], 149 [+123], 153, 180, and 187). Probit analysis was used to determine the median age at menarche for the sample. Binary logistic regression analysis was used to determine predictors of menarcheal status. Six toxicants (p,p'-DDE, HCB, PCBs, mirex, lead, and mercury) were entered into the logistic regression model. Age, socioeconomic status (SES), and BMI were tested as potential cofounders and were included in the model at P < .05. Interactions among toxicants were also evaluated. Toxicant levels were measured in blood for this sample and were consistent with long-term exposure to a variety of toxicants in multiple media. Mercury levels were at or below background levels, all lead levels were well below the Centers for Disease Control and Prevention action limit of 10 microg/dL, and PCB levels were consistent with a cumulative, continuing exposure pattern. The median age at menarche for the total sample was 12.2 years. The predicted age at menarche for girls with lead levels above the median (1.2 microg/dL) was 10.5 months later than that for girls with lead levels below the median. In the logistic regression analysis, age was the strongest predictor of menarcheal status and SES was also a significant predictor but BMI was not. The logistic regression analysis that corrected for age, SES, and other pollutants (p,p'-DDE, HCB, mirex, and mercury) indicated that, at their respective geometric means, lead (geometric mean: 0.49 microg/dL) was associated with a significantly lower probability of having reached menarche (beta = -1.29) and a group of 4 potentially estrogenic PCB congeners (E-PCB) (geometric mean: 0.12 ppb; International Union of Pure and Applied Chemistry numbers 52, 70, 101 [+90], and 187) was associated with a significantly greater probability of having reached menarche (beta = 2.13). Predicted probabilities at different levels of lead and PCBs were calculated on the basis of the logistic regression model. At the respective means of all toxicants and SES, 69% of 12-year-old girls were predicted to have reached menarche. However, at the 75th percentile of lead levels, only 10% of 12-year-old Mohawk girls were predicted to have reached menarche; at the 75th percentile of E-PCB levels, 86% of 12-year-old Mohawk girls were predicted to have reached menarche. No association was observed between mirex, p,p'-DDE, or HCB and menarcheal status. Although BMI was not a significant predictor, we tested BMI in the logistic regression model; it had little effect on the relationships between menarcheal status and either lead or E-PCB. In models testing toxicant interactions, age, SES, lead levels, and PCB levels continued to be significant predictors of menarcheal status. When each toxicant was tested in a logistic regression model correcting only for age and SES, we observed little change in the effects of lead or E-PCB on menarcheal status. The analysis of multichemical exposure among Akwesasne Mohawk Nation adolescent girls suggests that the attainment of menarche may be sensitive to relatively low levels of lead and certain PCB congeners. This study is distinguished by the ability to test many toxicants simultaneously and thus to exclude effects from unmeasured but coexisting exposures. By testing several PCB congener groupings, we were able to determine that specifically a group of potentially estrogenic PCB congeners affected the odds of reaching menarche. The lead and PCB findings are consistent with the literature and are biologically plausible. The sample size, cross-sectional study design, and possible occurrence of confounders beyond those tested suggest that results should be interpreted cautiously. Additional investigation to determine whether such low toxicant levels may affect reproduction and disorders of the reproductive system is warranted.
Mehrsai, Abdolrasoul; Guitynavard, Fateme; Nikoobakht, Mohammad Reza; Gooran, Shahram; Ahmadi, Ayat
2017-01-01
Mineralization inhibitors are required to prevent the precipitation of minerals and inhibit the formation of kidney stones and other ectopic calcifications. In laboratory studies, Fetuin-A as a glycoprotein has inhibited hydroxyapatite precipitation in calcium and phosphate supersaturated solutions; however, information about patients with kidney stones is limited. The aim of this study was to investigate the association of serum and urinary Fetuin-A levels with calcium oxalate kidney stones. In this case-control study, 30 patients with kidney stones and 30 healthy individuals without any history of urolithiasis who were referred to the urology ward of Sina Hospital of Tehran, Iran, in 2015 were entered into the study. All patients underwent computerized tomography scans. After collecting demographic information, serum and urine levels of Fetuin-A and some other calcification inhibitors and promoters, were measured and compared using T-test, Mann-Whitney and logistic regression between the two study groups. Patients with kidney stones, on average, had lower levels of Serum Fetuin-A (1522.27 ±755.39 vs. 1914.64 ±733.76 μg/ml; P = 0.046) as well as lower levels of Urine Fetuin-A (944.62 ±188.5 vs. 1409.68 ±295.26 μg/ml; P <0.001). Multivariate logistic analysis showed that urinary calcium and serum creatinine are the risk factors and Fetuin-A is a urinary protective factor for kidney stones. PFC Our study showed that patients with kidney stones had lower serum and urinary levels of Fetuin-A. In the logistic regression model, urinary Fetuin-A was reported as a protective factor for kidney stones.
Ye, Dong-qing; Hu, Yi-song; Li, Xiang-pei; Huang, Fen; Yang, Shi-gui; Hao, Jia-hu; Yin, Jing; Zhang, Guo-qing; Liu, Hui-hui
2004-11-01
To explore the impact of environmental factors, daily lifestyle, psycho-social factors and the interactions between environmental factors and chemokines genes on systemic lupus erythematosus (SLE). Case-control study was carried out and environmental factors for SLE were analyzed by univariate and multivariate unconditional logistic regression. Interactions between environmental factors and chemokines polymorphism contributing to systemic lupus erythematosus were also analyzed by logistic regression model. There were nineteen factors associated with SLE when univariate unconditional logistic regression was used. However, when multivariate unconditional logistic regression was used, only five factors showed having impacts on the disease, in which drinking well water (OR=0.099) was protective factor for SLE, and multiple drug allergy (OR=8.174), over-exposure to sunshine (OR=18.339), taking antibiotics (OR=9.630) and oral contraceptives were risk factors for SLE. When unconditional logistic regression model was used, results showed that there was interaction between eating irritable food and -2518MCP-1G/G genotype (OR=4.387). No interaction between environmental factors was found that contributing to SLE in this study. Many environmental factors were related to SLE, and there was an interaction between -2518MCP-1G/G genotype and eating irritable food.
Mielniczuk, Jan; Teisseyre, Paweł
2018-03-01
Detection of gene-gene interactions is one of the most important challenges in genome-wide case-control studies. Besides traditional logistic regression analysis, recently the entropy-based methods attracted a significant attention. Among entropy-based methods, interaction information is one of the most promising measures having many desirable properties. Although both logistic regression and interaction information have been used in several genome-wide association studies, the relationship between them has not been thoroughly investigated theoretically. The present paper attempts to fill this gap. We show that although certain connections between the two methods exist, in general they refer two different concepts of dependence and looking for interactions in those two senses leads to different approaches to interaction detection. We introduce ordering between interaction measures and specify conditions for independent and dependent genes under which interaction information is more discriminative measure than logistic regression. Moreover, we show that for so-called perfect distributions those measures are equivalent. The numerical experiments illustrate the theoretical findings indicating that interaction information and its modified version are more universal tools for detecting various types of interaction than logistic regression and linkage disequilibrium measures. © 2017 WILEY PERIODICALS, INC.
ERIC Educational Resources Information Center
Shih, Ching-Lin; Liu, Tien-Hsiang; Wang, Wen-Chung
2014-01-01
The simultaneous item bias test (SIBTEST) method regression procedure and the differential item functioning (DIF)-free-then-DIF strategy are applied to the logistic regression (LR) method simultaneously in this study. These procedures are used to adjust the effects of matching true score on observed score and to better control the Type I error…
Medina-Solis, Carlo Eduardo; Maupomé, Gerardo; del Socorro, Herrera Miriam; Pérez-Núñez, Ricardo; Avila-Burgos, Leticia; Lamadrid-Figueroa, Hector
2008-01-01
To determine the factors associated with the dental health services utilization among children ages 6 to 12 in León, Nicaragua. A cross-sectional study was carried out in 1,400 schoolchildren. Using a questionnaire, we determined information related to utilization and independent variables in the previous year. Oral health needs were established by means of a dental examination. To identify the independent variables associated with dental health services utilization, two types of multivariate regression models were used, according to the measurement scale of the outcome variable: a) frequency of utilization as (0) none, (1) one, and (2) two or more, analyzed with the ordered logistic regression and b) the type of service utilized as (0) none, (1) preventive services, (2) curative services, and (3) both services, analyzed with the multinomial logistic regression. The proportion of children who received at least one dental service in the 12 months prior to the study was 27.7 percent. The variables associated with utilization in the two models were older age, female sex, more frequent toothbrushing, positive attitude of the mother toward the child's oral health, higher socioeconomic level, and higher oral health needs. Various predisposing, enabling, and oral health needs variables were associated with higher dental health services utilization. As in prior reports elsewhere, these results from Nicaragua confirmed that utilization inequalities exist between socioeconomic groups. The multinomial logistic regression model evidenced the association of different variables depending on the type of service used.
Administrative Climate and Novices' Intent to Remain Teaching
ERIC Educational Resources Information Center
Pogodzinski, Ben; Youngs, Peter; Frank, Kenneth A.; Belman, Dale
2012-01-01
Using survey data from novice teachers at the elementary and middle school level across 11 districts, multilevel logistic regressions were estimated to examine the association between novices' perceptions of the administrative climate and their desire to remain teaching within their schools. We find that the probability that a novice teacher…
The Radius of Trust: Religion, Social Embeddedness and Trust in Strangers
ERIC Educational Resources Information Center
Welch, Michael R.; Sikkink, David; Loveland, Matthew T.
2007-01-01
Data from the 2002 Religion and Public Activism Survey were used to examine relationships among measures of religious orientation, embeddedness in social networks and the level of trust individuals direct toward others. Results from ordered logistic regression analysis demonstrate that Catholics and members of other denominations show…
ERIC Educational Resources Information Center
Lundetrae, Kjersti; Gabrielsen, Egil; Mykletun, Reidar
2010-01-01
Basic skills and educational level are closely related, and both might affect employment. Data from the Adult Literacy and Life Skills Survey were used to examine whether basic skills in terms of literacy and numeracy predicted youth unemployment (16-24 years) while controlling for educational level. Stepwise logistic regression showed that in…
ERIC Educational Resources Information Center
Thompson, Ronald G., Jr.; Auslander, Wendy F.; Alonzo, Dana
2012-01-01
The purpose of this study is to identify individual-level characteristics of foster care adolescents who are more likely to not participate in, and drop out of, a life-skills HIV prevention program delivered over 8 months. Structured interviews were conducted with 320 foster care adolescents (15-18 years). Logistic regression and survival analyses…
Pfeiffer, R M; Riedl, R
2015-08-15
We assess the asymptotic bias of estimates of exposure effects conditional on covariates when summary scores of confounders, instead of the confounders themselves, are used to analyze observational data. First, we study regression models for cohort data that are adjusted for summary scores. Second, we derive the asymptotic bias for case-control studies when cases and controls are matched on a summary score, and then analyzed either using conditional logistic regression or by unconditional logistic regression adjusted for the summary score. Two scores, the propensity score (PS) and the disease risk score (DRS) are studied in detail. For cohort analysis, when regression models are adjusted for the PS, the estimated conditional treatment effect is unbiased only for linear models, or at the null for non-linear models. Adjustment of cohort data for DRS yields unbiased estimates only for linear regression; all other estimates of exposure effects are biased. Matching cases and controls on DRS and analyzing them using conditional logistic regression yields unbiased estimates of exposure effect, whereas adjusting for the DRS in unconditional logistic regression yields biased estimates, even under the null hypothesis of no association. Matching cases and controls on the PS yield unbiased estimates only under the null for both conditional and unconditional logistic regression, adjusted for the PS. We study the bias for various confounding scenarios and compare our asymptotic results with those from simulations with limited sample sizes. To create realistic correlations among multiple confounders, we also based simulations on a real dataset. Copyright © 2015 John Wiley & Sons, Ltd.
Nie, Z Q; Ou, Y Q; Zhuang, J; Qu, Y J; Mai, J Z; Chen, J M; Liu, X Q
2016-05-01
Conditional logistic regression analysis and unconditional logistic regression analysis are commonly used in case control study, but Cox proportional hazard model is often used in survival data analysis. Most literature only refer to main effect model, however, generalized linear model differs from general linear model, and the interaction was composed of multiplicative interaction and additive interaction. The former is only statistical significant, but the latter has biological significance. In this paper, macros was written by using SAS 9.4 and the contrast ratio, attributable proportion due to interaction and synergy index were calculated while calculating the items of logistic and Cox regression interactions, and the confidence intervals of Wald, delta and profile likelihood were used to evaluate additive interaction for the reference in big data analysis in clinical epidemiology and in analysis of genetic multiplicative and additive interactions.
No rationale for 1 variable per 10 events criterion for binary logistic regression analysis.
van Smeden, Maarten; de Groot, Joris A H; Moons, Karel G M; Collins, Gary S; Altman, Douglas G; Eijkemans, Marinus J C; Reitsma, Johannes B
2016-11-24
Ten events per variable (EPV) is a widely advocated minimal criterion for sample size considerations in logistic regression analysis. Of three previous simulation studies that examined this minimal EPV criterion only one supports the use of a minimum of 10 EPV. In this paper, we examine the reasons for substantial differences between these extensive simulation studies. The current study uses Monte Carlo simulations to evaluate small sample bias, coverage of confidence intervals and mean square error of logit coefficients. Logistic regression models fitted by maximum likelihood and a modified estimation procedure, known as Firth's correction, are compared. The results show that besides EPV, the problems associated with low EPV depend on other factors such as the total sample size. It is also demonstrated that simulation results can be dominated by even a few simulated data sets for which the prediction of the outcome by the covariates is perfect ('separation'). We reveal that different approaches for identifying and handling separation leads to substantially different simulation results. We further show that Firth's correction can be used to improve the accuracy of regression coefficients and alleviate the problems associated with separation. The current evidence supporting EPV rules for binary logistic regression is weak. Given our findings, there is an urgent need for new research to provide guidance for supporting sample size considerations for binary logistic regression analysis.
Prediction models for clustered data: comparison of a random intercept and standard regression model
2013-01-01
Background When study data are clustered, standard regression analysis is considered inappropriate and analytical techniques for clustered data need to be used. For prediction research in which the interest of predictor effects is on the patient level, random effect regression models are probably preferred over standard regression analysis. It is well known that the random effect parameter estimates and the standard logistic regression parameter estimates are different. Here, we compared random effect and standard logistic regression models for their ability to provide accurate predictions. Methods Using an empirical study on 1642 surgical patients at risk of postoperative nausea and vomiting, who were treated by one of 19 anesthesiologists (clusters), we developed prognostic models either with standard or random intercept logistic regression. External validity of these models was assessed in new patients from other anesthesiologists. We supported our results with simulation studies using intra-class correlation coefficients (ICC) of 5%, 15%, or 30%. Standard performance measures and measures adapted for the clustered data structure were estimated. Results The model developed with random effect analysis showed better discrimination than the standard approach, if the cluster effects were used for risk prediction (standard c-index of 0.69 versus 0.66). In the external validation set, both models showed similar discrimination (standard c-index 0.68 versus 0.67). The simulation study confirmed these results. For datasets with a high ICC (≥15%), model calibration was only adequate in external subjects, if the used performance measure assumed the same data structure as the model development method: standard calibration measures showed good calibration for the standard developed model, calibration measures adapting the clustered data structure showed good calibration for the prediction model with random intercept. Conclusion The models with random intercept discriminate better than the standard model only if the cluster effect is used for predictions. The prediction model with random intercept had good calibration within clusters. PMID:23414436
Bouwmeester, Walter; Twisk, Jos W R; Kappen, Teus H; van Klei, Wilton A; Moons, Karel G M; Vergouwe, Yvonne
2013-02-15
When study data are clustered, standard regression analysis is considered inappropriate and analytical techniques for clustered data need to be used. For prediction research in which the interest of predictor effects is on the patient level, random effect regression models are probably preferred over standard regression analysis. It is well known that the random effect parameter estimates and the standard logistic regression parameter estimates are different. Here, we compared random effect and standard logistic regression models for their ability to provide accurate predictions. Using an empirical study on 1642 surgical patients at risk of postoperative nausea and vomiting, who were treated by one of 19 anesthesiologists (clusters), we developed prognostic models either with standard or random intercept logistic regression. External validity of these models was assessed in new patients from other anesthesiologists. We supported our results with simulation studies using intra-class correlation coefficients (ICC) of 5%, 15%, or 30%. Standard performance measures and measures adapted for the clustered data structure were estimated. The model developed with random effect analysis showed better discrimination than the standard approach, if the cluster effects were used for risk prediction (standard c-index of 0.69 versus 0.66). In the external validation set, both models showed similar discrimination (standard c-index 0.68 versus 0.67). The simulation study confirmed these results. For datasets with a high ICC (≥15%), model calibration was only adequate in external subjects, if the used performance measure assumed the same data structure as the model development method: standard calibration measures showed good calibration for the standard developed model, calibration measures adapting the clustered data structure showed good calibration for the prediction model with random intercept. The models with random intercept discriminate better than the standard model only if the cluster effect is used for predictions. The prediction model with random intercept had good calibration within clusters.
Li, Yi; Tseng, Yufeng J.; Pan, Dahua; Liu, Jianzhong; Kern, Petra S.; Gerberick, G. Frank; Hopfinger, Anton J.
2008-01-01
Currently, the only validated methods to identify skin sensitization effects are in vivo models, such as the Local Lymph Node Assay (LLNA) and guinea pig studies. There is a tremendous need, in particular due to novel legislation, to develop animal alternatives, eg. Quantitative Structure-Activity Relationship (QSAR) models. Here, QSAR models for skin sensitization using LLNA data have been constructed. The descriptors used to generate these models are derived from the 4D-molecular similarity paradigm and are referred to as universal 4D-fingerprints. A training set of 132 structurally diverse compounds and a test set of 15 structurally diverse compounds were used in this study. The statistical methodologies used to build the models are logistic regression (LR), and partial least square coupled logistic regression (PLS-LR), which prove to be effective tools for studying skin sensitization measures expressed in the two categorical terms of sensitizer and non-sensitizer. QSAR models with low values of the Hosmer-Lemeshow goodness-of-fit statistic, χHL2, are significant and predictive. For the training set, the cross-validated prediction accuracy of the logistic regression models ranges from 77.3% to 78.0%, while that of PLS-logistic regression models ranges from 87.1% to 89.4%. For the test set, the prediction accuracy of logistic regression models ranges from 80.0%-86.7%, while that of PLS-logistic regression models ranges from 73.3%-80.0%. The QSAR models are made up of 4D-fingerprints related to aromatic atoms, hydrogen bond acceptors and negatively partially charged atoms. PMID:17226934
MODELING SNAKE MICROHABITAT FROM RADIOTELEMETRY STUDIES USING POLYTOMOUS LOGISTIC REGRESSION
Multivariate analysis of snake microhabitat has historically used techniques that were derived under assumptions of normality and common covariance structure (e.g., discriminant function analysis, MANOVA). In this study, polytomous logistic regression (PLR which does not require ...
Brenn, T; Arnesen, E
1985-01-01
For comparative evaluation, discriminant analysis, logistic regression and Cox's model were used to select risk factors for total and coronary deaths among 6595 men aged 20-49 followed for 9 years. Groups with mortality between 5 and 93 per 1000 were considered. Discriminant analysis selected variable sets only marginally different from the logistic and Cox methods which always selected the same sets. A time-saving option, offered for both the logistic and Cox selection, showed no advantage compared with discriminant analysis. Analysing more than 3800 subjects, the logistic and Cox methods consumed, respectively, 80 and 10 times more computer time than discriminant analysis. When including the same set of variables in non-stepwise analyses, all methods estimated coefficients that in most cases were almost identical. In conclusion, discriminant analysis is advocated for preliminary or stepwise analysis, otherwise Cox's method should be used.
ERIC Educational Resources Information Center
DeMars, Christine E.
2009-01-01
The Mantel-Haenszel (MH) and logistic regression (LR) differential item functioning (DIF) procedures have inflated Type I error rates when there are large mean group differences, short tests, and large sample sizes.When there are large group differences in mean score, groups matched on the observed number-correct score differ on true score,…
Meng, Ge; Feng, Yan; Nie, Zhiqing; Wu, Xiaomeng; Wei, Hongying; Wu, Shaowei; Yin, Yong; Wang, Yan
2016-04-01
Polybrominated diphenyl ethers (PBDEs), polychlorinated biphenyls (PCBs) and organochlorine pesticides (OCPs) are common persistent organic pollutants (POPs) that may be associated with childhood asthma. The concentrations of PBDEs, PCBs and OCPs were analyzed in pooled serum samples from both asthmatic and non-asthmatic children. The differences in the internal exposure levels between the case and control groups were tested (p value <0.0012). The associations between the internal exposure concentrations of the POPs and childhood asthma were estimated based on the odds ratios (ORs) calculated using logistic regression models. There were significant differences in three PBDEs, 26 PCBs and seven OCPs between the two groups, with significantly higher levels in the cases. The multiple logistic regression models demonstrated that the internal exposure concentrations of a number of the POPs (23 PCBs, p,p'-DDE and α-HCH) were positively associated with childhood asthma. Some synergistic effects were observed when the children were co-exposed to the chemicals. BDE-209 was positively associated with asthma aggravation. This study indicates the potential relationships between the internal exposure concentrations of particular POPs and the development of childhood asthma. Copyright © 2015 Elsevier Inc. All rights reserved.
Sze, N N; Wong, S C; Lee, C Y
2014-12-01
In past several decades, many countries have set quantified road safety targets to motivate transport authorities to develop systematic road safety strategies and measures and facilitate the achievement of continuous road safety improvement. Studies have been conducted to evaluate the association between the setting of quantified road safety targets and road fatality reduction, in both the short and long run, by comparing road fatalities before and after the implementation of a quantified road safety target. However, not much work has been done to evaluate whether the quantified road safety targets are actually achieved. In this study, we used a binary logistic regression model to examine the factors - including vehicle ownership, fatality rate, and national income, in addition to level of ambition and duration of target - that contribute to a target's success. We analyzed 55 quantified road safety targets set by 29 countries from 1981 to 2009, and the results indicate that targets that are in progress and with lower level of ambitions had a higher likelihood of eventually being achieved. Moreover, possible interaction effects on the association between level of ambition and the likelihood of success are also revealed. Copyright © 2014 Elsevier Ltd. All rights reserved.
Satellite rainfall retrieval by logistic regression
NASA Technical Reports Server (NTRS)
Chiu, Long S.
1986-01-01
The potential use of logistic regression in rainfall estimation from satellite measurements is investigated. Satellite measurements provide covariate information in terms of radiances from different remote sensors.The logistic regression technique can effectively accommodate many covariates and test their significance in the estimation. The outcome from the logistical model is the probability that the rainrate of a satellite pixel is above a certain threshold. By varying the thresholds, a rainrate histogram can be obtained, from which the mean and the variant can be estimated. A logistical model is developed and applied to rainfall data collected during GATE, using as covariates the fractional rain area and a radiance measurement which is deduced from a microwave temperature-rainrate relation. It is demonstrated that the fractional rain area is an important covariate in the model, consistent with the use of the so-called Area Time Integral in estimating total rain volume in other studies. To calibrate the logistical model, simulated rain fields generated by rainfield models with prescribed parameters are needed. A stringent test of the logistical model is its ability to recover the prescribed parameters of simulated rain fields. A rain field simulation model which preserves the fractional rain area and lognormality of rainrates as found in GATE is developed. A stochastic regression model of branching and immigration whose solutions are lognormally distributed in some asymptotic limits has also been developed.
Practical Session: Logistic Regression
NASA Astrophysics Data System (ADS)
Clausel, M.; Grégoire, G.
2014-12-01
An exercise is proposed to illustrate the logistic regression. One investigates the different risk factors in the apparition of coronary heart disease. It has been proposed in Chapter 5 of the book of D.G. Kleinbaum and M. Klein, "Logistic Regression", Statistics for Biology and Health, Springer Science Business Media, LLC (2010) and also by D. Chessel and A.B. Dufour in Lyon 1 (see Sect. 6 of http://pbil.univ-lyon1.fr/R/pdf/tdr341.pdf). This example is based on data given in the file evans.txt coming from http://www.sph.emory.edu/dkleinb/logreg3.htm#data.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ghazali, Amirul Syafiq Mohd; Ali, Zalila; Noor, Norlida Mohd
Multinomial logistic regression is widely used to model the outcomes of a polytomous response variable, a categorical dependent variable with more than two categories. The model assumes that the conditional mean of the dependent categorical variables is the logistic function of an affine combination of predictor variables. Its procedure gives a number of logistic regression models that make specific comparisons of the response categories. When there are q categories of the response variable, the model consists of q-1 logit equations which are fitted simultaneously. The model is validated by variable selection procedures, tests of regression coefficients, a significant test ofmore » the overall model, goodness-of-fit measures, and validation of predicted probabilities using odds ratio. This study used the multinomial logistic regression model to investigate obesity and overweight among primary school students in a rural area on the basis of their demographic profiles, lifestyles and on the diet and food intake. The results indicated that obesity and overweight of students are related to gender, religion, sleep duration, time spent on electronic games, breakfast intake in a week, with whom meals are taken, protein intake, and also, the interaction between breakfast intake in a week with sleep duration, and the interaction between gender and protein intake.« less
NASA Astrophysics Data System (ADS)
Ghazali, Amirul Syafiq Mohd; Ali, Zalila; Noor, Norlida Mohd; Baharum, Adam
2015-10-01
Multinomial logistic regression is widely used to model the outcomes of a polytomous response variable, a categorical dependent variable with more than two categories. The model assumes that the conditional mean of the dependent categorical variables is the logistic function of an affine combination of predictor variables. Its procedure gives a number of logistic regression models that make specific comparisons of the response categories. When there are q categories of the response variable, the model consists of q-1 logit equations which are fitted simultaneously. The model is validated by variable selection procedures, tests of regression coefficients, a significant test of the overall model, goodness-of-fit measures, and validation of predicted probabilities using odds ratio. This study used the multinomial logistic regression model to investigate obesity and overweight among primary school students in a rural area on the basis of their demographic profiles, lifestyles and on the diet and food intake. The results indicated that obesity and overweight of students are related to gender, religion, sleep duration, time spent on electronic games, breakfast intake in a week, with whom meals are taken, protein intake, and also, the interaction between breakfast intake in a week with sleep duration, and the interaction between gender and protein intake.
The cross-validated AUC for MCP-logistic regression with high-dimensional data.
Jiang, Dingfeng; Huang, Jian; Zhang, Ying
2013-10-01
We propose a cross-validated area under the receiving operator characteristic (ROC) curve (CV-AUC) criterion for tuning parameter selection for penalized methods in sparse, high-dimensional logistic regression models. We use this criterion in combination with the minimax concave penalty (MCP) method for variable selection. The CV-AUC criterion is specifically designed for optimizing the classification performance for binary outcome data. To implement the proposed approach, we derive an efficient coordinate descent algorithm to compute the MCP-logistic regression solution surface. Simulation studies are conducted to evaluate the finite sample performance of the proposed method and its comparison with the existing methods including the Akaike information criterion (AIC), Bayesian information criterion (BIC) or Extended BIC (EBIC). The model selected based on the CV-AUC criterion tends to have a larger predictive AUC and smaller classification error than those with tuning parameters selected using the AIC, BIC or EBIC. We illustrate the application of the MCP-logistic regression with the CV-AUC criterion on three microarray datasets from the studies that attempt to identify genes related to cancers. Our simulation studies and data examples demonstrate that the CV-AUC is an attractive method for tuning parameter selection for penalized methods in high-dimensional logistic regression models.
Perioperative factors associated with pressure ulcer development after major surgery.
Kim, Jeong Min; Lee, Hyunjeong; Ha, Taehoon; Na, Sungwon
2018-02-01
Postoperative pressure ulcers are important indicators of perioperative care quality, and are serious and expensive complications during critical care. This study aimed to identify perioperative risk factors for postoperative pressure ulcers. This retrospective case-control study evaluated 2,498 patients who underwent major surgery. Forty-three patients developed postoperative pressure ulcers and were matched to 86 control patients based on age, sex, surgery, and comorbidities. The pressure ulcer group had lower baseline hemoglobin and albumin levels, compared to the control group. The pressure ulcer group also had higher values for lactate levels, blood loss, and number of packed red blood cell ( p RBC) units. Univariate analysis revealed that pressure ulcer development was associated with preoperative hemoglobin levels, albumin levels, lactate levels, intraoperative blood loss, number of p RBC units, Acute Physiologic and Chronic Health Evaluation II score, Braden scale score, postoperative ventilator care, and patient restraint. In the multiple logistic regression analysis, only preoperative low albumin levels (odds ratio [OR]: 0.21, 95% CI: 0.05-0.82; P < 0.05) and high lactate levels (OR: 1.70, 95% CI: 1.07-2.71; P < 0.05) were independently associated with pressure ulcer development. A receiver operating characteristic curve was used to assess the predictive power of the logistic regression model, and the area under the curve was 0.88 (95% CI: 0.79-0.97; P < 0.001). The present study revealed that preoperative low albumin levels and high lactate levels were significantly associated with pressure ulcer development after surgery.
Risk factors for persistent gestational trophoblastic neoplasia.
Kuyumcuoglu, Umur; Guzel, Ali Irfan; Erdemoglu, Mahmut; Celik, Yusuf
2011-01-01
This retrospective study evaluated the risk factors for persistent gestational trophoblastic disease (GTN) and determined their odds ratios. This study included 100 cases with GTN admitted to our clinic. Possible risk factors recorded were age, gravidity, parity, size of the neoplasia, and beta-human chorionic gonadotropin levels (beta-hCG) before and after the procedure. Statistical analyses consisted of the independent sample t-test and logistic regression using the statistical package SPSS ver. 15.0 for Windows (SPSS, Chicago, IL, USA). Twenty of the cases had persistent GTN, and the differences between these and the others cases were evaluated. The size of the neoplasia and histopathological type of GTN had no statistical relationship with persistence, whereas age, gravidity, and beta-hCG levels were significant risk factors for persistent GTN (p < 0.05). The odds ratios (95% confidence interval (CI)) for age, gravidity, and pre- and post-evacuation beta-hCG levels determined using logistic regression were 4.678 (0.97-22.44), 7.315 (1.16-46.16), 2.637 (1.41-4.94), and 2.339 (1.52-3.60), respectively. Patient age, gravidity, and beta-hCG levels were risk factors for persistent GTN, whereas the size of the neoplasia and histopathological type of GTN were not significant risk factors.
Lawal, A O; Kolude, B; Adeyemi, B F; Lawoyin, J O; Akang, E E
2012-01-01
Tobacco and alcohol are major risk factors of oral cancer, but nutritional deficiency may also contribute to development of oral cancer. This study compared serum antioxidant vitamin levels in oral cancer patients and controls in order to validate the role of vitamin deficiencies in the etiology of oral cancer. Serum vitamin A, C, and E levels of 33 oral cancer patients and 30 controls at University College Hospital, Ibadan, Nigeria, were determined using standard methods. The data obtained were analyzed using the Student t-test, odds ratio, and logistic regression. Mean vitamin A, C, and E levels were significantly lower in oral cancer patients (P=0.022, P=0.000, and P=0.013 respectively). Risk of oral cancer was 10.89, 11.35, and 5.6 times more in patients with low serum vitamins A, C, and E, respectively. However, on logistic regression analysis, only low serum vitamin E independently predicted occurrence of oral cancer. The lower serum vitamin A, C, and E levels in oral cancer patients could be either a cause or an effect of the oral cancer. Further studies using a larger sample size and cohort studies with long-term follow-up of subjects are desirable.
Tabała, Klaudia; Wrzesińska, Magdalena; Stecz, Patryk; Kocur, Józef
2016-12-23
Chronic obstructive pulmonary disease (COPD) and asthma are a challenge to public health, with the sufferers experiencing a range of psychological factors affecting their health and behavior. The aim of the present study was to determine the level of anxiety, personality traits and stress-coping ability of patients with obstructive lung disease and comparison with a group of healthy controls. The research was conducted on a group of 150 people with obstructive lung diseases (asthma and COPD) and healthy controls (mean age = 56.0 ± 16.00). Four surveys were used: a sociodemographic survey, NEO-FFI Personality Inventory, State-Trait Anxiety Inventory (STAI), and Brief Cope Inventory. Logistic regression was used to identify the investigated variables which best differentiated the healthy and sick individuals. Patients with asthma or COPD demonstrated a significantly lower level of conscientiousness, openness to experience, active coping and planning, as well as higher levels of neuroticism and a greater tendency to behavioral disengagement. Logistic regression found trait-anxiety, openness to experience, positive reframing, acceptance, humor and behavioral disengagement to be best at distinguishing people with lung diseases from healthy individuals. The results indicate the need for intervention in the psychological functioning of people with obstructive diseases.
Sirichotiratana, Nithat; Yogi, Subash; Prutipinyo, Chardsumon
2013-01-01
This study was conducted during February-March 2012 to determine the perception and support regarding smoke-free policy among tourists at Suvarnabhumi International Airport, Bangkok, Thailand. In this cross-sectional study, 200 tourists (n = 200) were enrolled by convenience sampling and interviewed by structured questionnaire. Descriptive statistics, chi-square, and multinomial logistic regression were adopted in the study. Results revealed that half (50%) of the tourists were current smokers and 55% had visited Thailand twice or more. Three quarter (76%) of tourists indicated that they would visit Thailand again even if it had a 100% smoke-free regulation. Almost all (99%) of the tourists had supported for the smoke-free policy (partial ban and total ban), and current smokers had higher percentage of support than non-smokers. Two factors, current smoking status and knowledge level, were significantly associated with perception level. After analysis with Multinomial Logistic Regression, it was found that perception, country group, and presence of designated smoking room (DSR) were associated with smoke-free policy. Recommendation is that, at institution level effective monitoring system is needed at the airport. At policy level, the recommendation is that effective comprehensive policy needed to be emphasized to ensure smoke-free airport environment. PMID:23999549
Kim, So Young; Sim, Songyong; Choi, Hyo Geun
2017-01-01
Although an association between energy drinks and suicide has been suggested, few prior studies have considered the role of emotional factors including stress, sleep, and school performance in adolescents. This study aimed to evaluate the association of energy drinks with suicide, independent of possible confounders including stress, sleep, and school performance. In total, 121,106 adolescents with 13-18 years olds from the 2014 and 2015 Korea Youth Risk Behavior Web-based Survey were surveyed for age, sex, region of residence, economic level, paternal and maternal education level, sleep time, stress level, school performance, frequency of energy drink intake, and suicide attempts. Subjective stress levels were classified into severe, moderate, mild, a little, and no stress. Sleep time was divided into 6 groups: < 6 h; 6 ≤ h < 7; 7 ≤ h < 8; 8 ≤ h < 9; and ≥ 9 h. School performance was classified into 5 levels: A (highest), B (middle, high), C (middle), D (middle, low), and E (lowest). Frequency of energy drink consumption was divided into 3 groups: ≥ 3, 1-2, and 0 times a week. The associations of sleep time, stress level, and school performance with suicide attempts and the frequency of energy drink intake were analyzed using multiple and ordinal logistic regression analysis, respectively, with complex sampling. The relationship between frequency of energy drink intake and suicide attempts was analyzed using multiple logistic regression analysis with complex sampling. Higher stress levels, lack of sleep, and low school performance were significantly associated with suicide attempts (each P < 0.001). These variables of high stress level, abnormal sleep time, and low school performance were also proportionally related with higher energy drink intake (P < 0.001). Frequent energy drink intake was significantly associated with suicide attempts in multiple logistic regression analyses (AOR for frequency of energy intake ≥ 3 times a week = 3.03, 95% CI = 2.64-3.49, P < 0.001). Severe stress, inadequate sleep, and low school performance were related with more energy drink intake and suicide attempts in Korean adolescents. Frequent energy drink intake was positively related with suicide attempts, even after adjusting for stress, sleep time, and school performance.
Kim, So Young; Sim, Songyong
2017-01-01
Objective Although an association between energy drinks and suicide has been suggested, few prior studies have considered the role of emotional factors including stress, sleep, and school performance in adolescents. This study aimed to evaluate the association of energy drinks with suicide, independent of possible confounders including stress, sleep, and school performance. Methods In total, 121,106 adolescents with 13–18 years olds from the 2014 and 2015 Korea Youth Risk Behavior Web-based Survey were surveyed for age, sex, region of residence, economic level, paternal and maternal education level, sleep time, stress level, school performance, frequency of energy drink intake, and suicide attempts. Subjective stress levels were classified into severe, moderate, mild, a little, and no stress. Sleep time was divided into 6 groups: < 6 h; 6 ≤ h < 7; 7 ≤ h < 8; 8 ≤ h < 9; and ≥ 9 h. School performance was classified into 5 levels: A (highest), B (middle, high), C (middle), D (middle, low), and E (lowest). Frequency of energy drink consumption was divided into 3 groups: ≥ 3, 1–2, and 0 times a week. The associations of sleep time, stress level, and school performance with suicide attempts and the frequency of energy drink intake were analyzed using multiple and ordinal logistic regression analysis, respectively, with complex sampling. The relationship between frequency of energy drink intake and suicide attempts was analyzed using multiple logistic regression analysis with complex sampling. Results Higher stress levels, lack of sleep, and low school performance were significantly associated with suicide attempts (each P < 0.001). These variables of high stress level, abnormal sleep time, and low school performance were also proportionally related with higher energy drink intake (P < 0.001). Frequent energy drink intake was significantly associated with suicide attempts in multiple logistic regression analyses (AOR for frequency of energy intake ≥ 3 times a week = 3.03, 95% CI = 2.64–3.49, P < 0.001). Conclusion Severe stress, inadequate sleep, and low school performance were related with more energy drink intake and suicide attempts in Korean adolescents. Frequent energy drink intake was positively related with suicide attempts, even after adjusting for stress, sleep time, and school performance. PMID:29135989
Vaeth, Michael; Skovlund, Eva
2004-06-15
For a given regression problem it is possible to identify a suitably defined equivalent two-sample problem such that the power or sample size obtained for the two-sample problem also applies to the regression problem. For a standard linear regression model the equivalent two-sample problem is easily identified, but for generalized linear models and for Cox regression models the situation is more complicated. An approximately equivalent two-sample problem may, however, also be identified here. In particular, we show that for logistic regression and Cox regression models the equivalent two-sample problem is obtained by selecting two equally sized samples for which the parameters differ by a value equal to the slope times twice the standard deviation of the independent variable and further requiring that the overall expected number of events is unchanged. In a simulation study we examine the validity of this approach to power calculations in logistic regression and Cox regression models. Several different covariate distributions are considered for selected values of the overall response probability and a range of alternatives. For the Cox regression model we consider both constant and non-constant hazard rates. The results show that in general the approach is remarkably accurate even in relatively small samples. Some discrepancies are, however, found in small samples with few events and a highly skewed covariate distribution. Comparison with results based on alternative methods for logistic regression models with a single continuous covariate indicates that the proposed method is at least as good as its competitors. The method is easy to implement and therefore provides a simple way to extend the range of problems that can be covered by the usual formulas for power and sample size determination. Copyright 2004 John Wiley & Sons, Ltd.
ERIC Educational Resources Information Center
Bozpolat, Ebru
2017-01-01
The purpose of this study is to determine whether Cumhuriyet University Faculty of Education students' levels of speaking anxiety are predicted by the variables of gender, department, grade, such sub-dimensions of "Speaking Self-Efficacy Scale for Pre-Service Teachers" as "public speaking," "effective speaking,"…
The Differential Effects of Preschool: Evidence from Virginia
ERIC Educational Resources Information Center
Huang, Francis L.; Invernizzi, Marcia A.; Drake, E. Allison
2012-01-01
This study investigated the differential and persistent effects of a state-funded pre-K program, the Virginia Preschool Initiative (VPI). We analyzed data from a cohort of over 60,000 students nested in approximately 1000 schools from the beginning of kindergarten to the end of first grade using two-level hierarchical logistic regression models.…
ERIC Educational Resources Information Center
Balfour, Danny L.; Neff, Donna M.
1993-01-01
A logistic regression model applied to data from 171 child service caseworkers identified variables determining job turnover during times of intense external criticism of the agency (length of service, professional commitment, level of education). A special training program did not significantly reduce the probability of turnover. (SK)
Racial Threat and White Opposition to Bilingual Education in Texas
ERIC Educational Resources Information Center
Hempel, Lynn M.; Dowling, Julie A.; Boardman, Jason D.; Ellison, Christopher G.
2013-01-01
This study examines local contextual conditions that influence opposition to bilingual education among non-Hispanic Whites, net of individual-level characteristics. Data from the Texas Poll (N = 615) are used in conjunction with U.S. Census data to test five competing hypotheses using binomial and multinomial logistic regression models. Our…
Analyzing Army Reserve Unsatisfactory Participants through Logistic Regression
2012-06-08
Unsatisfactory Participants by State/Territory ...............................45 Figure 14. Observed vs . Expected Unsatisfactory Participants by Grade...47 Figure 15. Observed vs . Expected Unsatisfactory Participants by Age ............................47 x TABLES Page Table...rank, and previous military experience. “A typical 1995-96 USAR Unsatisfactory Participant was a white, unmarried male whose highest level of
The Impact of Teacher Collaboration on School Management in Canada
ERIC Educational Resources Information Center
Bouchamma, Yamina; Savoie, Andrea A.; Basque, Marc
2012-01-01
This study examined the level of collaboration between Francophone and Anglophone language teachers of 13- and 16- year-old Canadian students (N = 4,494) using data from the 2002 SAIP (School Achievement Indicators Program) of the Council of Ministers of Education of Canada. Among 32 factors, logistic regression identified six predictors of…
Ethnicity and Economic Well-Being: The Case of Ghana
ERIC Educational Resources Information Center
Addai, Isaac; Pokimica, Jelena
2010-01-01
In the context of decades of successful economic reforms in Ghana, this study investigates whether ethnicity influences economic well-being (perceived and actual) among Ghanaians at the micro-level. Drawing on Afro-barometer 2008 data, the authors employs logistic and multiple regression techniques to explore the relative effect of ethnicity on…
ERIC Educational Resources Information Center
McDonnall, Michele Capella
2011-01-01
The study reported here identified factors that predict employment for transition-age youths with visual impairments. Logistic regression was used to predict employment at two levels. Significant variables were early and recent work experiences, completion of a postsecondary program, difficulty with transportation, independent travel skills, and…
ERIC Educational Resources Information Center
Curran, F. Chris
2017-01-01
Little research explores the relative influence of various stakeholders on school discipline policy. Using data from the SASS and ordered logistic regression, this study explores such influence while assessing variation across schools types and changes over time. Principals consistently rate themselves and teachers as the most influential…
Student Assistance Program Outcomes for Students at Risk for Suicide
ERIC Educational Resources Information Center
Biddle, Virginia Sue; Kern, John, III; Brent, David A.; Thurkettle, Mary Ann; Puskar, Kathryn R.; Sekula, L. Kathleen
2014-01-01
Pennsylvania's response to adolescent suicide is its Student Assistance Program (SAP). SAP has been funded for 27 years although no statewide outcome studies using case-level data have been conducted. This study used logistic regression to examine drug-/alcohol-related behaviors and suspensions of suicidal students who participated in SAP. Of the…
The Oklahoma's Promise Program: A National Model to Promote College Persistence
ERIC Educational Resources Information Center
Mendoza, Pilar; Mendez, Jesse P.
2013-01-01
Using a multi-method approach involving fixed effects and logistic regressions, this study examined the effect of the Oklahoma's Promise Program on student persistence in relation to the Pell and Stafford federal programs and according to socio-economic characteristics and class level. The Oklahoma's Promise is a hybrid state program that pays…
Mark Spencer; Kevin O' Hara
2007-01-01
Phytophthora ramorum attacks tanoak (Lithocarpus densiflorus) in California and Oregon. We present a stand-level study examining the presence of disease symptoms in individual stems. Working with data from four plots in redwood (Sequoia sempervirens)/tanoak forests in Marin County, and three plots in Mendocino...
Chang, Jianfang; Tse, Chi-Shing; Leung, Grace Tak Yu; Fung, Ada Wai Tung; Hau, Kit-Tai; Chiu, Helen Fung Kum; Lam, Linda Chiu Wa
2014-06-01
Education has a profound effect on older adults' cognitive performance. In Hong Kong, some dementia screening tasks were originally designed for developed population with, on average, higher education. We compared the screening power of these tasks for Chinese older adults with different levels of education. Community-dwelling older adults who were healthy (N = 383) and with very mild dementia (N = 405) performed the following tasks: Mini-Mental State Examination, Alzheimer's Disease Assessment Scale-Cognitive subscales, Verbal Fluency, Abstract Thinking, and Visual/Digit Span. Logistic regression was used to examine the power of these tasks to predict Clinical Dementia Rating (CDR 0.5 vs. 0). Logistic regression analysis showed that while the screening power of the total scores in all tasks was similar for high and low education groups, there were education biases in some items of these tasks. The differential screening power in high and low education groups was not identical across items in some tasks. Thus, in cognitive assessments, we should exercise great caution when using these potentially biased items for older adults with limited education.
Kim, Yi-Soon; Kim, Min-Za; Jeong, Ihn-Sook
2004-08-01
This study was aimed to identify the effect of self-foot reflexology on the relief of premenstrual syndrome and dysmenorrhea in high school girls. Study subjects was 236 women residing in the community, teachers and nurses who were older than 45 were recruited. Data was collected with self administered questionnaires from July 1st to August 31st, 2003 and analysed using SPSS/WIN 10.0 with Xtest, t-test, and stepwise multiple logistic regression at a significant level of =.05. The breast cancer screening rate was 57.2%, and repeat screening rate was 15.3%. With the multiple logistic regression analysis, factors associated with mammography screening were age and perceived barriers of action, and factors related to the repeat mammography screening were education level and other cancer screening experience. Based on the results, we recommend the development of an intervention program to decrease the perceived barrier of action, to regard mammography as an essential test in regular check-up, and to give active advertisement and education to the public to improve the rates of breast cancer screening and repeat screening.
[Metabolic syndrome in workers of a second level hospital].
Mathiew-Quirós, Alvaro; Salinas-Martínez, Ana María; Hernández-Herrera, Ricardo Jorge; Gallardo-Vela, José Alberto
2014-01-01
People with metabolic syndrome (20-25 % of the world population) are three times more likely to suffer a heart attack or stroke and twice as likely to die from this cause. The objective of this study was to assess the prevalence of metabolic syndrome in workers of a second level hospital. This was a cross-sectional study with 160 healthcare workers in Monterrey, México. Sociodemographic, anthropometric and biochemical data were obtained to assess the prevalence of metabolic syndrome. Bivariate and multiple logistic regression analysis were carried out in order to assess the relationship between metabolic syndrome and sociodemographic and occupational variables. The prevalence of metabolic syndrome among workers was 38.1 %. Nurses were more affected with 32.8 %. Overweight and obesity were prevalent in 78 %. In the logistic regression there was a significant association between metabolic syndrome and not having partner (OR 3.98, 95 % CI [1.54-10.25]) and obesity (OR 4.69, 95 % CI [1.73-12.73]). The prevalence of metabolic syndrome and obesity is alarming. Appropriate and prompt actions must be taken in order to reduce the risk of cardiovascular disease in this population.
Variational dynamic background model for keyword spotting in handwritten documents
NASA Astrophysics Data System (ADS)
Kumar, Gaurav; Wshah, Safwan; Govindaraju, Venu
2013-12-01
We propose a bayesian framework for keyword spotting in handwritten documents. This work is an extension to our previous work where we proposed dynamic background model, DBM for keyword spotting that takes into account the local character level scores and global word level scores to learn a logistic regression classifier to separate keywords from non-keywords. In this work, we add a bayesian layer on top of the DBM called the variational dynamic background model, VDBM. The logistic regression classifier uses the sigmoid function to separate keywords from non-keywords. The sigmoid function being neither convex nor concave, exact inference of VDBM becomes intractable. An expectation maximization step is proposed to do approximate inference. The advantage of VDBM over the DBM is multi-fold. Firstly, being bayesian, it prevents over-fitting of data. Secondly, it provides better modeling of data and an improved prediction of unseen data. VDBM is evaluated on the IAM dataset and the results prove that it outperforms our prior work and other state of the art line based word spotting system.
Rai, Rajesh Kumar; Unisa, Sayeed
2013-06-01
This study examines the reasons for not using any method of contraception as well as reasons for not using modern methods of contraception, and factors associated with the future intention to use different types of contraceptives in India and its selected states, namely Uttar Pradesh, Assam and West Bengal. Data from the third wave of District Level Household and Facility Survey, 2007-08 were used. Bivariate as well as logistic regression analyses were performed to fulfill the study objective. Postpartum amenorrhea and breastfeeding practices were reported as the foremost causes for not using any method of contraception. Opposition to use, health concerns and fear of side effects were reported to be major hurdles in the way of using modern methods of contraception. Results from logistic regression suggest considerable variation in explaining the factors associated with future intention to use contraceptives. Promotion of health education addressing the advantages of contraceptive methods and eliminating apprehension about the use of these methods through effective communication by community level workers is the need of the hour. Copyright © 2013 Elsevier B.V. All rights reserved.
Demand analysis of flood insurance by using logistic regression model and genetic algorithm
NASA Astrophysics Data System (ADS)
Sidi, P.; Mamat, M. B.; Sukono; Supian, S.; Putra, A. S.
2018-03-01
Citarum River floods in the area of South Bandung Indonesia, often resulting damage to some buildings belonging to the people living in the vicinity. One effort to alleviate the risk of building damage is to have flood insurance. The main obstacle is not all people in the Citarum basin decide to buy flood insurance. In this paper, we intend to analyse the decision to buy flood insurance. It is assumed that there are eight variables that influence the decision of purchasing flood assurance, include: income level, education level, house distance with river, building election with road, flood frequency experience, flood prediction, perception on insurance company, and perception towards government effort in handling flood. The analysis was done by using logistic regression model, and to estimate model parameters, it is done with genetic algorithm. The results of the analysis shows that eight variables analysed significantly influence the demand of flood insurance. These results are expected to be considered for insurance companies, to influence the decision of the community to be willing to buy flood insurance.
Wang, Ningjian; Han, Bing; Li, Qin; Chen, Yi; Chen, Yingchao; Xia, Fangzhen; Lin, Dongping; Jensen, Michael D; Lu, Yingli
2015-07-16
To date, no study has explored the association between androgen levels and 25-hydroxyvitamin D (25(OH)D) levels in Chinese men. We aimed to investigate the relationship between 25(OH)D levels and total and free testosterone (T), sex hormone binding globulin (SHBG), estradiol, and hypogonadism in Chinese men. Our data, which were based on the population, were collected from 16 sites in East China. There were 2,854 men enrolled in the study, with a mean (SD) age of 53.0 (13.5) years. Hypogonadism was defined as total T <11.3 nmol/L or free T <22.56 pmol/L. The 25(OH)D, follicle-stimulating hormone, luteinizing hormone, total T, estradiol and SHBG were measured using chemiluminescence and free T by enzyme-linked immune-sorbent assay. The associations between 25(OH)D and reproductive hormones and hypogonadism were analyzed using linear regression and binary logistic regression analyses, respectively. A total of 713 (25.0 %) men had hypogonadism with significantly lower 25(OH)D levels but greater BMI and HOMA-IR. Using linear regression, after fully adjusting for age, residence area, economic status, smoking, BMI, HOMA-IR, diabetes and systolic pressure, 25(OH)D was associated with total T and estradiol (P < 0.05). In the logistic regression analyses, increased quartiles of 25(OH)D were associated with significantly decreased odds ratios of hypogonadism (P for trend <0.01). This association, which was considerably attenuated by BMI and HOMA-IR, persisted in the fully adjusted model (P for trend <0.01) in which for the lowest compared with the highest quartile of 25(OH)D, the odds ratio of hypogonadism was 1.50 (95 % CI, 1.14, 1.97). A lower vitamin D level was associated with a higher prevalence of hypogonadism in Chinese men. This association might, in part, be explained by adiposity and insulin resistance and warrants additional investigation.
Kesselmeier, Miriam; Lorenzo Bermejo, Justo
2017-11-01
Logistic regression is the most common technique used for genetic case-control association studies. A disadvantage of standard maximum likelihood estimators of the genotype relative risk (GRR) is their strong dependence on outlier subjects, for example, patients diagnosed at unusually young age. Robust methods are available to constrain outlier influence, but they are scarcely used in genetic studies. This article provides a non-intimidating introduction to robust logistic regression, and investigates its benefits and limitations in genetic association studies. We applied the bounded Huber and extended the R package 'robustbase' with the re-descending Hampel functions to down-weight outlier influence. Computer simulations were carried out to assess the type I error rate, mean squared error (MSE) and statistical power according to major characteristics of the genetic study and investigated markers. Simulations were complemented with the analysis of real data. Both standard and robust estimation controlled type I error rates. Standard logistic regression showed the highest power but standard GRR estimates also showed the largest bias and MSE, in particular for associated rare and recessive variants. For illustration, a recessive variant with a true GRR=6.32 and a minor allele frequency=0.05 investigated in a 1000 case/1000 control study by standard logistic regression resulted in power=0.60 and MSE=16.5. The corresponding figures for Huber-based estimation were power=0.51 and MSE=0.53. Overall, Hampel- and Huber-based GRR estimates did not differ much. Robust logistic regression may represent a valuable alternative to standard maximum likelihood estimation when the focus lies on risk prediction rather than identification of susceptibility variants. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Li, Feiming; Gimpel, John R; Arenson, Ethan; Song, Hao; Bates, Bruce P; Ludwin, Fredric
2014-04-01
Few studies have investigated how well scores from the Comprehensive Osteopathic Medical Licensing Examination-USA (COMLEX-USA) series predict resident outcomes, such as performance on board certification examinations. To determine how well COMLEX-USA predicts performance on the American Osteopathic Board of Emergency Medicine (AOBEM) Part I certification examination. The target study population was first-time examinees who took AOBEM Part I in 2011 and 2012 with matched performances on COMLEX-USA Level 1, Level 2-Cognitive Evaluation (CE), and Level 3. Pearson correlations were computed between AOBEM Part I first-attempt scores and COMLEX-USA performances to measure the association between these examinations. Stepwise linear regression analysis was conducted to predict AOBEM Part I scores by the 3 COMLEX-USA scores. An independent t test was conducted to compare mean COMLEX-USA performances between candidates who passed and who failed AOBEM Part I, and a stepwise logistic regression analysis was used to predict the log-odds of passing AOBEM Part I on the basis of COMLEX-USA scores. Scores from AOBEM Part I had the highest correlation with COMLEX-USA Level 3 scores (.57) and slightly lower correlation with COMLEX-USA Level 2-CE scores (.53). The lowest correlation was between AOBEM Part I and COMLEX-USA Level 1 scores (.47). According to the stepwise regression model, COMLEX-USA Level 1 and Level 2-CE scores, which residency programs often use as selection criteria, together explained 30% of variance in AOBEM Part I scores. Adding Level 3 scores explained 37% of variance. The independent t test indicated that the 397 examinees passing AOBEM Part I performed significantly better than the 54 examinees failing AOBEM Part I in all 3 COMLEX-USA levels (P<.001 for all 3 levels). The logistic regression model showed that COMLEX-USA Level 1 and Level 3 scores predicted the log-odds of passing AOBEM Part I (P=.03 and P<.001, respectively). The present study empirically supported the predictive and discriminant validities of the COMLEX-USA series in relation to the AOBEM Part I certification examination. Although residency programs may use COMLEX-USA Level 1 and Level 2-CE scores as partial criteria in selecting residents, Level 3 scores, though typically not available at the time of application, are actually the most statistically related to performances on AOBEM Part I.
Sampson, Maureen L; Gounden, Verena; van Deventer, Hendrik E; Remaley, Alan T
2016-02-01
The main drawback of the periodic analysis of quality control (QC) material is that test performance is not monitored in time periods between QC analyses, potentially leading to the reporting of faulty test results. The objective of this study was to develop a patient based QC procedure for the more timely detection of test errors. Results from a Chem-14 panel measured on the Beckman LX20 analyzer were used to develop the model. Each test result was predicted from the other 13 members of the panel by multiple regression, which resulted in correlation coefficients between the predicted and measured result of >0.7 for 8 of the 14 tests. A logistic regression model, which utilized the measured test result, the predicted test result, the day of the week and time of day, was then developed for predicting test errors. The output of the logistic regression was tallied by a daily CUSUM approach and used to predict test errors, with a fixed specificity of 90%. The mean average run length (ARL) before error detection by CUSUM-Logistic Regression (CSLR) was 20 with a mean sensitivity of 97%, which was considerably shorter than the mean ARL of 53 (sensitivity 87.5%) for a simple prediction model that only used the measured result for error detection. A CUSUM-Logistic Regression analysis of patient laboratory data can be an effective approach for the rapid and sensitive detection of clinical laboratory errors. Published by Elsevier Inc.
Borgquist, Ola; Wise, Matt P; Nielsen, Niklas; Al-Subaie, Nawaf; Cranshaw, Julius; Cronberg, Tobias; Glover, Guy; Hassager, Christian; Kjaergaard, Jesper; Kuiper, Michael; Smid, Ondrej; Walden, Andrew; Friberg, Hans
2017-08-01
Dysglycemia and glycemic variability are associated with poor outcomes in critically ill patients. Targeted temperature management alters blood glucose homeostasis. We investigated the association between blood glucose concentrations and glycemic variability and the neurologic outcomes of patients randomized to targeted temperature management at 33°C or 36°C after cardiac arrest. Post hoc analysis of the multicenter TTM-trial. Primary outcome of this analysis was neurologic outcome after 6 months, referred to as "Cerebral Performance Category." Thirty-six sites in Europe and Australia. All 939 patients with out-of-hospital cardiac arrest of presumed cardiac cause that had been included in the TTM-trial. Targeted temperature management at 33°C or 36°C. Nonparametric tests as well as multiple logistic regression and mixed effects logistic regression models were used. Median glucose concentrations on hospital admission differed significantly between Cerebral Performance Category outcomes (p < 0.0001). Hyper- and hypoglycemia were associated with poor neurologic outcome (p = 0.001 and p = 0.054). In the multiple logistic regression models, the median glycemic level was an independent predictor of poor Cerebral Performance Category (Cerebral Performance Category, 3-5) with an odds ratio (OR) of 1.13 in the adjusted model (p = 0.008; 95% CI, 1.03-1.24). It was also a predictor in the mixed model, which served as a sensitivity analysis to adjust for the multiple time points. The proportion of hyperglycemia was higher in the 33°C group compared with the 36°C group. Higher blood glucose levels at admission and during the first 36 hours, and higher glycemic variability, were associated with poor neurologic outcome and death. More patients in the 33°C treatment arm had hyperglycemia.
Lanfredi, Mariangela; Candini, Valentina; Buizza, Chiara; Ferrari, Clarissa; Boero, Maria E; Giobbio, Gian M; Goldschmidt, Nicoletta; Greppo, Stefania; Iozzino, Laura; Maggi, Paolo; Melegari, Anna; Pasqualetti, Patrizio; Rossi, Giuseppe; de Girolamo, Giovanni
2014-05-15
Quality of life (QOL) has been considered an important outcome measure in psychiatric research and determinants of QOL have been widely investigated. We aimed at detecting predictors of QOL at baseline and at testing the longitudinal interrelations of the baseline predictors with QOL scores at a 1-year follow-up in a sample of patients living in Residential Facilities (RFs). Logistic regression models were adopted to evaluate the association between WHOQoL-Bref scores and potential determinants of QOL. In addition, all variables significantly associated with QOL domains in the final logistic regression model were included by using the Structural Equation Modeling (SEM). We included 139 patients with a diagnosis of schizophrenia spectrum. In the final logistic regression model level of activity, social support, age, service satisfaction, spiritual well-being and symptoms' severity were identified as predictors of QOL scores at baseline. Longitudinal analyses carried out by SEM showed that 40% of QOL follow-up variability was explained by QOL at baseline, and significant indirect effects toward QOL at follow-up were found for satisfaction with services and for social support. Rehabilitation plans for people with schizophrenia living in RFs should also consider mediators of change in subjective QOL such as satisfaction with mental health services. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Can shoulder dystocia be reliably predicted?
Dodd, Jodie M; Catcheside, Britt; Scheil, Wendy
2012-06-01
To evaluate factors reported to increase the risk of shoulder dystocia, and to evaluate their predictive value at a population level. The South Australian Pregnancy Outcome Unit's population database from 2005 to 2010 was accessed to determine the occurrence of shoulder dystocia in addition to reported risk factors, including age, parity, self-reported ethnicity, presence of diabetes and infant birth weight. Odds ratios (and 95% confidence interval) of shoulder dystocia was calculated for each risk factor, which were then incorporated into a logistic regression model. Test characteristics for each variable in predicting shoulder dystocia were calculated. As a proportion of all births, the reported rate of shoulder dystocia increased significantly from 0.95% in 2005 to 1.38% in 2010 (P = 0.0002). Using a logistic regression model, induction of labour and infant birth weight greater than both 4000 and 4500 g were identified as significant independent predictors of shoulder dystocia. The value of risk factors alone and when incorporated into the logistic regression model was poorly predictive of the occurrence of shoulder dystocia. While there are a number of factors associated with an increased risk of shoulder dystocia, none are of sufficient sensitivity or positive predictive value to allow their use clinically to reliably and accurately identify the occurrence of shoulder dystocia. © 2012 The Authors ANZJOG © 2012 The Royal Australian and New Zealand College of Obstetricians and Gynaecologists.
Machado-Carvalhais, Helenaura P; Ramos-Jorge, Maria L; Auad, Sheyla M; Martins, Laura H P M; Paiva, Saul M; Pordeus, Isabela A
2008-10-01
The aims of this cross-sectional study were to determine the prevalence of occupational accidents with exposure to biological material among undergraduate students of dentistry and to estimate potential risk factors associated with exposure to blood. Data were collected through a self-administered questionnaire (86.4 percent return rate), which was completed by a sample of 286 undergraduate dental students (mean age 22.4 +/-2.4 years). The students were enrolled in the clinical component of the curriculum, which corresponds to the final six semesters of study. Descriptive, bivariate, simple logistic regression and multiple logistic regression (Forward Stepwise Procedure) analyses were performed. The level of statistical significance was set at 5 percent. Percutaneous and mucous exposures to potentially infectious biological material were reported by 102 individuals (35.6 percent); 26.8 percent reported the occurrence of multiple episodes of exposure. The logistic regression analyses revealed that the incomplete use of individual protection equipment (OR=3.7; 95 percent CI 1.5-9.3), disciplines where surgical procedures are carried out (OR=16.3; 95 percent CI 7.1-37.2), and handling sharp instruments (OR=4.4; 95 percent CI 2.1-9.1), more specifically, hollow-bore needles (OR=6.8; 95 percent CI 2.1-19.0), were independently associated with exposure to blood. Policies of reviewing the procedures during clinical practice are recommended in order to reduce occupational exposure.
Kim, Yoonsang; Choi, Young-Ku; Emery, Sherry
2013-08-01
Several statistical packages are capable of estimating generalized linear mixed models and these packages provide one or more of three estimation methods: penalized quasi-likelihood, Laplace, and Gauss-Hermite. Many studies have investigated these methods' performance for the mixed-effects logistic regression model. However, the authors focused on models with one or two random effects and assumed a simple covariance structure between them, which may not be realistic. When there are multiple correlated random effects in a model, the computation becomes intensive, and often an algorithm fails to converge. Moreover, in our analysis of smoking status and exposure to anti-tobacco advertisements, we have observed that when a model included multiple random effects, parameter estimates varied considerably from one statistical package to another even when using the same estimation method. This article presents a comprehensive review of the advantages and disadvantages of each estimation method. In addition, we compare the performances of the three methods across statistical packages via simulation, which involves two- and three-level logistic regression models with at least three correlated random effects. We apply our findings to a real dataset. Our results suggest that two packages-SAS GLIMMIX Laplace and SuperMix Gaussian quadrature-perform well in terms of accuracy, precision, convergence rates, and computing speed. We also discuss the strengths and weaknesses of the two packages in regard to sample sizes.
Kim, Yoonsang; Emery, Sherry
2013-01-01
Several statistical packages are capable of estimating generalized linear mixed models and these packages provide one or more of three estimation methods: penalized quasi-likelihood, Laplace, and Gauss-Hermite. Many studies have investigated these methods’ performance for the mixed-effects logistic regression model. However, the authors focused on models with one or two random effects and assumed a simple covariance structure between them, which may not be realistic. When there are multiple correlated random effects in a model, the computation becomes intensive, and often an algorithm fails to converge. Moreover, in our analysis of smoking status and exposure to anti-tobacco advertisements, we have observed that when a model included multiple random effects, parameter estimates varied considerably from one statistical package to another even when using the same estimation method. This article presents a comprehensive review of the advantages and disadvantages of each estimation method. In addition, we compare the performances of the three methods across statistical packages via simulation, which involves two- and three-level logistic regression models with at least three correlated random effects. We apply our findings to a real dataset. Our results suggest that two packages—SAS GLIMMIX Laplace and SuperMix Gaussian quadrature—perform well in terms of accuracy, precision, convergence rates, and computing speed. We also discuss the strengths and weaknesses of the two packages in regard to sample sizes. PMID:24288415
Li, Yuan; Wu, Qun Hong; Jiao, Ming Li; Fan, Xiao Hong; Hu, Quan; Hao, Yan Hua; Liu, Ruo Hong; Zhang, Wei; Cui, Yu; Han, Li Yuan
2015-01-01
To evaluate whether the adiponectin gene is associated with diabetic retinopathy (DR) risk and interaction with environmental factors modifies the DR risk, and to investigate the relationship between serum adiponectin levels and DR. Four adiponectin polymorphisms were evaluated in 372 DR cases and 145 controls. Differences in environmental factors between cases and controls were evaluated by unconditional logistic regression analysis. The model-free multifactor dimensionality reduction method and traditional multiple regression models were applied to explore interactions between the polymorphisms and environmental factors. Using the Bonferroni method, we found no significant associations between four adiponectin polymorphisms and DR susceptibility. Multivariate logistic regression found that physical activity played a protective role in the progress of DR, whereas family history of diabetes (odds ratio 1.75) and insulin therapy (odds ratio 1.78) were associated with an increased risk for DR. The interaction between the C-11377 G (rs266729) polymorphism and insulin therapy might be associated with DR risk. Family history of diabetes combined with insulin therapy also increased the risk of DR. No adiponectin gene polymorphisms influenced the serum adiponectin levels. Serum adiponectin levels did not differ between the DR group and non-DR group. No significant association was identified between four adiponectin polymorphisms and DR susceptibility after stringent Bonferroni correction. The interaction between C-11377G (rs266729) polymorphism and insulin therapy, as well as the interaction between family history of diabetes and insulin therapy, might be associated with DR susceptibility.
Fertility desires of Yoruba couples of South-western Nigeria.
Oyediran, Kolawole Azeez
2006-09-01
Using the matched wife-husband (763) sample from the data collected from Ogbomoso and Iseyin towns in Oyo State, Nigeria, this paper examines factors associated with couples' fertility intention. The analysis used logistic regression models for predicting the effects of selected socioeconomic background characteristics on a couple's fertility intention. Results indicate high levels of concurrence among husbands and wives on fertility intention. Where differences exist, husbands are more pronatalists than their wives. About 87% of pairs of partners reported similar fertility preferences. Of these couples, 59.5% wanted more children while only 27.8% reported otherwise. The logistic regression models indicated that a couple's fertility intention was associated with age, education, place of residence, frequency of television-watching and number of living children. Therefore, programme interventions aimed at promoting fertility reduction in Nigeria should convey fertility regulation messages to both husbands and wives.
Cubbin, Catherine; Heck, Katherine; Powell, Tara; Marchi, Kristen; Braveman, Paula
2015-01-01
We examined racial/ethnic disparities in depressive symptoms during pregnancy among a population-based sample of childbearing women in California (N = 24,587). We hypothesized that these racial/ethnic disparities would be eliminated when comparing women with similar incomes and neighborhood poverty environments. Neighborhood poverty trajectory descriptions were linked with survey data measuring age, parity, race/ethnicity, marital status, education, income, and depressive symptoms. We constructed logistic regression models among the overall sample to examine both crude and adjusted racial/ethnic disparities in feeling depressed. Next, stratified adjusted logistic regression models were constructed to examine racial/ethnic disparities in feeling depressed among women of similar income levels living in similar neighborhood poverty environments. We found that racial/ethnic disparities in feeling depressed remained only among women who were not poor themselves and who lived in long-term moderate or low poverty neighborhoods.
Binary logistic regression modelling: Measuring the probability of relapse cases among drug addict
NASA Astrophysics Data System (ADS)
Ismail, Mohd Tahir; Alias, Siti Nor Shadila
2014-07-01
For many years Malaysia faced the drug addiction issues. The most serious case is relapse phenomenon among treated drug addict (drug addict who have under gone the rehabilitation programme at Narcotic Addiction Rehabilitation Centre, PUSPEN). Thus, the main objective of this study is to find the most significant factor that contributes to relapse to happen. The binary logistic regression analysis was employed to model the relationship between independent variables (predictors) and dependent variable. The dependent variable is the status of the drug addict either relapse, (Yes coded as 1) or not, (No coded as 0). Meanwhile the predictors involved are age, age at first taking drug, family history, education level, family crisis, community support and self motivation. The total of the sample is 200 which the data are provided by AADK (National Antidrug Agency). The finding of the study revealed that age and self motivation are statistically significant towards the relapse cases..
Bossola, Maurizio; Vulpio, Carlo; Colacicco, Luigi; Scribano, Donata; Zuppi, Cecilia; Tazza, Luigi
2012-02-11
The aim of our study was to measure reactive oxygen metabolites (ROMs) in chronic hemodialysis (HD) patients and evaluate the possible association with cardiovascular disease (CVD) and mortality. We measured ROMs in 76 HD patients and correlated with CVD, cardiovascular (CV) events in the follow-up and all-cause and CVD-related mortality. The levels of ROMs presented a median value of 270 (238.2-303.2) CARR U (interquartile range). We created a ROC curve (ROMs levels vs. CVD) and we identified a cut-off point of 273 CARR U. Patients with ROMs levels ≥273 CARR U were significantly older, had higher C-reactive protein levels and lower creatinine concentrations. The prevalence of CVD was higher in patients with ROMs levels ≥273 (87.1%) than in those with ROMs levels <273 CARR U (17.7%; p<0.0001). ROMs levels were significantly higher in patients with CVD (317±63.8) than in those without (242.7±49.1; p<0.0001). At multiple regression analysis, age, creatinine and C-reactive protein were independent factors associated with ROMs. At multiple logistic regression analysis the association between ROMs and CVD was independent (OR: 1.02, 95% CI: 1.00-1.05; p=0.03). Twenty six patients developed cardiovascular (CV) events during the follow-up. Of these, seven were in the group with ROMs levels <273 CARR U and 19 in the group with ROMs levels ≥273 CARR U. The logistic regression analysis showed that both age (OR: 1.06, 95% CI: 1.01-1.12; p=0.013) and ROMs levels (OR: 1.10, 95% CI: 1.00-1.02; p=0.045) were independently associated with CV events in the follow-up. ROMs are independently associated with CVD and predict CV events in chronic HD patients.
Mocellin, Simone; Ambrosi, Alessandro; Montesco, Maria Cristina; Foletto, Mirto; Zavagno, Giorgio; Nitti, Donato; Lise, Mario; Rossi, Carlo Riccardo
2006-08-01
Currently, approximately 80% of melanoma patients undergoing sentinel node biopsy (SNB) have negative sentinel lymph nodes (SLNs), and no prediction system is reliable enough to be implemented in the clinical setting to reduce the number of SNB procedures. In this study, the predictive power of support vector machine (SVM)-based statistical analysis was tested. The clinical records of 246 patients who underwent SNB at our institution were used for this analysis. The following clinicopathologic variables were considered: the patient's age and sex and the tumor's histological subtype, Breslow thickness, Clark level, ulceration, mitotic index, lymphocyte infiltration, regression, angiolymphatic invasion, microsatellitosis, and growth phase. The results of SVM-based prediction of SLN status were compared with those achieved with logistic regression. The SLN positivity rate was 22% (52 of 234). When the accuracy was > or = 80%, the negative predictive value, positive predictive value, specificity, and sensitivity were 98%, 54%, 94%, and 77% and 82%, 41%, 69%, and 93% by using SVM and logistic regression, respectively. Moreover, SVM and logistic regression were associated with a diagnostic error and an SNB percentage reduction of (1) 1% and 60% and (2) 15% and 73%, respectively. The results from this pilot study suggest that SVM-based prediction of SLN status might be evaluated as a prognostic method to avoid the SNB procedure in 60% of patients currently eligible, with a very low error rate. If validated in larger series, this strategy would lead to obvious advantages in terms of both patient quality of life and costs for the health care system.
Nonconvex Sparse Logistic Regression With Weakly Convex Regularization
NASA Astrophysics Data System (ADS)
Shen, Xinyue; Gu, Yuantao
2018-06-01
In this work we propose to fit a sparse logistic regression model by a weakly convex regularized nonconvex optimization problem. The idea is based on the finding that a weakly convex function as an approximation of the $\\ell_0$ pseudo norm is able to better induce sparsity than the commonly used $\\ell_1$ norm. For a class of weakly convex sparsity inducing functions, we prove the nonconvexity of the corresponding sparse logistic regression problem, and study its local optimality conditions and the choice of the regularization parameter to exclude trivial solutions. Despite the nonconvexity, a method based on proximal gradient descent is used to solve the general weakly convex sparse logistic regression, and its convergence behavior is studied theoretically. Then the general framework is applied to a specific weakly convex function, and a necessary and sufficient local optimality condition is provided. The solution method is instantiated in this case as an iterative firm-shrinkage algorithm, and its effectiveness is demonstrated in numerical experiments by both randomly generated and real datasets.
Campos-Filho, N; Franco, E L
1989-02-01
A frequent procedure in matched case-control studies is to report results from the multivariate unmatched analyses if they do not differ substantially from the ones obtained after conditioning on the matching variables. Although conceptually simple, this rule requires that an extensive series of logistic regression models be evaluated by both the conditional and unconditional maximum likelihood methods. Most computer programs for logistic regression employ only one maximum likelihood method, which requires that the analyses be performed in separate steps. This paper describes a Pascal microcomputer (IBM PC) program that performs multiple logistic regression by both maximum likelihood estimation methods, which obviates the need for switching between programs to obtain relative risk estimates from both matched and unmatched analyses. The program calculates most standard statistics and allows factoring of categorical or continuous variables by two distinct methods of contrast. A built-in, descriptive statistics option allows the user to inspect the distribution of cases and controls across categories of any given variable.
Comparison of cranial sex determination by discriminant analysis and logistic regression.
Amores-Ampuero, Anabel; Alemán, Inmaculada
2016-04-05
Various methods have been proposed for estimating dimorphism. The objective of this study was to compare sex determination results from cranial measurements using discriminant analysis or logistic regression. The study sample comprised 130 individuals (70 males) of known sex, age, and cause of death from San José cemetery in Granada (Spain). Measurements of 19 neurocranial dimensions and 11 splanchnocranial dimensions were subjected to discriminant analysis and logistic regression, and the percentages of correct classification were compared between the sex functions obtained with each method. The discriminant capacity of the selected variables was evaluated with a cross-validation procedure. The percentage accuracy with discriminant analysis was 78.2% for the neurocranium (82.4% in females and 74.6% in males) and 73.7% for the splanchnocranium (79.6% in females and 68.8% in males). These percentages were higher with logistic regression analysis: 85.7% for the neurocranium (in both sexes) and 94.1% for the splanchnocranium (100% in females and 91.7% in males).
Zhang, Dongdong; Chen, Ling; Yin, Dan; Miao, Jinping; Sun, Yehuan
2014-07-01
To explore the correlation between suicide ideation and family function & negative life events, as well as other influential factors in adolescents, thus present a theoretical base for clinicians and school staff to develop intervention for those problems. By adopting current situation random sampling method, Self-Rating Idea of Suicide Scale, Adolescent Self-Rating Life Events Check List and Family APGAR Index were used to assess adolescents at random in a hygiene vocational school in Changzhou City, Jiangsu Province and a collage in Wuhu City, Anhui Province. 3700 questionnaires were granted, 3675 questionnaires were collected, among which 3620 were valid. Chi-square test, t-test, and univariate logistic regression were employed in univariate analysis, multivariate logistic regression was used in multivariate analysis. The detection rate of suicide ideation is 7.0%, and the top five suicide ideation characteristics were: poor academic performance (33.6%), serious family functional impairment (25.8%), lower-middle academic performance (11.7%), bad economic conditions (10.8%) and study in Grade Three (9.9%). Multiple logistic regression showed that the following three high-level stress amount in negative life events are most crucial for suicide ideation. They are "relationships" (OR = 1.135, 95% CI 1.071 - 1. 202), "academic pressure" (OR = 1.169, 95% CI 1.101 - 1.241), and "external events" (OR = 1.278, 95% CI 1.187 - 1.376). What' s more, the stress of attending higher grades (OR = 1.980, 95% CI 1.302 - 3.008), poor academic performance (OR = 7.206, 95% CI 1.745 - 9.789), moderate family functional impairment (OR = 2.562, 95% CI 1.527 - 2.892) and its serious level (OR = 8.287, 95% CI 3.154 - 6.917) are also influential factors for suicide ideation. Severe family functional impairment and high-level stress amount of negative life events produced the main factors of suicide ideation. Therefore, necessary and sufficient support should be given to adolescents by families and schools.
Hill, Andrew; Loh, Po-Ru; Bharadwaj, Ragu B.; Pons, Pascal; Shang, Jingbo; Guinan, Eva; Lakhani, Karim; Kilty, Iain
2017-01-01
Abstract Background: The association of differing genotypes with disease-related phenotypic traits offers great potential to both help identify new therapeutic targets and support stratification of patients who would gain the greatest benefit from specific drug classes. Development of low-cost genotyping and sequencing has made collecting large-scale genotyping data routine in population and therapeutic intervention studies. In addition, a range of new technologies is being used to capture numerous new and complex phenotypic descriptors. As a result, genotype and phenotype datasets have grown exponentially. Genome-wide association studies associate genotypes and phenotypes using methods such as logistic regression. As existing tools for association analysis limit the efficiency by which value can be extracted from increasing volumes of data, there is a pressing need for new software tools that can accelerate association analyses on large genotype-phenotype datasets. Results: Using open innovation (OI) and contest-based crowdsourcing, the logistic regression analysis in a leading, community-standard genetics software package (PLINK 1.07) was substantially accelerated. OI allowed us to do this in <6 months by providing rapid access to highly skilled programmers with specialized, difficult-to-find skill sets. Through a crowd-based contest a combination of computational, numeric, and algorithmic approaches was identified that accelerated the logistic regression in PLINK 1.07 by 18- to 45-fold. Combining contest-derived logistic regression code with coarse-grained parallelization, multithreading, and associated changes to data initialization code further developed through distributed innovation, we achieved an end-to-end speedup of 591-fold for a data set size of 6678 subjects by 645 863 variants, compared to PLINK 1.07's logistic regression. This represents a reduction in run time from 4.8 hours to 29 seconds. Accelerated logistic regression code developed in this project has been incorporated into the PLINK2 project. Conclusions: Using iterative competition-based OI, we have developed a new, faster implementation of logistic regression for genome-wide association studies analysis. We present lessons learned and recommendations on running a successful OI process for bioinformatics. PMID:28327993
Hill, Andrew; Loh, Po-Ru; Bharadwaj, Ragu B; Pons, Pascal; Shang, Jingbo; Guinan, Eva; Lakhani, Karim; Kilty, Iain; Jelinsky, Scott A
2017-05-01
The association of differing genotypes with disease-related phenotypic traits offers great potential to both help identify new therapeutic targets and support stratification of patients who would gain the greatest benefit from specific drug classes. Development of low-cost genotyping and sequencing has made collecting large-scale genotyping data routine in population and therapeutic intervention studies. In addition, a range of new technologies is being used to capture numerous new and complex phenotypic descriptors. As a result, genotype and phenotype datasets have grown exponentially. Genome-wide association studies associate genotypes and phenotypes using methods such as logistic regression. As existing tools for association analysis limit the efficiency by which value can be extracted from increasing volumes of data, there is a pressing need for new software tools that can accelerate association analyses on large genotype-phenotype datasets. Using open innovation (OI) and contest-based crowdsourcing, the logistic regression analysis in a leading, community-standard genetics software package (PLINK 1.07) was substantially accelerated. OI allowed us to do this in <6 months by providing rapid access to highly skilled programmers with specialized, difficult-to-find skill sets. Through a crowd-based contest a combination of computational, numeric, and algorithmic approaches was identified that accelerated the logistic regression in PLINK 1.07 by 18- to 45-fold. Combining contest-derived logistic regression code with coarse-grained parallelization, multithreading, and associated changes to data initialization code further developed through distributed innovation, we achieved an end-to-end speedup of 591-fold for a data set size of 6678 subjects by 645 863 variants, compared to PLINK 1.07's logistic regression. This represents a reduction in run time from 4.8 hours to 29 seconds. Accelerated logistic regression code developed in this project has been incorporated into the PLINK2 project. Using iterative competition-based OI, we have developed a new, faster implementation of logistic regression for genome-wide association studies analysis. We present lessons learned and recommendations on running a successful OI process for bioinformatics. © The Author 2017. Published by Oxford University Press.
Lin, Chao-Cheng; Bai, Ya-Mei; Chen, Jen-Yeu; Hwang, Tzung-Jeng; Chen, Tzu-Ting; Chiu, Hung-Wen; Li, Yu-Chuan
2010-03-01
Metabolic syndrome (MetS) is an important side effect of second-generation antipsychotics (SGAs). However, many SGA-treated patients with MetS remain undetected. In this study, we trained and validated artificial neural network (ANN) and multiple logistic regression models without biochemical parameters to rapidly identify MetS in patients with SGA treatment. A total of 383 patients with a diagnosis of schizophrenia or schizoaffective disorder (DSM-IV criteria) with SGA treatment for more than 6 months were investigated to determine whether they met the MetS criteria according to the International Diabetes Federation. The data for these patients were collected between March 2005 and September 2005. The input variables of ANN and logistic regression were limited to demographic and anthropometric data only. All models were trained by randomly selecting two-thirds of the patient data and were internally validated with the remaining one-third of the data. The models were then externally validated with data from 69 patients from another hospital, collected between March 2008 and June 2008. The area under the receiver operating characteristic curve (AUC) was used to measure the performance of all models. Both the final ANN and logistic regression models had high accuracy (88.3% vs 83.6%), sensitivity (93.1% vs 86.2%), and specificity (86.9% vs 83.8%) to identify MetS in the internal validation set. The mean +/- SD AUC was high for both the ANN and logistic regression models (0.934 +/- 0.033 vs 0.922 +/- 0.035, P = .63). During external validation, high AUC was still obtained for both models. Waist circumference and diastolic blood pressure were the common variables that were left in the final ANN and logistic regression models. Our study developed accurate ANN and logistic regression models to detect MetS in patients with SGA treatment. The models are likely to provide a noninvasive tool for large-scale screening of MetS in this group of patients. (c) 2010 Physicians Postgraduate Press, Inc.
Bayesian logistic regression in detection of gene-steroid interaction for cancer at PDLIM5 locus.
Wang, Ke-Sheng; Owusu, Daniel; Pan, Yue; Xie, Changchun
2016-06-01
The PDZ and LIM domain 5 (PDLIM5) gene may play a role in cancer, bipolar disorder, major depression, alcohol dependence and schizophrenia; however, little is known about the interaction effect of steroid and PDLIM5 gene on cancer. This study examined 47 single-nucleotide polymorphisms (SNPs) within the PDLIM5 gene in the Marshfield sample with 716 cancer patients (any diagnosed cancer, excluding minor skin cancer) and 2848 noncancer controls. Multiple logistic regression model in PLINK software was used to examine the association of each SNP with cancer. Bayesian logistic regression in PROC GENMOD in SAS statistical software, ver. 9.4 was used to detect gene- steroid interactions influencing cancer. Single marker analysis using PLINK identified 12 SNPs associated with cancer (P< 0.05); especially, SNP rs6532496 revealed the strongest association with cancer (P = 6.84 × 10⁻³); while the next best signal was rs951613 (P = 7.46 × 10⁻³). Classic logistic regression in PROC GENMOD showed that both rs6532496 and rs951613 revealed strong gene-steroid interaction effects (OR=2.18, 95% CI=1.31-3.63 with P = 2.9 × 10⁻³ for rs6532496 and OR=2.07, 95% CI=1.24-3.45 with P = 5.43 × 10⁻³ for rs951613, respectively). Results from Bayesian logistic regression showed stronger interaction effects (OR=2.26, 95% CI=1.2-3.38 for rs6532496 and OR=2.14, 95% CI=1.14-3.2 for rs951613, respectively). All the 12 SNPs associated with cancer revealed significant gene-steroid interaction effects (P < 0.05); whereas 13 SNPs showed gene-steroid interaction effects without main effect on cancer. SNP rs4634230 revealed the strongest gene-steroid interaction effect (OR=2.49, 95% CI=1.5-4.13 with P = 4.0 × 10⁻⁴ based on the classic logistic regression and OR=2.59, 95% CI=1.4-3.97 from Bayesian logistic regression; respectively). This study provides evidence of common genetic variants within the PDLIM5 gene and interactions between PLDIM5 gene polymorphisms and steroid use influencing cancer.
Li, Baoyue; Lingsma, Hester F; Steyerberg, Ewout W; Lesaffre, Emmanuel
2011-05-23
Logistic random effects models are a popular tool to analyze multilevel also called hierarchical data with a binary or ordinal outcome. Here, we aim to compare different statistical software implementations of these models. We used individual patient data from 8509 patients in 231 centers with moderate and severe Traumatic Brain Injury (TBI) enrolled in eight Randomized Controlled Trials (RCTs) and three observational studies. We fitted logistic random effects regression models with the 5-point Glasgow Outcome Scale (GOS) as outcome, both dichotomized as well as ordinal, with center and/or trial as random effects, and as covariates age, motor score, pupil reactivity or trial. We then compared the implementations of frequentist and Bayesian methods to estimate the fixed and random effects. Frequentist approaches included R (lme4), Stata (GLLAMM), SAS (GLIMMIX and NLMIXED), MLwiN ([R]IGLS) and MIXOR, Bayesian approaches included WinBUGS, MLwiN (MCMC), R package MCMCglmm and SAS experimental procedure MCMC.Three data sets (the full data set and two sub-datasets) were analysed using basically two logistic random effects models with either one random effect for the center or two random effects for center and trial. For the ordinal outcome in the full data set also a proportional odds model with a random center effect was fitted. The packages gave similar parameter estimates for both the fixed and random effects and for the binary (and ordinal) models for the main study and when based on a relatively large number of level-1 (patient level) data compared to the number of level-2 (hospital level) data. However, when based on relatively sparse data set, i.e. when the numbers of level-1 and level-2 data units were about the same, the frequentist and Bayesian approaches showed somewhat different results. The software implementations differ considerably in flexibility, computation time, and usability. There are also differences in the availability of additional tools for model evaluation, such as diagnostic plots. The experimental SAS (version 9.2) procedure MCMC appeared to be inefficient. On relatively large data sets, the different software implementations of logistic random effects regression models produced similar results. Thus, for a large data set there seems to be no explicit preference (of course if there is no preference from a philosophical point of view) for either a frequentist or Bayesian approach (if based on vague priors). The choice for a particular implementation may largely depend on the desired flexibility, and the usability of the package. For small data sets the random effects variances are difficult to estimate. In the frequentist approaches the MLE of this variance was often estimated zero with a standard error that is either zero or could not be determined, while for Bayesian methods the estimates could depend on the chosen "non-informative" prior of the variance parameter. The starting value for the variance parameter may be also critical for the convergence of the Markov chain.
Deletion Diagnostics for Alternating Logistic Regressions
Preisser, John S.; By, Kunthel; Perin, Jamie; Qaqish, Bahjat F.
2013-01-01
Deletion diagnostics are introduced for the regression analysis of clustered binary outcomes estimated with alternating logistic regressions, an implementation of generalized estimating equations (GEE) that estimates regression coefficients in a marginal mean model and in a model for the intracluster association given by the log odds ratio. The diagnostics are developed within an estimating equations framework that recasts the estimating functions for association parameters based upon conditional residuals into equivalent functions based upon marginal residuals. Extensions of earlier work on GEE diagnostics follow directly, including computational formulae for one-step deletion diagnostics that measure the influence of a cluster of observations on the estimated regression parameters and on the overall marginal mean or association model fit. The diagnostic formulae are evaluated with simulations studies and with an application concerning an assessment of factors associated with health maintenance visits in primary care medical practices. The application and the simulations demonstrate that the proposed cluster-deletion diagnostics for alternating logistic regressions are good approximations of their exact fully iterated counterparts. PMID:22777960
Tobón-Arroyave, Sergio I; Isaza-Guzmán, Diana M; Restrepo-Cadavid, Eliana M; Zapata-Molina, Sandra M; Martínez-Pabón, María C
2012-12-01
To determine the variations in salivary concentrations of sRANKL, osteoprotegerin (OPG) and its ratio, regarding the periodontal status. Ninety-seven chronic periodontitis (CP) subjects and 43 healthy controls were selected. Periodontal status was assessed based on full-mouth clinical periodontal measurements. sRANKL and OPG salivary levels were analysed by ELISA. The association between these analytes and its ratio with CP was analysed individually and adjusted for confounding using a binary logistic regression model. sRANKL and sRANKL/OPG ratio were increased, whereas OPG was decreased in CP compared with healthy controls subjects. Although univariate analysis revealed a positive association of sRANKL salivary levels ≥6 pg/ml, OPG salivary levels ≤131 pg/ml and sRANKL/OPG ratio ≥0.062 with CP, after logistic regression analysis only the latter parameter was strongly and independently associated with disease status. Confounding and interaction effects of ageing and smoking habit on sRANKL and OPG levels could be noted. Although salivary concentrations of sRANKL, OPG and its ratio may act as indicators of the amount/extent of periodontal breakdown, the mutual confounding and synergistic biological interactive effects related to ageing and smoking habit of the susceptible host may also promote the tissue destruction in CP. © 2012 John Wiley & Sons A/S.
Knol, Mirjam J; van der Tweel, Ingeborg; Grobbee, Diederick E; Numans, Mattijs E; Geerlings, Mirjam I
2007-10-01
To determine the presence of interaction in epidemiologic research, typically a product term is added to the regression model. In linear regression, the regression coefficient of the product term reflects interaction as departure from additivity. However, in logistic regression it refers to interaction as departure from multiplicativity. Rothman has argued that interaction estimated as departure from additivity better reflects biologic interaction. So far, literature on estimating interaction on an additive scale using logistic regression only focused on dichotomous determinants. The objective of the present study was to provide the methods to estimate interaction between continuous determinants and to illustrate these methods with a clinical example. and results From the existing literature we derived the formulas to quantify interaction as departure from additivity between one continuous and one dichotomous determinant and between two continuous determinants using logistic regression. Bootstrapping was used to calculate the corresponding confidence intervals. To illustrate the theory with an empirical example, data from the Utrecht Health Project were used, with age and body mass index as risk factors for elevated diastolic blood pressure. The methods and formulas presented in this article are intended to assist epidemiologists to calculate interaction on an additive scale between two variables on a certain outcome. The proposed methods are included in a spreadsheet which is freely available at: http://www.juliuscenter.nl/additive-interaction.xls.
2010-01-01
relationship between Ad-36 exposure and (1) obesity, and (2) levels of serum cholesterol and triglycerides . In this study there was no association in...value 0.0075), female gender (P-value 0.036), and a lower frequency of high levels of low- density lipoproteins (P-value 0.013). Logistic regression...levels of / cholesterol and triglycerides . There was no association in either case. Unanticipated relationships between Ad-36 exposure and age, race
A nonparametric multiple imputation approach for missing categorical data.
Zhou, Muhan; He, Yulei; Yu, Mandi; Hsu, Chiu-Hsieh
2017-06-06
Incomplete categorical variables with more than two categories are common in public health data. However, most of the existing missing-data methods do not use the information from nonresponse (missingness) probabilities. We propose a nearest-neighbour multiple imputation approach to impute a missing at random categorical outcome and to estimate the proportion of each category. The donor set for imputation is formed by measuring distances between each missing value with other non-missing values. The distance function is calculated based on a predictive score, which is derived from two working models: one fits a multinomial logistic regression for predicting the missing categorical outcome (the outcome model) and the other fits a logistic regression for predicting missingness probabilities (the missingness model). A weighting scheme is used to accommodate contributions from two working models when generating the predictive score. A missing value is imputed by randomly selecting one of the non-missing values with the smallest distances. We conduct a simulation to evaluate the performance of the proposed method and compare it with several alternative methods. A real-data application is also presented. The simulation study suggests that the proposed method performs well when missingness probabilities are not extreme under some misspecifications of the working models. However, the calibration estimator, which is also based on two working models, can be highly unstable when missingness probabilities for some observations are extremely high. In this scenario, the proposed method produces more stable and better estimates. In addition, proper weights need to be chosen to balance the contributions from the two working models and achieve optimal results for the proposed method. We conclude that the proposed multiple imputation method is a reasonable approach to dealing with missing categorical outcome data with more than two levels for assessing the distribution of the outcome. In terms of the choices for the working models, we suggest a multinomial logistic regression for predicting the missing outcome and a binary logistic regression for predicting the missingness probability.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhong, H; Wang, J; Shen, L
Purpose: The purpose of this study is to investigate the relationship between computed tomographic (CT) texture features of primary lesions and metastasis-free survival for rectal cancer patients; and to develop a datamining prediction model using texture features. Methods: A total of 220 rectal cancer patients treated with neoadjuvant chemo-radiotherapy (CRT) were enrolled in this study. All patients underwent CT scans before CRT. The primary lesions on the CT images were delineated by two experienced oncologists. The CT images were filtered by Laplacian of Gaussian (LoG) filters with different filter values (1.0–2.5: from fine to coarse). Both filtered and unfiltered imagesmore » were analyzed using Gray-level Co-occurrence Matrix (GLCM) texture analysis with different directions (transversal, sagittal, and coronal). Totally, 270 texture features with different species, directions and filter values were extracted. Texture features were examined with Student’s t-test for selecting predictive features. Principal Component Analysis (PCA) was performed upon the selected features to reduce the feature collinearity. Artificial neural network (ANN) and logistic regression were applied to establish metastasis prediction models. Results: Forty-six of 220 patients developed metastasis with a follow-up time of more than 2 years. Sixtyseven texture features were significantly different in t-test (p<0.05) between patients with and without metastasis, and 12 of them were extremely significant (p<0.001). The Area-under-the-curve (AUC) of ANN was 0.72, and the concordance index (CI) of logistic regression was 0.71. The predictability of ANN was slightly better than logistic regression. Conclusion: CT texture features of primary lesions are related to metastasisfree survival of rectal cancer patients. Both ANN and logistic regression based models can be developed for prediction.« less
Olson, Scott A.; Brouillette, Michael C.
2006-01-01
A logistic regression equation was developed for estimating the probability of a stream flowing intermittently at unregulated, rural stream sites in Vermont. These determinations can be used for a wide variety of regulatory and planning efforts at the Federal, State, regional, county and town levels, including such applications as assessing fish and wildlife habitats, wetlands classifications, recreational opportunities, water-supply potential, waste-assimilation capacities, and sediment transport. The equation will be used to create a derived product for the Vermont Hydrography Dataset having the streamflow characteristic of 'intermittent' or 'perennial.' The Vermont Hydrography Dataset is Vermont's implementation of the National Hydrography Dataset and was created at a scale of 1:5,000 based on statewide digital orthophotos. The equation was developed by relating field-verified perennial or intermittent status of a stream site during normal summer low-streamflow conditions in the summer of 2005 to selected basin characteristics of naturally flowing streams in Vermont. The database used to develop the equation included 682 stream sites with drainage areas ranging from 0.05 to 5.0 square miles. When the 682 sites were observed, 126 were intermittent (had no flow at the time of the observation) and 556 were perennial (had flowing water at the time of the observation). The results of the logistic regression analysis indicate that the probability of a stream having intermittent flow in Vermont is a function of drainage area, elevation of the site, the ratio of basin relief to basin perimeter, and the areal percentage of well- and moderately well-drained soils in the basin. Using a probability cutpoint (a lower probability indicates the site has perennial flow and a higher probability indicates the site has intermittent flow) of 0.5, the logistic regression equation correctly predicted the perennial or intermittent status of 116 test sites 85 percent of the time.
ERIC Educational Resources Information Center
Osborne, Jason W.
2012-01-01
Logistic regression is slowly gaining acceptance in the social sciences, and fills an important niche in the researcher's toolkit: being able to predict important outcomes that are not continuous in nature. While OLS regression is a valuable tool, it cannot routinely be used to predict outcomes that are binary or categorical in nature. These…
Yao, Ming; Ni, Jun; Zhou, Lixin; Peng, Bin; Zhu, Yicheng; Cui, Liying
2016-01-01
Although increasing evidence suggests that hyperglycemia following acute stroke adversely affects clinical outcome, whether the association between glycaemia and functional outcome varies between stroke patients with\\without pre-diagnosed diabetes remains controversial. We aimed to investigate the relationship between the fasting blood glucose (FBG) and the 6-month functional outcome in a subgroup of SMART cohort and further to assess whether this association varied based on the status of pre-diagnosed diabetes. Data of 2862 patients with acute ischemic stroke (629 with pre-diagnosed diabetics) enrolled from SMART cohort were analyzed. Functional outcome at 6-month post-stroke was measured by modified Rankin Scale (mRS) and categorized as favorable (mRS:0-2) or poor (mRS:3-5). Binary logistic regression model, adjusting for age, gender, educational level, history of hypertension and stroke, baseline NIHSS and treatment group, was used in the whole cohort to evaluate the association between admission FBG and functional outcome. Stratified logistic regression analyses were further performed based on the presence/absence of pre-diabetes history. In the whole cohort, multivariable logistical regression showed that poor functional outcome was associated with elevated FBG (OR1.21 (95%CI 1.07-1.37), p = 0.002), older age (OR1.64 (95% CI1.38-1.94), p<0.001), higher NIHSS (OR2.90 (95%CI 2.52-3.33), p<0.001) and hypertension (OR1.42 (95%CI 1.13-1.98), p = 0.04). Stratified logistical regression analysis showed that the association between FBG and functional outcome remained significant only in patients without pre-diagnosed diabetes (OR1.26 (95%CI 1.03-1.55), p = 0.023), but not in those with premorbid diagnosis of diabetes (p = 0.885). The present results demonstrate a significant association between elevated FBG after stroke and poor functional outcome in patients without pre-diagnosed diabetes, but not in diabetics. This finding confirms the importance of glycemic control during acute phase of ischemic stroke especially in patients without pre-diagnosed diabetes. Further investigation for developing optimal strategies to control blood glucose level in hyperglycemic setting is therefore of great importance. ClinicalTrials.gov NCT00664846.
Predicting Social Trust with Binary Logistic Regression
ERIC Educational Resources Information Center
Adwere-Boamah, Joseph; Hufstedler, Shirley
2015-01-01
This study used binary logistic regression to predict social trust with five demographic variables from a national sample of adult individuals who participated in The General Social Survey (GSS) in 2012. The five predictor variables were respondents' highest degree earned, race, sex, general happiness and the importance of personally assisting…
A Statewide Study of Gang Membership in California Secondary Schools
ERIC Educational Resources Information Center
Estrada, Joey Nuñez, Jr.; Gilreath, Tamika D.; Astor, Ron Avi; Benbenishty, Rami
2016-01-01
To date, there is a paucity of empirical evidence that examines gang membership in schools. Using statewide data of 7th-, 9th-, and 11th-grade students from California, this study focuses on the prevalence of gang membership by county, region, ethnicity, and grade level. Bivariate and multivariate logistic regression analyses were employed with…
Seasonal Variation in Physical Activity among Preschool Children in a Northern Canadian City
ERIC Educational Resources Information Center
Carson, Valerie; Spence, John C.; Cutumisu, Nicoleta; Boule, Normand; Edwards, Joy
2010-01-01
Little research has examined seasonal differences in physical activity (PA) levels among children. Proxy reports of PA were completed by 1,715 parents on their children in Edmonton, Alberta, Canada. Total PA (TPA) minutes were calculated, and each participant was classified as active, somewhat active, or inactive. Logistic regression models were…
ERIC Educational Resources Information Center
Ozen, Hamit
2016-01-01
Experiencing social phobia is an important factor which can hinder academic success during university years. In this study, research of social phobia with several variables is conducted among university students. The research group of the study consists of total 736 students studying at various departments at universities in Turkey. Students are…
Knowledge of Millennium Development Goals among University Faculty in Uganda and Kenya
ERIC Educational Resources Information Center
Wamala, Robert; Nabachwa, Mary Sonko; Chamberlain, Jean; Nakalembe, Eva
2012-01-01
This article examines the level of knowledge of the Millennium Development Goals (MDGs) among university faculty. The assessment is based on data from 197 academic unit or faculty heads randomly selected from universities in Uganda and Kenya. Frequency distributions and logistic regression were used for analysis. Slightly more than one in three…
Exploring Person Fit with an Approach Based on Multilevel Logistic Regression
ERIC Educational Resources Information Center
Walker, A. Adrienne; Engelhard, George, Jr.
2015-01-01
The idea that test scores may not be valid representations of what students know, can do, and should learn next is well known. Person fit provides an important aspect of validity evidence. Person fit analyses at the individual student level are not typically conducted and person fit information is not communicated to educational stakeholders. In…
Who Stays and for How Long: Examining Attrition in Canadian Graduate Programs
ERIC Educational Resources Information Center
DeClou, Lindsay
2016-01-01
Attrition from Canadian graduate programs is a point of concern on a societal, institutional, and individual level. To improve retention in graduate school, a better understanding of what leads to withdrawal needs to be reached. This paper uses logistic regression and discrete-time survival analysis with time-varying covariates to analyze data…
Escaping Poverty: Rural Low-Income Mothers' Opportunity to Pursue Post-Secondary Education
ERIC Educational Resources Information Center
Woodford, Michelle; Mammen, Sheila
2010-01-01
Using human capital theory, this paper identifies the factors that may affect the opportunity for rural low-income mothers to pursue post-secondary education or training in order to escape poverty. Dependent variables used in the logistic regression model included micro-level household variables as well as the effects of state-wide welfare…
Clustering performance comparison using K-means and expectation maximization algorithms.
Jung, Yong Gyu; Kang, Min Soo; Heo, Jun
2014-11-14
Clustering is an important means of data mining based on separating data categories by similar features. Unlike the classification algorithm, clustering belongs to the unsupervised type of algorithms. Two representatives of the clustering algorithms are the K -means and the expectation maximization (EM) algorithm. Linear regression analysis was extended to the category-type dependent variable, while logistic regression was achieved using a linear combination of independent variables. To predict the possibility of occurrence of an event, a statistical approach is used. However, the classification of all data by means of logistic regression analysis cannot guarantee the accuracy of the results. In this paper, the logistic regression analysis is applied to EM clusters and the K -means clustering method for quality assessment of red wine, and a method is proposed for ensuring the accuracy of the classification results.
Zhang, Xinyan; Li, Bingzong; Han, Huiying; Song, Sha; Xu, Hongxia; Hong, Yating; Yi, Nengjun; Zhuang, Wenzhuo
2018-05-10
Multiple myeloma (MM), like other cancers, is caused by the accumulation of genetic abnormalities. Heterogeneity exists in the patients' response to treatments, for example, bortezomib. This urges efforts to identify biomarkers from numerous molecular features and build predictive models for identifying patients that can benefit from a certain treatment scheme. However, previous studies treated the multi-level ordinal drug response as a binary response where only responsive and non-responsive groups are considered. It is desirable to directly analyze the multi-level drug response, rather than combining the response to two groups. In this study, we present a novel method to identify significantly associated biomarkers and then develop ordinal genomic classifier using the hierarchical ordinal logistic model. The proposed hierarchical ordinal logistic model employs the heavy-tailed Cauchy prior on the coefficients and is fitted by an efficient quasi-Newton algorithm. We apply our hierarchical ordinal regression approach to analyze two publicly available datasets for MM with five-level drug response and numerous gene expression measures. Our results show that our method is able to identify genes associated with the multi-level drug response and to generate powerful predictive models for predicting the multi-level response. The proposed method allows us to jointly fit numerous correlated predictors and thus build efficient models for predicting the multi-level drug response. The predictive model for the multi-level drug response can be more informative than the previous approaches. Thus, the proposed approach provides a powerful tool for predicting multi-level drug response and has important impact on cancer studies.
Perioperative factors associated with pressure ulcer development after major surgery
2018-01-01
Background Postoperative pressure ulcers are important indicators of perioperative care quality, and are serious and expensive complications during critical care. This study aimed to identify perioperative risk factors for postoperative pressure ulcers. Methods This retrospective case-control study evaluated 2,498 patients who underwent major surgery. Forty-three patients developed postoperative pressure ulcers and were matched to 86 control patients based on age, sex, surgery, and comorbidities. Results The pressure ulcer group had lower baseline hemoglobin and albumin levels, compared to the control group. The pressure ulcer group also had higher values for lactate levels, blood loss, and number of packed red blood cell (pRBC) units. Univariate analysis revealed that pressure ulcer development was associated with preoperative hemoglobin levels, albumin levels, lactate levels, intraoperative blood loss, number of pRBC units, Acute Physiologic and Chronic Health Evaluation II score, Braden scale score, postoperative ventilator care, and patient restraint. In the multiple logistic regression analysis, only preoperative low albumin levels (odds ratio [OR]: 0.21, 95% CI: 0.05–0.82; P < 0.05) and high lactate levels (OR: 1.70, 95% CI: 1.07–2.71; P < 0.05) were independently associated with pressure ulcer development. A receiver operating characteristic curve was used to assess the predictive power of the logistic regression model, and the area under the curve was 0.88 (95% CI: 0.79–0.97; P < 0.001). Conclusions The present study revealed that preoperative low albumin levels and high lactate levels were significantly associated with pressure ulcer development after surgery. PMID:29441175
Carboxyhemoglobin and methemoglobin levels as prognostic markers in acute pulmonary embolism.
Kakavas, Sotirios; Papanikolaou, Aggeliki; Ballis, Evangelos; Tatsis, Nikolaos; Goga, Christina; Tatsis, Georgios
2015-04-01
Carboxyhemoglobin (COHb) and methemoglobin (MetHb) levels have been associated with a poor outcome in patients with various pathological conditions including cardiovascular diseases. Our aim was to retrospectively assess the prognostic value of arterial COHb and MetHb in patients with acute pulmonary embolism (PE). We conducted a retrospective study of 156 patients admitted in a pulmonary clinic due to acute PE. Measured variables during emergency department evaluation that were retrospectively analyzed included the ratio of the partial pressure of oxygen in arterial blood to the fraction of oxygen in inspired gas, Acute Physiology and Chronic Health Evaluation II score, risk stratification indices, and arterial blood gases. The association between arterial COHb and MetHb levels and disease severity or mortality was evaluated using bivariate tests and logistic regression analysis. Arterial COHb and MetHb levels correlated with Acute Physiology and Chronic Health Evaluation II and pulmonary severity index scores. Furthermore, arterial COHb and MetHb levels were associated with troponin T and N-terminal pro-B-type natriuretic peptide levels. In univariate logistic regression analysis, COHb and MetHb levels were both significantly associated with an increased risk of death. However, in multivariate analysis, only COHb remained significant as an independent predictor of in-hospital mortality. Our preliminary data suggest that arterial COHb and MetHb levels reflect the severity of acute PE, whereas COHb levels are independent predictors of in hospital death in patients in this clinical setting. These findings require further prospective validation. Copyright © 2015 Elsevier Inc. All rights reserved.
Sofi, Nighat Y; Jain, Monika; Kapil, Umesh; Seenu, Vuthaluru; R, Lakshmy; Yadav, Chander P; Pandey, Ravindra M; Sareen, Neha
2018-01-01
The study was conducted with an objective to investigate the association between reproductive factors, nutritional status and serum 25(OH)D levels among women diagnosed with breast cancer (BC). A total of 200 women with BC attending a tertiary healthcare institute of Delhi, India matched with 200 healthy women for age (±2years) and socio economic status were included in the study. Data was collected on socio-demographic profile, reproductive factors, physical activity and dietary intake (24h dietary recall and food frequency questionnaire) using interviewer administered structured questionnaires and standard tools. Non fasting blood samples (5ml) were collected for the biochemical estimation of serum 25(OH)D and calcium levels by chemiluminescent immunoassay and colorimetric assay technique. Data was analyzed by univariable conditional logistic regression and significant variables with (p<0.05), were analyzed in final model by conditional multivariable logistic regression analysis. The mean age of patients at diagnosis of BC was 45±10years. Results of multivariable conditional logistic regression analysis revealed significantly higher odds of BC for reproductive factors like age at marriage (more than 23 years), number of abortions, history or current use of oral contraceptive pills (OCP), with [OR (95% CI)] of [2.4 (1.2-4.9)], [4.0 (1.6-12.6)], [2.4 (1.2-5.0)]. Women with physically light activities and occasional consumption of eggs were found to have higher odds of BC [4.6 (1.6-13.0)] and [3.2 (1.6-6.3)]. Women with serum 25(OH)D levels less than 20ng/ml and calcium levels less than 10.5mg/dl had higher odds of having BC [2.4 (1.2-5.1)] and [3.7 (1.5-8.8)]. A protective effect of urban areas as place of residence and energy intake greater than 50% of Recommended Dietary Allowance (RDA) per day against BC was observed (p<0.05). The findings of the present study revealed a significant association of reproductive and dietary factors in addition to sedentary physical activity and low serum 25(OH)D levels in women diagnosed with BC. Copyright © 2017 Elsevier Ltd. All rights reserved.
The association between maternal antioxidant levels in midpregnancy and preeclampsia.
Cohen, Jacqueline M; Kramer, Michael S; Platt, Robert W; Basso, Olga; Evans, Rhobert W; Kahn, Susan R
2015-11-01
We sought to determine whether midpregnancy antioxidant levels are associated with preeclampsia, overall and by timing of onset. We carried out a case-control study, nested within a cohort of 5337 pregnant women in Montreal, Quebec, Canada. Blood samples obtained at 24-26 weeks were assayed for nonenzymatic antioxidant levels among cases of preeclampsia (n = 111) and unaffected controls (n = 441). We excluded women diagnosed with gestational hypertension only. We used logistic regression with the z-score of each antioxidant level as the main predictor variable for preeclampsia risk. We further stratified early-onset (<34 weeks) and late-onset preeclampsia and carried out multinomial logistic regression. Finally, we assessed associations between antioxidant biomarkers and timing of onset (in weeks) by Cox regression, with appropriate selection weights. We summed levels of correlated biomarkers (r(2) > 0.3) and log-transformed positively skewed distributions. We adjusted for body mass index, nulliparity, preexisting diabetes, hypertension, smoking, and proxies for ethnicity and socioeconomic status. The odds ratios for α-tocopherol, α-tocopherol:cholesterol, lycopene, lutein, and carotenoids (sum of α-carotene, β-carotene, anhydrolutein, α-cryptoxanthin, and β-cryptoxanthin) suggested an inverse association between antioxidant levels and overall preeclampsia risk; however, only lutein was significantly associated with overall preeclampsia in adjusted models (odds ratio, 0.60; 95% confidence interval, 0.46-0.77) per SD. In multinomial logistic models, the relative risk ratio (RRR) estimates for the early-onset subgroup were farther from the null than those for the late-onset subgroup. The ratio of α-tocopherol to cholesterol and retinol were significantly associated with early- but not late-onset preeclampsia: RRRs (95% confidence intervals) for early-onset preeclampsia 0.67 (0.46-0.99) and 1.61 (1.12-2.33), respectively. Lutein was significantly associated with both early- and late-onset subtypes in adjusted models; RRRs 0.53 (0.35-0.80) and 0.62 (0.47-0.82), respectively. Survival analyses confirmed these trends. Most antioxidants were more strongly associated with early-onset preeclampsia, suggesting that oxidative stress may play a greater role in the pathophysiology of early-onset preeclampsia. Alternatively, reverse causality may explain this pattern. Lutein was associated with both early- and late-onset preeclampsia and may be a promising nutrient to consider in preeclampsia prevention trials, if this finding is corroborated. Copyright © 2015 Elsevier Inc. All rights reserved.
Cunningham, Marc; Bock, Ariella; Brown, Niquelle; Sacher, Suzy; Hatch, Benjamin; Inglis, Andrew; Aronovich, Dana
2015-09-01
Contraceptive prevalence rate (CPR) is a vital indicator used by country governments, international donors, and other stakeholders for measuring progress in family planning programs against country targets and global initiatives as well as for estimating health outcomes. Because of the need for more frequent CPR estimates than population-based surveys currently provide, alternative approaches for estimating CPRs are being explored, including using contraceptive logistics data. Using data from the Demographic and Health Surveys (DHS) in 30 countries, population data from the United States Census Bureau International Database, and logistics data from the Procurement Planning and Monitoring Report (PPMR) and the Pipeline Monitoring and Procurement Planning System (PipeLine), we developed and evaluated 3 models to generate country-level, public-sector contraceptive prevalence estimates for injectable contraceptives, oral contraceptives, and male condoms. Models included: direct estimation through existing couple-years of protection (CYP) conversion factors, bivariate linear regression, and multivariate linear regression. Model evaluation consisted of comparing the referent DHS prevalence rates for each short-acting method with the model-generated prevalence rate using multiple metrics, including mean absolute error and proportion of countries where the modeled prevalence rate for each method was within 1, 2, or 5 percentage points of the DHS referent value. For the methods studied, family planning use estimates from public-sector logistics data were correlated with those from the DHS, validating the quality and accuracy of current public-sector logistics data. Logistics data for oral and injectable contraceptives were significantly associated (P<.05) with the referent DHS values for both bivariate and multivariate models. For condoms, however, that association was only significant for the bivariate model. With the exception of the CYP-based model for condoms, models were able to estimate public-sector prevalence rates for each short-acting method to within 2 percentage points in at least 85% of countries. Public-sector contraceptive logistics data are strongly correlated with public-sector prevalence rates for short-acting methods, demonstrating the quality of current logistics data and their ability to provide relatively accurate prevalence estimates. The models provide a starting point for generating interim estimates of contraceptive use when timely survey data are unavailable. All models except the condoms CYP model performed well; the regression models were most accurate but the CYP model offers the simplest calculation method. Future work extending the research to other modern methods, relating subnational logistics data with prevalence rates, and tracking that relationship over time is needed. © Cunningham et al.
Cunningham, Marc; Brown, Niquelle; Sacher, Suzy; Hatch, Benjamin; Inglis, Andrew; Aronovich, Dana
2015-01-01
Background: Contraceptive prevalence rate (CPR) is a vital indicator used by country governments, international donors, and other stakeholders for measuring progress in family planning programs against country targets and global initiatives as well as for estimating health outcomes. Because of the need for more frequent CPR estimates than population-based surveys currently provide, alternative approaches for estimating CPRs are being explored, including using contraceptive logistics data. Methods: Using data from the Demographic and Health Surveys (DHS) in 30 countries, population data from the United States Census Bureau International Database, and logistics data from the Procurement Planning and Monitoring Report (PPMR) and the Pipeline Monitoring and Procurement Planning System (PipeLine), we developed and evaluated 3 models to generate country-level, public-sector contraceptive prevalence estimates for injectable contraceptives, oral contraceptives, and male condoms. Models included: direct estimation through existing couple-years of protection (CYP) conversion factors, bivariate linear regression, and multivariate linear regression. Model evaluation consisted of comparing the referent DHS prevalence rates for each short-acting method with the model-generated prevalence rate using multiple metrics, including mean absolute error and proportion of countries where the modeled prevalence rate for each method was within 1, 2, or 5 percentage points of the DHS referent value. Results: For the methods studied, family planning use estimates from public-sector logistics data were correlated with those from the DHS, validating the quality and accuracy of current public-sector logistics data. Logistics data for oral and injectable contraceptives were significantly associated (P<.05) with the referent DHS values for both bivariate and multivariate models. For condoms, however, that association was only significant for the bivariate model. With the exception of the CYP-based model for condoms, models were able to estimate public-sector prevalence rates for each short-acting method to within 2 percentage points in at least 85% of countries. Conclusions: Public-sector contraceptive logistics data are strongly correlated with public-sector prevalence rates for short-acting methods, demonstrating the quality of current logistics data and their ability to provide relatively accurate prevalence estimates. The models provide a starting point for generating interim estimates of contraceptive use when timely survey data are unavailable. All models except the condoms CYP model performed well; the regression models were most accurate but the CYP model offers the simplest calculation method. Future work extending the research to other modern methods, relating subnational logistics data with prevalence rates, and tracking that relationship over time is needed. PMID:26374805
Delva, J; Spencer, M S; Lin, J K
2000-01-01
This article compares estimates of the relative odds of nitrite use obtained from weighted unconditional logistic regression with estimates obtained from conditional logistic regression after post-stratification and matching of cases with controls by neighborhood of residence. We illustrate these methods by comparing the odds associated with nitrite use among adults of four racial/ethnic groups, with and without a high school education. We used aggregated data from the 1994-B through 1996 National Household Survey on Drug Abuse (NHSDA). Difference between the methods and implications for analysis and inference are discussed.
Atteraya, Madhu Sudhan; Ebrahim, Nasser B; Gnawali, Shreejana
2018-02-01
We examined the prevalence of child maltreatment as measured by the level of physical (moderate to severe) and emotional abuse and child labor, and the associated household level determinants of child maltreatment in Nepal. We used a nationally representative data set from the fifth round of the Nepal Multiple Indicator Cluster Survey (the 2014 NMICS). The main independent variables were household level characteristics. Dependent variables included child experience of moderate to severe physical abuse, emotional abuse, and child labor (domestic work and economic activities). Bivariate analyses and logistic regressions were used to examine the associations between independent and dependent variables. The results showed that nearly half of the children (49.8%) had experienced moderate physical abuse, 21.5% experienced severe physical abuse, and 77.3% experienced emotional abuse. About 27% of the children had engaged in domestic work and 46.7% in various economic activities. At bivariate level, educational level of household's head and household wealth status had shown significant statistical association with child maltreatment (p<0.001). Results from multivariate logistic regressions showed that higher education levels and higher household wealth status protected children from moderate to severe physical abuse, emotional abuse and child labor. In general, child maltreatment is a neglected social issue in Nepal and the high rates of child maltreatment calls for mass awareness programs focusing on parents, and involving all stakeholders including governments, local, and international organizations. Copyright © 2017 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Guntur, R. D.; Lobo, M.
2017-02-01
A research has been carried out to investigate the characteristics of reasons for DOSC and to determine the statistical model explaining factors which influence on the DOSC in the age group 7 - 18 years in East Nusa Tenggara (ENT) Province. Primary data of out of school children had been collected throughout interviews using prepared questionnaires in three selected districts. Data was then analysed using descriptive and logistic regression method. The analysis shows that from the 341 samples, there were 194DOSC. The majority of them were males, lived in the countryside, had farmer parents, had family size of 5, and had mothers with only primary education level. The main reasons of children to drop out from the primary and junior education levels were the inabilities of paying the school fees and the willingness to work in the farms to help their parents. For senior education level, it was because of the unaffordable school tuitions and no desire of children in having good education. Both partial and simultaneous parameter tests in the logistic regression model show that children who lived in countryside, from poor families, males were the three factors that significantly affected the number of DOSC in the group age with odds ratio values 2.48; 2.37; 1.97 respectively.
Hepatotoxicity during Treatment for Tuberculosis in People Living with HIV/AIDS.
Araújo-Mariz, Carolline; Lopes, Edmundo Pessoa; Acioli-Santos, Bartolomeu; Maruza, Magda; Montarroyos, Ulisses Ramos; Ximenes, Ricardo Arraes de Alencar; Lacerda, Heloísa Ramos; Miranda-Filho, Demócrito de Barros; Albuquerque, Maria de Fátima P Militão de
2016-01-01
Hepatotoxicity is frequently reported as an adverse reaction during the treatment of tuberculosis. The aim of this study was to determine the incidence of hepatotoxicity and to identify predictive factors for developing hepatotoxicity after people living with HIV/AIDS (PLWHA) start treatment for tuberculosis. This was a prospective cohort study with PLWHA who were monitored during the first 60 days of tuberculosis treatment in Pernambuco, Brazil. Hepatotoxicity was considered increased levels of aminotransferase, namely those that rose to three times higher than the level before initiating tuberculosis treatment, these levels being associated with symptoms of hepatitis. We conducted a multivariate logistic regression analysis and the magnitude of the associations was expressed by the odds ratio with a confidence interval of 95%. Hepatotoxicity was observed in 53 (30.6%) of the 173 patients who started tuberculosis treatment. The final multivariate logistic regression model demonstrated that the use of fluconazole, malnutrition and the subject being classified as a phenotypically slow acetylator increased the risk of hepatotoxicity significantly. The incidence of hepatotoxicity during treatment for tuberculosis in PLWHA was high. Those classified as phenotypically slow acetylators and as malnourished should be targeted for specific care to reduce the risk of hepatotoxicity during treatment for tuberculosis. The use of fluconazole should be avoided during tuberculosis treatment in PLWHA.
Austin, Peter C; Lee, Douglas S; Steyerberg, Ewout W; Tu, Jack V
2012-01-01
In biomedical research, the logistic regression model is the most commonly used method for predicting the probability of a binary outcome. While many clinical researchers have expressed an enthusiasm for regression trees, this method may have limited accuracy for predicting health outcomes. We aimed to evaluate the improvement that is achieved by using ensemble-based methods, including bootstrap aggregation (bagging) of regression trees, random forests, and boosted regression trees. We analyzed 30-day mortality in two large cohorts of patients hospitalized with either acute myocardial infarction (N = 16,230) or congestive heart failure (N = 15,848) in two distinct eras (1999–2001 and 2004–2005). We found that both the in-sample and out-of-sample prediction of ensemble methods offered substantial improvement in predicting cardiovascular mortality compared to conventional regression trees. However, conventional logistic regression models that incorporated restricted cubic smoothing splines had even better performance. We conclude that ensemble methods from the data mining and machine learning literature increase the predictive performance of regression trees, but may not lead to clear advantages over conventional logistic regression models for predicting short-term mortality in population-based samples of subjects with cardiovascular disease. PMID:22777999
ERIC Educational Resources Information Center
Fidalgo, Angel M.; Alavi, Seyed Mohammad; Amirian, Seyed Mohammad Reza
2014-01-01
This study examines three controversial aspects in differential item functioning (DIF) detection by logistic regression (LR) models: first, the relative effectiveness of different analytical strategies for detecting DIF; second, the suitability of the Wald statistic for determining the statistical significance of the parameters of interest; and…
ERIC Educational Resources Information Center
French, Brian F.; Maller, Susan J.
2007-01-01
Two unresolved implementation issues with logistic regression (LR) for differential item functioning (DIF) detection include ability purification and effect size use. Purification is suggested to control inaccuracies in DIF detection as a result of DIF items in the ability estimate. Additionally, effect size use may be beneficial in controlling…
A Note on Three Statistical Tests in the Logistic Regression DIF Procedure
ERIC Educational Resources Information Center
Paek, Insu
2012-01-01
Although logistic regression became one of the well-known methods in detecting differential item functioning (DIF), its three statistical tests, the Wald, likelihood ratio (LR), and score tests, which are readily available under the maximum likelihood, do not seem to be consistently distinguished in DIF literature. This paper provides a clarifying…
Comparison of Two Approaches for Handling Missing Covariates in Logistic Regression
ERIC Educational Resources Information Center
Peng, Chao-Ying Joanne; Zhu, Jin
2008-01-01
For the past 25 years, methodological advances have been made in missing data treatment. Most published work has focused on missing data in dependent variables under various conditions. The present study seeks to fill the void by comparing two approaches for handling missing data in categorical covariates in logistic regression: the…
Comparison of IRT Likelihood Ratio Test and Logistic Regression DIF Detection Procedures
ERIC Educational Resources Information Center
Atar, Burcu; Kamata, Akihito
2011-01-01
The Type I error rates and the power of IRT likelihood ratio test and cumulative logit ordinal logistic regression procedures in detecting differential item functioning (DIF) for polytomously scored items were investigated in this Monte Carlo simulation study. For this purpose, 54 simulation conditions (combinations of 3 sample sizes, 2 sample…
Multiple Logistic Regression Analysis of Cigarette Use among High School Students
ERIC Educational Resources Information Center
Adwere-Boamah, Joseph
2011-01-01
A binary logistic regression analysis was performed to predict high school students' cigarette smoking behavior from selected predictors from 2009 CDC Youth Risk Behavior Surveillance Survey. The specific target student behavior of interest was frequent cigarette use. Five predictor variables included in the model were: a) race, b) frequency of…
ERIC Educational Resources Information Center
Anderson, Carolyn J.; Verkuilen, Jay; Peyton, Buddy L.
2010-01-01
Survey items with multiple response categories and multiple-choice test questions are ubiquitous in psychological and educational research. We illustrate the use of log-multiplicative association (LMA) models that are extensions of the well-known multinomial logistic regression model for multiple dependent outcome variables to reanalyze a set of…
Propensity Score Estimation with Data Mining Techniques: Alternatives to Logistic Regression
ERIC Educational Resources Information Center
Keller, Bryan S. B.; Kim, Jee-Seon; Steiner, Peter M.
2013-01-01
Propensity score analysis (PSA) is a methodological technique which may correct for selection bias in a quasi-experiment by modeling the selection process using observed covariates. Because logistic regression is well understood by researchers in a variety of fields and easy to implement in a number of popular software packages, it has…
Classifying machinery condition using oil samples and binary logistic regression
NASA Astrophysics Data System (ADS)
Phillips, J.; Cripps, E.; Lau, John W.; Hodkiewicz, M. R.
2015-08-01
The era of big data has resulted in an explosion of condition monitoring information. The result is an increasing motivation to automate the costly and time consuming human elements involved in the classification of machine health. When working with industry it is important to build an understanding and hence some trust in the classification scheme for those who use the analysis to initiate maintenance tasks. Typically "black box" approaches such as artificial neural networks (ANN) and support vector machines (SVM) can be difficult to provide ease of interpretability. In contrast, this paper argues that logistic regression offers easy interpretability to industry experts, providing insight to the drivers of the human classification process and to the ramifications of potential misclassification. Of course, accuracy is of foremost importance in any automated classification scheme, so we also provide a comparative study based on predictive performance of logistic regression, ANN and SVM. A real world oil analysis data set from engines on mining trucks is presented and using cross-validation we demonstrate that logistic regression out-performs the ANN and SVM approaches in terms of prediction for healthy/not healthy engines.
Length bias correction in gene ontology enrichment analysis using logistic regression.
Mi, Gu; Di, Yanming; Emerson, Sarah; Cumbie, Jason S; Chang, Jeff H
2012-01-01
When assessing differential gene expression from RNA sequencing data, commonly used statistical tests tend to have greater power to detect differential expression of genes encoding longer transcripts. This phenomenon, called "length bias", will influence subsequent analyses such as Gene Ontology enrichment analysis. In the presence of length bias, Gene Ontology categories that include longer genes are more likely to be identified as enriched. These categories, however, are not necessarily biologically more relevant. We show that one can effectively adjust for length bias in Gene Ontology analysis by including transcript length as a covariate in a logistic regression model. The logistic regression model makes the statistical issue underlying length bias more transparent: transcript length becomes a confounding factor when it correlates with both the Gene Ontology membership and the significance of the differential expression test. The inclusion of the transcript length as a covariate allows one to investigate the direct correlation between the Gene Ontology membership and the significance of testing differential expression, conditional on the transcript length. We present both real and simulated data examples to show that the logistic regression approach is simple, effective, and flexible.
Lee, Seokho; Shin, Hyejin; Lee, Sang Han
2016-12-01
Alzheimer's disease (AD) is usually diagnosed by clinicians through cognitive and functional performance test with a potential risk of misdiagnosis. Since the progression of AD is known to cause structural changes in the corpus callosum (CC), the CC thickness can be used as a functional covariate in AD classification problem for a diagnosis. However, misclassified class labels negatively impact the classification performance. Motivated by AD-CC association studies, we propose a logistic regression for functional data classification that is robust to misdiagnosis or label noise. Specifically, our logistic regression model is constructed by adopting individual intercepts to functional logistic regression model. This approach enables to indicate which observations are possibly mislabeled and also lead to a robust and efficient classifier. An effective algorithm using MM algorithm provides simple closed-form update formulas. We test our method using synthetic datasets to demonstrate its superiority over an existing method, and apply it to differentiating patients with AD from healthy normals based on CC from MRI. © 2016, The International Biometric Society.
Szekér, Szabolcs; Vathy-Fogarassy, Ágnes
2018-01-01
Logistic regression based propensity score matching is a widely used method in case-control studies to select the individuals of the control group. This method creates a suitable control group if all factors affecting the output variable are known. However, if relevant latent variables exist as well, which are not taken into account during the calculations, the quality of the control group is uncertain. In this paper, we present a statistics-based research in which we try to determine the relationship between the accuracy of the logistic regression model and the uncertainty of the dependent variable of the control group defined by propensity score matching. Our analyses show that there is a linear correlation between the fit of the logistic regression model and the uncertainty of the output variable. In certain cases, a latent binary explanatory variable can result in a relative error of up to 70% in the prediction of the outcome variable. The observed phenomenon calls the attention of analysts to an important point, which must be taken into account when deducting conclusions.
Subclinical Hypothyroidism after 131I-Treatment of Graves' Disease: A Risk Factor for Depression?
Yu, Jing; Tian, Ai-Juan; Yuan, Xin; Cheng, Xiao-Xin
2016-01-01
Although it is well accepted that there is a close relationship between hypothyroidism and depression, previous studies provided inconsistent or even opposite results in whether subclinical hypothyroidism (SCH) increased the risk of depression. One possible reason is that the etiology of SCH in these studies was not clearly distinguished. We therefore investigated the relationship between SCH resulting from 131I treatment of Graves' disease and depression. The incidence of depression among 95 patients with SCH and 121 euthyroid patients following 131I treatment of Graves' disease was studied. The risk factors of depression were determined with multivariate logistic regression analysis. Thyroid hormone replacement therapy was performed in patients with thyroid-stimulating hormone (TSH) levels exceeding 10 mIU/L. Patients with SCH had significantly higher Hamilton Depression Scale scores, serum TSH and thyroid peroxidase antibody (TPOAb) levels compared with euthyroid patients. Multivariate logistic regression analysis revealed SCH, Graves' eye syndrome and high serum TPO antibody level as risk factors for depression. L-thyroxine treatment is beneficial for SCH patients with serum TSH levels exceeding 10 mIU/L. The results of the present study demonstrated that SCH is prevalent among 131I treated Graves' patients. SCH might increase the risk of developing depression. L-thyroxine replacement therapy helps to resolve depressive disorders in SCH patients with TSH > 10mIU/L. These data provide insight into the relationship between SCH and depression.
Chen, Jing; Li, Jia; Qiu, Gang; Wei, Jingchao; Qiu, Yanfen; An, Yonghui; Shen, Yong
2016-09-20
The purpose of this study was to investigate whether uncovertebral joint ossification was a risk factor for axial symptoms (AS) after cervical disc arthroplasty (CDA). This retrospective study included 52 consecutive patients who underwent CDA for single-level cervical disc disease. To examine possible risk factors for AS after CDA, univariate and multivariate logistic regression analyses were conducted to compare data from the patients with and without AS (the AS and no-AS groups, respectively). Among the 52 patients examined, AS were observed in 24 patients (46.2 %), including a stiff neck (n = 11), neck pain and dullness (n = 10), and shoulder pain (n = 3). Uncovertebral joint ossification was detected in 22 (42.3 %) patients, including 17 patients in the AS group and 5 patients in the no-AS group. Clinical outcome improved during the follow-up period for the AS group. According to multivariate logistic regression analysis, uncovertebral joint ossification, cervical kyphosis, and range of motion (ROM) at the index level were identified as significant risk factors for AS after CDA. Satisfactory clinical outcomes were observed following CDA for the treatment of single-level cervical disc disease in the present cohort. In addition, uncovertebral joint ossification, cervical kyphosis, and ROM at the index level were found to affect the incidence of AS after CDA.
Lewis, Kristin Nicole; Heckman, Bernadette Davantes; Himawan, Lina
2011-08-01
Growth mixture modeling (GMM) identified latent groups based on treatment outcome trajectories of headache disability measures in patients in headache subspecialty treatment clinics. Using a longitudinal design, 219 patients in headache subspecialty clinics in 4 large cities throughout Ohio provided data on their headache disability at pretreatment and 3 follow-up assessments. GMM identified 3 treatment outcome trajectory groups: (1) patients who initiated treatment with elevated disability levels and who reported statistically significant reductions in headache disability (high-disability improvers; 11%); (2) patients who initiated treatment with elevated disability but who reported no reductions in disability (high-disability nonimprovers; 34%); and (3) patients who initiated treatment with moderate disability and who reported statistically significant reductions in headache disability (moderate-disability improvers; 55%). Based on the final multinomial logistic regression model, a dichotomized treatment appointment attendance variable was a statistically significant predictor for differentiating high-disability improvers from high-disability nonimprovers. Three-fourths of patients who initiated treatment with elevated disability levels did not report reductions in disability after 5 months of treatment with new preventive pharmacotherapies. Preventive headache agents may be most efficacious for patients with moderate levels of disability and for patients with high disability levels who attend all treatment appointments. Copyright © 2011 International Association for the Study of Pain. Published by Elsevier B.V. All rights reserved.
Logistic regression for circular data
NASA Astrophysics Data System (ADS)
Al-Daffaie, Kadhem; Khan, Shahjahan
2017-05-01
This paper considers the relationship between a binary response and a circular predictor. It develops the logistic regression model by employing the linear-circular regression approach. The maximum likelihood method is used to estimate the parameters. The Newton-Raphson numerical method is used to find the estimated values of the parameters. A data set from weather records of Toowoomba city is analysed by the proposed methods. Moreover, a simulation study is considered. The R software is used for all computations and simulations.
Naval Research Logistics Quarterly. Volume 28. Number 3,
1981-09-01
denotes component-wise maximum. f has antone (isotone) differences on C x D if for cl < c2 and d, < d2, NAVAL RESEARCH LOGISTICS QUARTERLY VOL. 28...or negative correlations and linear or nonlinear regressions. Given are the mo- ments to order two and, for special cases, (he regression function and...data sets. We designate this bnb distribution as G - B - N(a, 0, v). The distribution admits only of positive correlation and linear regressions
Bond, H S; Sullivan, S G; Cowling, B J
2016-06-01
Influenza vaccination is the most practical means available for preventing influenza virus infection and is widely used in many countries. Because vaccine components and circulating strains frequently change, it is important to continually monitor vaccine effectiveness (VE). The test-negative design is frequently used to estimate VE. In this design, patients meeting the same clinical case definition are recruited and tested for influenza; those who test positive are the cases and those who test negative form the comparison group. When determining VE in these studies, the typical approach has been to use logistic regression, adjusting for potential confounders. Because vaccine coverage and influenza incidence change throughout the season, time is included among these confounders. While most studies use unconditional logistic regression, adjusting for time, an alternative approach is to use conditional logistic regression, matching on time. Here, we used simulation data to examine the potential for both regression approaches to permit accurate and robust estimates of VE. In situations where vaccine coverage changed during the influenza season, the conditional model and unconditional models adjusting for categorical week and using a spline function for week provided more accurate estimates. We illustrated the two approaches on data from a test-negative study of influenza VE against hospitalization in children in Hong Kong which resulted in the conditional logistic regression model providing the best fit to the data.
Asghari, Mehdi Poursheikhali; Hayatshahi, Sayyed Hamed Sadat; Abdolmaleki, Parviz
2012-01-01
From both the structural and functional points of view, β-turns play important biological roles in proteins. In the present study, a novel two-stage hybrid procedure has been developed to identify β-turns in proteins. Binary logistic regression was initially used for the first time to select significant sequence parameters in identification of β-turns due to a re-substitution test procedure. Sequence parameters were consisted of 80 amino acid positional occurrences and 20 amino acid percentages in sequence. Among these parameters, the most significant ones which were selected by binary logistic regression model, were percentages of Gly, Ser and the occurrence of Asn in position i+2, respectively, in sequence. These significant parameters have the highest effect on the constitution of a β-turn sequence. A neural network model was then constructed and fed by the parameters selected by binary logistic regression to build a hybrid predictor. The networks have been trained and tested on a non-homologous dataset of 565 protein chains. With applying a nine fold cross-validation test on the dataset, the network reached an overall accuracy (Qtotal) of 74, which is comparable with results of the other β-turn prediction methods. In conclusion, this study proves that the parameter selection ability of binary logistic regression together with the prediction capability of neural networks lead to the development of more precise models for identifying β-turns in proteins. PMID:27418910
Asghari, Mehdi Poursheikhali; Hayatshahi, Sayyed Hamed Sadat; Abdolmaleki, Parviz
2012-01-01
From both the structural and functional points of view, β-turns play important biological roles in proteins. In the present study, a novel two-stage hybrid procedure has been developed to identify β-turns in proteins. Binary logistic regression was initially used for the first time to select significant sequence parameters in identification of β-turns due to a re-substitution test procedure. Sequence parameters were consisted of 80 amino acid positional occurrences and 20 amino acid percentages in sequence. Among these parameters, the most significant ones which were selected by binary logistic regression model, were percentages of Gly, Ser and the occurrence of Asn in position i+2, respectively, in sequence. These significant parameters have the highest effect on the constitution of a β-turn sequence. A neural network model was then constructed and fed by the parameters selected by binary logistic regression to build a hybrid predictor. The networks have been trained and tested on a non-homologous dataset of 565 protein chains. With applying a nine fold cross-validation test on the dataset, the network reached an overall accuracy (Qtotal) of 74, which is comparable with results of the other β-turn prediction methods. In conclusion, this study proves that the parameter selection ability of binary logistic regression together with the prediction capability of neural networks lead to the development of more precise models for identifying β-turns in proteins.
Crane, Paul K; Gibbons, Laura E; Jolley, Lance; van Belle, Gerald
2006-11-01
We present an ordinal logistic regression model for identification of items with differential item functioning (DIF) and apply this model to a Mini-Mental State Examination (MMSE) dataset. We employ item response theory ability estimation in our models. Three nested ordinal logistic regression models are applied to each item. Model testing begins with examination of the statistical significance of the interaction term between ability and the group indicator, consistent with nonuniform DIF. Then we turn our attention to the coefficient of the ability term in models with and without the group term. If including the group term has a marked effect on that coefficient, we declare that it has uniform DIF. We examined DIF related to language of test administration in addition to self-reported race, Hispanic ethnicity, age, years of education, and sex. We used PARSCALE for IRT analyses and STATA for ordinal logistic regression approaches. We used an iterative technique for adjusting IRT ability estimates on the basis of DIF findings. Five items were found to have DIF related to language. These same items also had DIF related to other covariates. The ordinal logistic regression approach to DIF detection, when combined with IRT ability estimates, provides a reasonable alternative for DIF detection. There appear to be several items with significant DIF related to language of test administration in the MMSE. More attention needs to be paid to the specific criteria used to determine whether an item has DIF, not just the technique used to identify DIF.
Conditional Poisson models: a flexible alternative to conditional logistic case cross-over analysis.
Armstrong, Ben G; Gasparrini, Antonio; Tobias, Aurelio
2014-11-24
The time stratified case cross-over approach is a popular alternative to conventional time series regression for analysing associations between time series of environmental exposures (air pollution, weather) and counts of health outcomes. These are almost always analyzed using conditional logistic regression on data expanded to case-control (case crossover) format, but this has some limitations. In particular adjusting for overdispersion and auto-correlation in the counts is not possible. It has been established that a Poisson model for counts with stratum indicators gives identical estimates to those from conditional logistic regression and does not have these limitations, but it is little used, probably because of the overheads in estimating many stratum parameters. The conditional Poisson model avoids estimating stratum parameters by conditioning on the total event count in each stratum, thus simplifying the computing and increasing the number of strata for which fitting is feasible compared with the standard unconditional Poisson model. Unlike the conditional logistic model, the conditional Poisson model does not require expanding the data, and can adjust for overdispersion and auto-correlation. It is available in Stata, R, and other packages. By applying to some real data and using simulations, we demonstrate that conditional Poisson models were simpler to code and shorter to run than are conditional logistic analyses and can be fitted to larger data sets than possible with standard Poisson models. Allowing for overdispersion or autocorrelation was possible with the conditional Poisson model but when not required this model gave identical estimates to those from conditional logistic regression. Conditional Poisson regression models provide an alternative to case crossover analysis of stratified time series data with some advantages. The conditional Poisson model can also be used in other contexts in which primary control for confounding is by fine stratification.
Fei, Y; Hu, J; Li, W-Q; Wang, W; Zong, G-Q
2017-03-01
Essentials Predicting the occurrence of portosplenomesenteric vein thrombosis (PSMVT) is difficult. We studied 72 patients with acute pancreatitis. Artificial neural networks modeling was more accurate than logistic regression in predicting PSMVT. Additional predictive factors may be incorporated into artificial neural networks. Objective To construct and validate artificial neural networks (ANNs) for predicting the occurrence of portosplenomesenteric venous thrombosis (PSMVT) and compare the predictive ability of the ANNs with that of logistic regression. Methods The ANNs and logistic regression modeling were constructed using simple clinical and laboratory data of 72 acute pancreatitis (AP) patients. The ANNs and logistic modeling were first trained on 48 randomly chosen patients and validated on the remaining 24 patients. The accuracy and the performance characteristics were compared between these two approaches by SPSS17.0 software. Results The training set and validation set did not differ on any of the 11 variables. After training, the back propagation network training error converged to 1 × 10 -20 , and it retained excellent pattern recognition ability. When the ANNs model was applied to the validation set, it revealed a sensitivity of 80%, specificity of 85.7%, a positive predictive value of 77.6% and negative predictive value of 90.7%. The accuracy was 83.3%. Differences could be found between ANNs modeling and logistic regression modeling in these parameters (10.0% [95% CI, -14.3 to 34.3%], 14.3% [95% CI, -8.6 to 37.2%], 15.7% [95% CI, -9.9 to 41.3%], 11.8% [95% CI, -8.2 to 31.8%], 22.6% [95% CI, -1.9 to 47.1%], respectively). When ANNs modeling was used to identify PSMVT, the area under receiver operating characteristic curve was 0.849 (95% CI, 0.807-0.901), which demonstrated better overall properties than logistic regression modeling (AUC = 0.716) (95% CI, 0.679-0.761). Conclusions ANNs modeling was a more accurate tool than logistic regression in predicting the occurrence of PSMVT following AP. More clinical factors or biomarkers may be incorporated into ANNs modeling to improve its predictive ability. © 2016 International Society on Thrombosis and Haemostasis.
Chen, Yimin; Zhao, Ying; Feng, Linmin; Zhang, Jie; Zhang, Juanwen; Feng, Guofang
2016-04-27
Metabolic syndrome is closely associated with an increased risk for fatty liver disease morbidity and mortality. Recently, studies have reported that participants with fatty liver disease have higher serum alpha-fetoprotein levels than those without. We investigated the association between alpha-fetoprotein levels and the prevalence of metabolic syndrome in a Chinese asymptomatic population. A cross-sectional study was performed with 7,755 participants who underwent individual health examinations. Clinical and anthropometric parameters were collected and serum alpha-fetoprotein levels and other clinical and laboratory parameters were measured. Logistic regression analysis was used to examine associations between alpha-fetoprotein and metabolic syndrome. Participants with metabolic syndrome had significantly higher (p < 0.001) alpha-fetoprotein levels than those without, though all alpha-fetoprotein levels were within the reference interval. The association between the components of metabolic syndrome (central obesity, elevated blood pressure, elevated triglycerides, reduced high-density lipoprotein cholesterol, and elevated fasting plasma glucose) and alpha-fetoprotein levels was evaluated. Alpha-fetoprotein levels in the elevated triglycerides, reduced high-density lipoprotein cholesterol, and elevated fasting plasma glucose groups were significantly different (p=0.002, p < 0.001, p=0.020) compared with alpha-fetoprotein in the normal triglycerides, high-density lipoprotein cholesterol, and fasting plasma glucose groups. Logistic regression analyses showed an association between alpha-fetoprotein levels and increased risk for metabolic syndrome, the presence of reduced high-density lipoprotein cholesterol, and elevated fasting plasma glucose, but not with obesity, elevated blood pressure, or triglycerides. These results suggest a significant association between alpha-fetoprotein and metabolic syndrome.
Ai, Zi-Sheng; Gao, You-Shui; Sun, Yuan; Liu, Yue; Zhang, Chang-Qing; Jiang, Cheng-Hua
2013-03-01
Risk factors for femoral neck fracture-induced avascular necrosis of the femoral head have not been elucidated clearly in middle-aged and elderly patients. Moreover, the high incidence of screw removal in China and its effect on the fate of the involved femoral head require statistical methods to reflect their intrinsic relationship. Ninety-nine patients older than 45 years with femoral neck fracture were treated by internal fixation between May 1999 and April 2004. Descriptive analysis, interaction analysis between associated factors, single factor logistic regression, multivariate logistic regression, and detailed interaction analysis were employed to explore potential relationships among associated factors. Avascular necrosis of the femoral head was found in 15 cases (15.2 %). Age × the status of implants (removal vs. maintenance) and gender × the timing of reduction were interactive according to two-factor interactive analysis. Age, the displacement of fractures, the quality of reduction, and the status of implants were found to be significant factors in single factor logistic regression analysis. Age, age × the status of implants, and the quality of reduction were found to be significant factors in multivariate logistic regression analysis. In fine interaction analysis after multivariate logistic regression analysis, implant removal was the most important risk factor for avascular necrosis in 56-to-85-year-old patients, with a risk ratio of 26.00 (95 % CI = 3.076-219.747). The middle-aged and elderly have less incidence of avascular necrosis of the femoral head following femoral neck fractures treated by cannulated screws. The removal of cannulated screws can induce a significantly high incidence of avascular necrosis of the femoral head in elderly patients, while a high-quality reduction is helpful to reduce avascular necrosis.
Zhou, Jinzhe; Zhou, Yanbing; Cao, Shougen; Li, Shikuan; Wang, Hao; Niu, Zhaojian; Chen, Dong; Wang, Dongsheng; Lv, Liang; Zhang, Jian; Li, Yu; Jiao, Xuelong; Tan, Xiaojie; Zhang, Jianli; Wang, Haibo; Zhang, Bingyuan; Lu, Yun; Sun, Zhenqing
2016-01-01
Reporting of surgical complications is common, but few provide information about the severity and estimate risk factors of complications. If have, but lack of specificity. We retrospectively analyzed data on 2795 gastric cancer patients underwent surgical procedure at the Affiliated Hospital of Qingdao University between June 2007 and June 2012, established multivariate logistic regression model to predictive risk factors related to the postoperative complications according to the Clavien-Dindo classification system. Twenty-four out of 86 variables were identified statistically significant in univariate logistic regression analysis, 11 significant variables entered multivariate analysis were employed to produce the risk model. Liver cirrhosis, diabetes mellitus, Child classification, invasion of neighboring organs, combined resection, introperative transfusion, Billroth II anastomosis of reconstruction, malnutrition, surgical volume of surgeons, operating time and age were independent risk factors for postoperative complications after gastrectomy. Based on logistic regression equation, p=Exp∑BiXi / (1+Exp∑BiXi), multivariate logistic regression predictive model that calculated the risk of postoperative morbidity was developed, p = 1/(1 + e((4.810-1.287X1-0.504X2-0.500X3-0.474X4-0.405X5-0.318X6-0.316X7-0.305X8-0.278X9-0.255X10-0.138X11))). The accuracy, sensitivity and specificity of the model to predict the postoperative complications were 86.7%, 76.2% and 88.6%, respectively. This risk model based on Clavien-Dindo grading severity of complications system and logistic regression analysis can predict severe morbidity specific to an individual patient's risk factors, estimate patients' risks and benefits of gastric surgery as an accurate decision-making tool and may serve as a template for the development of risk models for other surgical groups.
Rhodes, Darson L; Kirchofer, Gregg; Hammig, Bart J; Ogletree, Roberta J
2013-05-01
This study examined the impact of professional preparation and class structure on sexuality topics taught and use of practice-based instructional strategies in US middle and high school health classes. Data from the classroom-level file of the 2006 School Health Policies and Programs were used. A series of multivariable logistic regression models were employed to determine if sexuality content taught was dependent on professional preparation and /or class structure (HE only versus HE/another subject combined). Additional multivariable logistic regression models were employed to determine if use of practice-based instructional strategies was dependent upon professional preparation and/or class structure. Years of teaching health topics and size of the school district were included as covariates in the multivariable logistic regression models. Findings indicated professionally prepared health educators were significantly more likely to teach 7 of the 13 sexuality topics as compared to nonprofessionally prepared health educators. There was no statistically significant difference in the instructional strategies used by professionally prepared and nonprofessionally prepared health educators. Exclusively health education classes versus combined classes were significantly more likely to have included 6 of the 13 topics and to have incorporated practice-based instructional strategies in the curricula. This study indicated professional preparation and class structure impacted sexuality content taught. Class structure also impacted whether opportunities for students to practice skills were made available. Results support the need for continued advocacy for professionally prepared health educators and health only courses. © 2013, American School Health Association.
NASA Astrophysics Data System (ADS)
Wulandari, S. P.; Salamah, M.; Rositawati, A. F. D.
2018-04-01
Food security is the condition where the food fulfilment is managed well for the country till the individual. Indonesia is one of the country which has the commitment to create the food security becomes main priority. However, the food necessity becomes common thing means that it doesn’t care about nutrient standard and the health condition of family member, so in the fulfilment of food necessity also has to consider the disease suffered by the family member, one of them is pulmonary tuberculosa. From that reasons, this research is conducted to know the factors which influence on household food security status which suffered from pulmonary tuberculosis in the coastal area of Surabaya by using binary logistic regression method. The analysis result by using binary logistic regression shows that the variables wife latest education, house density and spacious house ventilation significantly affect on household food security status which suffered from pulmonary tuberculosis in the coastal area of Surabaya, where the wife education level is University/equivalent, the house density is eligible or 8 m2/person and spacious house ventilation 10% of the floor area has the opportunity to become food secure households amounted to 0.911089. While the chance of becoming food insecure households amounted to 0.088911. The model household food security status which suffered from pulmonary tuberculosis in the coastal area of Surabaya has been conformable, and the overall percentages of those classifications are at 71.8%.
Smoking media literacy in Vietnamese adolescents.
Page, Randy M; Huong, Nguyen T; Chi, Hoang K; Tien, Truong Q
2011-01-01
Smoking media literacy (SML) has been found to be independently associated with reduced current smoking and reduced susceptibility to future smoking in a sample of American adolescents, but not in other populations of adolescents. Thus, the purpose of this study was to assess SML in Vietnamese adolescents and to determine the association with smoking behavior and susceptibility to future smoking. A cross-sectional survey of 2000 high school students completed the SML scale, which is based on an integrated theoretical framework of media literacy, and items assessing cigarette use. Ordinal logistic regression was used to determine the association of SML with smoking and susceptibility to future smoking. Ordinal logistic regression was also to determine whether smoking in the past 30 days was associated with the 8 domains/core concepts of media literacy which comprise the SML. Smoking media literacy was lower among the Vietnamese adolescents than what has been previously reported in American adolescents. Ordinal logistic regression analysis results showed that in the total sample SML was associated with reduced smoking, but there was no association with susceptibility to future smoking. Further analysis showed that results differed according to school and grade level. There did not appear to be association of smoking with the specific domains/concepts that comprise the SML. The association of SML with reduced smoking suggests the need for further research involving SML, including the testing of media literacy training interventions, in Vietnamese adolescents and also other populations of adolescents. © 2011, American School Health Association.
Seligman, D A; Pullinger, A G
2000-01-01
Confusion about the relationship of occlusion to temporomandibular disorders (TMD) persists. This study attempted to identify occlusal and attrition factors plus age that would characterize asymptomatic normal female subjects. A total of 124 female patients with intracapsular TMD were compared with 47 asymptomatic female controls for associations to 9 occlusal factors, 3 attrition severity measures, and age using classification tree, multiple stepwise logistic regression, and univariate analyses. Models were tested for accuracy (sensitivity and specificity) and total contribution to the variance. The classification tree model had 4 terminal nodes that used only anterior attrition and age. "Normals" were mainly characterized by low attrition levels, whereas patients had higher attrition and tended to be younger. The tree model was only moderately useful (sensitivity 63%, specificity 94%) in predicting normals. The logistic regression model incorporated unilateral posterior crossbite and mediotrusive attrition severity in addition to the 2 factors in the tree, but was slightly less accurate than the tree (sensitivity 51%, specificity 90%). When only occlusal factors were considered in the analysis, normals were additionally characterized by a lack of anterior open bite, smaller overjet, and smaller RCP-ICP slides. The log likelihood accounted for was similar for both the tree (pseudo R(2) = 29.38%; mean deviance = 0.95) and the multiple logistic regression (Cox Snell R(2) = 30.3%, mean deviance = 0.84) models. The occlusal and attrition factors studied were only moderately useful in differentiating normals from TMD patients.
Borda, Alfredo; Sanz, Belén; Otero, Laura; Blasco, Teresa; García-Gómez, Francisco J; de Andrés, Fuencisla
2011-01-01
To analyze the association between travel time and participation in a breast cancer screening program adjusted for contextual variables in the province of Segovia (Spain). We performed an ecological study using the following data sources: the Breast Cancer Early Detection Program of the Primary Care Management of Segovia, the Population and Housing Census for 2001 and the municipal register for 2006-2007. The study period comprised January 2006 to December 2007. Dependent variables consisted of the municipal participation rate and the desired level of municipal participation (greater than or equal to 70%). The key independent variable was travel time from the municipality to the mammography unit. Covariables consisted of the municipalities' demographic and socioeconomic factors. We performed univariate and multivariate Poisson regression analyses of the participation rate, and logistic regression of the desired participation level. The sample was composed of 178 municipalities. The mean participation rate was 75.2%. The desired level of participation (≥ 70%) was achieved in 119 municipalities (67%). In the multivariate Poisson and logistic regression analyses, longer travel time was associated with a lower participation rate and with lower participation after adjustment was made for geographic density, age, socioeconomic status and dependency ratio, with a relative risk index of 0.88 (95% CI: 0.81-0.96) and an odds ratio of 0.22 (95% CI: 0.1-0.47), respectively. Travel time to the mammography unit may help to explain participation in breast cancer screening programs. Copyright © 2010 SESPAS. Published by Elsevier Espana. All rights reserved.
ERIC Educational Resources Information Center
Lichtenberger, Eric; George-Jackson, Casey
2013-01-01
This study examined how various individual, family, and school level contextual factors impact the likelihood of planning to major in one of the science, technology, engineering, or mathematics (STEM) fields for high school students. A binary logistic regression model was developed to determine the extent to which each of the covariates helped to…
Emergency Department Use by Nursing Home Residents: Effect of Severity of Cognitive Impairment
ERIC Educational Resources Information Center
Stephens, Caroline E.; Newcomer, Robert; Blegen, Mary; Miller, Bruce; Harrington, Charlene
2012-01-01
Purpose: To examine the 1-year prevalence and risk of emergency department (ED) use and ambulatory care-sensitive (ACS) ED use by nursing home (NH) residents with different levels of severity of cognitive impairment (CI). Design and Methods: We used multinomial logistic regression to estimate the effect of CI severity on the odds of any ED visit…
J.M. Menzel; W.M. Ford; J.W. Edwards; L.J. Ceperley; L.J. Ceperley
2006-01-01
The Virginia northern flying squirrel (Glaucomys sabrinus fuscus) is an endangered sciurid that occurs in the Allegheny Mountains of Virginia and West Virginia. Despite its status, few of its ecological requirements have been synthesized for landscape-level predictive distributions to facilitate habitat delineation efforts. Using logistic regression, we developed a GIS...
An Exploration of Teacher Attrition and Mobility in High Poverty Racially Segregated Schools
ERIC Educational Resources Information Center
Djonko-Moore, Cara M.
2016-01-01
The purpose of this study was to examine the mobility (movement to a new school) and attrition (quitting teaching) patterns of teachers in high poverty, racially segregated (HPRS) schools in the US. Using 2007-9 survey data from the National Center for Education Statistics, a multi-level multinomial logistic regression was performed to examine the…
Impact of School Violence on Youth Alcohol Abuse: Differences Based on Gender and Grade Level
ERIC Educational Resources Information Center
Vidourek, Rebecca A.; King, Keith A.; Merianos, Ashley L.
2016-01-01
The purpose of this study was to examine the impact of school violence on recent alcohol use and episodic heavy drinking among seventh- through 12th-grade students. A total of 54,631 students completed a survey assessing substance use and other risky behaviors. Logistic regression analyses were conducted to examine the research questions. Results…
ERIC Educational Resources Information Center
Obasaju, Mayowa A.; Palin, Frances L.; Jacobs, Carli; Anderson, Page; Kaslow, Nadine J.
2009-01-01
An ecological model is used to explore the moderating effects of community-level variables on the relation between childhood sexual, physical, and emotional abuse and adult intimate partner violence (IPV) within a sample of 98 African American women from low incomes. Results from hierarchical, binary logistics regressions analyses show that…
ERIC Educational Resources Information Center
Suvedi, Murari; Ghimire, Raju; Kaplowitz, Michael
2017-01-01
Purpose: This paper examines the factors affecting farmers' participation in extension programs and adoption of improved seed varieties in the hills of rural Nepal. Methodology/approach: Cross-sectional farm-level data were collected during July and August 2014. A sample of 198 farm households was selected for interviewing by using a multistage,…
Protective Families in High- and Low-Risk Environments: Implications for Adolescent Substance Use
ERIC Educational Resources Information Center
Cleveland, Michael J.; Feinberg, Mark E.; Greenberg, Mark T.
2010-01-01
This study used data from a sample of 6th to 12th grade students (N = 48,641, 51% female), nested in 192 schools, to determine if the influence of family-based protective factors varied across different school contexts. Hierarchical logistic regression models were used to examine the effects of individual-level family protective factors, relative…
Mak, Kwok-Kei; Kim, Dae-Hwan; Leigh, J Paul
2015-01-01
Few population-based studies have used an econometric approach to understand the association between two cancer risk factors, obesity and stress. This study investigated sociodemographic differences in the association between obesity and stress among Korean adults (6,546 men and 8,473 women). Data were drawn from the Korean National Health and Nutrition Examination Survey for 2008, 2009, and 2010. Ordered logistic regression models and propensity score matching methods were used to examine the associations between obesity and stress, stratified by gender and age groups. In women, the stress level of the obese group was found to be 27.6% higher than the nonobese group in the ordered logistic regression; the obesity effect on stress was statistically significant in the propensity score-matched analysis. Corresponding evidence for the effect of obesity on stress was lacking among men. Participants who were young, well-educated, and working were more likely to report stress. In Korea, obesity causes stress in women but not in men. Young women are susceptible to a disproportionate level of stress. More cancer prevention programs targeting young and obese women are encouraged in developed Asian countries.
Factors associated with self-medication in Spain: a cross-sectional study in different age groups.
Niclós, Gracia; Olivar, Teresa; Rodilla, Vicent
2018-06-01
The identification of factors which may influence a patient's decision to self-medicate. Descriptive, cross-sectional study of the adult population (at least 16 years old), using data from the 2009 European Health Interview Survey in Spain, which included 22 188 subjects. Logistic regression models enabled us to estimate the effect of each analysed variable on self-medication. In total, 14 863 (67%) individuals reported using medication (prescribed and non-prescribed) and 3274 (22.0%) of them self-medicated. Using logistic regression and stratifying by age, four different models have been constructed. Our results include different variables in each of the models to explain self-medication, but the one that appears on all four models is education level. Age is the other important factor which influences self-medication. Self-medication is strongly associated with factors related to socio-demographic, such as sex, educational level or age, as well as several health factors such as long-standing illness or physical activity. When our data are compared to those from previous Spanish surveys carried out in 2003 and 2006, we can conclude that self-medication is increasing in Spain. © 2017 Royal Pharmaceutical Society.
Organochlorine pesticides accumulation and breast cancer: A hospital-based case-control study.
He, Ting-Ting; Zuo, An-Jun; Wang, Ji-Gang; Zhao, Peng
2017-05-01
The aim of this study is to detect the accumulation status of organochlorine pesticides in breast cancer patients and to explore the relationship between organochlorine pesticides contamination and breast cancer development. We conducted a hospital-based case-control study in 56 patients with breast cancer and 46 patients with benign breast disease. We detected the accumulation level of several organochlorine pesticides products (β-hexachlorocyclohexane, γ-hexachlorocyclohexane, polychlorinated biphenyls-28, polychlorinated biphenyls-52, pentachlorothioanisole, and pp'-dichlorodiphenyldichloroethane) in breast adipose tissues of all 102 patients using gas chromatography. Thereafter, we examined the expression status of estrogen receptor, progesterone receptor, human epidermal growth factor receptor-2 (HER2), and Ki-67 in 56 breast cancer cases by immunohistochemistry. In addition, we analyzed the risk of breast cancer in those patients with organochlorine pesticides contamination using a logistic regression model. Our data showed that breast cancer patients suffered high accumulation levels of pp'-dichlorodiphenyldichloroethane and polychlorinated biphenyls-52. However, the concentrations of pp'-dichlorodiphenyldichloroethane and polychlorinated biphenyls-52 were not related to clinicopathologic parameters of breast cancer. Further logistic regression analysis showed polychlorinated biphenyls-52 and pp'-dichlorodiphenyldichloroethane were risk factors for breast cancer. Our results provide new evidence on etiology of breast cancer.
Zeng, Rong; Luo, Jiayou; Tan, Cai; DU, Qiyun; Zhang, Weimin; Li, Yanping
2012-11-01
To explore the relationship between caregivers' nutritional knowledge and children's dietary behavior in rural areas of China. A cross-sectional study was conducted. 3361 rural caregivers and their children, aged 2 to 7 years old, were selected randomly and surveyed by questionnaire. Logistic regression models were used to identify the relationship between caregivers' nutritional knowledge and the children's dietary behaviors. The awareness level of nutritional knowledge among rural caregivers was 57.9%; among the children surveyed, 79.6% did not like to drink milk, 66.0% were considered choosy of food, 84.1% regularly snacked, 24.4% frequently skipped breakfast, and 13.7% did not come to meals on time. Logistic regression models indicated that a caregiver with a low level of nutritional knowledge is a risk factor for a child's unhealth dietary behaviors (snacking excepted): the odds ratios (OR) of not liking to drink milk, being choosy about food, skipping breakfast or not having meals on time are 1.665, 1.338, 1.330 and 1.582, respectively. Caregivers' nutritional knowledge is strongly associated with children's dietary behavior. Nutrition education programs are urgently wanted to improve caregiver's knowledge and thus to improve children's dietary behavior in rural areas of China.
Exploring Audiologists' Language and Hearing Aid Uptake in Initial Rehabilitation Appointments.
Sciacca, Anna; Meyer, Carly; Ekberg, Katie; Barr, Caitlin; Hickson, Louise
2017-06-13
The study aimed (a) to profile audiologists' language during the diagnosis and management planning phase of hearing assessment appointments and (b) to explore associations between audiologists' language and patients' decisions to obtain hearing aids. Sixty-two audiologist-patient dyads participated. Patient participants were aged 55 years or older. Hearing assessment appointments were audiovisually recorded and transcribed for analysis. Audiologists' language was profiled using two measures: general language complexity and use of jargon. A binomial, multivariate logistic regression analysis was conducted to investigate the associations between these language measures and hearing aid uptake. The logistic regression model revealed that the Flesch-Kincaid reading grade level of audiologists' language was significantly associated with hearing aid uptake. Patients were less likely to obtain hearing aids when audiologists' language was at a higher reading grade level. No associations were found between audiologists' use of jargon and hearing aid uptake. Audiologists' use of complex language may present a barrier for patients to understand hearing rehabilitation recommendations. Reduced understanding may limit patient participation in the decision-making process and result in patients being less willing to trial hearing aids. Clear, concise language is recommended to facilitate shared decision making.
Rank-Optimized Logistic Matrix Regression toward Improved Matrix Data Classification.
Zhang, Jianguang; Jiang, Jianmin
2018-02-01
While existing logistic regression suffers from overfitting and often fails in considering structural information, we propose a novel matrix-based logistic regression to overcome the weakness. In the proposed method, 2D matrices are directly used to learn two groups of parameter vectors along each dimension without vectorization, which allows the proposed method to fully exploit the underlying structural information embedded inside the 2D matrices. Further, we add a joint [Formula: see text]-norm on two parameter matrices, which are organized by aligning each group of parameter vectors in columns. This added co-regularization term has two roles-enhancing the effect of regularization and optimizing the rank during the learning process. With our proposed fast iterative solution, we carried out extensive experiments. The results show that in comparison to both the traditional tensor-based methods and the vector-based regression methods, our proposed solution achieves better performance for matrix data classifications.
Detecting DIF in Polytomous Items Using MACS, IRT and Ordinal Logistic Regression
ERIC Educational Resources Information Center
Elosua, Paula; Wells, Craig
2013-01-01
The purpose of the present study was to compare the Type I error rate and power of two model-based procedures, the mean and covariance structure model (MACS) and the item response theory (IRT), and an observed-score based procedure, ordinal logistic regression, for detecting differential item functioning (DIF) in polytomous items. A simulation…
ERIC Educational Resources Information Center
Rudner, Lawrence
2016-01-01
In the machine learning literature, it is commonly accepted as fact that as calibration sample sizes increase, Naïve Bayes classifiers initially outperform Logistic Regression classifiers in terms of classification accuracy. Applied to subtests from an on-line final examination and from a highly regarded certification examination, this study shows…
ERIC Educational Resources Information Center
Fan, Xitao; Wang, Lin
The Monte Carlo study compared the performance of predictive discriminant analysis (PDA) and that of logistic regression (LR) for the two-group classification problem. Prior probabilities were used for classification, but the cost of misclassification was assumed to be equal. The study used a fully crossed three-factor experimental design (with…
School Exits in the Milwaukee Parental Choice Program: Evidence of a Marketplace?
ERIC Educational Resources Information Center
Ford, Michael
2011-01-01
This article examines whether the large number of school exits from the Milwaukee school voucher program is evidence of a marketplace. Two logistic regression and multinomial logistic regression models tested the relation between the inability to draw large numbers of voucher students and the ability for a private school to remain viable. Data on…
Hierarchical Bayesian Logistic Regression to forecast metabolic control in type 2 DM patients.
Dagliati, Arianna; Malovini, Alberto; Decata, Pasquale; Cogni, Giulia; Teliti, Marsida; Sacchi, Lucia; Cerra, Carlo; Chiovato, Luca; Bellazzi, Riccardo
2016-01-01
In this work we present our efforts in building a model able to forecast patients' changes in clinical conditions when repeated measurements are available. In this case the available risk calculators are typically not applicable. We propose a Hierarchical Bayesian Logistic Regression model, which allows taking into account individual and population variability in model parameters estimate. The model is used to predict metabolic control and its variation in type 2 diabetes mellitus. In particular we have analyzed a population of more than 1000 Italian type 2 diabetic patients, collected within the European project Mosaic. The results obtained in terms of Matthews Correlation Coefficient are significantly better than the ones gathered with standard logistic regression model, based on data pooling.
Model building strategy for logistic regression: purposeful selection.
Zhang, Zhongheng
2016-03-01
Logistic regression is one of the most commonly used models to account for confounders in medical literature. The article introduces how to perform purposeful selection model building strategy with R. I stress on the use of likelihood ratio test to see whether deleting a variable will have significant impact on model fit. A deleted variable should also be checked for whether it is an important adjustment of remaining covariates. Interaction should be checked to disentangle complex relationship between covariates and their synergistic effect on response variable. Model should be checked for the goodness-of-fit (GOF). In other words, how the fitted model reflects the real data. Hosmer-Lemeshow GOF test is the most widely used for logistic regression model.
Erkenekli, Kudret; Oztas, Efser; Kuscu, Elif; Keskin, Uğur; Kurt, Yasemin Gulcan; Tas, Ahmet; Yilmaz, Nafiye
2017-01-01
Dyslipidemia is common in women with polycystic ovary syndrome (PCOS) irrespective of age. Our aim was to investigate soluble tumor necrosis factor like weak inducer of apoptosis (sTWEAK), a cardiovascular risk marker in PCOS, and to determine if it is associated with dyslipidemia in youth. A prospective-observational study was carried out including 35 PCOS patients and 35 healthy controls. Serum sTWEAK levels were measured using commercially available kits. Multiple logistic regression analysis was then performed to verify the statistically significant differences in the possible predictors of dyslipidemia. Serum sTWEAK levels and the percentage of women with dyslipidemia were significantly higher in the PCOS group (p = 0.024 and p < 0.001, respectively). Participants were further divided into 2 subgroups based on the presence of dyslipidemia. The percentage of women with PCOS was significantly higher in the dyslipidemic group when compared with controls; 70.7 vs. 20.7%, respectively (p < 0.001). Multiple logistic regression analysis revealed that both the presence of PCOS (OR 7.924, 95% CI 2.117-29.657, p = 0.002) and increased levels of sTWEAK (>693 pg/ml; OR 3.810, 95% CI 1.075-13.501, p = 0.038) were independently associated with dyslipidemia. Increased levels of both sTWEAK and PCOS were found to be independently associated with dyslipidemia in youth. © 2016 S. Karger AG, Basel.
Hamer, Maria Andrada; Källén, Karin; Lidfeldt, Jonas; Samsioe, Göran; Teleman, Pia
2011-11-01
To outline serum estradiol levels in perimenopausal women with stress, mixed or urge incontinence. We believe the majority of urgency symptoms in perimenopausal women to be caused by a pelvic floor dysfunction and a hypermobility of the bladder neck. If this is the case, there would be no difference in estradiol levels between the groups. University hospital. In the observational Women's Health in the Lund Area study, a subset of 400/2221 women reporting urinary incontinence completed a detailed questionnaire regarding lower urinary tract symptoms and had their serum steroid hormone levels measured. Statistical analyses were made by Chi-square test, nonparametrical tests, ANOVA, multi- and univariate logistic regression analysis. Stress incontinence was reported by 196, mixed incontinence by 153 and urge incontinence by 43 women; in 369, serumestradiol values were available. Serum estradiol did not differ significantly between stress incontinent (median 49.5 pmo/l, range 2.63-875.4), urge incontinent (median 31.6 pmol/l, range 2.63-460.7) or mixed incontinent women (median 35.5 pmol/l, range 2.63-787.9, p=0.62). Logistic regression analysis correcting for age, parity, hormonal status, smoking, hysterectomy and BMI also failed to show any difference in estradiol levels between the groups (p=0.41-0.58). No significant differences in serum estradiol levels between stress, mixed or urge incontinent perimenopausal women could be demonstrated. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Tang, Li-Na; Ye, Xiao-Zhou; Yan, Qiu-Ge; Chang, Hong-Juan; Ma, Yu-Qiao; Liu, De-Bin; Li, Zhi-Gen; Yu, Yi-Zhen
2017-02-01
The risk factors of high trait anger of juvenile offenders were explored through questionnaire study in a youth correctional facility of Hubei province, China. A total of 1090 juvenile offenders in Hubei province were investigated by self-compiled social-demographic questionnaire, Childhood Trauma Questionnaire (CTQ), and State-Trait Anger Expression Inventory-II (STAXI-II). The risk factors were analyzed by chi-square tests, correlation analysis, and binary logistic regression analysis with SPSS 19.0. A total of 1082 copies of valid questionnaires were collected. High trait anger group (n=316) was defined as those who scored in the upper 27th percentile of STAXI-II trait anger scale (TAS), and the rest were defined as low trait anger group (n=766). The risk factors associated with high level of trait anger included: childhood emotional abuse, childhood sexual abuse, step family, frequent drug abuse, and frequent internet using (P<0.05 or P<0.01). Birth sequence, number of sibling, ranking in the family, identity of the main care-taker, the education level of care-taker, educational style of care-taker, family income, relationship between parents, social atmosphere of local area, frequent drinking, and frequent smoking did not predict to high level of trait anger (P>0.05). It was suggested that traumatic experience in childhood and unhealthy life style may significantly increase the level of trait anger in adulthood. The risk factors of high trait anger and their effects should be taken into consideration seriously.
Inequality in the hepatitis B awareness level in rural residents from 7 provinces in China.
Zheng, Juan; Li, Quan; Wang, Jian; Zhang, Guojie; Wangen, Knut R
2017-05-04
The hepatitis B (HB) awareness level is an important factor affecting the rates of HB virus vaccination. To better understand income-related inequalities in the HB awareness level, it is imperative to identify the sources of inequalities and assess the contribution rates of these influential factors. This study analyzed the unequal distribution of the HB awareness level and the contributions of various influential factors. We performed a cross-sectional household survey with questionnaire-based, face-to-face interviews in 7 Chinese provinces. Responses from 7271 respondents were used in this analysis. Multinomial logistic regression was used for the analysis of contributing factors, and the concentration index was used as a measure of HB awareness inequalities. The HB awareness level varied across participants with different characteristics. Multinomial logistic regression of the explanatory factors of the HB awareness level showed that several estimated coefficients and relative risk ratios were statistically significant for middle- and high-level awareness, except for sex, occupation, and household income. The concentration index of the HB knowledge score was 0.140, indicating inequality gradients disadvantageous to the poor. The contribution rate of socioeconomic factors was the largest (60.8%), followed by demographic characteristics (29.0%) and geographic factors (4.3%). Demographic, socioeconomic, and geographic factors are associated with the HB awareness inequality. Therefore, to reduce inequality, HB-related health education targeting individuals with low socioeconomic status should be performed. Less-developed provinces, especially with high proportions of poor residents, warrant particular attention. Our findings may be beneficial to improve the HB virus vaccination rate for individuals with low socioeconomic status.
2013-01-01
Background There are multiple adverse effects of anemia on human function, particularly on women. However, few researches are conducted on women anemia in rural Western China. This study mainly aims to investigate the levels and associated factors of maternal anemia between 2001 and 2005 in this region. Methods 6172 and 5372 mothers with children under three years old were selected from 8 provinces in 2001 and from 9 provinces in 2005 respectively in Western China by means of a multi-stage probability proportion to size sampling method (PPS). The blood samples were tested and related socio-demographic information was obtained through questionnaires. A two-level logistic regression model was employed to identify the determinants and provincial variations of women anemia in 2001 and 2005. Results The results indicated that the crude prevalence of women anemia in 2005 was higher than the rate in 2001(45.7% vs 33.6%). Based on the nationwide census data in 2000, the age-standardized prevalence of women anemia in the study were obtained as 38.0% in 2001 and 50.0% in 2005 respectively. Two-level logistic model analysis showed that compared to the average, women were more likely to be anemic in Guangxi and Qinghai in 2001 as well as in Chongqing and Qinghai in 2005; that women from Minority groups had higher odds of anemia in contrast with Han; that women with higher parity, longer breastfeeding duration and higher socioeconomic level had a lower rate of anemia, while age of women was positively associated with anemia. The positive correlation between women anemia and altitude was also observed. Conclusions The study demonstrated that the burden of maternal anemia in rural Western China increased considerably between 2001 and 2005. The Chinese government should conduct integrated interventions on anemia of mothers in this region. PMID:23597320
2011-01-01
Background Logistic random effects models are a popular tool to analyze multilevel also called hierarchical data with a binary or ordinal outcome. Here, we aim to compare different statistical software implementations of these models. Methods We used individual patient data from 8509 patients in 231 centers with moderate and severe Traumatic Brain Injury (TBI) enrolled in eight Randomized Controlled Trials (RCTs) and three observational studies. We fitted logistic random effects regression models with the 5-point Glasgow Outcome Scale (GOS) as outcome, both dichotomized as well as ordinal, with center and/or trial as random effects, and as covariates age, motor score, pupil reactivity or trial. We then compared the implementations of frequentist and Bayesian methods to estimate the fixed and random effects. Frequentist approaches included R (lme4), Stata (GLLAMM), SAS (GLIMMIX and NLMIXED), MLwiN ([R]IGLS) and MIXOR, Bayesian approaches included WinBUGS, MLwiN (MCMC), R package MCMCglmm and SAS experimental procedure MCMC. Three data sets (the full data set and two sub-datasets) were analysed using basically two logistic random effects models with either one random effect for the center or two random effects for center and trial. For the ordinal outcome in the full data set also a proportional odds model with a random center effect was fitted. Results The packages gave similar parameter estimates for both the fixed and random effects and for the binary (and ordinal) models for the main study and when based on a relatively large number of level-1 (patient level) data compared to the number of level-2 (hospital level) data. However, when based on relatively sparse data set, i.e. when the numbers of level-1 and level-2 data units were about the same, the frequentist and Bayesian approaches showed somewhat different results. The software implementations differ considerably in flexibility, computation time, and usability. There are also differences in the availability of additional tools for model evaluation, such as diagnostic plots. The experimental SAS (version 9.2) procedure MCMC appeared to be inefficient. Conclusions On relatively large data sets, the different software implementations of logistic random effects regression models produced similar results. Thus, for a large data set there seems to be no explicit preference (of course if there is no preference from a philosophical point of view) for either a frequentist or Bayesian approach (if based on vague priors). The choice for a particular implementation may largely depend on the desired flexibility, and the usability of the package. For small data sets the random effects variances are difficult to estimate. In the frequentist approaches the MLE of this variance was often estimated zero with a standard error that is either zero or could not be determined, while for Bayesian methods the estimates could depend on the chosen "non-informative" prior of the variance parameter. The starting value for the variance parameter may be also critical for the convergence of the Markov chain. PMID:21605357
Zhang, L L; Lu, Y H; Cheng, X L; Liu, M Y; Sun, B R; Li, C L
2016-08-01
To evaluate vitamin D status in middle-aged subjects in Beijing and explore the correlation between serum 25-hydroxyvitamin D[25(OH)D] levels and dyslipidemia. A total of 448 individuals over 40 years old were enrolled in the cross-sectional survey. The general information, blood biochemical and lipid profiles and serum 25(OH)D levels were collected. The subjects were either divided into two groups (the dyslipidemia group and the non-dyslipidemia group) based on the lipid levels, or four groups according to quartiles of 25(OH)D levels. The association between 25(OH)D levels and dyslipidemia risk was analyzed by a logistic regression analysis. A total of 234 cases were in dyslipidemia group, which accounted for 52.23% of the subjects. The serum 25(OH)D levels were significantly lower in the dyslipidemia group than in the non-dyslipidemia group both in men and in women (all P<0.05). The median serum 25(OH)D level in the total subjects was 15.7 (12.2, 20.1)μg/L with 91.1% subjects of serum 25(OH)D level<30 μg/L. The proportion of subjects with dyslipidemia (high TC, high TG, high LDL-C, or low HDL-C) increased with the decrease of 25(OH)D level quartiles (P<0.05). After adjustment of confounding factors, the logistic regression analysis showed that subjects in the lowest 25(OH) D quartile group had 143% higher risks for dyslipidemia than those in the highest quartile group. These findings indicate that 25(OH)D insufficiency is highly prevalent among middle-aged individuals and it may be associated with the risk of dyslipidemia.
Itani, Kamal M F; DePalma, Ralph G; Schifftner, Tracy; Sanders, Karen M; Chang, Barbara K; Henderson, William G; Khuri, Shukri F
2005-11-01
There has been concern that a reduced level of surgical resident supervision in the operating room (OR) is correlated with worse patient outcomes. Until September 2004, Veterans' Affairs (VA) hospitals entered in the surgical record level 3 supervision on every surgical case when the attending physician was available but not physically present in the OR or the OR suite. In this study, we assessed the impact of level 3 on risk-adjusted morbidity and mortality in the VA system. Surgical cases entered into the National Surgical Quality Improvement Program database between 1998 and 2004, from 99 VA teaching facilities, were included in a logistic regression analysis for each year. Level 3 versus all other levels of supervision were forced into the model, and patient characteristics then were selected stepwise to arrive at a final model. Confidence limits for the odds ratios were calculated by profile likelihood. A total of 610,660 cases were available for analysis. Thirty-day mortality and morbidity rates were reported in 14,441 (2.36%) and 63,079 (10.33%) cases, respectively. Level 3 supervision decreased from 8.72% in 1998 to 2.69% in 2004. In the logistic regression analysis, the odds ratios for mortality for level 3 ranged from .72 to 1.03. Only in the year 2000 were the odds ratio for mortality statistically significant at the .05 level (odds ratio, .72; 95% confidence interval, .594-.858). For morbidity, the odds ratios for level 3 supervision ranged from .66 to 1.01, and all odds ratios except for the year 2004 were statistically significant. Between 1998 and 2004, the level of resident supervision in the OR did not affect clinical outcomes adversely for surgical patients in the VA teaching hospitals.
Estimating Procurement Cost Growth Using Logistic and Multiple Regression
2003-03-01
Figure 4). The plots fail to pass the visual inspection for constant variance as well as the Breusch - Pagan test (Neter, 1996: 112) at an alpha level...plots fail to pass the visual inspection for constant variance as well as the Breusch - Pagan test at an alpha level of 0.05. Based on these findings...amount of cost growth a program will have 13 once model A deems that the program will incur cost growth. Sipple conducts validation testing on
Providing written language services in the schools: the time is now.
Fallon, Karen A; Katz, Lauren A
2011-01-01
The current study was conducted to investigate the provision of written language services by school-based speech-language pathologists (SLPs). Specifically, the study examined SLPs' knowledge, attitudes, and collaborative practices in the area of written language services as well as the variables that impact provision of these services. Public school-based SLPs from across the country were solicited for participation in an online, Web-based survey. Data from 645 full-time SLPs from 49 states were evaluated using descriptive statistics and logistic regression. Many school-based SLPs reported not providing any services in the area of written language to students with written language weaknesses. Knowledge, attitudes, and collaborative practices were mixed. A logistic regression revealed three variables likely to predict high levels of service provision in the area of written language. Data from the current study revealed that many struggling readers and writers on school-based SLPs' caseloads are not receiving services from their SLPs. Implications for SLPs' preservice preparation, continuing education, and doctoral preparation are discussed.
Sakado, K; Sakado, M; Seki, T; Kuwabara, H; Kojima, M; Sato, T; Someya, T
2001-06-01
Although a number of studies have reported on the association between obsessional personality features as measured by the Munich Personality Test (MPT) "Rigidity" scale and depression, there has been no examination of these relationships in a non-clinical sample. The dimensional scores on the MPT were compared between subjects with and without lifetime depression, using a sample of employed Japanese adults. The odds ratio for suffering from lifetime depression was estimated by multiple logistic regression analysis. To diagnose a lifetime history of depression, the Inventory to Diagnose Depression, Lifetime version (IDDL) was used. The subjects with lifetime depression scored significantly higher on the "Rigidity" scale than the subjects without lifetime depression. In our logistic regression analysis, three risk factors were identified as each independently increasing a person's risk for suffering from lifetime depression: higher levels of "Rigidity", being of the female gender, and suffering from current depressive symptoms. The MPT "Rigidity" scale is a sensitive measure of personality features that occur with depression.
Nham, Eric G; Pearl, David L; Slavic, Durda; Ouckama, Rachel; Ojkic, Davor; Guerin, Michele T
2017-08-01
Avian reovirus (ARV) is an economically significant pathogen of broiler chickens. Our objective was to determine the prevalence, geographical distribution, and seasonal variation of ARV infection among commercial broiler flocks in Ontario, Canada during grow-out. A cross-sectional study of 231 randomly selected flocks was conducted from July 2010 to January 2012. Fifteen blood samples, 15 whole intestines, and 15 cloacal swabs per flock were collected at slaughter; ELISA and PCR were used to determine a flock's ARV exposure status. Avian reovirus prevalence was 91% (95% CI: 87 to 94). District alone did not significantly explain the overall variation in the prevalence of ARV (univariable logistic regression; P = 0.073), although geographical differences were identified. The odds of ARV presence were significantly lower in the summer/autumn compared to the winter/spring (univariable exact logistic regression; P < 0.001). There was no association between flock mortality and flock ELISA mean titer or PCR status.
Nham, Eric G.; Pearl, David L.; Slavic, Durda; Ouckama, Rachel; Ojkic, Davor; Guerin, Michele T.
2017-01-01
Avian reovirus (ARV) is an economically significant pathogen of broiler chickens. Our objective was to determine the prevalence, geographical distribution, and seasonal variation of ARV infection among commercial broiler flocks in Ontario, Canada during grow-out. A cross-sectional study of 231 randomly selected flocks was conducted from July 2010 to January 2012. Fifteen blood samples, 15 whole intestines, and 15 cloacal swabs per flock were collected at slaughter; ELISA and PCR were used to determine a flock’s ARV exposure status. Avian reovirus prevalence was 91% (95% CI: 87 to 94). District alone did not significantly explain the overall variation in the prevalence of ARV (univariable logistic regression; P = 0.073), although geographical differences were identified. The odds of ARV presence were significantly lower in the summer/autumn compared to the winter/spring (univariable exact logistic regression; P < 0.001). There was no association between flock mortality and flock ELISA mean titer or PCR status. PMID:28761188
The Association between Unintended Pregnancy and Violence among Incarcerated Men and Women
Kelly, Patricia J.; Ramaswamy, Megha
2018-01-01
Background In this article, we examine the association between unintended pregnancy and individual and community level indicators of violence in a population of both women and men in the criminal justice system. Methods We conducted a cross-sectional survey with 290 women and 306 men in 3 correctional facilities in Kansas City and used logistic regression models to assess relationships between key independent variables and unintended pregnancy. Findings In gender-specific logistic regression models, women with a history of intimate partner violence were 2.02 times more likely (CI 1.15, 3.56), and those with a history of sexual abuse before age 16 were 1.23 times more likely (CI 1.02–1.49) to have experienced unintended pregnancy. Men or their family members who were victimized by neighborhood violence were 1.82 times more likely to have experienced unintended pregnancy (CI 1.01, 3.28). Discussion These findings suggest the need for gender and community-specific interventions that address the relationship between violence and unintended pregnancy. PMID:23136860
Determination of riverbank erosion probability using Locally Weighted Logistic Regression
NASA Astrophysics Data System (ADS)
Ioannidou, Elena; Flori, Aikaterini; Varouchakis, Emmanouil A.; Giannakis, Georgios; Vozinaki, Anthi Eirini K.; Karatzas, George P.; Nikolaidis, Nikolaos
2015-04-01
Riverbank erosion is a natural geomorphologic process that affects the fluvial environment. The most important issue concerning riverbank erosion is the identification of the vulnerable locations. An alternative to the usual hydrodynamic models to predict vulnerable locations is to quantify the probability of erosion occurrence. This can be achieved by identifying the underlying relations between riverbank erosion and the geomorphological or hydrological variables that prevent or stimulate erosion. Thus, riverbank erosion can be determined by a regression model using independent variables that are considered to affect the erosion process. The impact of such variables may vary spatially, therefore, a non-stationary regression model is preferred instead of a stationary equivalent. Locally Weighted Regression (LWR) is proposed as a suitable choice. This method can be extended to predict the binary presence or absence of erosion based on a series of independent local variables by using the logistic regression model. It is referred to as Locally Weighted Logistic Regression (LWLR). Logistic regression is a type of regression analysis used for predicting the outcome of a categorical dependent variable (e.g. binary response) based on one or more predictor variables. The method can be combined with LWR to assign weights to local independent variables of the dependent one. LWR allows model parameters to vary over space in order to reflect spatial heterogeneity. The probabilities of the possible outcomes are modelled as a function of the independent variables using a logistic function. Logistic regression measures the relationship between a categorical dependent variable and, usually, one or several continuous independent variables by converting the dependent variable to probability scores. Then, a logistic regression is formed, which predicts success or failure of a given binary variable (e.g. erosion presence or absence) for any value of the independent variables. The erosion occurrence probability can be calculated in conjunction with the model deviance regarding the independent variables tested. The most straightforward measure for goodness of fit is the G statistic. It is a simple and effective way to study and evaluate the Logistic Regression model efficiency and the reliability of each independent variable. The developed statistical model is applied to the Koiliaris River Basin on the island of Crete, Greece. Two datasets of river bank slope, river cross-section width and indications of erosion were available for the analysis (12 and 8 locations). Two different types of spatial dependence functions, exponential and tricubic, were examined to determine the local spatial dependence of the independent variables at the measurement locations. The results show a significant improvement when the tricubic function is applied as the erosion probability is accurately predicted at all eight validation locations. Results for the model deviance show that cross-section width is more important than bank slope in the estimation of erosion probability along the Koiliaris riverbanks. The proposed statistical model is a useful tool that quantifies the erosion probability along the riverbanks and can be used to assist managing erosion and flooding events. Acknowledgements This work is part of an on-going THALES project (CYBERSENSORS - High Frequency Monitoring System for Integrated Water Resources Management of Rivers). The project has been co-financed by the European Union (European Social Fund - ESF) and Greek national funds through the Operational Program "Education and Lifelong Learning" of the National Strategic Reference Framework (NSRF) - Research Funding Program: THALES. Investing in knowledge society through the European Social Fund.
Deciphering factors controlling groundwater arsenic spatial variability in Bangladesh
NASA Astrophysics Data System (ADS)
Tan, Z.; Yang, Q.; Zheng, C.; Zheng, Y.
2017-12-01
Elevated concentrations of geogenic arsenic in groundwater have been found in many countries to exceed 10 μg/L, the WHO's guideline value for drinking water. A common yet unexplained characteristic of groundwater arsenic spatial distribution is the extensive variability at various spatial scales. This study investigates factors influencing the spatial variability of groundwater arsenic in Bangladesh to improve the accuracy of models predicting arsenic exceedance rate spatially. A novel boosted regression tree method is used to establish a weak-learning ensemble model, which is compared to a linear model using a conventional stepwise logistic regression method. The boosted regression tree models offer the advantage of parametric interaction when big datasets are analyzed in comparison to the logistic regression. The point data set (n=3,538) of groundwater hydrochemistry with 19 parameters was obtained by the British Geological Survey in 2001. The spatial data sets of geological parameters (n=13) were from the Consortium for Spatial Information, Technical University of Denmark, University of East Anglia and the FAO, while the soil parameters (n=42) were from the Harmonized World Soil Database. The aforementioned parameters were regressed to categorical groundwater arsenic concentrations below or above three thresholds: 5 μg/L, 10 μg/L and 50 μg/L to identify respective controlling factors. Boosted regression tree method outperformed logistic regression methods in all three threshold levels in terms of accuracy, specificity and sensitivity, resulting in an improvement of spatial distribution map of probability of groundwater arsenic exceeding all three thresholds when compared to disjunctive-kriging interpolated spatial arsenic map using the same groundwater arsenic dataset. Boosted regression tree models also show that the most important controlling factors of groundwater arsenic distribution include groundwater iron content and well depth for all three thresholds. The probability of a well with iron content higher than 5mg/L to contain greater than 5 μg/L, 10 μg/L and 50 μg/L As is estimated to be more than 91%, 85% and 51%, respectively, while the probability of a well from depth more than 160m to contain more than 5 μg/L, 10 μg/L and 50 μg/L As is estimated to be less than 38%, 25% and 14%, respectively.
NASA Astrophysics Data System (ADS)
Yilmaz, Işık
2009-06-01
The purpose of this study is to compare the landslide susceptibility mapping methods of frequency ratio (FR), logistic regression and artificial neural networks (ANN) applied in the Kat County (Tokat—Turkey). Digital elevation model (DEM) was first constructed using GIS software. Landslide-related factors such as geology, faults, drainage system, topographical elevation, slope angle, slope aspect, topographic wetness index (TWI) and stream power index (SPI) were used in the landslide susceptibility analyses. Landslide susceptibility maps were produced from the frequency ratio, logistic regression and neural networks models, and they were then compared by means of their validations. The higher accuracies of the susceptibility maps for all three models were obtained from the comparison of the landslide susceptibility maps with the known landslide locations. However, respective area under curve (AUC) values of 0.826, 0.842 and 0.852 for frequency ratio, logistic regression and artificial neural networks showed that the map obtained from ANN model is more accurate than the other models, accuracies of all models can be evaluated relatively similar. The results obtained in this study also showed that the frequency ratio model can be used as a simple tool in assessment of landslide susceptibility when a sufficient number of data were obtained. Input process, calculations and output process are very simple and can be readily understood in the frequency ratio model, however logistic regression and neural networks require the conversion of data to ASCII or other formats. Moreover, it is also very hard to process the large amount of data in the statistical package.
ERIC Educational Resources Information Center
Schumacher, Phyllis; Olinsky, Alan; Quinn, John; Smith, Richard
2010-01-01
The authors extended previous research by 2 of the authors who conducted a study designed to predict the successful completion of students enrolled in an actuarial program. They used logistic regression to determine the probability of an actuarial student graduating in the major or dropping out. They compared the results of this study with those…
Carolyn B. Meyer; Sherri L. Miller; C. John Ralph
2004-01-01
The scale at which habitat variables are measured affects the accuracy of resource selection functions in predicting animal use of sites. We used logistic regression models for a wide-ranging species, the marbled murrelet, (Brachyramphus marmoratus) in a large region in California to address how much changing the spatial or temporal scale of...
ERIC Educational Resources Information Center
Monahan, Patrick O.; McHorney, Colleen A.; Stump, Timothy E.; Perkins, Anthony J.
2007-01-01
Previous methodological and applied studies that used binary logistic regression (LR) for detection of differential item functioning (DIF) in dichotomously scored items either did not report an effect size or did not employ several useful measures of DIF magnitude derived from the LR model. Equations are provided for these effect size indices.…
ERIC Educational Resources Information Center
Magis, David; Raiche, Gilles; Beland, Sebastien; Gerard, Paul
2011-01-01
We present an extension of the logistic regression procedure to identify dichotomous differential item functioning (DIF) in the presence of more than two groups of respondents. Starting from the usual framework of a single focal group, we propose a general approach to estimate the item response functions in each group and to test for the presence…
Risk Factors of Falls in Community-Dwelling Older Adults: Logistic Regression Tree Analysis
ERIC Educational Resources Information Center
Yamashita, Takashi; Noe, Douglas A.; Bailer, A. John
2012-01-01
Purpose of the Study: A novel logistic regression tree-based method was applied to identify fall risk factors and possible interaction effects of those risk factors. Design and Methods: A nationally representative sample of American older adults aged 65 years and older (N = 9,592) in the Health and Retirement Study 2004 and 2006 modules was used.…
ERIC Educational Resources Information Center
Gordovil-Merino, Amalia; Guardia-Olmos, Joan; Pero-Cebollero, Maribel
2012-01-01
In this paper, we used simulations to compare the performance of classical and Bayesian estimations in logistic regression models using small samples. In the performed simulations, conditions were varied, including the type of relationship between independent and dependent variable values (i.e., unrelated and related values), the type of variable…
Ohlmacher, G.C.; Davis, J.C.
2003-01-01
Landslides in the hilly terrain along the Kansas and Missouri rivers in northeastern Kansas have caused millions of dollars in property damage during the last decade. To address this problem, a statistical method called multiple logistic regression has been used to create a landslide-hazard map for Atchison, Kansas, and surrounding areas. Data included digitized geology, slopes, and landslides, manipulated using ArcView GIS. Logistic regression relates predictor variables to the occurrence or nonoccurrence of landslides within geographic cells and uses the relationship to produce a map showing the probability of future landslides, given local slopes and geologic units. Results indicated that slope is the most important variable for estimating landslide hazard in the study area. Geologic units consisting mostly of shale, siltstone, and sandstone were most susceptible to landslides. Soil type and aspect ratio were considered but excluded from the final analysis because these variables did not significantly add to the predictive power of the logistic regression. Soil types were highly correlated with the geologic units, and no significant relationships existed between landslides and slope aspect. ?? 2003 Elsevier Science B.V. All rights reserved.
A Method for Calculating the Probability of Successfully Completing a Rocket Propulsion Ground Test
NASA Technical Reports Server (NTRS)
Messer, Bradley
2007-01-01
Propulsion ground test facilities face the daily challenge of scheduling multiple customers into limited facility space and successfully completing their propulsion test projects. Over the last decade NASA s propulsion test facilities have performed hundreds of tests, collected thousands of seconds of test data, and exceeded the capabilities of numerous test facility and test article components. A logistic regression mathematical modeling technique has been developed to predict the probability of successfully completing a rocket propulsion test. A logistic regression model is a mathematical modeling approach that can be used to describe the relationship of several independent predictor variables X(sub 1), X(sub 2),.., X(sub k) to a binary or dichotomous dependent variable Y, where Y can only be one of two possible outcomes, in this case Success or Failure of accomplishing a full duration test. The use of logistic regression modeling is not new; however, modeling propulsion ground test facilities using logistic regression is both a new and unique application of the statistical technique. Results from this type of model provide project managers with insight and confidence into the effectiveness of rocket propulsion ground testing.
Fei, Yang; Hu, Jian; Gao, Kun; Tu, Jianfeng; Li, Wei-Qin; Wang, Wei
2017-06-01
To construct a radical basis function (RBF) artificial neural networks (ANNs) model to predict the incidence of acute pancreatitis (AP)-induced portal vein thrombosis. The analysis included 353 patients with AP who had admitted between January 2011 and December 2015. RBF ANNs model and logistic regression model were constructed based on eleven factors relevant to AP respectively. Statistical indexes were used to evaluate the value of the prediction in two models. The predict sensitivity, specificity, positive predictive value, negative predictive value and accuracy by RBF ANNs model for PVT were 73.3%, 91.4%, 68.8%, 93.0% and 87.7%, respectively. There were significant differences between the RBF ANNs and logistic regression models in these parameters (P<0.05). In addition, a comparison of the area under receiver operating characteristic curves of the two models showed a statistically significant difference (P<0.05). The RBF ANNs model is more likely to predict the occurrence of PVT induced by AP than logistic regression model. D-dimer, AMY, Hct and PT were important prediction factors of approval for AP-induced PVT. Copyright © 2017 Elsevier Inc. All rights reserved.
Ali Morowatisharifabad, Mohammad; Abdolkarimi, Mahdi; Asadpour, Mohammad; Fathollahi, Mahmood Sheikh; Balaee, Parisa
2018-04-15
Theory-based education tailored to target behaviour and group can be effective in promoting physical activity. The purpose of this study was to examine the predictive power of Protection Motivation Theory on intent and behaviour of Physical Activity in Patients with Type 2 Diabetes. This descriptive study was conducted on 250 patients in Rafsanjan, Iran. To examine the scores of protection motivation theory structures, a researcher-made questionnaire was used. Its validity and reliability were confirmed. The level of physical activity was also measured by the International Short - form Physical Activity Inventory. Its validity and reliability were also approved. Data were analysed by statistical tests including correlation coefficient, chi-square, logistic regression and linear regression. The results revealed that there was a significant correlation between all the protection motivation theory constructs and the intention to do physical activity. The results showed that the Theory structures were able to predict 60% of the variance of physical activity intention. The results of logistic regression demonstrated that increase in the score of physical activity intent and self - efficacy increased the chance of higher level of physical activity by 3.4 and 1.5 times, respectively OR = (3.39, 1.54). Considering the ability of protection motivation theory structures to explain the physical activity behaviour, interventional designs are suggested based on the structures of this theory, especially to improve self -efficacy as the most powerful factor in predicting physical activity intention and behaviour.
Gao, Yu; Shi, Lu
2015-08-21
To better understand the documented link between mindfulness and longevity, we examine the association between mindfulness and conscious avoidance of secondhand smoke (SHS), as well as the association between mindfulness and physical activity. In Shanghai University of Finance and Economics (SUFE) we surveyed a convenience sample of 1516 college freshmen. We measured mindfulness, weekly physical activity, and conscious avoidance of secondhand smoke, along with demographic and behavioral covariates. We used a multilevel logistic regression to test the association between mindfulness and conscious avoidance of secondhand smoke, and used a Tobit regression model to test the association between mindfulness and metabolic equivalent hours per week. In both models the home province of the student respondent was used as the cluster variable, and demographic and behavioral covariates, such as age, gender, smoking history, household registration status (urban vs. rural), the perceived smog frequency in their home towns, and the asthma diagnosis. The logistic regression of consciously avoiding SHS shows that a higher level of mindfulness was associated with an increase in the odds ratio of conscious SHS avoidance (logged odds: 0.22, standard error: 0.07, p < 0.01). The Tobit regression shows that a higher level of mindfulness was associated with more metabolic equivalent hours per week (Tobit coefficient: 4.09, standard error: 1.13, p < 0.001). This study is an innovative attempt to study the behavioral issue of secondhand smoke from the perspective of the potential victim, rather than the active smoker. The observed associational patterns here are consistent with previous findings that mindfulness is associated with healthier behaviors in obesity prevention and substance use. Research designs with interventions are needed to test the causal link between mindfulness and these healthy behaviors.
Gao, Yu; Shi, Lu
2015-01-01
Introduction: To better understand the documented link between mindfulness and longevity, we examine the association between mindfulness and conscious avoidance of secondhand smoke (SHS), as well as the association between mindfulness and physical activity. Method: In Shanghai University of Finance and Economics (SUFE) we surveyed a convenience sample of 1516 college freshmen. We measured mindfulness, weekly physical activity, and conscious avoidance of secondhand smoke, along with demographic and behavioral covariates. We used a multilevel logistic regression to test the association between mindfulness and conscious avoidance of secondhand smoke, and used a Tobit regression model to test the association between mindfulness and metabolic equivalent hours per week. In both models the home province of the student respondent was used as the cluster variable, and demographic and behavioral covariates, such as age, gender, smoking history, household registration status (urban vs. rural), the perceived smog frequency in their home towns, and the asthma diagnosis. Results: The logistic regression of consciously avoiding SHS shows that a higher level of mindfulness was associated with an increase in the odds ratio of conscious SHS avoidance (logged odds: 0.22, standard error: 0.07, p < 0.01). The Tobit regression shows that a higher level of mindfulness was associated with more metabolic equivalent hours per week (Tobit coefficient: 4.09, standard error: 1.13, p < 0.001). Discussion: This study is an innovative attempt to study the behavioral issue of secondhand smoke from the perspective of the potential victim, rather than the active smoker. The observed associational patterns here are consistent with previous findings that mindfulness is associated with healthier behaviors in obesity prevention and substance use. Research designs with interventions are needed to test the causal link between mindfulness and these healthy behaviors. PMID:26308029
Serum osteocalcin is significantly related to indices of obesity and lipid profile in Malaysian men.
Chin, Kok-Yong; Ima-Nirwana, Soelaiman; Mohamed, Isa Naina; Ahmad, Fairus; Ramli, Elvy Suhana Mohd; Aminuddin, Amilia; Ngah, Wan Zurinah Wan
2014-01-01
Recent studies revealed a possible reciprocal relationship between the skeletal system and obesity and lipid metabolism, mediated by osteocalcin, an osteoblast-specific protein. This study aimed to validate the relationship between serum osteocalcin and indices of obesity and lipid parameters in a group of Malaysian men. A total of 373 men from the Malaysian Aging Male Study were included in the analysis. Data on subjects' demography, body mass index (BMI), body fat (BF) mass, waist circumference (WC), serum osteocalcin and fasting lipid levels were collected. Bioelectrical impendence (BIA) method was used to estimate BF. Multiple linear and binary logistic regression analyses were performed to analyze the association between serum osteocalcin and the aforementioned variables, with adjustment for age, ethnicity and BMI. Multiple regression results indicated that weight, BMI, BF mass, BF %, WC were significantly and negatively associated with serum osteocalcin (p < 0.001). There was a significant positive association between serum osteocalcin and high density lipoprotein (HDL) cholesterol (p = 0.032). Binary logistic results indicated that subjects with low serum osteocalcin level were more likely to be associated with high BMI (obese and overweight), high BF%, high WC and low HDL cholesterol (p < 0.05). Subjects with high osteocalcin level also demonstrated high total cholesterol level (p < 0.05) but this association was probably driven by high HDL level. These variables were not associated with serum C-terminal of telopeptide crosslinks in the subjects (p > 0.05). Serum osteocalcin is associated with indices of obesity and HDL level in men. These relationships should be validated by a longitudinal study, with comprehensive hormone profile testing.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yahya, Noorazrul, E-mail: noorazrul.yahya@research.uwa.edu.au; Ebert, Martin A.; Bulsara, Max
Purpose: Given the paucity of available data concerning radiotherapy-induced urinary toxicity, it is important to ensure derivation of the most robust models with superior predictive performance. This work explores multiple statistical-learning strategies for prediction of urinary symptoms following external beam radiotherapy of the prostate. Methods: The performance of logistic regression, elastic-net, support-vector machine, random forest, neural network, and multivariate adaptive regression splines (MARS) to predict urinary symptoms was analyzed using data from 754 participants accrued by TROG03.04-RADAR. Predictive features included dose-surface data, comorbidities, and medication-intake. Four symptoms were analyzed: dysuria, haematuria, incontinence, and frequency, each with three definitions (grade ≥more » 1, grade ≥ 2 and longitudinal) with event rate between 2.3% and 76.1%. Repeated cross-validations producing matched models were implemented. A synthetic minority oversampling technique was utilized in endpoints with rare events. Parameter optimization was performed on the training data. Area under the receiver operating characteristic curve (AUROC) was used to compare performance using sample size to detect differences of ≥0.05 at the 95% confidence level. Results: Logistic regression, elastic-net, random forest, MARS, and support-vector machine were the highest-performing statistical-learning strategies in 3, 3, 3, 2, and 1 endpoints, respectively. Logistic regression, MARS, elastic-net, random forest, neural network, and support-vector machine were the best, or were not significantly worse than the best, in 7, 7, 5, 5, 3, and 1 endpoints. The best-performing statistical model was for dysuria grade ≥ 1 with AUROC ± standard deviation of 0.649 ± 0.074 using MARS. For longitudinal frequency and dysuria grade ≥ 1, all strategies produced AUROC>0.6 while all haematuria endpoints and longitudinal incontinence models produced AUROC<0.6. Conclusions: Logistic regression and MARS were most likely to be the best-performing strategy for the prediction of urinary symptoms with elastic-net and random forest producing competitive results. The predictive power of the models was modest and endpoint-dependent. New features, including spatial dose maps, may be necessary to achieve better models.« less
Predicting nest success from habitat features in aspen forests of the central Rocky Mountains
Heather M. Struempf; Deborah M. Finch; Gregory Hayward; Stanley Anderson
2001-01-01
We collected nesting data on bird use of aspen stands in the Routt and Medicine Bow National Forests between 1987 and 1989. We found active nest sites of 28 species of small nongame birds on nine study plots in undisturbed aspen forests. We compared logistic regression models predicting nest success (at least one nestling) from nest-site or stand-level habitat...
ERIC Educational Resources Information Center
Papanastasiou, Elena C.; Zembylas, Michalinos
2006-01-01
The data obtained from high-school seniors for the Third International Mathematics and Science Study (TIMSS) for the country of Cyprus appear to be contradictory. Although Cypriot students did not perform well in mathematics in elementary school, middle school, and in the non-advanced sectors of high school, students in advanced mathematics…
Jose E. Negron; Jill L. Wilson
2003-01-01
We examined attributes of pinon pine (Pinus edulis) associated with the probability of infestation by pinon ips (Ips confusus) in an outbreak in the Coconino National Forest, Arizona. We used data collected from 87 plots, 59 infested and 28 uninfested, and a logistic regression approach to estimate the probability ofinfestation based on plotand tree-level attributes....
ERIC Educational Resources Information Center
Bozpolat, Ebru
2016-01-01
The purpose of this study was to reveal whether the low, medium, and high level self-regulated learning strategies of third year students at the Education Faculty of Cumhuriyet University can be predicted by the variables of gender, academic self-efficacy, and general academic average. The study uses the Relational Screening Model. The dependent…
ERIC Educational Resources Information Center
Secolsky, Charles; Krishnan, Sathasivam; Judd, Thomas P.
2013-01-01
The community colleges in the state of New Jersey went through a process of establishing statewide cut-off scores for English and mathematics placement tests. The colleges wanted to communicate to secondary schools a consistent preparation that would be necessary for enrolling in Freshman Composition and College Algebra at the community college…
ERIC Educational Resources Information Center
Moses, Tim; Miao, Jing; Dorans, Neil
2010-01-01
This study compared the accuracies of four differential item functioning (DIF) estimation methods, where each method makes use of only one of the following: raw data, logistic regression, loglinear models, or kernel smoothing. The major focus was on the estimation strategies' potential for estimating score-level, conditional DIF. A secondary focus…
Zhao, Lei; Li, Weizheng; Su, Zhihong; Liu, Yong; Zhu, Liyong; Zhu, Shaihong
2018-05-29
This study investigated the role of preoperative fasting C-peptide (FCP) levels in predicting diabetic outcomes in low-BMI Chinese patients following Roux-en-Y gastric bypass (RYGB) by comparing the metabolic outcomes of patients with FCP > 1 ng/ml versus FCP ≤ 1 ng/ml. The study sample included 78 type 2 diabetes mellitus patients with an average BMI < 30 kg/m 2 at baseline. Patients' parameters were analyzed before and after surgery, with a 2-year follow-up. A univariate logistic regression analysis and multivariate analysis of variance between the remission and improvement group were performed to determine factors that were associated with type 2 diabetes remission after RYGB. Linear correlation analyses between FCP and metabolic parameters were performed. Patients were divided into two groups: FCP > 1 ng/ml and FCP ≤ 1 ng/ml, with measured parameters compared between the groups. Patients' fasting plasma glucose, 2-h postprandial plasma glucose, FCP, and HbA1c improved significantly after surgery (p < 0.05). Factors associated with type 2 diabetes remission were BMI, 2hINS, and FCP at the univariate logistic regression analysis (p < 0.05). Multivariate logistic regression analysis was performed then showed the results were more related to FCP (OR = 2.39). FCP showed a significant linear correlation with fasting insulin and BMI (p < 0.05). There was a significant difference in remission rate between the FCP > 1 ng/ml and FCP ≤ 1 ng/ml groups (p = 0.01). The parameters of patients with FCP > 1 ng/ml, including BMI, plasma glucose, HbA1c, and plasma insulin, decreased markedly after surgery (p < 0.05). FCP level is a significant predictor of diabetes outcomes after RYGB in low-BMI Chinese patients. An FCP level of 1 ng/ml may be a useful threshold for predicting surgical prognosis, with FCP > 1 ng/ml predicting better clinical outcomes following RYGB.
Smoking in young adolescents: an approach with multilevel discrete choice models
Pinilla, J; Gonzalez, B; Barber, P; Santana, Y
2002-01-01
Design: Cross sectional analysis performed by multilevel logistic regression with pupils at the first level and schools at the second level. The data came from a stratified sample of students surveyed on their own, their families' and their friends' smoking habits, their schools, and their awareness of cigarette prices and advertising. Setting: The study was performed in the Island of Gran Canaria, Spain. Participants: 1877 students from 30 secondary schools in spring of 2000 (model's effective sample sizes 1697 and 1738) . Main results: 14.2% of the young teenagers surveyed use tobacco, almost half of them (6.3% of the total surveyed) on a daily basis. According to the ordered logistic regression model, to have a smoker as the best friend increases significantly the probability of smoking (odds ratio: 6.96, 95% confidence intervals (CI) (4.93 to 9.84), and the same stands for one smoker living at home compared with a smoking free home (odds ratio: 2.03, 95% CI 1.22 to 3.36). Girls smoke more (odds ratio: 1.85, 95% CI 1.33 to 2.59). Experience with alcohol, and lack of interest in studies are also significant factors affecting smoking. Multilevel models of logistic regression showed that factors related to the school affect the smoking behaviour of young teenagers. More specifically, whether a school complies with antismoking rules or not is the main factor to predict smoking prevalence in schools. The remainder of the differences can be attributed to individual and family characteristics, tobacco consumption by parents or other close relatives, and peer group. Conclusions: A great deal of the individual differences in smoking are explained by factors at the school level, therefore the context is very relevant in this case. The most relevant predictors for smoking in young adolescents include some factors related to the schools they attend. One variable stood out in accounting for the school to school differences: how well they enforced the no smoking rule. Therefore we can prevent or delay tobacco smoking in adolescents not only by publicising health risks, but also by better enforcing no smoking rules in schools. PMID:11854347
Koo, Yong Seo; Song, Jin-Young; Joo, Eun-Yeon; Lee, Heon-Jeong; Lee, Eunil; Lee, Sang-kun; Jung, Ki-Young
2016-01-01
Obesity is a common disorder with many complications. Although chronodisruption plays a role in obesity, few epidemiological studies have investigated the association between artificial light at night (ALAN) and obesity. Since sleep health is related to both obesity and ALAN, we investigated the association between outdoor ALAN and obesity after adjusting for sleep health. We also investigated the association between outdoor ALAN and sleep health. This cross-sectional survey included 8526 adults, 39-70 years of age, who participated in the Korean Genome and Epidemiology Study. Outdoor ALAN data were obtained from satellite images provided by the US Defense Meteorological Satellite Program. We obtained individual data regarding outdoor ALAN; body mass index; depression; and sleep health including sleep duration, mid-sleep time, and insomnia; and other demographic data including age, sex, educational level, type of residential building, monthly household income, alcohol consumption, smoking status and consumption of caffeine or alcohol before sleep. A logistic regression model was used to investigate the association between outdoor ALAN and obesity. The prevalence of obesity differed significantly according to sex (women 47% versus men 39%, p < 0.001) and outdoor ALAN (high 55% versus low 40%, p < 0.001). Univariate logistic regression analysis revealed a significant association between high outdoor ALAN and obesity (odds ratio [OR] 1.24, 95% confidence interval [CI] 1.14-1.35, p < 0.001). Furthermore, multivariate logistic regression analyses showed that high outdoor ALAN was significantly associated with obesity after adjusting for age and sex (OR 1.25, 95% CI 1.14-1.37, p < 0.001) and even after controlling for various other confounding factors including age, sex, educational level, type of residential building, monthly household income, alcohol consumption, smoking, consumption of caffeine or alcohol before sleep, delayed sleep pattern, short sleep duration and habitual snoring (OR 1.20, 95% CI 1.06-1.36, p = 0.003). The findings of our study provide epidemiological evidence that outdoor ALAN is significantly related to obesity.
Didarloo, Alireza; Nabilou, Bahram; Khalkhali, Hamid Reza
2017-11-03
Breast cancer is a life-threatening condition affecting women around the world. The early detection of breast lumps using a breast self-examination (BSE) is important for the prevention and control of this disease. The aim of this study was to examine BSE behavior and its predictive factors among female university students using the Health Belief Model (HBM). This investigation was a cross-sectional survey carried out with 334 female students at Urmia University of Medical Sciences in the northwest of Iran. To collect the necessary data, researchers applied a valid and reliable three-part questionnaire. The data were analyzed using descriptive statistics and a chi-square test, in addition to multivariate logistic regression statistics in SPSS software version 16.0 (SPSS Inc., Chicago, IL, USA). The results indicated that 82 of the 334 participants (24.6%) reported practicing BSEs. Multivariate logistic regression analyses showed that high perceived severity [OR = 2.38, 95% CI = (1.02-5.54)], high perceived benefits [OR = 1.94, 95% CI = (1.09-3.46)], and high perceived self-efficacy [OR = 13.15, 95% CI = (3.64-47.51)] were better predictors of BSE behavior (P < 0.05) than low perceived severity, benefits, and self-efficacy. The findings also showed that a high level of knowledge compared to a low level of knowledge [OR = 5.51, 95% CI = (1.79-16.86)] and academic undergraduate and graduate degrees compared to doctoral degrees [OR = 2.90, 95% CI = (1.42-5.92)] of the participants were predictors of BSE performance (P < 0.05). The study revealed that the HBM constructs are able to predict BSE behavior. Among these constructs, self-efficacy was the most important predictor of the behavior. Interventions based on the constructs of perceived self-efficacy, benefits, and severity are recommended for increasing women's regular screening for breast cancer.
Domingueti, Caroline Pereira; Fóscolo, Rodrigo Bastos; Dusse, Luci Maria S; Reis, Janice Sepúlveda; Carvalho, Maria das Graças; Gomes, Karina Braga; Fernandes, Ana Paula
2018-02-01
Objective This study aimed to evaluate the association between different renal biomarkers with D-Dimer levels in diabetes mellitus (DM1) patients group classified as: low D-Dimer levels (< 318 ng/mL), which included first and second D-Dimer tertiles, and high D-Dimer levels (≥ 318 ng/mL), which included third D-Dimer tertile. Materials and methods D-Dimer and cystatin C were measured by ELISA. Creatinine and urea were determined by enzymatic method. Estimated glomerular filtration rate (eGFR) was calculated using CKD-EPI equation. Albuminuria was assessed by immunoturbidimetry. Presence of renal disease was evaluated using each renal biomarker: creatinine, urea, cystatin C, eGFR and albuminuria. Bivariate logistic regression analysis was performed to assess which renal biomarkers are associated with high D-Dimer levels and odds ratio was calculated. After, multivariate logistic regression analysis was performed to assess which renal biomarkers are associated with high D-Dimer levels (after adjusting for sex and age) and odds ratio was calculated. Results Cystatin C presented a better association [OR of 9.8 (3.8-25.5)] with high D-Dimer levels than albuminuria, creatinine, eGFR and urea [OR of 5.3 (2.2-12.9), 8.4 (2.5-25.4), 9.1 (2.6-31.4) and 3.5 (1.4-8.4), respectively] after adjusting for sex and age. All biomarkers showed a good association with D-Dimer levels, and consequently, with hypercoagulability status, and cystatin C showed the best association among them. Conclusion Therefore, cystatin C might be useful to detect patients with incipient diabetic kidney disease that present an increased risk of cardiovascular disease, contributing to an early adoption of reno and cardioprotective therapies.
Effect of environmental molds on risk of death from asthma during the pollen season.
Targonski, P V; Persky, V W; Ramekrishnan, V
1995-05-01
Many studies have noted an association of ambient aeroallergen levels with exacerbation of asthma. This study was undertaken to examine the relationship of aeroallergen levels with asthma-related mortality in Chicago. The association of environmental aeroallergen levels with death caused by asthma among 5- to 34-year-olds in Chicago was examined for the period of 1985 through 1989. Logistic regression analysis was used to compare the probability of a death caused by asthma occurring on the basis of environmental tree, grass, or ragweed pollen and mold spore levels. Mean mold spore levels but not tree, grass, or ragweed pollen levels were significantly higher for days on which asthma-related death occurred than for days on which no deaths occurred (z = 2.80, p < 0.005). The odds of a death caused by asthma occurring on days with mold spore counts of 1000 spores per cubic meter or greater was 2.16 times higher (95% confidence interval = 1.31, 3.56, p = 0.003) than on days on which mold spore counts were less than 1000 spores per cubic meter. The association with mold spore levels remained significant on multivariate logistic regression with mold spore counts measured as a continuous variable and controlling for pollens, with the odds of an asthma-related death occurring being 1.2 times higher (95% confidence interval = 1.07-1.34) for every increase of 1000 spores per cubic meter in daily mold spore levels. Although death caused by asthma also involves personal, social, and medical access factors, these data suggest that exposure to environmental molds may play a role in asthma-related mortality and should be considered in prevention strategies.
Orish, Verner N; Onyeabor, Onyekachi S; Boampong, Johnson N; Afoakwah, Richmond; Nwaefuna, Ekene; Acquah, Samuel; Orish, Esther O; Sanyaolu, Adekunle O; Iriemenam, Nnaemeka C
2014-08-01
This study investigated the influence of the level of education on HIV infection among pregnant women attending antenatal care in Sekondi-Takoradi, Ghana. A cross-sectional study was conducted at four hospitals in the Sekondi-Takoradi metropolis. The study group comprised 885 consenting pregnant women attending antenatal care clinics. Questionnaires were administered and venous blood samples were screened for HIV and other parameters. Multivariable logistic regression analyses were performed to determine the association between the level of education attained by the pregnant women and their HIV statuses. The data showed that 9.83% (87/885) of the pregnant women were HIV seropositive while 90.17% (798/885) were HIV seronegative. There were significant differences in mean age (years) between the HIV seropositive women (27.45 ± 5.5) and their HIV seronegative (26.02 ± 5.6) counterparts (p = .026) but the inference disappeared after adjustment (p = .22). Multivariable logistic regression analysis revealed that pregnant women with secondary/tertiary education were less likely to have HIV infection compared with those with none/primary education (adjusted OR, 0.53; 95% CI, 0.30-0.91; p = .022). Our data showed an association with higher level of education and HIV statuses of the pregnant women. It is imperative to encourage formal education among pregnant women in this region.
Proximity to sports facilities and sports participation for adolescents in Germany.
Reimers, Anne K; Wagner, Matthias; Alvanides, Seraphim; Steinmayr, Andreas; Reiner, Miriam; Schmidt, Steffen; Woll, Alexander
2014-01-01
To assess the relationship between proximity to specific sports facilities and participation in the corresponding sports activities for adolescents in Germany. A sample of 1,768 adolescents aged 11-17 years old and living in 161 German communities was examined. Distances to the nearest sports facilities were calculated as an indicator of proximity to sports facilities using Geographic Information Systems (GIS). Participation in specific leisure-time sports activities in sports clubs was assessed using a self-report questionnaire and individual-level socio-demographic variables were derived from a parent questionnaire. Community-level socio-demographics as covariates were selected from the INKAR database, in particular from indicators and maps on land development. Logistic regression analyses were conducted to examine associations between proximity to the nearest sports facilities and participation in the corresponding sports activities. The logistic regression analyses showed that girls residing longer distances from the nearest gym were less likely to engage in indoor sports activities; a significant interaction between distances to gyms and level of urbanization was identified. Decomposition of the interaction term showed that for adolescent girls living in rural areas participation in indoor sports activities was positively associated with gym proximity. Proximity to tennis courts and indoor pools was not associated with participation in tennis or water sports, respectively. Improved proximity to gyms is likely to be more important for female adolescents living in rural areas.
Carter, Janet M.; Moran, Michael J.; Zogorski, John S.; Price, Curtis V.
2012-01-01
Multiple lines of evidence for indicating factors associated with the sources, transport, and fate of chloroform and three other trihalomethanes (THMs) in untreated groundwater were revealed by evaluating low-level analytical results and logistic regression results for THMs. Samples of untreated groundwater from wells used for drinking water were collected from 1996-2007 from 2492 wells across the United States and analyzed for chloroform, bromodichloromethane, dibromochloromethane, and bromoform by a low-level analytical method implemented in April 1996. Using an assessment level of 0.02 μg/L, chloroform was detected in 36.5% of public-well samples and 17.6% of domestic-well samples, with most concentrations less than 1 μg/L. Brominated THMs occurred less frequently than chloroform but more frequently in public-well samples than domestic-well samples. For both public and domestic wells, THMs occurred most frequently in urban areas. Logistic regression analyses showed that the occurrence of THMs was related to nonpoint sources such as urban land use and to point sources like septic systems. The frequent occurrence and concentration distribution pattern of THMs, as well as their frequent co-occurrence with other organic compounds and nitrate, all known to have anthropogenic sources, and the positive associations between THM occurrence and dissolved oxygen and recharge indicate the recycling of water that contains THMs and other anthropogenic contaminants.
Sun, Hokeun; Wang, Shuang
2013-05-30
The matched case-control designs are commonly used to control for potential confounding factors in genetic epidemiology studies especially epigenetic studies with DNA methylation. Compared with unmatched case-control studies with high-dimensional genomic or epigenetic data, there have been few variable selection methods for matched sets. In an earlier paper, we proposed the penalized logistic regression model for the analysis of unmatched DNA methylation data using a network-based penalty. However, for popularly applied matched designs in epigenetic studies that compare DNA methylation between tumor and adjacent non-tumor tissues or between pre-treatment and post-treatment conditions, applying ordinary logistic regression ignoring matching is known to bring serious bias in estimation. In this paper, we developed a penalized conditional logistic model using the network-based penalty that encourages a grouping effect of (1) linked Cytosine-phosphate-Guanine (CpG) sites within a gene or (2) linked genes within a genetic pathway for analysis of matched DNA methylation data. In our simulation studies, we demonstrated the superiority of using conditional logistic model over unconditional logistic model in high-dimensional variable selection problems for matched case-control data. We further investigated the benefits of utilizing biological group or graph information for matched case-control data. We applied the proposed method to a genome-wide DNA methylation study on hepatocellular carcinoma (HCC) where we investigated the DNA methylation levels of tumor and adjacent non-tumor tissues from HCC patients by using the Illumina Infinium HumanMethylation27 Beadchip. Several new CpG sites and genes known to be related to HCC were identified but were missed by the standard method in the original paper. Copyright © 2012 John Wiley & Sons, Ltd.
Ahn, Borami; Kim, Shin-Hye; Park, Mi-Jung
2017-01-01
To assess blood cadmium levels in Korean adolescents with respect to demographic and lifestyle factors. We analyzed data from the Korea National Health and Nutrition Examination Survey from 2010 to 2013, totaling 1472 adolescents aged 10-18 years. Geometric means of blood cadmium were calculated using a complex samples general linear model to compare blood levels in different demographic and lifestyle groups. Multivariate logistic regression analyses were also used to find predictors for high blood cadmium (>90th percentile). The geometric mean of the blood cadmium concentrations was 0.30μg/L in Korean adolescents. Older age, type of housing (multifamily house and commercial building), smoking and alcohol consumption, and iron deficiency/iron deficiency anemia (IDA) were significantly associated with higher blood cadmium concentrations (P<0.05). Blood cadmium concentrations were not significantly affected by gender, region, body mass index status, or household income. In multivariate logistic regression analysis, independent predictors for higher blood cadmium levels included current smoker (OR=7.77), alcohol consumption (OR=4.31), living in a multifamily house or commercial building (OR=3.11-3.46), and IDA (OR=2.64). Possible associations between blood cadmium levels and type of housing or alcohol consumption in adolescents are suggested for the first time in this study. Further studies are needed to elucidate the mechanism of these findings. Copyright © 2016 Elsevier GmbH. All rights reserved.
Stages of syphilis in South China - a multilevel analysis of early diagnosis.
Wong, Ngai Sze; Huang, Shujie; Zheng, Heping; Chen, Lei; Zhao, Peizhen; Tucker, Joseph D; Yang, Li Gang; Goh, Beng Tin; Yang, Bin
2017-01-31
Early diagnosis of syphilis and timely treatment can effectively reduce ongoing syphilis transmission and morbidity. We examined the factors associated with the early diagnosis of syphilis to inform syphilis screening strategic planning. In an observational study, we analyzed reported syphilis cases in Guangdong Province, China (from 2014 to mid-2015) accessed from the national case-based surveillance system. We categorized primary and secondary syphilis cases as early diagnosis and categorized latent and tertiary syphilis as delayed diagnosis. Univariate analyses and multivariable logistic regressions were performed to identify the factors associated with early diagnosis. We also examined the factors associated with early diagnosis at the individual and city levels in multilevel logistic regression models with cases nested by city (n = 21), adjusted for age at diagnosis and gender. Among 83,944 diagnosed syphilis cases, 22% were early diagnoses. The city-level early diagnosis rate ranged from 7 to 46%, consistent with substantial geographic variation as shown in the multilevel model. Early diagnosis was associated with cases presenting to specialist clinics for screening, being male and attaining higher education level. Cases received syphilis testing in institutions and hospitals, and diagnosed in hospitals were less likely to be in early diagnosis. At the city-level, cases living in a city equipped with more hospitals per capita were less likely to be early diagnosis. To enhance early diagnosis of syphilis, city-specific syphilis screening strategies with a mix of passive and client/provider-initiated testing might be a useful approach.
Fibrinogen: cardiometabolic risk marker in obese or overweight children and adolescents.
Azevedo, Waldeneide F; Cantalice, Anajás S C; Gonzaga, Nathalia C; Simões, Mônica O da S; Guimarães, Anna Larissa V; Carvalho, Danielle F de; Medeiros, Carla Campos Muniz
2015-01-01
To determine the prevalence of increased serum fibrinogen levels and its association with cardiometabolic risk factors in overweight or obese children and adolescents. Cross-sectional study with 138 children and adolescents (overweight or obese) followed at a reference outpatient clinic of the public health care network. Fibrinogen concentration was divided into quartiles, and values above or equal to the third quartile were considered high. The association between high fibrinogen values and cardiometabolic risk factors was assessed using Pearson's chi-squared test or Fisher's exact test, as necessary. Logistic regression was used to adjust variables predictive of fibrinogen levels. Analyses were performed using SPSS version 22.0 and SAS software, considering a confidence interval of 95%. Serum fibrinogen levels were elevated in 28.3% of individuals, showing association with the presence of high CRP (p=0.003, PR: 2.41, 95% CI: 1.30-4.46) and the presence of four or more risk factors (p=0.042; PR: 1.78, 95% CI: 1.00-3.17). After a logistic regression, only elevated CRP remained associated with altered fibrinogen levels (p=0.024; PR: 1.32; 95% CI: 1.09-5.25). Increased fibrinogen was prevalent in the study population and was associated with ultrasensitive C-reactive protein and the presence of four or more cardiovascular risk factors; it should be included in the assessment of individuals at risk. Copyright © 2015 Sociedade Brasileira de Pediatria. Published by Elsevier Editora Ltda. All rights reserved.
Biocultural Predictors of Motor Coordination Among Prepubertal Boys and Girls.
Luz, Leonardo G O; Valente-Dos-Santos, João; Luz, Tatiana D D; Sousa-E-Silva, Paulo; Duarte, João P; Machado-Rodrigues, Aristides; Seabra, André; Santos, Rute; Cumming, Sean P; Coelho-E-Silva, Manuel J
2018-02-01
This study aimed to predict motor coordination from a matrix of biocultural factors for 173 children (89 boys, 84 girls) aged 7-9 years who were assessed with the Körperkoordinationtest für Kinder test battery. Socioeconomic variables included built environment, area of residence, mother's educational level, and mother's physical activity level (using the International Physical Activity Questionnaire [short version]). The behavioral domain was marked by participation in organized sports and habitual physical activity measured by accelerometers ( ActiGraph GT1M). Indicators of biological development included somatic maturation and body mass index. Among males, the best logistic regression model to explain motor coordination (Nagelkerke R 2 = 50.8; χ 2 = 41.166; p < .001) emerged from age-group (odds ratio [OR]: 0.007-0.065), late maturation (OR = 0.174), normal body weight status (OR = 0.116), mother's educational level (OR = 0.129), and urban area of residence (OR = 0.236). Among girls, the best logistic regression to explain motor coordination (Nagelkerke R 2 = 40.8; χ 2 = 29.933; p < .01) derived from age (OR: 0.091-0.384), normal body mass index (OR = 0.142), participation in organized sport (OR = 0.121), and mother's physical activity level (OR = 0.183). This sex-specific, ecological approach to motor coordination proficiency may help promote physical activity during prepubertal years through familiar determinants.
Yang, Tingzhong; Peng, Sihui; Barnett, Ross; Zhang, Chichen
2018-01-01
Ecological models have emphasized that short sleep duration (SSD) is influenced by both individual and environmental variables. However, few studies have considered the latter. The present study explores the influence of urban and regional contextual factors, net of individual characteristics, on the prevalence of SSD among university students in China. Participants were 11,954 students, who were identified through a multistage survey sampling process conducted in 50 universities. Individual data were obtained through a self-administered questionnaire, and contextual variables were retrieved from a national database. Multilevel logistic regression models were used to examine urban and regional variations in high and moderate levels of SSD. Overall the prevalence of high SSD (<6 hours sleep duration) was 2.8% (95% CI: 1.7%,3.9%) and moderate SSD (<7 hours) 24.7% (95% CI: 19.5%, 29.8%). Multilevel logistic regressions confirmed that home region gross domestic product (GDP) and the university regional unemployment rate were associated with SSD, net of other individual- and city-level covariates. Students attending high-level universities also recorded the highest levels of SSD. Of the individual characteristcs, only mother's occupation and student mental health status were related to SSD. The results of this study add important insights about the role of contextual factors affecting SSD among young adults and indicate the need to take into account both past, as well as present, environmental influences to control SSD.
Health care funding levels and patient outcomes: a national study.
Byrne, Margaret M; Pietz, Kenneth; Woodard, Lechauncy; Petersen, Laura A
2007-04-01
Health care funding levels differ significantly across geographic regions, but there is little correlation between regional funding levels and outcomes of elderly Medicare beneficiaries. Our goal was to determine whether this relationship holds true in a non-Medicare population cared for in a large integrated health care system with a capitated budget allocation system. We explored the association between health care funding and risk-adjusted mortality in the 22 Veterans Affairs (VA) geographic Networks over a six-year time period. Allocations to Networks were adjusted for illness burden using Diagnostic Cost Groups. To test the association between funding and risk-adjusted three-year mortality, we ran logistic regressions with single-year patient cohorts, as well as hierarchical regressions on a six year longitudinal data set, clustering on VA Network. A 1000 dollar increase in funding per unit of patient illness burden was associated with a 2-8% reduction in three-year mortality in cross sectional regressions. However, in longitudinal hierarchical regressions clustering on Network, the significant effect of funding level was eliminated. When longitudinal data are used, the significant cross sectional effect of funding levels on mortality disappear. Thus, the factors driving differences in mortality are Network effects, although part of the Network effect may be due to past levels of funding. Our results provide a caution for cross sectional examinations of the association between regional health care funding levels and health outcomes. Copyright (c) 2006 John Wiley & Sons, Ltd.
Oral Microbiota and Risk for Esophageal Squamous Cell Carcinoma in a High-Risk Area of China.
Chen, Xingdong; Winckler, Björn; Lu, Ming; Cheng, Hongwei; Yuan, Ziyu; Yang, Yajun; Jin, Li; Ye, Weimin
2015-01-01
Poor oral health has been linked with an increased risk of esophageal squamous cell carcinoma (ESCC). We investigated whether alteration of oral microbiota is associated with ESCC risk. Fasting saliva samples were collected from 87 incident and histopathologicallly diagnosed ESCC cases, 63 subjects with dysplasia and 85 healthy controls. All subjects were also interviewed with a questionnaire. V3-V4 region of 16S rRNA was amplified and sequenced by 454-pyrosequencing platform. Carriage of each genus was compared by means of multivariate-adjusted odds ratios derived from logistic regression model. Relative abundance was compared using Metastats method. Beta diversity was estimated using Unifrac and weighted Unifrac distances. Principal coordinate analysis (PCoA) was applied to ordinate dissimilarity matrices. Multinomial logistic regression was used to compare the coordinates between different groups. ESCC subjects had an overall decreased microbial diversity compared to control and dysplasia subjects (P<0.001). Decreased carriage of genera Lautropia, Bulleidia, Catonella, Corynebacterium, Moryella, Peptococcus and Cardiobacterium were found in ESCC subjects compared to non-ESCC subjects. Multinomial logistic regression analyses on PCoA coordinates also revealed that ESCC subjects had significantly different levels for several coordinates compared to non-ESCC subjects. In conclusion, we observed a correlation between altered salivary bacterial microbiota and ESCC risk. The results of our study on the saliva microbiome are of particular interest as it reflects the shift in microbial communities. Further studies are warranted to verify this finding, and if being verified, to explore the underlying mechanisms.
Guo, L W; Liu, S Z; Zhang, M; Chen, Q; Zhang, S K; Sun, X B
2017-12-10
Objective: To investigate the effect of fried food intake on the pathogenesis of esophageal cancer and precancerous lesions. Methods: From 2005 to 2013, all the residents aged 40-69 years from 11 counties (cities) where cancer screening of upper gastrointestinal cancer had been conducted in rural areas of Henan province, were recruited as the subjects of study. Information on demography and lifestyle was collected. The residents under study were screened with iodine staining endoscopic examination and biopsy samples were diagnosed pathologically, under standardized criteria. Subjects with high risk were divided into the groups based on their different pathological degrees. Multivariate ordinal logistic regression analysis was used to analyze the relationship between the frequency of fried food intake and esophageal cancer and precancerous lesions. Results: A total number of 8 792 cases with normal esophagus, 3 680 with mild hyperplasia, 972 with moderate hyperplasia, 413 with severe hyperplasia carcinoma in situ, and 336 cases of esophageal cancer were recruited. Results from multivariate logistic regression analysis showed that, when compared with those who did not eat fried food, the intake of fried food (<2 times/week: OR =1.60, 95% CI : 1.40-1.83; ≥2 times/week: OR =2.58, 95% CI : 1.98-3.37) appeared a risk factor for both esophageal cancer or precancerous lesions after adjustment for age, sex, marital status, educational level, body mass index, smoking and alcohol intake. Conclusion: The intake of fried food appeared a risk factor for both esophageal cancer and precancerous lesions.
Saleem, Taimur; Ishaque, Sidra; Habib, Nida; Hussain, Syedda Saadia; Jawed, Areeba; Khan, Aamir Ali; Ahmad, Muhammad Imran; Iftikhar, Mian Omer; Mughal, Hamza Pervez; Jehan, Imtiaz
2009-01-01
Background To determine the knowledge, attitudes and practices regarding organ donation in a selected adult population in Pakistan. Methods Convenience sampling was used to generate a sample of 440; 408 interviews were successfully completed and used for analysis. Data collection was carried out via a face to face interview based on a pre-tested questionnaire in selected public areas of Karachi, Pakistan. Data was analyzed using SPSS v.15 and associations were tested using the Pearson's Chi square test. Multiple logistic regression was used to find independent predictors of knowledge status and motivation of organ donation. Results Knowledge about organ donation was significantly associated with education (p = 0.000) and socioeconomic status (p = 0.038). 70/198 (35.3%) people expressed a high motivation to donate. Allowance of organ donation in religion was significantly associated with the motivation to donate (p = 0.000). Multiple logistic regression analysis revealed that higher level of education and higher socioeconomic status were significant (p < 0.05) independent predictors of knowledge status of organ donation. For motivation, multiple logistic regression revealed that higher socioeconomic status, adequate knowledge score and belief that organ donation is allowed in religion were significant (p < 0.05) independent predictors. Television emerged as the major source of information. Only 3.5% had themselves donated an organ; with only one person being an actual kidney donor. Conclusion Better knowledge may ultimately translate into the act of donation. Effective measures should be taken to educate people with relevant information with the involvement of media, doctors and religious scholars. PMID:19534793
Differentiating major depressive disorder in youths with attention deficit hyperactivity disorder.
Diler, Rasim Somer; Daviss, W Burleson; Lopez, Adriana; Axelson, David; Iyengar, Satish; Birmaher, Boris
2007-09-01
Youths with attention deficit hyperactivity disorders (ADHD) frequently have comorbid major depressive disorders (MDD) sharing overlapping symptoms. Our objective was to examine which depressive symptoms best discriminate MDD among youths with ADHD. One-hundred-eleven youths with ADHD (5.2-17.8 years old) and their parents completed interviews with the K-SADS-PL and respective versions of the child or the parent Mood and Feelings Questionnaire (MFQ-C, MFQ-P). Controlling for group differences, logistic regression was used to calculate odds ratios reflecting the accuracy with which various depressive symptoms on the MFQ-C or MFQ-P discriminated MDD. Stepwise logistic regression then identified depressive symptoms that best discriminated the groups with and without MDD, using cross-validated misclassification rate as the criterion. Symptoms that discriminated youths with MDD (n=18) from those without MDD (n=93) were 4 of 6 mood/anhedonia symptoms, all 14 depressed cognition symptoms, and only 3 of 11 physical/vegetative symptoms. Mild irritability, miserable/unhappy moods, and symptoms related to sleep, appetite, energy levels and concentration did not discriminate MDD. A stepwise logistic regression correctly classified 89% of the comorbid MDD subjects, with only age, anhedonia at school, thoughts about killing self, thoughts that bad things would happen, and talking more slowly remaining in the final model. Results of this study may not generalize to community samples because subjects were drawn largely from a university-based outpatient psychiatric clinic. These findings stress the importance of social withdrawal, anhedonia, depressive cognitions, suicidal thoughts, and psychomotor retardation when trying to identify MDD among ADHD youths.
Prehospital helicopter transport and survival of patients with traumatic brain injury.
Bekelis, Kimon; Missios, Symeon; Mackenzie, Todd A
2015-03-01
To investigate the association of helicopter transport with survival of patients with traumatic brain injury (TBI), in comparison with ground emergency medical services (EMS). Helicopter utilization and its effect on the outcomes of TBI remain controversial. We performed a retrospective cohort study involving patients with TBI who were registered in the National Trauma Data Bank between 2009 and 2011. Regression techniques with propensity score matching were used to investigate the association of helicopter transport with survival of patients with TBI, in comparison with ground EMS. During the study period, there were 209,529 patients with TBI who were registered in the National Trauma Data Bank and met the inclusion criteria. Of these patients, 35,334 were transported via helicopters and 174,195 via ground EMS. For patients transported to level I trauma centers, 2797 deaths (12%) were recorded after helicopter transport and 8161 (7.8%) after ground EMS. Multivariable logistic regression analysis demonstrated an association of helicopter transport with increased survival [OR (odds ratio), 1.95; 95% confidence interval (CI), 1.81-2.10; absolute risk reduction (ARR), 6.37%]. This persisted after propensity score matching (OR, 1.88; 95% CI, 1.74-2.03; ARR, 5.93%). For patients transported to level II trauma centers, 1282 deaths (10.6%) were recorded after helicopter transport and 5097 (7.3%) after ground EMS. Multivariable logistic regression analysis demonstrated an association of helicopter transport with increased survival (OR, 1.81; 95% CI, 1.64-2.00; ARR 5.17%). This again persisted after propensity score matching (OR, 1.73; 95% CI, 1.55-1.94; ARR, 4.69). Helicopter transport of patients with TBI to level I and II trauma centers was associated with improved survival, in comparison with ground EMS.
Prehospital Helicopter Transport and Survival of Patients With Traumatic Brain Injury
Mackenzie, Todd A.
2015-01-01
Objective To investigate the association of helicopter transport with survival of patients with traumatic brain injury (TBI), in comparison with ground emergency medical services (EMS). Background Helicopter utilization and its effect on the outcomes of TBI remain controversial. Methods We performed a retrospective cohort study involving patients with TBI who were registered in the National Trauma Data Bank between 2009 and 2011. Regression techniques with propensity score matching were used to investigate the association of helicopter transport with survival of patients with TBI, in comparison with ground EMS. Results During the study period, there were 209,529 patients with TBI who were registered in the National Trauma Data Bank and met the inclusion criteria. Of these patients, 35,334 were transported via helicopters and 174,195 via ground EMS. For patients transported to level I trauma centers, 2797 deaths (12%) were recorded after helicopter transport and 8161 (7.8%) after ground EMS. Multivariable logistic regression analysis demonstrated an association of helicopter transport with increased survival [OR (odds ratio), 1.95; 95% confidence interval (CI), 1.81–2.10; absolute risk reduction (ARR), 6.37%]. This persisted after propensity score matching (OR, 1.88; 95% CI, 1.74–2.03; ARR, 5.93%). For patients transported to level II trauma centers, 1282 deaths (10.6%) were recorded after helicopter transport and 5097 (7.3%) after ground EMS. Multivariable logistic regression analysis demonstrated an association of helicopter transport with increased survival (OR, 1.81; 95% CI, 1.64–2.00; ARR 5.17%). This again persisted after propensity score matching (OR, 1.73; 95% CI, 1.55–1.94; ARR, 4.69). Conclusions Helicopter transport of patients with TBI to level I and II trauma centers was associated with improved survival, in comparison with ground EMS. PMID:24743624
Wang, A; Liu, J; Meng, X; Li, J; Wang, H; Wang, Y; Su, Z; Zhang, N; Dai, L; Wang, Y; Wang, Y
2018-01-01
The association between oxidized low-density lipoprotein (oxLDL) and cognitive impairment is unclear. This study aimed to investigate the potential association between oxLDL and cognitive impairment among patients with acute ischemic stroke. We measured the levels of oxLDL and recorded the Mini-Mental State Examination (MMSE) score in patients with acute ischemic stroke who were recruited from the Study of Oxidative Stress in Patients with Acute Ischemic Stroke. Cognitive impairment was defined as an MMSE score of <24. The association between oxLDL and cognitive impairment was assessed by multivariate logistic or linear regression analysis. Other clinical variables of interest were also studied. A total of 3726 patients [1287 (34.54%) female] were included in this study, with a mean age of 63.62 ± 11.96 years. After adjusting for potential confounders in our logistic regression model, each SD increase in oxLDL was associated with a 26% increase in the prevalence of cognitive impairment (odds radio, 1.26; 95% confidence interval, 1.13-1.39; P < 0.0001). Similarly, higher oxLDL was associated with lower MMSE scores, with a 0.56-point decrease in MMSE score for every SD increase in oxLDL in a linear regression analysis (β = -0.56; 95% confidence interval, -0.81 to -0.32; P < 0.0001). There were no significant interactions between oxLDL and age, sex or education levels for cognitive impairment (all interactions, P > 0.05). Elevated levels of oxLDL were associated with a higher prevalence of cognitive impairment in patients with ischemic stroke. © 2017 EAN.
Content Coding of Psychotherapy Transcripts Using Labeled Topic Models.
Gaut, Garren; Steyvers, Mark; Imel, Zac E; Atkins, David C; Smyth, Padhraic
2017-03-01
Psychotherapy represents a broad class of medical interventions received by millions of patients each year. Unlike most medical treatments, its primary mechanisms are linguistic; i.e., the treatment relies directly on a conversation between a patient and provider. However, the evaluation of patient-provider conversation suffers from critical shortcomings, including intensive labor requirements, coder error, nonstandardized coding systems, and inability to scale up to larger data sets. To overcome these shortcomings, psychotherapy analysis needs a reliable and scalable method for summarizing the content of treatment encounters. We used a publicly available psychotherapy corpus from Alexander Street press comprising a large collection of transcripts of patient-provider conversations to compare coding performance for two machine learning methods. We used the labeled latent Dirichlet allocation (L-LDA) model to learn associations between text and codes, to predict codes in psychotherapy sessions, and to localize specific passages of within-session text representative of a session code. We compared the L-LDA model to a baseline lasso regression model using predictive accuracy and model generalizability (measured by calculating the area under the curve (AUC) from the receiver operating characteristic curve). The L-LDA model outperforms the lasso logistic regression model at predicting session-level codes with average AUC scores of 0.79, and 0.70, respectively. For fine-grained level coding, L-LDA and logistic regression are able to identify specific talk-turns representative of symptom codes. However, model performance for talk-turn identification is not yet as reliable as human coders. We conclude that the L-LDA model has the potential to be an objective, scalable method for accurate automated coding of psychotherapy sessions that perform better than comparable discriminative methods at session-level coding and can also predict fine-grained codes.
Content Coding of Psychotherapy Transcripts Using Labeled Topic Models
Gaut, Garren; Steyvers, Mark; Imel, Zac E; Atkins, David C; Smyth, Padhraic
2016-01-01
Psychotherapy represents a broad class of medical interventions received by millions of patients each year. Unlike most medical treatments, its primary mechanisms are linguistic; i.e., the treatment relies directly on a conversation between a patient and provider. However, the evaluation of patient-provider conversation suffers from critical shortcomings, including intensive labor requirements, coder error, non-standardized coding systems, and inability to scale up to larger data sets. To overcome these shortcomings, psychotherapy analysis needs a reliable and scalable method for summarizing the content of treatment encounters. We used a publicly-available psychotherapy corpus from Alexander Street press comprising a large collection of transcripts of patient-provider conversations to compare coding performance for two machine learning methods. We used the Labeled Latent Dirichlet Allocation (L-LDA) model to learn associations between text and codes, to predict codes in psychotherapy sessions, and to localize specific passages of within-session text representative of a session code. We compared the L-LDA model to a baseline lasso regression model using predictive accuracy and model generalizability (measured by calculating the area under the curve (AUC) from the receiver operating characteristic (ROC) curve). The L-LDA model outperforms the lasso logistic regression model at predicting session-level codes with average AUC scores of .79, and .70, respectively. For fine-grained level coding, L-LDA and logistic regression are able to identify specific talk-turns representative of symptom codes. However, model performance for talk-turn identification is not yet as reliable as human coders. We conclude that the L-LDA model has the potential to be an objective, scaleable method for accurate automated coding of psychotherapy sessions that performs better than comparable discriminative methods at session-level coding and can also predict fine-grained codes. PMID:26625437
ERIC Educational Resources Information Center
Guler, Nese; Penfield, Randall D.
2009-01-01
In this study, we investigate the logistic regression (LR), Mantel-Haenszel (MH), and Breslow-Day (BD) procedures for the simultaneous detection of both uniform and nonuniform differential item functioning (DIF). A simulation study was used to assess and compare the Type I error rate and power of a combined decision rule (CDR), which assesses DIF…
ERIC Educational Resources Information Center
Le, Huy; Marcus, Justin
2012-01-01
This study used Monte Carlo simulation to examine the properties of the overall odds ratio (OOR), which was recently introduced as an index for overall effect size in multiple logistic regression. It was found that the OOR was relatively independent of study base rate and performed better than most commonly used R-square analogs in indexing model…
Predicting Student Success on the Texas Chemistry STAAR Test: A Logistic Regression Analysis
ERIC Educational Resources Information Center
Johnson, William L.; Johnson, Annabel M.; Johnson, Jared
2012-01-01
Background: The context is the new Texas STAAR end-of-course testing program. Purpose: The authors developed a logistic regression model to predict who would pass-or-fail the new Texas chemistry STAAR end-of-course exam. Setting: Robert E. Lee High School (5A) with an enrollment of 2700 students, Tyler, Texas. Date of the study was the 2011-2012…
Susan L. King
2003-01-01
The performance of two classifiers, logistic regression and neural networks, are compared for modeling noncatastrophic individual tree mortality for 21 species of trees in West Virginia. The output of the classifier is usually a continuous number between 0 and 1. A threshold is selected between 0 and 1 and all of the trees below the threshold are classified as...
Relaxing the rule of ten events per variable in logistic and Cox regression.
Vittinghoff, Eric; McCulloch, Charles E
2007-03-15
The rule of thumb that logistic and Cox models should be used with a minimum of 10 outcome events per predictor variable (EPV), based on two simulation studies, may be too conservative. The authors conducted a large simulation study of other influences on confidence interval coverage, type I error, relative bias, and other model performance measures. They found a range of circumstances in which coverage and bias were within acceptable levels despite less than 10 EPV, as well as other factors that were as influential as or more influential than EPV. They conclude that this rule can be relaxed, in particular for sensitivity analyses undertaken to demonstrate adequate control of confounding.
Logistic regression trees for initial selection of interesting loci in case-control studies
Nickolov, Radoslav Z; Milanov, Valentin B
2007-01-01
Modern genetic epidemiology faces the challenge of dealing with hundreds of thousands of genetic markers. The selection of a small initial subset of interesting markers for further investigation can greatly facilitate genetic studies. In this contribution we suggest the use of a logistic regression tree algorithm known as logistic tree with unbiased selection. Using the simulated data provided for Genetic Analysis Workshop 15, we show how this algorithm, with incorporation of multifactor dimensionality reduction method, can reduce an initial large pool of markers to a small set that includes the interesting markers with high probability. PMID:18466557
Rupert, Michael G.; Cannon, Susan H.; Gartner, Joseph E.; Michael, John A.; Helsel, Dennis R.
2008-01-01
Logistic regression was used to develop statistical models that can be used to predict the probability of debris flows in areas recently burned by wildfires by using data from 14 wildfires that burned in southern California during 2003-2006. Twenty-eight independent variables describing the basin morphology, burn severity, rainfall, and soil properties of 306 drainage basins located within those burned areas were evaluated. The models were developed as follows: (1) Basins that did and did not produce debris flows soon after the 2003 to 2006 fires were delineated from data in the National Elevation Dataset using a geographic information system; (2) Data describing the basin morphology, burn severity, rainfall, and soil properties were compiled for each basin. These data were then input to a statistics software package for analysis using logistic regression; and (3) Relations between the occurrence or absence of debris flows and the basin morphology, burn severity, rainfall, and soil properties were evaluated, and five multivariate logistic regression models were constructed. All possible combinations of independent variables were evaluated to determine which combinations produced the most effective models, and the multivariate models that best predicted the occurrence of debris flows were identified. Percentage of high burn severity and 3-hour peak rainfall intensity were significant variables in all models. Soil organic matter content and soil clay content were significant variables in all models except Model 5. Soil slope was a significant variable in all models except Model 4. The most suitable model can be selected from these five models on the basis of the availability of independent variables in the particular area of interest and field checking of probability maps. The multivariate logistic regression models can be entered into a geographic information system, and maps showing the probability of debris flows can be constructed in recently burned areas of southern California. This study demonstrates that logistic regression is a valuable tool for developing models that predict the probability of debris flows occurring in recently burned landscapes.
Hein, R; Abbas, S; Seibold, P; Salazar, R; Flesch-Janys, D; Chang-Claude, J
2012-01-01
Menopausal hormone therapy (MHT) is associated with an increased breast cancer risk in postmenopausal women, with combined estrogen-progestagen therapy posing a greater risk than estrogen monotherapy. However, few studies focused on potential effect modification of MHT-associated breast cancer risk by genetic polymorphisms in the progesterone metabolism. We assessed effect modification of MHT use by five coding single nucleotide polymorphisms (SNPs) in the progesterone metabolizing enzymes AKR1C3 (rs7741), AKR1C4 (rs3829125, rs17134592), and SRD5A1 (rs248793, rs3736316) using a two-center population-based case-control study from Germany with 2,502 postmenopausal breast cancer patients and 4,833 matched controls. An empirical-Bayes procedure that tests for interaction using a weighted combination of the prospective and the retrospective case-control estimators as well as standard prospective logistic regression were applied to assess multiplicative statistical interaction between polymorphisms and duration of MHT use with regard to breast cancer risk assuming a log-additive mode of inheritance. No genetic marginal effects were observed. Breast cancer risk associated with duration of combined therapy was significantly modified by SRD5A1_rs3736316, showing a reduced risk elevation in carriers of the minor allele (p (interaction,empirical-Bayes) = 0.006 using the empirical-Bayes method, p (interaction,logistic regression) = 0.013 using logistic regression). The risk associated with duration of use of monotherapy was increased by AKR1C3_rs7741 in minor allele carriers (p (interaction,empirical-Bayes) = 0.083, p (interaction,logistic regression) = 0.029) and decreased in minor allele carriers of two SNPs in AKR1C4 (rs3829125: p (interaction,empirical-Bayes) = 0.07, p (interaction,logistic regression) = 0.021; rs17134592: p (interaction,empirical-Bayes) = 0.101, p (interaction,logistic regression) = 0.038). After Bonferroni correction for multiple testing only SRD5A1_rs3736316 assessed using the empirical-Bayes method remained significant. Postmenopausal breast cancer risk associated with combined therapy may be modified by genetic variation in SRD5A1. Further well-powered studies are, however, required to replicate our finding.
Applications of statistics to medical science, III. Correlation and regression.
Watanabe, Hiroshi
2012-01-01
In this third part of a series surveying medical statistics, the concepts of correlation and regression are reviewed. In particular, methods of linear regression and logistic regression are discussed. Arguments related to survival analysis will be made in a subsequent paper.
Schell, Greggory J; Lavieri, Mariel S; Stein, Joshua D; Musch, David C
2013-12-21
Open-angle glaucoma (OAG) is a prevalent, degenerate ocular disease which can lead to blindness without proper clinical management. The tests used to assess disease progression are susceptible to process and measurement noise. The aim of this study was to develop a methodology which accounts for the inherent noise in the data and improve significant disease progression identification. Longitudinal observations from the Collaborative Initial Glaucoma Treatment Study (CIGTS) were used to parameterize and validate a Kalman filter model and logistic regression function. The Kalman filter estimates the true value of biomarkers associated with OAG and forecasts future values of these variables. We develop two logistic regression models via generalized estimating equations (GEE) for calculating the probability of experiencing significant OAG progression: one model based on the raw measurements from CIGTS and another model based on the Kalman filter estimates of the CIGTS data. Receiver operating characteristic (ROC) curves and associated area under the ROC curve (AUC) estimates are calculated using cross-fold validation. The logistic regression model developed using Kalman filter estimates as data input achieves higher sensitivity and specificity than the model developed using raw measurements. The mean AUC for the Kalman filter-based model is 0.961 while the mean AUC for the raw measurements model is 0.889. Hence, using the probability function generated via Kalman filter estimates and GEE for logistic regression, we are able to more accurately classify patients and instances as experiencing significant OAG progression. A Kalman filter approach for estimating the true value of OAG biomarkers resulted in data input which improved the accuracy of a logistic regression classification model compared to a model using raw measurements as input. This methodology accounts for process and measurement noise to enable improved discrimination between progression and nonprogression in chronic diseases.
Computing group cardinality constraint solutions for logistic regression problems.
Zhang, Yong; Kwon, Dongjin; Pohl, Kilian M
2017-01-01
We derive an algorithm to directly solve logistic regression based on cardinality constraint, group sparsity and use it to classify intra-subject MRI sequences (e.g. cine MRIs) of healthy from diseased subjects. Group cardinality constraint models are often applied to medical images in order to avoid overfitting of the classifier to the training data. Solutions within these models are generally determined by relaxing the cardinality constraint to a weighted feature selection scheme. However, these solutions relate to the original sparse problem only under specific assumptions, which generally do not hold for medical image applications. In addition, inferring clinical meaning from features weighted by a classifier is an ongoing topic of discussion. Avoiding weighing features, we propose to directly solve the group cardinality constraint logistic regression problem by generalizing the Penalty Decomposition method. To do so, we assume that an intra-subject series of images represents repeated samples of the same disease patterns. We model this assumption by combining series of measurements created by a feature across time into a single group. Our algorithm then derives a solution within that model by decoupling the minimization of the logistic regression function from enforcing the group sparsity constraint. The minimum to the smooth and convex logistic regression problem is determined via gradient descent while we derive a closed form solution for finding a sparse approximation of that minimum. We apply our method to cine MRI of 38 healthy controls and 44 adult patients that received reconstructive surgery of Tetralogy of Fallot (TOF) during infancy. Our method correctly identifies regions impacted by TOF and generally obtains statistically significant higher classification accuracy than alternative solutions to this model, i.e., ones relaxing group cardinality constraints. Copyright © 2016 Elsevier B.V. All rights reserved.
Ren, Yilong; Wang, Yunpeng; Wu, Xinkai; Yu, Guizhen; Ding, Chuan
2016-10-01
Red light running (RLR) has become a major safety concern at signalized intersection. To prevent RLR related crashes, it is critical to identify the factors that significantly impact the drivers' behaviors of RLR, and to predict potential RLR in real time. In this research, 9-month's RLR events extracted from high-resolution traffic data collected by loop detectors from three signalized intersections were applied to identify the factors that significantly affect RLR behaviors. The data analysis indicated that occupancy time, time gap, used yellow time, time left to yellow start, whether the preceding vehicle runs through the intersection during yellow, and whether there is a vehicle passing through the intersection on the adjacent lane were significantly factors for RLR behaviors. Furthermore, due to the rare events nature of RLR, a modified rare events logistic regression model was developed for RLR prediction. The rare events logistic regression method has been applied in many fields for rare events studies and shows impressive performance, but so far none of previous research has applied this method to study RLR. The results showed that the rare events logistic regression model performed significantly better than the standard logistic regression model. More importantly, the proposed RLR prediction method is purely based on loop detector data collected from a single advance loop detector located 400 feet away from stop-bar. This brings great potential for future field applications of the proposed method since loops have been widely implemented in many intersections and can collect data in real time. This research is expected to contribute to the improvement of intersection safety significantly. Copyright © 2016 Elsevier Ltd. All rights reserved.
Engoren, Milo; Habib, Robert H; Dooner, John J; Schwann, Thomas A
2013-08-01
As many as 14 % of patients undergoing coronary artery bypass surgery are readmitted within 30 days. Readmission is usually the result of morbidity and may lead to death. The purpose of this study is to develop and compare statistical and genetic programming models to predict readmission. Patients were divided into separate Construction and Validation populations. Using 88 variables, logistic regression, genetic programs, and artificial neural nets were used to develop predictive models. Models were first constructed and tested on the Construction populations, then validated on the Validation population. Areas under the receiver operator characteristic curves (AU ROC) were used to compare the models. Two hundred and two patients (7.6 %) in the 2,644 patient Construction group and 216 (8.0 %) of the 2,711 patient Validation group were re-admitted within 30 days of CABG surgery. Logistic regression predicted readmission with AU ROC = .675 ± .021 in the Construction group. Genetic programs significantly improved the accuracy, AU ROC = .767 ± .001, p < .001). Artificial neural nets were less accurate with AU ROC = 0.597 ± .001 in the Construction group. Predictive accuracy of all three techniques fell in the Validation group. However, the accuracy of genetic programming (AU ROC = .654 ± .001) was still trivially but statistically non-significantly better than that of the logistic regression (AU ROC = .644 ± .020, p = .61). Genetic programming and logistic regression provide alternative methods to predict readmission that are similarly accurate.
Eken, Cenker; Bilge, Ugur; Kartal, Mutlu; Eray, Oktay
2009-06-03
Logistic regression is the most common statistical model for processing multivariate data in the medical literature. Artificial intelligence models like an artificial neural network (ANN) and genetic algorithm (GA) may also be useful to interpret medical data. The purpose of this study was to perform artificial intelligence models on a medical data sheet and compare to logistic regression. ANN, GA, and logistic regression analysis were carried out on a data sheet of a previously published article regarding patients presenting to an emergency department with flank pain suspicious for renal colic. The study population was composed of 227 patients: 176 patients had a diagnosis of urinary stone, while 51 ultimately had no calculus. The GA found two decision rules in predicting urinary stones. Rule 1 consisted of being male, pain not spreading to back, and no fever. In rule 2, pelvicaliceal dilatation on bedside ultrasonography replaced no fever. ANN, GA rule 1, GA rule 2, and logistic regression had a sensitivity of 94.9, 67.6, 56.8, and 95.5%, a specificity of 78.4, 76.47, 86.3, and 47.1%, a positive likelihood ratio of 4.4, 2.9, 4.1, and 1.8, and a negative likelihood ratio of 0.06, 0.42, 0.5, and 0.09, respectively. The area under the curve was found to be 0.867, 0.720, 0.715, and 0.713 for all applications, respectively. Data mining techniques such as ANN and GA can be used for predicting renal colic in emergency settings and to constitute clinical decision rules. They may be an alternative to conventional multivariate analysis applications used in biostatistics.
NASA Astrophysics Data System (ADS)
Duman, T. Y.; Can, T.; Gokceoglu, C.; Nefeslioglu, H. A.; Sonmez, H.
2006-11-01
As a result of industrialization, throughout the world, cities have been growing rapidly for the last century. One typical example of these growing cities is Istanbul, the population of which is over 10 million. Due to rapid urbanization, new areas suitable for settlement and engineering structures are necessary. The Cekmece area located west of the Istanbul metropolitan area is studied, because the landslide activity is extensive in this area. The purpose of this study is to develop a model that can be used to characterize landslide susceptibility in map form using logistic regression analysis of an extensive landslide database. A database of landslide activity was constructed using both aerial-photography and field studies. About 19.2% of the selected study area is covered by deep-seated landslides. The landslides that occur in the area are primarily located in sandstones with interbedded permeable and impermeable layers such as claystone, siltstone and mudstone. About 31.95% of the total landslide area is located at this unit. To apply logistic regression analyses, a data matrix including 37 variables was constructed. The variables used in the forwards stepwise analyses are different measures of slope, aspect, elevation, stream power index (SPI), plan curvature, profile curvature, geology, geomorphology and relative permeability of lithological units. A total of 25 variables were identified as exerting strong influence on landslide occurrence, and included by the logistic regression equation. Wald statistics values indicate that lithology, SPI and slope are more important than the other parameters in the equation. Beta coefficients of the 25 variables included the logistic regression equation provide a model for landslide susceptibility in the Cekmece area. This model is used to generate a landslide susceptibility map that correctly classified 83.8% of the landslide-prone areas.
DSM-5 Alcohol Use Disorder Severity in Puerto Rico: Prevalence, Criteria Profile, and Correlates.
Caetano, Raul; Gruenewald, Paul; Vaeth, Patrice A C; Canino, Glorisa
2018-02-01
Our aim was to examine lifetime criteria profiles and correlates of severity (mild, moderate, severe) of DSM-5 alcohol use disorders (AUD) in Puerto Rico. Data are from a household random sample of individuals 18 to 64 years of age in San Juan, Puerto Rico. The survey response rate was 83%. DSM-5 AUD was identified with the Spanish version of the World Health Organization's Composite International Diagnostic Interview. The analyses also identify correlates of each severity level using an ordered logistic regression model. The prevalence of lifetime DSM-5 AUD among men and women was 38 and 16%, respectively. Mild lifetime DSM-5 AUD was the most prevalent severity level among both men (18%) and women (9%). The most common criteria, independent of gender and severity level, were drinking larger quantities and for longer than planned (men range: 80 to 97%; women range: 78 to 91%) and hazardous use (men range: 56 to 91%; women range: 42 to 74%). Results from ordered logistic regression showed that the adjusted odds ratio for weekly drinking frequency, greater volume of alcohol consumed per drinking occasion, positive attitudes about drinking, drinking norms, and male gender invariantly increased risks across all DSM-5 AUD severity levels (mild, moderate, severe). Greater negative attitudes about drinking, low family cohesion, and Protestant religion were related to greater risks at higher AUD severity levels. AUD prevalence is high in San Juan, Puerto Rico. Prevalence rates for some criteria are equally high across severity levels and poorly differentiate between mild, moderate, or severe DSM-5 AUD. The sociodemographic and alcohol-related risks vary across DSM-5 severity levels. Copyright © 2018 by the Research Society on Alcoholism.
Robertson, Sam; Woods, Carl; Gastin, Paul
2015-09-01
To develop a physiological performance and anthropometric attribute model to predict Australian Football League draft selection. Cross-sectional observational. Data was obtained (n=4902) from three Under-18 Australian football competitions between 2010 and 2013. Players were allocated into one of the three groups, based on their highest level of selection in their final year of junior football (Australian Football League Drafted, n=292; National Championship, n=293; State-level club, n=4317). Physiological performance (vertical jumps, agility, speed and running endurance) and anthropometric (body mass and height) data were obtained. Hedge's effect sizes were calculated to assess the influence of selection-level and competition on these physical attributes, with logistic regression models constructed to discriminate Australian Football League Drafted and National Championship players. Rule induction analysis was undertaken to determine a set of rules for discriminating selection-level. Effect size comparisons revealed a range of small to moderate differences between State-level club players and both other groups for all attributes, with trivial to small differences between Australian Football League Drafted and National Championship players noted. Logistic regression models showed multistage fitness test, height and 20 m sprint time as the most important attributes in predicting Draft success. Rule induction analysis showed that players displaying multistage fitness test scores of >14.01 and/or 20 m sprint times of <2.99 s were most likely to be recruited. High levels of performance in aerobic and/or speed tests increase the likelihood of elite junior Australian football players being recruited to the highest level of the sport. Copyright © 2014 Sports Medicine Australia. Published by Elsevier Ltd. All rights reserved.
Inequality in the hepatitis B awareness level in rural residents from 7 provinces in China
Zheng, Juan; Li, Quan; Wang, Jian; Zhang, Guojie; Wangen, Knut R.
2017-01-01
ABSTRACT The hepatitis B (HB) awareness level is an important factor affecting the rates of HB virus vaccination. To better understand income-related inequalities in the HB awareness level, it is imperative to identify the sources of inequalities and assess the contribution rates of these influential factors. This study analyzed the unequal distribution of the HB awareness level and the contributions of various influential factors. We performed a cross-sectional household survey with questionnaire-based, face-to-face interviews in 7 Chinese provinces. Responses from 7271 respondents were used in this analysis. Multinomial logistic regression was used for the analysis of contributing factors, and the concentration index was used as a measure of HB awareness inequalities. The HB awareness level varied across participants with different characteristics. Multinomial logistic regression of the explanatory factors of the HB awareness level showed that several estimated coefficients and relative risk ratios were statistically significant for middle- and high-level awareness, except for sex, occupation, and household income. The concentration index of the HB knowledge score was 0.140, indicating inequality gradients disadvantageous to the poor. The contribution rate of socioeconomic factors was the largest (60.8%), followed by demographic characteristics (29.0%) and geographic factors (4.3%). Demographic, socioeconomic, and geographic factors are associated with the HB awareness inequality. Therefore, to reduce inequality, HB-related health education targeting individuals with low socioeconomic status should be performed. Less-developed provinces, especially with high proportions of poor residents, warrant particular attention. Our findings may be beneficial to improve the HB virus vaccination rate for individuals with low socioeconomic status. PMID:28277091
Linder, Gustav; Sandin, Fredrik; Johansson, Jan; Lindblad, Mats; Lundell, Lars; Hedberg, Jakob
2018-02-01
Low socioeconomic status and poor education elevate the risk of developing esophageal- and junctional cancer. High education level also increases survival after curative surgery. The present study aimed to investigate associations, if any, between patient education-level and treatment allocation after diagnosis of esophageal- and junctional cancer and its subsequent impact on survival. A nation-wide cohort study was undertaken. Data from a Swedish national quality register for esophageal cancer (NREV) was linked to the National Cancer Register, National Patient Register, Prescribed Drug Register, Cause of Death Register and educational data from Statistics Sweden. The effect of education level (low; ≤9 years, intermediate; 10-12 years and high >12 years) on the probability of allocation to curative treatment was analyzed with logistic regression. The Kaplan-Meier-method and Cox proportional hazard models were used to assess the effect of education on survival. A total of 4112 patients were included. In a multivariate logistic regression model, high education level was associated with greater probability of allocation to curative treatment (adjusted OR: 1.48, 95% CI: 1.08-2.03, p = 0,014) as was adherence to a multidisciplinary treatment-conference (adjusted OR: 3.13, 95% CI: 2.40-4.08, p < 0,001). High education level was associated with improved survival in the patients allocated to curative treatment (HR: 0.82, 95% CI: 0.69-0.99, p = 0,036). In this nation-wide cohort of esophageal- and junctional cancer patients, including data regarding many confounders, high education level was associated with greater probability of being offered curative treatment and improved survival. Copyright © 2017 Elsevier Ltd. All rights reserved.
Allameh, Farzad; Pourmand, Gholamreza; Bozorgi, Ali; Nekuie, Sepideh; Namdari, Farshad
2016-01-01
The aim of the study was to evaluate the relationship between the serum levels of androgens and Coronary Artery Disease (CAD) in an Iranian population. Male individuals admitted to Tehran Heart Center and Sina Hospital, Tehran, Iran from 2011-2012 were categorized into CAD and control groups based on selective coronary angiography. Baseline demographic data, including age, BMI, diabetes, and a history of hypertension were recorded. Patients were also assessed for their serum levels of total testosterone, free testosterone, estradiol, dehydroepi and rosterone sulfate (DHEA-S), and Sex Hormone Binding Globulin (SHBG). Data analysis was carried out chi-square and ANOVA tests as well as logistic regression analysis. Two hundred patients were in the CAD group and 135 individuals in control group. In the CAD group, 69 had single-vessel disease, 49 had two-vessel diseases, and 82 had three-vessel diseases. Statistically significant differences were observed between the individuals in the two groups with respect to age (P<0.0001), diabetes (P<0.0001), and a history of hypertension (P=0.018). The serum levels of free testosterone (P=0.048) and DHEA-S (P<0.0001) were significantly higher in the control group than in the CAD group; however, the serum level of SHBG was higher in the CAD group than in the control group (P=0.007). Results of the logistic regression analysis indicated that only age (P=0.042) and diabetes (P=0.003) had significant relationships with CAD. Although the serum levels of some of the androgens were significantly different between the two groups, no association was found between androgenic hormone levels and the risk of CAD, due mainly to the effect of age and diabetes.
Carboxyhemoglobin levels in medical intensive care patients: a retrospective, observational study
2012-01-01
Introduction Critical illness leads to increased endogenous production of carbon monoxide (CO) due to the induction of the stress-response enzyme, heme oxygenase-1 (HO-1). There is evidence for the cytoprotective and anti-inflammatory effects of CO based on animal studies. In critically ill patients after cardiothoracic surgery, low minimum and high maximum carboxyhemoglobin (COHb) levels were shown to be associated with increased mortality, which suggests that there is an 'optimal range' for HO-1 activity. Our study aimed to test whether this relationship between COHb and outcome exists in non-surgical ICU patients. Methods We conducted a retrospective, observational study in a medical ICU at a university hospital in Vienna, Austria involving 868 critically ill patients. No interventions were undertaken. Arterial COHb was measured on admission and during the course of treatment in the ICU. The association between arterial COHb levels and ICU mortality was evaluated using bivariate tests and a logistic regression model. Results Minimum COHb levels were slightly lower in non-survivors compared to survivors (0.9%, 0.7% to 1.2% versus 1.2%, 0.9% to 1.5%; P = 0.0001), and the average COHb levels were marginally lower in non-survivors compared to survivors (1.5%, 1.2% to 1.8% versus 1.6%, 1.4% to 1.9%, P = 0.003). The multivariate logistic regression analysis revealed that the association between a low minimum COHb level and increased mortality was independent of the severity of illness and the type of organ failure. Conclusions Critically ill patients surviving the admission to a medical ICU had slightly higher minimum and marginally higher average COHb levels when compared to non-survivors. Even though the observed differences are statistically significant, the minute margins would not qualify COHb as a predictive marker for ICU mortality. PMID:22236404
Carboxyhemoglobin levels in medical intensive care patients: a retrospective, observational study.
Fazekas, Andreas S; Wewalka, Marlene; Zauner, Christian; Funk, Georg-Christian
2012-01-11
Critical illness leads to increased endogenous production of carbon monoxide (CO) due to the induction of the stress-response enzyme, heme oxygenase-1 (HO-1). There is evidence for the cytoprotective and anti-inflammatory effects of CO based on animal studies. In critically ill patients after cardiothoracic surgery, low minimum and high maximum carboxyhemoglobin (COHb) levels were shown to be associated with increased mortality, which suggests that there is an 'optimal range' for HO-1 activity. Our study aimed to test whether this relationship between COHb and outcome exists in non-surgical ICU patients. We conducted a retrospective, observational study in a medical ICU at a university hospital in Vienna, Austria involving 868 critically ill patients. No interventions were undertaken. Arterial COHb was measured on admission and during the course of treatment in the ICU. The association between arterial COHb levels and ICU mortality was evaluated using bivariate tests and a logistic regression model. Minimum COHb levels were slightly lower in non-survivors compared to survivors (0.9%, 0.7% to 1.2% versus 1.2%, 0.9% to 1.5%; P=0.0001), and the average COHb levels were marginally lower in non-survivors compared to survivors (1.5%, 1.2% to 1.8% versus 1.6%, 1.4% to 1.9%, P=0.003). The multivariate logistic regression analysis revealed that the association between a low minimum COHb level and increased mortality was independent of the severity of illness and the type of organ failure. Critically ill patients surviving the admission to a medical ICU had slightly higher minimum and marginally higher average COHb levels when compared to non-survivors. Even though the observed differences are statistically significant, the minute margins would not qualify COHb as a predictive marker for ICU mortality.
Feizi, Awat; Aliyari, Roqayeh; Roohafza, Hamidreza
2012-01-01
Objective. The present paper aimed at investigating the association between perceived stress and major life events stressors in Iranian general population. Methods. In a cross-sectional large-scale community-based study, 4583 people aged 19 and older, living in Isfahan, Iran, were investigated. Logistic quantile regression was used for modeling perceived stress, measured by GHQ questionnaire, as the bounded outcome (dependent), variable, and as a function of most important stressful life events, as the predictor variables, controlling for major lifestyle and sociodemographic factors. This model provides empirical evidence of the predictors' effects heterogeneity depending on individual location on the distribution of perceived stress. Results. The results showed that among four stressful life events, family conflicts and social problems were more correlated with level of perceived stress. Higher levels of education were negatively associated with perceived stress and its coefficients monotonically decrease beyond the 30th percentile. Also, higher levels of physical activity were associated with perception of low levels of stress. The pattern of gender's coefficient over the majority of quantiles implied that females are more affected by stressors. Also high perceived stress was associated with low or middle levels of income. Conclusions. The results of current research suggested that in a developing society with high prevalence of stress, interventions targeted toward promoting financial and social equalities, social skills training, and healthy lifestyle may have the potential benefits for large parts of the population, most notably female and lower educated people. PMID:23091560
New robust statistical procedures for the polytomous logistic regression models.
Castilla, Elena; Ghosh, Abhik; Martin, Nirian; Pardo, Leandro
2018-05-17
This article derives a new family of estimators, namely the minimum density power divergence estimators, as a robust generalization of the maximum likelihood estimator for the polytomous logistic regression model. Based on these estimators, a family of Wald-type test statistics for linear hypotheses is introduced. Robustness properties of both the proposed estimators and the test statistics are theoretically studied through the classical influence function analysis. Appropriate real life examples are presented to justify the requirement of suitable robust statistical procedures in place of the likelihood based inference for the polytomous logistic regression model. The validity of the theoretical results established in the article are further confirmed empirically through suitable simulation studies. Finally, an approach for the data-driven selection of the robustness tuning parameter is proposed with empirical justifications. © 2018, The International Biometric Society.
Staley, Dennis M.; Negri, Jacquelyn A.; Kean, Jason W.; Laber, Jayme L.; Tillery, Anne C.; Youberg, Ann M.
2016-06-30
Wildfire can significantly alter the hydrologic response of a watershed to the extent that even modest rainstorms can generate dangerous flash floods and debris flows. To reduce public exposure to hazard, the U.S. Geological Survey produces post-fire debris-flow hazard assessments for select fires in the western United States. We use publicly available geospatial data describing basin morphology, burn severity, soil properties, and rainfall characteristics to estimate the statistical likelihood that debris flows will occur in response to a storm of a given rainfall intensity. Using an empirical database and refined geospatial analysis methods, we defined new equations for the prediction of debris-flow likelihood using logistic regression methods. We showed that the new logistic regression model outperformed previous models used to predict debris-flow likelihood.
NASA Astrophysics Data System (ADS)
Kneringer, Philipp; Dietz, Sebastian; Mayr, Georg J.; Zeileis, Achim
2017-04-01
Low-visibility conditions have a large impact on aviation safety and economic efficiency of airports and airlines. To support decision makers, we develop a statistical probabilistic nowcasting tool for the occurrence of capacity-reducing operations related to low visibility. The probabilities of four different low visibility classes are predicted with an ordered logistic regression model based on time series of meteorological point measurements. Potential predictor variables for the statistical models are visibility, humidity, temperature and wind measurements at several measurement sites. A stepwise variable selection method indicates that visibility and humidity measurements are the most important model inputs. The forecasts are tested with a 30 minute forecast interval up to two hours, which is a sufficient time span for tactical planning at Vienna Airport. The ordered logistic regression models outperform persistence and are competitive with human forecasters.
Dillon, Michael P; Major, Matthew J; Kaluf, Brian; Balasanov, Yuri; Fatone, Stefania
2018-04-01
While Amputee Mobility Predictor scores differ between Medicare Functional Classification Levels (K-level), this does not demonstrate that the Amputee Mobility Predictor can accurately predict K-level. To determine how accurately K-level could be predicted using the Amputee Mobility Predictor in combination with patient characteristics for persons with transtibial and transfemoral amputation. Prediction. A cumulative odds ordinal logistic regression was built to determine the effect that the Amputee Mobility Predictor, in combination with patient characteristics, had on the odds of being assigned to a particular K-level in 198 people with transtibial or transfemoral amputation. For people assigned to the K2 or K3 level by their clinician, the Amputee Mobility Predictor predicted the clinician-assigned K-level more than 80% of the time. For people assigned to the K1 or K4 level by their clinician, the prediction of clinician-assigned K-level was less accurate. The odds of being in a higher K-level improved with younger age and transfemoral amputation. Ordinal logistic regression can be used to predict the odds of being assigned to a particular K-level using the Amputee Mobility Predictor and patient characteristics. This pilot study highlighted critical method design issues, such as potential predictor variables and sample size requirements for future prospective research. Clinical relevance This pilot study demonstrated that the odds of being assigned a particular K-level could be predicted using the Amputee Mobility Predictor score and patient characteristics. While the model seemed sufficiently accurate to predict clinician assignment to the K2 or K3 level, further work is needed in larger and more representative samples, particularly for people with low (K1) and high (K4) levels of mobility, to be confident in the model's predictive value prior to use in clinical practice.
A computational approach to compare regression modelling strategies in prediction research.
Pajouheshnia, Romin; Pestman, Wiebe R; Teerenstra, Steven; Groenwold, Rolf H H
2016-08-25
It is often unclear which approach to fit, assess and adjust a model will yield the most accurate prediction model. We present an extension of an approach for comparing modelling strategies in linear regression to the setting of logistic regression and demonstrate its application in clinical prediction research. A framework for comparing logistic regression modelling strategies by their likelihoods was formulated using a wrapper approach. Five different strategies for modelling, including simple shrinkage methods, were compared in four empirical data sets to illustrate the concept of a priori strategy comparison. Simulations were performed in both randomly generated data and empirical data to investigate the influence of data characteristics on strategy performance. We applied the comparison framework in a case study setting. Optimal strategies were selected based on the results of a priori comparisons in a clinical data set and the performance of models built according to each strategy was assessed using the Brier score and calibration plots. The performance of modelling strategies was highly dependent on the characteristics of the development data in both linear and logistic regression settings. A priori comparisons in four empirical data sets found that no strategy consistently outperformed the others. The percentage of times that a model adjustment strategy outperformed a logistic model ranged from 3.9 to 94.9 %, depending on the strategy and data set. However, in our case study setting the a priori selection of optimal methods did not result in detectable improvement in model performance when assessed in an external data set. The performance of prediction modelling strategies is a data-dependent process and can be highly variable between data sets within the same clinical domain. A priori strategy comparison can be used to determine an optimal logistic regression modelling strategy for a given data set before selecting a final modelling approach.
Cakir, Ebru; Kucuk, Ulku; Pala, Emel Ebru; Sezer, Ozlem; Ekin, Rahmi Gokhan; Cakmak, Ozgur
2017-05-01
Conventional cytomorphologic assessment is the first step to establish an accurate diagnosis in urinary cytology. In cytologic preparations, the separation of low-grade urothelial carcinoma (LGUC) from reactive urothelial proliferation (RUP) can be exceedingly difficult. The bladder washing cytologies of 32 LGUC and 29 RUP were reviewed. The cytologic slides were examined for the presence or absence of the 28 cytologic features. The cytologic criteria showing statistical significance in LGUC were increased numbers of monotonous single (non-umbrella) cells, three-dimensional cellular papillary clusters without fibrovascular cores, irregular bordered clusters, atypical single cells, irregular nuclear overlap, cytoplasmic homogeneity, increased N/C ratio, pleomorphism, nuclear border irregularity, nuclear eccentricity, elongated nuclei, and hyperchromasia (p ˂ 0.05), and the cytologic criteria showing statistical significance in RUP were inflammatory background, mixture of small and large urothelial cells, loose monolayer aggregates, and vacuolated cytoplasm (p ˂ 0.05). When these variables were subjected to a stepwise logistic regression analysis, four features were selected to distinguish LGUC from RUP: increased numbers of monotonous single (non-umbrella) cells, increased nuclear cytoplasmic ratio, hyperchromasia, and presence of small and large urothelial cells (p = 0.0001). By this logistic model of the 32 cases with proven LGUC, the stepwise logistic regression analysis correctly predicted 31 (96.9%) patients with this diagnosis, and of the 29 patients with RUP, the logistic model correctly predicted 26 (89.7%) patients as having this disease. There are several cytologic features to separate LGUC from RUP. Stepwise logistic regression analysis is a valuable tool for determining the most useful cytologic criteria to distinguish these entities. © 2017 APMIS. Published by John Wiley & Sons Ltd.
ERIC Educational Resources Information Center
McAuliffe, Tomomi; Cordier, Reinie; Vaz, Sharmila; Thomas, Yvonne; Falkmer, Torbjorn
2017-01-01
This study aimed to examine the influence of differences in household status on the parental stress, coping, time use and quality of life (QoL) among mothers of children with autism spectrum disorders. Forty-three single and 164 coupled mothers completed the survey. Data were analysed using multivariate logistic regression. We found that single…
Kumar, Abhishek; Kumari, Divya; Singh, Aditya
2015-10-01
This article examines the trends and pattern in socioeconomic inequality in stunting, underweight and wasting among children aged <3 years in urban India over a 14-year period. We use three successive rounds of the National Family Health Survey data conducted during 1992-93, 1998-99 and 2005-06. The selected socioeconomic predictors are household wealth and mother's education level. We use principal component analysis to compute a separate wealth index for urban India for all three rounds of the survey. We have used descriptive statistics, concentration index and pooled logistic regression to analyse the data. The results show that between 1992-93 and 2005-06, the prevalence of childhood undernutrition has declined across household wealth quintiles and educational level of mothers. However, the pace of decline is much higher among the better-off socioeconomic groups than among the least-affluent groups. The result of pooled logistic regression analysis shows that the socioeconomic inequality in childhood undernutrition in urban India has increased over the study period. The salient findings of this study call for separate programmes targeting the children of lower socioeconomic groups in urban population of India. Published by Oxford University Press in association with The London School of Hygiene and Tropical Medicine © The Author 2014; all rights reserved.
Leitner, Lukas; Musser, Ewald; Kastner, Norbert; Friesenbichler, Jörg; Hirzberger, Daniela; Radl, Roman; Leithner, Andreas; Sadoghi, Patrick
2016-01-01
Red blood cell concentrates (RCC) substitution after total knee arthroplasty (TKA) is correlated with multifold of complications and an independent predictor for higher postoperative mortality. TKA is mainly performed in elderly patients with pre-existing polymorbidity, often requiring permanent preoperative antithrombotic therapy (PAT). The aim of this retrospective analysis was to investigate the impact of demand for PAT on inpatient blood management in patients undergoing TKA. In this study 200 patients were retrospectively evaluated after TKA for differences between PAT and non-PAT regarding demographic parameters, preoperative ASA score > 2, duration of operation, pre-, and intraoperative hemoglobin level, and postoperative parameters including amount of wound drainage, RCC requirement, and inpatient time. In a multivariate logistic regression analysis the independent influences of PAT, demographic parameters, ASA score > 2, and duration of the operation on RCC demand following TKA were analyzed. Patients with PAT were significantly older, more often had an ASA > 2 at surgery, needed a higher number of RCCs units and more frequently and had lower perioperative hemoglobin levels. Multivariate logistic regression revealed PAT was an independent predictor for RCC requirement. PAT patients are more likely to require RCC following TKA and should be accurately monitored with respect to postoperative blood loss. PMID:27488941
Leitner, Lukas; Musser, Ewald; Kastner, Norbert; Friesenbichler, Jörg; Hirzberger, Daniela; Radl, Roman; Leithner, Andreas; Sadoghi, Patrick
2016-08-04
Red blood cell concentrates (RCC) substitution after total knee arthroplasty (TKA) is correlated with multifold of complications and an independent predictor for higher postoperative mortality. TKA is mainly performed in elderly patients with pre-existing polymorbidity, often requiring permanent preoperative antithrombotic therapy (PAT). The aim of this retrospective analysis was to investigate the impact of demand for PAT on inpatient blood management in patients undergoing TKA. In this study 200 patients were retrospectively evaluated after TKA for differences between PAT and non-PAT regarding demographic parameters, preoperative ASA score > 2, duration of operation, pre-, and intraoperative hemoglobin level, and postoperative parameters including amount of wound drainage, RCC requirement, and inpatient time. In a multivariate logistic regression analysis the independent influences of PAT, demographic parameters, ASA score > 2, and duration of the operation on RCC demand following TKA were analyzed. Patients with PAT were significantly older, more often had an ASA > 2 at surgery, needed a higher number of RCCs units and more frequently and had lower perioperative hemoglobin levels. Multivariate logistic regression revealed PAT was an independent predictor for RCC requirement. PAT patients are more likely to require RCC following TKA and should be accurately monitored with respect to postoperative blood loss.
Physician job satisfaction in Saudi Arabia: insights from a tertiary hospital survey.
Aldrees, Turki; Al-Eissa, Sami; Badri, Motasim; Aljuhayman, Ahmed; Zamakhshary, Mohammed
2015-01-01
Job satisfaction refers to the extent to which people like or dislike their job. Job satisfaction varies across professions. Few studies have explored this issue among physicians in Saudi Arabia. The objective of this study is to determine the level and factors associated with job satisfaction among Saudi and non-Saudi physicians. In this cross-sectional study conducted in a major tertiary hospital in Riyadh, a 5-point Likert scale structured questionnaire was used to collect data on a wide range of socio-demographic, practice environment characteristics and level and consequences of job satisfaction from practicing physicians (consultants or residents) across different medical specialties. Logistic regression models were fitted to determine factors associated with job satisfaction. Of 344 participants, 300 (87.2%) were Saudis, 252 (73%) males, 255 (74%) married, 188 (54.7%) consultants and age [median (IQR)] was 32 (27-42.7) years. Overall, 104 (30%) respondents were dissatisfied with their jobs. Intensive care physicians were the most dissatisfied physicians (50%). In a multiple logistic regression model, income satisfaction (odds ratio [OR]=0.448 95% CI 0.278-0.723, P < .001) was the only factor independently associated with dissatisfaction. Factors adversely associated with physicians job satisfaction identified in this study should be addressed in governmental strategic planning aimed at improving the healthcare system and patient care.
Chowdhury, Md Rocky Khan; Rahman, Md Shafiur; Mondal, Md Nazrul Islam; Sayem, Abu; Billah, Baki
2015-01-01
Stigma, considered a social disease, is more apparent in developing societies which are driven by various social affairs, and influences adherence to treatment. The aim of the present study was to examine levels of social stigma related to tuberculosis (TB) in sociodemographic context and identify the effects of sociodemographic factors on stigma. The study sample consisted of 372 TB patients. Data were collected using stratified sampling with simple random sampling techniques. T tests, chi-square tests, and binary logistic regression analysis were performed to examine correlations between stigma and sociodemographic variables. Approximately 85.9% of patients had experienced stigma. The most frequent indicator of the stigma experienced by patients involved problems taking part in social programs (79.5%). Mean levels of stigma were significantly higher in women (55.5%), illiterate individuals (60.8%), and villagers (60.8%) relative to those of other groups. Chi-square tests revealed that education, monthly family income, and type of patient (pulmonary and extrapulmonary) were significantly associated with stigma. Binary logistic regression analysis demonstrated that stigma was influenced by sex, education, and type of patient. Stigma is one of the most important barriers to treatment adherence. Therefore, in interventions that aim to reduce stigma, strong collaboration between various institutions is essential.
Constructive thinking, rational intelligence and irritable bowel syndrome.
Rey, Enrique; Moreno Ortega, Marta; Garcia Alonso, Monica-Olga; Diaz-Rubio, Manuel
2009-07-07
To evaluate rational and experiential intelligence in irritable bowel syndrome (IBS) sufferers. We recruited 100 subjects with IBS as per Rome II criteria (50 consulters and 50 non-consulters) and 100 healthy controls, matched by age, sex and educational level. Cases and controls completed a clinical questionnaire (including symptom characteristics and medical consultation) and the following tests: rational-intelligence (Wechsler Adult Intelligence Scale, 3rd edition); experiential-intelligence (Constructive Thinking Inventory); personality (NEO personality inventory); psychopathology (MMPI-2), anxiety (state-trait anxiety inventory) and life events (social readjustment rating scale). Analysis of variance was used to compare the test results of IBS-sufferers and controls, and a logistic regression model was then constructed and adjusted for age, sex and educational level to evaluate any possible association with IBS. No differences were found between IBS cases and controls in terms of IQ (102.0 +/- 10.8 vs 102.8 +/- 12.6), but IBS sufferers scored significantly lower in global constructive thinking (43.7 +/- 9.4 vs 49.6 +/- 9.7). In the logistic regression model, global constructive thinking score was independently linked to suffering from IBS [OR 0.92 (0.87-0.97)], without significant OR for total IQ. IBS subjects do not show lower rational intelligence than controls, but lower experiential intelligence is nevertheless associated with IBS.
Female autonomy and reported abortion-seeking in Ghana, West Africa.
Rominski, Sarah D; Gupta, Mira; Aborigo, Raymond; Adongo, Phillip; Engman, Cyril; Hodgson, Abraham; Moyer, Cheryl
2014-09-01
To investigate factors associated with self-reported pregnancy termination in Ghana and thereby appreciate the correlates of abortion-seeking in order to understand safe abortion care provision. In a retrospective study, data from the Ghana 2008 Demographic and Health Survey were used to investigate factors associated with self-reported pregnancy termination. Variables on an individual and household level were examined by both bivariate analyses and multivariate logistic regression. A five-point autonomy scale was created to explore the role of female autonomy in reported abortion-seeking behavior. Among 4916 women included in the survey, 791 (16.1%) reported having an abortion. Factors associated with abortion-seeking included being older, having attended school, and living in an urban versus a rural area. When entered into a logistic regression model with demographic control variables, every step up the autonomy scale (i.e. increasing autonomy) was associated with a 14.0% increased likelihood of reporting the termination of a pregnancy (P < 0.05). Although health system barriers might play a role in preventing women from seeking safe abortion services, autonomy on an individual level is also important and needs to be addressed if women are to be empowered to seek safe abortion services. Copyright © 2014 International Federation of Gynecology and Obstetrics. Published by Elsevier Ireland Ltd. All rights reserved.
Saa, Luis Rodrigo; Perea, Anselmo; García-Bocanegra, Ignacio; Arenas, Antonio José; Jara, Diego Vinicio; Ramos, Raul; Carbonero, Alfonso
2012-03-01
A cross-sectional study was carried out to determine the seroprevalence and risk factors associated with Bovine viral diarrhea virus (BVDV) infection in non-vaccinated dairy and dual-purpose cattle herds from Ecuador. A total of 2,367 serum samples from 346 herds were collected from June 2008 through February 2009. A questionnaire, which included variables related to cattle, health, management measures, and the environment, was filled out in each herd. A commercial indirect enzyme-linked immunosorbent assay test was used to determine the seropositivity. A logistic regression model was used to determine risk factors at herd level. The individual seroprevalence for BVDV in non-vaccinated herds in Ecuador was 36.2% (857/2,367; CI(95%), 34.3-38.1%). The herd prevalence was 74% (256/346; CI(95%), 69.4-78.6%) and the intra-herd prevalence ranged between 11.1% and 100% (mean = 51.6%). The logistic regression model showed that the density of cattle farms in the area (more than 70%; OR, 1.94; CI(95%), 1.21-3.2) and the altitude (higher than 2,338 m above sea level; 2.33; CI(95%), 1.4-3.9) are potential risk factors associated with BVDV infection.
Xu, Wenjian; Zheng, Lijun; Xu, Yin; Zheng, Yong
2017-02-17
Social attitudes toward male homosexuality in China so far are still not optimistic. Sexual minorities in China have reported high levels of internalized homophobia. This Internet-based study examined the associations among internalized homophobia, mental health, sexual behaviors, and outness among 435 gay/bisexual men in Southwest China from 2014 to 2015. Latent profile analysis, confirmatory factor analysis, univariate logistic regression, and separate multivariate logistic regression analyses were conducted. This descriptive study found the Internalized Homophobia Scale to be suitable for use in China. The sample demonstrated a high prevalence of internalized homophobia. Latent profile analysis suggested a 2-class solution as optimal, and a high level of internalized homophobia was significantly associated with greater psychological distress (Wald = 6.49, AOR = 1.66), transactional sex during the previous 6 months (Wald = 5.23, AOR = 2.77), more sexual compulsions (Wald = 14.05, AOR = 2.12), and the concealment of sexual identity from others (Wald = 30.70, AOR = 0.30) and parents (Wald = 6.72, AOR = 0.49). These findings contribute to our understanding of internalized homophobia in China, and highlight the need to decrease gay-related psychological stress/distress and improve public health services.
Chou, Wen-Jiun; Liu, Tai-Ling; Hu, Huei-Fan; Yen, Cheng-Fang
2016-01-01
The aim of this study was to examine the prevalence rates of suicidal intent and its correlates among adolescents diagnosed with ADHD in Taiwan. A total of 287 adolescents aged 11-18 years and diagnosed with ADHD participated in this study. Their suicidal ideation and suicide attempts were assessed. Logistic regression analysis was used to examine the associations of suicide with individual, family, peer, ADHD, and psychopathology factors. A total of 12.2% of the participants reported suicidal ideation or a suicide attempt. A logistic regression analysis model showed that adolescents who were older, were bullying perpetrators, and reported high depression level were more likely to have suicidal intent. These three factors were also significantly correlated with suicidal ideation; however, only having high depression level was significantly correlated with suicidal attempts. The results of this study showed that a high proportion of adolescents with ADHD reported suicidal ideation or a suicide attempt. Multiple factors were significantly associated with suicidal intent among adolescents with ADHD. Clinicians, educational professionals, and parents of adolescents with ADHD should monitor the possibility of suicide in adolescents with ADHD who exhibit the correlates of suicidal intent identified in this study. Copyright © 2016 Elsevier Ltd. All rights reserved.
Pinna, Antonio; Zinellu, Angelo; Tendas, Donatella; Blasetti, Francesco; Carru, Ciriaco; Castiglia, Paolo
2016-01-01
To compare the plasma levels of homocysteine and asymmetrical dimethyl-l-arginine (ADMA) and the degree of whole blood DNA methylation in patients with early and neovascular age-related macular degeneration (AMD) and in controls without maculopathy of any sort. This observational case-control pilot study included 39 early AMD patients, 27 neovascular AMD patients and 132 sex- and age-matched controls without maculopathy. Plasma homocysteine and ADMA concentrations and the degree of whole blood DNA methylation were measured. Quantitative variables were compared by Student's t-test or Mann-Whitney test. Logistic regression models were used to investigate the significance of the association between early or wet AMD and some variables. There were no significant differences in mean plasma homocysteine and ADMA concentrations and in the degree of whole blood DNA methylation between patients with early or neovascular AMD and their controls. Similarly, logistic regression analysis disclosed that plasma homocysteine and ADMA levels were not associated with an increased risk for early or neovascular AMD. We failed to demonstrate an association between early or neovascular AMD and increased plasma homocysteine and/or ADMA. Results also suggest that the degree of whole blood DNA methylation is not a marker of AMD.
Lee, Jongin; Kim, Hyoung-Ryoul
2018-05-22
To show the association of hs-CRP level with working hours in different age groups. We used data from Korean National Health and Nutrition Survey. The odds ratios (ORs) and 95% confidence intervals (CIs) of variables for elevated hs-CRP (> 3.0 mg/L) were generated with logistic regression models. Significant variables were verified with an adjusted multivariate logistic model after stratification of age groups. Working for more than 55 hours per week was associated with elevated hs-CRP level in the old-ages group (≥ 60 years old: OR 2.18, 95% CI 1.07-4.45). Working for 40-55 hours per week was associated with decreased hs-CRP in the young-ages group (OR 0.58, 95% CI 0.37-0.93). Working hours appear to influence the levels of hs-CRP in individuals aged older than 60 years.
Science of Test Research Consortium: Year Two Final Report
2012-10-02
July 2012. Analysis of an Intervention for Small Unmanned Aerial System ( SUAS ) Accidents, submitted to Quality Engineering, LQEN-2012-0056. Stone... Systems Engineering. Wolf, S. E., R. R. Hill, and J. J. Pignatiello. June 2012. Using Neural Networks and Logistic Regression to Model Small Unmanned ...Human Retina. 6. Wolf, S. E. March 2012. Modeling Small Unmanned Aerial System Mishaps using Logistic Regression and Artificial Neural Networks. 7
ERIC Educational Resources Information Center
Hidalgo, Mª Dolores; Gómez-Benito, Juana; Zumbo, Bruno D.
2014-01-01
The authors analyze the effectiveness of the R[superscript 2] and delta log odds ratio effect size measures when using logistic regression analysis to detect differential item functioning (DIF) in dichotomous items. A simulation study was carried out, and the Type I error rate and power estimates under conditions in which only statistical testing…
Brian S. Cade; Barry R. Noon; Rick D. Scherer; John J. Keane
2017-01-01
Counts of avian fledglings, nestlings, or clutch size that are bounded below by zero and above by some small integer form a discrete random variable distribution that is not approximated well by conventional parametric count distributions such as the Poisson or negative binomial. We developed a logistic quantile regression model to provide estimates of the empirical...
Mohammed, Mohammed A; Manktelow, Bradley N; Hofer, Timothy P
2016-04-01
There is interest in deriving case-mix adjusted standardised mortality ratios so that comparisons between healthcare providers, such as hospitals, can be undertaken in the controversial belief that variability in standardised mortality ratios reflects quality of care. Typically standardised mortality ratios are derived using a fixed effects logistic regression model, without a hospital term in the model. This fails to account for the hierarchical structure of the data - patients nested within hospitals - and so a hierarchical logistic regression model is more appropriate. However, four methods have been advocated for deriving standardised mortality ratios from a hierarchical logistic regression model, but their agreement is not known and neither do we know which is to be preferred. We found significant differences between the four types of standardised mortality ratios because they reflect a range of underlying conceptual issues. The most subtle issue is the distinction between asking how an average patient fares in different hospitals versus how patients at a given hospital fare at an average hospital. Since the answers to these questions are not the same and since the choice between these two approaches is not obvious, the extent to which profiling hospitals on mortality can be undertaken safely and reliably, without resolving these methodological issues, remains questionable. © The Author(s) 2012.
Chan, Siew Foong; Deeks, Jonathan J; Macaskill, Petra; Irwig, Les
2008-01-01
To compare three predictive models based on logistic regression to estimate adjusted likelihood ratios allowing for interdependency between diagnostic variables (tests). This study was a review of the theoretical basis, assumptions, and limitations of published models; and a statistical extension of methods and application to a case study of the diagnosis of obstructive airways disease based on history and clinical examination. Albert's method includes an offset term to estimate an adjusted likelihood ratio for combinations of tests. Spiegelhalter and Knill-Jones method uses the unadjusted likelihood ratio for each test as a predictor and computes shrinkage factors to allow for interdependence. Knottnerus' method differs from the other methods because it requires sequencing of tests, which limits its application to situations where there are few tests and substantial data. Although parameter estimates differed between the models, predicted "posttest" probabilities were generally similar. Construction of predictive models using logistic regression is preferred to the independence Bayes' approach when it is important to adjust for dependency of tests errors. Methods to estimate adjusted likelihood ratios from predictive models should be considered in preference to a standard logistic regression model to facilitate ease of interpretation and application. Albert's method provides the most straightforward approach.
Cameron, Isobel M; Scott, Neil W; Adler, Mats; Reid, Ian C
2014-12-01
It is important for clinical practice and research that measurement scales of well-being and quality of life exhibit only minimal differential item functioning (DIF). DIF occurs where different groups of people endorse items in a scale to different extents after being matched by the intended scale attribute. We investigate the equivalence or otherwise of common methods of assessing DIF. Three methods of measuring age- and sex-related DIF (ordinal logistic regression, Rasch analysis and Mantel χ(2) procedure) were applied to Hospital Anxiety Depression Scale (HADS) data pertaining to a sample of 1,068 patients consulting primary care practitioners. Three items were flagged by all three approaches as having either age- or sex-related DIF with a consistent direction of effect; a further three items identified did not meet stricter criteria for important DIF using at least one method. When applying strict criteria for significant DIF, ordinal logistic regression was slightly less sensitive. Ordinal logistic regression, Rasch analysis and contingency table methods yielded consistent results when identifying DIF in the HADS depression and HADS anxiety scales. Regardless of methods applied, investigators should use a combination of statistical significance, magnitude of the DIF effect and investigator judgement when interpreting the results.
NASA Astrophysics Data System (ADS)
Cao, Faxian; Yang, Zhijing; Ren, Jinchang; Ling, Wing-Kuen; Zhao, Huimin; Marshall, Stephen
2017-12-01
Although the sparse multinomial logistic regression (SMLR) has provided a useful tool for sparse classification, it suffers from inefficacy in dealing with high dimensional features and manually set initial regressor values. This has significantly constrained its applications for hyperspectral image (HSI) classification. In order to tackle these two drawbacks, an extreme sparse multinomial logistic regression (ESMLR) is proposed for effective classification of HSI. First, the HSI dataset is projected to a new feature space with randomly generated weight and bias. Second, an optimization model is established by the Lagrange multiplier method and the dual principle to automatically determine a good initial regressor for SMLR via minimizing the training error and the regressor value. Furthermore, the extended multi-attribute profiles (EMAPs) are utilized for extracting both the spectral and spatial features. A combinational linear multiple features learning (MFL) method is proposed to further enhance the features extracted by ESMLR and EMAPs. Finally, the logistic regression via the variable splitting and the augmented Lagrangian (LORSAL) is adopted in the proposed framework for reducing the computational time. Experiments are conducted on two well-known HSI datasets, namely the Indian Pines dataset and the Pavia University dataset, which have shown the fast and robust performance of the proposed ESMLR framework.
Latin hypercube approach to estimate uncertainty in ground water vulnerability
Gurdak, J.J.; McCray, J.E.; Thyne, G.; Qi, S.L.
2007-01-01
A methodology is proposed to quantify prediction uncertainty associated with ground water vulnerability models that were developed through an approach that coupled multivariate logistic regression with a geographic information system (GIS). This method uses Latin hypercube sampling (LHS) to illustrate the propagation of input error and estimate uncertainty associated with the logistic regression predictions of ground water vulnerability. Central to the proposed method is the assumption that prediction uncertainty in ground water vulnerability models is a function of input error propagation from uncertainty in the estimated logistic regression model coefficients (model error) and the values of explanatory variables represented in the GIS (data error). Input probability distributions that represent both model and data error sources of uncertainty were simultaneously sampled using a Latin hypercube approach with logistic regression calculations of probability of elevated nonpoint source contaminants in ground water. The resulting probability distribution represents the prediction intervals and associated uncertainty of the ground water vulnerability predictions. The method is illustrated through a ground water vulnerability assessment of the High Plains regional aquifer. Results of the LHS simulations reveal significant prediction uncertainties that vary spatially across the regional aquifer. Additionally, the proposed method enables a spatial deconstruction of the prediction uncertainty that can lead to improved prediction of ground water vulnerability. ?? 2007 National Ground Water Association.
Who cares about health inequalities? Cross-country evidence from the World Health Survey
King, Nicholas B; Harper, Sam; Young, Meredith E
2013-01-01
Reduction of health inequalities within and between countries is a global health priority, but little is known about the determinants of popular support for this goal. We used data from the World Health Survey to assess individual preferences for prioritizing reductions in health and health care inequalities. We used descriptive tables and regression analysis to study the determinants of preferences for reducing health inequalities as the primary health system goal. Determinants included individual socio-demographic characteristics (age, sex, urban residence, education, marital status, household income, self-rated health, health care use, satisfaction with health care system) and country-level characteristics [gross domestic product (GDP) per capita, disability-free life expectancy, equality in child mortality, income inequality, health and public health expenditures]. We used logistic regression to assess the likelihood that individuals ranked minimizing inequalities first, and rank-ordered logistic regression to compare the ranking of other priorities against minimizing health inequalities. Individuals tended to prioritize health system goals related to overall improvement (improving population health and health care responsiveness) over those related to equality and fairness (minimizing inequalities in health and responsiveness, and promoting fairness of financial contribution). Individuals in countries with higher GDP per capita, life expectancy, and equality in child mortality were more likely to prioritize minimizing health inequalities. PMID:23059735
Holtschlag, David J.; Shively, Dawn; Whitman, Richard L.; Haack, Sheridan K.; Fogarty, Lisa R.
2008-01-01
Regression analyses and hydrodynamic modeling were used to identify environmental factors and flow paths associated with Escherichia coli (E. coli) concentrations at Memorial and Metropolitan Beaches on Lake St. Clair in Macomb County, Mich. Lake St. Clair is part of the binational waterway between the United States and Canada that connects Lake Huron with Lake Erie in the Great Lakes Basin. Linear regression, regression-tree, and logistic regression models were developed from E. coli concentration and ancillary environmental data. Linear regression models on log10 E. coli concentrations indicated that rainfall prior to sampling, water temperature, and turbidity were positively associated with bacteria concentrations at both beaches. Flow from Clinton River, changes in water levels, wind conditions, and log10 E. coli concentrations 2 days before or after the target bacteria concentrations were statistically significant at one or both beaches. In addition, various interaction terms were significant at Memorial Beach. Linear regression models for both beaches explained only about 30 percent of the variability in log10 E. coli concentrations. Regression-tree models were developed from data from both Memorial and Metropolitan Beaches but were found to have limited predictive capability in this study. The results indicate that too few observations were available to develop reliable regression-tree models. Linear logistic models were developed to estimate the probability of E. coli concentrations exceeding 300 most probable number (MPN) per 100 milliliters (mL). Rainfall amounts before bacteria sampling were positively associated with exceedance probabilities at both beaches. Flow of Clinton River, turbidity, and log10 E. coli concentrations measured before or after the target E. coli measurements were related to exceedances at one or both beaches. The linear logistic models were effective in estimating bacteria exceedances at both beaches. A receiver operating characteristic (ROC) analysis was used to determine cut points for maximizing the true positive rate prediction while minimizing the false positive rate. A two-dimensional hydrodynamic model was developed to simulate horizontal current patterns on Lake St. Clair in response to wind, flow, and water-level conditions at model boundaries. Simulated velocity fields were used to track hypothetical massless particles backward in time from the beaches along flow paths toward source areas. Reverse particle tracking for idealized steady-state conditions shows changes in expected flow paths and traveltimes with wind speeds and directions from 24 sectors. The results indicate that three to four sets of contiguous wind sectors have similar effects on flow paths in the vicinity of the beaches. In addition, reverse particle tracking was used for transient conditions to identify expected flow paths for 10 E. coli sampling events in 2004. These results demonstrate the ability to track hypothetical particles from the beaches, backward in time, to likely source areas. This ability, coupled with a greater frequency of bacteria sampling, may provide insight into changes in bacteria concentrations between source and sink areas.
Gante, Inês; Ferreira, Ana Carina; Pestana, Gonçalo; Pires, Daniela; Amaral, Njila; Dores, Jorge; do Céu Almeida, Maria; Sandoval, José Luis
2018-03-01
Gestational diabetes mellitus (GDM) occurs in 5-15% of pregnancies, and lower maternal educational attainment has been associated with higher risk of GDM. We aimed to determine if maternal education level is associated with persistent post-partum glucose metabolism disorders in women with GDM. Retrospective cohort study of women with GDM followed in 25 Portuguese health institutions between 2008 and 2012. Educational attainment was categorised into four levels. Prevalence of post-partum glucose metabolism disorders (type 2 diabetes mellitus, increased fasting plasma glucose or impaired glucose tolerance) was compared and adjusted odds ratios calculated controlling for confounders using logistic regression. We included 4490 women diagnosed with GDM. Educational level ranged as follows: 6.8% (n = 307) were at level 1 (≤ 6th grade), 34.6% (n = 1554) at level 2 (6-9th grade), 30.4% (n = 1364) at level 3 (10-12th grade) and 28.2% (n = 1265) at level 4 (≥ university degree). At 6 weeks post-partum re-evaluation, 10.9% (n = 491) had persistent glucose metabolism disorders. Educational levels 1 and 2 had a higher probability of persistent post-partum glucose metabolism disorders when compared to level 4 (OR = 2.37 [1.69;3.32], p < 0.001 and OR = 1.39 [1.09;1.76], p = 0.008, for level 1 and 2, respectively), an association that persisted in multivariable logistic regression adjusting for confounders (level 1 OR = 2.25 [1.53;3.33], p < 0.001; level 2 OR = 1.43 [1.09;1.89], p = 0.01). Persistent post-partum glucose metabolism disorders are frequent in women with GDM and associated with lower maternal educational level. Interventions aimed at this risk group may contribute towards a decrease in prevalence of post-partum glucose metabolism disorders.
Kupek, Emil
2006-03-15
Structural equation modelling (SEM) has been increasingly used in medical statistics for solving a system of related regression equations. However, a great obstacle for its wider use has been its difficulty in handling categorical variables within the framework of generalised linear models. A large data set with a known structure among two related outcomes and three independent variables was generated to investigate the use of Yule's transformation of odds ratio (OR) into Q-metric by (OR-1)/(OR+1) to approximate Pearson's correlation coefficients between binary variables whose covariance structure can be further analysed by SEM. Percent of correctly classified events and non-events was compared with the classification obtained by logistic regression. The performance of SEM based on Q-metric was also checked on a small (N = 100) random sample of the data generated and on a real data set. SEM successfully recovered the generated model structure. SEM of real data suggested a significant influence of a latent confounding variable which would have not been detectable by standard logistic regression. SEM classification performance was broadly similar to that of the logistic regression. The analysis of binary data can be greatly enhanced by Yule's transformation of odds ratios into estimated correlation matrix that can be further analysed by SEM. The interpretation of results is aided by expressing them as odds ratios which are the most frequently used measure of effect in medical statistics.
Suzuki, Taku; Iwamoto, Takuji; Shizu, Kanae; Suzuki, Katsuji; Yamada, Harumoto; Sato, Kazuki
2017-05-01
This retrospective study was designed to investigate prognostic factors for postoperative outcomes for cubital tunnel syndrome (CubTS) using multiple logistic regression analysis with a large number of patients. Eighty-three patients with CubTS who underwent surgeries were enrolled. The following potential prognostic factors for disease severity were selected according to previous reports: sex, age, type of surgery, disease duration, body mass index, cervical lesion, presence of diabetes mellitus, Workers' Compensation status, preoperative severity, and preoperative electrodiagnostic testing. Postoperative severity of disease was assessed 2 years after surgery by Messina's criteria which is an outcome measure specifically for CubTS. Bivariate analysis was performed to select candidate prognostic factors for multiple linear regression analyses. Multiple logistic regression analysis was conducted to identify the association between postoperative severity and selected prognostic factors. Both bivariate and multiple linear regression analysis revealed only preoperative severity as an independent risk factor for poor prognosis, while other factors did not show any significant association. Although conflicting results exist regarding prognosis of CubTS, this study supports evidence from previous studies and concludes early surgical intervention portends the most favorable prognosis. Copyright © 2017 The Japanese Orthopaedic Association. Published by Elsevier B.V. All rights reserved.
Unequal views of inequality: Cross-national support for redistribution 1985-2011.
VanHeuvelen, Tom
2017-05-01
This research examines public views on government responsibility to reduce income inequality, support for redistribution. While individual-level correlates of support for redistribution are relatively well understood, many questions remain at the country-level. Therefore, I examine how country-level characteristics affect aggregate support for redistribution. I test explanations of aggregate support using a unique dataset combining 18 waves of the International Social Survey Programme and European Social Survey. Results from mixed-effects logistic regression and fixed-effects linear regression models show two primary and contrasting effects. States that reduce inequality through bundles of tax and transfer policies are rewarded with more supportive publics. In contrast, economic development has a seemingly equivalent and dampening effect on public support. Importantly, the effect of economic development grows at higher levels of development, potentially overwhelming the amplifying effect of state redistribution. My results therefore suggest a fundamental challenge to proponents of egalitarian politics. Copyright © 2016 Elsevier Inc. All rights reserved.
Lopes-Virella, Maria F; Baker, Nathaniel L; Hunt, Kelly J; Cleary, Patricia A; Klein, Richard; Virella, Gabriel
2013-08-01
The current study aimed to determine in the Diabetes Control and Complications Trial (DCCT)/Epidemiology of Diabetes Interventions and Complications cohort whether or not abnormal levels of markers of inflammation and endothelial dysfunction measured in samples collected at DCCT baseline were able to predict the development of macroalbuminuria. Levels of inflammation and endothelial cell dysfunction biomarkers were measured in 1,237 of 1,441 patients enrolled in the DCCT study who were both free of albuminuria and cardiovascular disease at baseline. To test the association of log-transformed biomarkers with albuminuria, generalized logistic regression models were used to quantify the association of increased levels of biomarkers and development of abnormal albuminuria. Normal, micro-, and macroalbuminuria were the outcomes of interest. In the logistic regression models adjusted by DCCT treatment assignment, baseline albumin excretion rate, and use of ACE/angiotensin receptor blocker drugs, one unit increase in the standardized levels of soluble E-selectin (sE-selectin) was associated with an 87% increase in the odds to develop macroalbuminuria and one unit increase in the levels of interleukin-6 (IL-6), plasminogen activator inhibitor 1 (PAI-1; total and active), and soluble tumor necrosis factor receptors (TNFR)-1 and -2 lead to a 30-50% increase in the odds to develop macroalbuminuria. Following adjustment for DCCT baseline retinopathy status, age, sex, HbA1c, and duration of diabetes, significant associations remained for sE-selectin and TNFR-1 and -2 but not for IL-6 or PAI-1. Our study indicates that high levels of inflammatory markers, mainly E-selectin and sTNRF-1 and -2, are important predictors of macroalbuminuria in patients with type 1 diabetes.
Pieper, L; Godkin, A; Roesler, U; Polleichtner, A; Slavic, D; Leslie, K E; Kelton, D F
2012-10-01
Prototheca spp. are algae that cause incurable acute or chronic mastitis in dairy cows. The aim of this case-control study was the identification of cow- and herd-level risk factors for this unusual mastitis pathogen. Aseptically collected composite milk samples from 2,428 milking cows in 23 case and 23 control herds were collected between January and May 2011. A questionnaire was administered to the producers, and cow-level production and demographic data were gathered. In 58 of 64 isolates, Prototheca spp. and Prototheca zopfii genotypes were differentiated using PCR and matrix-assisted laser desorption/ionization time-of-flight mass spectrometry. All isolates were identified as Prototheca zopfii genotype 2. The mean within-herd prevalence for Prototheca spp. was 5.1% (range 0.0-12.5%). Case herds had a significantly lower herd-level prevalence of Staphylococcus aureus and a higher prevalence of yeasts than did control herds. The final logistic regression model for herd-level risk factors included use of intramammary injections of a non-intramammary drug [odds ratio (OR) = 136.8], the number of different injectable antibiotic products being used (OR = 2.82), the use of any dry cow teat sealant (external OR = 80.0; internal OR = 34.2), and having treated 3 or more displaced abomasums in the last 12 mo OR = 44.7). The final logistic regression model for cow-level risk factors included second or greater lactation (OR = 4.40) and the logarithm of the lactation-average somatic cell count (OR = 2.99). Unsanitary or repeated intramammary infusions, antibiotic treatment, and off-label use of injectable drugs in the udder might promote Prototheca udder infection. Copyright © 2012 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Farzaneh, N; Ghobaklou, M; Moghimi-Dehkordi, B; Naderi, N; Fadai, F
2013-01-01
Background: Irritable Bowel Syndrome (IBS) is a common functional gastrointestinal disorder. Aims: To identify demographic factors in patients with IBS. Subjects and Methods: One-hundred and fifty three IBS patients seen at Taleghani Hospital Gastroenterology Clinic and met the Rome III criteria and 163 peoples who did not meet IBS criteria were consecutively enrolled. Both groups were asked to complete a self-rating questionnaire containing information, which included questions about age, sex, monthly income, education level, marital status, height, weight, alcohol drinking and smoking habits. Student's t-test, Pearson's Chi-square and logistic regression were used to statistical analysis. Results: The mean (SD) age for IBS patients 36.3 (13.5) years and 33.1 (9.9) years in non-IBS group (P < 0.001). Frequency of IBS defined by Rome III criteria was higher in females and younger individuals. Univariate analysis showed that IBS in males was associated with a lower monthly income and educational level and in females younger age, single, lower monthly income and educational level, body mass index (BMI), and unemployment status. Multivariate logistic regression identified a low level of education in males (Odds ratio [OR] = 3.6, 95% Confidence interval [CI]: 1.4-9.6) and in females, lower education level (OR = 2.4, 95% CI: 1.1-5.2), lower BMI (OR = 0.94, 95% CI: 0.89-0.99), unemployed (OR = 0.31, 95% CI: 0.11-0.85) and smoking (OR = 6.2, 95% CI: 1.03-37.2). Conclusion: We identified demographic factors in IBS patients. Being single and having a lower educational level, income, lower BMI and being unemployed were the most important factors associated with IBS, particularly in females. PMID:24116320
Shi, Lei; Zhang, Danyang; Zhou, Chenyu; Yang, Libin; Sun, Tao; Hao, Tianjun; Peng, Xiangwen; Gao, Lei; Liu, Wenhui; Mu, Yi; Han, Yuzhen; Fan, Lihua
2017-01-01
Objectives The purpose of the present study was to explore the characteristics of workplace violence that Chinese nurses at tertiary and county–level hospitals encountered in the 12 months from December 2014 to January 2016, to identify and analyse risk factors for workplace violence, and to establish the basis for future preventive strategies. Design A cross–sectional study. Setting A total of 44 tertiary hospitals and 90 county–level hospitals in 16 provinces (municipalities or autonomous regions) in China. Methods We used stratified random sampling to collect data from December 2014 to January 2016. We distributed 21 360 questionnaires, and 15 970 participants provided valid data (effective response rate=74.77%). We conducted binary logistic regression analyses on the risk factors for workplace violence among the nurses in our sample and analysed the reasons for aggression. Results The prevalence of workplace violence was 65.8%; of this, 64.9% was verbal violence, and physical violence and sexual harassment accounted for 11.8% and 3.9%, respectively. Frequent workplace violence occurred primarily in emergency and paediatric departments. Respondents reported that patients’ relatives were the main perpetrators in tertiary and county–level hospitals. Logistic regression analysis showed that respondents’ age, department, years of experience and direct contact with patients were common risk factors at different levels of hospitals. Conclusions Workplace violence is frequent in China’s tertiary and county–level hospitals; its occurrence is especially frequent in the emergency and paediatric departments. It is necessary to cope with workplace violence by developing effective control strategies at individual, hospital and national levels. PMID:28647719
Shi, Lei; Zhang, Danyang; Zhou, Chenyu; Yang, Libin; Sun, Tao; Hao, Tianjun; Peng, Xiangwen; Gao, Lei; Liu, Wenhui; Mu, Yi; Han, Yuzhen; Fan, Lihua
2017-06-24
The purpose of the present study was to explore the characteristics of workplace violence that Chinese nurses at tertiary and county-level hospitals encountered in the 12 months from December 2014 to January 2016, to identify and analyse risk factors for workplace violence, and to establish the basis for future preventive strategies. A cross-sectional study. A total of 44 tertiary hospitals and 90 county-level hospitals in 16 provinces (municipalities or autonomous regions) in China. We used stratified random sampling to collect data from December 2014 to January 2016. We distributed 21 360 questionnaires, and 15 970 participants provided valid data (effective response rate=74.77%). We conducted binary logistic regression analyses on the risk factors for workplace violence among the nurses in our sample and analysed the reasons for aggression. The prevalence of workplace violence was 65.8%; of this, 64.9% was verbal violence, and physical violence and sexual harassment accounted for 11.8% and 3.9%, respectively. Frequent workplace violence occurred primarily in emergency and paediatric departments. Respondents reported that patients' relatives were the main perpetrators in tertiary and county-level hospitals. Logistic regression analysis showed that respondents' age, department, years of experience and direct contact with patients were common risk factors at different levels of hospitals. Workplace violence is frequent in China's tertiary and county-level hospitals; its occurrence is especially frequent in the emergency and paediatric departments. It is necessary to cope with workplace violence by developing effective control strategies at individual, hospital and national levels. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Lee, Chia Ee; Vincent-Chong, Vui King; Ramanathan, Anand; Kallarakkal, Thomas George; Karen-Ng, Lee Peng; Ghani, Wan Maria Nabillah; Rahman, Zainal Ariff Abdul; Ismail, Siti Mazlipah; Abraham, Mannil Thomas; Tay, Keng Kiong; Mustafa, Wan Mahadzir Wan; Cheong, Sok Ching; Zain, Rosnah Binti
2015-01-01
BACKGROUND: Collagen Triple Helix Repeat Containing 1 (CTHRC1) is a protein often found to be over-expressed in various types of human cancers. However, correlation between CTHRC1 expression level with clinico-pathological characteristics and prognosis in oral cancer remains unclear. Therefore, this study aimed to determine mRNA and protein expression of CTHRC1 in oral squamous cell carcinoma (OSCC) and to evaluate the clinical and prognostic impact of CTHRC1 in OSCC. METHODS: In this study, mRNA and protein expression of CTHRC1 in OSCCs were determined by quantitative PCR and immunohistochemistry, respectively. The association between CTHRC1 and clinico-pathological parameters were evaluated by univariate and multivariate binary logistic regression analyses. Correlation between CTHRC1 protein expressions with survival were analysed using Kaplan-Meier and Cox regression models. RESULTS: Current study demonstrated CTHRC1 was significantly overexpressed at the mRNA level in OSCC. Univariate analyses indicated a high-expression of CTHRC1 that was significantly associated with advanced stage pTNM staging, tumour size ≥ 4 cm and positive lymph node metastasis (LNM). However, only positive LNM remained significant after adjusting with other confounder factors in multivariate logistic regression analyses. Kaplan-Meier survival analyses and Cox model demonstrated that patients with high-expression of CTHRC1 protein were associated with poor prognosis and is an independent prognostic factor in OSCC. CONCLUSION: This study indicated that over-expression of CTHRC1 potentially as an independent predictor for positive LNM and poor prognosis in OSCC. PMID:26664254
Ali Morowatisharifabad, Mohammad; Abdolkarimi, Mahdi; Asadpour, Mohammad; Fathollahi, Mahmood Sheikh; Balaee, Parisa
2018-01-01
INTRODUCTION: Theory-based education tailored to target behaviour and group can be effective in promoting physical activity. AIM: The purpose of this study was to examine the predictive power of Protection Motivation Theory on intent and behaviour of Physical Activity in Patients with Type 2 Diabetes. METHODS: This descriptive study was conducted on 250 patients in Rafsanjan, Iran. To examine the scores of protection motivation theory structures, a researcher-made questionnaire was used. Its validity and reliability were confirmed. The level of physical activity was also measured by the International Short - form Physical Activity Inventory. Its validity and reliability were also approved. Data were analysed by statistical tests including correlation coefficient, chi-square, logistic regression and linear regression. RESULTS: The results revealed that there was a significant correlation between all the protection motivation theory constructs and the intention to do physical activity. The results showed that the Theory structures were able to predict 60% of the variance of physical activity intention. The results of logistic regression demonstrated that increase in the score of physical activity intent and self - efficacy increased the chance of higher level of physical activity by 3.4 and 1.5 times, respectively OR = (3.39, 1.54). CONCLUSION: Considering the ability of protection motivation theory structures to explain the physical activity behaviour, interventional designs are suggested based on the structures of this theory, especially to improve self -efficacy as the most powerful factor in predicting physical activity intention and behaviour. PMID:29731945
Prevalence and Geographic Variations of Polypharmacy Among West Virginia Medicaid Beneficiaries.
Feng, Xue; Tan, Xi; Riley, Brittany; Zheng, Tianyu; Bias, Thomas K; Becker, James B; Sambamoorthi, Usha
2017-11-01
West Virginia (WV) residents are at high risk for polypharmacy given its considerable chronic disease burdens. To evaluate the prevalence, correlates, outcomes, and geographic variations of polypharmacy among WV Medicaid beneficiaries. In this cross-sectional study, we analyzed 2009-2010 WV Medicaid fee-for-service (FFS) claims data for adults aged 18-64 (N=37,570). We defined polypharmacy as simultaneous use of drugs from five or more different drug classes on a daily basis for at least 60 consecutive days in one year. Multilevel logistic regression was used to explore the individual- and county-level factors associated with polypharmacy. Its relationship with healthcare utilization was assessed using negative binomial regression and logistic regression. The univariate local indicators of spatial association method was applied to explore spatial patterns of polypharmacy in WV. The prevalence of polypharmacy among WV Medicaid beneficiaries was 44.6%. High-high clusters of polypharmacy were identified in southern WV, indicating counties with above-average prevalence surrounded by counties with above-average prevalence. Polypharmacy was associated with being older, female, eligible for Medicaid due to cash assistance or medical eligibility, having any chronic conditions or more chronic conditions, and living in a county with lower levels of education. Polypharmacy was associated with more hospitalizations, emergency department visits, and outpatient visits, as well as higher non-drug medical expenditures. Polypharmacy was prevalent among WV Medicaid beneficiaries and was associated with substantial healthcare utilization and expenditures. The clustering of high prevalence of polypharmacy in southern WV may suggest targeted strategies to reduce polypharmacy burden in these areas.
Mearelli, Filippo; Fiotti, Nicola; Altamura, Nicola; Zanetti, Michela; Fernandes, Giovanni; Burekovic, Ismet; Occhipinti, Alessandro; Orso, Daniele; Giansante, Carlo; Casarsa, Chiara; Biolo, Gianni
2014-10-01
The objective of the study was to determine the accuracy of phospholipase A2 group II (PLA2-II), interferon-gamma-inducible protein 10 (IP-10), angiopoietin-2 (Ang-2), and procalcitonin (PCT) plasma levels in early ruling in/out of sepsis among systemic inflammatory response syndrome (SIRS) patients. Biomarker levels were determined in 80 SIRS patients during the first 4 h of admission to the medical ward. The final diagnosis of sepsis or non-infective SIRS was issued according to good clinical practice. Sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) for sepsis diagnosis were assessed. The optimal biomarker combinations with clinical variables were investigated by logistic regression and decision tree (CART). PLA2-II, IP-10 and PCT, but not Ang-2, were significantly higher in septic (n = 60) than in non-infective SIRS (n = 20) patients (P ≤ 0.001, 0.027, and 0.002, respectively). PLA2-II PPV and NPV were 88 and 86%, respectively. The corresponding figures were 100 and 31% for IP-10, and 93 and 35% for PCT. Binary logistic regression model had 100% PPV and NPV, while manual and software-generated CART reached an overall accuracy of 95 and 98%, respectively, both with 100% NPV. PLA2-II and IP-10 associated with clinical variables in regression or decision tree heterogeneous models may be valuable biomarkers for sepsis diagnosis in SIRS patients admitted to medical ward (MW). Further studies are needed to introduce them into clinical practice.
Over- and undersupply in home care: a representative multicenter correlational study.
Lahmann, Nils A; Suhr, Ralf; Kuntz, Simone; Kottner, Jan
2015-04-01
Quality assurance and funding of care become a major challenge against the background of demographic changes in western societies. The primary aim of the study was to identify possible misclassification, respectively over and undersupply of care by comparing the Barthel Index of clients of home care service with the level of care (Stage 0, I, II, III) according to the statutory German long-term care insurance. In 2012, a multi-center point prevalence study of 878 randomly selected clients of 100 randomly selected home care services across Germany was conducted. According to a standardized study protocol, demographics, the Barthel Index and the nurses' professional judgment-whether a client requires more nursing care-were assessed. Associations of the Barthel items and professional judgment were analyzed using univariate (Chi-square) and multivariate (logistic regression and classification-regression-tree-models) statistics. In each level of care, the Barthel Index showed large variability e.g. in level II ranging from 0 to 100 points. Multivariate logistic regression regarding possible under- and oversupply revealed occasionally fecal incontinence (2.1; 95 % CI 1.2-3.7), urinary incontinence (2.0; 95 % CI 1.1-3.6), feeding (1.7; 95 % CI 1.0-2.9), immobility (0.2; 95 % CI 0.1-0.6) and to be female (1.8; 95 % CI 1.2-2.6) to be statistically significantly associated. The variability in Barthel Index in each level of care found in this study indicated a large general misclassification of home care clients according to their actual need of care. Professional caregivers identified occasional incontinence, help with eating and drinking and mobility (especially in female clients) as areas of possible under- and oversupply of care. The statutory German long-term care insurance classification should be modified according to the above finding to increase the quality of care in home care clients.
Bushnik, Tracey; Levallois, Patrick; D'Amour, Monique; Anderson, Todd J; McAlister, Finlay A
2014-07-01
Hypertension is the leading risk factor for cardiovascular disease, but its cause is not always known. Interest is increasing in the potential role of environmental chemicals, including lead. Data are from the first two cycles of the Canadian Health Measures Survey. Lead in whole blood (PbB), and systolic (SBP) and diastolic (DBP) blood pressure were measured and hypertension status was derived for 4,550 respondents aged 40 to 79. Linear regression estimated associations between PbB and SBP and DBP. Logistic regression estimated associations between PbB and hypertension. Adjusted least squares geometric means of PbB were estimated for hypertensive versus non-hypertensive individuals. Compared with non-hypertensive individuals, those with hypertension had higher average PbB levels, were older, more likely to be male, and more likely to have other hypertension risk factors (diabetes, family history of high blood pressure). In adjusted regression models, a modest association emerged between PbB levels and SBP among 40- to 54-year-olds, and between PbB levels and DBP for the overall population. No association emerged between PbB levels and hypertension prevalence. A modest association was observed between blood lead levels and blood pressure, but not with hypertension, in Canadian adults aged 40 to 79.
ERIC Educational Resources Information Center
Kasapoglu, Koray
2014-01-01
This study aims to investigate which factors are associated with Turkey's 15-year-olds' scoring above the OECD average (493) on the PISA'09 reading assessment. Collected from a total of 4,996 15-year-old students from Turkey, data were analyzed by logistic regression analysis in order to model the data of students who were split into two: (1)…
Upgrade Summer Severe Weather Tool
NASA Technical Reports Server (NTRS)
Watson, Leela
2011-01-01
The goal of this task was to upgrade to the existing severe weather database by adding observations from the 2010 warm season, update the verification dataset with results from the 2010 warm season, use statistical logistic regression analysis on the database and develop a new forecast tool. The AMU analyzed 7 stability parameters that showed the possibility of providing guidance in forecasting severe weather, calculated verification statistics for the Total Threat Score (TTS), and calculated warm season verification statistics for the 2010 season. The AMU also performed statistical logistic regression analysis on the 22-year severe weather database. The results indicated that the logistic regression equation did not show an increase in skill over the previously developed TTS. The equation showed less accuracy than TTS at predicting severe weather, little ability to distinguish between severe and non-severe weather days, and worse standard categorical accuracy measures and skill scores over TTS.
Estimating the Probability of Rare Events Occurring Using a Local Model Averaging.
Chen, Jin-Hua; Chen, Chun-Shu; Huang, Meng-Fan; Lin, Hung-Chih
2016-10-01
In statistical applications, logistic regression is a popular method for analyzing binary data accompanied by explanatory variables. But when one of the two outcomes is rare, the estimation of model parameters has been shown to be severely biased and hence estimating the probability of rare events occurring based on a logistic regression model would be inaccurate. In this article, we focus on estimating the probability of rare events occurring based on logistic regression models. Instead of selecting a best model, we propose a local model averaging procedure based on a data perturbation technique applied to different information criteria to obtain different probability estimates of rare events occurring. Then an approximately unbiased estimator of Kullback-Leibler loss is used to choose the best one among them. We design complete simulations to show the effectiveness of our approach. For illustration, a necrotizing enterocolitis (NEC) data set is analyzed. © 2016 Society for Risk Analysis.
Evaluating the perennial stream using logistic regression in central Taiwan
NASA Astrophysics Data System (ADS)
Ruljigaljig, T.; Cheng, Y. S.; Lin, H. I.; Lee, C. H.; Yu, T. T.
2014-12-01
This study produces a perennial stream head potential map, based on a logistic regression method with a Geographic Information System (GIS). Perennial stream initiation locations, indicates the location of the groundwater and surface contact, were identified in the study area from field survey. The perennial stream potential map in central Taiwan was constructed using the relationship between perennial stream and their causative factors, such as Catchment area, slope gradient, aspect, elevation, groundwater recharge and precipitation. Here, the field surveys of 272 streams were determined in the study area. The areas under the curve for logistic regression methods were calculated as 0.87. The results illustrate the importance of catchment area and groundwater recharge as key factors within the model. The results obtained from the model within the GIS were then used to produce a map of perennial stream and estimate the location of perennial stream head.
Menditto, Anthony A; Linhorst, Donald M; Coleman, James C; Beck, Niels C
2006-04-01
Development of policies and procedures to contend with the risks presented by elopement, aggression, and suicidal behaviors are long-standing challenges for mental health administrators. Guidance in making such judgments can be obtained through the use of a multivariate statistical technique known as logistic regression. This procedure can be used to develop a predictive equation that is mathematically formulated to use the best combination of predictors, rather than considering just one factor at a time. This paper presents an overview of logistic regression and its utility in mental health administrative decision making. A case example of its application is presented using data on elopements from Missouri's long-term state psychiatric hospitals. Ultimately, the use of statistical prediction analyses tempered with differential qualitative weighting of classification errors can augment decision-making processes in a manner that provides guidance and flexibility while wrestling with the complex problem of risk assessment and decision making.
Lei, Yang; Nollen, Nikki; Ahluwahlia, Jasjit S; Yu, Qing; Mayo, Matthew S
2015-04-09
Other forms of tobacco use are increasing in prevalence, yet most tobacco control efforts are aimed at cigarettes. In light of this, it is important to identify individuals who are using both cigarettes and alternative tobacco products (ATPs). Most previous studies have used regression models. We conducted a traditional logistic regression model and a classification and regression tree (CART) model to illustrate and discuss the added advantages of using CART in the setting of identifying high-risk subgroups of ATP users among cigarettes smokers. The data were collected from an online cross-sectional survey administered by Survey Sampling International between July 5, 2012 and August 15, 2012. Eligible participants self-identified as current smokers, African American, White, or Latino (of any race), were English-speaking, and were at least 25 years old. The study sample included 2,376 participants and was divided into independent training and validation samples for a hold out validation. Logistic regression and CART models were used to examine the important predictors of cigarettes + ATP users. The logistic regression model identified nine important factors: gender, age, race, nicotine dependence, buying cigarettes or borrowing, whether the price of cigarettes influences the brand purchased, whether the participants set limits on cigarettes per day, alcohol use scores, and discrimination frequencies. The C-index of the logistic regression model was 0.74, indicating good discriminatory capability. The model performed well in the validation cohort also with good discrimination (c-index = 0.73) and excellent calibration (R-square = 0.96 in the calibration regression). The parsimonious CART model identified gender, age, alcohol use score, race, and discrimination frequencies to be the most important factors. It also revealed interesting partial interactions. The c-index is 0.70 for the training sample and 0.69 for the validation sample. The misclassification rate was 0.342 for the training sample and 0.346 for the validation sample. The CART model was easier to interpret and discovered target populations that possess clinical significance. This study suggests that the non-parametric CART model is parsimonious, potentially easier to interpret, and provides additional information in identifying the subgroups at high risk of ATP use among cigarette smokers.
Uhler, Kristin M; Baca, Rosalinda; Dudas, Emily; Fredrickson, Tammy
2015-01-01
Speech perception measures have long been considered an integral piece of the audiological assessment battery. Currently, a prelinguistic, standardized measure of speech perception is missing in the clinical assessment battery for infants and young toddlers. Such a measure would allow systematic assessment of speech perception abilities of infants as well as the potential to investigate the impact early identification of hearing loss and early fitting of amplification have on the auditory pathways. To investigate the impact of sensation level (SL) on the ability of infants with normal hearing (NH) to discriminate /a-i/ and /ba-da/ and to determine if performance on the two contrasts are significantly different in predicting the discrimination criterion. The design was based on a survival analysis model for event occurrence and a repeated measures logistic model for binary outcomes. The outcome for survival analysis was the minimum SL for criterion and the outcome for the logistic regression model was the presence/absence of achieving the criterion. Criterion achievement was designated when an infant's proportion correct score was >0.75 on the discrimination performance task. Twenty-two infants with NH sensitivity participated in this study. There were 9 males and 13 females, aged 6-14 mo. Testing took place over two to three sessions. The first session consisted of a hearing test, threshold assessment of the two speech sounds (/a/ and /i/), and if time and attention allowed, visual reinforcement infant speech discrimination (VRISD). The second session consisted of VRISD assessment for the two test contrasts (/a-i/ and /ba-da/). The presentation level started at 50 dBA. If the infant was unable to successfully achieve criterion (>0.75) at 50 dBA, the presentation level was increased to 70 dBA followed by 60 dBA. Data examination included an event analysis, which provided the probability of criterion distribution across SL. The second stage of the analysis was a repeated measures logistic regression where SL and contrast were used to predict the likelihood of speech discrimination criterion. Infants were able to reach criterion for the /a-i/ contrast at statistically lower SLs when compared to /ba-da/. There were six infants who never reached criterion for /ba-da/ and one never reached criterion for /a-i/. The conditional probability of not reaching criterion by 70 dB SL was 0% for /a-i/ and 21% for /ba-da/. The predictive logistic regression model showed that children were more likely to discriminate the /a-i/ even when controlling for SL. Nearly all normal-hearing infants can demonstrate discrimination criterion of a vowel contrast at 60 dB SL, while a level of ≥70 dB SL may be needed to allow all infants to demonstrate discrimination criterion of a difficult consonant contrast. American Academy of Audiology.
Wu, T-L; Tsai, C-C; Wang, Y-Y; Ho, K-Y; Wu, Y-M; Hung, H-C; Lin, Y-C
2015-12-01
The present study investigated the association between the RAGE G82S polymorphism, the plasma levels of sRAGE and chronic periodontitis in subjects with and without diabetes mellitus (DM). A total of 230 patients with DM and 264 non-DM participants were recruited for this study. Genotyping of the RAGE G82S polymorphism was accomplished using polymerase chain reaction-restriction fragment length polymorphism, and associations were analyzed with the chi-squared test and logistic regression analysis. In the non-DM group, the chi-squared test showed that the frequency distributions of the G82S polymorphism were significantly different between chronic periodontitis and non-chronic periodontitis subjects (χ(2) = 8.39, p = 0.02). A multivariate logistic regression model showed that the (G82S + S82S) genotypes were associated with a significantly increased risk of chronic periodontitis development compared to the G82G genotype (adjusted odds ratio = 2.06, 95% confidence interval: 1.08-4.07). In the DM group, there was no association between the G82S polymorphism and chronic periodontitis development when a multivariate logistic regression was performed. Plasma levels of sRAGE were significantly higher in subjects with the G82G genotype compared to those with the (G82S + S82S) genotypes in both the non-DM (856.6 ± 332.0 vs. 720.4 ± 311.4 pg/mL, p = 0.003) and DM groups (915.3 ± 497.1 vs. 603.5 ± 298.3 pg/mL, p < 0.0001). However, there was no difference in plasma sRAGE levels between chronic periodontitis and non-chronic periodontitis subjects in both the DM and non-DM groups. Moreover, when the subjects were further sub-divided by the G82S polymorphism, the difference in plasma levels of sRAGE between chronic periodontitis and non-chronic periodontitis subjects in the DM and non-DM groups remained statistically insignificant. The present study revealed that the RAGE G82S polymorphism was associated with chronic periodontitis in the non-DM group but not in the DM group. Our results also showed that the plasma levels of sRAGE were significantly higher in subjects with the RAGE G82G genotype, and this correlation was not affected by the presence of chronic periodontitis in the DM and non-DM groups. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Shi, K-Q; Zhou, Y-Y; Yan, H-D; Li, H; Wu, F-L; Xie, Y-Y; Braddock, M; Lin, X-Y; Zheng, M-H
2017-02-01
At present, there is no ideal model for predicting the short-term outcome of patients with acute-on-chronic hepatitis B liver failure (ACHBLF). This study aimed to establish and validate a prognostic model by using the classification and regression tree (CART) analysis. A total of 1047 patients from two separate medical centres with suspected ACHBLF were screened in the study, which were recognized as derivation cohort and validation cohort, respectively. CART analysis was applied to predict the 3-month mortality of patients with ACHBLF. The accuracy of the CART model was tested using the area under the receiver operating characteristic curve, which was compared with the model for end-stage liver disease (MELD) score and a new logistic regression model. CART analysis identified four variables as prognostic factors of ACHBLF: total bilirubin, age, serum sodium and INR, and three distinct risk groups: low risk (4.2%), intermediate risk (30.2%-53.2%) and high risk (81.4%-96.9%). The new logistic regression model was constructed with four independent factors, including age, total bilirubin, serum sodium and prothrombin activity by multivariate logistic regression analysis. The performances of the CART model (0.896), similar to the logistic regression model (0.914, P=.382), exceeded that of MELD score (0.667, P<.001). The results were confirmed in the validation cohort. We have developed and validated a novel CART model superior to MELD for predicting three-month mortality of patients with ACHBLF. Thus, the CART model could facilitate medical decision-making and provide clinicians with a validated practical bedside tool for ACHBLF risk stratification. © 2016 John Wiley & Sons Ltd.
NASA Astrophysics Data System (ADS)
Schaeben, Helmut; Semmler, Georg
2016-09-01
The objective of prospectivity modeling is prediction of the conditional probability of the presence T = 1 or absence T = 0 of a target T given favorable or prohibitive predictors B, or construction of a two classes 0,1 classification of T. A special case of logistic regression called weights-of-evidence (WofE) is geologists' favorite method of prospectivity modeling due to its apparent simplicity. However, the numerical simplicity is deceiving as it is implied by the severe mathematical modeling assumption of joint conditional independence of all predictors given the target. General weights of evidence are explicitly introduced which are as simple to estimate as conventional weights, i.e., by counting, but do not require conditional independence. Complementary to the regression view is the classification view on prospectivity modeling. Boosting is the construction of a strong classifier from a set of weak classifiers. From the regression point of view it is closely related to logistic regression. Boost weights-of-evidence (BoostWofE) was introduced into prospectivity modeling to counterbalance violations of the assumption of conditional independence even though relaxation of modeling assumptions with respect to weak classifiers was not the (initial) purpose of boosting. In the original publication of BoostWofE a fabricated dataset was used to "validate" this approach. Using the same fabricated dataset it is shown that BoostWofE cannot generally compensate lacking conditional independence whatever the consecutively processing order of predictors. Thus the alleged features of BoostWofE are disproved by way of counterexamples, while theoretical findings are confirmed that logistic regression including interaction terms can exactly compensate violations of joint conditional independence if the predictors are indicators.
Weight management behaviors in a sample of Iranian adolescent girls.
Garousi, S; Garrusi, B; Baneshi, Mohammad Reza; Sharifi, Z
2016-09-01
Attempts to obtain the ideal body shape portrayed in advertising can result in behaviors that lead to an unhealthy reduction in weight. This study was designed to identify contributing factors that may be effective in changing the behavior of a sample of Iranian adolescents. Three hundred fifty adolescent girls from high schools in Kerman, Iran participated in a cross-sectional study based on a self-administered questionnaire. Multifactorial logistic regression modeling was used to identify the factors influencing each of the contributing factors for body management methods, and a decision tree model was constructed to identify individuals who were more or less likely to change their body shape. Approximately one-third of the adolescent girls had attempted dieting, and 37 % of them had exercised to lose weight. The logistic regression model showed that pressure from their mother and the media; father's education level; and body mass index (BMI) were important factors in dieting. BMI and perceived pressure from the media were risk factors for attempting exercise. BMI and perceived pressure from relatives, particularly mothers, and the media were important factors in attempts by adolescent girls to lose weight.
Analyzing thresholds and efficiency with hierarchical Bayesian logistic regression.
Houpt, Joseph W; Bittner, Jennifer L
2018-07-01
Ideal observer analysis is a fundamental tool used widely in vision science for analyzing the efficiency with which a cognitive or perceptual system uses available information. The performance of an ideal observer provides a formal measure of the amount of information in a given experiment. The ratio of human to ideal performance is then used to compute efficiency, a construct that can be directly compared across experimental conditions while controlling for the differences due to the stimuli and/or task specific demands. In previous research using ideal observer analysis, the effects of varying experimental conditions on efficiency have been tested using ANOVAs and pairwise comparisons. In this work, we present a model that combines Bayesian estimates of psychometric functions with hierarchical logistic regression for inference about both unadjusted human performance metrics and efficiencies. Our approach improves upon the existing methods by constraining the statistical analysis using a standard model connecting stimulus intensity to human observer accuracy and by accounting for variability in the estimates of human and ideal observer performance scores. This allows for both individual and group level inferences. Copyright © 2018 Elsevier Ltd. All rights reserved.
Dai, Xiaoping; Han, Yuping; Zhang, Xiaohong; Hu, Wei; Huang, Liangji; Duan, Wenpei; Li, Siyi; Liu, Xiaolu; Wang, Qian
2017-09-01
A better understanding of willingness to separate waste and waste separation behaviour can aid the design and improvement of waste management policies. Based on the intercept questionnaire survey data of undergraduate students and residents in Zhengzhou City of China, this article compared factors affecting the willingness and behaviour of students and residents to participate in waste separation using two binary logistic regression models. Improvement opportunities for waste separation were also discussed. Binary logistic regression results indicate that knowledge of and attitude to waste separation and acceptance of waste education significantly affect the willingness of undergraduate students to separate waste, and demographic factors, such as gender, age, education level, and income, significantly affect the willingness of residents to do so. Presence of waste-specific bins and attitude to waste separation are drivers of waste separation behaviour for both students and residents. Improved education about waste separation and facilities are effective to stimulate waste separation, and charging on unsorted waste may be an effective way to improve it in Zhengzhou.
Andu, Eaden; Wagenaar, Brad H; Kemp, Chris G; Nevin, Paul E; Simoni, Jane M; Andrasik, Michele; Cohn, Susan E; French, Audrey L; Rao, Deepa
2018-04-26
We sought to examine risk and protective factors for Posttraumatic Stress Disorder (PTSD) among African American women living with HIV. This is a cross-sectional analysis of baseline data from a randomized trial of an HIV stigma reduction intervention. We examined data from two-hundred and thirty-nine African American women living with HIV. We examined whether age, marital status, level of education, internalized HIV-related stigma, and social support as potential protective and risk factors for PTSD symptoms using logistic regression. We analyzed bi-variate associations between each variable and PTSD symptoms, and constructed a multivariate logistic regression model adjusting for all variables. We found 67% reported clinically significant PTSD symptoms at baseline. Our results suggest that age, education, and internalized stigma were found to be associated with PTSD symptoms (p < 0.001), with older age and more education as protective factors and stigma as a risk factor for PTSD. Therefore, understanding this relationship may help improve assessment and treatment through evidence- based and trauma-informed strategies.
Association between domestic violence and women's quality of life 1
de Lucena, Kerle Dayana Tavares; Vianna, Rodrigo Pinheiro de Toledo; do Nascimento, João Agnaldo; Campos, Hemílio Fernandes Coelho; Oliveira, Elaine Cristina Tôrres
2017-01-01
ABSTRACT Objective: to analyze the association between domestic violence against women and quality of life. Method: a cross-sectional population-based household survey conducted with women 18 years and older, using a stratified sample by neighborhoods. For analysis, prevalence of domestic violence and quality of life index was verified and logistic regression was used to determine associations, with a significance level of 5%. Results: 424 women who had a prevalence of domestic violence of 54.4% and a quality of life index of 61.59 participated in this study. It was verified, through logistic regression, that domestic violence is associated with women's quality of life (p=0,017). The observed variables that influence the occurrence of domestic violence were in the social relations domain (p=0,000), provision of medical treatment for women (p=0,019) and safety (p=0,006). Conclusion: the study confirmed the evidence of an association between domestic violence against women and quality of life, a situation that reaffirms the importance of constructing public policies focused on gender emancipation. PMID:28591305
Williams, David R.
2009-01-01
Objectives. We examined whether perceived chronic discrimination was related to excess body fat accumulation in a random, multiethnic, population-based sample of US adults. Methods. We used multivariate multinomial logistic regression and logistic regression analyses to examine the relationship between interpersonal experiences of perceived chronic discrimination and body mass index and high-risk waist circumference. Results. Consistent with other studies, our analyses showed that perceived unfair treatment was associated with increased abdominal obesity. Compared with Irish, Jewish, Polish, and Italian Whites who did not experience perceived chronic discrimination, Irish, Jewish, Polish, and Italian Whites who perceived chronic discrimination were 2 to 6 times more likely to have a high-risk waist circumference. No significant relationship between perceived discrimination and the obesity measures was found among the other Whites, Blacks, or Hispanics. Conclusions. These findings are not completely unsupported. White ethnic groups including Polish, Italians, Jews, and Irish have historically been discriminated against in the United States, and other recent research suggests that they experience higher levels of perceived discrimination than do other Whites and that these experiences adversely affect their health. PMID:18923119
Hechter, Rulin C.; Budoff, Matthew; Hodis, Howard N.; Rinaldo, Charles R.; Jenkins, Frank J.; Jacobson, Lisa P.; Kingsley, Lawrence A.; Taiwo, Babafemi; Post, Wendy S.; Margolick, Joseph B.; Detels, Roger
2012-01-01
We assessed associations of herpes simplex virus types 1 and 2 (HSV-1 and -2), cytomegalovirus (CMV), and human herpesvirus 8 (HHV-8) infection with subclinical coronary atherosclerosis in 291 HIV-infected men in the Multicenter AIDS Cohort Study. Coronary artery calcium (CAC) was measured by non-contrast coronary CT imaging. Markers for herpesviruses infection were measured in frozen specimens collected 10-12 years prior to case identification. Multivariable logistic regression models and ordinal logistic regression models were performed. HSV-2 seropositivity was associated with coronary atherosclerosis (adjusted odds ratio [AOR] =4.12, 95% confidence interval [CI] =1.58-10.85) after adjustment for age, race/ethnicity, cardiovascular risk factors, and HIV infection related factors. Infection with a greater number of herpesviruses was associated with elevated CAC levels (AOR=1.58, 95% CI=1.06-2.36). Our findings suggest HSV-2 may be a risk factor for subclinical coronary atherosclerosis in HIV-infected men. Infection with multiple herpesviruses may contribute to the increased burden of atherosclerosis. PMID:22472456
Sampaolo, Letizia; Tommaso, Giulia; Gherardi, Bianca; Carrozzi, Giuliano; Freni Sterrantino, Anna; Ottone, Marta; Goldoni, Carlo Alberto; Bertozzi, Nicoletta; Scaringi, Meri; Bolognesi, Lara; Masocco, Maria; Salmaso, Stefania; Lauriola, Paolo
2017-01-01
"OBJECTIVES: to identify groups of people in relation to the perception of environmental risk and to assess the main characteristics using data collected in the environmental module of the surveillance network Italian Behavioral Risk Factor Surveillance System (PASSI). perceptive profiles were identified using a latent class analysis; later they were included as outcome in multinomial logistic regression models to assess the association between environmental risk perception and demographic, health, socio-economic and behavioural variables. the latent class analysis allowed to split the sample in "worried", "indifferent", and "positive" people. The multinomial logistic regression model showed that the "worried" profile typically includes people of Italian nationality, living in highly urbanized areas, with a high level of education, and with economic difficulties; they pay special attention to their own health and fitness, but they have a negative perception of their own psychophysical state. the application of advanced statistical analysis enable to appraise PASSI data in order to characterize the perception of environmental risk, making the planning of interventions related to risk communication possible. ".
Hechter, Rulin C; Budoff, Matthew; Hodis, Howard N; Rinaldo, Charles R; Jenkins, Frank J; Jacobson, Lisa P; Kingsley, Lawrence A; Taiwo, Babafemi; Post, Wendy S; Margolick, Joseph B; Detels, Roger
2012-08-01
We assessed associations of herpes simplex virus types 1 and 2 (HSV-1 and -2), cytomegalovirus (CMV), and human herpesvirus 8 (HHV-8) infection with subclinical coronary atherosclerosis in 291 HIV-infected men in the Multicenter AIDS Cohort Study. Coronary artery calcium (CAC) was measured by non-contrast coronary CT imaging. Markers for herpesviruses infection were measured in frozen specimens collected 10-12 years prior to case identification. Multivariable logistic regression models and ordinal logistic regression models were performed. HSV-2 seropositivity was associated with coronary atherosclerosis (adjusted odds ratio [AOR]=4.12, 95% confidence interval [CI]=1.58-10.85) after adjustment for age, race/ethnicity, cardiovascular risk factors, and HIV infection related factors. Infection with a greater number of herpesviruses was associated with elevated CAC levels (AOR=1.58, 95% CI=1.06-2.36). Our findings suggest HSV-2 may be a risk factor for subclinical coronary atherosclerosis in HIV-infected men. Infection with multiple herpesviruses may contribute to the increased burden of atherosclerosis. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.
Correlates of HIV knowledge and Sexual risk behaviors among Female Military Personnel
Essien, E. James; Monjok, Emmanuel; Chen, Hua; Abughosh, Susan; Ekong, Ernest; Peters, Ronald J.; Holmes, Laurens; Holstad, Marcia M.; Mgbere, Osaro
2010-01-01
Objective Uniformed services personnel are at an increased risk of HIV infection. We examined the HIV/AIDS knowledge and sexual risk behaviors among female military personnel to determine the correlates of HIV risk behaviors in this population. Method The study used a cross-sectional design to examine HIV/AIDS knowledge and sexual risk behaviors in a sample of 346 females drawn from two military cantonments in Southwestern Nigeria. Data was collected between 2006 and 2008. Using bivariate analysis and multivariate logistic regression, HIV/AIDS knowledge and sexual behaviors were described in relation to socio-demographic characteristics of the participants. Results Multivariate logistic regression analysis revealed that level of education and knowing someone with HIV/AIDS were significant (p<0.05) predictors of HIV knowledge in this sample. HIV prevention self-efficacy was significantly (P<0.05) predicted by annual income and race/ethnicity. Condom use attitudes were also significantly (P<0.05) associated with number of children, annual income, and number of sexual partners. Conclusion Data indicates the importance of incorporating these predictor variables into intervention designs. PMID:20387111
NASA Astrophysics Data System (ADS)
Widyaningsih, Purnami; Retno Sari Saputro, Dewi; Nugrahani Putri, Aulia
2017-06-01
GWOLR model combines geographically weighted regression (GWR) and (ordinal logistic reression) OLR models. Its parameter estimation employs maximum likelihood estimation. Such parameter estimation, however, yields difficult-to-solve system of nonlinear equations, and therefore numerical approximation approach is required. The iterative approximation approach, in general, uses Newton-Raphson (NR) method. The NR method has a disadvantage—its Hessian matrix is always the second derivatives of each iteration so it does not always produce converging results. With regard to this matter, NR model is modified by substituting its Hessian matrix into Fisher information matrix, which is termed Fisher scoring (FS). The present research seeks to determine GWOLR model parameter estimation using Fisher scoring method and apply the estimation on data of the level of vulnerability to Dengue Hemorrhagic Fever (DHF) in Semarang. The research concludes that health facilities give the greatest contribution to the probability of the number of DHF sufferers in both villages. Based on the number of the sufferers, IR category of DHF in both villages can be determined.
Savary, Serge; Delbac, Lionel; Rochas, Amélie; Taisant, Guillaume; Willocquet, Laetitia
2009-08-01
Dual epidemics are defined as epidemics developing on two or several plant organs in the course of a cropping season. Agricultural pathosystems where such epidemics develop are often very important, because the harvestable part is one of the organs affected. These epidemics also are often difficult to manage, because the linkage between epidemiological components occurring on different organs is poorly understood, and because prediction of the risk toward the harvestable organs is difficult. In the case of downy mildew (DM) and powdery mildew (PM) of grapevine, nonlinear modeling and logistic regression indicated nonlinearity in the foliage-cluster relationships. Nonlinear modeling enabled the parameterization of a transmission coefficient that numerically links the two components, leaves and clusters, in DM and PM epidemics. Logistic regression analysis yielded a series of probabilistic models that enabled predicting preset levels of cluster infection risks based on DM and PM severities on the foliage at successive crop stages. The usefulness of this framework for tactical decision-making for disease control is discussed.
Einav, Sharon; Alon, Gady; Kaufman, Nechama; Braunstein, Rony; Carmel, Sara; Varon, Joseph; Hersch, Moshe
2012-09-01
To determine whether variables in physicians' backgrounds influenced their decision to forego resuscitating a patient they did not previously know. Questionnaire survey of a convenience sample of 204 physicians working in the departments of internal medicine, anaesthesiology and cardiology in 11 hospitals in Israel. Twenty per cent of the participants had elected to forego resuscitating a patient they did not previously know without additional consultation. Physicians who had more frequently elected to forego resuscitation had practised medicine for more than 5 years (p=0.013), estimated the number of resuscitations they had performed as being higher (p=0.009), and perceived their experience in resuscitation as sufficient (p=0.001). The variable that predicted the outcome of always performing resuscitation in the logistic regression model was less than 5 years of experience in medicine (OR 0.227, 95% CI 0.065 to 0.793; p=0.02). Physicians' level of experience may affect the probability of a patient's receiving resuscitation, whereas the physicians' personal beliefs and values did not seem to affect this outcome.
Hyperhomocysteinemia is a risk factor for Alzheimer's disease in an Algerian population.
Nazef, Khaled; Khelil, Malika; Chelouti, Hiba; Kacimi, Ghouti; Bendini, Mohamed; Tazir, Meriem; Belarbi, Soraya; El Hadi Cherifi, Mohamed; Djerdjouri, Bahia
2014-04-01
There is growing evidence that increased blood concentration of total homocysteine (tHcy) may be a risk factor for Alzheimer's disease (AD). The present study was conducted to evaluate the association of serum tHcy and other biochemical risk factors with AD. This is a case-control study including 41 individuals diagnosed with AD and 46 nondemented controls. Serum levels of all studied biochemical parameters were performed. Univariate logistic regression showed a significant increase of tHcy (p = 0.008), urea (p = 0.036) and a significant decrease of vitamin B12 (p = 0.012) in AD group vs. controls. Using multivariate logistic regression, tHcy (p = 0.007, OR = 1.376) appeared as an independent risk factor predictor of AD. There was a significant positive correlation between tHcy and creatinine (p <0.0001). A negative correlation was found between tHcy and vitamin B12 (p <0.0001). Our findings support that hyperhomocysteinemia is a risk factor for AD in an Algerian population and is also associated with vitamin B12 deficiency. Copyright © 2014 IMSS. Published by Elsevier Inc. All rights reserved.