Sample records for two-level logistic regression

  1. Two-factor logistic regression in pediatric liver transplantation

    NASA Astrophysics Data System (ADS)

    Uzunova, Yordanka; Prodanova, Krasimira; Spasov, Lyubomir

    2017-12-01

    Using a two-factor logistic regression analysis an estimate is derived for the probability of absence of infections in the early postoperative period after pediatric liver transplantation. The influence of both the bilirubin level and the international normalized ratio of prothrombin time of blood coagulation at the 5th postoperative day is studied.

  2. An empirical study of statistical properties of variance partition coefficients for multi-level logistic regression models

    USGS Publications Warehouse

    Li, Ji; Gray, B.R.; Bates, D.M.

    2008-01-01

    Partitioning the variance of a response by design levels is challenging for binomial and other discrete outcomes. Goldstein (2003) proposed four definitions for variance partitioning coefficients (VPC) under a two-level logistic regression model. In this study, we explicitly derived formulae for multi-level logistic regression model and subsequently studied the distributional properties of the calculated VPCs. Using simulations and a vegetation dataset, we demonstrated associations between different VPC definitions, the importance of methods for estimating VPCs (by comparing VPC obtained using Laplace and penalized quasilikehood methods), and bivariate dependence between VPCs calculated at different levels. Such an empirical study lends an immediate support to wider applications of VPC in scientific data analysis.

  3. The use of generalized estimating equations in the analysis of motor vehicle crash data.

    PubMed

    Hutchings, Caroline B; Knight, Stacey; Reading, James C

    2003-01-01

    The purpose of this study was to determine if it is necessary to use generalized estimating equations (GEEs) in the analysis of seat belt effectiveness in preventing injuries in motor vehicle crashes. The 1992 Utah crash dataset was used, excluding crash participants where seat belt use was not appropriate (n=93,633). The model used in the 1996 Report to Congress [Report to congress on benefits of safety belts and motorcycle helmets, based on data from the Crash Outcome Data Evaluation System (CODES). National Center for Statistics and Analysis, NHTSA, Washington, DC, February 1996] was analyzed for all occupants with logistic regression, one level of nesting (occupants within crashes), and two levels of nesting (occupants within vehicles within crashes) to compare the use of GEEs with logistic regression. When using one level of nesting compared to logistic regression, 13 of 16 variance estimates changed more than 10%, and eight of 16 parameter estimates changed more than 10%. In addition, three of the independent variables changed from significant to insignificant (alpha=0.05). With the use of two levels of nesting, two of 16 variance estimates and three of 16 parameter estimates changed more than 10% from the variance and parameter estimates in one level of nesting. One of the independent variables changed from insignificant to significant (alpha=0.05) in the two levels of nesting model; therefore, only two of the independent variables changed from significant to insignificant when the logistic regression model was compared to the two levels of nesting model. The odds ratio of seat belt effectiveness in preventing injuries was 12% lower when a one-level nested model was used. Based on these results, we stress the need to use a nested model and GEEs when analyzing motor vehicle crash data.

  4. Comparison of multinomial logistic regression and logistic regression: which is more efficient in allocating land use?

    NASA Astrophysics Data System (ADS)

    Lin, Yingzhi; Deng, Xiangzheng; Li, Xing; Ma, Enjun

    2014-12-01

    Spatially explicit simulation of land use change is the basis for estimating the effects of land use and cover change on energy fluxes, ecology and the environment. At the pixel level, logistic regression is one of the most common approaches used in spatially explicit land use allocation models to determine the relationship between land use and its causal factors in driving land use change, and thereby to evaluate land use suitability. However, these models have a drawback in that they do not determine/allocate land use based on the direct relationship between land use change and its driving factors. Consequently, a multinomial logistic regression method was introduced to address this flaw, and thereby, judge the suitability of a type of land use in any given pixel in a case study area of the Jiangxi Province, China. A comparison of the two regression methods indicated that the proportion of correctly allocated pixels using multinomial logistic regression was 92.98%, which was 8.47% higher than that obtained using logistic regression. Paired t-test results also showed that pixels were more clearly distinguished by multinomial logistic regression than by logistic regression. In conclusion, multinomial logistic regression is a more efficient and accurate method for the spatial allocation of land use changes. The application of this method in future land use change studies may improve the accuracy of predicting the effects of land use and cover change on energy fluxes, ecology, and environment.

  5. Individual and community risk factors and sexually transmitted diseases among arrested youths: a two level analysis.

    PubMed

    Dembo, Richard; Belenko, Steven; Childs, Kristina; Wareham, Jennifer; Schmeidler, James

    2009-08-01

    High rates of infection for chlamydia and gonorrhea have been noted among youths involved in the juvenile justice system. Although both individual and community-level factors have been found to be associated with sexually transmitted disease (STD) risk, their relative importance has not been tested in this population. A two-level logistic regression analysis was completed to assess the influence of individual-level and community-level predictors on STD test results among arrested youths processed at a centralized intake facility. Results from weighted two level logistic regression analyses (n = 1,368) indicated individual-level factors of gender (being female), age, race (being African American), and criminal history predicted the youths' positive STD status. For the community-level predictors, concentrated disadvantage significantly and positively predicted the youths' STD status. Implications of these findings for future research and public health policy are discussed.

  6. Predictors of course in obsessive-compulsive disorder: logistic regression versus Cox regression for recurrent events.

    PubMed

    Kempe, P T; van Oppen, P; de Haan, E; Twisk, J W R; Sluis, A; Smit, J H; van Dyck, R; van Balkom, A J L M

    2007-09-01

    Two methods for predicting remissions in obsessive-compulsive disorder (OCD) treatment are evaluated. Y-BOCS measurements of 88 patients with a primary OCD (DSM-III-R) diagnosis were performed over a 16-week treatment period, and during three follow-ups. Remission at any measurement was defined as a Y-BOCS score lower than thirteen combined with a reduction of seven points when compared with baseline. Logistic regression models were compared with a Cox regression for recurrent events model. Logistic regression yielded different models at different evaluation times. The recurrent events model remained stable when fewer measurements were used. Higher baseline levels of neuroticism and more severe OCD symptoms were associated with a lower chance of remission, early age of onset and more depressive symptoms with a higher chance. Choice of outcome time affects logistic regression prediction models. Recurrent events analysis uses all information on remissions and relapses. Short- and long-term predictors for OCD remission show overlap.

  7. Cancer prevalence and education by cancer site: logistic regression analysis.

    PubMed

    Johnson, Stephanie; Corsten, Martin J; McDonald, James T; Gupta, Michael

    2010-10-01

    Previously, using the American National Health Interview Survey (NHIS) and a logistic regression analysis, we found that upper aerodigestive tract (UADT) cancer is correlated with low socioeconomic status (SES). The objective of this study was to determine if this correlation between low SES and cancer prevalence exists for other cancers. We again used the NHIS and employed education level as our main measure of SES. We controlled for potentially confounding factors, including smoking status and alcohol consumption. We found that only two cancer subsites shared the pattern of increased prevalence with low education level and decreased prevalence with high education level: UADT cancer and cervical cancer. UADT cancer and cervical cancer were the only two cancers identified that had a link between prevalence and lower education level. This raises the possibility that an associated risk factor for the two cancers is causing the relationship between lower education level and prevalence.

  8. Comparing the efficiency of digital and conventional soil mapping to predict soil types in a semi-arid region in Iran

    NASA Astrophysics Data System (ADS)

    Zeraatpisheh, Mojtaba; Ayoubi, Shamsollah; Jafari, Azam; Finke, Peter

    2017-05-01

    The efficiency of different digital and conventional soil mapping approaches to produce categorical maps of soil types is determined by cost, sample size, accuracy and the selected taxonomic level. The efficiency of digital and conventional soil mapping approaches was examined in the semi-arid region of Borujen, central Iran. This research aimed to (i) compare two digital soil mapping approaches including Multinomial logistic regression and random forest, with the conventional soil mapping approach at four soil taxonomic levels (order, suborder, great group and subgroup levels), (ii) validate the predicted soil maps by the same validation data set to determine the best method for producing the soil maps, and (iii) select the best soil taxonomic level by different approaches at three sample sizes (100, 80, and 60 point observations), in two scenarios with and without a geomorphology map as a spatial covariate. In most predicted maps, using both digital soil mapping approaches, the best results were obtained using the combination of terrain attributes and the geomorphology map, although differences between the scenarios with and without the geomorphology map were not significant. Employing the geomorphology map increased map purity and the Kappa index, and led to a decrease in the 'noisiness' of soil maps. Multinomial logistic regression had better performance at higher taxonomic levels (order and suborder levels); however, random forest showed better performance at lower taxonomic levels (great group and subgroup levels). Multinomial logistic regression was less sensitive than random forest to a decrease in the number of training observations. The conventional soil mapping method produced a map with larger minimum polygon size because of traditional cartographic criteria used to make the geological map 1:100,000 (on which the conventional soil mapping map was largely based). Likewise, conventional soil mapping map had also a larger average polygon size that resulted in a lower level of detail. Multinomial logistic regression at the order level (map purity of 0.80), random forest at the suborder (map purity of 0.72) and great group level (map purity of 0.60), and conventional soil mapping at the subgroup level (map purity of 0.48) produced the most accurate maps in the study area. The multinomial logistic regression method was identified as the most effective approach based on a combined index of map purity, map information content, and map production cost. The combined index also showed that smaller sample size led to a preference for the order level, while a larger sample size led to a preference for the great group level.

  9. Robust mislabel logistic regression without modeling mislabel probabilities.

    PubMed

    Hung, Hung; Jou, Zhi-Yu; Huang, Su-Yun

    2018-03-01

    Logistic regression is among the most widely used statistical methods for linear discriminant analysis. In many applications, we only observe possibly mislabeled responses. Fitting a conventional logistic regression can then lead to biased estimation. One common resolution is to fit a mislabel logistic regression model, which takes into consideration of mislabeled responses. Another common method is to adopt a robust M-estimation by down-weighting suspected instances. In this work, we propose a new robust mislabel logistic regression based on γ-divergence. Our proposal possesses two advantageous features: (1) It does not need to model the mislabel probabilities. (2) The minimum γ-divergence estimation leads to a weighted estimating equation without the need to include any bias correction term, that is, it is automatically bias-corrected. These features make the proposed γ-logistic regression more robust in model fitting and more intuitive for model interpretation through a simple weighting scheme. Our method is also easy to implement, and two types of algorithms are included. Simulation studies and the Pima data application are presented to demonstrate the performance of γ-logistic regression. © 2017, The International Biometric Society.

  10. What Are the Odds of that? A Primer on Understanding Logistic Regression

    ERIC Educational Resources Information Center

    Huang, Francis L.; Moon, Tonya R.

    2013-01-01

    The purpose of this Methodological Brief is to present a brief primer on logistic regression, a commonly used technique when modeling dichotomous outcomes. Using data from the National Education Longitudinal Study of 1988 (NELS:88), logistic regression techniques were used to investigate student-level variables in eighth grade (i.e., enrolled in a…

  11. Individual relocation decisions after tornadoes: a multi-level analysis.

    PubMed

    Cong, Zhen; Nejat, Ali; Liang, Daan; Pei, Yaolin; Javid, Roxana J

    2018-04-01

    This study examines how multi-level factors affected individuals' relocation decisions after EF4 and EF5 (Enhanced Fujita Tornado Intensity Scale) tornadoes struck the United States in 2013. A telephone survey was conducted with 536 respondents, including oversampled older adults, one year after these two disaster events. Respondents' addresses were used to associate individual information with block group-level variables recorded by the American Community Survey. Logistic regression revealed that residential damage and homeownership are important predictors of relocation. There was also significant interaction between these two variables, indicating less difference between homeowners and renters at higher damage levels. Homeownership diminished the likelihood of relocation among younger respondents. Random effects logistic regression found that the percentage of homeownership and of higher income households in the community buffered the effect of damage on relocation; the percentage of older adults reduced the likelihood of this group relocating. The findings are assessed from the standpoint of age difference, policy implications, and social capital and vulnerability. © 2018 The Author(s). Disasters © Overseas Development Institute, 2018.

  12. Higher direct bilirubin levels during mid-pregnancy are associated with lower risk of gestational diabetes mellitus.

    PubMed

    Liu, Chaoqun; Zhong, Chunrong; Zhou, Xuezhen; Chen, Renjuan; Wu, Jiangyue; Wang, Weiye; Li, Xiating; Ding, Huisi; Guo, Yanfang; Gao, Qin; Hu, Xingwen; Xiong, Guoping; Yang, Xuefeng; Hao, Liping; Xiao, Mei; Yang, Nianhong

    2017-01-01

    Bilirubin concentrations have been recently reported to be negatively associated with type 2 diabetes mellitus. We examined the association between bilirubin concentrations and gestational diabetes mellitus. In a prospective cohort study, 2969 pregnant women were recruited prior to 16 weeks of gestation and were followed up until delivery. The value of bilirubin was tested and oral glucose tolerance test was conducted to screen gestational diabetes mellitus. The relationship between serum bilirubin concentration and gestational weeks was studied by two-piecewise linear regression. A subsample of 1135 participants with serum bilirubin test during 16-18 weeks gestation was conducted to research the association between serum bilirubin levels and risk of gestational diabetes mellitus by logistic regression. Gestational diabetes mellitus developed in 8.5 % of the participants (223 of 2969). Two-piecewise linear regression analyses demonstrated that the levels of bilirubin decreased with gestational week up to the turning point 23 and after that point, levels of bilirubin were increased slightly. In multiple logistic regression analysis, the relative risk of developing gestational diabetes mellitus was lower in the highest tertile of direct bilirubin than that in the lowest tertile (RR 0.60; 95 % CI, 0.35-0.89). The results suggested that women with higher serum direct bilirubin levels during the second trimester of pregnancy have lower risk for development of gestational diabetes mellitus.

  13. PARAMETRIC AND NON PARAMETRIC (MARS: MULTIVARIATE ADDITIVE REGRESSION SPLINES) LOGISTIC REGRESSIONS FOR PREDICTION OF A DICHOTOMOUS RESPONSE VARIABLE WITH AN EXAMPLE FOR PRESENCE/ABSENCE OF AMPHIBIANS

    EPA Science Inventory

    The purpose of this report is to provide a reference manual that could be used by investigators for making informed use of logistic regression using two methods (standard logistic regression and MARS). The details for analyses of relationships between a dependent binary response ...

  14. Parameters Estimation of Geographically Weighted Ordinal Logistic Regression (GWOLR) Model

    NASA Astrophysics Data System (ADS)

    Zuhdi, Shaifudin; Retno Sari Saputro, Dewi; Widyaningsih, Purnami

    2017-06-01

    A regression model is the representation of relationship between independent variable and dependent variable. The dependent variable has categories used in the logistic regression model to calculate odds on. The logistic regression model for dependent variable has levels in the logistics regression model is ordinal. GWOLR model is an ordinal logistic regression model influenced the geographical location of the observation site. Parameters estimation in the model needed to determine the value of a population based on sample. The purpose of this research is to parameters estimation of GWOLR model using R software. Parameter estimation uses the data amount of dengue fever patients in Semarang City. Observation units used are 144 villages in Semarang City. The results of research get GWOLR model locally for each village and to know probability of number dengue fever patient categories.

  15. A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design.

    PubMed

    Staley, James R; Jones, Edmund; Kaptoge, Stephen; Butterworth, Adam S; Sweeting, Michael J; Wood, Angela M; Howson, Joanna M M

    2017-06-01

    Logistic regression is often used instead of Cox regression to analyse genome-wide association studies (GWAS) of single-nucleotide polymorphisms (SNPs) and disease outcomes with cohort and case-cohort designs, as it is less computationally expensive. Although Cox and logistic regression models have been compared previously in cohort studies, this work does not completely cover the GWAS setting nor extend to the case-cohort study design. Here, we evaluated Cox and logistic regression applied to cohort and case-cohort genetic association studies using simulated data and genetic data from the EPIC-CVD study. In the cohort setting, there was a modest improvement in power to detect SNP-disease associations using Cox regression compared with logistic regression, which increased as the disease incidence increased. In contrast, logistic regression had more power than (Prentice weighted) Cox regression in the case-cohort setting. Logistic regression yielded inflated effect estimates (assuming the hazard ratio is the underlying measure of association) for both study designs, especially for SNPs with greater effect on disease. Given logistic regression is substantially more computationally efficient than Cox regression in both settings, we propose a two-step approach to GWAS in cohort and case-cohort studies. First to analyse all SNPs with logistic regression to identify associated variants below a pre-defined P-value threshold, and second to fit Cox regression (appropriately weighted in case-cohort studies) to those identified SNPs to ensure accurate estimation of association with disease.

  16. Predictors of Placement Stability at the State Level: The Use of Logistic Regression to Inform Practice

    ERIC Educational Resources Information Center

    Courtney, Jon R.; Prophet, Retta

    2011-01-01

    Placement instability is often associated with a number of negative outcomes for children. To gain state level contextual knowledge of factors associated with placement stability/instability, logistic regression was applied to selected variables from the New Mexico Adoption and Foster Care Administrative Reporting System dataset. Predictors…

  17. A deeper look at two concepts of measuring gene-gene interactions: logistic regression and interaction information revisited.

    PubMed

    Mielniczuk, Jan; Teisseyre, Paweł

    2018-03-01

    Detection of gene-gene interactions is one of the most important challenges in genome-wide case-control studies. Besides traditional logistic regression analysis, recently the entropy-based methods attracted a significant attention. Among entropy-based methods, interaction information is one of the most promising measures having many desirable properties. Although both logistic regression and interaction information have been used in several genome-wide association studies, the relationship between them has not been thoroughly investigated theoretically. The present paper attempts to fill this gap. We show that although certain connections between the two methods exist, in general they refer two different concepts of dependence and looking for interactions in those two senses leads to different approaches to interaction detection. We introduce ordering between interaction measures and specify conditions for independent and dependent genes under which interaction information is more discriminative measure than logistic regression. Moreover, we show that for so-called perfect distributions those measures are equivalent. The numerical experiments illustrate the theoretical findings indicating that interaction information and its modified version are more universal tools for detecting various types of interaction than logistic regression and linkage disequilibrium measures. © 2017 WILEY PERIODICALS, INC.

  18. A simple approach to power and sample size calculations in logistic regression and Cox regression models.

    PubMed

    Vaeth, Michael; Skovlund, Eva

    2004-06-15

    For a given regression problem it is possible to identify a suitably defined equivalent two-sample problem such that the power or sample size obtained for the two-sample problem also applies to the regression problem. For a standard linear regression model the equivalent two-sample problem is easily identified, but for generalized linear models and for Cox regression models the situation is more complicated. An approximately equivalent two-sample problem may, however, also be identified here. In particular, we show that for logistic regression and Cox regression models the equivalent two-sample problem is obtained by selecting two equally sized samples for which the parameters differ by a value equal to the slope times twice the standard deviation of the independent variable and further requiring that the overall expected number of events is unchanged. In a simulation study we examine the validity of this approach to power calculations in logistic regression and Cox regression models. Several different covariate distributions are considered for selected values of the overall response probability and a range of alternatives. For the Cox regression model we consider both constant and non-constant hazard rates. The results show that in general the approach is remarkably accurate even in relatively small samples. Some discrepancies are, however, found in small samples with few events and a highly skewed covariate distribution. Comparison with results based on alternative methods for logistic regression models with a single continuous covariate indicates that the proposed method is at least as good as its competitors. The method is easy to implement and therefore provides a simple way to extend the range of problems that can be covered by the usual formulas for power and sample size determination. Copyright 2004 John Wiley & Sons, Ltd.

  19. Remote sensing and GIS-based landslide hazard analysis and cross-validation using multivariate logistic regression model on three test areas in Malaysia

    NASA Astrophysics Data System (ADS)

    Pradhan, Biswajeet

    2010-05-01

    This paper presents the results of the cross-validation of a multivariate logistic regression model using remote sensing data and GIS for landslide hazard analysis on the Penang, Cameron, and Selangor areas in Malaysia. Landslide locations in the study areas were identified by interpreting aerial photographs and satellite images, supported by field surveys. SPOT 5 and Landsat TM satellite imagery were used to map landcover and vegetation index, respectively. Maps of topography, soil type, lineaments and land cover were constructed from the spatial datasets. Ten factors which influence landslide occurrence, i.e., slope, aspect, curvature, distance from drainage, lithology, distance from lineaments, soil type, landcover, rainfall precipitation, and normalized difference vegetation index (ndvi), were extracted from the spatial database and the logistic regression coefficient of each factor was computed. Then the landslide hazard was analysed using the multivariate logistic regression coefficients derived not only from the data for the respective area but also using the logistic regression coefficients calculated from each of the other two areas (nine hazard maps in all) as a cross-validation of the model. For verification of the model, the results of the analyses were then compared with the field-verified landslide locations. Among the three cases of the application of logistic regression coefficient in the same study area, the case of Selangor based on the Selangor logistic regression coefficients showed the highest accuracy (94%), where as Penang based on the Penang coefficients showed the lowest accuracy (86%). Similarly, among the six cases from the cross application of logistic regression coefficient in other two areas, the case of Selangor based on logistic coefficient of Cameron showed highest (90%) prediction accuracy where as the case of Penang based on the Selangor logistic regression coefficients showed the lowest accuracy (79%). Qualitatively, the cross application model yields reasonable results which can be used for preliminary landslide hazard mapping.

  20. Evaluation of logistic regression models and effect of covariates for case-control study in RNA-Seq analysis.

    PubMed

    Choi, Seung Hoan; Labadorf, Adam T; Myers, Richard H; Lunetta, Kathryn L; Dupuis, Josée; DeStefano, Anita L

    2017-02-06

    Next generation sequencing provides a count of RNA molecules in the form of short reads, yielding discrete, often highly non-normally distributed gene expression measurements. Although Negative Binomial (NB) regression has been generally accepted in the analysis of RNA sequencing (RNA-Seq) data, its appropriateness has not been exhaustively evaluated. We explore logistic regression as an alternative method for RNA-Seq studies designed to compare cases and controls, where disease status is modeled as a function of RNA-Seq reads using simulated and Huntington disease data. We evaluate the effect of adjusting for covariates that have an unknown relationship with gene expression. Finally, we incorporate the data adaptive method in order to compare false positive rates. When the sample size is small or the expression levels of a gene are highly dispersed, the NB regression shows inflated Type-I error rates but the Classical logistic and Bayes logistic (BL) regressions are conservative. Firth's logistic (FL) regression performs well or is slightly conservative. Large sample size and low dispersion generally make Type-I error rates of all methods close to nominal alpha levels of 0.05 and 0.01. However, Type-I error rates are controlled after applying the data adaptive method. The NB, BL, and FL regressions gain increased power with large sample size, large log2 fold-change, and low dispersion. The FL regression has comparable power to NB regression. We conclude that implementing the data adaptive method appropriately controls Type-I error rates in RNA-Seq analysis. Firth's logistic regression provides a concise statistical inference process and reduces spurious associations from inaccurately estimated dispersion parameters in the negative binomial framework.

  1. A Comparison of Individual-Level and Community-Level Predictors of Marijuana and Cocaine Use among a Sample of Newly Arrested Juvenile Offenders

    ERIC Educational Resources Information Center

    Childs, Kristina; Dembo, Richard; Belenko, Steven; Wareham, Jennifer; Schmeidler, James

    2011-01-01

    Variations in drug use have been found across individual-level factors and community characteristics, and by type of drug used. Relatively little research, however, has examined this variation among juvenile offenders. Based on a sample of 924 newly arrested juvenile offenders, two multilevel logistic regression models predicting marijuana test…

  2. Large unbalanced credit scoring using Lasso-logistic regression ensemble.

    PubMed

    Wang, Hong; Xu, Qingsong; Zhou, Lifeng

    2015-01-01

    Recently, various ensemble learning methods with different base classifiers have been proposed for credit scoring problems. However, for various reasons, there has been little research using logistic regression as the base classifier. In this paper, given large unbalanced data, we consider the plausibility of ensemble learning using regularized logistic regression as the base classifier to deal with credit scoring problems. In this research, the data is first balanced and diversified by clustering and bagging algorithms. Then we apply a Lasso-logistic regression learning ensemble to evaluate the credit risks. We show that the proposed algorithm outperforms popular credit scoring models such as decision tree, Lasso-logistic regression and random forests in terms of AUC and F-measure. We also provide two importance measures for the proposed model to identify important variables in the data.

  3. A nonparametric multiple imputation approach for missing categorical data.

    PubMed

    Zhou, Muhan; He, Yulei; Yu, Mandi; Hsu, Chiu-Hsieh

    2017-06-06

    Incomplete categorical variables with more than two categories are common in public health data. However, most of the existing missing-data methods do not use the information from nonresponse (missingness) probabilities. We propose a nearest-neighbour multiple imputation approach to impute a missing at random categorical outcome and to estimate the proportion of each category. The donor set for imputation is formed by measuring distances between each missing value with other non-missing values. The distance function is calculated based on a predictive score, which is derived from two working models: one fits a multinomial logistic regression for predicting the missing categorical outcome (the outcome model) and the other fits a logistic regression for predicting missingness probabilities (the missingness model). A weighting scheme is used to accommodate contributions from two working models when generating the predictive score. A missing value is imputed by randomly selecting one of the non-missing values with the smallest distances. We conduct a simulation to evaluate the performance of the proposed method and compare it with several alternative methods. A real-data application is also presented. The simulation study suggests that the proposed method performs well when missingness probabilities are not extreme under some misspecifications of the working models. However, the calibration estimator, which is also based on two working models, can be highly unstable when missingness probabilities for some observations are extremely high. In this scenario, the proposed method produces more stable and better estimates. In addition, proper weights need to be chosen to balance the contributions from the two working models and achieve optimal results for the proposed method. We conclude that the proposed multiple imputation method is a reasonable approach to dealing with missing categorical outcome data with more than two levels for assessing the distribution of the outcome. In terms of the choices for the working models, we suggest a multinomial logistic regression for predicting the missing outcome and a binary logistic regression for predicting the missingness probability.

  4. A Multilevel Study of Students' Motivations of Studying Accounting: Implications for Employers

    ERIC Educational Resources Information Center

    Law, Philip; Yuen, Desmond

    2012-01-01

    Purpose: The purpose of this study is to examine the influence of factors affecting students' choice of accounting as a study major in Hong Kong. Design/methodology/approach: Multinomial logistic regression and Hierarchical Generalized Linear Modeling (HGLM) are used to analyze the survey data for the level one and level two data, which is the…

  5. Methodologic considerations in the design and analysis of nested case-control studies: association between cytokines and postoperative delirium.

    PubMed

    Ngo, Long H; Inouye, Sharon K; Jones, Richard N; Travison, Thomas G; Libermann, Towia A; Dillon, Simon T; Kuchel, George A; Vasunilashorn, Sarinnapha M; Alsop, David C; Marcantonio, Edward R

    2017-06-06

    The nested case-control study (NCC) design within a prospective cohort study is used when outcome data are available for all subjects, but the exposure of interest has not been collected, and is difficult or prohibitively expensive to obtain for all subjects. A NCC analysis with good matching procedures yields estimates that are as efficient and unbiased as estimates from the full cohort study. We present methodological considerations in a matched NCC design and analysis, which include the choice of match algorithms, analysis methods to evaluate the association of exposures of interest with outcomes, and consideration of overmatching. Matched, NCC design within a longitudinal observational prospective cohort study in the setting of two academic hospitals. Study participants are patients aged over 70 years who underwent scheduled major non-cardiac surgery. The primary outcome was postoperative delirium from in-hospital interviews and medical record review. The main exposure was IL-6 concentration (pg/ml) from blood sampled at three time points before delirium occurred. We used nonparametric signed ranked test to test for the median of the paired differences. We used conditional logistic regression to model the risk of IL-6 on delirium incidence. Simulation was used to generate a sample of cohort data on which unconditional multivariable logistic regression was used, and the results were compared to those of the conditional logistic regression. Partial R-square was used to assess the level of overmatching. We found that the optimal match algorithm yielded more matched pairs than the greedy algorithm. The choice of analytic strategy-whether to consider measured cytokine levels as the predictor or outcome-- yielded inferences that have different clinical interpretations but similar levels of statistical significance. Estimation results from NCC design using conditional logistic regression, and from simulated cohort design using unconditional logistic regression, were similar. We found minimal evidence for overmatching. Using a matched NCC approach introduces methodological challenges into the study design and data analysis. Nonetheless, with careful selection of the match algorithm, match factors, and analysis methods, this design is cost effective and, for our study, yields estimates that are similar to those from a prospective cohort study design.

  6. Logistic Regression with Multiple Random Effects: A Simulation Study of Estimation Methods and Statistical Packages.

    PubMed

    Kim, Yoonsang; Choi, Young-Ku; Emery, Sherry

    2013-08-01

    Several statistical packages are capable of estimating generalized linear mixed models and these packages provide one or more of three estimation methods: penalized quasi-likelihood, Laplace, and Gauss-Hermite. Many studies have investigated these methods' performance for the mixed-effects logistic regression model. However, the authors focused on models with one or two random effects and assumed a simple covariance structure between them, which may not be realistic. When there are multiple correlated random effects in a model, the computation becomes intensive, and often an algorithm fails to converge. Moreover, in our analysis of smoking status and exposure to anti-tobacco advertisements, we have observed that when a model included multiple random effects, parameter estimates varied considerably from one statistical package to another even when using the same estimation method. This article presents a comprehensive review of the advantages and disadvantages of each estimation method. In addition, we compare the performances of the three methods across statistical packages via simulation, which involves two- and three-level logistic regression models with at least three correlated random effects. We apply our findings to a real dataset. Our results suggest that two packages-SAS GLIMMIX Laplace and SuperMix Gaussian quadrature-perform well in terms of accuracy, precision, convergence rates, and computing speed. We also discuss the strengths and weaknesses of the two packages in regard to sample sizes.

  7. Logistic Regression with Multiple Random Effects: A Simulation Study of Estimation Methods and Statistical Packages

    PubMed Central

    Kim, Yoonsang; Emery, Sherry

    2013-01-01

    Several statistical packages are capable of estimating generalized linear mixed models and these packages provide one or more of three estimation methods: penalized quasi-likelihood, Laplace, and Gauss-Hermite. Many studies have investigated these methods’ performance for the mixed-effects logistic regression model. However, the authors focused on models with one or two random effects and assumed a simple covariance structure between them, which may not be realistic. When there are multiple correlated random effects in a model, the computation becomes intensive, and often an algorithm fails to converge. Moreover, in our analysis of smoking status and exposure to anti-tobacco advertisements, we have observed that when a model included multiple random effects, parameter estimates varied considerably from one statistical package to another even when using the same estimation method. This article presents a comprehensive review of the advantages and disadvantages of each estimation method. In addition, we compare the performances of the three methods across statistical packages via simulation, which involves two- and three-level logistic regression models with at least three correlated random effects. We apply our findings to a real dataset. Our results suggest that two packages—SAS GLIMMIX Laplace and SuperMix Gaussian quadrature—perform well in terms of accuracy, precision, convergence rates, and computing speed. We also discuss the strengths and weaknesses of the two packages in regard to sample sizes. PMID:24288415

  8. Susceptibility assessment of earthquake-triggered landslides in El Salvador using logistic regression

    NASA Astrophysics Data System (ADS)

    García-Rodríguez, M. J.; Malpica, J. A.; Benito, B.; Díaz, M.

    2008-03-01

    This work has evaluated the probability of earthquake-triggered landslide occurrence in the whole of El Salvador, with a Geographic Information System (GIS) and a logistic regression model. Slope gradient, elevation, aspect, mean annual precipitation, lithology, land use, and terrain roughness are the predictor variables used to determine the dependent variable of occurrence or non-occurrence of landslides within an individual grid cell. The results illustrate the importance of terrain roughness and soil type as key factors within the model — using only these two variables the analysis returned a significance level of 89.4%. The results obtained from the model within the GIS were then used to produce a map of relative landslide susceptibility.

  9. Addressing data privacy in matched studies via virtual pooling.

    PubMed

    Saha-Chaudhuri, P; Weinberg, C R

    2017-09-07

    Data confidentiality and shared use of research data are two desirable but sometimes conflicting goals in research with multi-center studies and distributed data. While ideal for straightforward analysis, confidentiality restrictions forbid creation of a single dataset that includes covariate information of all participants. Current approaches such as aggregate data sharing, distributed regression, meta-analysis and score-based methods can have important limitations. We propose a novel application of an existing epidemiologic tool, specimen pooling, to enable confidentiality-preserving analysis of data arising from a matched case-control, multi-center design. Instead of pooling specimens prior to assay, we apply the methodology to virtually pool (aggregate) covariates within nodes. Such virtual pooling retains most of the information used in an analysis with individual data and since individual participant data is not shared externally, within-node virtual pooling preserves data confidentiality. We show that aggregated covariate levels can be used in a conditional logistic regression model to estimate individual-level odds ratios of interest. The parameter estimates from the standard conditional logistic regression are compared to the estimates based on a conditional logistic regression model with aggregated data. The parameter estimates are shown to be similar to those without pooling and to have comparable standard errors and confidence interval coverage. Virtual data pooling can be used to maintain confidentiality of data from multi-center study and can be particularly useful in research with large-scale distributed data.

  10. Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble

    PubMed Central

    Wang, Hong; Xu, Qingsong; Zhou, Lifeng

    2015-01-01

    Recently, various ensemble learning methods with different base classifiers have been proposed for credit scoring problems. However, for various reasons, there has been little research using logistic regression as the base classifier. In this paper, given large unbalanced data, we consider the plausibility of ensemble learning using regularized logistic regression as the base classifier to deal with credit scoring problems. In this research, the data is first balanced and diversified by clustering and bagging algorithms. Then we apply a Lasso-logistic regression learning ensemble to evaluate the credit risks. We show that the proposed algorithm outperforms popular credit scoring models such as decision tree, Lasso-logistic regression and random forests in terms of AUC and F-measure. We also provide two importance measures for the proposed model to identify important variables in the data. PMID:25706988

  11. Dental health services utilization and associated factors in children 6 to 12 years old in a low-income country.

    PubMed

    Medina-Solis, Carlo Eduardo; Maupomé, Gerardo; del Socorro, Herrera Miriam; Pérez-Núñez, Ricardo; Avila-Burgos, Leticia; Lamadrid-Figueroa, Hector

    2008-01-01

    To determine the factors associated with the dental health services utilization among children ages 6 to 12 in León, Nicaragua. A cross-sectional study was carried out in 1,400 schoolchildren. Using a questionnaire, we determined information related to utilization and independent variables in the previous year. Oral health needs were established by means of a dental examination. To identify the independent variables associated with dental health services utilization, two types of multivariate regression models were used, according to the measurement scale of the outcome variable: a) frequency of utilization as (0) none, (1) one, and (2) two or more, analyzed with the ordered logistic regression and b) the type of service utilized as (0) none, (1) preventive services, (2) curative services, and (3) both services, analyzed with the multinomial logistic regression. The proportion of children who received at least one dental service in the 12 months prior to the study was 27.7 percent. The variables associated with utilization in the two models were older age, female sex, more frequent toothbrushing, positive attitude of the mother toward the child's oral health, higher socioeconomic level, and higher oral health needs. Various predisposing, enabling, and oral health needs variables were associated with higher dental health services utilization. As in prior reports elsewhere, these results from Nicaragua confirmed that utilization inequalities exist between socioeconomic groups. The multinomial logistic regression model evidenced the association of different variables depending on the type of service used.

  12. Intermediate and advanced topics in multilevel logistic regression analysis

    PubMed Central

    Merlo, Juan

    2017-01-01

    Multilevel data occur frequently in health services, population and public health, and epidemiologic research. In such research, binary outcomes are common. Multilevel logistic regression models allow one to account for the clustering of subjects within clusters of higher‐level units when estimating the effect of subject and cluster characteristics on subject outcomes. A search of the PubMed database demonstrated that the use of multilevel or hierarchical regression models is increasing rapidly. However, our impression is that many analysts simply use multilevel regression models to account for the nuisance of within‐cluster homogeneity that is induced by clustering. In this article, we describe a suite of analyses that can complement the fitting of multilevel logistic regression models. These ancillary analyses permit analysts to estimate the marginal or population‐average effect of covariates measured at the subject and cluster level, in contrast to the within‐cluster or cluster‐specific effects arising from the original multilevel logistic regression model. We describe the interval odds ratio and the proportion of opposed odds ratios, which are summary measures of effect for cluster‐level covariates. We describe the variance partition coefficient and the median odds ratio which are measures of components of variance and heterogeneity in outcomes. These measures allow one to quantify the magnitude of the general contextual effect. We describe an R 2 measure that allows analysts to quantify the proportion of variation explained by different multilevel logistic regression models. We illustrate the application and interpretation of these measures by analyzing mortality in patients hospitalized with a diagnosis of acute myocardial infarction. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. PMID:28543517

  13. Intermediate and advanced topics in multilevel logistic regression analysis.

    PubMed

    Austin, Peter C; Merlo, Juan

    2017-09-10

    Multilevel data occur frequently in health services, population and public health, and epidemiologic research. In such research, binary outcomes are common. Multilevel logistic regression models allow one to account for the clustering of subjects within clusters of higher-level units when estimating the effect of subject and cluster characteristics on subject outcomes. A search of the PubMed database demonstrated that the use of multilevel or hierarchical regression models is increasing rapidly. However, our impression is that many analysts simply use multilevel regression models to account for the nuisance of within-cluster homogeneity that is induced by clustering. In this article, we describe a suite of analyses that can complement the fitting of multilevel logistic regression models. These ancillary analyses permit analysts to estimate the marginal or population-average effect of covariates measured at the subject and cluster level, in contrast to the within-cluster or cluster-specific effects arising from the original multilevel logistic regression model. We describe the interval odds ratio and the proportion of opposed odds ratios, which are summary measures of effect for cluster-level covariates. We describe the variance partition coefficient and the median odds ratio which are measures of components of variance and heterogeneity in outcomes. These measures allow one to quantify the magnitude of the general contextual effect. We describe an R 2 measure that allows analysts to quantify the proportion of variation explained by different multilevel logistic regression models. We illustrate the application and interpretation of these measures by analyzing mortality in patients hospitalized with a diagnosis of acute myocardial infarction. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd.

  14. Comparison of Two Approaches for Handling Missing Covariates in Logistic Regression

    ERIC Educational Resources Information Center

    Peng, Chao-Ying Joanne; Zhu, Jin

    2008-01-01

    For the past 25 years, methodological advances have been made in missing data treatment. Most published work has focused on missing data in dependent variables under various conditions. The present study seeks to fill the void by comparing two approaches for handling missing data in categorical covariates in logistic regression: the…

  15. Logistic regression analysis of conventional ultrasonography, strain elastosonography, and contrast-enhanced ultrasound characteristics for the differentiation of benign and malignant thyroid nodules

    PubMed Central

    Deng, Yingyuan; Wang, Tianfu; Chen, Siping; Liu, Weixiang

    2017-01-01

    The aim of the study is to screen the significant sonographic features by logistic regression analysis and fit a model to diagnose thyroid nodules. A total of 525 pathological thyroid nodules were retrospectively analyzed. All the nodules underwent conventional ultrasonography (US), strain elastosonography (SE), and contrast -enhanced ultrasound (CEUS). Those nodules’ 12 suspicious sonographic features were used to assess thyroid nodules. The significant features of diagnosing thyroid nodules were picked out by logistic regression analysis. All variables that were statistically related to diagnosis of thyroid nodules, at a level of p < 0.05 were embodied in a logistic regression analysis model. The significant features in the logistic regression model of diagnosing thyroid nodules were calcification, suspected cervical lymph node metastasis, hypoenhancement pattern, margin, shape, vascularity, posterior acoustic, echogenicity, and elastography score. According to the results of logistic regression analysis, the formula that could predict whether or not thyroid nodules are malignant was established. The area under the receiver operating curve (ROC) was 0.930 and the sensitivity, specificity, accuracy, positive predictive value, and negative predictive value were 83.77%, 89.56%, 87.05%, 86.04%, and 87.79% respectively. PMID:29228030

  16. Logistic regression analysis of conventional ultrasonography, strain elastosonography, and contrast-enhanced ultrasound characteristics for the differentiation of benign and malignant thyroid nodules.

    PubMed

    Pang, Tiantian; Huang, Leidan; Deng, Yingyuan; Wang, Tianfu; Chen, Siping; Gong, Xuehao; Liu, Weixiang

    2017-01-01

    The aim of the study is to screen the significant sonographic features by logistic regression analysis and fit a model to diagnose thyroid nodules. A total of 525 pathological thyroid nodules were retrospectively analyzed. All the nodules underwent conventional ultrasonography (US), strain elastosonography (SE), and contrast -enhanced ultrasound (CEUS). Those nodules' 12 suspicious sonographic features were used to assess thyroid nodules. The significant features of diagnosing thyroid nodules were picked out by logistic regression analysis. All variables that were statistically related to diagnosis of thyroid nodules, at a level of p < 0.05 were embodied in a logistic regression analysis model. The significant features in the logistic regression model of diagnosing thyroid nodules were calcification, suspected cervical lymph node metastasis, hypoenhancement pattern, margin, shape, vascularity, posterior acoustic, echogenicity, and elastography score. According to the results of logistic regression analysis, the formula that could predict whether or not thyroid nodules are malignant was established. The area under the receiver operating curve (ROC) was 0.930 and the sensitivity, specificity, accuracy, positive predictive value, and negative predictive value were 83.77%, 89.56%, 87.05%, 86.04%, and 87.79% respectively.

  17. Comparing Linear Discriminant Function with Logistic Regression for the Two-Group Classification Problem.

    ERIC Educational Resources Information Center

    Fan, Xitao; Wang, Lin

    The Monte Carlo study compared the performance of predictive discriminant analysis (PDA) and that of logistic regression (LR) for the two-group classification problem. Prior probabilities were used for classification, but the cost of misclassification was assumed to be equal. The study used a fully crossed three-factor experimental design (with…

  18. Comparison of naïve Bayes and logistic regression for computer-aided diagnosis of breast masses using ultrasound imaging

    NASA Astrophysics Data System (ADS)

    Cary, Theodore W.; Cwanger, Alyssa; Venkatesh, Santosh S.; Conant, Emily F.; Sehgal, Chandra M.

    2012-03-01

    This study compares the performance of two proven but very different machine learners, Naïve Bayes and logistic regression, for differentiating malignant and benign breast masses using ultrasound imaging. Ultrasound images of 266 masses were analyzed quantitatively for shape, echogenicity, margin characteristics, and texture features. These features along with patient age, race, and mammographic BI-RADS category were used to train Naïve Bayes and logistic regression classifiers to diagnose lesions as malignant or benign. ROC analysis was performed using all of the features and using only a subset that maximized information gain. Performance was determined by the area under the ROC curve, Az, obtained from leave-one-out cross validation. Naïve Bayes showed significant variation (Az 0.733 +/- 0.035 to 0.840 +/- 0.029, P < 0.002) with the choice of features, but the performance of logistic regression was relatively unchanged under feature selection (Az 0.839 +/- 0.029 to 0.859 +/- 0.028, P = 0.605). Out of 34 features, a subset of 6 gave the highest information gain: brightness difference, margin sharpness, depth-to-width, mammographic BI-RADs, age, and race. The probabilities of malignancy determined by Naïve Bayes and logistic regression after feature selection showed significant correlation (R2= 0.87, P < 0.0001). The diagnostic performance of Naïve Bayes and logistic regression can be comparable, but logistic regression is more robust. Since probability of malignancy cannot be measured directly, high correlation between the probabilities derived from two basic but dissimilar models increases confidence in the predictive power of machine learning models for characterizing solid breast masses on ultrasound.

  19. Rank-Optimized Logistic Matrix Regression toward Improved Matrix Data Classification.

    PubMed

    Zhang, Jianguang; Jiang, Jianmin

    2018-02-01

    While existing logistic regression suffers from overfitting and often fails in considering structural information, we propose a novel matrix-based logistic regression to overcome the weakness. In the proposed method, 2D matrices are directly used to learn two groups of parameter vectors along each dimension without vectorization, which allows the proposed method to fully exploit the underlying structural information embedded inside the 2D matrices. Further, we add a joint [Formula: see text]-norm on two parameter matrices, which are organized by aligning each group of parameter vectors in columns. This added co-regularization term has two roles-enhancing the effect of regularization and optimizing the rank during the learning process. With our proposed fast iterative solution, we carried out extensive experiments. The results show that in comparison to both the traditional tensor-based methods and the vector-based regression methods, our proposed solution achieves better performance for matrix data classifications.

  20. "Let Me Count the Ways:" Fostering Reasons for Living among Low-Income, Suicidal, African American Women

    ERIC Educational Resources Information Center

    West, Lindsey M.; Davis, Telsie A.; Thompson, Martie P.; Kaslow, Nadine J.

    2011-01-01

    Protective factors for fostering reasons for living were examined among low-income, suicidal, African American women. Bivariate logistic regressions revealed that higher levels of optimism, spiritual well-being, and family social support predicted reasons for living. Multivariate logistic regressions indicated that spiritual well-being showed…

  1. Matched samples logistic regression in case-control studies with missing values: when to break the matches.

    PubMed

    Hansson, Lisbeth; Khamis, Harry J

    2008-12-01

    Simulated data sets are used to evaluate conditional and unconditional maximum likelihood estimation in an individual case-control design with continuous covariates when there are different rates of excluded cases and different levels of other design parameters. The effectiveness of the estimation procedures is measured by method bias, variance of the estimators, root mean square error (RMSE) for logistic regression and the percentage of explained variation. Conditional estimation leads to higher RMSE than unconditional estimation in the presence of missing observations, especially for 1:1 matching. The RMSE is higher for the smaller stratum size, especially for the 1:1 matching. The percentage of explained variation appears to be insensitive to missing data, but is generally higher for the conditional estimation than for the unconditional estimation. It is particularly good for the 1:2 matching design. For minimizing RMSE, a high matching ratio is recommended; in this case, conditional and unconditional logistic regression models yield comparable levels of effectiveness. For maximizing the percentage of explained variation, the 1:2 matching design with the conditional logistic regression model is recommended.

  2. London Measure of Unplanned Pregnancy: guidance for its use as an outcome measure

    PubMed Central

    Hall, Jennifer A; Barrett, Geraldine; Copas, Andrew; Stephenson, Judith

    2017-01-01

    Background The London Measure of Unplanned Pregnancy (LMUP) is a psychometrically validated measure of the degree of intention of a current or recent pregnancy. The LMUP is increasingly being used worldwide, and can be used to evaluate family planning or preconception care programs. However, beyond recommending the use of the full LMUP scale, there is no published guidance on how to use the LMUP as an outcome measure. Ordinal logistic regression has been recommended informally, but studies published to date have all used binary logistic regression and dichotomized the scale at different cut points. There is thus a need for evidence-based guidance to provide a standardized methodology for multivariate analysis and to enable comparison of results. This paper makes recommendations for the regression method for analysis of the LMUP as an outcome measure. Materials and methods Data collected from 4,244 pregnant women in Malawi were used to compare five regression methods: linear, logistic with two cut points, and ordinal logistic with either the full or grouped LMUP score. The recommendations were then tested on the original UK LMUP data. Results There were small but no important differences in the findings across the regression models. Logistic regression resulted in the largest loss of information, and assumptions were violated for the linear and ordinal logistic regression. Consequently, robust standard errors were used for linear regression and a partial proportional odds ordinal logistic regression model attempted. The latter could only be fitted for grouped LMUP score. Conclusion We recommend the linear regression model with robust standard errors to make full use of the LMUP score when analyzed as an outcome measure. Ordinal logistic regression could be considered, but a partial proportional odds model with grouped LMUP score may be required. Logistic regression is the least-favored option, due to the loss of information. For logistic regression, the cut point for un/planned pregnancy should be between nine and ten. These recommendations will standardize the analysis of LMUP data and enhance comparability of results across studies. PMID:28435343

  3. The Differential Effects of Preschool: Evidence from Virginia

    ERIC Educational Resources Information Center

    Huang, Francis L.; Invernizzi, Marcia A.; Drake, E. Allison

    2012-01-01

    This study investigated the differential and persistent effects of a state-funded pre-K program, the Virginia Preschool Initiative (VPI). We analyzed data from a cohort of over 60,000 students nested in approximately 1000 schools from the beginning of kindergarten to the end of first grade using two-level hierarchical logistic regression models.…

  4. Predictors of Employment for Youths with Visual Impairments: Findings from the Second National Longitudinal Transition Study

    ERIC Educational Resources Information Center

    McDonnall, Michele Capella

    2011-01-01

    The study reported here identified factors that predict employment for transition-age youths with visual impairments. Logistic regression was used to predict employment at two levels. Significant variables were early and recent work experiences, completion of a postsecondary program, difficulty with transportation, independent travel skills, and…

  5. Independent Prognostic Factors for Acute Organophosphorus Pesticide Poisoning.

    PubMed

    Tang, Weidong; Ruan, Feng; Chen, Qi; Chen, Suping; Shao, Xuebo; Gao, Jianbo; Zhang, Mao

    2016-07-01

    Acute organophosphorus pesticide poisoning (AOPP) is becoming a significant problem and a potential cause of human mortality because of the abuse of organophosphate compounds. This study aims to determine the independent prognostic factors of AOPP by using multivariate logistic regression analysis. The clinical data for 71 subjects with AOPP admitted to our hospital were retrospectively analyzed. This information included the Acute Physiology and Chronic Health Evaluation II (APACHE II) scores, 6-h post-admission blood lactate levels, post-admission 6-h lactate clearance rates, admission blood cholinesterase levels, 6-h post-admission blood cholinesterase levels, cholinesterase activity, blood pH, and other factors. Univariate analysis and multivariate logistic regression analyses were conducted to identify all prognostic factors and independent prognostic factors, respectively. A receiver operating characteristic curve was plotted to analyze the testing power of independent prognostic factors. Twelve of 71 subjects died. Admission blood lactate levels, 6-h post-admission blood lactate levels, post-admission 6-h lactate clearance rates, blood pH, and APACHE II scores were identified as prognostic factors for AOPP according to the univariate analysis, whereas only 6-h post-admission blood lactate levels, post-admission 6-h lactate clearance rates, and blood pH were independent prognostic factors identified by multivariate logistic regression analysis. The receiver operating characteristic analysis suggested that post-admission 6-h lactate clearance rates were of moderate diagnostic value. High 6-h post-admission blood lactate levels, low blood pH, and low post-admission 6-h lactate clearance rates were independent prognostic factors identified by multivariate logistic regression analysis. Copyright © 2016 by Daedalus Enterprises.

  6. Effects of Social Class and School Conditions on Educational Enrollment and Achievement of Boys and Girls in Rural Viet Nam

    ERIC Educational Resources Information Center

    Nguyen, Phuong L.

    2006-01-01

    This study examines the effects of parental SES, school quality, and community factors on children's enrollment and achievement in rural areas in Viet Nam, using logistic regression and ordered logistic regression. Multivariate analysis reveals significant differences in educational enrollment and outcomes by level of household expenditures and…

  7. Logistic regression for dichotomized counts.

    PubMed

    Preisser, John S; Das, Kalyan; Benecha, Habtamu; Stamm, John W

    2016-12-01

    Sometimes there is interest in a dichotomized outcome indicating whether a count variable is positive or zero. Under this scenario, the application of ordinary logistic regression may result in efficiency loss, which is quantifiable under an assumed model for the counts. In such situations, a shared-parameter hurdle model is investigated for more efficient estimation of regression parameters relating to overall effects of covariates on the dichotomous outcome, while handling count data with many zeroes. One model part provides a logistic regression containing marginal log odds ratio effects of primary interest, while an ancillary model part describes the mean count of a Poisson or negative binomial process in terms of nuisance regression parameters. Asymptotic efficiency of the logistic model parameter estimators of the two-part models is evaluated with respect to ordinary logistic regression. Simulations are used to assess the properties of the models with respect to power and Type I error, the latter investigated under both misspecified and correctly specified models. The methods are applied to data from a randomized clinical trial of three toothpaste formulations to prevent incident dental caries in a large population of Scottish schoolchildren. © The Author(s) 2014.

  8. Logistic LASSO regression for the diagnosis of breast cancer using clinical demographic data and the BI-RADS lexicon for ultrasonography.

    PubMed

    Kim, Sun Mi; Kim, Yongdai; Jeong, Kuhwan; Jeong, Heeyeong; Kim, Jiyoung

    2018-01-01

    The aim of this study was to compare the performance of image analysis for predicting breast cancer using two distinct regression models and to evaluate the usefulness of incorporating clinical and demographic data (CDD) into the image analysis in order to improve the diagnosis of breast cancer. This study included 139 solid masses from 139 patients who underwent a ultrasonography-guided core biopsy and had available CDD between June 2009 and April 2010. Three breast radiologists retrospectively reviewed 139 breast masses and described each lesion using the Breast Imaging Reporting and Data System (BI-RADS) lexicon. We applied and compared two regression methods-stepwise logistic (SL) regression and logistic least absolute shrinkage and selection operator (LASSO) regression-in which the BI-RADS descriptors and CDD were used as covariates. We investigated the performances of these regression methods and the agreement of radiologists in terms of test misclassification error and the area under the curve (AUC) of the tests. Logistic LASSO regression was superior (P<0.05) to SL regression, regardless of whether CDD was included in the covariates, in terms of test misclassification errors (0.234 vs. 0.253, without CDD; 0.196 vs. 0.258, with CDD) and AUC (0.785 vs. 0.759, without CDD; 0.873 vs. 0.735, with CDD). However, it was inferior (P<0.05) to the agreement of three radiologists in terms of test misclassification errors (0.234 vs. 0.168, without CDD; 0.196 vs. 0.088, with CDD) and the AUC without CDD (0.785 vs. 0.844, P<0.001), but was comparable to the AUC with CDD (0.873 vs. 0.880, P=0.141). Logistic LASSO regression based on BI-RADS descriptors and CDD showed better performance than SL in predicting the presence of breast cancer. The use of CDD as a supplement to the BI-RADS descriptors significantly improved the prediction of breast cancer using logistic LASSO regression.

  9. Assessing landslide susceptibility by statistical data analysis and GIS: the case of Daunia (Apulian Apennines, Italy)

    NASA Astrophysics Data System (ADS)

    Ceppi, C.; Mancini, F.; Ritrovato, G.

    2009-04-01

    This study aim at the landslide susceptibility mapping within an area of the Daunia (Apulian Apennines, Italy) by a multivariate statistical method and data manipulation in a Geographical Information System (GIS) environment. Among the variety of existing statistical data analysis techniques, the logistic regression was chosen to produce a susceptibility map all over an area where small settlements are historically threatened by landslide phenomena. By logistic regression a best fitting between the presence or absence of landslide (dependent variable) and the set of independent variables is performed on the basis of a maximum likelihood criterion, bringing to the estimation of regression coefficients. The reliability of such analysis is therefore due to the ability to quantify the proneness to landslide occurrences by the probability level produced by the analysis. The inventory of dependent and independent variables were managed in a GIS, where geometric properties and attributes have been translated into raster cells in order to proceed with the logistic regression by means of SPSS (Statistical Package for the Social Sciences) package. A landslide inventory was used to produce the bivariate dependent variable whereas the independent set of variable concerned with slope, aspect, elevation, curvature, drained area, lithology and land use after their reductions to dummy variables. The effect of independent parameters on landslide occurrence was assessed by the corresponding coefficient in the logistic regression function, highlighting a major role played by the land use variable in determining occurrence and distribution of phenomena. Once the outcomes of the logistic regression are determined, data are re-introduced in the GIS to produce a map reporting the proneness to landslide as predicted level of probability. As validation of results and regression model a cell-by-cell comparison between the susceptibility map and the initial inventory of landslide events was performed and an agreement at 75% level achieved.

  10. Discrete post-processing of total cloud cover ensemble forecasts

    NASA Astrophysics Data System (ADS)

    Hemri, Stephan; Haiden, Thomas; Pappenberger, Florian

    2017-04-01

    This contribution presents an approach to post-process ensemble forecasts for the discrete and bounded weather variable of total cloud cover. Two methods for discrete statistical post-processing of ensemble predictions are tested. The first approach is based on multinomial logistic regression, the second involves a proportional odds logistic regression model. Applying them to total cloud cover raw ensemble forecasts from the European Centre for Medium-Range Weather Forecasts improves forecast skill significantly. Based on station-wise post-processing of raw ensemble total cloud cover forecasts for a global set of 3330 stations over the period from 2007 to early 2014, the more parsimonious proportional odds logistic regression model proved to slightly outperform the multinomial logistic regression model. Reference Hemri, S., Haiden, T., & Pappenberger, F. (2016). Discrete post-processing of total cloud cover ensemble forecasts. Monthly Weather Review 144, 2565-2577.

  11. Determining factors influencing survival of breast cancer by fuzzy logistic regression model.

    PubMed

    Nikbakht, Roya; Bahrampour, Abbas

    2017-01-01

    Fuzzy logistic regression model can be used for determining influential factors of disease. This study explores the important factors of actual predictive survival factors of breast cancer's patients. We used breast cancer data which collected by cancer registry of Kerman University of Medical Sciences during the period of 2000-2007. The variables such as morphology, grade, age, and treatments (surgery, radiotherapy, and chemotherapy) were applied in the fuzzy logistic regression model. Performance of model was determined in terms of mean degree of membership (MDM). The study results showed that almost 41% of patients were in neoplasm and malignant group and more than two-third of them were still alive after 5-year follow-up. Based on the fuzzy logistic model, the most important factors influencing survival were chemotherapy, morphology, and radiotherapy, respectively. Furthermore, the MDM criteria show that the fuzzy logistic regression have a good fit on the data (MDM = 0.86). Fuzzy logistic regression model showed that chemotherapy is more important than radiotherapy in survival of patients with breast cancer. In addition, another ability of this model is calculating possibilistic odds of survival in cancer patients. The results of this study can be applied in clinical research. Furthermore, there are few studies which applied the fuzzy logistic models. Furthermore, we recommend using this model in various research areas.

  12. Estimating interaction on an additive scale between continuous determinants in a logistic regression model.

    PubMed

    Knol, Mirjam J; van der Tweel, Ingeborg; Grobbee, Diederick E; Numans, Mattijs E; Geerlings, Mirjam I

    2007-10-01

    To determine the presence of interaction in epidemiologic research, typically a product term is added to the regression model. In linear regression, the regression coefficient of the product term reflects interaction as departure from additivity. However, in logistic regression it refers to interaction as departure from multiplicativity. Rothman has argued that interaction estimated as departure from additivity better reflects biologic interaction. So far, literature on estimating interaction on an additive scale using logistic regression only focused on dichotomous determinants. The objective of the present study was to provide the methods to estimate interaction between continuous determinants and to illustrate these methods with a clinical example. and results From the existing literature we derived the formulas to quantify interaction as departure from additivity between one continuous and one dichotomous determinant and between two continuous determinants using logistic regression. Bootstrapping was used to calculate the corresponding confidence intervals. To illustrate the theory with an empirical example, data from the Utrecht Health Project were used, with age and body mass index as risk factors for elevated diastolic blood pressure. The methods and formulas presented in this article are intended to assist epidemiologists to calculate interaction on an additive scale between two variables on a certain outcome. The proposed methods are included in a spreadsheet which is freely available at: http://www.juliuscenter.nl/additive-interaction.xls.

  13. Iterative Purification and Effect Size Use with Logistic Regression for Differential Item Functioning Detection

    ERIC Educational Resources Information Center

    French, Brian F.; Maller, Susan J.

    2007-01-01

    Two unresolved implementation issues with logistic regression (LR) for differential item functioning (DIF) detection include ability purification and effect size use. Purification is suggested to control inaccuracies in DIF detection as a result of DIF items in the ability estimate. Additionally, effect size use may be beneficial in controlling…

  14. Use of generalized ordered logistic regression for the analysis of multidrug resistance data.

    PubMed

    Agga, Getahun E; Scott, H Morgan

    2015-10-01

    Statistical analysis of antimicrobial resistance data largely focuses on individual antimicrobial's binary outcome (susceptible or resistant). However, bacteria are becoming increasingly multidrug resistant (MDR). Statistical analysis of MDR data is mostly descriptive often with tabular or graphical presentations. Here we report the applicability of generalized ordinal logistic regression model for the analysis of MDR data. A total of 1,152 Escherichia coli, isolated from the feces of weaned pigs experimentally supplemented with chlortetracycline (CTC) and copper, were tested for susceptibilities against 15 antimicrobials and were binary classified into resistant or susceptible. The 15 antimicrobial agents tested were grouped into eight different antimicrobial classes. We defined MDR as the number of antimicrobial classes to which E. coli isolates were resistant ranging from 0 to 8. Proportionality of the odds assumption of the ordinal logistic regression model was violated only for the effect of treatment period (pre-treatment, during-treatment and post-treatment); but not for the effect of CTC or copper supplementation. Subsequently, a partially constrained generalized ordinal logistic model was built that allows for the effect of treatment period to vary while constraining the effects of treatment (CTC and copper supplementation) to be constant across the levels of MDR classes. Copper (Proportional Odds Ratio [Prop OR]=1.03; 95% CI=0.73-1.47) and CTC (Prop OR=1.1; 95% CI=0.78-1.56) supplementation were not significantly associated with the level of MDR adjusted for the effect of treatment period. MDR generally declined over the trial period. In conclusion, generalized ordered logistic regression can be used for the analysis of ordinal data such as MDR data when the proportionality assumptions for ordered logistic regression are violated. Published by Elsevier B.V.

  15. Modeling Governance KB with CATPCA to Overcome Multicollinearity in the Logistic Regression

    NASA Astrophysics Data System (ADS)

    Khikmah, L.; Wijayanto, H.; Syafitri, U. D.

    2017-04-01

    The problem often encounters in logistic regression modeling are multicollinearity problems. Data that have multicollinearity between explanatory variables with the result in the estimation of parameters to be bias. Besides, the multicollinearity will result in error in the classification. In general, to overcome multicollinearity in regression used stepwise regression. They are also another method to overcome multicollinearity which involves all variable for prediction. That is Principal Component Analysis (PCA). However, classical PCA in only for numeric data. Its data are categorical, one method to solve the problems is Categorical Principal Component Analysis (CATPCA). Data were used in this research were a part of data Demographic and Population Survey Indonesia (IDHS) 2012. This research focuses on the characteristic of women of using the contraceptive methods. Classification results evaluated using Area Under Curve (AUC) values. The higher the AUC value, the better. Based on AUC values, the classification of the contraceptive method using stepwise method (58.66%) is better than the logistic regression model (57.39%) and CATPCA (57.39%). Evaluation of the results of logistic regression using sensitivity, shows the opposite where CATPCA method (99.79%) is better than logistic regression method (92.43%) and stepwise (92.05%). Therefore in this study focuses on major class classification (using a contraceptive method), then the selected model is CATPCA because it can raise the level of the major class model accuracy.

  16. The Association between Androgenic Hormone Levels and the Risk of Developing Coronary Artery Disease (CAD).

    PubMed

    Allameh, Farzad; Pourmand, Gholamreza; Bozorgi, Ali; Nekuie, Sepideh; Namdari, Farshad

    2016-01-01

    The aim of the study was to evaluate the relationship between the serum levels of androgens and Coronary Artery Disease (CAD) in an Iranian population. Male individuals admitted to Tehran Heart Center and Sina Hospital, Tehran, Iran from 2011-2012 were categorized into CAD and control groups based on selective coronary angiography. Baseline demographic data, including age, BMI, diabetes, and a history of hypertension were recorded. Patients were also assessed for their serum levels of total testosterone, free testosterone, estradiol, dehydroepi and rosterone sulfate (DHEA-S), and Sex Hormone Binding Globulin (SHBG). Data analysis was carried out chi-square and ANOVA tests as well as logistic regression analysis. Two hundred patients were in the CAD group and 135 individuals in control group. In the CAD group, 69 had single-vessel disease, 49 had two-vessel diseases, and 82 had three-vessel diseases. Statistically significant differences were observed between the individuals in the two groups with respect to age (P<0.0001), diabetes (P<0.0001), and a history of hypertension (P=0.018). The serum levels of free testosterone (P=0.048) and DHEA-S (P<0.0001) were significantly higher in the control group than in the CAD group; however, the serum level of SHBG was higher in the CAD group than in the control group (P=0.007). Results of the logistic regression analysis indicated that only age (P=0.042) and diabetes (P=0.003) had significant relationships with CAD. Although the serum levels of some of the androgens were significantly different between the two groups, no association was found between androgenic hormone levels and the risk of CAD, due mainly to the effect of age and diabetes.

  17. Regression trees for predicting mortality in patients with cardiovascular disease: What improvement is achieved by using ensemble-based methods?

    PubMed Central

    Austin, Peter C; Lee, Douglas S; Steyerberg, Ewout W; Tu, Jack V

    2012-01-01

    In biomedical research, the logistic regression model is the most commonly used method for predicting the probability of a binary outcome. While many clinical researchers have expressed an enthusiasm for regression trees, this method may have limited accuracy for predicting health outcomes. We aimed to evaluate the improvement that is achieved by using ensemble-based methods, including bootstrap aggregation (bagging) of regression trees, random forests, and boosted regression trees. We analyzed 30-day mortality in two large cohorts of patients hospitalized with either acute myocardial infarction (N = 16,230) or congestive heart failure (N = 15,848) in two distinct eras (1999–2001 and 2004–2005). We found that both the in-sample and out-of-sample prediction of ensemble methods offered substantial improvement in predicting cardiovascular mortality compared to conventional regression trees. However, conventional logistic regression models that incorporated restricted cubic smoothing splines had even better performance. We conclude that ensemble methods from the data mining and machine learning literature increase the predictive performance of regression trees, but may not lead to clear advantages over conventional logistic regression models for predicting short-term mortality in population-based samples of subjects with cardiovascular disease. PMID:22777999

  18. A comparative study on entrepreneurial attitudes modeled with logistic regression and Bayes nets.

    PubMed

    López Puga, Jorge; García García, Juan

    2012-11-01

    Entrepreneurship research is receiving increasing attention in our context, as entrepreneurs are key social agents involved in economic development. We compare the success of the dichotomic logistic regression model and the Bayes simple classifier to predict entrepreneurship, after manipulating the percentage of missing data and the level of categorization in predictors. A sample of undergraduate university students (N = 1230) completed five scales (motivation, attitude towards business creation, obstacles, deficiencies, and training needs) and we found that each of them predicted different aspects of the tendency to business creation. Additionally, our results show that the receiver operating characteristic (ROC) curve is affected by the rate of missing data in both techniques, but logistic regression seems to be more vulnerable when faced with missing data, whereas Bayes nets underperform slightly when categorization has been manipulated. Our study sheds light on the potential entrepreneur profile and we propose to use Bayesian networks as an additional alternative to overcome the weaknesses of logistic regression when missing data are present in applied research.

  19. Assessment of maternal anemia in rural Western China between 2001 and 2005: a two-level logistic regression approach

    PubMed Central

    2013-01-01

    Background There are multiple adverse effects of anemia on human function, particularly on women. However, few researches are conducted on women anemia in rural Western China. This study mainly aims to investigate the levels and associated factors of maternal anemia between 2001 and 2005 in this region. Methods 6172 and 5372 mothers with children under three years old were selected from 8 provinces in 2001 and from 9 provinces in 2005 respectively in Western China by means of a multi-stage probability proportion to size sampling method (PPS). The blood samples were tested and related socio-demographic information was obtained through questionnaires. A two-level logistic regression model was employed to identify the determinants and provincial variations of women anemia in 2001 and 2005. Results The results indicated that the crude prevalence of women anemia in 2005 was higher than the rate in 2001(45.7% vs 33.6%). Based on the nationwide census data in 2000, the age-standardized prevalence of women anemia in the study were obtained as 38.0% in 2001 and 50.0% in 2005 respectively. Two-level logistic model analysis showed that compared to the average, women were more likely to be anemic in Guangxi and Qinghai in 2001 as well as in Chongqing and Qinghai in 2005; that women from Minority groups had higher odds of anemia in contrast with Han; that women with higher parity, longer breastfeeding duration and higher socioeconomic level had a lower rate of anemia, while age of women was positively associated with anemia. The positive correlation between women anemia and altitude was also observed. Conclusions The study demonstrated that the burden of maternal anemia in rural Western China increased considerably between 2001 and 2005. The Chinese government should conduct integrated interventions on anemia of mothers in this region. PMID:23597320

  20. Sperm Retrieval in Patients with Klinefelter Syndrome: A Skewed Regression Model Analysis.

    PubMed

    Chehrazi, Mohammad; Rahimiforoushani, Abbas; Sabbaghian, Marjan; Nourijelyani, Keramat; Sadighi Gilani, Mohammad Ali; Hoseini, Mostafa; Vesali, Samira; Yaseri, Mehdi; Alizadeh, Ahad; Mohammad, Kazem; Samani, Reza Omani

    2017-01-01

    The most common chromosomal abnormality due to non-obstructive azoospermia (NOA) is Klinefelter syndrome (KS) which occurs in 1-1.72 out of 500-1000 male infants. The probability of retrieving sperm as the outcome could be asymmetrically different between patients with and without KS, therefore logistic regression analysis is not a well-qualified test for this type of data. This study has been designed to evaluate skewed regression model analysis for data collected from microsurgical testicular sperm extraction (micro-TESE) among azoospermic patients with and without non-mosaic KS syndrome. This cohort study compared the micro-TESE outcome between 134 men with classic KS and 537 men with NOA and normal karyotype who were referred to Royan Institute between 2009 and 2011. In addition to our main outcome, which was sperm retrieval, we also used logistic and skewed regression analyses to compare the following demographic and hormonal factors: age, level of follicle stimulating hormone (FSH), luteinizing hormone (LH), and testosterone between the two groups. A comparison of the micro-TESE between the KS and control groups showed a success rate of 28.4% (38/134) for the KS group and 22.2% (119/537) for the control group. In the KS group, a significantly difference (P<0.001) existed between testosterone levels for the successful sperm retrieval group (3.4 ± 0.48 mg/mL) compared to the unsuccessful sperm retrieval group (2.33 ± 0.23 mg/mL). The index for quasi Akaike information criterion (QAIC) had a goodness of fit of 74 for the skewed model which was lower than logistic regression (QAIC=85). According to the results, skewed regression is more efficient in estimating sperm retrieval success when the data from patients with KS are analyzed. This finding should be investigated by conducting additional studies with different data structures.

  1. Detecting DIF in Polytomous Items Using MACS, IRT and Ordinal Logistic Regression

    ERIC Educational Resources Information Center

    Elosua, Paula; Wells, Craig

    2013-01-01

    The purpose of the present study was to compare the Type I error rate and power of two model-based procedures, the mean and covariance structure model (MACS) and the item response theory (IRT), and an observed-score based procedure, ordinal logistic regression, for detecting differential item functioning (DIF) in polytomous items. A simulation…

  2. School Exits in the Milwaukee Parental Choice Program: Evidence of a Marketplace?

    ERIC Educational Resources Information Center

    Ford, Michael

    2011-01-01

    This article examines whether the large number of school exits from the Milwaukee school voucher program is evidence of a marketplace. Two logistic regression and multinomial logistic regression models tested the relation between the inability to draw large numbers of voucher students and the ability for a private school to remain viable. Data on…

  3. Exploring Audiologists' Language and Hearing Aid Uptake in Initial Rehabilitation Appointments.

    PubMed

    Sciacca, Anna; Meyer, Carly; Ekberg, Katie; Barr, Caitlin; Hickson, Louise

    2017-06-13

    The study aimed (a) to profile audiologists' language during the diagnosis and management planning phase of hearing assessment appointments and (b) to explore associations between audiologists' language and patients' decisions to obtain hearing aids. Sixty-two audiologist-patient dyads participated. Patient participants were aged 55 years or older. Hearing assessment appointments were audiovisually recorded and transcribed for analysis. Audiologists' language was profiled using two measures: general language complexity and use of jargon. A binomial, multivariate logistic regression analysis was conducted to investigate the associations between these language measures and hearing aid uptake. The logistic regression model revealed that the Flesch-Kincaid reading grade level of audiologists' language was significantly associated with hearing aid uptake. Patients were less likely to obtain hearing aids when audiologists' language was at a higher reading grade level. No associations were found between audiologists' use of jargon and hearing aid uptake. Audiologists' use of complex language may present a barrier for patients to understand hearing rehabilitation recommendations. Reduced understanding may limit patient participation in the decision-making process and result in patients being less willing to trial hearing aids. Clear, concise language is recommended to facilitate shared decision making.

  4. Perception of tourists regarding the smoke-free policy at Suvarnabhumi International Airport, Bangkok, Thailand.

    PubMed

    Sirichotiratana, Nithat; Yogi, Subash; Prutipinyo, Chardsumon

    2013-08-30

    This study was conducted during February-March 2012 to determine the perception and support regarding smoke-free policy among tourists at Suvarnabhumi International Airport, Bangkok, Thailand. In this cross-sectional study, 200 tourists (n = 200) were enrolled by convenience sampling and interviewed by structured questionnaire. Descriptive statistics, chi-square, and multinomial logistic regression were adopted in the study. Results revealed that half (50%) of the tourists were current smokers and 55% had visited Thailand twice or more. Three quarter (76%) of tourists indicated that they would visit Thailand again even if it had a 100% smoke-free regulation. Almost all (99%) of the tourists had supported for the smoke-free policy (partial ban and total ban), and current smokers had higher percentage of support than non-smokers. Two factors, current smoking status and knowledge level, were significantly associated with perception level. After analysis with Multinomial Logistic Regression, it was found that perception, country group, and presence of designated smoking room (DSR) were associated with smoke-free policy. Recommendation is that, at institution level effective monitoring system is needed at the airport. At policy level, the recommendation is that effective comprehensive policy needed to be emphasized to ensure smoke-free airport environment.

  5. Comparison of Logistic Regression and Artificial Neural Network in Low Back Pain Prediction: Second National Health Survey

    PubMed Central

    Parsaeian, M; Mohammad, K; Mahmoudi, M; Zeraati, H

    2012-01-01

    Background: The purpose of this investigation was to compare empirically predictive ability of an artificial neural network with a logistic regression in prediction of low back pain. Methods: Data from the second national health survey were considered in this investigation. This data includes the information of low back pain and its associated risk factors among Iranian people aged 15 years and older. Artificial neural network and logistic regression models were developed using a set of 17294 data and they were validated in a test set of 17295 data. Hosmer and Lemeshow recommendation for model selection was used in fitting the logistic regression. A three-layer perceptron with 9 inputs, 3 hidden and 1 output neurons was employed. The efficiency of two models was compared by receiver operating characteristic analysis, root mean square and -2 Loglikelihood criteria. Results: The area under the ROC curve (SE), root mean square and -2Loglikelihood of the logistic regression was 0.752 (0.004), 0.3832 and 14769.2, respectively. The area under the ROC curve (SE), root mean square and -2Loglikelihood of the artificial neural network was 0.754 (0.004), 0.3770 and 14757.6, respectively. Conclusions: Based on these three criteria, artificial neural network would give better performance than logistic regression. Although, the difference is statistically significant, it does not seem to be clinically significant. PMID:23113198

  6. Comparison of logistic regression and artificial neural network in low back pain prediction: second national health survey.

    PubMed

    Parsaeian, M; Mohammad, K; Mahmoudi, M; Zeraati, H

    2012-01-01

    The purpose of this investigation was to compare empirically predictive ability of an artificial neural network with a logistic regression in prediction of low back pain. Data from the second national health survey were considered in this investigation. This data includes the information of low back pain and its associated risk factors among Iranian people aged 15 years and older. Artificial neural network and logistic regression models were developed using a set of 17294 data and they were validated in a test set of 17295 data. Hosmer and Lemeshow recommendation for model selection was used in fitting the logistic regression. A three-layer perceptron with 9 inputs, 3 hidden and 1 output neurons was employed. The efficiency of two models was compared by receiver operating characteristic analysis, root mean square and -2 Loglikelihood criteria. The area under the ROC curve (SE), root mean square and -2Loglikelihood of the logistic regression was 0.752 (0.004), 0.3832 and 14769.2, respectively. The area under the ROC curve (SE), root mean square and -2Loglikelihood of the artificial neural network was 0.754 (0.004), 0.3770 and 14757.6, respectively. Based on these three criteria, artificial neural network would give better performance than logistic regression. Although, the difference is statistically significant, it does not seem to be clinically significant.

  7. Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes.

    PubMed

    Li, Baoyue; Lingsma, Hester F; Steyerberg, Ewout W; Lesaffre, Emmanuel

    2011-05-23

    Logistic random effects models are a popular tool to analyze multilevel also called hierarchical data with a binary or ordinal outcome. Here, we aim to compare different statistical software implementations of these models. We used individual patient data from 8509 patients in 231 centers with moderate and severe Traumatic Brain Injury (TBI) enrolled in eight Randomized Controlled Trials (RCTs) and three observational studies. We fitted logistic random effects regression models with the 5-point Glasgow Outcome Scale (GOS) as outcome, both dichotomized as well as ordinal, with center and/or trial as random effects, and as covariates age, motor score, pupil reactivity or trial. We then compared the implementations of frequentist and Bayesian methods to estimate the fixed and random effects. Frequentist approaches included R (lme4), Stata (GLLAMM), SAS (GLIMMIX and NLMIXED), MLwiN ([R]IGLS) and MIXOR, Bayesian approaches included WinBUGS, MLwiN (MCMC), R package MCMCglmm and SAS experimental procedure MCMC.Three data sets (the full data set and two sub-datasets) were analysed using basically two logistic random effects models with either one random effect for the center or two random effects for center and trial. For the ordinal outcome in the full data set also a proportional odds model with a random center effect was fitted. The packages gave similar parameter estimates for both the fixed and random effects and for the binary (and ordinal) models for the main study and when based on a relatively large number of level-1 (patient level) data compared to the number of level-2 (hospital level) data. However, when based on relatively sparse data set, i.e. when the numbers of level-1 and level-2 data units were about the same, the frequentist and Bayesian approaches showed somewhat different results. The software implementations differ considerably in flexibility, computation time, and usability. There are also differences in the availability of additional tools for model evaluation, such as diagnostic plots. The experimental SAS (version 9.2) procedure MCMC appeared to be inefficient. On relatively large data sets, the different software implementations of logistic random effects regression models produced similar results. Thus, for a large data set there seems to be no explicit preference (of course if there is no preference from a philosophical point of view) for either a frequentist or Bayesian approach (if based on vague priors). The choice for a particular implementation may largely depend on the desired flexibility, and the usability of the package. For small data sets the random effects variances are difficult to estimate. In the frequentist approaches the MLE of this variance was often estimated zero with a standard error that is either zero or could not be determined, while for Bayesian methods the estimates could depend on the chosen "non-informative" prior of the variance parameter. The starting value for the variance parameter may be also critical for the convergence of the Markov chain.

  8. Naval Research Logistics Quarterly. Volume 28. Number 3,

    DTIC Science & Technology

    1981-09-01

    denotes component-wise maximum. f has antone (isotone) differences on C x D if for cl < c2 and d, < d2, NAVAL RESEARCH LOGISTICS QUARTERLY VOL. 28...or negative correlations and linear or nonlinear regressions. Given are the mo- ments to order two and, for special cases, (he regression function and...data sets. We designate this bnb distribution as G - B - N(a, 0, v). The distribution admits only of positive correlation and linear regressions

  9. PREDICTION OF MALIGNANT BREAST LESIONS FROM MRI FEATURES: A COMPARISON OF ARTIFICIAL NEURAL NETWORK AND LOGISTIC REGRESSION TECHNIQUES

    PubMed Central

    McLaren, Christine E.; Chen, Wen-Pin; Nie, Ke; Su, Min-Ying

    2009-01-01

    Rationale and Objectives Dynamic contrast enhanced MRI (DCE-MRI) is a clinical imaging modality for detection and diagnosis of breast lesions. Analytical methods were compared for diagnostic feature selection and performance of lesion classification to differentiate between malignant and benign lesions in patients. Materials and Methods The study included 43 malignant and 28 benign histologically-proven lesions. Eight morphological parameters, ten gray level co-occurrence matrices (GLCM) texture features, and fourteen Laws’ texture features were obtained using automated lesion segmentation and quantitative feature extraction. Artificial neural network (ANN) and logistic regression analysis were compared for selection of the best predictors of malignant lesions among the normalized features. Results Using ANN, the final four selected features were compactness, energy, homogeneity, and Law_LS, with area under the receiver operating characteristic curve (AUC) = 0.82, and accuracy = 0.76. The diagnostic performance of these 4-features computed on the basis of logistic regression yielded AUC = 0.80 (95% CI, 0.688 to 0.905), similar to that of ANN. The analysis also shows that the odds of a malignant lesion decreased by 48% (95% CI, 25% to 92%) for every increase of 1 SD in the Law_LS feature, adjusted for differences in compactness, energy, and homogeneity. Using logistic regression with z-score transformation, a model comprised of compactness, NRL entropy, and gray level sum average was selected, and it had the highest overall accuracy of 0.75 among all models, with AUC = 0.77 (95% CI, 0.660 to 0.880). When logistic modeling of transformations using the Box-Cox method was performed, the most parsimonious model with predictors, compactness and Law_LS, had an AUC of 0.79 (95% CI, 0.672 to 0.898). Conclusion The diagnostic performance of models selected by ANN and logistic regression was similar. The analytic methods were found to be roughly equivalent in terms of predictive ability when a small number of variables were chosen. The robust ANN methodology utilizes a sophisticated non-linear model, while logistic regression analysis provides insightful information to enhance interpretation of the model features. PMID:19409817

  10. Predicting multi-level drug response with gene expression profile in multiple myeloma using hierarchical ordinal regression.

    PubMed

    Zhang, Xinyan; Li, Bingzong; Han, Huiying; Song, Sha; Xu, Hongxia; Hong, Yating; Yi, Nengjun; Zhuang, Wenzhuo

    2018-05-10

    Multiple myeloma (MM), like other cancers, is caused by the accumulation of genetic abnormalities. Heterogeneity exists in the patients' response to treatments, for example, bortezomib. This urges efforts to identify biomarkers from numerous molecular features and build predictive models for identifying patients that can benefit from a certain treatment scheme. However, previous studies treated the multi-level ordinal drug response as a binary response where only responsive and non-responsive groups are considered. It is desirable to directly analyze the multi-level drug response, rather than combining the response to two groups. In this study, we present a novel method to identify significantly associated biomarkers and then develop ordinal genomic classifier using the hierarchical ordinal logistic model. The proposed hierarchical ordinal logistic model employs the heavy-tailed Cauchy prior on the coefficients and is fitted by an efficient quasi-Newton algorithm. We apply our hierarchical ordinal regression approach to analyze two publicly available datasets for MM with five-level drug response and numerous gene expression measures. Our results show that our method is able to identify genes associated with the multi-level drug response and to generate powerful predictive models for predicting the multi-level response. The proposed method allows us to jointly fit numerous correlated predictors and thus build efficient models for predicting the multi-level drug response. The predictive model for the multi-level drug response can be more informative than the previous approaches. Thus, the proposed approach provides a powerful tool for predicting multi-level drug response and has important impact on cancer studies.

  11. A Generalized Logistic Regression Procedure to Detect Differential Item Functioning among Multiple Groups

    ERIC Educational Resources Information Center

    Magis, David; Raiche, Gilles; Beland, Sebastien; Gerard, Paul

    2011-01-01

    We present an extension of the logistic regression procedure to identify dichotomous differential item functioning (DIF) in the presence of more than two groups of respondents. Starting from the usual framework of a single focal group, we propose a general approach to estimate the item response functions in each group and to test for the presence…

  12. Predicting risk for portal vein thrombosis in acute pancreatitis patients: A comparison of radical basis function artificial neural network and logistic regression models.

    PubMed

    Fei, Yang; Hu, Jian; Gao, Kun; Tu, Jianfeng; Li, Wei-Qin; Wang, Wei

    2017-06-01

    To construct a radical basis function (RBF) artificial neural networks (ANNs) model to predict the incidence of acute pancreatitis (AP)-induced portal vein thrombosis. The analysis included 353 patients with AP who had admitted between January 2011 and December 2015. RBF ANNs model and logistic regression model were constructed based on eleven factors relevant to AP respectively. Statistical indexes were used to evaluate the value of the prediction in two models. The predict sensitivity, specificity, positive predictive value, negative predictive value and accuracy by RBF ANNs model for PVT were 73.3%, 91.4%, 68.8%, 93.0% and 87.7%, respectively. There were significant differences between the RBF ANNs and logistic regression models in these parameters (P<0.05). In addition, a comparison of the area under receiver operating characteristic curves of the two models showed a statistically significant difference (P<0.05). The RBF ANNs model is more likely to predict the occurrence of PVT induced by AP than logistic regression model. D-dimer, AMY, Hct and PT were important prediction factors of approval for AP-induced PVT. Copyright © 2017 Elsevier Inc. All rights reserved.

  13. Serum magnesium but not calcium was associated with hemorrhagic transformation in stroke overall and stroke subtypes: a case-control study in China.

    PubMed

    Tan, Ge; Yuan, Ruozhen; Wei, ChenChen; Xu, Mangmang; Liu, Ming

    2018-05-26

    Association between serum calcium and magnesium versus hemorrhagic transformation (HT) remains to be identified. A total of 1212 non-thrombolysis patients with serum calcium and magnesium collected within 24 h from stroke onset were enrolled. Backward stepwise multivariate logistic regression analysis was conducted to investigate association between calcium and magnesium versus HT. Calcium and magnesium were entered into logistic regression analysis in two models, separately: model 1, as continuous variable (per 1-mmol/L increase), and model 2, as four-categorized variable (being collapsed into quartiles). HT occurred in 140 patients (11.6%). Serum calcium was slightly lower in patients with HT than in patient without HT (P = 0.273). But serum magnesium was significantly lower in patients with HT than in patients without HT (P = 0.007). In logistic regression analysis, calcium displayed no association with HT. Magnesium, as either continuous or four-categorized variable, was independently and inversely associated with HT in stroke overall and stroke of large-artery atherosclerosis (LAA). The results demonstrated that serum calcium had no association with HT in patients without thrombolysis after acute ischemic stroke. Serum magnesium in low level was independently associated with increasing HT in stroke overall and particularly in stroke of LAA.

  14. Binomial outcomes in dataset with some clusters of size two: can the dependence of twins be accounted for? A simulation study comparing the reliability of statistical methods based on a dataset of preterm infants.

    PubMed

    Sauzet, Odile; Peacock, Janet L

    2017-07-20

    The analysis of perinatal outcomes often involves datasets with some multiple births. These are datasets mostly formed of independent observations and a limited number of clusters of size two (twins) and maybe of size three or more. This non-independence needs to be accounted for in the statistical analysis. Using simulated data based on a dataset of preterm infants we have previously investigated the performance of several approaches to the analysis of continuous outcomes in the presence of some clusters of size two. Mixed models have been developed for binomial outcomes but very little is known about their reliability when only a limited number of small clusters are present. Using simulated data based on a dataset of preterm infants we investigated the performance of several approaches to the analysis of binomial outcomes in the presence of some clusters of size two. Logistic models, several methods of estimation for the logistic random intercept models and generalised estimating equations were compared. The presence of even a small percentage of twins means that a logistic regression model will underestimate all parameters but a logistic random intercept model fails to estimate the correlation between siblings if the percentage of twins is too small and will provide similar estimates to logistic regression. The method which seems to provide the best balance between estimation of the standard error and the parameter for any percentage of twins is the generalised estimating equations. This study has shown that the number of covariates or the level two variance do not necessarily affect the performance of the various methods used to analyse datasets containing twins but when the percentage of small clusters is too small, mixed models cannot capture the dependence between siblings.

  15. Simulating land-use changes by incorporating spatial autocorrelation and self-organization in CLUE-S modeling: a case study in Zengcheng District, Guangzhou, China

    NASA Astrophysics Data System (ADS)

    Mei, Zhixiong; Wu, Hao; Li, Shiyun

    2018-06-01

    The Conversion of Land Use and its Effects at Small regional extent (CLUE-S), which is a widely used model for land-use simulation, utilizes logistic regression to estimate the relationships between land use and its drivers, and thus, predict land-use change probabilities. However, logistic regression disregards possible spatial autocorrelation and self-organization in land-use data. Autologistic regression can depict spatial autocorrelation but cannot address self-organization, while logistic regression by considering only self-organization (NElogistic regression) fails to capture spatial autocorrelation. Therefore, this study developed a regression (NE-autologistic regression) method, which incorporated both spatial autocorrelation and self-organization, to improve CLUE-S. The Zengcheng District of Guangzhou, China was selected as the study area. The land-use data of 2001, 2005, and 2009, as well as 10 typical driving factors, were used to validate the proposed regression method and the improved CLUE-S model. Then, three future land-use scenarios in 2020: the natural growth scenario, ecological protection scenario, and economic development scenario, were simulated using the improved model. Validation results showed that NE-autologistic regression performed better than logistic regression, autologistic regression, and NE-logistic regression in predicting land-use change probabilities. The spatial allocation accuracy and kappa values of NE-autologistic-CLUE-S were higher than those of logistic-CLUE-S, autologistic-CLUE-S, and NE-logistic-CLUE-S for the simulations of two periods, 2001-2009 and 2005-2009, which proved that the improved CLUE-S model achieved the best simulation and was thereby effective to a certain extent. The scenario simulation results indicated that under all three scenarios, traffic land and residential/industrial land would increase, whereas arable land and unused land would decrease during 2009-2020. Apparent differences also existed in the simulated change sizes and locations of each land-use type under different scenarios. The results not only demonstrate the validity of the improved model but also provide a valuable reference for relevant policy-makers.

  16. Ordinal logistic regression analysis on the nutritional status of children in KarangKitri village

    NASA Astrophysics Data System (ADS)

    Ohyver, Margaretha; Yongharto, Kimmy Octavian

    2015-09-01

    Ordinal logistic regression is a statistical technique that can be used to describe the relationship between ordinal response variable with one or more independent variables. This method has been used in various fields including in the health field. In this research, ordinal logistic regression is used to describe the relationship between nutritional status of children with age, gender, height, and family status. Nutritional status of children in this research is divided into over nutrition, well nutrition, less nutrition, and malnutrition. The purpose for this research is to describe the characteristics of children in the KarangKitri Village and to determine the factors that influence the nutritional status of children in the KarangKitri village. There are three things that obtained from this research. First, there are still children who are not categorized as well nutritional status. Second, there are children who come from sufficient economic level which include in not normal status. Third, the factors that affect the nutritional level of children are age, family status, and height.

  17. Access disparities to Magnet hospitals for patients undergoing neurosurgical operations

    PubMed Central

    Missios, Symeon; Bekelis, Kimon

    2017-01-01

    Background Centers of excellence focusing on quality improvement have demonstrated superior outcomes for a variety of surgical interventions. We investigated the presence of access disparities to hospitals recognized by the Magnet Recognition Program of the American Nurses Credentialing Center (ANCC) for patients undergoing neurosurgical operations. Methods We performed a cohort study of all neurosurgery patients who were registered in the New York Statewide Planning and Research Cooperative System (SPARCS) database from 2009–2013. We examined the association of African-American race and lack of insurance with Magnet status hospitalization for neurosurgical procedures. A mixed effects propensity adjusted multivariable regression analysis was used to control for confounding. Results During the study period, 190,535 neurosurgical patients met the inclusion criteria. Using a multivariable logistic regression, we demonstrate that African-Americans had lower admission rates to Magnet institutions (OR 0.62; 95% CI, 0.58–0.67). This persisted in a mixed effects logistic regression model (OR 0.77; 95% CI, 0.70–0.83) to adjust for clustering at the patient county level, and a propensity score adjusted logistic regression model (OR 0.75; 95% CI, 0.69–0.82). Additionally, lack of insurance was associated with lower admission rates to Magnet institutions (OR 0.71; 95% CI, 0.68–0.73), in a multivariable logistic regression model. This persisted in a mixed effects logistic regression model (OR 0.72; 95% CI, 0.69–0.74), and a propensity score adjusted logistic regression model (OR 0.72; 95% CI, 0.69–0.75). Conclusions Using a comprehensive all-payer cohort of neurosurgery patients in New York State we identified an association of African-American race and lack of insurance with lower rates of admission to Magnet hospitals. PMID:28684152

  18. Adjusting for Confounding in Early Postlaunch Settings: Going Beyond Logistic Regression Models.

    PubMed

    Schmidt, Amand F; Klungel, Olaf H; Groenwold, Rolf H H

    2016-01-01

    Postlaunch data on medical treatments can be analyzed to explore adverse events or relative effectiveness in real-life settings. These analyses are often complicated by the number of potential confounders and the possibility of model misspecification. We conducted a simulation study to compare the performance of logistic regression, propensity score, disease risk score, and stabilized inverse probability weighting methods to adjust for confounding. Model misspecification was induced in the independent derivation dataset. We evaluated performance using relative bias confidence interval coverage of the true effect, among other metrics. At low events per coefficient (1.0 and 0.5), the logistic regression estimates had a large relative bias (greater than -100%). Bias of the disease risk score estimates was at most 13.48% and 18.83%. For the propensity score model, this was 8.74% and >100%, respectively. At events per coefficient of 1.0 and 0.5, inverse probability weighting frequently failed or reduced to a crude regression, resulting in biases of -8.49% and 24.55%. Coverage of logistic regression estimates became less than the nominal level at events per coefficient ≤5. For the disease risk score, inverse probability weighting, and propensity score, coverage became less than nominal at events per coefficient ≤2.5, ≤1.0, and ≤1.0, respectively. Bias of misspecified disease risk score models was 16.55%. In settings with low events/exposed subjects per coefficient, disease risk score methods can be useful alternatives to logistic regression models, especially when propensity score models cannot be used. Despite better performance of disease risk score methods than logistic regression and propensity score models in small events per coefficient settings, bias, and coverage still deviated from nominal.

  19. Refining Stimulus Parameters in Assessing Infant Speech Perception Using Visual Reinforcement Infant Speech Discrimination: Sensation Level.

    PubMed

    Uhler, Kristin M; Baca, Rosalinda; Dudas, Emily; Fredrickson, Tammy

    2015-01-01

    Speech perception measures have long been considered an integral piece of the audiological assessment battery. Currently, a prelinguistic, standardized measure of speech perception is missing in the clinical assessment battery for infants and young toddlers. Such a measure would allow systematic assessment of speech perception abilities of infants as well as the potential to investigate the impact early identification of hearing loss and early fitting of amplification have on the auditory pathways. To investigate the impact of sensation level (SL) on the ability of infants with normal hearing (NH) to discriminate /a-i/ and /ba-da/ and to determine if performance on the two contrasts are significantly different in predicting the discrimination criterion. The design was based on a survival analysis model for event occurrence and a repeated measures logistic model for binary outcomes. The outcome for survival analysis was the minimum SL for criterion and the outcome for the logistic regression model was the presence/absence of achieving the criterion. Criterion achievement was designated when an infant's proportion correct score was >0.75 on the discrimination performance task. Twenty-two infants with NH sensitivity participated in this study. There were 9 males and 13 females, aged 6-14 mo. Testing took place over two to three sessions. The first session consisted of a hearing test, threshold assessment of the two speech sounds (/a/ and /i/), and if time and attention allowed, visual reinforcement infant speech discrimination (VRISD). The second session consisted of VRISD assessment for the two test contrasts (/a-i/ and /ba-da/). The presentation level started at 50 dBA. If the infant was unable to successfully achieve criterion (>0.75) at 50 dBA, the presentation level was increased to 70 dBA followed by 60 dBA. Data examination included an event analysis, which provided the probability of criterion distribution across SL. The second stage of the analysis was a repeated measures logistic regression where SL and contrast were used to predict the likelihood of speech discrimination criterion. Infants were able to reach criterion for the /a-i/ contrast at statistically lower SLs when compared to /ba-da/. There were six infants who never reached criterion for /ba-da/ and one never reached criterion for /a-i/. The conditional probability of not reaching criterion by 70 dB SL was 0% for /a-i/ and 21% for /ba-da/. The predictive logistic regression model showed that children were more likely to discriminate the /a-i/ even when controlling for SL. Nearly all normal-hearing infants can demonstrate discrimination criterion of a vowel contrast at 60 dB SL, while a level of ≥70 dB SL may be needed to allow all infants to demonstrate discrimination criterion of a difficult consonant contrast. American Academy of Audiology.

  20. Using ROC curves to compare neural networks and logistic regression for modeling individual noncatastrophic tree mortality

    Treesearch

    Susan L. King

    2003-01-01

    The performance of two classifiers, logistic regression and neural networks, are compared for modeling noncatastrophic individual tree mortality for 21 species of trees in West Virginia. The output of the classifier is usually a continuous number between 0 and 1. A threshold is selected between 0 and 1 and all of the trees below the threshold are classified as...

  1. [Calculating Pearson residual in logistic regressions: a comparison between SPSS and SAS].

    PubMed

    Xu, Hao; Zhang, Tao; Li, Xiao-song; Liu, Yuan-yuan

    2015-01-01

    To compare the results of Pearson residual calculations in logistic regression models using SPSS and SAS. We reviewed Pearson residual calculation methods, and used two sets of data to test logistic models constructed by SPSS and STATA. One model contained a small number of covariates compared to the number of observed. The other contained a similar number of covariates as the number of observed. The two software packages produced similar Pearson residual estimates when the models contained a similar number of covariates as the number of observed, but the results differed when the number of observed was much greater than the number of covariates. The two software packages produce different results of Pearson residuals, especially when the models contain a small number of covariates. Further studies are warranted.

  2. Epidemiologic programs for computers and calculators. A microcomputer program for multiple logistic regression by unconditional and conditional maximum likelihood methods.

    PubMed

    Campos-Filho, N; Franco, E L

    1989-02-01

    A frequent procedure in matched case-control studies is to report results from the multivariate unmatched analyses if they do not differ substantially from the ones obtained after conditioning on the matching variables. Although conceptually simple, this rule requires that an extensive series of logistic regression models be evaluated by both the conditional and unconditional maximum likelihood methods. Most computer programs for logistic regression employ only one maximum likelihood method, which requires that the analyses be performed in separate steps. This paper describes a Pascal microcomputer (IBM PC) program that performs multiple logistic regression by both maximum likelihood estimation methods, which obviates the need for switching between programs to obtain relative risk estimates from both matched and unmatched analyses. The program calculates most standard statistics and allows factoring of categorical or continuous variables by two distinct methods of contrast. A built-in, descriptive statistics option allows the user to inspect the distribution of cases and controls across categories of any given variable.

  3. EXpectation Propagation LOgistic REgRession (EXPLORER): Distributed Privacy-Preserving Online Model Learning

    PubMed Central

    Wang, Shuang; Jiang, Xiaoqian; Wu, Yuan; Cui, Lijuan; Cheng, Samuel; Ohno-Machado, Lucila

    2013-01-01

    We developed an EXpectation Propagation LOgistic REgRession (EXPLORER) model for distributed privacy-preserving online learning. The proposed framework provides a high level guarantee for protecting sensitive information, since the information exchanged between the server and the client is the encrypted posterior distribution of coefficients. Through experimental results, EXPLORER shows the same performance (e.g., discrimination, calibration, feature selection etc.) as the traditional frequentist Logistic Regression model, but provides more flexibility in model updating. That is, EXPLORER can be updated one point at a time rather than having to retrain the entire data set when new observations are recorded. The proposed EXPLORER supports asynchronized communication, which relieves the participants from coordinating with one another, and prevents service breakdown from the absence of participants or interrupted communications. PMID:23562651

  4. Internal exposure levels of typical POPs and their associations with childhood asthma in Shanghai, China.

    PubMed

    Meng, Ge; Feng, Yan; Nie, Zhiqing; Wu, Xiaomeng; Wei, Hongying; Wu, Shaowei; Yin, Yong; Wang, Yan

    2016-04-01

    Polybrominated diphenyl ethers (PBDEs), polychlorinated biphenyls (PCBs) and organochlorine pesticides (OCPs) are common persistent organic pollutants (POPs) that may be associated with childhood asthma. The concentrations of PBDEs, PCBs and OCPs were analyzed in pooled serum samples from both asthmatic and non-asthmatic children. The differences in the internal exposure levels between the case and control groups were tested (p value <0.0012). The associations between the internal exposure concentrations of the POPs and childhood asthma were estimated based on the odds ratios (ORs) calculated using logistic regression models. There were significant differences in three PBDEs, 26 PCBs and seven OCPs between the two groups, with significantly higher levels in the cases. The multiple logistic regression models demonstrated that the internal exposure concentrations of a number of the POPs (23 PCBs, p,p'-DDE and α-HCH) were positively associated with childhood asthma. Some synergistic effects were observed when the children were co-exposed to the chemicals. BDE-209 was positively associated with asthma aggravation. This study indicates the potential relationships between the internal exposure concentrations of particular POPs and the development of childhood asthma. Copyright © 2015 Elsevier Inc. All rights reserved.

  5. Clustering performance comparison using K-means and expectation maximization algorithms.

    PubMed

    Jung, Yong Gyu; Kang, Min Soo; Heo, Jun

    2014-11-14

    Clustering is an important means of data mining based on separating data categories by similar features. Unlike the classification algorithm, clustering belongs to the unsupervised type of algorithms. Two representatives of the clustering algorithms are the K -means and the expectation maximization (EM) algorithm. Linear regression analysis was extended to the category-type dependent variable, while logistic regression was achieved using a linear combination of independent variables. To predict the possibility of occurrence of an event, a statistical approach is used. However, the classification of all data by means of logistic regression analysis cannot guarantee the accuracy of the results. In this paper, the logistic regression analysis is applied to EM clusters and the K -means clustering method for quality assessment of red wine, and a method is proposed for ensuring the accuracy of the classification results.

  6. Multinomial logistic regression modelling of obesity and overweight among primary school students in a rural area of Negeri Sembilan

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ghazali, Amirul Syafiq Mohd; Ali, Zalila; Noor, Norlida Mohd

    Multinomial logistic regression is widely used to model the outcomes of a polytomous response variable, a categorical dependent variable with more than two categories. The model assumes that the conditional mean of the dependent categorical variables is the logistic function of an affine combination of predictor variables. Its procedure gives a number of logistic regression models that make specific comparisons of the response categories. When there are q categories of the response variable, the model consists of q-1 logit equations which are fitted simultaneously. The model is validated by variable selection procedures, tests of regression coefficients, a significant test ofmore » the overall model, goodness-of-fit measures, and validation of predicted probabilities using odds ratio. This study used the multinomial logistic regression model to investigate obesity and overweight among primary school students in a rural area on the basis of their demographic profiles, lifestyles and on the diet and food intake. The results indicated that obesity and overweight of students are related to gender, religion, sleep duration, time spent on electronic games, breakfast intake in a week, with whom meals are taken, protein intake, and also, the interaction between breakfast intake in a week with sleep duration, and the interaction between gender and protein intake.« less

  7. Multinomial logistic regression modelling of obesity and overweight among primary school students in a rural area of Negeri Sembilan

    NASA Astrophysics Data System (ADS)

    Ghazali, Amirul Syafiq Mohd; Ali, Zalila; Noor, Norlida Mohd; Baharum, Adam

    2015-10-01

    Multinomial logistic regression is widely used to model the outcomes of a polytomous response variable, a categorical dependent variable with more than two categories. The model assumes that the conditional mean of the dependent categorical variables is the logistic function of an affine combination of predictor variables. Its procedure gives a number of logistic regression models that make specific comparisons of the response categories. When there are q categories of the response variable, the model consists of q-1 logit equations which are fitted simultaneously. The model is validated by variable selection procedures, tests of regression coefficients, a significant test of the overall model, goodness-of-fit measures, and validation of predicted probabilities using odds ratio. This study used the multinomial logistic regression model to investigate obesity and overweight among primary school students in a rural area on the basis of their demographic profiles, lifestyles and on the diet and food intake. The results indicated that obesity and overweight of students are related to gender, religion, sleep duration, time spent on electronic games, breakfast intake in a week, with whom meals are taken, protein intake, and also, the interaction between breakfast intake in a week with sleep duration, and the interaction between gender and protein intake.

  8. The relationship between serum and urinary Fetuin-A levels and kidney stone formation among kidney stone patients.

    PubMed

    Mehrsai, Abdolrasoul; Guitynavard, Fateme; Nikoobakht, Mohammad Reza; Gooran, Shahram; Ahmadi, Ayat

    2017-01-01

    Mineralization inhibitors are required to prevent the precipitation of minerals and inhibit the formation of kidney stones and other ectopic calcifications. In laboratory studies, Fetuin-A as a glycoprotein has inhibited hydroxyapatite precipitation in calcium and phosphate supersaturated solutions; however, information about patients with kidney stones is limited. The aim of this study was to investigate the association of serum and urinary Fetuin-A levels with calcium oxalate kidney stones. In this case-control study, 30 patients with kidney stones and 30 healthy individuals without any history of urolithiasis who were referred to the urology ward of Sina Hospital of Tehran, Iran, in 2015 were entered into the study. All patients underwent computerized tomography scans. After collecting demographic information, serum and urine levels of Fetuin-A and some other calcification inhibitors and promoters, were measured and compared using T-test, Mann-Whitney and logistic regression between the two study groups. Patients with kidney stones, on average, had lower levels of Serum Fetuin-A (1522.27 ±755.39 vs. 1914.64 ±733.76 μg/ml; P = 0.046) as well as lower levels of Urine Fetuin-A (944.62 ±188.5 vs. 1409.68 ±295.26 μg/ml; P <0.001). Multivariate logistic analysis showed that urinary calcium and serum creatinine are the risk factors and Fetuin-A is a urinary protective factor for kidney stones. PFC Our study showed that patients with kidney stones had lower serum and urinary levels of Fetuin-A. In the logistic regression model, urinary Fetuin-A was reported as a protective factor for kidney stones.

  9. Effect of folic acid on appetite in children: ordinal logistic and fuzzy logistic regressions.

    PubMed

    Namdari, Mahshid; Abadi, Alireza; Taheri, S Mahmoud; Rezaei, Mansour; Kalantari, Naser; Omidvar, Nasrin

    2014-03-01

    Reduced appetite and low food intake are often a concern in preschool children, since it can lead to malnutrition, a leading cause of impaired growth and mortality in childhood. It is occasionally considered that folic acid has a positive effect on appetite enhancement and consequently growth in children. The aim of this study was to assess the effect of folic acid on the appetite of preschool children 3 to 6 y old. The study sample included 127 children ages 3 to 6 who were randomly selected from 20 preschools in the city of Tehran in 2011. Since appetite was measured by linguistic terms, a fuzzy logistic regression was applied for modeling. The obtained results were compared with a statistical ordinal logistic model. After controlling for the potential confounders, in a statistical ordinal logistic model, serum folate showed a significantly positive effect on appetite. A small but positive effect of folate was detected by fuzzy logistic regression. Based on fuzzy regression, the risk for poor appetite in preschool children was related to the employment status of their mothers. In this study, a positive association was detected between the levels of serum folate and improved appetite. For further investigation, a randomized controlled, double-blind clinical trial could be helpful to address causality. Copyright © 2014 Elsevier Inc. All rights reserved.

  10. The effect of high leverage points on the logistic ridge regression estimator having multicollinearity

    NASA Astrophysics Data System (ADS)

    Ariffin, Syaiba Balqish; Midi, Habshah

    2014-06-01

    This article is concerned with the performance of logistic ridge regression estimation technique in the presence of multicollinearity and high leverage points. In logistic regression, multicollinearity exists among predictors and in the information matrix. The maximum likelihood estimator suffers a huge setback in the presence of multicollinearity which cause regression estimates to have unduly large standard errors. To remedy this problem, a logistic ridge regression estimator is put forward. It is evident that the logistic ridge regression estimator outperforms the maximum likelihood approach for handling multicollinearity. The effect of high leverage points are then investigated on the performance of the logistic ridge regression estimator through real data set and simulation study. The findings signify that logistic ridge regression estimator fails to provide better parameter estimates in the presence of both high leverage points and multicollinearity.

  11. Determination of osteoporosis risk factors using a multiple logistic regression model in postmenopausal Turkish women.

    PubMed

    Akkus, Zeki; Camdeviren, Handan; Celik, Fatma; Gur, Ali; Nas, Kemal

    2005-09-01

    To determine the risk factors of osteoporosis using a multiple binary logistic regression method and to assess the risk variables for osteoporosis, which is a major and growing health problem in many countries. We presented a case-control study, consisting of 126 postmenopausal healthy women as control group and 225 postmenopausal osteoporotic women as the case group. The study was carried out in the Department of Physical Medicine and Rehabilitation, Dicle University, Diyarbakir, Turkey between 1999-2002. The data from the 351 participants were collected using a standard questionnaire that contains 43 variables. A multiple logistic regression model was then used to evaluate the data and to find the best regression model. We classified 80.1% (281/351) of the participants using the regression model. Furthermore, the specificity value of the model was 67% (84/126) of the control group while the sensitivity value was 88% (197/225) of the case group. We found the distribution of residual values standardized for final model to be exponential using the Kolmogorow-Smirnow test (p=0.193). The receiver operating characteristic curve was found successful to predict patients with risk for osteoporosis. This study suggests that low levels of dietary calcium intake, physical activity, education, and longer duration of menopause are independent predictors of the risk of low bone density in our population. Adequate dietary calcium intake in combination with maintaining a daily physical activity, increasing educational level, decreasing birth rate, and duration of breast-feeding may contribute to healthy bones and play a role in practical prevention of osteoporosis in Southeast Anatolia. In addition, the findings of the present study indicate that the use of multivariate statistical method as a multiple logistic regression in osteoporosis, which maybe influenced by many variables, is better than univariate statistical evaluation.

  12. Perception of Tourists Regarding the Smoke-Free Policy at Suvarnabhumi International Airport, Bangkok, Thailand

    PubMed Central

    Sirichotiratana, Nithat; Yogi, Subash; Prutipinyo, Chardsumon

    2013-01-01

    This study was conducted during February-March 2012 to determine the perception and support regarding smoke-free policy among tourists at Suvarnabhumi International Airport, Bangkok, Thailand. In this cross-sectional study, 200 tourists (n = 200) were enrolled by convenience sampling and interviewed by structured questionnaire. Descriptive statistics, chi-square, and multinomial logistic regression were adopted in the study. Results revealed that half (50%) of the tourists were current smokers and 55% had visited Thailand twice or more. Three quarter (76%) of tourists indicated that they would visit Thailand again even if it had a 100% smoke-free regulation. Almost all (99%) of the tourists had supported for the smoke-free policy (partial ban and total ban), and current smokers had higher percentage of support than non-smokers. Two factors, current smoking status and knowledge level, were significantly associated with perception level. After analysis with Multinomial Logistic Regression, it was found that perception, country group, and presence of designated smoking room (DSR) were associated with smoke-free policy. Recommendation is that, at institution level effective monitoring system is needed at the airport. At policy level, the recommendation is that effective comprehensive policy needed to be emphasized to ensure smoke-free airport environment. PMID:23999549

  13. Analysis of nonlinear relationships in dual epidemics, and its application to the management of grapevine downy and powdery mildews.

    PubMed

    Savary, Serge; Delbac, Lionel; Rochas, Amélie; Taisant, Guillaume; Willocquet, Laetitia

    2009-08-01

    Dual epidemics are defined as epidemics developing on two or several plant organs in the course of a cropping season. Agricultural pathosystems where such epidemics develop are often very important, because the harvestable part is one of the organs affected. These epidemics also are often difficult to manage, because the linkage between epidemiological components occurring on different organs is poorly understood, and because prediction of the risk toward the harvestable organs is difficult. In the case of downy mildew (DM) and powdery mildew (PM) of grapevine, nonlinear modeling and logistic regression indicated nonlinearity in the foliage-cluster relationships. Nonlinear modeling enabled the parameterization of a transmission coefficient that numerically links the two components, leaves and clusters, in DM and PM epidemics. Logistic regression analysis yielded a series of probabilistic models that enabled predicting preset levels of cluster infection risks based on DM and PM severities on the foliage at successive crop stages. The usefulness of this framework for tactical decision-making for disease control is discussed.

  14. Science of Test Research Consortium: Year Two Final Report

    DTIC Science & Technology

    2012-10-02

    July 2012. Analysis of an Intervention for Small Unmanned Aerial System ( SUAS ) Accidents, submitted to Quality Engineering, LQEN-2012-0056. Stone... Systems Engineering. Wolf, S. E., R. R. Hill, and J. J. Pignatiello. June 2012. Using Neural Networks and Logistic Regression to Model Small Unmanned ...Human Retina. 6. Wolf, S. E. March 2012. Modeling Small Unmanned Aerial System Mishaps using Logistic Regression and Artificial Neural Networks. 7

  15. On the use and misuse of scalar scores of confounders in design and analysis of observational studies.

    PubMed

    Pfeiffer, R M; Riedl, R

    2015-08-15

    We assess the asymptotic bias of estimates of exposure effects conditional on covariates when summary scores of confounders, instead of the confounders themselves, are used to analyze observational data. First, we study regression models for cohort data that are adjusted for summary scores. Second, we derive the asymptotic bias for case-control studies when cases and controls are matched on a summary score, and then analyzed either using conditional logistic regression or by unconditional logistic regression adjusted for the summary score. Two scores, the propensity score (PS) and the disease risk score (DRS) are studied in detail. For cohort analysis, when regression models are adjusted for the PS, the estimated conditional treatment effect is unbiased only for linear models, or at the null for non-linear models. Adjustment of cohort data for DRS yields unbiased estimates only for linear regression; all other estimates of exposure effects are biased. Matching cases and controls on DRS and analyzing them using conditional logistic regression yields unbiased estimates of exposure effect, whereas adjusting for the DRS in unconditional logistic regression yields biased estimates, even under the null hypothesis of no association. Matching cases and controls on the PS yield unbiased estimates only under the null for both conditional and unconditional logistic regression, adjusted for the PS. We study the bias for various confounding scenarios and compare our asymptotic results with those from simulations with limited sample sizes. To create realistic correlations among multiple confounders, we also based simulations on a real dataset. Copyright © 2015 John Wiley & Sons, Ltd.

  16. Extreme Sparse Multinomial Logistic Regression: A Fast and Robust Framework for Hyperspectral Image Classification

    NASA Astrophysics Data System (ADS)

    Cao, Faxian; Yang, Zhijing; Ren, Jinchang; Ling, Wing-Kuen; Zhao, Huimin; Marshall, Stephen

    2017-12-01

    Although the sparse multinomial logistic regression (SMLR) has provided a useful tool for sparse classification, it suffers from inefficacy in dealing with high dimensional features and manually set initial regressor values. This has significantly constrained its applications for hyperspectral image (HSI) classification. In order to tackle these two drawbacks, an extreme sparse multinomial logistic regression (ESMLR) is proposed for effective classification of HSI. First, the HSI dataset is projected to a new feature space with randomly generated weight and bias. Second, an optimization model is established by the Lagrange multiplier method and the dual principle to automatically determine a good initial regressor for SMLR via minimizing the training error and the regressor value. Furthermore, the extended multi-attribute profiles (EMAPs) are utilized for extracting both the spectral and spatial features. A combinational linear multiple features learning (MFL) method is proposed to further enhance the features extracted by ESMLR and EMAPs. Finally, the logistic regression via the variable splitting and the augmented Lagrangian (LORSAL) is adopted in the proposed framework for reducing the computational time. Experiments are conducted on two well-known HSI datasets, namely the Indian Pines dataset and the Pavia University dataset, which have shown the fast and robust performance of the proposed ESMLR framework.

  17. Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes

    PubMed Central

    2011-01-01

    Background Logistic random effects models are a popular tool to analyze multilevel also called hierarchical data with a binary or ordinal outcome. Here, we aim to compare different statistical software implementations of these models. Methods We used individual patient data from 8509 patients in 231 centers with moderate and severe Traumatic Brain Injury (TBI) enrolled in eight Randomized Controlled Trials (RCTs) and three observational studies. We fitted logistic random effects regression models with the 5-point Glasgow Outcome Scale (GOS) as outcome, both dichotomized as well as ordinal, with center and/or trial as random effects, and as covariates age, motor score, pupil reactivity or trial. We then compared the implementations of frequentist and Bayesian methods to estimate the fixed and random effects. Frequentist approaches included R (lme4), Stata (GLLAMM), SAS (GLIMMIX and NLMIXED), MLwiN ([R]IGLS) and MIXOR, Bayesian approaches included WinBUGS, MLwiN (MCMC), R package MCMCglmm and SAS experimental procedure MCMC. Three data sets (the full data set and two sub-datasets) were analysed using basically two logistic random effects models with either one random effect for the center or two random effects for center and trial. For the ordinal outcome in the full data set also a proportional odds model with a random center effect was fitted. Results The packages gave similar parameter estimates for both the fixed and random effects and for the binary (and ordinal) models for the main study and when based on a relatively large number of level-1 (patient level) data compared to the number of level-2 (hospital level) data. However, when based on relatively sparse data set, i.e. when the numbers of level-1 and level-2 data units were about the same, the frequentist and Bayesian approaches showed somewhat different results. The software implementations differ considerably in flexibility, computation time, and usability. There are also differences in the availability of additional tools for model evaluation, such as diagnostic plots. The experimental SAS (version 9.2) procedure MCMC appeared to be inefficient. Conclusions On relatively large data sets, the different software implementations of logistic random effects regression models produced similar results. Thus, for a large data set there seems to be no explicit preference (of course if there is no preference from a philosophical point of view) for either a frequentist or Bayesian approach (if based on vague priors). The choice for a particular implementation may largely depend on the desired flexibility, and the usability of the package. For small data sets the random effects variances are difficult to estimate. In the frequentist approaches the MLE of this variance was often estimated zero with a standard error that is either zero or could not be determined, while for Bayesian methods the estimates could depend on the chosen "non-informative" prior of the variance parameter. The starting value for the variance parameter may be also critical for the convergence of the Markov chain. PMID:21605357

  18. Logistic Mixed Models to Investigate Implicit and Explicit Belief Tracking.

    PubMed

    Lages, Martin; Scheel, Anne

    2016-01-01

    We investigated the proposition of a two-systems Theory of Mind in adults' belief tracking. A sample of N = 45 participants predicted the choice of one of two opponent players after observing several rounds in an animated card game. Three matches of this card game were played and initial gaze direction on target and subsequent choice predictions were recorded for each belief task and participant. We conducted logistic regressions with mixed effects on the binary data and developed Bayesian logistic mixed models to infer implicit and explicit mentalizing in true belief and false belief tasks. Although logistic regressions with mixed effects predicted the data well a Bayesian logistic mixed model with latent task- and subject-specific parameters gave a better account of the data. As expected explicit choice predictions suggested a clear understanding of true and false beliefs (TB/FB). Surprisingly, however, model parameters for initial gaze direction also indicated belief tracking. We discuss why task-specific parameters for initial gaze directions are different from choice predictions yet reflect second-order perspective taking.

  19. Sample size determination for logistic regression on a logit-normal distribution.

    PubMed

    Kim, Seongho; Heath, Elisabeth; Heilbrun, Lance

    2017-06-01

    Although the sample size for simple logistic regression can be readily determined using currently available methods, the sample size calculation for multiple logistic regression requires some additional information, such as the coefficient of determination ([Formula: see text]) of a covariate of interest with other covariates, which is often unavailable in practice. The response variable of logistic regression follows a logit-normal distribution which can be generated from a logistic transformation of a normal distribution. Using this property of logistic regression, we propose new methods of determining the sample size for simple and multiple logistic regressions using a normal transformation of outcome measures. Simulation studies and a motivating example show several advantages of the proposed methods over the existing methods: (i) no need for [Formula: see text] for multiple logistic regression, (ii) available interim or group-sequential designs, and (iii) much smaller required sample size.

  20. 4D-Fingerprint Categorical QSAR Models for Skin Sensitization Based on Classification Local Lymph Node Assay Measures

    PubMed Central

    Li, Yi; Tseng, Yufeng J.; Pan, Dahua; Liu, Jianzhong; Kern, Petra S.; Gerberick, G. Frank; Hopfinger, Anton J.

    2008-01-01

    Currently, the only validated methods to identify skin sensitization effects are in vivo models, such as the Local Lymph Node Assay (LLNA) and guinea pig studies. There is a tremendous need, in particular due to novel legislation, to develop animal alternatives, eg. Quantitative Structure-Activity Relationship (QSAR) models. Here, QSAR models for skin sensitization using LLNA data have been constructed. The descriptors used to generate these models are derived from the 4D-molecular similarity paradigm and are referred to as universal 4D-fingerprints. A training set of 132 structurally diverse compounds and a test set of 15 structurally diverse compounds were used in this study. The statistical methodologies used to build the models are logistic regression (LR), and partial least square coupled logistic regression (PLS-LR), which prove to be effective tools for studying skin sensitization measures expressed in the two categorical terms of sensitizer and non-sensitizer. QSAR models with low values of the Hosmer-Lemeshow goodness-of-fit statistic, χHL2, are significant and predictive. For the training set, the cross-validated prediction accuracy of the logistic regression models ranges from 77.3% to 78.0%, while that of PLS-logistic regression models ranges from 87.1% to 89.4%. For the test set, the prediction accuracy of logistic regression models ranges from 80.0%-86.7%, while that of PLS-logistic regression models ranges from 73.3%-80.0%. The QSAR models are made up of 4D-fingerprints related to aromatic atoms, hydrogen bond acceptors and negatively partially charged atoms. PMID:17226934

  1. Association of Brain-Derived Neurotrophic Factor and Vitamin D with Depression and Obesity: A Population-Based Study.

    PubMed

    Goltz, Annemarie; Janowitz, Deborah; Hannemann, Anke; Nauck, Matthias; Hoffmann, Johanna; Seyfart, Tom; Völzke, Henry; Terock, Jan; Grabe, Hans Jörgen

    2018-06-19

    Depression and obesity are widespread and closely linked. Brain-derived neurotrophic factor (BDNF) and vitamin D are both assumed to be associated with depression and obesity. Little is known about the interplay between vitamin D and BDNF. We explored the putative associations and interactions between serum BDNF and vitamin D levels with depressive symptoms and abdominal obesity in a large population-based cohort. Data were obtained from the population-based Study of Health in Pomerania (SHIP)-Trend (n = 3,926). The associations of serum BDNF and vitamin D levels with depressive symptoms (measured using the Patient Health Questionnaire) were assessed with binary and multinomial logistic regression models. The associations of serum BDNF and vitamin D levels with obesity (measured by the waist-to-hip ratio [WHR]) were assessed with binary logistic and linear regression models with restricted cubic splines. Logistic regression models revealed inverse associations of vitamin D with depression (OR = 0.966; 95% CI 0.951-0.981) and obesity (OR = 0.976; 95% CI 0.967-0.985). No linear association of serum BDNF with depression or obesity was found. However, linear regression models revealed a U-shaped association of BDNF with WHR (p < 0.001). Vitamin D was inversely associated with depression and obesity. BDNF was associated with abdominal obesity, but not with depression. At the population level, our results support the relevant roles of vitamin D and BDNF in mental and physical health-related outcomes. © 2018 S. Karger AG, Basel.

  2. EXpectation Propagation LOgistic REgRession (EXPLORER): distributed privacy-preserving online model learning.

    PubMed

    Wang, Shuang; Jiang, Xiaoqian; Wu, Yuan; Cui, Lijuan; Cheng, Samuel; Ohno-Machado, Lucila

    2013-06-01

    We developed an EXpectation Propagation LOgistic REgRession (EXPLORER) model for distributed privacy-preserving online learning. The proposed framework provides a high level guarantee for protecting sensitive information, since the information exchanged between the server and the client is the encrypted posterior distribution of coefficients. Through experimental results, EXPLORER shows the same performance (e.g., discrimination, calibration, feature selection, etc.) as the traditional frequentist logistic regression model, but provides more flexibility in model updating. That is, EXPLORER can be updated one point at a time rather than having to retrain the entire data set when new observations are recorded. The proposed EXPLORER supports asynchronized communication, which relieves the participants from coordinating with one another, and prevents service breakdown from the absence of participants or interrupted communications. Copyright © 2013 Elsevier Inc. All rights reserved.

  3. Forecasting the probability of future groundwater levels declining below specified low thresholds in the conterminous U.S.

    USGS Publications Warehouse

    Dudley, Robert W.; Hodgkins, Glenn A.; Dickinson, Jesse

    2017-01-01

    We present a logistic regression approach for forecasting the probability of future groundwater levels declining or maintaining below specific groundwater-level thresholds. We tested our approach on 102 groundwater wells in different climatic regions and aquifers of the United States that are part of the U.S. Geological Survey Groundwater Climate Response Network. We evaluated the importance of current groundwater levels, precipitation, streamflow, seasonal variability, Palmer Drought Severity Index, and atmosphere/ocean indices for developing the logistic regression equations. Several diagnostics of model fit were used to evaluate the regression equations, including testing of autocorrelation of residuals, goodness-of-fit metrics, and bootstrap validation testing. The probabilistic predictions were most successful at wells with high persistence (low month-to-month variability) in their groundwater records and at wells where the groundwater level remained below the defined low threshold for sustained periods (generally three months or longer). The model fit was weakest at wells with strong seasonal variability in levels and with shorter duration low-threshold events. We identified challenges in deriving probabilistic-forecasting models and possible approaches for addressing those challenges.

  4. The crux of the method: assumptions in ordinary least squares and logistic regression.

    PubMed

    Long, Rebecca G

    2008-10-01

    Logistic regression has increasingly become the tool of choice when analyzing data with a binary dependent variable. While resources relating to the technique are widely available, clear discussions of why logistic regression should be used in place of ordinary least squares regression are difficult to find. The current paper compares and contrasts the assumptions of ordinary least squares with those of logistic regression and explains why logistic regression's looser assumptions make it adept at handling violations of the more important assumptions in ordinary least squares.

  5. Sparse Logistic Regression for Diagnosis of Liver Fibrosis in Rat by Using SCAD-Penalized Likelihood

    PubMed Central

    Yan, Fang-Rong; Lin, Jin-Guan; Liu, Yu

    2011-01-01

    The objective of the present study is to find out the quantitative relationship between progression of liver fibrosis and the levels of certain serum markers using mathematic model. We provide the sparse logistic regression by using smoothly clipped absolute deviation (SCAD) penalized function to diagnose the liver fibrosis in rats. Not only does it give a sparse solution with high accuracy, it also provides the users with the precise probabilities of classification with the class information. In the simulative case and the experiment case, the proposed method is comparable to the stepwise linear discriminant analysis (SLDA) and the sparse logistic regression with least absolute shrinkage and selection operator (LASSO) penalty, by using receiver operating characteristic (ROC) with bayesian bootstrap estimating area under the curve (AUC) diagnostic sensitivity for selected variable. Results show that the new approach provides a good correlation between the serum marker levels and the liver fibrosis induced by thioacetamide (TAA) in rats. Meanwhile, this approach might also be used in predicting the development of liver cirrhosis. PMID:21716672

  6. Using Dominance Analysis to Determine Predictor Importance in Logistic Regression

    ERIC Educational Resources Information Center

    Azen, Razia; Traxel, Nicole

    2009-01-01

    This article proposes an extension of dominance analysis that allows researchers to determine the relative importance of predictors in logistic regression models. Criteria for choosing logistic regression R[superscript 2] analogues were determined and measures were selected that can be used to perform dominance analysis in logistic regression. A…

  7. A Logistic Regression Analysis of Turkey's 15-Year-Olds' Scoring above the OECD Average on the PISA'09 Reading Assessment

    ERIC Educational Resources Information Center

    Kasapoglu, Koray

    2014-01-01

    This study aims to investigate which factors are associated with Turkey's 15-year-olds' scoring above the OECD average (493) on the PISA'09 reading assessment. Collected from a total of 4,996 15-year-old students from Turkey, data were analyzed by logistic regression analysis in order to model the data of students who were split into two: (1)…

  8. Use of genetic programming, logistic regression, and artificial neural nets to predict readmission after coronary artery bypass surgery.

    PubMed

    Engoren, Milo; Habib, Robert H; Dooner, John J; Schwann, Thomas A

    2013-08-01

    As many as 14 % of patients undergoing coronary artery bypass surgery are readmitted within 30 days. Readmission is usually the result of morbidity and may lead to death. The purpose of this study is to develop and compare statistical and genetic programming models to predict readmission. Patients were divided into separate Construction and Validation populations. Using 88 variables, logistic regression, genetic programs, and artificial neural nets were used to develop predictive models. Models were first constructed and tested on the Construction populations, then validated on the Validation population. Areas under the receiver operator characteristic curves (AU ROC) were used to compare the models. Two hundred and two patients (7.6 %) in the 2,644 patient Construction group and 216 (8.0 %) of the 2,711 patient Validation group were re-admitted within 30 days of CABG surgery. Logistic regression predicted readmission with AU ROC = .675 ± .021 in the Construction group. Genetic programs significantly improved the accuracy, AU ROC = .767 ± .001, p < .001). Artificial neural nets were less accurate with AU ROC = 0.597 ± .001 in the Construction group. Predictive accuracy of all three techniques fell in the Validation group. However, the accuracy of genetic programming (AU ROC = .654 ± .001) was still trivially but statistically non-significantly better than that of the logistic regression (AU ROC = .644 ± .020, p = .61). Genetic programming and logistic regression provide alternative methods to predict readmission that are similarly accurate.

  9. Relaxing the rule of ten events per variable in logistic and Cox regression.

    PubMed

    Vittinghoff, Eric; McCulloch, Charles E

    2007-03-15

    The rule of thumb that logistic and Cox models should be used with a minimum of 10 outcome events per predictor variable (EPV), based on two simulation studies, may be too conservative. The authors conducted a large simulation study of other influences on confidence interval coverage, type I error, relative bias, and other model performance measures. They found a range of circumstances in which coverage and bias were within acceptable levels despite less than 10 EPV, as well as other factors that were as influential as or more influential than EPV. They conclude that this rule can be relaxed, in particular for sensitivity analyses undertaken to demonstrate adequate control of confounding.

  10. Variation in the prevalence of chronic bronchitis among smokers: a cross-sectional study.

    PubMed

    Mahesh, P A; Jayaraj, B S; Chaya, S K; Lokesh, K S; McKay, A J; Prabhakar, A K; Pape, U J

    2014-07-01

    Given the wide variations in prevalence of chronic obstructive pulmonary disease observed between populations with similar levels of exposure to tobacco smoke, we aimed to investigate the possibility of variations in prevalence of chronic bronchitis (CB) between two geographically distinct smoking populations in rural Karnataka, India. The Burden of Obstructive Lung Disease (BOLD) questionnaire was administered to all men aged >30 years in a cross-sectional survey. The χ(2) and Fisher's exact tests were used to compare CB prevalence in the two populations. Logistic regression was used to analyse the impact of multiple variables on the occurrence of CB. Two samples of 2322 and 2182 subjects were included in the study. In non-smokers, CB prevalence did not differ between the populations. However, it was significantly different between smoking populations (44.79% vs. 2.13%, P < 0.0001). Logistic regression indicated that, in addition to smoking, region, age, occupational dust exposure and type of house were associated with higher likelihood of CB. An interaction between smoking and area of residence was found (P < 0.001) and appeared to explain the effect of region (without interaction). A significant difference in CB prevalence was observed between male populations from two areas of Karnataka state, including when stratified by smoking status. No significant difference was observed between non-smokers.

  11. Applying Kaplan-Meier to Item Response Data

    ERIC Educational Resources Information Center

    McNeish, Daniel

    2018-01-01

    Some IRT models can be equivalently modeled in alternative frameworks such as logistic regression. Logistic regression can also model time-to-event data, which concerns the probability of an event occurring over time. Using the relation between time-to-event models and logistic regression and the relation between logistic regression and IRT, this…

  12. Building interpretable predictive models for pediatric hospital readmission using Tree-Lasso logistic regression.

    PubMed

    Jovanovic, Milos; Radovanovic, Sandro; Vukicevic, Milan; Van Poucke, Sven; Delibasic, Boris

    2016-09-01

    Quantification and early identification of unplanned readmission risk have the potential to improve the quality of care during hospitalization and after discharge. However, high dimensionality, sparsity, and class imbalance of electronic health data and the complexity of risk quantification, challenge the development of accurate predictive models. Predictive models require a certain level of interpretability in order to be applicable in real settings and create actionable insights. This paper aims to develop accurate and interpretable predictive models for readmission in a general pediatric patient population, by integrating a data-driven model (sparse logistic regression) and domain knowledge based on the international classification of diseases 9th-revision clinical modification (ICD-9-CM) hierarchy of diseases. Additionally, we propose a way to quantify the interpretability of a model and inspect the stability of alternative solutions. The analysis was conducted on >66,000 pediatric hospital discharge records from California, State Inpatient Databases, Healthcare Cost and Utilization Project between 2009 and 2011. We incorporated domain knowledge based on the ICD-9-CM hierarchy in a data driven, Tree-Lasso regularized logistic regression model, providing the framework for model interpretation. This approach was compared with traditional Lasso logistic regression resulting in models that are easier to interpret by fewer high-level diagnoses, with comparable prediction accuracy. The results revealed that the use of a Tree-Lasso model was as competitive in terms of accuracy (measured by area under the receiver operating characteristic curve-AUC) as the traditional Lasso logistic regression, but integration with the ICD-9-CM hierarchy of diseases provided more interpretable models in terms of high-level diagnoses. Additionally, interpretations of models are in accordance with existing medical understanding of pediatric readmission. Best performing models have similar performances reaching AUC values 0.783 and 0.779 for traditional Lasso and Tree-Lasso, respectfully. However, information loss of Lasso models is 0.35 bits higher compared to Tree-Lasso model. We propose a method for building predictive models applicable for the detection of readmission risk based on Electronic Health records. Integration of domain knowledge (in the form of ICD-9-CM taxonomy) and a data-driven, sparse predictive algorithm (Tree-Lasso Logistic Regression) resulted in an increase of interpretability of the resulting model. The models are interpreted for the readmission prediction problem in general pediatric population in California, as well as several important subpopulations, and the interpretations of models comply with existing medical understanding of pediatric readmission. Finally, quantitative assessment of the interpretability of the models is given, that is beyond simple counts of selected low-level features. Copyright © 2016 Elsevier B.V. All rights reserved.

  13. Dietary consumption patterns and laryngeal cancer risk.

    PubMed

    Vlastarakos, Petros V; Vassileiou, Andrianna; Delicha, Evie; Kikidis, Dimitrios; Protopapas, Dimosthenis; Nikolopoulos, Thomas P

    2016-06-01

    We conducted a case-control study to investigate the effect of diet on laryngeal carcinogenesis. Our study population was made up of 140 participants-70 patients with laryngeal cancer (LC) and 70 controls with a non-neoplastic condition that was unrelated to diet, smoking, or alcohol. A food-frequency questionnaire determined the mean consumption of 113 different items during the 3 years prior to symptom onset. Total energy intake and cooking mode were also noted. The relative risk, odds ratio (OR), and 95% confidence interval (CI) were estimated by multiple logistic regression analysis. We found that the total energy intake was significantly higher in the LC group (p < 0.001), and that the difference remained statistically significant after logistic regression analysis (p < 0.001; OR: 118.70). Notably, meat consumption was higher in the LC group (p < 0.001), and the difference remained significant after logistic regression analysis (p = 0.029; OR: 1.16). LC patients also consumed significantly more fried food (p = 0.036); this difference also remained significant in the logistic regression model (p = 0.026; OR: 5.45). The LC group also consumed significantly more seafood (p = 0.012); the difference persisted after logistic regression analysis (p = 0.009; OR: 2.48), with the consumption of shrimp proving detrimental (p = 0.049; OR: 2.18). Finally, the intake of zinc was significantly higher in the LC group before and after logistic regression analysis (p = 0.034 and p = 0.011; OR: 30.15, respectively). Cereal consumption (including pastas) was also higher among the LC patients (p = 0.043), with logistic regression analysis showing that their negative effect was possibly associated with the sauces and dressings that traditionally accompany pasta dishes (p = 0.006; OR: 4.78). Conversely, a higher consumption of dairy products was found in controls (p < 0.05); logistic regression analysis showed that calcium appeared to be protective at the micronutrient level (p < 0.001; OR: 0.27). We found no difference in the overall consumption of fruits and vegetables between the LC patients and controls; however, the LC patients did have a greater consumption of cooked tomatoes and cooked root vegetables (p = 0.039 for both), and the controls had more consumption of leeks (p = 0.042) and, among controls younger than 65 years, cooked beans (p = 0.037). Lemon (p = 0.037), squeezed fruit juice (p = 0.032), and watermelon (p = 0.018) were also more frequently consumed by the controls. Other differences at the micronutrient level included greater consumption by the LC patients of retinol (p = 0.044), polyunsaturated fats (p = 0.041), and linoleic acid (p = 0.008); LC patients younger than 65 years also had greater intake of riboflavin (p = 0.045). We conclude that the differences in dietary consumption patterns between LC patients and controls indicate a possible role for lifestyle modifications involving nutritional factors as a means of decreasing the risk of laryngeal cancer.

  14. Polymorphism Thr160Thr in SRD5A1, involved in the progesterone metabolism, modifies postmenopausal breast cancer risk associated with menopausal hormone therapy.

    PubMed

    Hein, R; Abbas, S; Seibold, P; Salazar, R; Flesch-Janys, D; Chang-Claude, J

    2012-01-01

    Menopausal hormone therapy (MHT) is associated with an increased breast cancer risk in postmenopausal women, with combined estrogen-progestagen therapy posing a greater risk than estrogen monotherapy. However, few studies focused on potential effect modification of MHT-associated breast cancer risk by genetic polymorphisms in the progesterone metabolism. We assessed effect modification of MHT use by five coding single nucleotide polymorphisms (SNPs) in the progesterone metabolizing enzymes AKR1C3 (rs7741), AKR1C4 (rs3829125, rs17134592), and SRD5A1 (rs248793, rs3736316) using a two-center population-based case-control study from Germany with 2,502 postmenopausal breast cancer patients and 4,833 matched controls. An empirical-Bayes procedure that tests for interaction using a weighted combination of the prospective and the retrospective case-control estimators as well as standard prospective logistic regression were applied to assess multiplicative statistical interaction between polymorphisms and duration of MHT use with regard to breast cancer risk assuming a log-additive mode of inheritance. No genetic marginal effects were observed. Breast cancer risk associated with duration of combined therapy was significantly modified by SRD5A1_rs3736316, showing a reduced risk elevation in carriers of the minor allele (p (interaction,empirical-Bayes) = 0.006 using the empirical-Bayes method, p (interaction,logistic regression) = 0.013 using logistic regression). The risk associated with duration of use of monotherapy was increased by AKR1C3_rs7741 in minor allele carriers (p (interaction,empirical-Bayes) = 0.083, p (interaction,logistic regression) = 0.029) and decreased in minor allele carriers of two SNPs in AKR1C4 (rs3829125: p (interaction,empirical-Bayes) = 0.07, p (interaction,logistic regression) = 0.021; rs17134592: p (interaction,empirical-Bayes) = 0.101, p (interaction,logistic regression) = 0.038). After Bonferroni correction for multiple testing only SRD5A1_rs3736316 assessed using the empirical-Bayes method remained significant. Postmenopausal breast cancer risk associated with combined therapy may be modified by genetic variation in SRD5A1. Further well-powered studies are, however, required to replicate our finding.

  15. Sociodemographic Differences in the Association Between Obesity and Stress: A Propensity Score-Matched Analysis from the Korean National Health and Nutrition Examination Survey (KNHANES).

    PubMed

    Mak, Kwok-Kei; Kim, Dae-Hwan; Leigh, J Paul

    2015-01-01

    Few population-based studies have used an econometric approach to understand the association between two cancer risk factors, obesity and stress. This study investigated sociodemographic differences in the association between obesity and stress among Korean adults (6,546 men and 8,473 women). Data were drawn from the Korean National Health and Nutrition Examination Survey for 2008, 2009, and 2010. Ordered logistic regression models and propensity score matching methods were used to examine the associations between obesity and stress, stratified by gender and age groups. In women, the stress level of the obese group was found to be 27.6% higher than the nonobese group in the ordered logistic regression; the obesity effect on stress was statistically significant in the propensity score-matched analysis. Corresponding evidence for the effect of obesity on stress was lacking among men. Participants who were young, well-educated, and working were more likely to report stress. In Korea, obesity causes stress in women but not in men. Young women are susceptible to a disproportionate level of stress. More cancer prevention programs targeting young and obese women are encouraged in developed Asian countries.

  16. Logistic Mixed Models to Investigate Implicit and Explicit Belief Tracking

    PubMed Central

    Lages, Martin; Scheel, Anne

    2016-01-01

    We investigated the proposition of a two-systems Theory of Mind in adults’ belief tracking. A sample of N = 45 participants predicted the choice of one of two opponent players after observing several rounds in an animated card game. Three matches of this card game were played and initial gaze direction on target and subsequent choice predictions were recorded for each belief task and participant. We conducted logistic regressions with mixed effects on the binary data and developed Bayesian logistic mixed models to infer implicit and explicit mentalizing in true belief and false belief tasks. Although logistic regressions with mixed effects predicted the data well a Bayesian logistic mixed model with latent task- and subject-specific parameters gave a better account of the data. As expected explicit choice predictions suggested a clear understanding of true and false beliefs (TB/FB). Surprisingly, however, model parameters for initial gaze direction also indicated belief tracking. We discuss why task-specific parameters for initial gaze directions are different from choice predictions yet reflect second-order perspective taking. PMID:27853440

  17. Impact of Contextual Factors on Prostate Cancer Risk and Outcomes

    DTIC Science & Technology

    2013-07-01

    framework with random effects (“frailty models”) while the case-control analyses (Aim 4) will use multilevel unconditional logistic regression models...contextual-level SES on prostate cancer risk within racial/ethnic groups. The survival analyses (Aims 1-3) will utilize a proportional hazards regression

  18. Completing the Remedial Sequence and College-Level Credit-Bearing Math: Comparing Binary, Cumulative, and Continuation Ratio Logistic Regression Models

    ERIC Educational Resources Information Center

    Davidson, J. Cody

    2016-01-01

    Mathematics is the most common subject area of remedial need and the majority of remedial math students never pass a college-level credit-bearing math class. The majorities of studies that investigate this phenomenon are conducted at community colleges and use some type of regression model; however, none have used a continuation ratio model. The…

  19. America's Democracy Colleges: The Civic Engagement of Community College Students

    ERIC Educational Resources Information Center

    Angeli Newell, Mallory

    2014-01-01

    This study explored the civic engagement of current two- and four-year students to explore whether differences exist between the groups and what may explain the differences. Using binary logistic regression and Ordinary Least Squares regression it was found that community-based engagement was lower for two- than four-year students, though…

  20. Regression approaches in the test-negative study design for assessment of influenza vaccine effectiveness.

    PubMed

    Bond, H S; Sullivan, S G; Cowling, B J

    2016-06-01

    Influenza vaccination is the most practical means available for preventing influenza virus infection and is widely used in many countries. Because vaccine components and circulating strains frequently change, it is important to continually monitor vaccine effectiveness (VE). The test-negative design is frequently used to estimate VE. In this design, patients meeting the same clinical case definition are recruited and tested for influenza; those who test positive are the cases and those who test negative form the comparison group. When determining VE in these studies, the typical approach has been to use logistic regression, adjusting for potential confounders. Because vaccine coverage and influenza incidence change throughout the season, time is included among these confounders. While most studies use unconditional logistic regression, adjusting for time, an alternative approach is to use conditional logistic regression, matching on time. Here, we used simulation data to examine the potential for both regression approaches to permit accurate and robust estimates of VE. In situations where vaccine coverage changed during the influenza season, the conditional model and unconditional models adjusting for categorical week and using a spline function for week provided more accurate estimates. We illustrated the two approaches on data from a test-negative study of influenza VE against hospitalization in children in Hong Kong which resulted in the conditional logistic regression model providing the best fit to the data.

  1. Nowcasting of Low-Visibility Procedure States with Ordered Logistic Regression at Vienna International Airport

    NASA Astrophysics Data System (ADS)

    Kneringer, Philipp; Dietz, Sebastian; Mayr, Georg J.; Zeileis, Achim

    2017-04-01

    Low-visibility conditions have a large impact on aviation safety and economic efficiency of airports and airlines. To support decision makers, we develop a statistical probabilistic nowcasting tool for the occurrence of capacity-reducing operations related to low visibility. The probabilities of four different low visibility classes are predicted with an ordered logistic regression model based on time series of meteorological point measurements. Potential predictor variables for the statistical models are visibility, humidity, temperature and wind measurements at several measurement sites. A stepwise variable selection method indicates that visibility and humidity measurements are the most important model inputs. The forecasts are tested with a 30 minute forecast interval up to two hours, which is a sufficient time span for tactical planning at Vienna Airport. The ordered logistic regression models outperform persistence and are competitive with human forecasters.

  2. Nomogram for prediction of level 2 axillary lymph node metastasis in proven level 1 node-positive breast cancer patients.

    PubMed

    Jiang, Yanlin; Xu, Hong; Zhang, Hao; Ou, Xunyan; Xu, Zhen; Ai, Liping; Sun, Lisha; Liu, Caigang

    2017-09-22

    The current management of the axilla in level 1 node-positive breast cancer patients is axillary lymph node dissection regardless of the status of the level 2 axillary lymph nodes. The goal of this study was to develop a nomogram predicting the probability of level 2 axillary lymph node metastasis (L-2-ALNM) in patients with level 1 axillary node-positive breast cancer. We reviewed the records of 974 patients with pathology-confirmed level 1 node-positive breast cancer between 2010 and 2014 at the Liaoning Cancer Hospital and Institute. The patients were randomized 1:1 and divided into a modeling group and a validation group. Clinical and pathological features of the patients were assessed with uni- and multivariate logistic regression. A nomogram based on independent predictors for the L-2-ALNM identified by multivariate logistic regression was constructed. Independent predictors of L-2-ALNM by the multivariate logistic regression analysis included tumor size, Ki-67 status, histological grade, and number of positive level 1 axillary lymph nodes. The areas under the receiver operating characteristic curve of the modeling set and the validation set were 0.828 and 0.816, respectively. The false-negative rates of the L-2-ALNM nomogram were 1.82% and 7.41% for the predicted probability cut-off points of < 6% and < 10%, respectively, when applied to the validation group. Our nomogram could help predict L-2-ALNM in patients with level 1 axillary lymph node metastasis. Patients with a low probability of L-2-ALNM could be spared level 2 axillary lymph node dissection, thereby reducing postoperative morbidity.

  3. SU-E-J-256: Predicting Metastasis-Free Survival of Rectal Cancer Patients Treated with Neoadjuvant Chemo-Radiotherapy by Data-Mining of CT Texture Features of Primary Lesions

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhong, H; Wang, J; Shen, L

    Purpose: The purpose of this study is to investigate the relationship between computed tomographic (CT) texture features of primary lesions and metastasis-free survival for rectal cancer patients; and to develop a datamining prediction model using texture features. Methods: A total of 220 rectal cancer patients treated with neoadjuvant chemo-radiotherapy (CRT) were enrolled in this study. All patients underwent CT scans before CRT. The primary lesions on the CT images were delineated by two experienced oncologists. The CT images were filtered by Laplacian of Gaussian (LoG) filters with different filter values (1.0–2.5: from fine to coarse). Both filtered and unfiltered imagesmore » were analyzed using Gray-level Co-occurrence Matrix (GLCM) texture analysis with different directions (transversal, sagittal, and coronal). Totally, 270 texture features with different species, directions and filter values were extracted. Texture features were examined with Student’s t-test for selecting predictive features. Principal Component Analysis (PCA) was performed upon the selected features to reduce the feature collinearity. Artificial neural network (ANN) and logistic regression were applied to establish metastasis prediction models. Results: Forty-six of 220 patients developed metastasis with a follow-up time of more than 2 years. Sixtyseven texture features were significantly different in t-test (p<0.05) between patients with and without metastasis, and 12 of them were extremely significant (p<0.001). The Area-under-the-curve (AUC) of ANN was 0.72, and the concordance index (CI) of logistic regression was 0.71. The predictability of ANN was slightly better than logistic regression. Conclusion: CT texture features of primary lesions are related to metastasisfree survival of rectal cancer patients. Both ANN and logistic regression based models can be developed for prediction.« less

  4. Factors associated with local public health agency participation in obesity prevention in southern States.

    PubMed

    Hatala, Jeffrey J; Fields, Tina T

    2015-05-01

    Obesity rates in the southern US states are higher than in other states. Historically, large-scale community-based interventions in the United States have not proven successful. With local public health agencies (LPHAs) tasked with prevention, their role in obesity prevention is important, yet little research exists regarding what predicts the participation of LPHAs. Cross-sectional data from the 2008 National Association of City and County Health Officials profile study and two public health conceptual frameworks were used to assess structural and environmental predictors of LPHA participation in obesity prevention. The predictors were compared between southern and nonsouthern states. Univariate and weighted logistic regressions were performed. Analysis revealed that more LPHAs in southern states were engaged in nearly all of the 10 essential public health functions related to obesity prevention compared with nonsouthern states. Presence of community-based organizations and staffing levels were the only significant variables in two of the six logistic regression models. This study provides insights into the success rates of the obesity prevention efforts of LPHAs in southern and nonsouthern states. Future research is needed to understand why and how certain structural elements and any additional factors influence LPHA participation in obesity prevention.

  5. A novel hybrid method of beta-turn identification in protein using binary logistic regression and neural network

    PubMed Central

    Asghari, Mehdi Poursheikhali; Hayatshahi, Sayyed Hamed Sadat; Abdolmaleki, Parviz

    2012-01-01

    From both the structural and functional points of view, β-turns play important biological roles in proteins. In the present study, a novel two-stage hybrid procedure has been developed to identify β-turns in proteins. Binary logistic regression was initially used for the first time to select significant sequence parameters in identification of β-turns due to a re-substitution test procedure. Sequence parameters were consisted of 80 amino acid positional occurrences and 20 amino acid percentages in sequence. Among these parameters, the most significant ones which were selected by binary logistic regression model, were percentages of Gly, Ser and the occurrence of Asn in position i+2, respectively, in sequence. These significant parameters have the highest effect on the constitution of a β-turn sequence. A neural network model was then constructed and fed by the parameters selected by binary logistic regression to build a hybrid predictor. The networks have been trained and tested on a non-homologous dataset of 565 protein chains. With applying a nine fold cross-validation test on the dataset, the network reached an overall accuracy (Qtotal) of 74, which is comparable with results of the other β-turn prediction methods. In conclusion, this study proves that the parameter selection ability of binary logistic regression together with the prediction capability of neural networks lead to the development of more precise models for identifying β-turns in proteins. PMID:27418910

  6. A novel hybrid method of beta-turn identification in protein using binary logistic regression and neural network.

    PubMed

    Asghari, Mehdi Poursheikhali; Hayatshahi, Sayyed Hamed Sadat; Abdolmaleki, Parviz

    2012-01-01

    From both the structural and functional points of view, β-turns play important biological roles in proteins. In the present study, a novel two-stage hybrid procedure has been developed to identify β-turns in proteins. Binary logistic regression was initially used for the first time to select significant sequence parameters in identification of β-turns due to a re-substitution test procedure. Sequence parameters were consisted of 80 amino acid positional occurrences and 20 amino acid percentages in sequence. Among these parameters, the most significant ones which were selected by binary logistic regression model, were percentages of Gly, Ser and the occurrence of Asn in position i+2, respectively, in sequence. These significant parameters have the highest effect on the constitution of a β-turn sequence. A neural network model was then constructed and fed by the parameters selected by binary logistic regression to build a hybrid predictor. The networks have been trained and tested on a non-homologous dataset of 565 protein chains. With applying a nine fold cross-validation test on the dataset, the network reached an overall accuracy (Qtotal) of 74, which is comparable with results of the other β-turn prediction methods. In conclusion, this study proves that the parameter selection ability of binary logistic regression together with the prediction capability of neural networks lead to the development of more precise models for identifying β-turns in proteins.

  7. Robust experimental design for optimizing the microbial inhibitor test for penicillin detection in milk.

    PubMed

    Nagel, O G; Molina, M P; Basílico, J C; Zapata, M L; Althaus, R L

    2009-06-01

    To use experimental design techniques and a multiple logistic regression model to optimize a microbiological inhibition test with dichotomous response for the detection of Penicillin G in milk. A 2(3) x 2(2) robust experimental design with two replications was used. The effects of three control factors (V: culture medium volume, S: spore concentration of Geobacillus stearothermophilus, I: indicator concentration), two noise factors (Dt: diffusion time, Ip: incubation period) and their interactions were studied. The V, S, Dt, Ip factors and V x S, V x Ip, S x Ip interactions showed significant effects. The use of 100 microl culture medium volume, 2 x 10(5) spores ml(-1), 60 min diffusion time and 3 h incubation period is recommended. In these elaboration conditions, the penicillin detection limit was of 3.9 microg l(-1), similar to the maximum residue limit (MRL). Of the two noise factors studied, the incubation period can be controlled by means of the culture medium volume and spore concentration. We were able to optimize bioassays of dichotomous response using an experimental design and logistic regression model for the detection of residues at the level of MRL, aiding in the avoidance of health problems in the consumer.

  8. Application of logistic regression to case-control association studies involving two causative loci.

    PubMed

    North, Bernard V; Curtis, David; Sham, Pak C

    2005-01-01

    Models in which two susceptibility loci jointly influence the risk of developing disease can be explored using logistic regression analysis. Comparison of likelihoods of models incorporating different sets of disease model parameters allows inferences to be drawn regarding the nature of the joint effect of the loci. We have simulated case-control samples generated assuming different two-locus models and then analysed them using logistic regression. We show that this method is practicable and that, for the models we have used, it can be expected to allow useful inferences to be drawn from sample sizes consisting of hundreds of subjects. Interactions between loci can be explored, but interactive effects do not exactly correspond with classical definitions of epistasis. We have particularly examined the issue of the extent to which it is helpful to utilise information from a previously identified locus when investigating a second, unknown locus. We show that for some models conditional analysis can have substantially greater power while for others unconditional analysis can be more powerful. Hence we conclude that in general both conditional and unconditional analyses should be performed when searching for additional loci.

  9. Clinical Outcome And Arginine Serum of Acute Ischemic Stroke Patients Supplemented by Snakehead Fish Extract

    NASA Astrophysics Data System (ADS)

    Pudjonarko, Dwi; Retnaningsih; Abidin, Zainal

    2018-02-01

    Background: Levels of arginine associated with clinical outcome in acute ischemic stroke (AIS). Arginine is a protein needed to synthesis nitric oxide (NO), a potential vasodilator and antioxidant. Snakehead fish is a source of protein which has antioxidant activity. Snakehead fish contains mineral, vitamin, and amino acids. One of the amino acids that were found quite high in snakehead fish extract is arginine. The aim of this study was done to determine the effect of snakehead fish extracts (SFE) on serum arginin levels and clinical outcome of AIS patients. Methods: It was double-blind randomized pretest-posttest control group design, with. AIS patients were divided into two groups i.e. snakehead fish extracts (SFE) and control. SFE group were administered 15 grams SFE for 7 days . Arginine serum levels and clinical outcome (measured by National Institute of Health Stroke Scale = NIHSS) were measured before and after treatment, other related factors were also analyzed in Logistic regression. Results: A total of 42 subjects who were performed random allocation as SFE or control group. There was no differences in subject characteristics between the two groups. There was a differences Δ arginine serum levels between SFE and control (33.6±19.95 μmol/L 0.3±2.51 μmol/L p<0.001). Change in NIHSS score in SFE improved significantly compared to the control group (4.14 ± 2.03; 2.52 ± 1.81;p=0.009 ). Logistic regression analysis showed only female gender factor that affected on improvement of NIHSS (OR=7; p=0,01). Conclusion: There is Clinical outcome improvement and enhancement of arginine serum levels in AIS patient with snakehead fish extract supplementation.

  10. A Method for Calculating the Probability of Successfully Completing a Rocket Propulsion Ground Test

    NASA Technical Reports Server (NTRS)

    Messer, Bradley

    2007-01-01

    Propulsion ground test facilities face the daily challenge of scheduling multiple customers into limited facility space and successfully completing their propulsion test projects. Over the last decade NASA s propulsion test facilities have performed hundreds of tests, collected thousands of seconds of test data, and exceeded the capabilities of numerous test facility and test article components. A logistic regression mathematical modeling technique has been developed to predict the probability of successfully completing a rocket propulsion test. A logistic regression model is a mathematical modeling approach that can be used to describe the relationship of several independent predictor variables X(sub 1), X(sub 2),.., X(sub k) to a binary or dichotomous dependent variable Y, where Y can only be one of two possible outcomes, in this case Success or Failure of accomplishing a full duration test. The use of logistic regression modeling is not new; however, modeling propulsion ground test facilities using logistic regression is both a new and unique application of the statistical technique. Results from this type of model provide project managers with insight and confidence into the effectiveness of rocket propulsion ground testing.

  11. Standards for Standardized Logistic Regression Coefficients

    ERIC Educational Resources Information Center

    Menard, Scott

    2011-01-01

    Standardized coefficients in logistic regression analysis have the same utility as standardized coefficients in linear regression analysis. Although there has been no consensus on the best way to construct standardized logistic regression coefficients, there is now sufficient evidence to suggest a single best approach to the construction of a…

  12. Easy and low-cost identification of metabolic syndrome in patients treated with second-generation antipsychotics: artificial neural network and logistic regression models.

    PubMed

    Lin, Chao-Cheng; Bai, Ya-Mei; Chen, Jen-Yeu; Hwang, Tzung-Jeng; Chen, Tzu-Ting; Chiu, Hung-Wen; Li, Yu-Chuan

    2010-03-01

    Metabolic syndrome (MetS) is an important side effect of second-generation antipsychotics (SGAs). However, many SGA-treated patients with MetS remain undetected. In this study, we trained and validated artificial neural network (ANN) and multiple logistic regression models without biochemical parameters to rapidly identify MetS in patients with SGA treatment. A total of 383 patients with a diagnosis of schizophrenia or schizoaffective disorder (DSM-IV criteria) with SGA treatment for more than 6 months were investigated to determine whether they met the MetS criteria according to the International Diabetes Federation. The data for these patients were collected between March 2005 and September 2005. The input variables of ANN and logistic regression were limited to demographic and anthropometric data only. All models were trained by randomly selecting two-thirds of the patient data and were internally validated with the remaining one-third of the data. The models were then externally validated with data from 69 patients from another hospital, collected between March 2008 and June 2008. The area under the receiver operating characteristic curve (AUC) was used to measure the performance of all models. Both the final ANN and logistic regression models had high accuracy (88.3% vs 83.6%), sensitivity (93.1% vs 86.2%), and specificity (86.9% vs 83.8%) to identify MetS in the internal validation set. The mean +/- SD AUC was high for both the ANN and logistic regression models (0.934 +/- 0.033 vs 0.922 +/- 0.035, P = .63). During external validation, high AUC was still obtained for both models. Waist circumference and diastolic blood pressure were the common variables that were left in the final ANN and logistic regression models. Our study developed accurate ANN and logistic regression models to detect MetS in patients with SGA treatment. The models are likely to provide a noninvasive tool for large-scale screening of MetS in this group of patients. (c) 2010 Physicians Postgraduate Press, Inc.

  13. A Bayesian goodness of fit test and semiparametric generalization of logistic regression with measurement data.

    PubMed

    Schörgendorfer, Angela; Branscum, Adam J; Hanson, Timothy E

    2013-06-01

    Logistic regression is a popular tool for risk analysis in medical and population health science. With continuous response data, it is common to create a dichotomous outcome for logistic regression analysis by specifying a threshold for positivity. Fitting a linear regression to the nondichotomized response variable assuming a logistic sampling model for the data has been empirically shown to yield more efficient estimates of odds ratios than ordinary logistic regression of the dichotomized endpoint. We illustrate that risk inference is not robust to departures from the parametric logistic distribution. Moreover, the model assumption of proportional odds is generally not satisfied when the condition of a logistic distribution for the data is violated, leading to biased inference from a parametric logistic analysis. We develop novel Bayesian semiparametric methodology for testing goodness of fit of parametric logistic regression with continuous measurement data. The testing procedures hold for any cutoff threshold and our approach simultaneously provides the ability to perform semiparametric risk estimation. Bayes factors are calculated using the Savage-Dickey ratio for testing the null hypothesis of logistic regression versus a semiparametric generalization. We propose a fully Bayesian and a computationally efficient empirical Bayesian approach to testing, and we present methods for semiparametric estimation of risks, relative risks, and odds ratios when parametric logistic regression fails. Theoretical results establish the consistency of the empirical Bayes test. Results from simulated data show that the proposed approach provides accurate inference irrespective of whether parametric assumptions hold or not. Evaluation of risk factors for obesity shows that different inferences are derived from an analysis of a real data set when deviations from a logistic distribution are permissible in a flexible semiparametric framework. © 2013, The International Biometric Society.

  14. Propensity score estimation: machine learning and classification methods as alternatives to logistic regression

    PubMed Central

    Westreich, Daniel; Lessler, Justin; Funk, Michele Jonsson

    2010-01-01

    Summary Objective Propensity scores for the analysis of observational data are typically estimated using logistic regression. Our objective in this Review was to assess machine learning alternatives to logistic regression which may accomplish the same goals but with fewer assumptions or greater accuracy. Study Design and Setting We identified alternative methods for propensity score estimation and/or classification from the public health, biostatistics, discrete mathematics, and computer science literature, and evaluated these algorithms for applicability to the problem of propensity score estimation, potential advantages over logistic regression, and ease of use. Results We identified four techniques as alternatives to logistic regression: neural networks, support vector machines, decision trees (CART), and meta-classifiers (in particular, boosting). Conclusion While the assumptions of logistic regression are well understood, those assumptions are frequently ignored. All four alternatives have advantages and disadvantages compared with logistic regression. Boosting (meta-classifiers) and to a lesser extent decision trees (particularly CART) appear to be most promising for use in the context of propensity score analysis, but extensive simulation studies are needed to establish their utility in practice. PMID:20630332

  15. Fungible weights in logistic regression.

    PubMed

    Jones, Jeff A; Waller, Niels G

    2016-06-01

    In this article we develop methods for assessing parameter sensitivity in logistic regression models. To set the stage for this work, we first review Waller's (2008) equations for computing fungible weights in linear regression. Next, we describe 2 methods for computing fungible weights in logistic regression. To demonstrate the utility of these methods, we compute fungible logistic regression weights using data from the Centers for Disease Control and Prevention's (2010) Youth Risk Behavior Surveillance Survey, and we illustrate how these alternate weights can be used to evaluate parameter sensitivity. To make our work accessible to the research community, we provide R code (R Core Team, 2015) that will generate both kinds of fungible logistic regression weights. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  16. A general equation to obtain multiple cut-off scores on a test from multinomial logistic regression.

    PubMed

    Bersabé, Rosa; Rivas, Teresa

    2010-05-01

    The authors derive a general equation to compute multiple cut-offs on a total test score in order to classify individuals into more than two ordinal categories. The equation is derived from the multinomial logistic regression (MLR) model, which is an extension of the binary logistic regression (BLR) model to accommodate polytomous outcome variables. From this analytical procedure, cut-off scores are established at the test score (the predictor variable) at which an individual is as likely to be in category j as in category j+1 of an ordinal outcome variable. The application of the complete procedure is illustrated by an example with data from an actual study on eating disorders. In this example, two cut-off scores on the Eating Attitudes Test (EAT-26) scores are obtained in order to classify individuals into three ordinal categories: asymptomatic, symptomatic and eating disorder. Diagnoses were made from the responses to a self-report (Q-EDD) that operationalises DSM-IV criteria for eating disorders. Alternatives to the MLR model to set multiple cut-off scores are discussed.

  17. Propensity score estimation: neural networks, support vector machines, decision trees (CART), and meta-classifiers as alternatives to logistic regression.

    PubMed

    Westreich, Daniel; Lessler, Justin; Funk, Michele Jonsson

    2010-08-01

    Propensity scores for the analysis of observational data are typically estimated using logistic regression. Our objective in this review was to assess machine learning alternatives to logistic regression, which may accomplish the same goals but with fewer assumptions or greater accuracy. We identified alternative methods for propensity score estimation and/or classification from the public health, biostatistics, discrete mathematics, and computer science literature, and evaluated these algorithms for applicability to the problem of propensity score estimation, potential advantages over logistic regression, and ease of use. We identified four techniques as alternatives to logistic regression: neural networks, support vector machines, decision trees (classification and regression trees [CART]), and meta-classifiers (in particular, boosting). Although the assumptions of logistic regression are well understood, those assumptions are frequently ignored. All four alternatives have advantages and disadvantages compared with logistic regression. Boosting (meta-classifiers) and, to a lesser extent, decision trees (particularly CART), appear to be most promising for use in the context of propensity score analysis, but extensive simulation studies are needed to establish their utility in practice. Copyright (c) 2010 Elsevier Inc. All rights reserved.

  18. Correlation and simple linear regression.

    PubMed

    Eberly, Lynn E

    2007-01-01

    This chapter highlights important steps in using correlation and simple linear regression to address scientific questions about the association of two continuous variables with each other. These steps include estimation and inference, assessing model fit, the connection between regression and ANOVA, and study design. Examples in microbiology are used throughout. This chapter provides a framework that is helpful in understanding more complex statistical techniques, such as multiple linear regression, linear mixed effects models, logistic regression, and proportional hazards regression.

  19. The logistic model for predicting the non-gonoactive Aedes aegypti females.

    PubMed

    Reyes-Villanueva, Filiberto; Rodríguez-Pérez, Mario A

    2004-01-01

    To estimate, using logistic regression, the likelihood of occurrence of a non-gonoactive Aedes aegypti female, previously fed human blood, with relation to body size and collection method. This study was conducted in Monterrey, Mexico, between 1994 and 1996. Ten samplings of 60 mosquitoes of Ae. aegypti females were carried out in three dengue endemic areas: six of biting females, two of emerging mosquitoes, and two of indoor resting females. Gravid females, as well as those with blood in the gut were removed. Mosquitoes were taken to the laboratory and engorged on human blood. After 48 hours, ovaries were dissected to register whether they were gonoactive or non-gonoactive. Wing-length in mm was an indicator for body size. The logistic regression model was used to assess the likelihood of non-gonoactivity, as a binary variable, in relation to wing-length and collection method. Of the 600 females, 164 (27%) remained non-gonoactive, with a wing-length range of 1.9-3.2 mm, almost equal to that of all females (1.8-3.3 mm). The logistic regression model showed a significant likelihood of a female remaining non-gonoactive (Y=1). The collection method did not influence the binary response, but there was an inverse relationship between non-gonoactivity and wing-length. Dengue vector populations from Monterrey, Mexico display a wide-range body size. Logistic regression was a useful tool to estimate the likelihood for an engorged female to remain non-gonoactive. The necessity for a second blood meal is present in any female, but small mosquitoes are more likely to bite again within a 2-day interval, in order to attain egg maturation. The English version of this paper is available too at: http://www.insp.mx/salud/index.html.

  20. Should metacognition be measured by logistic regression?

    PubMed

    Rausch, Manuel; Zehetleitner, Michael

    2017-03-01

    Are logistic regression slopes suitable to quantify metacognitive sensitivity, i.e. the efficiency with which subjective reports differentiate between correct and incorrect task responses? We analytically show that logistic regression slopes are independent from rating criteria in one specific model of metacognition, which assumes (i) that rating decisions are based on sensory evidence generated independently of the sensory evidence used for primary task responses and (ii) that the distributions of evidence are logistic. Given a hierarchical model of metacognition, logistic regression slopes depend on rating criteria. According to all considered models, regression slopes depend on the primary task criterion. A reanalysis of previous data revealed that massive numbers of trials are required to distinguish between hierarchical and independent models with tolerable accuracy. It is argued that researchers who wish to use logistic regression as measure of metacognitive sensitivity need to control the primary task criterion and rating criteria. Copyright © 2017 Elsevier Inc. All rights reserved.

  1. A comparative analysis of predictive models of morbidity in intensive care unit after cardiac surgery - part II: an illustrative example.

    PubMed

    Cevenini, Gabriele; Barbini, Emanuela; Scolletta, Sabino; Biagioli, Bonizella; Giomarelli, Pierpaolo; Barbini, Paolo

    2007-11-22

    Popular predictive models for estimating morbidity probability after heart surgery are compared critically in a unitary framework. The study is divided into two parts. In the first part modelling techniques and intrinsic strengths and weaknesses of different approaches were discussed from a theoretical point of view. In this second part the performances of the same models are evaluated in an illustrative example. Eight models were developed: Bayes linear and quadratic models, k-nearest neighbour model, logistic regression model, Higgins and direct scoring systems and two feed-forward artificial neural networks with one and two layers. Cardiovascular, respiratory, neurological, renal, infectious and hemorrhagic complications were defined as morbidity. Training and testing sets each of 545 cases were used. The optimal set of predictors was chosen among a collection of 78 preoperative, intraoperative and postoperative variables by a stepwise procedure. Discrimination and calibration were evaluated by the area under the receiver operating characteristic curve and Hosmer-Lemeshow goodness-of-fit test, respectively. Scoring systems and the logistic regression model required the largest set of predictors, while Bayesian and k-nearest neighbour models were much more parsimonious. In testing data, all models showed acceptable discrimination capacities, however the Bayes quadratic model, using only three predictors, provided the best performance. All models showed satisfactory generalization ability: again the Bayes quadratic model exhibited the best generalization, while artificial neural networks and scoring systems gave the worst results. Finally, poor calibration was obtained when using scoring systems, k-nearest neighbour model and artificial neural networks, while Bayes (after recalibration) and logistic regression models gave adequate results. Although all the predictive models showed acceptable discrimination performance in the example considered, the Bayes and logistic regression models seemed better than the others, because they also had good generalization and calibration. The Bayes quadratic model seemed to be a convincing alternative to the much more usual Bayes linear and logistic regression models. It showed its capacity to identify a minimum core of predictors generally recognized as essential to pragmatically evaluate the risk of developing morbidity after heart surgery.

  2. Determination of riverbank erosion probability using Locally Weighted Logistic Regression

    NASA Astrophysics Data System (ADS)

    Ioannidou, Elena; Flori, Aikaterini; Varouchakis, Emmanouil A.; Giannakis, Georgios; Vozinaki, Anthi Eirini K.; Karatzas, George P.; Nikolaidis, Nikolaos

    2015-04-01

    Riverbank erosion is a natural geomorphologic process that affects the fluvial environment. The most important issue concerning riverbank erosion is the identification of the vulnerable locations. An alternative to the usual hydrodynamic models to predict vulnerable locations is to quantify the probability of erosion occurrence. This can be achieved by identifying the underlying relations between riverbank erosion and the geomorphological or hydrological variables that prevent or stimulate erosion. Thus, riverbank erosion can be determined by a regression model using independent variables that are considered to affect the erosion process. The impact of such variables may vary spatially, therefore, a non-stationary regression model is preferred instead of a stationary equivalent. Locally Weighted Regression (LWR) is proposed as a suitable choice. This method can be extended to predict the binary presence or absence of erosion based on a series of independent local variables by using the logistic regression model. It is referred to as Locally Weighted Logistic Regression (LWLR). Logistic regression is a type of regression analysis used for predicting the outcome of a categorical dependent variable (e.g. binary response) based on one or more predictor variables. The method can be combined with LWR to assign weights to local independent variables of the dependent one. LWR allows model parameters to vary over space in order to reflect spatial heterogeneity. The probabilities of the possible outcomes are modelled as a function of the independent variables using a logistic function. Logistic regression measures the relationship between a categorical dependent variable and, usually, one or several continuous independent variables by converting the dependent variable to probability scores. Then, a logistic regression is formed, which predicts success or failure of a given binary variable (e.g. erosion presence or absence) for any value of the independent variables. The erosion occurrence probability can be calculated in conjunction with the model deviance regarding the independent variables tested. The most straightforward measure for goodness of fit is the G statistic. It is a simple and effective way to study and evaluate the Logistic Regression model efficiency and the reliability of each independent variable. The developed statistical model is applied to the Koiliaris River Basin on the island of Crete, Greece. Two datasets of river bank slope, river cross-section width and indications of erosion were available for the analysis (12 and 8 locations). Two different types of spatial dependence functions, exponential and tricubic, were examined to determine the local spatial dependence of the independent variables at the measurement locations. The results show a significant improvement when the tricubic function is applied as the erosion probability is accurately predicted at all eight validation locations. Results for the model deviance show that cross-section width is more important than bank slope in the estimation of erosion probability along the Koiliaris riverbanks. The proposed statistical model is a useful tool that quantifies the erosion probability along the riverbanks and can be used to assist managing erosion and flooding events. Acknowledgements This work is part of an on-going THALES project (CYBERSENSORS - High Frequency Monitoring System for Integrated Water Resources Management of Rivers). The project has been co-financed by the European Union (European Social Fund - ESF) and Greek national funds through the Operational Program "Education and Lifelong Learning" of the National Strategic Reference Framework (NSRF) - Research Funding Program: THALES. Investing in knowledge society through the European Social Fund.

  3. Estimating the Probability of Rare Events Occurring Using a Local Model Averaging.

    PubMed

    Chen, Jin-Hua; Chen, Chun-Shu; Huang, Meng-Fan; Lin, Hung-Chih

    2016-10-01

    In statistical applications, logistic regression is a popular method for analyzing binary data accompanied by explanatory variables. But when one of the two outcomes is rare, the estimation of model parameters has been shown to be severely biased and hence estimating the probability of rare events occurring based on a logistic regression model would be inaccurate. In this article, we focus on estimating the probability of rare events occurring based on logistic regression models. Instead of selecting a best model, we propose a local model averaging procedure based on a data perturbation technique applied to different information criteria to obtain different probability estimates of rare events occurring. Then an approximately unbiased estimator of Kullback-Leibler loss is used to choose the best one among them. We design complete simulations to show the effectiveness of our approach. For illustration, a necrotizing enterocolitis (NEC) data set is analyzed. © 2016 Society for Risk Analysis.

  4. Logistic models--an odd(s) kind of regression.

    PubMed

    Jupiter, Daniel C

    2013-01-01

    The logistic regression model bears some similarity to the multivariable linear regression with which we are familiar. However, the differences are great enough to warrant a discussion of the need for and interpretation of logistic regression. Copyright © 2013 American College of Foot and Ankle Surgeons. Published by Elsevier Inc. All rights reserved.

  5. Unequal views of inequality: Cross-national support for redistribution 1985-2011.

    PubMed

    VanHeuvelen, Tom

    2017-05-01

    This research examines public views on government responsibility to reduce income inequality, support for redistribution. While individual-level correlates of support for redistribution are relatively well understood, many questions remain at the country-level. Therefore, I examine how country-level characteristics affect aggregate support for redistribution. I test explanations of aggregate support using a unique dataset combining 18 waves of the International Social Survey Programme and European Social Survey. Results from mixed-effects logistic regression and fixed-effects linear regression models show two primary and contrasting effects. States that reduce inequality through bundles of tax and transfer policies are rewarded with more supportive publics. In contrast, economic development has a seemingly equivalent and dampening effect on public support. Importantly, the effect of economic development grows at higher levels of development, potentially overwhelming the amplifying effect of state redistribution. My results therefore suggest a fundamental challenge to proponents of egalitarian politics. Copyright © 2016 Elsevier Inc. All rights reserved.

  6. Cross-national differences in the gender gap in subjective health in Europe: does country-level gender equality matter?

    PubMed

    Dahlin, Johanna; Härkönen, Juho

    2013-12-01

    Multiple studies have found that women report being in worse health despite living longer. Gender gaps vary cross-nationally, but relatively little is known about the causes of comparative differences. Existing literature is inconclusive as to whether gender gaps in health are smaller in more gender equal societies. We analyze gender gaps in self-rated health (SRH) and limiting longstanding illness (LLI) with five waves of European Social Survey data for 191,104 respondents from 28 countries. We use means, odds ratios, logistic regressions, and multilevel random slopes logistic regressions. Gender gaps in subjective health vary visibly across Europe. In many countries (especially in Eastern and Southern Europe), women report distinctly worse health, while in others (such as Estonia, Finland, and Great Britain) there are small or no differences. Logistic regressions ran separately for each country revealed that individual-level socioeconomic and demographic variables explain a majority of these gaps in some countries, but contribute little to their understanding in most countries. In yet other countries, men had worse health when these variables were controlled for. Cross-national variation in the gender gaps exists after accounting for individual-level factors. Against expectations, the remaining gaps are not systematically related to societal-level gender inequality in the multilevel analyses. Our findings stress persistent cross-national variability in gender gaps in health and call for further analysis. Copyright © 2013 Elsevier Ltd. All rights reserved.

  7. Sample size estimation for alternating logistic regressions analysis of multilevel randomized community trials of under-age drinking.

    PubMed

    Reboussin, Beth A; Preisser, John S; Song, Eun-Young; Wolfson, Mark

    2012-07-01

    Under-age drinking is an enormous public health issue in the USA. Evidence that community level structures may impact on under-age drinking has led to a proliferation of efforts to change the environment surrounding the use of alcohol. Although the focus of these efforts is to reduce drinking by individual youths, environmental interventions are typically implemented at the community level with entire communities randomized to the same intervention condition. A distinct feature of these trials is the tendency of the behaviours of individuals residing in the same community to be more alike than that of others residing in different communities, which is herein called 'clustering'. Statistical analyses and sample size calculations must account for this clustering to avoid type I errors and to ensure an appropriately powered trial. Clustering itself may also be of scientific interest. We consider the alternating logistic regressions procedure within the population-averaged modelling framework to estimate the effect of a law enforcement intervention on the prevalence of under-age drinking behaviours while modelling the clustering at multiple levels, e.g. within communities and within neighbourhoods nested within communities, by using pairwise odds ratios. We then derive sample size formulae for estimating intervention effects when planning a post-test-only or repeated cross-sectional community-randomized trial using the alternating logistic regressions procedure.

  8. Perceived Risk of Burglary and Fear of Crime: Individual- and Country-Level Mixed Modeling.

    PubMed

    Chon, Don Soo; Wilson, Mary

    2016-02-01

    Given the scarcity of prior studies, the current research introduced country-level variables, along with individual-level ones, to test how they are related to an individual's perceived risk of burglary (PRB) and fear of crime (FC), separately, by using mixed-level logistic regression analyses. The analyses of 104,218 individuals, residing in 50 countries, showed that country-level poverty was positively associated with FC only. However, individual-level variables, such as prior property crime victimization and female gender, had consistently positive relationships with both PRB and FC. However, age group and socioeconomic status were inconsistent between those two models, suggesting that PRB and FC are two different concepts. Finally, no significant difference in the pattern of PRB and FC was found between a highly developed group of countries and a less developed one. © The Author(s) 2014.

  9. Can Predictive Modeling Identify Head and Neck Oncology Patients at Risk for Readmission?

    PubMed

    Manning, Amy M; Casper, Keith A; Peter, Kay St; Wilson, Keith M; Mark, Jonathan R; Collar, Ryan M

    2018-05-01

    Objective Unplanned readmission within 30 days is a contributor to health care costs in the United States. The use of predictive modeling during hospitalization to identify patients at risk for readmission offers a novel approach to quality improvement and cost reduction. Study Design Two-phase study including retrospective analysis of prospectively collected data followed by prospective longitudinal study. Setting Tertiary academic medical center. Subjects and Methods Prospectively collected data for patients undergoing surgical treatment for head and neck cancer from January 2013 to January 2015 were used to build predictive models for readmission within 30 days of discharge using logistic regression, classification and regression tree (CART) analysis, and random forests. One model (logistic regression) was then placed prospectively into the discharge workflow from March 2016 to May 2016 to determine the model's ability to predict which patients would be readmitted within 30 days. Results In total, 174 admissions had descriptive data. Thirty-two were excluded due to incomplete data. Logistic regression, CART, and random forest predictive models were constructed using the remaining 142 admissions. When applied to 106 consecutive prospective head and neck oncology patients at the time of discharge, the logistic regression model predicted readmissions with a specificity of 94%, a sensitivity of 47%, a negative predictive value of 90%, and a positive predictive value of 62% (odds ratio, 14.9; 95% confidence interval, 4.02-55.45). Conclusion Prospectively collected head and neck cancer databases can be used to develop predictive models that can accurately predict which patients will be readmitted. This offers valuable support for quality improvement initiatives and readmission-related cost reduction in head and neck cancer care.

  10. Decreased levels of sRAGE in follicular fluid from patients with PCOS.

    PubMed

    Wang, BiJun; Li, Jing; Yang, QingLing; Zhang, FuLi; Hao, MengMeng; Guo, YiHong

    2017-03-01

    This study aimed to explore the association between soluble receptor for advanced glycation end products (sRAGE) levels in follicular fluid and the number of oocytes retrieved and to evaluate the effect of sRAGE on vascular endothelial growth factor (VEGF) in granulosa cells in patients with polycystic ovarian syndrome (PCOS). Two sets of experiments were performed in this study. In part one, sRAGE and VEGF protein levels in follicular fluid samples from 39 patients with PCOS and 35 non-PCOS patients were measured by ELISA. In part two, ovarian granulosa cells were isolated from an additional 10 patients with PCOS and cultured. VEGF and SP1 mRNA and protein levels, as well as pAKT levels, were detected by real-time PCR and Western blotting after cultured cells were treated with different concentrations of sRAGE. Compared with the non-PCOS patients, patients with PCOS had lower sRAGE levels in follicular fluid. Multi-adjusted regression analysis showed that high sRAGE levels in follicular fluid predicted a lower Gn dose, more oocytes retrieved, and a better IVF outcome in the non-PCOS group. Logistic regression analysis showed that higher sRAGE levels predicted favorably IVF outcomes in the non-PCOS group. Multi-adjusted regression analysis also showed that high sRAGE levels in follicular fluid predicted a lower Gn dose in the PCOS group. Treating granulosa cells isolated from patients with PCOS with recombinant sRAGE decreased VEGF and SP1 mRNA and protein expression and pAKT levels in a dose-dependent manner. © 2017 Society for Reproduction and Fertility.

  11. Logistic regression analysis of factors associated with avascular necrosis of the femoral head following femoral neck fractures in middle-aged and elderly patients.

    PubMed

    Ai, Zi-Sheng; Gao, You-Shui; Sun, Yuan; Liu, Yue; Zhang, Chang-Qing; Jiang, Cheng-Hua

    2013-03-01

    Risk factors for femoral neck fracture-induced avascular necrosis of the femoral head have not been elucidated clearly in middle-aged and elderly patients. Moreover, the high incidence of screw removal in China and its effect on the fate of the involved femoral head require statistical methods to reflect their intrinsic relationship. Ninety-nine patients older than 45 years with femoral neck fracture were treated by internal fixation between May 1999 and April 2004. Descriptive analysis, interaction analysis between associated factors, single factor logistic regression, multivariate logistic regression, and detailed interaction analysis were employed to explore potential relationships among associated factors. Avascular necrosis of the femoral head was found in 15 cases (15.2 %). Age × the status of implants (removal vs. maintenance) and gender × the timing of reduction were interactive according to two-factor interactive analysis. Age, the displacement of fractures, the quality of reduction, and the status of implants were found to be significant factors in single factor logistic regression analysis. Age, age × the status of implants, and the quality of reduction were found to be significant factors in multivariate logistic regression analysis. In fine interaction analysis after multivariate logistic regression analysis, implant removal was the most important risk factor for avascular necrosis in 56-to-85-year-old patients, with a risk ratio of 26.00 (95 % CI = 3.076-219.747). The middle-aged and elderly have less incidence of avascular necrosis of the femoral head following femoral neck fractures treated by cannulated screws. The removal of cannulated screws can induce a significantly high incidence of avascular necrosis of the femoral head in elderly patients, while a high-quality reduction is helpful to reduce avascular necrosis.

  12. Combining logistic regression with classification and regression tree to predict quality of care in a home health nursing data set.

    PubMed

    Guo, Huey-Ming; Shyu, Yea-Ing Lotus; Chang, Her-Kun

    2006-01-01

    In this article, the authors provide an overview of a research method to predict quality of care in home health nursing data set. The results of this study can be visualized through classification an regression tree (CART) graphs. The analysis was more effective, and the results were more informative since the home health nursing dataset was analyzed with a combination of the logistic regression and CART, these two techniques complete each other. And the results more informative that more patients' characters were related to quality of care in home care. The results contributed to home health nurse predict patient outcome in case management. Improved prediction is needed for interventions to be appropriately targeted for improved patient outcome and quality of care.

  13. Bidirectional relationship between renal function and periodontal disease in older Japanese women.

    PubMed

    Yoshihara, Akihiro; Iwasaki, Masanori; Miyazaki, Hideo; Nakamura, Kazutoshi

    2016-09-01

    The purpose of this study was to evaluate the reciprocal effects of chronic kidney disease (CKD) and periodontal disease. A total of 332 postmenopausal never smoking women were enrolled, and their serum high-sensitivity C-reactive protein, serum osteocalcin and serum cystatin C levels were measured. Poor renal function was defined as serum cystatin C > 0.91 mg/l. Periodontal disease markers, including clinical attachment level and the periodontal inflamed surface area (PISA), were also evaluated. Logistic regression analysis was conducted to evaluate the relationships between renal function and periodontal disease markers, serum osteocalcin level and hsCRP level. The prevalence-rate ratios (PRRs) on multiple Poisson regression analyses were determined to evaluate the relationships between periodontal disease markers and serum osteocalcin, serum cystatin C and serum hsCRP levels. On logistic regression analysis, PISA was significantly associated with serum cystatin C level. The odds ratio for serum cystatin C level was 2.44 (p = 0.011). The PRR between serum cystatin C level and periodontal disease markers such as number of sites with clinical attachment level ≥6 mm was significantly positive (3.12, p < 0.001). Similar tendencies were shown for serum osteocalcin level. This study suggests that CKD and periodontal disease can have reciprocal effects. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  14. The relationship of bone and blood lead to hypertension: Further analyses of the normative aging study data

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hu, H.; Kim, Rokho; Korrick, S.

    1996-12-31

    In an earlier report based on participants in the Veterans Administration Normative Aging Study, we found a significant association between the risk of hypertension and lead levels in tibia. To examine the possible confounding effects of education and occupation, we considered in this study five levels of education and three levels of occupation as independent variables in the statistical model. Of 1,171 active subjects seen between August 1991 and December 1994, 563 provided complete data for this analysis. In the initial logistic regression model, acre and body mass index, family history of hypertension, and dietary sodium intake, but neither cumulativemore » smoking nor alcohol ingestion, conferred increased odds ratios for being hypertensive that were statistically significant. When the lead biomarkers were added separately to this initial logistic model, tibia lead and patella lead levels were associated with significantly elevated odds ratios for hypertension. In the final backward elimination logistic regression model that included categorical variables for education and occupation, the only variables retained were body mass index, family history of hypertension, and tibia lead level. We conclude that education and occupation variables were not confounding the association between the lead biomarkers and hypertension that we reported previously. 27 refs., 3 tabs.« less

  15. Transmission Risks of Schistosomiasis Japonica: Extraction from Back-propagation Artificial Neural Network and Logistic Regression Model

    PubMed Central

    Xu, Jun-Fang; Xu, Jing; Li, Shi-Zhu; Jia, Tia-Wu; Huang, Xi-Bao; Zhang, Hua-Ming; Chen, Mei; Yang, Guo-Jing; Gao, Shu-Jing; Wang, Qing-Yun; Zhou, Xiao-Nong

    2013-01-01

    Background The transmission of schistosomiasis japonica in a local setting is still poorly understood in the lake regions of the People's Republic of China (P. R. China), and its transmission patterns are closely related to human, social and economic factors. Methodology/Principal Findings We aimed to apply the integrated approach of artificial neural network (ANN) and logistic regression model in assessment of transmission risks of Schistosoma japonicum with epidemiological data collected from 2339 villagers from 1247 households in six villages of Jiangling County, P.R. China. By using the back-propagation (BP) of the ANN model, 16 factors out of 27 factors were screened, and the top five factors ranked by the absolute value of mean impact value (MIV) were mainly related to human behavior, i.e. integration of water contact history and infection history, family with past infection, history of water contact, infection history, and infection times. The top five factors screened by the logistic regression model were mainly related to the social economics, i.e. village level, economic conditions of family, age group, education level, and infection times. The risk of human infection with S. japonicum is higher in the population who are at age 15 or younger, or with lower education, or with the higher infection rate of the village, or with poor family, and in the population with more than one time to be infected. Conclusion/Significance Both BP artificial neural network and logistic regression model established in a small scale suggested that individual behavior and socioeconomic status are the most important risk factors in the transmission of schistosomiasis japonica. It was reviewed that the young population (≤15) in higher-risk areas was the main target to be intervened for the disease transmission control. PMID:23556015

  16. Multivariate logistic regression for predicting total culturable virus presence at the intake of a potable-water treatment plant: novel application of the atypical coliform/total coliform ratio.

    PubMed

    Black, L E; Brion, G M; Freitas, S J

    2007-06-01

    Predicting the presence of enteric viruses in surface waters is a complex modeling problem. Multiple water quality parameters that indicate the presence of human fecal material, the load of fecal material, and the amount of time fecal material has been in the environment are needed. This paper presents the results of a multiyear study of raw-water quality at the inlet of a potable-water plant that related 17 physical, chemical, and biological indices to the presence of enteric viruses as indicated by cytopathic changes in cell cultures. It was found that several simple, multivariate logistic regression models that could reliably identify observations of the presence or absence of total culturable virus could be fitted. The best models developed combined a fecal age indicator (the atypical coliform [AC]/total coliform [TC] ratio), the detectable presence of a human-associated sterol (epicoprostanol) to indicate the fecal source, and one of several fecal load indicators (the levels of Giardia species cysts, coliform bacteria, and coprostanol). The best fit to the data was found when the AC/TC ratio, the presence of epicoprostanol, and the density of fecal coliform bacteria were input into a simple, multivariate logistic regression equation, resulting in 84.5% and 78.6% accuracies for the identification of the presence and absence of total culturable virus, respectively. The AC/TC ratio was the most influential input variable in all of the models generated, but producing the best prediction required additional input related to the fecal source and the fecal load. The potential for replacing microbial indicators of fecal load with levels of coprostanol was proposed and evaluated by multivariate logistic regression modeling for the presence and absence of virus.

  17. [A survey of correlation between serum 25-hydroxyvitamin D levels and dyslipidemia rlsk among middle-aged individuals in Beijing].

    PubMed

    Zhang, L L; Lu, Y H; Cheng, X L; Liu, M Y; Sun, B R; Li, C L

    2016-08-01

    To evaluate vitamin D status in middle-aged subjects in Beijing and explore the correlation between serum 25-hydroxyvitamin D[25(OH)D] levels and dyslipidemia. A total of 448 individuals over 40 years old were enrolled in the cross-sectional survey. The general information, blood biochemical and lipid profiles and serum 25(OH)D levels were collected. The subjects were either divided into two groups (the dyslipidemia group and the non-dyslipidemia group) based on the lipid levels, or four groups according to quartiles of 25(OH)D levels. The association between 25(OH)D levels and dyslipidemia risk was analyzed by a logistic regression analysis. A total of 234 cases were in dyslipidemia group, which accounted for 52.23% of the subjects. The serum 25(OH)D levels were significantly lower in the dyslipidemia group than in the non-dyslipidemia group both in men and in women (all P<0.05). The median serum 25(OH)D level in the total subjects was 15.7 (12.2, 20.1)μg/L with 91.1% subjects of serum 25(OH)D level<30 μg/L. The proportion of subjects with dyslipidemia (high TC, high TG, high LDL-C, or low HDL-C) increased with the decrease of 25(OH)D level quartiles (P<0.05). After adjustment of confounding factors, the logistic regression analysis showed that subjects in the lowest 25(OH) D quartile group had 143% higher risks for dyslipidemia than those in the highest quartile group. These findings indicate that 25(OH)D insufficiency is highly prevalent among middle-aged individuals and it may be associated with the risk of dyslipidemia.

  18. Environmental factors and flow paths related to Escherichia coli concentrations at two beaches on Lake St. Clair, Michigan, 2002–2005

    USGS Publications Warehouse

    Holtschlag, David J.; Shively, Dawn; Whitman, Richard L.; Haack, Sheridan K.; Fogarty, Lisa R.

    2008-01-01

    Regression analyses and hydrodynamic modeling were used to identify environmental factors and flow paths associated with Escherichia coli (E. coli) concentrations at Memorial and Metropolitan Beaches on Lake St. Clair in Macomb County, Mich. Lake St. Clair is part of the binational waterway between the United States and Canada that connects Lake Huron with Lake Erie in the Great Lakes Basin. Linear regression, regression-tree, and logistic regression models were developed from E. coli concentration and ancillary environmental data. Linear regression models on log10 E. coli concentrations indicated that rainfall prior to sampling, water temperature, and turbidity were positively associated with bacteria concentrations at both beaches. Flow from Clinton River, changes in water levels, wind conditions, and log10 E. coli concentrations 2 days before or after the target bacteria concentrations were statistically significant at one or both beaches. In addition, various interaction terms were significant at Memorial Beach. Linear regression models for both beaches explained only about 30 percent of the variability in log10 E. coli concentrations. Regression-tree models were developed from data from both Memorial and Metropolitan Beaches but were found to have limited predictive capability in this study. The results indicate that too few observations were available to develop reliable regression-tree models. Linear logistic models were developed to estimate the probability of E. coli concentrations exceeding 300 most probable number (MPN) per 100 milliliters (mL). Rainfall amounts before bacteria sampling were positively associated with exceedance probabilities at both beaches. Flow of Clinton River, turbidity, and log10 E. coli concentrations measured before or after the target E. coli measurements were related to exceedances at one or both beaches. The linear logistic models were effective in estimating bacteria exceedances at both beaches. A receiver operating characteristic (ROC) analysis was used to determine cut points for maximizing the true positive rate prediction while minimizing the false positive rate. A two-dimensional hydrodynamic model was developed to simulate horizontal current patterns on Lake St. Clair in response to wind, flow, and water-level conditions at model boundaries. Simulated velocity fields were used to track hypothetical massless particles backward in time from the beaches along flow paths toward source areas. Reverse particle tracking for idealized steady-state conditions shows changes in expected flow paths and traveltimes with wind speeds and directions from 24 sectors. The results indicate that three to four sets of contiguous wind sectors have similar effects on flow paths in the vicinity of the beaches. In addition, reverse particle tracking was used for transient conditions to identify expected flow paths for 10 E. coli sampling events in 2004. These results demonstrate the ability to track hypothetical particles from the beaches, backward in time, to likely source areas. This ability, coupled with a greater frequency of bacteria sampling, may provide insight into changes in bacteria concentrations between source and sink areas.

  19. Artificial neural networks predict the incidence of portosplenomesenteric venous thrombosis in patients with acute pancreatitis.

    PubMed

    Fei, Y; Hu, J; Li, W-Q; Wang, W; Zong, G-Q

    2017-03-01

    Essentials Predicting the occurrence of portosplenomesenteric vein thrombosis (PSMVT) is difficult. We studied 72 patients with acute pancreatitis. Artificial neural networks modeling was more accurate than logistic regression in predicting PSMVT. Additional predictive factors may be incorporated into artificial neural networks. Objective To construct and validate artificial neural networks (ANNs) for predicting the occurrence of portosplenomesenteric venous thrombosis (PSMVT) and compare the predictive ability of the ANNs with that of logistic regression. Methods The ANNs and logistic regression modeling were constructed using simple clinical and laboratory data of 72 acute pancreatitis (AP) patients. The ANNs and logistic modeling were first trained on 48 randomly chosen patients and validated on the remaining 24 patients. The accuracy and the performance characteristics were compared between these two approaches by SPSS17.0 software. Results The training set and validation set did not differ on any of the 11 variables. After training, the back propagation network training error converged to 1 × 10 -20 , and it retained excellent pattern recognition ability. When the ANNs model was applied to the validation set, it revealed a sensitivity of 80%, specificity of 85.7%, a positive predictive value of 77.6% and negative predictive value of 90.7%. The accuracy was 83.3%. Differences could be found between ANNs modeling and logistic regression modeling in these parameters (10.0% [95% CI, -14.3 to 34.3%], 14.3% [95% CI, -8.6 to 37.2%], 15.7% [95% CI, -9.9 to 41.3%], 11.8% [95% CI, -8.2 to 31.8%], 22.6% [95% CI, -1.9 to 47.1%], respectively). When ANNs modeling was used to identify PSMVT, the area under receiver operating characteristic curve was 0.849 (95% CI, 0.807-0.901), which demonstrated better overall properties than logistic regression modeling (AUC = 0.716) (95% CI, 0.679-0.761). Conclusions ANNs modeling was a more accurate tool than logistic regression in predicting the occurrence of PSMVT following AP. More clinical factors or biomarkers may be incorporated into ANNs modeling to improve its predictive ability. © 2016 International Society on Thrombosis and Haemostasis.

  20. Comparison between students and residents on determinants of willingness to separate waste and waste separation behaviour in Zhengzhou, China.

    PubMed

    Dai, Xiaoping; Han, Yuping; Zhang, Xiaohong; Hu, Wei; Huang, Liangji; Duan, Wenpei; Li, Siyi; Liu, Xiaolu; Wang, Qian

    2017-09-01

    A better understanding of willingness to separate waste and waste separation behaviour can aid the design and improvement of waste management policies. Based on the intercept questionnaire survey data of undergraduate students and residents in Zhengzhou City of China, this article compared factors affecting the willingness and behaviour of students and residents to participate in waste separation using two binary logistic regression models. Improvement opportunities for waste separation were also discussed. Binary logistic regression results indicate that knowledge of and attitude to waste separation and acceptance of waste education significantly affect the willingness of undergraduate students to separate waste, and demographic factors, such as gender, age, education level, and income, significantly affect the willingness of residents to do so. Presence of waste-specific bins and attitude to waste separation are drivers of waste separation behaviour for both students and residents. Improved education about waste separation and facilities are effective to stimulate waste separation, and charging on unsorted waste may be an effective way to improve it in Zhengzhou.

  1. Risk and protective factors of posttraumatic stress disorder among African American women living with HIV.

    PubMed

    Andu, Eaden; Wagenaar, Brad H; Kemp, Chris G; Nevin, Paul E; Simoni, Jane M; Andrasik, Michele; Cohn, Susan E; French, Audrey L; Rao, Deepa

    2018-04-26

    We sought to examine risk and protective factors for Posttraumatic Stress Disorder (PTSD) among African American women living with HIV. This is a cross-sectional analysis of baseline data from a randomized trial of an HIV stigma reduction intervention. We examined data from two-hundred and thirty-nine African American women living with HIV. We examined whether age, marital status, level of education, internalized HIV-related stigma, and social support as potential protective and risk factors for PTSD symptoms using logistic regression. We analyzed bi-variate associations between each variable and PTSD symptoms, and constructed a multivariate logistic regression model adjusting for all variables. We found 67% reported clinically significant PTSD symptoms at baseline. Our results suggest that age, education, and internalized stigma were found to be associated with PTSD symptoms (p < 0.001), with older age and more education as protective factors and stigma as a risk factor for PTSD. Therefore, understanding this relationship may help improve assessment and treatment through evidence- based and trauma-informed strategies.

  2. Correlates of HIV knowledge and Sexual risk behaviors among Female Military Personnel

    PubMed Central

    Essien, E. James; Monjok, Emmanuel; Chen, Hua; Abughosh, Susan; Ekong, Ernest; Peters, Ronald J.; Holmes, Laurens; Holstad, Marcia M.; Mgbere, Osaro

    2010-01-01

    Objective Uniformed services personnel are at an increased risk of HIV infection. We examined the HIV/AIDS knowledge and sexual risk behaviors among female military personnel to determine the correlates of HIV risk behaviors in this population. Method The study used a cross-sectional design to examine HIV/AIDS knowledge and sexual risk behaviors in a sample of 346 females drawn from two military cantonments in Southwestern Nigeria. Data was collected between 2006 and 2008. Using bivariate analysis and multivariate logistic regression, HIV/AIDS knowledge and sexual behaviors were described in relation to socio-demographic characteristics of the participants. Results Multivariate logistic regression analysis revealed that level of education and knowing someone with HIV/AIDS were significant (p<0.05) predictors of HIV knowledge in this sample. HIV prevention self-efficacy was significantly (P<0.05) predicted by annual income and race/ethnicity. Condom use attitudes were also significantly (P<0.05) associated with number of children, annual income, and number of sexual partners. Conclusion Data indicates the importance of incorporating these predictor variables into intervention designs. PMID:20387111

  3. Comparison of four methods for deriving hospital standardised mortality ratios from a single hierarchical logistic regression model.

    PubMed

    Mohammed, Mohammed A; Manktelow, Bradley N; Hofer, Timothy P

    2016-04-01

    There is interest in deriving case-mix adjusted standardised mortality ratios so that comparisons between healthcare providers, such as hospitals, can be undertaken in the controversial belief that variability in standardised mortality ratios reflects quality of care. Typically standardised mortality ratios are derived using a fixed effects logistic regression model, without a hospital term in the model. This fails to account for the hierarchical structure of the data - patients nested within hospitals - and so a hierarchical logistic regression model is more appropriate. However, four methods have been advocated for deriving standardised mortality ratios from a hierarchical logistic regression model, but their agreement is not known and neither do we know which is to be preferred. We found significant differences between the four types of standardised mortality ratios because they reflect a range of underlying conceptual issues. The most subtle issue is the distinction between asking how an average patient fares in different hospitals versus how patients at a given hospital fare at an average hospital. Since the answers to these questions are not the same and since the choice between these two approaches is not obvious, the extent to which profiling hospitals on mortality can be undertaken safely and reliably, without resolving these methodological issues, remains questionable. © The Author(s) 2012.

  4. Radiomorphometric analysis of frontal sinus for sex determination.

    PubMed

    Verma, Saumya; Mahima, V G; Patil, Karthikeya

    2014-09-01

    Sex determination of unknown individuals carries crucial significance in forensic research, in cases where fragments of skull persist with no likelihood of identification based on dental arch. In these instances sex determination becomes important to rule out certain number of possibilities instantly and helps in establishing a biological profile of human remains. The aim of the study is to evaluate a mathematical method based on logistic regression analysis capable of ascertaining the sex of individuals in the South Indian population. The study was conducted in the department of Oral Medicine and Radiology. The right and left areas, maximum height, width of frontal sinus were determined in 100 Caldwell views of 50 women and 50 men aged 20 years and above, with the help of Vernier callipers and a square grid with 1 square measuring 1mm(2) in area. Student's t-test, logistic regression analysis. The mean values of variables were greater in men, based on Student's t-test at 5% level of significance. The mathematical model based on logistic regression analysis gave percentage agreement of total area to correctly predict the female gender as 55.2%, of right area as 60.9% and of left area as 55.2%. The areas of the frontal sinus and the logistic regression proved to be unreliable in sex determination. (Logit = 0.924 - 0.00217 × right area).

  5. Predicting U.S. Army Reserve Unit Manning Using Market Demographics

    DTIC Science & Technology

    2015-06-01

    develops linear regression , classification tree, and logistic regression models to determine the ability of the location to support manning requirements... logistic regression model delivers predictive results that allow decision-makers to identify locations with a high probability of meeting unit...manning requirements. The recommendation of this thesis is that the USAR implement the logistic regression model. 14. SUBJECT TERMS U.S

  6. Analyzing Student Learning Outcomes: Usefulness of Logistic and Cox Regression Models. IR Applications, Volume 5

    ERIC Educational Resources Information Center

    Chen, Chau-Kuang

    2005-01-01

    Logistic and Cox regression methods are practical tools used to model the relationships between certain student learning outcomes and their relevant explanatory variables. The logistic regression model fits an S-shaped curve into a binary outcome with data points of zero and one. The Cox regression model allows investigators to study the duration…

  7. [Two-level logistic modeling analysis on the factors that influence birth in hospitals in poor rural areas of Sichuan province].

    PubMed

    Yu, Chuan; Li, Xiao-song

    2008-11-01

    To identify the determinants of birth in hospitals in the poor rural areas. A questionnaire survey in eight poor counties in Sichuan province was conducted. Multilevel logistic regression analysis was performed to identify the factors that influenced birth in hospitals. Hospitals delivered 61.4% of babies in the selected counties. Education, eligibility to poverty relief, numbers of pre-natal examinations and abnormalities found in pre-natal examinations had a significant impact on birth in hospitals. Education of women and medical relief in the poor rural areas need to be strengthened to increase the proportion of babies delivered in hospitals in the poor rural areas. Systematic management of pregnant women and increased pre-natal examinations could also contribute to hospital delivery of babies.

  8. An appraisal of convergence failures in the application of logistic regression model in published manuscripts.

    PubMed

    Yusuf, O B; Bamgboye, E A; Afolabi, R F; Shodimu, M A

    2014-09-01

    Logistic regression model is widely used in health research for description and predictive purposes. Unfortunately, most researchers are sometimes not aware that the underlying principles of the techniques have failed when the algorithm for maximum likelihood does not converge. Young researchers particularly postgraduate students may not know why separation problem whether quasi or complete occurs, how to identify it and how to fix it. This study was designed to critically evaluate convergence issues in articles that employed logistic regression analysis published in an African Journal of Medicine and medical sciences between 2004 and 2013. Problems of quasi or complete separation were described and were illustrated with the National Demographic and Health Survey dataset. A critical evaluation of articles that employed logistic regression was conducted. A total of 581 articles was reviewed, of which 40 (6.9%) used binary logistic regression. Twenty-four (60.0%) stated the use of logistic regression model in the methodology while none of the articles assessed model fit. Only 3 (12.5%) properly described the procedures. Of the 40 that used the logistic regression model, the problem of convergence occurred in 6 (15.0%) of the articles. Logistic regression tends to be poorly reported in studies published between 2004 and 2013. Our findings showed that the procedure may not be well understood by researchers since very few described the process in their reports and may be totally unaware of the problem of convergence or how to deal with it.

  9. Estimating time-varying exposure-outcome associations using case-control data: logistic and case-cohort analyses.

    PubMed

    Keogh, Ruth H; Mangtani, Punam; Rodrigues, Laura; Nguipdop Djomo, Patrick

    2016-01-05

    Traditional analyses of standard case-control studies using logistic regression do not allow estimation of time-varying associations between exposures and the outcome. We present two approaches which allow this. The motivation is a study of vaccine efficacy as a function of time since vaccination. Our first approach is to estimate time-varying exposure-outcome associations by fitting a series of logistic regressions within successive time periods, reusing controls across periods. Our second approach treats the case-control sample as a case-cohort study, with the controls forming the subcohort. In the case-cohort analysis, controls contribute information at all times they are at risk. Extensions allow left truncation, frequency matching and, using the case-cohort analysis, time-varying exposures. Simulations are used to investigate the methods. The simulation results show that both methods give correct estimates of time-varying effects of exposures using standard case-control data. Using the logistic approach there are efficiency gains by reusing controls over time and care should be taken over the definition of controls within time periods. However, using the case-cohort analysis there is no ambiguity over the definition of controls. The performance of the two analyses is very similar when controls are used most efficiently under the logistic approach. Using our methods, case-control studies can be used to estimate time-varying exposure-outcome associations where they may not previously have been considered. The case-cohort analysis has several advantages, including that it allows estimation of time-varying associations as a continuous function of time, while the logistic regression approach is restricted to assuming a step function form for the time-varying association.

  10. Sociodemographic factors associated with pregnant women's level of knowledge about oral health

    PubMed Central

    Barbieri, Wander; Peres, Stela Verzinhasse; Pereira, Carla de Britto; Peres, João; de Sousa, Maria da Luz Rosário; Cortellazzi, Karine Laura

    2018-01-01

    ABSTRACT Objective To evaluate knowledge on oral health and associated sociodemographic factors in pregnant women. Methods A cross-sectional study with a sample of 195 pregnant women seen at the Primary Care Unit Paraisópolis I, in São Paulo (SP), Brazil. For statistical analysis, χ2 or Fisher's exact test and multiple logistic regression were used. A significance level of 5% was used in all analyses. Results Schooling level equal to or greater than 8 years and having one or two children were associated with an adequate knowledge about oral health. Conclusion Oral health promotion strategies during prenatal care should take into account sociodemographic aspects. PMID:29694612

  11. Modeling the dynamics of urban growth using multinomial logistic regression: a case study of Jiayu County, Hubei Province, China

    NASA Astrophysics Data System (ADS)

    Nong, Yu; Du, Qingyun; Wang, Kun; Miao, Lei; Zhang, Weiwei

    2008-10-01

    Urban growth modeling, one of the most important aspects of land use and land cover change study, has attracted substantial attention because it helps to comprehend the mechanisms of land use change thus helps relevant policies made. This study applied multinomial logistic regression to model urban growth in the Jiayu county of Hubei province, China to discover the relationship between urban growth and the driving forces of which biophysical and social-economic factors are selected as independent variables. This type of regression is similar to binary logistic regression, but it is more general because the dependent variable is not restricted to two categories, as those previous studies did. The multinomial one can simulate the process of multiple land use competition between urban land, bare land, cultivated land and orchard land. Taking the land use type of Urban as reference category, parameters could be estimated with odds ratio. A probability map is generated from the model to predict where urban growth will occur as a result of the computation.

  12. Logistic Regression: Concept and Application

    ERIC Educational Resources Information Center

    Cokluk, Omay

    2010-01-01

    The main focus of logistic regression analysis is classification of individuals in different groups. The aim of the present study is to explain basic concepts and processes of binary logistic regression analysis intended to determine the combination of independent variables which best explain the membership in certain groups called dichotomous…

  13. Classification of pregnancies of unknown location according to four different hCG-based protocols.

    PubMed

    Fistouris, J; Bergh, C; Strandell, A

    2016-10-01

    How do four protocols based on serial human chorionic gonadotropin (hCG) measurements perform when classifying pregnancies of unknown location (PULs) as low or high risk of being an ectopic pregnancy (EP)? The use of cut-offs in hCG level changes published by NICE, and a logistic regression model, M4, correctly classify more PULs as high risk, compared with two other protocols. A logistic regression model, M4, based on the mean of two consecutive hCG values and the hCG ratio (hCG 48 h/hCG 0 h) that classify PULs into low- and high-risk groups for triage purposes, identifies more EPs than a protocol using the cut-offs between a 13% decline and a 66% rise in hCG levels over 48 h. A retrospective comparative study of four different hCG-based protocols classifying PULs as low or high risk of being an EP was performed at a gynaecological emergency unit over 3 years. We identified 915 women with a PUL. Initial transvaginal ultrasonography (TVS) findings categorised 187 of the PULs as probable intrauterine pregnancies (IUPs) and 16 as probable EPs. The rate of change in hCG levels over 48 h was calculated for each patient and subjected to three different hCG threshold intervals and a logistic regression model for outcome prediction. Each PUL was subsequently dichotomised to either low-risk (i.e. failed PUL/IUP) or high-risk (i.e. EP) classification, which allowed us to compare the diagnostic performance. In 'Protocol A', a PUL was classified as low risk if >13% hCG level decline or >66% hCG level rise was achieved; otherwise, the PUL was classified as high risk of being an EP. 'Protocol B' classified a PUL as low or high risk using cut-offs of 35-50% declining hCG levels and of 53% rising hCG levels. Similarly, 'Protocol C' used hCG level cut-offs published by NICE, 50% for declining hCG levels and 63% for rising hCG levels. Finally, if a logistic regression model 'Protocol M4' calculated a ≥5% risk of the PUL being an EP, it was classified as high risk, and otherwise the PUL was classified as low risk. When the time interval between two hCG measurements failed to meet an exact 48 h, extrapolation and interpolation of hCG values was made, using log linear transformation. Protocols A, B, C and M4 classified 73, 66, 55 and 56% of PULs as low risk. The sensitivity for protocols A, B, C and M4 was 68% (95% confidence interval (CI) 61-75%), 81% (74-86%), 87% (82-92%) and 88% (83-93%), respectively. The specificity was 82% (80-85%), 77% (74-80%), 66% (62-69%) and 67% (63-70%) for protocols A, B, C and M4, respectively. All comparisons of sensitivity and specificity between the protocols were statistically significant except for protocol C versus protocol M4. In protocol C, 87% (66-97%) of misclassified EPs had rising hCG levels, compared with 19% (6-41%) for protocol M4 (P < 0.01). In a secondary analysis excluding probable IUPs and probable EPs, the results for 712 PULs were analysed. The sensitivity subsequently remained stable for all protocols. Protocol M4 reached a 78% (74-81%) specificity, which was significantly higher than 70% (66-74%) for protocol C (P = 0.01) and protocol M4 classified 63% of PULs as low risk compared with 58% for protocol C. The retrospective design of the study is a limitation. The results are derived from a population where laparoscopy played an important role in PUL management and diagnosis of EPs, although it did reflect real clinical practice. Although we tried to adhere to definitions of PUL and final outcomes as in previous studies and a recent consensus statement, potential differences in this regard must be acknowledged. Where the time interval between two serial hCG measurements deviated from 48 h we estimated 48 h hCG values. A logistic regression model, M4, classifies more PULs correctly as low risk in a selected PUL population without probable IUPs and EPs and identifies as many EPs, in comparison with the cut-offs available in the NICE guideline. This advantage for model M4 may result in a reduction of unnecessary follow-up visits, when fewer low-risk PULs are misclassified as high risk. These findings, however, ought to be clarified in a randomised controlled trial. The study was supported by LUA/ALF grant No. 70940. There are no competing interests. © The Author 2016. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  14. Prescription-drug-related risk in driving: comparing conventional and lasso shrinkage logistic regressions.

    PubMed

    Avalos, Marta; Adroher, Nuria Duran; Lagarde, Emmanuel; Thiessard, Frantz; Grandvalet, Yves; Contrand, Benjamin; Orriols, Ludivine

    2012-09-01

    Large data sets with many variables provide particular challenges when constructing analytic models. Lasso-related methods provide a useful tool, although one that remains unfamiliar to most epidemiologists. We illustrate the application of lasso methods in an analysis of the impact of prescribed drugs on the risk of a road traffic crash, using a large French nationwide database (PLoS Med 2010;7:e1000366). In the original case-control study, the authors analyzed each exposure separately. We use the lasso method, which can simultaneously perform estimation and variable selection in a single model. We compare point estimates and confidence intervals using (1) a separate logistic regression model for each drug with a Bonferroni correction and (2) lasso shrinkage logistic regression analysis. Shrinkage regression had little effect on (bias corrected) point estimates, but led to less conservative results, noticeably for drugs with moderate levels of exposure. Carbamates, carboxamide derivative and fatty acid derivative antiepileptics, drugs used in opioid dependence, and mineral supplements of potassium showed stronger associations. Lasso is a relevant method in the analysis of databases with large number of exposures and can be recommended as an alternative to conventional strategies.

  15. Data mining: Potential applications in research on nutrition and health.

    PubMed

    Batterham, Marijka; Neale, Elizabeth; Martin, Allison; Tapsell, Linda

    2017-02-01

    Data mining enables further insights from nutrition-related research, but caution is required. The aim of this analysis was to demonstrate and compare the utility of data mining methods in classifying a categorical outcome derived from a nutrition-related intervention. Baseline data (23 variables, 8 categorical) on participants (n = 295) in an intervention trial were used to classify participants in terms of meeting the criteria of achieving 10 000 steps per day. Results from classification and regression trees (CARTs), random forests, adaptive boosting, logistic regression, support vector machines and neural networks were compared using area under the curve (AUC) and error assessments. The CART produced the best model when considering the AUC (0.703), overall error (18%) and within class error (28%). Logistic regression also performed reasonably well compared to the other models (AUC 0.675, overall error 23%, within class error 36%). All the methods gave different rankings of variables' importance. CART found that body fat, quality of life using the SF-12 Physical Component Summary (PCS) and the cholesterol: HDL ratio were the most important predictors of meeting the 10 000 steps criteria, while logistic regression showed the SF-12PCS, glucose levels and level of education to be the most significant predictors (P ≤ 0.01). Differing outcomes suggest caution is required with a single data mining method, particularly in a dataset with nonlinear relationships and outliers and when exploring relationships that were not the primary outcomes of the research. © 2017 Dietitians Association of Australia.

  16. An Entropy-Based Measure for Assessing Fuzziness in Logistic Regression

    PubMed Central

    Weiss, Brandi A.; Dardick, William

    2015-01-01

    This article introduces an entropy-based measure of data–model fit that can be used to assess the quality of logistic regression models. Entropy has previously been used in mixture-modeling to quantify how well individuals are classified into latent classes. The current study proposes the use of entropy for logistic regression models to quantify the quality of classification and separation of group membership. Entropy complements preexisting measures of data–model fit and provides unique information not contained in other measures. Hypothetical data scenarios, an applied example, and Monte Carlo simulation results are used to demonstrate the application of entropy in logistic regression. Entropy should be used in conjunction with other measures of data–model fit to assess how well logistic regression models classify cases into observed categories. PMID:29795897

  17. Logistic regression applied to natural hazards: rare event logistic regression with replications

    NASA Astrophysics Data System (ADS)

    Guns, M.; Vanacker, V.

    2012-06-01

    Statistical analysis of natural hazards needs particular attention, as most of these phenomena are rare events. This study shows that the ordinary rare event logistic regression, as it is now commonly used in geomorphologic studies, does not always lead to a robust detection of controlling factors, as the results can be strongly sample-dependent. In this paper, we introduce some concepts of Monte Carlo simulations in rare event logistic regression. This technique, so-called rare event logistic regression with replications, combines the strength of probabilistic and statistical methods, and allows overcoming some of the limitations of previous developments through robust variable selection. This technique was here developed for the analyses of landslide controlling factors, but the concept is widely applicable for statistical analyses of natural hazards.

  18. An Entropy-Based Measure for Assessing Fuzziness in Logistic Regression.

    PubMed

    Weiss, Brandi A; Dardick, William

    2016-12-01

    This article introduces an entropy-based measure of data-model fit that can be used to assess the quality of logistic regression models. Entropy has previously been used in mixture-modeling to quantify how well individuals are classified into latent classes. The current study proposes the use of entropy for logistic regression models to quantify the quality of classification and separation of group membership. Entropy complements preexisting measures of data-model fit and provides unique information not contained in other measures. Hypothetical data scenarios, an applied example, and Monte Carlo simulation results are used to demonstrate the application of entropy in logistic regression. Entropy should be used in conjunction with other measures of data-model fit to assess how well logistic regression models classify cases into observed categories.

  19. Identification of immune correlates of protection in Shigella infection by application of machine learning.

    PubMed

    Arevalillo, Jorge M; Sztein, Marcelo B; Kotloff, Karen L; Levine, Myron M; Simon, Jakub K

    2017-10-01

    Immunologic correlates of protection are important in vaccine development because they give insight into mechanisms of protection, assist in the identification of promising vaccine candidates, and serve as endpoints in bridging clinical vaccine studies. Our goal is the development of a methodology to identify immunologic correlates of protection using the Shigella challenge as a model. The proposed methodology utilizes the Random Forests (RF) machine learning algorithm as well as Classification and Regression Trees (CART) to detect immune markers that predict protection, identify interactions between variables, and define optimal cutoffs. Logistic regression modeling is applied to estimate the probability of protection and the confidence interval (CI) for such a probability is computed by bootstrapping the logistic regression models. The results demonstrate that the combination of Classification and Regression Trees and Random Forests complements the standard logistic regression and uncovers subtle immune interactions. Specific levels of immunoglobulin IgG antibody in blood on the day of challenge predicted protection in 75% (95% CI 67-86). Of those subjects that did not have blood IgG at or above a defined threshold, 100% were protected if they had IgA antibody secreting cells above a defined threshold. Comparison with the results obtained by applying only logistic regression modeling with standard Akaike Information Criterion for model selection shows the usefulness of the proposed method. Given the complexity of the immune system, the use of machine learning methods may enhance traditional statistical approaches. When applied together, they offer a novel way to quantify important immune correlates of protection that may help the development of vaccines. Copyright © 2017 Elsevier Inc. All rights reserved.

  20. Association between blood lead and blood pressure: Results from the Canadian Health Measures Survey (2007 to 2011).

    PubMed

    Bushnik, Tracey; Levallois, Patrick; D'Amour, Monique; Anderson, Todd J; McAlister, Finlay A

    2014-07-01

    Hypertension is the leading risk factor for cardiovascular disease, but its cause is not always known. Interest is increasing in the potential role of environmental chemicals, including lead. Data are from the first two cycles of the Canadian Health Measures Survey. Lead in whole blood (PbB), and systolic (SBP) and diastolic (DBP) blood pressure were measured and hypertension status was derived for 4,550 respondents aged 40 to 79. Linear regression estimated associations between PbB and SBP and DBP. Logistic regression estimated associations between PbB and hypertension. Adjusted least squares geometric means of PbB were estimated for hypertensive versus non-hypertensive individuals. Compared with non-hypertensive individuals, those with hypertension had higher average PbB levels, were older, more likely to be male, and more likely to have other hypertension risk factors (diabetes, family history of high blood pressure). In adjusted regression models, a modest association emerged between PbB levels and SBP among 40- to 54-year-olds, and between PbB levels and DBP for the overall population. No association emerged between PbB levels and hypertension prevalence. A modest association was observed between blood lead levels and blood pressure, but not with hypertension, in Canadian adults aged 40 to 79.

  1. Content Coding of Psychotherapy Transcripts Using Labeled Topic Models.

    PubMed

    Gaut, Garren; Steyvers, Mark; Imel, Zac E; Atkins, David C; Smyth, Padhraic

    2017-03-01

    Psychotherapy represents a broad class of medical interventions received by millions of patients each year. Unlike most medical treatments, its primary mechanisms are linguistic; i.e., the treatment relies directly on a conversation between a patient and provider. However, the evaluation of patient-provider conversation suffers from critical shortcomings, including intensive labor requirements, coder error, nonstandardized coding systems, and inability to scale up to larger data sets. To overcome these shortcomings, psychotherapy analysis needs a reliable and scalable method for summarizing the content of treatment encounters. We used a publicly available psychotherapy corpus from Alexander Street press comprising a large collection of transcripts of patient-provider conversations to compare coding performance for two machine learning methods. We used the labeled latent Dirichlet allocation (L-LDA) model to learn associations between text and codes, to predict codes in psychotherapy sessions, and to localize specific passages of within-session text representative of a session code. We compared the L-LDA model to a baseline lasso regression model using predictive accuracy and model generalizability (measured by calculating the area under the curve (AUC) from the receiver operating characteristic curve). The L-LDA model outperforms the lasso logistic regression model at predicting session-level codes with average AUC scores of 0.79, and 0.70, respectively. For fine-grained level coding, L-LDA and logistic regression are able to identify specific talk-turns representative of symptom codes. However, model performance for talk-turn identification is not yet as reliable as human coders. We conclude that the L-LDA model has the potential to be an objective, scalable method for accurate automated coding of psychotherapy sessions that perform better than comparable discriminative methods at session-level coding and can also predict fine-grained codes.

  2. Content Coding of Psychotherapy Transcripts Using Labeled Topic Models

    PubMed Central

    Gaut, Garren; Steyvers, Mark; Imel, Zac E; Atkins, David C; Smyth, Padhraic

    2016-01-01

    Psychotherapy represents a broad class of medical interventions received by millions of patients each year. Unlike most medical treatments, its primary mechanisms are linguistic; i.e., the treatment relies directly on a conversation between a patient and provider. However, the evaluation of patient-provider conversation suffers from critical shortcomings, including intensive labor requirements, coder error, non-standardized coding systems, and inability to scale up to larger data sets. To overcome these shortcomings, psychotherapy analysis needs a reliable and scalable method for summarizing the content of treatment encounters. We used a publicly-available psychotherapy corpus from Alexander Street press comprising a large collection of transcripts of patient-provider conversations to compare coding performance for two machine learning methods. We used the Labeled Latent Dirichlet Allocation (L-LDA) model to learn associations between text and codes, to predict codes in psychotherapy sessions, and to localize specific passages of within-session text representative of a session code. We compared the L-LDA model to a baseline lasso regression model using predictive accuracy and model generalizability (measured by calculating the area under the curve (AUC) from the receiver operating characteristic (ROC) curve). The L-LDA model outperforms the lasso logistic regression model at predicting session-level codes with average AUC scores of .79, and .70, respectively. For fine-grained level coding, L-LDA and logistic regression are able to identify specific talk-turns representative of symptom codes. However, model performance for talk-turn identification is not yet as reliable as human coders. We conclude that the L-LDA model has the potential to be an objective, scaleable method for accurate automated coding of psychotherapy sessions that performs better than comparable discriminative methods at session-level coding and can also predict fine-grained codes. PMID:26625437

  3. Investigation of pajama properties on skin under mild cold conditions: the interaction between skin and clothing.

    PubMed

    Yao, Lei; Gohel, Mayur D I; Li, Yi; Chung, Waiyee J

    2011-07-01

    Clothing is considered the second skin of the human body. The aim of this study was to determine clothing-wearer interaction on skin physiology under mild cold conditions. Skin physiological parameters, subjective sensory response, stress level, and physical properties of clothing fabric from two longitude parallel-designed wear trials were studied. The wear trials involved four kinds of pajamas made from cotton or polyester material that had hydrophilic or hydrophobic treatment, conducted for three weeks under mild cold conditions. Statistical tools, factor analysis, hierarchical linear regression, and logistic regression were applied to analyze the strong predictors of skin physiological parameters, stress level, and sensory response. A framework was established to illustrate clothing-wearer interactions with clothing fabric properties, skin physiology, stress level, and sensory response under mild cold conditions. Fabric has various effects on the human body under mild cold conditions. A fabric's properties influence skin physiology, sensation, and psychological response. © 2011 The International Society of Dermatology.

  4. Exploring students' patterns of reasoning

    NASA Astrophysics Data System (ADS)

    Matloob Haghanikar, Mojgan

    As part of a collaborative study of the science preparation of elementary school teachers, we investigated the quality of students' reasoning and explored the relationship between sophistication of reasoning and the degree to which the courses were considered inquiry oriented. To probe students' reasoning, we developed open-ended written content questions with the distinguishing feature of applying recently learned concepts in a new context. We devised a protocol for developing written content questions that provided a common structure for probing and classifying students' sophistication level of reasoning. In designing our protocol, we considered several distinct criteria, and classified students' responses based on their performance for each criterion. First, we classified concepts into three types: Descriptive, Hypothetical, and Theoretical and categorized the abstraction levels of the responses in terms of the types of concepts and the inter-relationship between the concepts. Second, we devised a rubric based on Bloom's revised taxonomy with seven traits (both knowledge types and cognitive processes) and a defined set of criteria to evaluate each trait. Along with analyzing students' reasoning, we visited universities and observed the courses in which the students were enrolled. We used the Reformed Teaching Observation Protocol (RTOP) to rank the courses with respect to characteristics that are valued for the inquiry courses. We conducted logistic regression for a sample of 18courses with about 900 students and reported the results for performing logistic regression to estimate the relationship between traits of reasoning and RTOP score. In addition, we analyzed conceptual structure of students' responses, based on conceptual classification schemes, and clustered students' responses into six categories. We derived regression model, to estimate the relationship between the sophistication of the categories of conceptual structure and RTOP scores. However, the outcome variable with six categories required a more complicated regression model, known as multinomial logistic regression, generalized from binary logistic regression. With the large amount of collected data, we found that the likelihood of the higher cognitive processes were in favor of classes with higher measures on inquiry. However, the usage of more abstract concepts with higher order conceptual structures was less prevalent in higher RTOP courses.

  5. [Use of multiple regression models in observational studies (1970-2013) and requirements of the STROBE guidelines in Spanish scientific journals].

    PubMed

    Real, J; Cleries, R; Forné, C; Roso-Llorach, A; Martínez-Sánchez, J M

    In medicine and biomedical research, statistical techniques like logistic, linear, Cox and Poisson regression are widely known. The main objective is to describe the evolution of multivariate techniques used in observational studies indexed in PubMed (1970-2013), and to check the requirements of the STROBE guidelines in the author guidelines in Spanish journals indexed in PubMed. A targeted PubMed search was performed to identify papers that used logistic linear Cox and Poisson models. Furthermore, a review was also made of the author guidelines of journals published in Spain and indexed in PubMed and Web of Science. Only 6.1% of the indexed manuscripts included a term related to multivariate analysis, increasing from 0.14% in 1980 to 12.3% in 2013. In 2013, 6.7, 2.5, 3.5, and 0.31% of the manuscripts contained terms related to logistic, linear, Cox and Poisson regression, respectively. On the other hand, 12.8% of journals author guidelines explicitly recommend to follow the STROBE guidelines, and 35.9% recommend the CONSORT guideline. A low percentage of Spanish scientific journals indexed in PubMed include the STROBE statement requirement in the author guidelines. Multivariate regression models in published observational studies such as logistic regression, linear, Cox and Poisson are increasingly used both at international level, as well as in journals published in Spanish. Copyright © 2015 Sociedad Española de Médicos de Atención Primaria (SEMERGEN). Publicado por Elsevier España, S.L.U. All rights reserved.

  6. Prediction of siRNA potency using sparse logistic regression.

    PubMed

    Hu, Wei; Hu, John

    2014-06-01

    RNA interference (RNAi) can modulate gene expression at post-transcriptional as well as transcriptional levels. Short interfering RNA (siRNA) serves as a trigger for the RNAi gene inhibition mechanism, and therefore is a crucial intermediate step in RNAi. There have been extensive studies to identify the sequence characteristics of potent siRNAs. One such study built a linear model using LASSO (Least Absolute Shrinkage and Selection Operator) to measure the contribution of each siRNA sequence feature. This model is simple and interpretable, but it requires a large number of nonzero weights. We have introduced a novel technique, sparse logistic regression, to build a linear model using single-position specific nucleotide compositions which has the same prediction accuracy of the linear model based on LASSO. The weights in our new model share the same general trend as those in the previous model, but have only 25 nonzero weights out of a total 84 weights, a 54% reduction compared to the previous model. Contrary to the linear model based on LASSO, our model suggests that only a few positions are influential on the efficacy of the siRNA, which are the 5' and 3' ends and the seed region of siRNA sequences. We also employed sparse logistic regression to build a linear model using dual-position specific nucleotide compositions, a task LASSO is not able to accomplish well due to its high dimensional nature. Our results demonstrate the superiority of sparse logistic regression as a technique for both feature selection and regression over LASSO in the context of siRNA design.

  7. Accounting for Slipping and Other False Negatives in Logistic Models of Student Learning

    ERIC Educational Resources Information Center

    MacLellan, Christopher J.; Liu, Ran; Koedinger, Kenneth R.

    2015-01-01

    Additive Factors Model (AFM) and Performance Factors Analysis (PFA) are two popular models of student learning that employ logistic regression to estimate parameters and predict performance. This is in contrast to Bayesian Knowledge Tracing (BKT) which uses a Hidden Markov Model formalism. While all three models tend to make similar predictions,…

  8. Filtering data from the collaborative initial glaucoma treatment study for improved identification of glaucoma progression.

    PubMed

    Schell, Greggory J; Lavieri, Mariel S; Stein, Joshua D; Musch, David C

    2013-12-21

    Open-angle glaucoma (OAG) is a prevalent, degenerate ocular disease which can lead to blindness without proper clinical management. The tests used to assess disease progression are susceptible to process and measurement noise. The aim of this study was to develop a methodology which accounts for the inherent noise in the data and improve significant disease progression identification. Longitudinal observations from the Collaborative Initial Glaucoma Treatment Study (CIGTS) were used to parameterize and validate a Kalman filter model and logistic regression function. The Kalman filter estimates the true value of biomarkers associated with OAG and forecasts future values of these variables. We develop two logistic regression models via generalized estimating equations (GEE) for calculating the probability of experiencing significant OAG progression: one model based on the raw measurements from CIGTS and another model based on the Kalman filter estimates of the CIGTS data. Receiver operating characteristic (ROC) curves and associated area under the ROC curve (AUC) estimates are calculated using cross-fold validation. The logistic regression model developed using Kalman filter estimates as data input achieves higher sensitivity and specificity than the model developed using raw measurements. The mean AUC for the Kalman filter-based model is 0.961 while the mean AUC for the raw measurements model is 0.889. Hence, using the probability function generated via Kalman filter estimates and GEE for logistic regression, we are able to more accurately classify patients and instances as experiencing significant OAG progression. A Kalman filter approach for estimating the true value of OAG biomarkers resulted in data input which improved the accuracy of a logistic regression classification model compared to a model using raw measurements as input. This methodology accounts for process and measurement noise to enable improved discrimination between progression and nonprogression in chronic diseases.

  9. Artificial neural network, genetic algorithm, and logistic regression applications for predicting renal colic in emergency settings.

    PubMed

    Eken, Cenker; Bilge, Ugur; Kartal, Mutlu; Eray, Oktay

    2009-06-03

    Logistic regression is the most common statistical model for processing multivariate data in the medical literature. Artificial intelligence models like an artificial neural network (ANN) and genetic algorithm (GA) may also be useful to interpret medical data. The purpose of this study was to perform artificial intelligence models on a medical data sheet and compare to logistic regression. ANN, GA, and logistic regression analysis were carried out on a data sheet of a previously published article regarding patients presenting to an emergency department with flank pain suspicious for renal colic. The study population was composed of 227 patients: 176 patients had a diagnosis of urinary stone, while 51 ultimately had no calculus. The GA found two decision rules in predicting urinary stones. Rule 1 consisted of being male, pain not spreading to back, and no fever. In rule 2, pelvicaliceal dilatation on bedside ultrasonography replaced no fever. ANN, GA rule 1, GA rule 2, and logistic regression had a sensitivity of 94.9, 67.6, 56.8, and 95.5%, a specificity of 78.4, 76.47, 86.3, and 47.1%, a positive likelihood ratio of 4.4, 2.9, 4.1, and 1.8, and a negative likelihood ratio of 0.06, 0.42, 0.5, and 0.09, respectively. The area under the curve was found to be 0.867, 0.720, 0.715, and 0.713 for all applications, respectively. Data mining techniques such as ANN and GA can be used for predicting renal colic in emergency settings and to constitute clinical decision rules. They may be an alternative to conventional multivariate analysis applications used in biostatistics.

  10. Beyond logistic regression: structural equations modelling for binary variables and its application to investigating unobserved confounders.

    PubMed

    Kupek, Emil

    2006-03-15

    Structural equation modelling (SEM) has been increasingly used in medical statistics for solving a system of related regression equations. However, a great obstacle for its wider use has been its difficulty in handling categorical variables within the framework of generalised linear models. A large data set with a known structure among two related outcomes and three independent variables was generated to investigate the use of Yule's transformation of odds ratio (OR) into Q-metric by (OR-1)/(OR+1) to approximate Pearson's correlation coefficients between binary variables whose covariance structure can be further analysed by SEM. Percent of correctly classified events and non-events was compared with the classification obtained by logistic regression. The performance of SEM based on Q-metric was also checked on a small (N = 100) random sample of the data generated and on a real data set. SEM successfully recovered the generated model structure. SEM of real data suggested a significant influence of a latent confounding variable which would have not been detectable by standard logistic regression. SEM classification performance was broadly similar to that of the logistic regression. The analysis of binary data can be greatly enhanced by Yule's transformation of odds ratios into estimated correlation matrix that can be further analysed by SEM. The interpretation of results is aided by expressing them as odds ratios which are the most frequently used measure of effect in medical statistics.

  11. A comparative analysis of fertility differentials in Ghana and Nigeria.

    PubMed

    Olatoregun, Oluwaseun; Fagbamigbe, Adeniyi Francis; Akinyemi, Odunayo Joshua; Yusuf, Oyindamola Bidemi; Bamgboye, Elijah Afolabi

    2014-09-01

    Nigeria and Ghana are the most densely populated countries in the West African sub-region with fertility levels above world average. Our study compared the two countries' fertility levels and their determinants as well as the differentials in the effect of these factors across the two countries. We carried out a retrospective analysis of data from the Nigeria and Ghana Demographic Health Surveys, 2008. The sample of 33,385 and 4,916 women aged 15-49 years obtained in Nigeria and Ghana respectively was stratified into low, medium and high fertility using reported children ever born. Data was summarized using appropriate descriptive statistics. Factors influencing fertility were identified using ordinal logistic regression at 5% significance level. While unemployment significantly lowers fertility in Nigeria, it wasn't significant in Ghana. In both countries, education, age at first marriage, marital status, urban-rural residence, wealth index and use of oral contraception were the main factors influencing high fertility levels.

  12. Parathyroid hormone response to two levels of vitamin D deficiency is associated with high risk of medical problems during hospitalization in patients with hip fracture.

    PubMed

    Alarcón, T; González-Montalvo, J I; Hoyos, R; Diez-Sebastián, J; Otero, A; Mauleon, J L

    2015-10-01

    Vitamin D and the parathyroid hormone (PTH) response play an important role in hip fracture patients. This study was carried out to determine the factors associated with the PTH response to different levels of vitamin D deficiency during hospitalization. This was a cross-sectional study of patients over 64 years of age admitted with an acute fragility hip fracture between March 1st 2009 and November 30th 2012. Demographic, clinical, functional, and cognitive function were evaluated at admission and during hospitalization. Levels of 25-hydroxyvitamin D (25-OHD) and PTH were analyzed. Two 25-OHD cut-off points were considered, <12 ng/ml and 12-20 ng/ml. Multivariate logistic regression analysis was used. Mean age of the 607 patients included was 84.7 years (SD 7.10), and 81.9 % were women. The mean 25-OHD level in the total sample was 13.2 (SD 11.1) ng/ml. Levels of 25-OHD <12 ng/ml were present in 347 patients (57.2 %), of whom 158 (45.5 %) had secondary hyperparathyroidism (SHPT) (PTH >65 pg/ml). 25-OHD levels of 12-20 ng/ml were present in 168 (27.7 %) patients, of whom 47 (28 %) had SHPT. Following logistic regression, SHPT was associated in both groups (25-OHD <12 and 12-20 ng/ml) with a greater number of medical problems during hospitalization. In the 25-OHD group <12 ng/ml, SHPT was also associated with poorer glomerular filtration rates. The PTH response to vitamin D deficiency in hip fracture patients may be a marker for patients with higher risk of developing multiple medical problems, both when considering severe (<12 ng/ml) and moderate (12-20 ng/ml) vitamin D deficiency.

  13. Power and Sample Size Calculations for Logistic Regression Tests for Differential Item Functioning

    ERIC Educational Resources Information Center

    Li, Zhushan

    2014-01-01

    Logistic regression is a popular method for detecting uniform and nonuniform differential item functioning (DIF) effects. Theoretical formulas for the power and sample size calculations are derived for likelihood ratio tests and Wald tests based on the asymptotic distribution of the maximum likelihood estimators for the logistic regression model.…

  14. A Methodology for Generating Placement Rules that Utilizes Logistic Regression

    ERIC Educational Resources Information Center

    Wurtz, Keith

    2008-01-01

    The purpose of this article is to provide the necessary tools for institutional researchers to conduct a logistic regression analysis and interpret the results. Aspects of the logistic regression procedure that are necessary to evaluate models are presented and discussed with an emphasis on cutoff values and choosing the appropriate number of…

  15. Comparison of standard maximum likelihood classification and polytomous logistic regression used in remote sensing

    Treesearch

    John Hogland; Nedret Billor; Nathaniel Anderson

    2013-01-01

    Discriminant analysis, referred to as maximum likelihood classification within popular remote sensing software packages, is a common supervised technique used by analysts. Polytomous logistic regression (PLR), also referred to as multinomial logistic regression, is an alternative classification approach that is less restrictive, more flexible, and easy to interpret. To...

  16. Association between family structure, maternal education level, and maternal employment with sedentary lifestyle in primary school-age children.

    PubMed

    Vázquez-Nava, Francisco; Treviño-Garcia-Manzo, Norberto; Vázquez-Rodríguez, Carlos F; Vázquez-Rodríguez, Eliza M

    2013-01-01

    To determine the association between family structure, maternal education level, and maternal employment with sedentary lifestyle in primary school-age children. Data were obtained from 897 children aged 6 to 12 years. A questionnaire was used to collect information. Body mass index (BMI) was determined using the age- and gender-specific Centers for Disease Control and Prevention definition. Children were categorized as: normal weight (5(th) percentile≤BMI<85(th) percentile), at risk for overweight (85(th)≤BMI<95(th) percentile), overweight (≥ 95(th) percentile). For the analysis, overweight was defined as BMI at or above the 85(th) percentile for each gender. Adjusted odds ratios (adjusted ORs) for physical inactivity were determined using a logistic regression model. The prevalence of overweight was 40.7%, and of sedentary lifestyle, 57.2%. The percentage of non-intact families was 23.5%. Approximately 48.7% of the mothers had a non-acceptable educational level, and 38.8% of the mothers worked outside of the home. The logistic regression model showed that living in a non-intact family household (adjusted OR=1.67; 95% CI=1.04-2.66) is associated with sedentary lifestyle in overweight children. In the group of normal weight children, logistic regression analysis show that living in a non-intact family, having a mother with a non-acceptable education level, and having a mother who works outside of the home were not associated with sedentary lifestyle. Living in a non-intact family, more than low maternal educational level and having a working mother, appears to be associated with sedentary lifestyle in overweight primary school-age children. Copyright © 2013 Sociedade Brasileira de Pediatria. Published by Elsevier Editora Ltda. All rights reserved.

  17. Lithium and neuroleptics in combination: is there enhancement of neurotoxicity leading to permanent sequelae?

    PubMed

    Goldman, S A

    1996-10-01

    Neurotoxicity in relation to concomitant administration of lithium and neuroleptic drugs, particularly haloperidol, has been an ongoing issue. This study examined whether use of lithium with neuroleptic drugs enhances neurotoxicity leading to permanent sequelae. The Spontaneous Reporting System database of the United States Food and Drug Administration and extant literature were reviewed for spectrum cases of lithium/neuroleptic neurotoxicity. Groups taking lithium alone (Li), lithium/haloperidol (LiHal) and lithium/ nonhaloperidol neuroleptics (LiNeuro), each paired for recovery and sequelae, were established for 237 cases. Statistical analyses included pairwise comparisons of lithium levels using the Wilcoxon Rank Sum procedure and logistic regression to analyze the relationship between independent variables and development of sequelae. The Li and Li-Neuro groups showed significant statistical differences in median lithium levels between recovery and sequelae pairs, whereas the LiHal pair did not differ significantly. Lithium level was associated with sequelae development overall and within the Li and LiNeuro groups; no such association was evident in the LiHal group. On multivariable logistic regression analysis, lithium level and taking lithium/haloperidol were significant factors in the development of sequelae, with multiple possibly confounding factors (e.g., age, sex) not statistically significant. Multivariable logistic regression analyses with neuroleptic dose as five discrete dose ranges or actual dose did not show an association between development of sequelae and dose. Database limitations notwithstanding, the lack of apparent impact of serum lithium level on the development of sequelae in patients treated with haloperidol contrasts notably with results in the Li and LiNeuro groups. These findings may suggest a possible effect of pharmacodynamic factors in lithium/neuroleptic combination therapy.

  18. Measuring Productivity of Depot-Level Aircraft Maintenance in the Air Force Logistics Command.

    DTIC Science & Technology

    1985-09-01

    of Figures...... . . . . . . . . . . . . vi List of Tables . . . . . . . . . ............ vii Abstract . . . ...................... viii I...59 6. DEA Efficiency Values (Third DEA Model) . .... 62 7. DMU 5 Input Efficiencies ................ 64 vi F "-’ List of Tables Table Page I. DEA...Regression Results for 20 Months . . . ..... 68 V. Regression Results for 7 Quarters . . ..... 70 VI . Coefficients of Correlation (Using Quarterly Data

  19. Preoperative Fasting C-Peptide Predicts Type 2 Diabetes Mellitus Remission in Low-BMI Chinese Patients After Roux-en-Y Gastric Bypass.

    PubMed

    Zhao, Lei; Li, Weizheng; Su, Zhihong; Liu, Yong; Zhu, Liyong; Zhu, Shaihong

    2018-05-29

    This study investigated the role of preoperative fasting C-peptide (FCP) levels in predicting diabetic outcomes in low-BMI Chinese patients following Roux-en-Y gastric bypass (RYGB) by comparing the metabolic outcomes of patients with FCP > 1 ng/ml versus FCP ≤ 1 ng/ml. The study sample included 78 type 2 diabetes mellitus patients with an average BMI < 30 kg/m 2 at baseline. Patients' parameters were analyzed before and after surgery, with a 2-year follow-up. A univariate logistic regression analysis and multivariate analysis of variance between the remission and improvement group were performed to determine factors that were associated with type 2 diabetes remission after RYGB. Linear correlation analyses between FCP and metabolic parameters were performed. Patients were divided into two groups: FCP > 1 ng/ml and FCP ≤ 1 ng/ml, with measured parameters compared between the groups. Patients' fasting plasma glucose, 2-h postprandial plasma glucose, FCP, and HbA1c improved significantly after surgery (p < 0.05). Factors associated with type 2 diabetes remission were BMI, 2hINS, and FCP at the univariate logistic regression analysis (p < 0.05). Multivariate logistic regression analysis was performed then showed the results were more related to FCP (OR = 2.39). FCP showed a significant linear correlation with fasting insulin and BMI (p < 0.05). There was a significant difference in remission rate between the FCP > 1 ng/ml and FCP ≤ 1 ng/ml groups (p = 0.01). The parameters of patients with FCP > 1 ng/ml, including BMI, plasma glucose, HbA1c, and plasma insulin, decreased markedly after surgery (p < 0.05). FCP level is a significant predictor of diabetes outcomes after RYGB in low-BMI Chinese patients. An FCP level of 1 ng/ml may be a useful threshold for predicting surgical prognosis, with FCP > 1 ng/ml predicting better clinical outcomes following RYGB.

  20. Design, innovation, and rural creative places: Are the arts the cherry on top, or the secret sauce?

    PubMed

    Wojan, Timothy R; Nichols, Bonnie

    2018-01-01

    Creative class theory explains the positive relationship between the arts and commercial innovation as the mutual attraction of artists and other creative workers by an unobserved creative milieu. This study explores alternative theories for rural settings, by analyzing establishment-level survey data combined with data on the local arts scene. The study identifies the local contextual factors associated with a strong design orientation, and estimates the impact that a strong design orientation has on the local economy. Data on innovation and design come from a nationally representative sample of establishments in tradable industries. Latent class analysis allows identifying unobserved subpopulations comprised of establishments with different design and innovation orientations. Logistic regression allows estimating the association between an establishment's design orientation and local contextual factors. A quantile instrumental variable regression allows assessing the robustness of the logistic regression results with respect to endogeneity. An estimate of design orientation at the local level derived from the survey is used to examine variation in economic performance during the period of recovery from the Great Recession (2010-2014). Three distinct innovation (substantive, nominal, and non-innovators) and design orientations (design-integrated, "design last finish," and no systematic approach to design) are identified. Innovation- and design-intensive establishments were identified in both rural and urban areas. Rural design-integrated establishments tended to locate in counties with more highly educated workforces and containing at least one performing arts organization. A quantile instrumental variable regression confirmed that the logistic regression result is robust to endogeneity concerns. Finally, rural areas characterized by design-integrated establishments experienced faster growth in wages relative to rural areas characterized by establishments using no systematic approach to design.

  1. Design, innovation, and rural creative places: Are the arts the cherry on top, or the secret sauce?

    PubMed Central

    Nichols, Bonnie

    2018-01-01

    Objective Creative class theory explains the positive relationship between the arts and commercial innovation as the mutual attraction of artists and other creative workers by an unobserved creative milieu. This study explores alternative theories for rural settings, by analyzing establishment-level survey data combined with data on the local arts scene. The study identifies the local contextual factors associated with a strong design orientation, and estimates the impact that a strong design orientation has on the local economy. Method Data on innovation and design come from a nationally representative sample of establishments in tradable industries. Latent class analysis allows identifying unobserved subpopulations comprised of establishments with different design and innovation orientations. Logistic regression allows estimating the association between an establishment’s design orientation and local contextual factors. A quantile instrumental variable regression allows assessing the robustness of the logistic regression results with respect to endogeneity. An estimate of design orientation at the local level derived from the survey is used to examine variation in economic performance during the period of recovery from the Great Recession (2010–2014). Results Three distinct innovation (substantive, nominal, and non-innovators) and design orientations (design-integrated, “design last finish,” and no systematic approach to design) are identified. Innovation- and design-intensive establishments were identified in both rural and urban areas. Rural design-integrated establishments tended to locate in counties with more highly educated workforces and containing at least one performing arts organization. A quantile instrumental variable regression confirmed that the logistic regression result is robust to endogeneity concerns. Finally, rural areas characterized by design-integrated establishments experienced faster growth in wages relative to rural areas characterized by establishments using no systematic approach to design. PMID:29489884

  2. New machine-learning algorithms for prediction of Parkinson's disease

    NASA Astrophysics Data System (ADS)

    Mandal, Indrajit; Sairam, N.

    2014-03-01

    This article presents an enhanced prediction accuracy of diagnosis of Parkinson's disease (PD) to prevent the delay and misdiagnosis of patients using the proposed robust inference system. New machine-learning methods are proposed and performance comparisons are based on specificity, sensitivity, accuracy and other measurable parameters. The robust methods of treating Parkinson's disease (PD) includes sparse multinomial logistic regression, rotation forest ensemble with support vector machines and principal components analysis, artificial neural networks, boosting methods. A new ensemble method comprising of the Bayesian network optimised by Tabu search algorithm as classifier and Haar wavelets as projection filter is used for relevant feature selection and ranking. The highest accuracy obtained by linear logistic regression and sparse multinomial logistic regression is 100% and sensitivity, specificity of 0.983 and 0.996, respectively. All the experiments are conducted over 95% and 99% confidence levels and establish the results with corrected t-tests. This work shows a high degree of advancement in software reliability and quality of the computer-aided diagnosis system and experimentally shows best results with supportive statistical inference.

  3. Landslide Hazard Mapping in Rwanda Using Logistic Regression

    NASA Astrophysics Data System (ADS)

    Piller, A.; Anderson, E.; Ballard, H.

    2015-12-01

    Landslides in the United States cause more than $1 billion in damages and 50 deaths per year (USGS 2014). Globally, figures are much more grave, yet monitoring, mapping and forecasting of these hazards are less than adequate. Seventy-five percent of the population of Rwanda earns a living from farming, mostly subsistence. Loss of farmland, housing, or life, to landslides is a very real hazard. Landslides in Rwanda have an impact at the economic, social, and environmental level. In a developing nation that faces challenges in tracking, cataloging, and predicting the numerous landslides that occur each year, satellite imagery and spatial analysis allow for remote study. We have focused on the development of a landslide inventory and a statistical methodology for assessing landslide hazards. Using logistic regression on approximately 30 test variables (i.e. slope, soil type, land cover, etc.) and a sample of over 200 landslides, we determine which variables are statistically most relevant to landslide occurrence in Rwanda. A preliminary predictive hazard map for Rwanda has been produced, using the variables selected from the logistic regression analysis.

  4. Evaluating penalized logistic regression models to predict Heat-Related Electric grid stress days

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bramer, Lisa M.; Rounds, J.; Burleyson, C. D.

    Understanding the conditions associated with stress on the electricity grid is important in the development of contingency plans for maintaining reliability during periods when the grid is stressed. In this paper, heat-related grid stress and the relationship with weather conditions were examined using data from the eastern United States. Penalized logistic regression models were developed and applied to predict stress on the electric grid using weather data. The inclusion of other weather variables, such as precipitation, in addition to temperature improved model performance. Several candidate models and combinations of predictive variables were examined. A penalized logistic regression model which wasmore » fit at the operation-zone level was found to provide predictive value and interpretability. Additionally, the importance of different weather variables observed at various time scales were examined. Maximum temperature and precipitation were identified as important across all zones while the importance of other weather variables was zone specific. In conclusion, the methods presented in this work are extensible to other regions and can be used to aid in planning and development of the electrical grid.« less

  5. Evaluating penalized logistic regression models to predict Heat-Related Electric grid stress days

    DOE PAGES

    Bramer, Lisa M.; Rounds, J.; Burleyson, C. D.; ...

    2017-09-22

    Understanding the conditions associated with stress on the electricity grid is important in the development of contingency plans for maintaining reliability during periods when the grid is stressed. In this paper, heat-related grid stress and the relationship with weather conditions were examined using data from the eastern United States. Penalized logistic regression models were developed and applied to predict stress on the electric grid using weather data. The inclusion of other weather variables, such as precipitation, in addition to temperature improved model performance. Several candidate models and combinations of predictive variables were examined. A penalized logistic regression model which wasmore » fit at the operation-zone level was found to provide predictive value and interpretability. Additionally, the importance of different weather variables observed at various time scales were examined. Maximum temperature and precipitation were identified as important across all zones while the importance of other weather variables was zone specific. In conclusion, the methods presented in this work are extensible to other regions and can be used to aid in planning and development of the electrical grid.« less

  6. The comparison of landslide ratio-based and general logistic regression landslide susceptibility models in the Chishan watershed after 2009 Typhoon Morakot

    NASA Astrophysics Data System (ADS)

    WU, Chunhung

    2015-04-01

    The research built the original logistic regression landslide susceptibility model (abbreviated as or-LRLSM) and landslide ratio-based ogistic regression landslide susceptibility model (abbreviated as lr-LRLSM), compared the performance and explained the error source of two models. The research assumes that the performance of the logistic regression model can be better if the distribution of landslide ratio and weighted value of each variable is similar. Landslide ratio is the ratio of landslide area to total area in the specific area and an useful index to evaluate the seriousness of landslide disaster in Taiwan. The research adopted the landside inventory induced by 2009 Typhoon Morakot in the Chishan watershed, which was the most serious disaster event in the last decade, in Taiwan. The research adopted the 20 m grid as the basic unit in building the LRLSM, and six variables, including elevation, slope, aspect, geological formation, accumulated rainfall, and bank erosion, were included in the two models. The six variables were divided as continuous variables, including elevation, slope, and accumulated rainfall, and categorical variables, including aspect, geological formation and bank erosion in building the or-LRLSM, while all variables, which were classified based on landslide ratio, were categorical variables in building the lr-LRLSM. Because the count of whole basic unit in the Chishan watershed was too much to calculate by using commercial software, the research took random sampling instead of the whole basic units. The research adopted equal proportions of landslide unit and not landslide unit in logistic regression analysis. The research took 10 times random sampling and selected the group with the best Cox & Snell R2 value and Nagelkerker R2 value as the database for the following analysis. Based on the best result from 10 random sampling groups, the or-LRLSM (lr-LRLSM) is significant at the 1% level with Cox & Snell R2 = 0.190 (0.196) and Nagelkerke R2 = 0.253 (0.260). The unit with the landslide susceptibility value > 0.5 (≦ 0.5) will be classified as a predicted landslide unit (not landslide unit). The AUC, i.e. the area under the relative operating characteristic curve, of or-LRLSM in the Chishan watershed is 0.72, while that of lr-LRLSM is 0.77. Furthermore, the average correct ratio of lr-LRLSM (73.3%) is better than that of or-LRLSM (68.3%). The research analyzed in detail the error sources from the two models. In continuous variables, using the landslide ratio-based classification in building the lr-LRLSM can let the distribution of weighted value more similar to distribution of landslide ratio in the range of continuous variable than that in building the or-LRLSM. In categorical variables, the meaning of using the landslide ratio-based classification in building the lr-LRLSM is to gather the parameters with approximate landslide ratio together. The mean correct ratio in continuous variables (categorical variables) by using the lr-LRLSM is better than that in or-LRLSM by 0.6 ~ 2.6% (1.7% ~ 6.0%). Building the landslide susceptibility model by using landslide ratio-based classification is practical and of better performance than that by using the original logistic regression.

  7. Widen NomoGram for multinomial logistic regression: an application to staging liver fibrosis in chronic hepatitis C patients.

    PubMed

    Ardoino, Ilaria; Lanzoni, Monica; Marano, Giuseppe; Boracchi, Patrizia; Sagrini, Elisabetta; Gianstefani, Alice; Piscaglia, Fabio; Biganzoli, Elia M

    2017-04-01

    The interpretation of regression models results can often benefit from the generation of nomograms, 'user friendly' graphical devices especially useful for assisting the decision-making processes. However, in the case of multinomial regression models, whenever categorical responses with more than two classes are involved, nomograms cannot be drawn in the conventional way. Such a difficulty in managing and interpreting the outcome could often result in a limitation of the use of multinomial regression in decision-making support. In the present paper, we illustrate the derivation of a non-conventional nomogram for multinomial regression models, intended to overcome this issue. Although it may appear less straightforward at first sight, the proposed methodology allows an easy interpretation of the results of multinomial regression models and makes them more accessible for clinicians and general practitioners too. Development of prediction model based on multinomial logistic regression and of the pertinent graphical tool is illustrated by means of an example involving the prediction of the extent of liver fibrosis in hepatitis C patients by routinely available markers.

  8. Genetic and clinical risk factors of new-onset diabetes after transplantation in Hispanic kidney transplant recipients.

    PubMed

    Yang, Jaewook; Hutchinson, Ian I; Shah, Tariq; Min, David I

    2011-05-27

    New-onset diabetes after transplantation (NODAT) is one of the major complications after transplantation and is associated with reduced overall patient and graft survival. The objective of this study was to determine the genetic and clinical risk factors for NODAT in Hispanic kidney transplant recipients. Hispanic kidney allograft recipients without evidence of preexisting diabetes who developed NODAT (n=133) were studied using Hispanic kidney transplant recipients with no evidence of diabetes as a control group (n=170). NODAT was defined as fasting glucose levels ≥126 mg/dL on two or more occasions or patients taking any insulin or oral hypoglycemic agents 1 month or later after kidney transplantation. Fourteen alleles in nine genes were genotyped and other patients' clinical data with genotype data were analyzed by logistic regression. Among 14 alleles, hepatocyte nuclear factor 4 alpha (HNF4A) AA (rs2144908, odds ratio [OR]=1.96, confidence interval [CI]=1.08-3.50, P=0.010), HNF4A TT (rs1884614, OR=2.44, CI=1.42-4.48, P=0.002), and insulin receptor substrate 1 AA+AG (rs1801278, OR=2.71, CI=1.16-6.89, P=0.021) remained significant after logistic regression. Among the clinical factors, average age (OR=1.01, CI=1.00-1.08, P=0.048), sirolimus (OR=5.36, CI=3.02-10.4, P=0.001), deceased donor (OR=1.96, CI=1.16-2.94, P=0.015), and acute rejection (OR=2.92, CI=1.31-5.77, P=0.009) remained significant after logistic regression. This study indicates that polymorphism of two alleles of HNF-4A gene (rs2144908 and rs1884614) and insulin receptor substrate 1 (rs1801278) are significantly associated with NODAT in kidney transplant patients with Hispanic ethnicity. In the case of clinical factors, older age (>50 year), deceased donor type, acute rejection, and sirolimus use are associated with NODAT in Hispanic kidney transplant recipients.

  9. Evaluation of Cox's model and logistic regression for matched case-control data with time-dependent covariates: a simulation study.

    PubMed

    Leffondré, Karen; Abrahamowicz, Michal; Siemiatycki, Jack

    2003-12-30

    Case-control studies are typically analysed using the conventional logistic model, which does not directly account for changes in the covariate values over time. Yet, many exposures may vary over time. The most natural alternative to handle such exposures would be to use the Cox model with time-dependent covariates. However, its application to case-control data opens the question of how to manipulate the risk sets. Through a simulation study, we investigate how the accuracy of the estimates of Cox's model depends on the operational definition of risk sets and/or on some aspects of the time-varying exposure. We also assess the estimates obtained from conventional logistic regression. The lifetime experience of a hypothetical population is first generated, and a matched case-control study is then simulated from this population. We control the frequency, the age at initiation, and the total duration of exposure, as well as the strengths of their effects. All models considered include a fixed-in-time covariate and one or two time-dependent covariate(s): the indicator of current exposure and/or the exposure duration. Simulation results show that none of the models always performs well. The discrepancies between the odds ratios yielded by logistic regression and the 'true' hazard ratio depend on both the type of the covariate and the strength of its effect. In addition, it seems that logistic regression has difficulty separating the effects of inter-correlated time-dependent covariates. By contrast, each of the two versions of Cox's model systematically induces either a serious under-estimation or a moderate over-estimation bias. The magnitude of the latter bias is proportional to the true effect, suggesting that an improved manipulation of the risk sets may eliminate, or at least reduce, the bias. Copyright 2003 JohnWiley & Sons, Ltd.

  10. An Entropy-Based Measure for Assessing Fuzziness in Logistic Regression

    ERIC Educational Resources Information Center

    Weiss, Brandi A.; Dardick, William

    2016-01-01

    This article introduces an entropy-based measure of data-model fit that can be used to assess the quality of logistic regression models. Entropy has previously been used in mixture-modeling to quantify how well individuals are classified into latent classes. The current study proposes the use of entropy for logistic regression models to quantify…

  11. On the Usefulness of a Multilevel Logistic Regression Approach to Person-Fit Analysis

    ERIC Educational Resources Information Center

    Conijn, Judith M.; Emons, Wilco H. M.; van Assen, Marcel A. L. M.; Sijtsma, Klaas

    2011-01-01

    The logistic person response function (PRF) models the probability of a correct response as a function of the item locations. Reise (2000) proposed to use the slope parameter of the logistic PRF as a person-fit measure. He reformulated the logistic PRF model as a multilevel logistic regression model and estimated the PRF parameters from this…

  12. Mortality risk prediction in burn injury: Comparison of logistic regression with machine learning approaches.

    PubMed

    Stylianou, Neophytos; Akbarov, Artur; Kontopantelis, Evangelos; Buchan, Iain; Dunn, Ken W

    2015-08-01

    Predicting mortality from burn injury has traditionally employed logistic regression models. Alternative machine learning methods have been introduced in some areas of clinical prediction as the necessary software and computational facilities have become accessible. Here we compare logistic regression and machine learning predictions of mortality from burn. An established logistic mortality model was compared to machine learning methods (artificial neural network, support vector machine, random forests and naïve Bayes) using a population-based (England & Wales) case-cohort registry. Predictive evaluation used: area under the receiver operating characteristic curve; sensitivity; specificity; positive predictive value and Youden's index. All methods had comparable discriminatory abilities, similar sensitivities, specificities and positive predictive values. Although some machine learning methods performed marginally better than logistic regression the differences were seldom statistically significant and clinically insubstantial. Random forests were marginally better for high positive predictive value and reasonable sensitivity. Neural networks yielded slightly better prediction overall. Logistic regression gives an optimal mix of performance and interpretability. The established logistic regression model of burn mortality performs well against more complex alternatives. Clinical prediction with a small set of strong, stable, independent predictors is unlikely to gain much from machine learning outside specialist research contexts. Copyright © 2015 Elsevier Ltd and ISBI. All rights reserved.

  13. Bias in logistic regression due to imperfect diagnostic test results and practical correction approaches.

    PubMed

    Valle, Denis; Lima, Joanna M Tucker; Millar, Justin; Amratia, Punam; Haque, Ubydul

    2015-11-04

    Logistic regression is a statistical model widely used in cross-sectional and cohort studies to identify and quantify the effects of potential disease risk factors. However, the impact of imperfect tests on adjusted odds ratios (and thus on the identification of risk factors) is under-appreciated. The purpose of this article is to draw attention to the problem associated with modelling imperfect diagnostic tests, and propose simple Bayesian models to adequately address this issue. A systematic literature review was conducted to determine the proportion of malaria studies that appropriately accounted for false-negatives/false-positives in a logistic regression setting. Inference from the standard logistic regression was also compared with that from three proposed Bayesian models using simulations and malaria data from the western Brazilian Amazon. A systematic literature review suggests that malaria epidemiologists are largely unaware of the problem of using logistic regression to model imperfect diagnostic test results. Simulation results reveal that statistical inference can be substantially improved when using the proposed Bayesian models versus the standard logistic regression. Finally, analysis of original malaria data with one of the proposed Bayesian models reveals that microscopy sensitivity is strongly influenced by how long people have lived in the study region, and an important risk factor (i.e., participation in forest extractivism) is identified that would have been missed by standard logistic regression. Given the numerous diagnostic methods employed by malaria researchers and the ubiquitous use of logistic regression to model the results of these diagnostic tests, this paper provides critical guidelines to improve data analysis practice in the presence of misclassification error. Easy-to-use code that can be readily adapted to WinBUGS is provided, enabling straightforward implementation of the proposed Bayesian models.

  14. Modeling brook trout presence and absence from landscape variables using four different analytical methods

    USGS Publications Warehouse

    Steen, Paul J.; Passino-Reader, Dora R.; Wiley, Michael J.

    2006-01-01

    As a part of the Great Lakes Regional Aquatic Gap Analysis Project, we evaluated methodologies for modeling associations between fish species and habitat characteristics at a landscape scale. To do this, we created brook trout Salvelinus fontinalis presence and absence models based on four different techniques: multiple linear regression, logistic regression, neural networks, and classification trees. The models were tested in two ways: by application to an independent validation database and cross-validation using the training data, and by visual comparison of statewide distribution maps with historically recorded occurrences from the Michigan Fish Atlas. Although differences in the accuracy of our models were slight, the logistic regression model predicted with the least error, followed by multiple regression, then classification trees, then the neural networks. These models will provide natural resource managers a way to identify habitats requiring protection for the conservation of fish species.

  15. Physical Activity Level of Korean Adults with Chronic Diseases: The Korean National Health and Nutritional Examination Survey, 2010-2012.

    PubMed

    Jin, Ho-Seong; An, Ah-Reum; Choi, Ho-Chun; Lee, Sang-Hyun; Shin, Dong-Heon; Oh, Seung-Min; Seo, Young-Gyun; Cho, Be-Long

    2015-11-01

    Proper physical activities are known to be helpful in the prevention and management of chronic diseases. However, the physical activity level of patients with chronic diseases is low. Therefore, this study aimed to investigate the physical activity compliance of patients with hypertension, diabetes, and dyslipidemia in Korea. This study analyzed the 2010-2012 Fifth Korean National Health and Nutrition Examination Survey data. We included 13,873 individuals in the analysis. The level of physical activity compliance was measured by performing multivariate logistic regression analyses. In the univariate analysis, the subjects with hypertension or diabetes tended to comply with the physical activity guidelines less faithfully than their healthy counterparts. The proportion of subjects with hypertension who were insufficiently physically active was 65.4% among the men and 75.8% among the women. For diabetes, the proportions were 66.7% and 76.8%, respectively. No significant difference was found between the subjects with dyslipidemia and their healthy counterparts. In the multivariate logistic regression analysis, no significant difference in physical activity compliance was observed between the subjects with hypertension, diabetes, or dyslipidemia and their healthy counterparts for both sexes. The patients with hypertension or diabetes tended to have lower physical activity prevlaence than their healthy counterparts. However, for dyslipidemia, no significant difference was found between the two groups. Given the significance of physical activities in the management of chronic diseases, the physical activities of these patients need to be improved.

  16. Pregnancy outcome of patients following bariatric surgery as compared with obese women: a population-based study.

    PubMed

    Shai, Daniel; Shoham-Vardi, Ilana; Amsalem, Doron; Silverberg, Daniel; Levi, Isaac; Sheiner, Eyal

    2014-02-01

    To evaluate pregnancy outcome and rates of anemia in patients following bariatric operation in comparison with obese pregnant women. A retrospective population-based study comparing pregnancy outcome of patients following bariatric with the obese population was conducted. Multivariate logistic regression models were constructed to control for confounders. To evaluate the change in hemoglobin levels, we included women who had one pregnancy before the bariatric surgery and one following the surgery or two pregnancies for women with obesity. This study included 326 women who had one pregnancy before and after a bariatric surgery and 1612 obese women who had at least two consecutive deliveries. Using a multivariable logistic regression model, controlling for confounders such as maternal age, patients following bariatric surgery had lower rates of gestational diabetes mellitus (OR 0.7; 95% CI 0.5-0.9; p = 0.49) and macrosomia (OR 0.3; 95% CI 0.2-0.5; p < 0.001) as compared with obese parturients. Women post bariatric surgery were more likely to be anemic (hemoglobin <10 g/dL) as compared to obese parturients (48% versus 37%; OR, 1.5; 95% CI, 1.2-1.9; p < 0.001). A significant decline in hemoglobin level was noted in patients following bariatric surgery (a decline of 0.33 g/dL versus 0.18 g/dL between two consecutive pregnancies of obese women). Using another multivariable model with anemia as the outcome variable, bariatric was noted as a risk factor for anemia (adjusted OR = 1.45, 95%CI 1.13-1.86, p = 0.004). Women following bariatric surgery have lower risk for gestational diabetes mellitus and fetal macrosomia as compared with obese parturients. Nevertheless, bariatric surgery is a risk factor for anemia.

  17. Logistic regression for risk factor modelling in stuttering research.

    PubMed

    Reed, Phil; Wu, Yaqionq

    2013-06-01

    To outline the uses of logistic regression and other statistical methods for risk factor analysis in the context of research on stuttering. The principles underlying the application of a logistic regression are illustrated, and the types of questions to which such a technique has been applied in the stuttering field are outlined. The assumptions and limitations of the technique are discussed with respect to existing stuttering research, and with respect to formulating appropriate research strategies to accommodate these considerations. Finally, some alternatives to the approach are briefly discussed. The way the statistical procedures are employed are demonstrated with some hypothetical data. Research into several practical issues concerning stuttering could benefit if risk factor modelling were used. Important examples are early diagnosis, prognosis (whether a child will recover or persist) and assessment of treatment outcome. After reading this article you will: (a) Summarize the situations in which logistic regression can be applied to a range of issues about stuttering; (b) Follow the steps in performing a logistic regression analysis; (c) Describe the assumptions of the logistic regression technique and the precautions that need to be checked when it is employed; (d) Be able to summarize its advantages over other techniques like estimation of group differences and simple regression. Copyright © 2012 Elsevier Inc. All rights reserved.

  18. A Predictive Model for Readmissions Among Medicare Patients in a California Hospital.

    PubMed

    Duncan, Ian; Huynh, Nhan

    2017-11-17

    Predictive models for hospital readmission rates are in high demand because of the Centers for Medicare & Medicaid Services (CMS) Hospital Readmission Reduction Program (HRRP). The LACE index is one of the most popular predictive tools among hospitals in the United States. The LACE index is a simple tool with 4 parameters: Length of stay, Acuity of admission, Comorbidity, and Emergency visits in the previous 6 months. The authors applied logistic regression to develop a predictive model for a medium-sized not-for-profit community hospital in California using patient-level data with more specific patient information (including 13 explanatory variables). Specifically, the logistic regression is applied to 2 populations: a general population including all patients and the specific group of patients targeted by the CMS penalty (characterized as ages 65 or older with select conditions). The 2 resulting logistic regression models have a higher sensitivity rate compared to the sensitivity of the LACE index. The C statistic values of the model applied to both populations demonstrate moderate levels of predictive power. The authors also build an economic model to demonstrate the potential financial impact of the use of the model for targeting high-risk patients in a sample hospital and demonstrate that, on balance, whether the hospital gains or loses from reducing readmissions depends on its margin and the extent of its readmission penalties.

  19. Impact of low vision on employment.

    PubMed

    Mojon-Azzi, Stefania M; Sousa-Poza, Alfonso; Mojon, Daniel S

    2010-01-01

    We investigated the influence of self-reported corrected eyesight on several variables describing the perception by employees and self-employed persons of their employment. Our study was based on data from the Survey of Health, Ageing and Retirement in Europe (SHARE). SHARE is a multidisciplinary, cross-national database of microdata on health, socioeconomic status, social and family networks, collected on 31,115 individuals in 11 European countries and in Israel. With the help of ordered logistic regressions and binary logistic regressions, we analyzed the influence of perceived visual impairment--corrected by 19 covariates capturing socioeconomic and health-related factors--on 10 variables describing the respondents' employment situation. Based on data covering 10,340 working individuals, the results of the logistic and ordered regressions indicate that respondents with lower levels of self-reported general eyesight were significantly less satisfied with their jobs, felt they had less freedom to decide, less opportunity to develop new skills, less support in difficult situations, less recognition for their work, and an inadequate salary. Respondents with a lower eyesight level more frequently reported that they feared their health might limit their ability to work before regular retirement age and more often indicated that they were seeking early retirement. Analysis of this dataset from 12 countries demonstrates the strong impact of self-reported visual impairment on individual employment, and therefore on job satisfaction, productivity, and well-being. Copyright © 2010 S. Karger AG, Basel.

  20. Spatiotemporal variability of urban growth factors: A global and local perspective on the megacity of Mumbai

    NASA Astrophysics Data System (ADS)

    Shafizadeh-Moghadam, Hossein; Helbich, Marco

    2015-03-01

    The rapid growth of megacities requires special attention among urban planners worldwide, and particularly in Mumbai, India, where growth is very pronounced. To cope with the planning challenges this will bring, developing a retrospective understanding of urban land-use dynamics and the underlying driving-forces behind urban growth is a key prerequisite. This research uses regression-based land-use change models - and in particular non-spatial logistic regression models (LR) and auto-logistic regression models (ALR) - for the Mumbai region over the period 1973-2010, in order to determine the drivers behind spatiotemporal urban expansion. Both global models are complemented by a local, spatial model, the so-called geographically weighted logistic regression (GWLR) model, one that explicitly permits variations in driving-forces across space. The study comes to two main conclusions. First, both global models suggest similar driving-forces behind urban growth over time, revealing that LRs and ALRs result in estimated coefficients with comparable magnitudes. Second, all the local coefficients show distinctive temporal and spatial variations. It is therefore concluded that GWLR aids our understanding of urban growth processes, and so can assist context-related planning and policymaking activities when seeking to secure a sustainable urban future.

  1. Classification and regression tree analysis of acute-on-chronic hepatitis B liver failure: Seeing the forest for the trees.

    PubMed

    Shi, K-Q; Zhou, Y-Y; Yan, H-D; Li, H; Wu, F-L; Xie, Y-Y; Braddock, M; Lin, X-Y; Zheng, M-H

    2017-02-01

    At present, there is no ideal model for predicting the short-term outcome of patients with acute-on-chronic hepatitis B liver failure (ACHBLF). This study aimed to establish and validate a prognostic model by using the classification and regression tree (CART) analysis. A total of 1047 patients from two separate medical centres with suspected ACHBLF were screened in the study, which were recognized as derivation cohort and validation cohort, respectively. CART analysis was applied to predict the 3-month mortality of patients with ACHBLF. The accuracy of the CART model was tested using the area under the receiver operating characteristic curve, which was compared with the model for end-stage liver disease (MELD) score and a new logistic regression model. CART analysis identified four variables as prognostic factors of ACHBLF: total bilirubin, age, serum sodium and INR, and three distinct risk groups: low risk (4.2%), intermediate risk (30.2%-53.2%) and high risk (81.4%-96.9%). The new logistic regression model was constructed with four independent factors, including age, total bilirubin, serum sodium and prothrombin activity by multivariate logistic regression analysis. The performances of the CART model (0.896), similar to the logistic regression model (0.914, P=.382), exceeded that of MELD score (0.667, P<.001). The results were confirmed in the validation cohort. We have developed and validated a novel CART model superior to MELD for predicting three-month mortality of patients with ACHBLF. Thus, the CART model could facilitate medical decision-making and provide clinicians with a validated practical bedside tool for ACHBLF risk stratification. © 2016 John Wiley & Sons Ltd.

  2. The quest for conditional independence in prospectivity modeling: weights-of-evidence, boost weights-of-evidence, and logistic regression

    NASA Astrophysics Data System (ADS)

    Schaeben, Helmut; Semmler, Georg

    2016-09-01

    The objective of prospectivity modeling is prediction of the conditional probability of the presence T = 1 or absence T = 0 of a target T given favorable or prohibitive predictors B, or construction of a two classes 0,1 classification of T. A special case of logistic regression called weights-of-evidence (WofE) is geologists' favorite method of prospectivity modeling due to its apparent simplicity. However, the numerical simplicity is deceiving as it is implied by the severe mathematical modeling assumption of joint conditional independence of all predictors given the target. General weights of evidence are explicitly introduced which are as simple to estimate as conventional weights, i.e., by counting, but do not require conditional independence. Complementary to the regression view is the classification view on prospectivity modeling. Boosting is the construction of a strong classifier from a set of weak classifiers. From the regression point of view it is closely related to logistic regression. Boost weights-of-evidence (BoostWofE) was introduced into prospectivity modeling to counterbalance violations of the assumption of conditional independence even though relaxation of modeling assumptions with respect to weak classifiers was not the (initial) purpose of boosting. In the original publication of BoostWofE a fabricated dataset was used to "validate" this approach. Using the same fabricated dataset it is shown that BoostWofE cannot generally compensate lacking conditional independence whatever the consecutively processing order of predictors. Thus the alleged features of BoostWofE are disproved by way of counterexamples, while theoretical findings are confirmed that logistic regression including interaction terms can exactly compensate violations of joint conditional independence if the predictors are indicators.

  3. Predicting The Type Of Pregnancy Using Flexible Discriminate Analysis And Artificial Neural Networks: A Comparison Study

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hooman, A.; Mohammadzadeh, M

    Some medical and epidemiological surveys have been designed to predict a nominal response variable with several levels. With regard to the type of pregnancy there are four possible states: wanted, unwanted by wife, unwanted by husband and unwanted by couple. In this paper, we have predicted the type of pregnancy, as well as the factors influencing it using three different models and comparing them. Regarding the type of pregnancy with several levels, we developed a multinomial logistic regression, a neural network and a flexible discrimination based on the data and compared their results using tow statistical indices: Surface under curvemore » (ROC) and kappa coefficient. Based on these tow indices, flexible discrimination proved to be a better fit for prediction on data in comparison to other methods. When the relations among variables are complex, one can use flexible discrimination instead of multinomial logistic regression and neural network to predict the nominal response variables with several levels in order to gain more accurate predictions.« less

  4. Cholesteryl Ester Transfer Protein Intimately Involved in Dyslipidemia-Related Susceptibility to Cognitive Deficits in Type 2 Diabetic Patients.

    PubMed

    Sun, Jie; Cai, Rongrong; Huang, Rong; Wang, Pin; Tian, Sai; Sun, Haixia; Xia, Wenqing; Wang, Shaohua

    2016-08-01

    Cholesteryl ester transfer protein (CETP) is involved in diabetic dyslipidemia. We aim to test the hypothesis that CETP might be of importance in mediating dyslipidemia-related susceptibility to cognitive deficits in diabetic patients. We recruited 190 type 2 diabetic patients and divided them into two groups according to the Montreal Cognitive Assessment (MoCA) score. The association between CETP and cognitive decline was analyzed with logistic regression and stratification. There were 110 diabetic patients with mild cognition impairment (MCI) and 80 healthy cognition subjects as controls. Dyslipidemia is more common among diabetic patients with MCI; they had a significant increase of serum CETP concentrations, which was negatively correlated with MoCA (r = -0.638; p < 0.001). Negative correlations were also found between the serum CETP concentration with the Auditory Verbal Learning Test (r = -0.266; p = 0.008), indicating memory deficit. Logistic regression analysis revealed that CETP concentration was an independent factor of diabetic MCI (p < 0.001). Further stratification study showed that high serum levels of CETP was an independent risk factor of MCI in diabetic patients with a low density lipoproteins level ≥2.59 mmol/L, or high density lipoproteins level ≤1.0 mmol/L for men and ≤1.3 mmol/L for women, or TG level ≥1.7 mmol/L, after adjusting for age, sex, education, and glucose control (all ps < 0.05). CETP was intimately involved in dyslipidemia-related susceptibility to cognitive decline, especially memory function in type 2 diabetic patients.

  5. Dynamic Dimensionality Selection for Bayesian Classifier Ensembles

    DTIC Science & Technology

    2015-03-19

    learning of weights in an otherwise generatively learned naive Bayes classifier. WANBIA-C is very cometitive to Logistic Regression but much more...classifier, Generative learning, Discriminative learning, Naïve Bayes, Feature selection, Logistic regression , higher order attribute independence 16...discriminative learning of weights in an otherwise generatively learned naive Bayes classifier. WANBIA-C is very cometitive to Logistic Regression but

  6. A review of logistic regression models used to predict post-fire tree mortality of western North American conifers

    Treesearch

    Travis Woolley; David C. Shaw; Lisa M. Ganio; Stephen Fitzgerald

    2012-01-01

    Logistic regression models used to predict tree mortality are critical to post-fire management, planning prescribed bums and understanding disturbance ecology. We review literature concerning post-fire mortality prediction using logistic regression models for coniferous tree species in the western USA. We include synthesis and review of: methods to develop, evaluate...

  7. Preserving Institutional Privacy in Distributed binary Logistic Regression.

    PubMed

    Wu, Yuan; Jiang, Xiaoqian; Ohno-Machado, Lucila

    2012-01-01

    Privacy is becoming a major concern when sharing biomedical data across institutions. Although methods for protecting privacy of individual patients have been proposed, it is not clear how to protect the institutional privacy, which is many times a critical concern of data custodians. Built upon our previous work, Grid Binary LOgistic REgression (GLORE)1, we developed an Institutional Privacy-preserving Distributed binary Logistic Regression model (IPDLR) that considers both individual and institutional privacy for building a logistic regression model in a distributed manner. We tested our method using both simulated and clinical data, showing how it is possible to protect the privacy of individuals and of institutions using a distributed strategy.

  8. Covariate Imbalance and Adjustment for Logistic Regression Analysis of Clinical Trial Data

    PubMed Central

    Ciolino, Jody D.; Martin, Reneé H.; Zhao, Wenle; Jauch, Edward C.; Hill, Michael D.; Palesch, Yuko Y.

    2014-01-01

    In logistic regression analysis for binary clinical trial data, adjusted treatment effect estimates are often not equivalent to unadjusted estimates in the presence of influential covariates. This paper uses simulation to quantify the benefit of covariate adjustment in logistic regression. However, International Conference on Harmonization guidelines suggest that covariate adjustment be pre-specified. Unplanned adjusted analyses should be considered secondary. Results suggest that that if adjustment is not possible or unplanned in a logistic setting, balance in continuous covariates can alleviate some (but never all) of the shortcomings of unadjusted analyses. The case of log binomial regression is also explored. PMID:24138438

  9. Differentially private distributed logistic regression using private and public data.

    PubMed

    Ji, Zhanglong; Jiang, Xiaoqian; Wang, Shuang; Xiong, Li; Ohno-Machado, Lucila

    2014-01-01

    Privacy protecting is an important issue in medical informatics and differential privacy is a state-of-the-art framework for data privacy research. Differential privacy offers provable privacy against attackers who have auxiliary information, and can be applied to data mining models (for example, logistic regression). However, differentially private methods sometimes introduce too much noise and make outputs less useful. Given available public data in medical research (e.g. from patients who sign open-consent agreements), we can design algorithms that use both public and private data sets to decrease the amount of noise that is introduced. In this paper, we modify the update step in Newton-Raphson method to propose a differentially private distributed logistic regression model based on both public and private data. We try our algorithm on three different data sets, and show its advantage over: (1) a logistic regression model based solely on public data, and (2) a differentially private distributed logistic regression model based on private data under various scenarios. Logistic regression models built with our new algorithm based on both private and public datasets demonstrate better utility than models that trained on private or public datasets alone without sacrificing the rigorous privacy guarantee.

  10. Prevalence and Determinants of Preterm Birth in Tehran, Iran: A Comparison between Logistic Regression and Decision Tree Methods.

    PubMed

    Amini, Payam; Maroufizadeh, Saman; Samani, Reza Omani; Hamidi, Omid; Sepidarkish, Mahdi

    2017-06-01

    Preterm birth (PTB) is a leading cause of neonatal death and the second biggest cause of death in children under five years of age. The objective of this study was to determine the prevalence of PTB and its associated factors using logistic regression and decision tree classification methods. This cross-sectional study was conducted on 4,415 pregnant women in Tehran, Iran, from July 6-21, 2015. Data were collected by a researcher-developed questionnaire through interviews with mothers and review of their medical records. To evaluate the accuracy of the logistic regression and decision tree methods, several indices such as sensitivity, specificity, and the area under the curve were used. The PTB rate was 5.5% in this study. The logistic regression outperformed the decision tree for the classification of PTB based on risk factors. Logistic regression showed that multiple pregnancies, mothers with preeclampsia, and those who conceived with assisted reproductive technology had an increased risk for PTB ( p < 0.05). Identifying and training mothers at risk as well as improving prenatal care may reduce the PTB rate. We also recommend that statisticians utilize the logistic regression model for the classification of risk groups for PTB.

  11. Complementary nonparametric analysis of covariance for logistic regression in a randomized clinical trial setting.

    PubMed

    Tangen, C M; Koch, G G

    1999-03-01

    In the randomized clinical trial setting, controlling for covariates is expected to produce variance reduction for the treatment parameter estimate and to adjust for random imbalances of covariates between the treatment groups. However, for the logistic regression model, variance reduction is not obviously obtained. This can lead to concerns about the assumptions of the logistic model. We introduce a complementary nonparametric method for covariate adjustment. It provides results that are usually compatible with expectations for analysis of covariance. The only assumptions required are based on randomization and sampling arguments. The resulting treatment parameter is a (unconditional) population average log-odds ratio that has been adjusted for random imbalance of covariates. Data from a randomized clinical trial are used to compare results from the traditional maximum likelihood logistic method with those from the nonparametric logistic method. We examine treatment parameter estimates, corresponding standard errors, and significance levels in models with and without covariate adjustment. In addition, we discuss differences between unconditional population average treatment parameters and conditional subpopulation average treatment parameters. Additional features of the nonparametric method, including stratified (multicenter) and multivariate (multivisit) analyses, are illustrated. Extensions of this methodology to the proportional odds model are also made.

  12. Asthma exacerbation and proximity of residence to major roads: a population-based matched case-control study among the pediatric Medicaid population in Detroit, Michigan

    PubMed Central

    2011-01-01

    Background The relationship between asthma and traffic-related pollutants has received considerable attention. The use of individual-level exposure measures, such as residence location or proximity to emission sources, may avoid ecological biases. Method This study focused on the pediatric Medicaid population in Detroit, MI, a high-risk population for asthma-related events. A population-based matched case-control analysis was used to investigate associations between acute asthma outcomes and proximity of residence to major roads, including freeways. Asthma cases were identified as all children who made at least one asthma claim, including inpatient and emergency department visits, during the three-year study period, 2004-06. Individually matched controls were randomly selected from the rest of the Medicaid population on the basis of non-respiratory related illness. We used conditional logistic regression with distance as both categorical and continuous variables, and examined non-linear relationships with distance using polynomial splines. The conditional logistic regression models were then extended by considering multiple asthma states (based on the frequency of acute asthma outcomes) using polychotomous conditional logistic regression. Results Asthma events were associated with proximity to primary roads with an odds ratio of 0.97 (95% CI: 0.94, 0.99) for a 1 km increase in distance using conditional logistic regression, implying that asthma events are less likely as the distance between the residence and a primary road increases. Similar relationships and effect sizes were found using polychotomous conditional logistic regression. Another plausible exposure metric, a reduced form response surface model that represents atmospheric dispersion of pollutants from roads, was not associated under that exposure model. Conclusions There is moderately strong evidence of elevated risk of asthma close to major roads based on the results obtained in this population-based matched case-control study. PMID:21513554

  13. What Makes Super‐Aged Nations Happier? Exploring Critical Factors of Happiness Among Middle‐Aged Men and Women in Japan

    PubMed Central

    Tamiya, Nanako; Kawachi, Nobuyuki; Miyairi, Maya

    2018-01-01

    This study aimed to examine multiple factors associated with happiness from the perspective of gender difference among a middle‐aged Japanese population. A total of 865 participants (male = 344, female = 521) aged 40–64 years were divided into two groups (high and low) by their self‐reported level of happiness. Logistic regression analysis by gender was carried out. In men, high levels of happiness were significantly correlated with living with spouse, occupation, enough sleep, leading a normal life, and regular checkups; while low levels of happiness were significantly correlated with smoking and having two or more diseases. In women, low levels of happiness were significantly correlated with caring for a family member. Our data suggested that the factors relevant to happiness levels might vary between men and women among middle‐aged people in Japan. To increase the nation's level of happiness, the Japanese government must implement extended social services and policymaking, to alleviate caregivers’ burdens, especially among Japanese women. PMID:29610701

  14. Correlates of consistent condom use among men who have sex with men recruited through the Internet in Huzhou city: a cross-sectional survey.

    PubMed

    Jin, Meihua; Yang, Zhongrong; Dong, Zhengquan; Han, Jiankang

    2013-12-01

    There is growing evidence that men who have sex with men (MSM) are currently a group at high risk of HIV infection in China. Our study aims to know the factors affecting consistent condom use among MSM recruited through the internet in Huzhou city. An anonymous cross-sectional study was conducted by recruiting 410 MSM living in Huzhou city via the Internet. The socio-demographic profiles (age, education level, employment status, etc.) and sexual risk behaviors of the respondents were investigated. Bivariate logistic regression analyses were performed to compare the differences between consistent condom users and inconsistent condom users. Variables with significant bivariate between groups' differences were used as candidate variables in a stepwise multivariate logistic regression model. All statistical analyses were performed using SPSS for Windows 17.0, and a p value < 0.05 was considered to be statistically significant. According to their condom use, sixty-eight respondents were classified into two groups. One is consistent condom users, and the other is inconsistent condom users. Multivariate logistic regression showed that respondents who had a comprehensive knowledge of HIV (OR = 4.08, 95% CI: 1.85-8.99), who had sex with male sex workers (OR = 15.30, 95% CI: 5.89-39.75) and who had not drunk alcohol before sex (OR = 3.10, 95% CI: 1.38-6.95) were more likely to be consistent condom users. Consistent condom use among MSM was associated with comprehensive knowledge of HIV and a lack of alcohol use before sexual contact. As a result, reducing alcohol consumption and enhancing education regarding the risks of HIV among sexually active MSM would be effective in preventing of HIV transmission.

  15. Blood oxygen level dependent magnetic resonance imaging for detecting pathological patterns in lupus nephritis patients: a preliminary study using a decision tree model.

    PubMed

    Shi, Huilan; Jia, Junya; Li, Dong; Wei, Li; Shang, Wenya; Zheng, Zhenfeng

    2018-02-09

    Precise renal histopathological diagnosis will guide therapy strategy in patients with lupus nephritis. Blood oxygen level dependent (BOLD) magnetic resonance imaging (MRI) has been applicable noninvasive technique in renal disease. This current study was performed to explore whether BOLD MRI could contribute to diagnose renal pathological pattern. Adult patients with lupus nephritis renal pathological diagnosis were recruited for this study. Renal biopsy tissues were assessed based on the lupus nephritis ISN/RPS 2003 classification. The Blood oxygen level dependent magnetic resonance imaging (BOLD-MRI) was used to obtain functional magnetic resonance parameter, R2* values. Several functions of R2* values were calculated and used to construct algorithmic models for renal pathological patterns. In addition, the algorithmic models were compared as to their diagnostic capability. Both Histopathology and BOLD MRI were used to examine a total of twelve patients. Renal pathological patterns included five classes III (including 3 as class III + V) and seven classes IV (including 4 as class IV + V). Three algorithmic models, including decision tree, line discriminant, and logistic regression, were constructed to distinguish the renal pathological pattern of class III and class IV. The sensitivity of the decision tree model was better than that of the line discriminant model (71.87% vs 59.48%, P < 0.001) and inferior to that of the Logistic regression model (71.87% vs 78.71%, P < 0.001). The specificity of decision tree model was equivalent to that of the line discriminant model (63.87% vs 63.73%, P = 0.939) and higher than that of the logistic regression model (63.87% vs 38.0%, P < 0.001). The Area under the ROC curve (AUROCC) of the decision tree model was greater than that of the line discriminant model (0.765 vs 0.629, P < 0.001) and logistic regression model (0.765 vs 0.662, P < 0.001). BOLD MRI is a useful non-invasive imaging technique for the evaluation of lupus nephritis. Decision tree models constructed using functions of R2* values may facilitate the prediction of renal pathological patterns.

  16. Prevalence and Trends in Domestic Violence in South Korea: Findings From National Surveys.

    PubMed

    Kim, Jae Yop; Oh, Sehun; Nam, Seok In

    2016-05-01

    To examine trends in the prevalence of domestic violence since 1997, 1 year prior to the introduction of legislative countermeasures and accompanying services in South Korea, and to analyze what socio-demographic characteristics of perpetrators contribute to spousal violence and whether there were any changes in risk factors over time. This study used two sets of nationally representative household samples: married or cohabiting couples of 1,540 from the 1999 national survey and 3,269 from the 2010 National Survey of Domestic Violence. Frequency analysis was used to measure the prevalence of intimate partner violence (IPV), and cross-tabulation, correlation, and logistic regression analyses were used to look for socio-demographic risk factors of spousal physical violence and patterns of change over time. The frequency analysis showed that the IPV prevalence dropped by approximately 50%, from 34.1% in 1999 to 16.5% in 2010, though it was still higher than many other countries. The cross-tabulation and logistic regression analyses suggested that men with low socio-demographic characteristics were generally more violent, though this tendency did not apply to women. Instead, younger women seemed to be more violent than older women. Last, different levels of household income were associated with different levels of IPV in 2010, but no linear trend was detected. In this study, IPV prevalence trends and risk factors of two different time periods were discussed to provide implications for tackling the IPV problem. Future countermeasures must build on understanding about men with low socio-demographic status and younger women, who were more violent in marital relationships. © The Author(s) 2015.

  17. Analysis of association of clinical aspects and IL1B tagSNPs with severe preeclampsia.

    PubMed

    Leme Galvão, Larissa Paes; Menezes, Filipe Emanuel; Mendonca, Caio; Barreto, Ikaro; Alvim-Pereira, Claudia; Alvim-Pereira, Fabiano; Gurgel, Ricardo

    2016-01-01

    This study investigates the association between IL1B genotypes using a tag SNP (single polymorphism) approach, maternal and environmental factors in Brazilian women with severe preeclampsia. A case-control study with a total of 456 patients (169 preeclamptic women and 287 controls) was conducted in the two reference maternity hospitals of Sergipe state, Northeast Brazil. A questionnaire was administered and DNA was extracted to genotype the population for four tag SNPs of the IL1Beta: rs 1143643, rs 1143633, rs 1143634 and rs 1143630. Haplotype association analysis and p-values were calculated using the THESIAS test. Odds ratio (OR) estimation, confidence interval (CI) and multivariate logistic regression were performed. High pregestational body mass index (pre-BMI), first gestation, cesarean section, more than six medical visits, low level of consciousness on admission and TC and TT genotype in rs1143630 of IL1Beta showed association with the preeclamptic group in univariate analysis. After multivariate logistic regression pre-BMI, first gestation and low level of consciousness on admission remained associated. We identified an association between clinical variables and preeclampsia. Univariate analysis suggested that inflammatory process-related genes, such as IL1B, may be involved and should be targeted in further studies. The identification of the genetic background involved in preeclampsia host response modulation is mandatory in order to understand the preeclampsia process.

  18. An EM-based semi-parametric mixture model approach to the regression analysis of competing-risks data.

    PubMed

    Ng, S K; McLachlan, G J

    2003-04-15

    We consider a mixture model approach to the regression analysis of competing-risks data. Attention is focused on inference concerning the effects of factors on both the probability of occurrence and the hazard rate conditional on each of the failure types. These two quantities are specified in the mixture model using the logistic model and the proportional hazards model, respectively. We propose a semi-parametric mixture method to estimate the logistic and regression coefficients jointly, whereby the component-baseline hazard functions are completely unspecified. Estimation is based on maximum likelihood on the basis of the full likelihood, implemented via an expectation-conditional maximization (ECM) algorithm. Simulation studies are performed to compare the performance of the proposed semi-parametric method with a fully parametric mixture approach. The results show that when the component-baseline hazard is monotonic increasing, the semi-parametric and fully parametric mixture approaches are comparable for mildly and moderately censored samples. When the component-baseline hazard is not monotonic increasing, the semi-parametric method consistently provides less biased estimates than a fully parametric approach and is comparable in efficiency in the estimation of the parameters for all levels of censoring. The methods are illustrated using a real data set of prostate cancer patients treated with different dosages of the drug diethylstilbestrol. Copyright 2003 John Wiley & Sons, Ltd.

  19. An 8-year study of people with multiple sclerosis in Isfahan, Iran: Association between environmental air pollutants and severity of disease.

    PubMed

    Ashtari, Fereshte; Esmaeil, Nafiseh; Mansourian, Marjan; Poursafa, Parinaz; Mirmosayyeb, Omid; Barzegar, Mahdi; Pourgheisari, Hajar

    2018-06-15

    The evidence for an impact of ambient air pollution on the incidence and severity of multiple sclerosis (MS) is still limited. In the present study, we assessed the association between daily air pollution levels and MS prevalence and severity in Isfahan city, Iran. Data related to MS patients has been collected from 2008 to 2016 in a referral university clinic. The air quality index (AQI) data, were collected from 6 monitoring stations of Isfahan department of environment. The distribution map presenting the sites of air pollution monitoring stations as well as the residential address of MS patients was plotted on geographical information system (GIS). An increase in AQI level in four areas of the city (north, west, east and south) was associated with higher expanded disability status scale (EDSS) of MS patients[logistic regression odds ratio = 1.01 (95% CI = 1.008,1.012)]. Moreover, significant inverse association between the complete remission after the first attack with AQI level in total areas [logistic regression odds ratio = 0.987 (95% CI = 0.977, 0.997)] was found in crude model. However, after adjustment for confounding variables through multivariate logistic regression, AQI level was associated with degree of complete remission after first attack 1.005 (95% CI = 1.004, 1.006). The results of our study suggest that air pollution could play a role in the severity and remission of MS disease. However, more detailed studies with considering the complex involvement of different environmental factors including sunlight exposure, diet, depression and vitamin D are needed to determine the outcome of MS. Copyright © 2018 Elsevier B.V. All rights reserved.

  20. Estimating a Logistic Discrimination Functions When One of the Training Samples Is Subject to Misclassification: A Maximum Likelihood Approach.

    PubMed

    Nagelkerke, Nico; Fidler, Vaclav

    2015-01-01

    The problem of discrimination and classification is central to much of epidemiology. Here we consider the estimation of a logistic regression/discrimination function from training samples, when one of the training samples is subject to misclassification or mislabeling, e.g. diseased individuals are incorrectly classified/labeled as healthy controls. We show that this leads to zero-inflated binomial model with a defective logistic regression or discrimination function, whose parameters can be estimated using standard statistical methods such as maximum likelihood. These parameters can be used to estimate the probability of true group membership among those, possibly erroneously, classified as controls. Two examples are analyzed and discussed. A simulation study explores properties of the maximum likelihood parameter estimates and the estimates of the number of mislabeled observations.

  1. The microbiological profile and presence of bloodstream infection influence mortality rates in necrotizing fasciitis

    PubMed Central

    2011-01-01

    Introduction Necrotizing fasciitis (NF) is a life threatening infectious disease with a high mortality rate. We carried out a microbiological characterization of the causative pathogens. We investigated the correlation of mortality in NF with bloodstream infection and with the presence of co-morbidities. Methods In this retrospective study, we analyzed 323 patients who presented with necrotizing fasciitis at two different institutions. Bloodstream infection (BSI) was defined as a positive blood culture result. The patients were categorized as survivors and non-survivors. Eleven clinically important variables which were statistically significant by univariate analysis were selected for multivariate regression analysis and a stepwise logistic regression model was developed to determine the association between BSI and mortality. Results Univariate logistic regression analysis showed that patients with hypotension, heart disease, liver disease, presence of Vibrio spp. in wound cultures, presence of fungus in wound cultures, and presence of Streptococcus group A, Aeromonas spp. or Vibrio spp. in blood cultures, had a significantly higher risk of in-hospital mortality. Our multivariate logistic regression analysis showed a higher risk of mortality in patients with pre-existing conditions like hypotension, heart disease, and liver disease. Multivariate logistic regression analysis also showed that presence of Vibrio spp in wound cultures, and presence of Streptococcus Group A in blood cultures were associated with a high risk of mortality while debridement > = 3 was associated with improved survival. Conclusions Mortality in patients with necrotizing fasciitis was significantly associated with the presence of Vibrio in wound cultures and Streptococcus group A in blood cultures. PMID:21693053

  2. Ex-post-facto analysis of competitive employment outcomes for individuals with mental retardation: national perspective.

    PubMed

    Moore, Corey L; Harley, Debra A; Gamble, David

    2004-08-01

    Disparities in proportions of competitive job placements and provision of vocational rehabilitation services by level of mental retardation were identified for 28,565 individuals. Chi-square results reveal that consumers with mild mental retardation are significantly more likely to achieve competitive jobs compared to those with more severe levels. Logistic regression indicated that the odds of achieving a competitive job for consumers receiving job placement services, business/vocational training, and counseling were nearly two times that of individuals not receiving such services. Findings suggest that a significantly lower proportion of these services were provided to consumers with severe/profound mental retardation. Implications of findings for service, research, and policy are discussed.

  3. A Two-Stage Method to Determine Optimal Product Sampling considering Dynamic Potential Market

    PubMed Central

    Hu, Zhineng; Lu, Wei; Han, Bing

    2015-01-01

    This paper develops an optimization model for the diffusion effects of free samples under dynamic changes in potential market based on the characteristics of independent product and presents a two-stage method to figure out the sampling level. The impact analysis of the key factors on the sampling level shows that the increase of the external coefficient or internal coefficient has a negative influence on the sampling level. And the changing rate of the potential market has no significant influence on the sampling level whereas the repeat purchase has a positive one. Using logistic analysis and regression analysis, the global sensitivity analysis gives a whole analysis of the interaction of all parameters, which provides a two-stage method to estimate the impact of the relevant parameters in the case of inaccuracy of the parameters and to be able to construct a 95% confidence interval for the predicted sampling level. Finally, the paper provides the operational steps to improve the accuracy of the parameter estimation and an innovational way to estimate the sampling level. PMID:25821847

  4. Risk adjustment in the American College of Surgeons National Surgical Quality Improvement Program: a comparison of logistic versus hierarchical modeling.

    PubMed

    Cohen, Mark E; Dimick, Justin B; Bilimoria, Karl Y; Ko, Clifford Y; Richards, Karen; Hall, Bruce Lee

    2009-12-01

    Although logistic regression has commonly been used to adjust for risk differences in patient and case mix to permit quality comparisons across hospitals, hierarchical modeling has been advocated as the preferred methodology, because it accounts for clustering of patients within hospitals. It is unclear whether hierarchical models would yield important differences in quality assessments compared with logistic models when applied to American College of Surgeons (ACS) National Surgical Quality Improvement Program (NSQIP) data. Our objective was to evaluate differences in logistic versus hierarchical modeling for identifying hospitals with outlying outcomes in the ACS-NSQIP. Data from ACS-NSQIP patients who underwent colorectal operations in 2008 at hospitals that reported at least 100 operations were used to generate logistic and hierarchical prediction models for 30-day morbidity and mortality. Differences in risk-adjusted performance (ratio of observed-to-expected events) and outlier detections from the two models were compared. Logistic and hierarchical models identified the same 25 hospitals as morbidity outliers (14 low and 11 high outliers), but the hierarchical model identified 2 additional high outliers. Both models identified the same eight hospitals as mortality outliers (five low and three high outliers). The values of observed-to-expected events ratios and p values from the two models were highly correlated. Results were similar when data were permitted from hospitals providing < 100 patients. When applied to ACS-NSQIP data, logistic and hierarchical models provided nearly identical results with respect to identification of hospitals' observed-to-expected events ratio outliers. As hierarchical models are prone to implementation problems, logistic regression will remain an accurate and efficient method for performing risk adjustment of hospital quality comparisons.

  5. School Socioeconomic Composition and Adolescent Sexual Initiation in Malawi.

    PubMed

    Kim, Jinho

    2015-09-01

    Numerous studies have documented the determinants of sexual behavior among adolescents in less-developed countries, yet relatively little is known about the influence of social contexts such as school and neighborhood. Using two waves of data from a school-based longitudinal survey conducted in Malawi from 2011-13, this study advances our understanding of the relationship between school-level socioeconomic contexts and adolescents' sexual activity. The results from two-level multinomial logistic regression models suggest that high socioeconomic composition of the student body in school decreases the odds of initiation of sexual activity, independent of other important features of schools and individual-level characteristics. This study also finds that the association between school socioeconomic composition and sexual activity is statistically significant among male adolescents but not female adolescents, suggesting that schools' socioeconomic contexts may be more relevant to male adolescents' initiation of sexual activity. © 2015 The Population Council, Inc.

  6. School socioeconomic composition and adolescent sexual initiation in Malawi

    PubMed Central

    Kim, Jinho

    2015-01-01

    While numerous studies have documented the determinants of sexual behavior among adolescents in less developed countries, relatively little is known about the influence of social contexts such as school and neighborhood. Using two waves of data from a school-based longitudinal survey conducted in Malawi from 2011 to 2013, this study advances our understanding of the relationship between school-level socioeconomic contexts and adolescents’ sexual activity. The results from two-level multinomial logistic regression models suggest that high socioeconomic composition of the student body in school decreases the odds of initiating sexual activity, independently of other important features of schools as well as individual-level characteristics. This study also finds that the association between school socioeconomic composition and sexual activity is statistically significant only among males, but not females, suggesting that school’s socioeconomic contexts may be more relevant to male adolescents’ initiation of sexual activity. PMID:26347090

  7. Adherence in single-parent households in a long-term asthma clinical trial.

    PubMed

    Spicher, Mary; Bollers, Nancy; Chinn, Tamara; Hall, Anita; Plunkett, Anne; Rodgers, Denise; Sundström, D A; Wilson, Laura

    2012-01-01

    Adherence of participants in a long-term clinical trial is necessary to assure validity of findings. This article examines adherence differences between single-parent and two-parent families in the Childhood Asthma Management Program (CAMP). Adherence was defined as the percentage of completed daily diary cards and scheduled study visits during the course of the trial. Logistic regression and ordinal logistic regression analyses were used. Children from single-parent families had a lower percentage of completed diary cards (72% vs. 84%) than two-parent families. Single-parent families were also more likely to reschedule visits (62% vs. 45%) and miss more clinic visits (23% vs. 17%) than two-parent families. Suggestions are given for study coordinators to assist participants in completing a long-term clinical trial. Many suggestions may be adapted for nurses in inpatient or outpatient settings for assisting parents of patients with chronic diseases.

  8. Does substance misuse moderate the relationship between criminal thinking and recidivism?

    PubMed Central

    Caudy, Michael S.; Folk, Johanna B.; Stuewig, Jeffrey B.; Wooditch, Alese; Martinez, Andres; Maass, Stephanie; Tangney, June P.; Taxman, Faye S.

    2014-01-01

    Purpose Some differential intervention frameworks contend that substance use is less robustly related to recidivism outcomes than other criminogenic needs such as criminal thinking. The current study tested the hypothesis that substance use disorder severity moderates the relationship between criminal thinking and recidivism. Methods The study utilized two independent criminal justice samples. Study 1 included 226 drug-involved probationers. Study 2 included 337 jail inmates with varying levels of substance use disorder severity. Logistic regression was employed to test the main and interactive effects of criminal thinking and substance use on multiple dichotomous indicators of recidivism. Results Bivariate analyses revealed a significant correlation between criminal thinking and recidivism in the jail sample (r = .18, p < .05) but no significant relationship in the probation sample. Logistic regressions revealed that SUD symptoms moderated the relationship between criminal thinking and recidivism in the jail-based sample (B = −.58, p < .05). A significant moderation effect was not observed in the probation sample. Conclusions Study findings indicate that substance use disorder symptoms moderate the strength of the association between criminal thinking and recidivism. These findings demonstrate the need for further research into the interaction between various dynamic risk factors. PMID:25598559

  9. Predicting outcome in severe traumatic brain injury using a simple prognostic model.

    PubMed

    Sobuwa, Simpiwe; Hartzenberg, Henry Benjamin; Geduld, Heike; Uys, Corrie

    2014-06-17

    Several studies have made it possible to predict outcome in severe traumatic brain injury (TBI) making it beneficial as an aid for clinical decision-making in the emergency setting. However, reliable predictive models are lacking for resource-limited prehospital settings such as those in developing countries like South Africa. To develop a simple predictive model for severe TBI using clinical variables in a South African prehospital setting. All consecutive patients admitted at two level-one centres in Cape Town, South Africa, for severe TBI were included. A binary logistic regression model was used, which included three predictor variables: oxygen saturation (SpO₂), Glasgow Coma Scale (GCS) and pupil reactivity. The Glasgow Outcome Scale was used to assess outcome on hospital discharge. A total of 74.4% of the outcomes were correctly predicted by the logistic regression model. The model demonstrated SpO₂ (p=0.019), GCS (p=0.001) and pupil reactivity (p=0.002) as independently significant predictors of outcome in severe TBI. Odds ratios of a good outcome were 3.148 (SpO₂ ≥ 90%), 5.108 (GCS 6 - 8) and 4.405 (pupils bilaterally reactive). This model is potentially useful for effective predictions of outcome in severe TBI.

  10. Multivariate prediction of upper limb prosthesis acceptance or rejection.

    PubMed

    Biddiss, Elaine A; Chau, Tom T

    2008-07-01

    To develop a model for prediction of upper limb prosthesis use or rejection. A questionnaire exploring factors in prosthesis acceptance was distributed internationally to individuals with upper limb absence through community-based support groups and rehabilitation hospitals. A total of 191 participants (59 prosthesis rejecters and 132 prosthesis wearers) were included in this study. A logistic regression model, a C5.0 decision tree, and a radial basis function neural network were developed and compared in terms of sensitivity (prediction of prosthesis rejecters), specificity (prediction of prosthesis wearers), and overall cross-validation accuracy. The logistic regression and neural network provided comparable overall accuracies of approximately 84 +/- 3%, specificity of 93%, and sensitivity of 61%. Fitting time-frame emerged as the predominant predictor. Individuals fitted within two years of birth (congenital) or six months of amputation (acquired) were 16 times more likely to continue prosthesis use. To increase rates of prosthesis acceptance, clinical directives should focus on timely, client-centred fitting strategies and the development of improved prostheses and healthcare for individuals with high-level or bilateral limb absence. Multivariate analyses are useful in determining the relative importance of the many factors involved in prosthesis acceptance and rejection.

  11. Are Women More Likely to Be Hired or Promoted into Management Positions?

    ERIC Educational Resources Information Center

    Lyness, Karen S.; Judiesch, Michael K.

    1999-01-01

    In a three-year study of 30,996 financial-services managers, logistic regression analyses showed that women were more likely to be promoted rather than hired into management positions. Relative to men, women in higher-level positions received fewer promotions than women in lower-level positions. (63 references) (SK)

  12. A Combined Pathway and Regional Heritability Analysis Indicates NETRIN1 Pathway Is Associated With Major Depressive Disorder.

    PubMed

    Zeng, Yanni; Navarro, Pau; Fernandez-Pujals, Ana M; Hall, Lynsey S; Clarke, Toni-Kim; Thomson, Pippa A; Smith, Blair H; Hocking, Lynne J; Padmanabhan, Sandosh; Hayward, Caroline; MacIntyre, Donald J; Wray, Naomi R; Deary, Ian J; Porteous, David J; Haley, Chris S; McIntosh, Andrew M

    2017-02-15

    Genome-wide association studies (GWASs) of major depressive disorder (MDD) have identified few significant associations. Testing the aggregation of genetic variants, in particular biological pathways, may be more powerful. Regional heritability analysis can be used to detect genomic regions that contribute to disease risk. We integrated pathway analysis and multilevel regional heritability analyses in a pipeline designed to identify MDD-associated pathways. The pipeline was applied to two independent GWAS samples [Generation Scotland: The Scottish Family Health Study (GS:SFHS, N = 6455) and Psychiatric Genomics Consortium (PGC:MDD) (N = 18,759)]. A polygenic risk score (PRS) composed of single nucleotide polymorphisms from the pathway most consistently associated with MDD was created, and its accuracy to predict MDD, using area under the curve, logistic regression, and linear mixed model analyses, was tested. In GS:SFHS, four pathways were significantly associated with MDD, and two of these explained a significant amount of pathway-level regional heritability. In PGC:MDD, one pathway was significantly associated with MDD. Pathway-level regional heritability was significant in this pathway in one subset of PGC:MDD. For both samples the regional heritabilities were further localized to the gene and subregion levels. The NETRIN1 signaling pathway showed the most consistent association with MDD across the two samples. PRSs from this pathway showed competitive predictive accuracy compared with the whole-genome PRSs when using area under the curve statistics, logistic regression, and linear mixed model. These post-GWAS analyses highlight the value of combining multiple methods on multiple GWAS data for the identification of risk pathways for MDD. The NETRIN1 signaling pathway is identified as a candidate pathway for MDD and should be explored in further large population studies. Copyright © 2016 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.

  13. Predicting 30-day Hospital Readmission with Publicly Available Administrative Database. A Conditional Logistic Regression Modeling Approach.

    PubMed

    Zhu, K; Lou, Z; Zhou, J; Ballester, N; Kong, N; Parikh, P

    2015-01-01

    This article is part of the Focus Theme of Methods of Information in Medicine on "Big Data and Analytics in Healthcare". Hospital readmissions raise healthcare costs and cause significant distress to providers and patients. It is, therefore, of great interest to healthcare organizations to predict what patients are at risk to be readmitted to their hospitals. However, current logistic regression based risk prediction models have limited prediction power when applied to hospital administrative data. Meanwhile, although decision trees and random forests have been applied, they tend to be too complex to understand among the hospital practitioners. Explore the use of conditional logistic regression to increase the prediction accuracy. We analyzed an HCUP statewide inpatient discharge record dataset, which includes patient demographics, clinical and care utilization data from California. We extracted records of heart failure Medicare beneficiaries who had inpatient experience during an 11-month period. We corrected the data imbalance issue with under-sampling. In our study, we first applied standard logistic regression and decision tree to obtain influential variables and derive practically meaning decision rules. We then stratified the original data set accordingly and applied logistic regression on each data stratum. We further explored the effect of interacting variables in the logistic regression modeling. We conducted cross validation to assess the overall prediction performance of conditional logistic regression (CLR) and compared it with standard classification models. The developed CLR models outperformed several standard classification models (e.g., straightforward logistic regression, stepwise logistic regression, random forest, support vector machine). For example, the best CLR model improved the classification accuracy by nearly 20% over the straightforward logistic regression model. Furthermore, the developed CLR models tend to achieve better sensitivity of more than 10% over the standard classification models, which can be translated to correct labeling of additional 400 - 500 readmissions for heart failure patients in the state of California over a year. Lastly, several key predictor identified from the HCUP data include the disposition location from discharge, the number of chronic conditions, and the number of acute procedures. It would be beneficial to apply simple decision rules obtained from the decision tree in an ad-hoc manner to guide the cohort stratification. It could be potentially beneficial to explore the effect of pairwise interactions between influential predictors when building the logistic regression models for different data strata. Judicious use of the ad-hoc CLR models developed offers insights into future development of prediction models for hospital readmissions, which can lead to better intuition in identifying high-risk patients and developing effective post-discharge care strategies. Lastly, this paper is expected to raise the awareness of collecting data on additional markers and developing necessary database infrastructure for larger-scale exploratory studies on readmission risk prediction.

  14. The Association between Overweight and School Policies on Physical Activity: A Multilevel Analysis among Elementary School Youth in the PLAY-On Study

    ERIC Educational Resources Information Center

    Leatherdale, Scott T.

    2010-01-01

    The objective is to examine school-level program and policy characteristics and student-level behavioural characteristics associated with being overweight. Multilevel logistic regression analysis were used to examine the school- and student-level characteristics associated with the odds of a student being overweight among 1264 Grade 5-8 students…

  15. The Impact of Household Heads' Education Levels on the Poverty Risk: The Evidence from Turkey

    ERIC Educational Resources Information Center

    Bilenkisi, Fikret; Gungor, Mahmut Sami; Tapsin, Gulcin

    2015-01-01

    This study aims to analyze the relationship between the education levels of household heads and the poverty risk of households in Turkey. The logistic regression models have been estimated with the poverty risk of a household as a dependent variable and a set of educational levels as explanatory variables for all households. There are subgroups of…

  16. Prediction of spatially explicit rainfall intensity-duration thresholds for post-fire debris-flow generation in the western United States

    NASA Astrophysics Data System (ADS)

    Staley, Dennis; Negri, Jacquelyn; Kean, Jason

    2016-04-01

    Population expansion into fire-prone steeplands has resulted in an increase in post-fire debris-flow risk in the western United States. Logistic regression methods for determining debris-flow likelihood and the calculation of empirical rainfall intensity-duration thresholds for debris-flow initiation represent two common approaches for characterizing hazard and reducing risk. Logistic regression models are currently being used to rapidly assess debris-flow hazard in response to design storms of known intensities (e.g. a 10-year recurrence interval rainstorm). Empirical rainfall intensity-duration thresholds comprise a major component of the United States Geological Survey (USGS) and the National Weather Service (NWS) debris-flow early warning system at a regional scale in southern California. However, these two modeling approaches remain independent, with each approach having limitations that do not allow for synergistic local-scale (e.g. drainage-basin scale) characterization of debris-flow hazard during intense rainfall. The current logistic regression equations consider rainfall a unique independent variable, which prevents the direct calculation of the relation between rainfall intensity and debris-flow likelihood. Regional (e.g. mountain range or physiographic province scale) rainfall intensity-duration thresholds fail to provide insight into the basin-scale variability of post-fire debris-flow hazard and require an extensive database of historical debris-flow occurrence and rainfall characteristics. Here, we present a new approach that combines traditional logistic regression and intensity-duration threshold methodologies. This method allows for local characterization of both the likelihood that a debris-flow will occur at a given rainfall intensity, the direct calculation of the rainfall rates that will result in a given likelihood, and the ability to calculate spatially explicit rainfall intensity-duration thresholds for debris-flow generation in recently burned areas. Our approach synthesizes the two methods by incorporating measured rainfall intensity into each model variable (based on measures of topographic steepness, burn severity and surface properties) within the logistic regression equation. This approach provides a more realistic representation of the relation between rainfall intensity and debris-flow likelihood, as likelihood values asymptotically approach zero when rainfall intensity approaches 0 mm/h, and increase with more intense rainfall. Model performance was evaluated by comparing predictions to several existing regional thresholds. The model, based upon training data collected in southern California, USA, has proven to accurately predict rainfall intensity-duration thresholds for other areas in the western United States not included in the original training dataset. In addition, the improved logistic regression model shows promise for emergency planning purposes and real-time, site-specific early warning. With further validation, this model may permit the prediction of spatially-explicit intensity-duration thresholds for debris-flow generation in areas where empirically derived regional thresholds do not exist. This improvement would permit the expansion of the early-warning system into other regions susceptible to post-fire debris flow.

  17. Interpretation of commonly used statistical regression models.

    PubMed

    Kasza, Jessica; Wolfe, Rory

    2014-01-01

    A review of some regression models commonly used in respiratory health applications is provided in this article. Simple linear regression, multiple linear regression, logistic regression and ordinal logistic regression are considered. The focus of this article is on the interpretation of the regression coefficients of each model, which are illustrated through the application of these models to a respiratory health research study. © 2013 The Authors. Respirology © 2013 Asian Pacific Society of Respirology.

  18. Reducing false-positive incidental findings with ensemble genotyping and logistic regression based variant filtering methods.

    PubMed

    Hwang, Kyu-Baek; Lee, In-Hee; Park, Jin-Ho; Hambuch, Tina; Choe, Yongjoon; Kim, MinHyeok; Lee, Kyungjoon; Song, Taemin; Neu, Matthew B; Gupta, Neha; Kohane, Isaac S; Green, Robert C; Kong, Sek Won

    2014-08-01

    As whole genome sequencing (WGS) uncovers variants associated with rare and common diseases, an immediate challenge is to minimize false-positive findings due to sequencing and variant calling errors. False positives can be reduced by combining results from orthogonal sequencing methods, but costly. Here, we present variant filtering approaches using logistic regression (LR) and ensemble genotyping to minimize false positives without sacrificing sensitivity. We evaluated the methods using paired WGS datasets of an extended family prepared using two sequencing platforms and a validated set of variants in NA12878. Using LR or ensemble genotyping based filtering, false-negative rates were significantly reduced by 1.1- to 17.8-fold at the same levels of false discovery rates (5.4% for heterozygous and 4.5% for homozygous single nucleotide variants (SNVs); 30.0% for heterozygous and 18.7% for homozygous insertions; 25.2% for heterozygous and 16.6% for homozygous deletions) compared to the filtering based on genotype quality scores. Moreover, ensemble genotyping excluded > 98% (105,080 of 107,167) of false positives while retaining > 95% (897 of 937) of true positives in de novo mutation (DNM) discovery in NA12878, and performed better than a consensus method using two sequencing platforms. Our proposed methods were effective in prioritizing phenotype-associated variants, and an ensemble genotyping would be essential to minimize false-positive DNM candidates. © 2014 WILEY PERIODICALS, INC.

  19. Reducing false positive incidental findings with ensemble genotyping and logistic regression-based variant filtering methods

    PubMed Central

    Hwang, Kyu-Baek; Lee, In-Hee; Park, Jin-Ho; Hambuch, Tina; Choi, Yongjoon; Kim, MinHyeok; Lee, Kyungjoon; Song, Taemin; Neu, Matthew B.; Gupta, Neha; Kohane, Isaac S.; Green, Robert C.; Kong, Sek Won

    2014-01-01

    As whole genome sequencing (WGS) uncovers variants associated with rare and common diseases, an immediate challenge is to minimize false positive findings due to sequencing and variant calling errors. False positives can be reduced by combining results from orthogonal sequencing methods, but costly. Here we present variant filtering approaches using logistic regression (LR) and ensemble genotyping to minimize false positives without sacrificing sensitivity. We evaluated the methods using paired WGS datasets of an extended family prepared using two sequencing platforms and a validated set of variants in NA12878. Using LR or ensemble genotyping based filtering, false negative rates were significantly reduced by 1.1- to 17.8-fold at the same levels of false discovery rates (5.4% for heterozygous and 4.5% for homozygous SNVs; 30.0% for heterozygous and 18.7% for homozygous insertions; 25.2% for heterozygous and 16.6% for homozygous deletions) compared to the filtering based on genotype quality scores. Moreover, ensemble genotyping excluded > 98% (105,080 of 107,167) of false positives while retaining > 95% (897 of 937) of true positives in de novo mutation (DNM) discovery, and performed better than a consensus method using two sequencing platforms. Our proposed methods were effective in prioritizing phenotype-associated variants, and ensemble genotyping would be essential to minimize false positive DNM candidates. PMID:24829188

  20. Dentists' perspectives on caries-related treatment decisions.

    PubMed

    Gomez, J; Ellwood, R P; Martignon, S; Pretty, I A

    2014-06-01

    To assess the impact of patient risk status on Colombian dentists' caries related treatment decisions for early to intermediate caries lesions (ICDAS code 2 to 4). A web-based questionnaire assessed dentists' views on the management of early/intermediate lesions. The questionnaire included questions on demographic characteristics, five clinical scenarios with randomised levels of caries risk, and two questions on different clinical and radiographic sets of images with different thresholds of caries. Questionnaires were completed by 439 dentists. For the two scenarios describing occlusal lesions ICDAS code 2, dentists chose to provide a preventive option in 63% and 60% of the cases. For the approximal lesion ICDAS code 2, 81% of the dentists chose to restore. The main findings of the binary logistic regression analysis for the clinical scenarios suggest that for the ICDAS code 2 occlusal lesions, the odds of a high caries risk patient having restorations is higher than for a low caries risk patient. For the questions describing different clinical thresholds of caries, most dentists would restore at ICDAS code 2 (55%) and for the question showing different radiographic thresholds images, 65% of dentists would intervene operatively at the inner half of enamel. No significant differences with respect to risk were found for these questions with the logistic regression. The results of this study indicate that Colombian dentists have not yet fully adopted non-invasive treatment for early caries lesions.

  1. Minimal intervention dentistry for early childhood caries and child dental anxiety: a randomized controlled trial.

    PubMed

    Arrow, P; Klobas, E

    2017-06-01

    To compare changes in child dental anxiety after treatment for early childhood caries (ECC) using two treatment approaches. Children with ECC were randomized to test (atraumatic restorative treatment (ART)-based approach) or control (standard care approach) groups. Children aged 3 years or older completed a dental anxiety scale at baseline and follow up. Changes in child dental anxiety from baseline to follow up were tested using the chi-squared statistic, Wilcoxon rank sum test, McNemar's test and multinomial logistic regression. Two hundred and fifty-four children were randomized (N = 127 test, N = 127 control). At baseline, 193 children completed the dental anxiety scale, 211 at follow up and 170 completed the scale on both occasions. Children who were anxious at baseline (11%) were no longer anxious at follow up, and 11% non-anxious children became anxious. Multinomial logistic regression found each increment in the number of visits increased the odds of worsening dental anxiety (odds ratio (OR), 2.2; P < 0.05), whereas each increment in the number of treatments lowered the odds of worsening anxiety (OR, 0.50; P = 0.05). The ART-based approach to managing ECC resulted in similar levels of dental anxiety to the standard treatment approach and provides a valuable alternative approach to the management of ECC in a primary dental care setting. © 2016 Australian Dental Association.

  2. The relationship between hemoglobin level and the type 1 diabetic nephropathy in Anhui Han's patients.

    PubMed

    Jiang, Jun; Lei, Lan; Zhou, Xiaowan; Li, Peng; Wei, Ren

    2018-02-20

    Recent studies have shown that low hemoglobin (Hb) level promote the progression of chronic kidney disease. This study assessed the relationship between Hb level and type 1 diabetic nephropathy (DN) in Anhui Han's patients. There were a total of 236 patients diagnosed with type 1 diabetes mellitus and (T1DM) seen between January 2014 and December 2016 in our centre. Hemoglobin levels in patients with DN were compared with those without DN. The relationship between Hb level and the urinary albumin-creatinine ratio (ACR) was examined by Spearman's correlational analysis and multiple stepwise regression analysis. The binary logistic multivariate regression analysis was performed to analyze the correlated factors for type 1 DN, calculate the Odds Ratio (OR) and 95%confidence interval (CI). The predicting value of Hb level for DN was evaluated by area under receiver operation characteristic curve (AUROC) for discrimination and Hosmer-Lemeshow goodness-of-fit test for calibration. The average Hb levels in the DN group (116.1 ± 20.8 g/L) were significantly lower than the non-DN group (131.9 ± 14.4 g/L) , P < 0.001. Hb levels were independently correlated with the urinary ACR in multiple stepwise regression analysis. The logistic multivariate regression analysis showed that the Hb level (OR: 0.936, 95% CI: 0.910 to 0.963, P < 0.001) was inversely correlated with DN in patients with T1DM. In sub-analysis, low Hb level (Hb < 120g/L in female, Hb < 130g/L in male) was still negatively associated with DN in patients with T1DM. The AUROC was 0.721 (95% CI: 0.655 to 0.787) in assessing the discrimination of the Hb level for DN. The value of P was 0.593 in Hosmer-Lemeshow goodness-of-fit test. In Anhui Han's patients with T1DM, the Hb level is inversely correlated with urinary ACR and DN. This article is protected by copyright. All rights reserved.

  3. Ecological correlates of depression and self-esteem in rural youth.

    PubMed

    Smokowski, Paul R; Evans, Caroline B R; Cotter, Katie L; Guo, Shenyang

    2014-10-01

    The current study examines individual-, social-, and school-level characteristics influencing symptoms of depression and self-esteem among a large sample (N = 4,321) of U.S. youth living in two rural counties in the South. Survey data for this sample of middle-school students (Grade 6 to Grade 8) were part of the Rural Adaptation Project. Data were analyzed using ordered logistic regression. Results show that being female, having a low income, and having negative relationships with parents and peers are risk factors that increase the probability of reporting high levels of depressive symptoms and low levels of self-esteem. In contrast, supportive relationships with parents and peers, high religious orientation, ethnic identity, and school satisfaction increased the probability of reporting low levels of depressive symptoms and high levels of self-esteem. There were few school-level characteristics associated with levels of depressive symptoms and self-esteem. Implications are discussed.

  4. Differentially private distributed logistic regression using private and public data

    PubMed Central

    2014-01-01

    Background Privacy protecting is an important issue in medical informatics and differential privacy is a state-of-the-art framework for data privacy research. Differential privacy offers provable privacy against attackers who have auxiliary information, and can be applied to data mining models (for example, logistic regression). However, differentially private methods sometimes introduce too much noise and make outputs less useful. Given available public data in medical research (e.g. from patients who sign open-consent agreements), we can design algorithms that use both public and private data sets to decrease the amount of noise that is introduced. Methodology In this paper, we modify the update step in Newton-Raphson method to propose a differentially private distributed logistic regression model based on both public and private data. Experiments and results We try our algorithm on three different data sets, and show its advantage over: (1) a logistic regression model based solely on public data, and (2) a differentially private distributed logistic regression model based on private data under various scenarios. Conclusion Logistic regression models built with our new algorithm based on both private and public datasets demonstrate better utility than models that trained on private or public datasets alone without sacrificing the rigorous privacy guarantee. PMID:25079786

  5. A retrospective analysis to identify the factors affecting infection in patients undergoing chemotherapy.

    PubMed

    Park, Ji Hyun; Kim, Hyeon-Young; Lee, Hanna; Yun, Eun Kyoung

    2015-12-01

    This study compares the performance of the logistic regression and decision tree analysis methods for assessing the risk factors for infection in cancer patients undergoing chemotherapy. The subjects were 732 cancer patients who were receiving chemotherapy at K university hospital in Seoul, Korea. The data were collected between March 2011 and February 2013 and were processed for descriptive analysis, logistic regression and decision tree analysis using the IBM SPSS Statistics 19 and Modeler 15.1 programs. The most common risk factors for infection in cancer patients receiving chemotherapy were identified as alkylating agents, vinca alkaloid and underlying diabetes mellitus. The logistic regression explained 66.7% of the variation in the data in terms of sensitivity and 88.9% in terms of specificity. The decision tree analysis accounted for 55.0% of the variation in the data in terms of sensitivity and 89.0% in terms of specificity. As for the overall classification accuracy, the logistic regression explained 88.0% and the decision tree analysis explained 87.2%. The logistic regression analysis showed a higher degree of sensitivity and classification accuracy. Therefore, logistic regression analysis is concluded to be the more effective and useful method for establishing an infection prediction model for patients undergoing chemotherapy. Copyright © 2015 Elsevier Ltd. All rights reserved.

  6. Performance and strategy comparisons of human listeners and logistic regression in discriminating underwater targets.

    PubMed

    Yang, Lixue; Chen, Kean

    2015-11-01

    To improve the design of underwater target recognition systems based on auditory perception, this study compared human listeners with automatic classifiers. Performances measures and strategies in three discrimination experiments, including discriminations between man-made and natural targets, between ships and submarines, and among three types of ships, were used. In the experiments, the subjects were asked to assign a score to each sound based on how confident they were about the category to which it belonged, and logistic regression, which represents linear discriminative models, also completed three similar tasks by utilizing many auditory features. The results indicated that the performances of logistic regression improved as the ratio between inter- and intra-class differences became larger, whereas the performances of the human subjects were limited by their unfamiliarity with the targets. Logistic regression performed better than the human subjects in all tasks but the discrimination between man-made and natural targets, and the strategies employed by excellent human subjects were similar to that of logistic regression. Logistic regression and several human subjects demonstrated similar performances when discriminating man-made and natural targets, but in this case, their strategies were not similar. An appropriate fusion of their strategies led to further improvement in recognition accuracy.

  7. The Prevalence and Clinical Significance of Low Procalcitonin Levels Among Patients With Severe Sepsis or Septic Shock in the Emergency Department.

    PubMed

    Choe, Eun A; Shin, Tae Gun; Jo, Ik Joon; Hwang, Sung Yeon; Lee, Tae Rim; Cha, Won Chul; Sim, Min Seob

    2016-07-01

    The aims of this study were to evaluate the prevalence of low procalcitonin (PCT) levels among patients with severe sepsis or septic shock, and to investigate clinical characteristics and outcomes associated with low PCT levels. We analyzed data from the sepsis registry for patients with severe sepsis or septic shock in the emergency department. Based on a specific PCT cutoff value, patients were classified into two groups: a low PCT group, PCT <0.25 ng/mL; and a high PCT group, PCT ≥0.25 ng/mL. The primary endpoint was 28-day mortality. A multivariable logistic regression model was used to evaluate independent factors associated with low PCT and 28-day mortality. A total of 1,212 patients were included. Of the eligible patients, 154 (12.7%) were assigned to the low PCT group, and 1,058 (87.3%) to the high PCT group. The 28-day mortality was 4.6% in the low PCT group and 13.5% in the high PCT group (P < 0.01). The adjusted odds ratio of the low PCT group for 28-day mortality was 0.43 (95% CI 0.19-0.98; P = 0.04). There was no trend of increasing mortality among higher PCT level patients. In a logistic regression model, factors associated with low PCT were pneumonia, lower C-reactive protein levels, lower lactate levels, the absence of bacteremia, and the absence of organ failure. Intra-abdominal infection and obesity were associated with high PCT. Initial low PCT levels were common among patients diagnosed with severe sepsis or septic shock in the emergency department, suggesting favorable outcomes. The prevalence of low PCT levels was significantly different according to obesity, the source of infection, C-reactive protein levels, lactate levels, bacteremia, and organ failure.

  8. Prediction of cold and heat patterns using anthropometric measures based on machine learning.

    PubMed

    Lee, Bum Ju; Lee, Jae Chul; Nam, Jiho; Kim, Jong Yeol

    2018-01-01

    To examine the association of body shape with cold and heat patterns, to determine which anthropometric measure is the best indicator for discriminating between the two patterns, and to investigate whether using a combination of measures can improve the predictive power to diagnose these patterns. Based on a total of 4,859 subjects (3,000 women and 1,859 men), statistical analyses using binary logistic regression were performed to assess the significance of the difference and the predictive power of each anthropometric measure, and binary logistic regression and Naive Bayes with the variable selection technique were used to assess the improvement in the predictive power of the patterns using the combined measures. In women, the strongest indicators for determining the cold and heat patterns among anthropometric measures were body mass index (BMI) and rib circumference; in men, the best indicator was BMI. In experiments using a combination of measures, the values of the area under the receiver operating characteristic curve in women were 0.776 by Naive Bayes and 0.772 by logistic regression, and the values in men were 0.788 by Naive Bayes and 0.779 by logistic regression. Individuals with a higher BMI have a tendency toward a heat pattern in both women and men. The use of a combination of anthropometric measures can slightly improve the diagnostic accuracy. Our findings can provide fundamental information for the diagnosis of cold and heat patterns based on body shape for personalized medicine.

  9. Unitary Response Regression Models

    ERIC Educational Resources Information Center

    Lipovetsky, S.

    2007-01-01

    The dependent variable in a regular linear regression is a numerical variable, and in a logistic regression it is a binary or categorical variable. In these models the dependent variable has varying values. However, there are problems yielding an identity output of a constant value which can also be modelled in a linear or logistic regression with…

  10. Binary logistic regression-Instrument for assessing museum indoor air impact on exhibits.

    PubMed

    Bucur, Elena; Danet, Andrei Florin; Lehr, Carol Blaziu; Lehr, Elena; Nita-Lazar, Mihai

    2017-04-01

    This paper presents a new way to assess the environmental impact on historical artifacts using binary logistic regression. The prediction of the impact on the exhibits during certain pollution scenarios (environmental impact) was calculated by a mathematical model based on the binary logistic regression; it allows the identification of those environmental parameters from a multitude of possible parameters with a significant impact on exhibitions and ranks them according to their severity effect. Air quality (NO 2 , SO 2 , O 3 and PM 2.5 ) and microclimate parameters (temperature, humidity) monitoring data from a case study conducted within exhibition and storage spaces of the Romanian National Aviation Museum Bucharest have been used for developing and validating the binary logistic regression method and the mathematical model. The logistic regression analysis was used on 794 data combinations (715 to develop of the model and 79 to validate it) by a Statistical Package for Social Sciences (SPSS 20.0). The results from the binary logistic regression analysis demonstrated that from six parameters taken into consideration, four of them present a significant effect upon exhibits in the following order: O 3 >PM 2.5 >NO 2 >humidity followed at a significant distance by the effects of SO 2 and temperature. The mathematical model, developed in this study, correctly predicted 95.1 % of the cumulated effect of the environmental parameters upon the exhibits. Moreover, this model could also be used in the decisional process regarding the preventive preservation measures that should be implemented within the exhibition space. The paper presents a new way to assess the environmental impact on historical artifacts using binary logistic regression. The mathematical model developed on the environmental parameters analyzed by the binary logistic regression method could be useful in a decision-making process establishing the best measures for pollution reduction and preventive preservation of exhibits.

  11. Improving power and robustness for detecting genetic association with extreme-value sampling design.

    PubMed

    Chen, Hua Yun; Li, Mingyao

    2011-12-01

    Extreme-value sampling design that samples subjects with extremely large or small quantitative trait values is commonly used in genetic association studies. Samples in such designs are often treated as "cases" and "controls" and analyzed using logistic regression. Such a case-control analysis ignores the potential dose-response relationship between the quantitative trait and the underlying trait locus and thus may lead to loss of power in detecting genetic association. An alternative approach to analyzing such data is to model the dose-response relationship by a linear regression model. However, parameter estimation from this model can be biased, which may lead to inflated type I errors. We propose a robust and efficient approach that takes into consideration of both the biased sampling design and the potential dose-response relationship. Extensive simulations demonstrate that the proposed method is more powerful than the traditional logistic regression analysis and is more robust than the linear regression analysis. We applied our method to the analysis of a candidate gene association study on high-density lipoprotein cholesterol (HDL-C) which includes study subjects with extremely high or low HDL-C levels. Using our method, we identified several SNPs showing a stronger evidence of association with HDL-C than the traditional case-control logistic regression analysis. Our results suggest that it is important to appropriately model the quantitative traits and to adjust for the biased sampling when dose-response relationship exists in extreme-value sampling designs. © 2011 Wiley Periodicals, Inc.

  12. Supporting Regularized Logistic Regression Privately and Efficiently.

    PubMed

    Li, Wenfa; Liu, Hongzhe; Yang, Peng; Xie, Wei

    2016-01-01

    As one of the most popular statistical and machine learning models, logistic regression with regularization has found wide adoption in biomedicine, social sciences, information technology, and so on. These domains often involve data of human subjects that are contingent upon strict privacy regulations. Concerns over data privacy make it increasingly difficult to coordinate and conduct large-scale collaborative studies, which typically rely on cross-institution data sharing and joint analysis. Our work here focuses on safeguarding regularized logistic regression, a widely-used statistical model while at the same time has not been investigated from a data security and privacy perspective. We consider a common use scenario of multi-institution collaborative studies, such as in the form of research consortia or networks as widely seen in genetics, epidemiology, social sciences, etc. To make our privacy-enhancing solution practical, we demonstrate a non-conventional and computationally efficient method leveraging distributing computing and strong cryptography to provide comprehensive protection over individual-level and summary data. Extensive empirical evaluations on several studies validate the privacy guarantee, efficiency and scalability of our proposal. We also discuss the practical implications of our solution for large-scale studies and applications from various disciplines, including genetic and biomedical studies, smart grid, network analysis, etc.

  13. Supporting Regularized Logistic Regression Privately and Efficiently

    PubMed Central

    Li, Wenfa; Liu, Hongzhe; Yang, Peng; Xie, Wei

    2016-01-01

    As one of the most popular statistical and machine learning models, logistic regression with regularization has found wide adoption in biomedicine, social sciences, information technology, and so on. These domains often involve data of human subjects that are contingent upon strict privacy regulations. Concerns over data privacy make it increasingly difficult to coordinate and conduct large-scale collaborative studies, which typically rely on cross-institution data sharing and joint analysis. Our work here focuses on safeguarding regularized logistic regression, a widely-used statistical model while at the same time has not been investigated from a data security and privacy perspective. We consider a common use scenario of multi-institution collaborative studies, such as in the form of research consortia or networks as widely seen in genetics, epidemiology, social sciences, etc. To make our privacy-enhancing solution practical, we demonstrate a non-conventional and computationally efficient method leveraging distributing computing and strong cryptography to provide comprehensive protection over individual-level and summary data. Extensive empirical evaluations on several studies validate the privacy guarantee, efficiency and scalability of our proposal. We also discuss the practical implications of our solution for large-scale studies and applications from various disciplines, including genetic and biomedical studies, smart grid, network analysis, etc. PMID:27271738

  14. Classification of Effective Soil Depth by Using Multinomial Logistic Regression Analysis

    NASA Astrophysics Data System (ADS)

    Chang, C. H.; Chan, H. C.; Chen, B. A.

    2016-12-01

    Classification of effective soil depth is a task of determining the slopeland utilizable limitation in Taiwan. The "Slopeland Conservation and Utilization Act" categorizes the slopeland into agriculture and husbandry land, land suitable for forestry and land for enhanced conservation according to the factors including average slope, effective soil depth, soil erosion and parental rock. However, sit investigation of the effective soil depth requires a cost-effective field work. This research aimed to classify the effective soil depth by using multinomial logistic regression with the environmental factors. The Wen-Shui Watershed located at the central Taiwan was selected as the study areas. The analysis of multinomial logistic regression is performed by the assistance of a Geographic Information Systems (GIS). The effective soil depth was categorized into four levels including deeper, deep, shallow and shallower. The environmental factors of slope, aspect, digital elevation model (DEM), curvature and normalized difference vegetation index (NDVI) were selected for classifying the soil depth. An Error Matrix was then used to assess the model accuracy. The results showed an overall accuracy of 75%. At the end, a map of effective soil depth was produced to help planners and decision makers in determining the slopeland utilizable limitation in the study areas.

  15. Enzymatic Activity of Glutathione S-Transferase and Dental Fluorosis Among Children Receiving Two Different Levels of Naturally Fluoridated Water.

    PubMed

    Bonola-Gallardo, Irvin; Irigoyen-Camacho, María Esther; Vera-Robles, Liliana; Campero, Antonio; Gómez-Quiroz, Luis

    2017-03-01

    This study was conducted to measure the activity of the enzyme glutathione S-transferase (GST) in saliva and to compare the activity of this enzyme in children with and without dental fluorosis in communities with different concentrations of naturally fluoridated water. A total of 141 schoolchildren participated in this cross-sectional study. Children were selected from two communities: one with a low (0.4 ppm) and the other with a high (1.8 ppm) water fluoride concentration. Dental fluorosis was evaluated by applying the Thylstrup and Fejerskov Index (TFI) criteria. Stimulated saliva was obtained, and fluoride concentration and GST activity were measured. The GST activity was compared among children with different levels of dental fluorosis using multinomial logistic regression models and odds ratios (OR). The mean age of the children was 10.6 (±1.03) years. Approximately half of the children showed dental fluorosis (52.5 %). The average GST activity was 0.5678 (±0.1959) nmol/min/μg. A higher concentration of fluoride in the saliva was detected in children with a higher GST activity (p = 0.039). A multinomial logistic regression model used to evaluate the GST activity and the dental fluorosis score identified a strong association between TFI = 2-3 (OR = 15.44, p = 0.007) and TFI ≥ 4 (OR = 55.40, p = 0.026) and the GST activity level, compared with children showing TFI = 0-1, adjusted for age and sex. Schoolchildren with higher levels of dental fluorosis and a higher fluoride concentration in the saliva showed greater GST activity. The increased GST activity most likely was the result of the body's need to inactivate free radicals produced by exposure to fluoride.

  16. Does Group-Level Commitment Predict Employee Well-Being?: A Prospective Analysis.

    PubMed

    Clausen, Thomas; Christensen, Karl Bang; Nielsen, Karina

    2015-11-01

    To investigate the links between group-level affective organizational commitment (AOC) and individual-level psychological well-being, self-reported sickness absence, and sleep disturbances. A total of 5085 care workers from 301 workgroups in the Danish eldercare services participated in both waves of the study (T1 [2005] and T2 [2006]). The three outcomes were analyzed using linear multilevel regression analysis, multilevel Poisson regression analysis, and multilevel logistic regression analysis, respectively. Group-level AOC (T1) significantly predicted individual-level psychological well-being, self-reported sickness absence, and sleep disturbances (T2). The association between group-level AOC (T1) and psychological well-being (T2) was fully mediated by individual-level AOC (T1), and the associations between group-level AOC (T1) and self-reported sickness absence and sleep disturbances (T2) were partially mediated by individual-level AOC (T1). Group-level AOC is an important predictor of employee well-being in contemporary health care organizations.

  17. Cluster Analysis of Campylobacter jejuni Genotypes Isolated from Small and Medium-Sized Mammalian Wildlife and Bovine Livestock from Ontario Farms.

    PubMed

    Viswanathan, M; Pearl, D L; Taboada, E N; Parmley, E J; Mutschall, S K; Jardine, C M

    2017-05-01

    Using data collected from a cross-sectional study of 25 farms (eight beef, eight swine and nine dairy) in 2010, we assessed clustering of molecular subtypes of C. jejuni based on a Campylobacter-specific 40 gene comparative genomic fingerprinting assay (CGF40) subtypes, using unweighted pair-group method with arithmetic mean (UPGMA) analysis, and multiple correspondence analysis. Exact logistic regression was used to determine which genes differentiate wildlife and livestock subtypes in our study population. A total of 33 bovine livestock (17 beef and 16 dairy), 26 wildlife (20 raccoon (Procyon lotor), five skunk (Mephitis mephitis) and one mouse (Peromyscus spp.) C. jejuni isolates were subtyped using CGF40. Dendrogram analysis, based on UPGMA, showed distinct branches separating bovine livestock and mammalian wildlife isolates. Furthermore, two-dimensional multiple correspondence analysis was highly concordant with dendrogram analysis showing clear differentiation between livestock and wildlife CGF40 subtypes. Based on multilevel logistic regression models with a random intercept for farm of origin, we found that isolates in general, and raccoons more specifically, were significantly more likely to be part of the wildlife branch. Exact logistic regression conducted gene by gene revealed 15 genes that were predictive of whether an isolate was of wildlife or bovine livestock isolate origin. Both multiple correspondence analysis and exact logistic regression revealed that in most cases, the presence of a particular gene (13 of 15) was associated with an isolate being of livestock rather than wildlife origin. In conclusion, the evidence gained from dendrogram analysis, multiple correspondence analysis and exact logistic regression indicates that mammalian wildlife carry CGF40 subtypes of C. jejuni distinct from those carried by bovine livestock. Future studies focused on source attribution of C. jejuni in human infections will help determine whether wildlife transmit Campylobacter jejuni directly to humans. © 2016 Blackwell Verlag GmbH.

  18. Area-level poverty and preterm birth risk: A population-based multilevel analysis

    PubMed Central

    DeFranco, Emily A; Lian, Min; Muglia, Louis A; Schootman, Mario

    2008-01-01

    Background Preterm birth is a complex disease with etiologic influences from a variety of social, environmental, hormonal, genetic, and other factors. The purpose of this study was to utilize a large population-based birth registry to estimate the independent effect of county-level poverty on preterm birth risk. To accomplish this, we used a multilevel logistic regression approach to account for multiple co-existent individual-level variables and county-level poverty rate. Methods Population-based study utilizing Missouri's birth certificate database (1989–1997). We conducted a multilevel logistic regression analysis to estimate the effect of county-level poverty on PTB risk. Of 634,994 births nested within 115 counties in Missouri, two levels were considered. Individual-level variables included demographics factors, prenatal care, health-related behavioral risk factors, and medical risk factors. The area-level variable included the percentage of the population within each county living below the poverty line (US census data, 1990). Counties were divided into quartiles of poverty; the first quartile (lowest rate of poverty) was the reference group. Results PTB < 35 weeks occurred in 24,490 pregnancies (3.9%). The rate of PTB < 35 weeks was 2.8% in counties within the lowest quartile of poverty and increased through the 4th quartile (4.9%), p < 0.0001. High county-level poverty was significantly associated with PTB risk. PTB risk (< 35 weeks) was increased for women who resided in counties within the highest quartile of poverty, adjusted odds ratio (adjOR) 1.18 (95% CI 1.03, 1.35), with a similar effect at earlier gestational ages (< 32 weeks), adjOR 1.27 (95% CI 1.06, 1.52). Conclusion Women residing in socioeconomically deprived areas are at increased risk of preterm birth, above other underlying risk factors. Although the risk increase is modest, it affects a large number of pregnancies. PMID:18793437

  19. Area-level poverty and preterm birth risk: a population-based multilevel analysis.

    PubMed

    DeFranco, Emily A; Lian, Min; Muglia, Louis A; Schootman, Mario

    2008-09-15

    Preterm birth is a complex disease with etiologic influences from a variety of social, environmental, hormonal, genetic, and other factors. The purpose of this study was to utilize a large population-based birth registry to estimate the independent effect of county-level poverty on preterm birth risk. To accomplish this, we used a multilevel logistic regression approach to account for multiple co-existent individual-level variables and county-level poverty rate. Population-based study utilizing Missouri's birth certificate database (1989-1997). We conducted a multilevel logistic regression analysis to estimate the effect of county-level poverty on PTB risk. Of 634,994 births nested within 115 counties in Missouri, two levels were considered. Individual-level variables included demographics factors, prenatal care, health-related behavioral risk factors, and medical risk factors. The area-level variable included the percentage of the population within each county living below the poverty line (US census data, 1990). Counties were divided into quartiles of poverty; the first quartile (lowest rate of poverty) was the reference group. PTB < 35 weeks occurred in 24,490 pregnancies (3.9%). The rate of PTB < 35 weeks was 2.8% in counties within the lowest quartile of poverty and increased through the 4th quartile (4.9%), p < 0.0001. High county-level poverty was significantly associated with PTB risk. PTB risk (< 35 weeks) was increased for women who resided in counties within the highest quartile of poverty, adjusted odds ratio (adj OR) 1.18 (95% CI 1.03, 1.35), with a similar effect at earlier gestational ages (< 32 weeks), adj OR 1.27 (95% CI 1.06, 1.52). Women residing in socioeconomically deprived areas are at increased risk of preterm birth, above other underlying risk factors. Although the risk increase is modest, it affects a large number of pregnancies.

  20. Peripheral arterial stiffness is associated with higher baseline plasma uric acid: A prospective cohort study.

    PubMed

    Ding, Xiaohan; Ye, Ping; Wang, Xiaona; Cao, Ruihua; Yang, Xu; Xiao, Wenkai; Zhang, Yun; Bai, Yongyi; Wu, Hongmei

    2017-03-01

    This prospective cohort study aimed at identifying association between uric acid (UA) and peripheral arterial stiffness. A prospective cohort longitudinal study was performed according to an average of 4.8 years' follow-up. The demographic data, anthropometric parameters, peripheral arterial stiffness (carotid-radial pulse-wave velocity, cr-PWV) and biomarker variables including UA were examined at both baseline and follow-up. Pearson's correlations were used to identify the associations between UA and peripheral arterial stiffness. Further logistic regressions were employed to determine the associations between UA and arterial stiffness. At the end of follow-up, 1447 subjects were included in the analyses. At baseline, cr-PWV ( r  = 0.200, p  < 0.001) was closely associated with UA. Furthermore, the follow-up cr-PWV ( r  = 0.145, p  < 0.001) was also strongly correlated to baseline UA in Pearson's correlation analysis. Multiple regressions also indicated the association between follow-up cr-PWV ( β  = 0.493, p  = 0.013) and baseline UA level. Logistic regressions revealed that higher baseline UA level was an independent predictor of arterial stiffness severity assessed by cr-PWV at follow-up cross-section. Peripheral arterial stiffness is closely associated with higher baseline UA level. Furthermore, a higher baseline UA level is an independent risk factor and predictor for peripheral arterial stiffness.

  1. Mixed conditional logistic regression for habitat selection studies.

    PubMed

    Duchesne, Thierry; Fortin, Daniel; Courbin, Nicolas

    2010-05-01

    1. Resource selection functions (RSFs) are becoming a dominant tool in habitat selection studies. RSF coefficients can be estimated with unconditional (standard) and conditional logistic regressions. While the advantage of mixed-effects models is recognized for standard logistic regression, mixed conditional logistic regression remains largely overlooked in ecological studies. 2. We demonstrate the significance of mixed conditional logistic regression for habitat selection studies. First, we use spatially explicit models to illustrate how mixed-effects RSFs can be useful in the presence of inter-individual heterogeneity in selection and when the assumption of independence from irrelevant alternatives (IIA) is violated. The IIA hypothesis states that the strength of preference for habitat type A over habitat type B does not depend on the other habitat types also available. Secondly, we demonstrate the significance of mixed-effects models to evaluate habitat selection of free-ranging bison Bison bison. 3. When movement rules were homogeneous among individuals and the IIA assumption was respected, fixed-effects RSFs adequately described habitat selection by simulated animals. In situations violating the inter-individual homogeneity and IIA assumptions, however, RSFs were best estimated with mixed-effects regressions, and fixed-effects models could even provide faulty conclusions. 4. Mixed-effects models indicate that bison did not select farmlands, but exhibited strong inter-individual variations in their response to farmlands. Less than half of the bison preferred farmlands over forests. Conversely, the fixed-effect model simply suggested an overall selection for farmlands. 5. Conditional logistic regression is recognized as a powerful approach to evaluate habitat selection when resource availability changes. This regression is increasingly used in ecological studies, but almost exclusively in the context of fixed-effects models. Fitness maximization can imply differences in trade-offs among individuals, which can yield inter-individual differences in selection and lead to departure from IIA. These situations are best modelled with mixed-effects models. Mixed-effects conditional logistic regression should become a valuable tool for ecological research.

  2. Modeling individual tree survial

    Treesearch

    Quang V. Cao

    2016-01-01

    Information provided by growth and yield models is the basis for forest managers to make decisions on how to manage their forests. Among different types of growth models, whole-stand models offer predictions at stand level, whereas individual-tree models give detailed information at tree level. The well-known logistic regression is commonly used to predict tree...

  3. Is It Considered Violence? The Acceptability of Physical Punishment of Children in Europe

    ERIC Educational Resources Information Center

    Gracia, Enrique; Herrero, Juan

    2008-01-01

    This study analyzes correlates of the acceptability of physical punishment of children in Europe. The design was a three-level ordinal logistic regression of 10,812 people nested within 208 localities (cities), nested within 14 countries of the European Union. Results showed that higher levels of acceptability were reported by men, the older, the…

  4. Modeling ozone bioindicator injury with microscale and landscape-scale explanatory variables: A logistic regression approach

    Treesearch

    John W. Coulston

    2011-01-01

    Tropospheric ozone occurs at phytotoxic levels in the United States (Lefohn and Pinkerton 1988). Several plant species, including commercially important timber species, are sensitive to elevated ozone levels. Exposure to elevated ozone can cause growth reduction and foliar injury and make trees more susceptible to secondary stressors such as insects and pathogens (...

  5. Advanced colorectal neoplasia risk stratification by penalized logistic regression.

    PubMed

    Lin, Yunzhi; Yu, Menggang; Wang, Sijian; Chappell, Richard; Imperiale, Thomas F

    2016-08-01

    Colorectal cancer is the second leading cause of death from cancer in the United States. To facilitate the efficiency of colorectal cancer screening, there is a need to stratify risk for colorectal cancer among the 90% of US residents who are considered "average risk." In this article, we investigate such risk stratification rules for advanced colorectal neoplasia (colorectal cancer and advanced, precancerous polyps). We use a recently completed large cohort study of subjects who underwent a first screening colonoscopy. Logistic regression models have been used in the literature to estimate the risk of advanced colorectal neoplasia based on quantifiable risk factors. However, logistic regression may be prone to overfitting and instability in variable selection. Since most of the risk factors in our study have several categories, it was tempting to collapse these categories into fewer risk groups. We propose a penalized logistic regression method that automatically and simultaneously selects variables, groups categories, and estimates their coefficients by penalizing the [Formula: see text]-norm of both the coefficients and their differences. Hence, it encourages sparsity in the categories, i.e. grouping of the categories, and sparsity in the variables, i.e. variable selection. We apply the penalized logistic regression method to our data. The important variables are selected, with close categories simultaneously grouped, by penalized regression models with and without the interactions terms. The models are validated with 10-fold cross-validation. The receiver operating characteristic curves of the penalized regression models dominate the receiver operating characteristic curve of naive logistic regressions, indicating a superior discriminative performance. © The Author(s) 2013.

  6. Habitat features and predictive habitat modeling for the Colorado chipmunk in southern New Mexico

    USGS Publications Warehouse

    Rivieccio, M.; Thompson, B.C.; Gould, W.R.; Boykin, K.G.

    2003-01-01

    Two subspecies of Colorado chipmunk (state threatened and federal species of concern) occur in southern New Mexico: Tamias quadrivittatus australis in the Organ Mountains and T. q. oscuraensis in the Oscura Mountains. We developed a GIS model of potentially suitable habitat based on vegetation and elevation features, evaluated site classifications of the GIS model, and determined vegetation and terrain features associated with chipmunk occurrence. We compared GIS model classifications with actual vegetation and elevation features measured at 37 sites. At 60 sites we measured 18 habitat variables regarding slope, aspect, tree species, shrub species, and ground cover. We used logistic regression to analyze habitat variables associated with chipmunk presence/absence. All (100%) 37 sample sites (28 predicted suitable, 9 predicted unsuitable) were classified correctly by the GIS model regarding elevation and vegetation. For 28 sites predicted suitable by the GIS model, 18 sites (64%) appeared visually suitable based on habitat variables selected from logistic regression analyses, of which 10 sites (36%) were specifically predicted as suitable habitat via logistic regression. We detected chipmunks at 70% of sites deemed suitable via the logistic regression models. Shrub cover, tree density, plant proximity, presence of logs, and presence of rock outcrop were retained in the logistic model for the Oscura Mountains; litter, shrub cover, and grass cover were retained in the logistic model for the Organ Mountains. Evaluation of predictive models illustrates the need for multi-stage analyses to best judge performance. Microhabitat analyses indicate prospective needs for different management strategies between the subspecies. Sensitivities of each population of the Colorado chipmunk to natural and prescribed fire suggest that partial burnings of areas inhabited by Colorado chipmunks in southern New Mexico may be beneficial. These partial burnings may later help avoid a fire that could substantially reduce habitat of chipmunks over a mountain range.

  7. [Use of data display screens and ocular hypertension in local public sector workers].

    PubMed

    Abellán Torró, Rosana; Merelles Tormo, Antoni

    2014-01-01

    The main objective of this study is to examine the association between work with data display screens (DDS) and ocular hypertension (OHT). A cross-sectional study among local public sector workers (Diputación Provincial de Valencia). Data from 620 people were collected over 25 months, from periodic medical examinations performed at an occupational health unit. Intraocular pressure (IOP) was obtained with a portable puff tonometer validated for screening, establishing the cut-off point for OHT at 22 mmHg. Both biological characteristics and other work-related variables were taken into account as covariates. Descriptive statistics of the data were obtained, together with nonparametric tests with a level of significance of 95% and logistic regression with p 〈0.1 as the level of significance of the likelihood test. The average age of the study population is 52.8 years. The prevalence of OHT was 3.5% (5.1% among men and 1.2% among women; p=0.012). No significant associations were found between hours of DDS-related work and OHT were found (p=0.395). Logistic regression corroborated the association between gender and OHT, with women less affected (OR = 0.234; 95%CI: 0.068 - 0.799; p=0.020). In our study, no associations were found between time of exposure to data display screens and ocular hypertension. Logistic regression points to a certain association between ocular hypertension and gender, with men being more predisposed. Copyright belongs to the Societat Catalana de Salut Laboral.

  8. Using Logistic Regression To Predict the Probability of Debris Flows Occurring in Areas Recently Burned By Wildland Fires

    USGS Publications Warehouse

    Rupert, Michael G.; Cannon, Susan H.; Gartner, Joseph E.

    2003-01-01

    Logistic regression was used to predict the probability of debris flows occurring in areas recently burned by wildland fires. Multiple logistic regression is conceptually similar to multiple linear regression because statistical relations between one dependent variable and several independent variables are evaluated. In logistic regression, however, the dependent variable is transformed to a binary variable (debris flow did or did not occur), and the actual probability of the debris flow occurring is statistically modeled. Data from 399 basins located within 15 wildland fires that burned during 2000-2002 in Colorado, Idaho, Montana, and New Mexico were evaluated. More than 35 independent variables describing the burn severity, geology, land surface gradient, rainfall, and soil properties were evaluated. The models were developed as follows: (1) Basins that did and did not produce debris flows were delineated from National Elevation Data using a Geographic Information System (GIS). (2) Data describing the burn severity, geology, land surface gradient, rainfall, and soil properties were determined for each basin. These data were then downloaded to a statistics software package for analysis using logistic regression. (3) Relations between the occurrence/non-occurrence of debris flows and burn severity, geology, land surface gradient, rainfall, and soil properties were evaluated and several preliminary multivariate logistic regression models were constructed. All possible combinations of independent variables were evaluated to determine which combination produced the most effective model. The multivariate model that best predicted the occurrence of debris flows was selected. (4) The multivariate logistic regression model was entered into a GIS, and a map showing the probability of debris flows was constructed. The most effective model incorporates the percentage of each basin with slope greater than 30 percent, percentage of land burned at medium and high burn severity in each basin, particle size sorting, average storm intensity (millimeters per hour), soil organic matter content, soil permeability, and soil drainage. The results of this study demonstrate that logistic regression is a valuable tool for predicting the probability of debris flows occurring in recently-burned landscapes.

  9. Prediction of unwanted pregnancies using logistic regression, probit regression and discriminant analysis

    PubMed Central

    Ebrahimzadeh, Farzad; Hajizadeh, Ebrahim; Vahabi, Nasim; Almasian, Mohammad; Bakhteyar, Katayoon

    2015-01-01

    Background: Unwanted pregnancy not intended by at least one of the parents has undesirable consequences for the family and the society. In the present study, three classification models were used and compared to predict unwanted pregnancies in an urban population. Methods: In this cross-sectional study, 887 pregnant mothers referring to health centers in Khorramabad, Iran, in 2012 were selected by the stratified and cluster sampling; relevant variables were measured and for prediction of unwanted pregnancy, logistic regression, discriminant analysis, and probit regression models and SPSS software version 21 were used. To compare these models, indicators such as sensitivity, specificity, the area under the ROC curve, and the percentage of correct predictions were used. Results: The prevalence of unwanted pregnancies was 25.3%. The logistic and probit regression models indicated that parity and pregnancy spacing, contraceptive methods, household income and number of living male children were related to unwanted pregnancy. The performance of the models based on the area under the ROC curve was 0.735, 0.733, and 0.680 for logistic regression, probit regression, and linear discriminant analysis, respectively. Conclusion: Given the relatively high prevalence of unwanted pregnancies in Khorramabad, it seems necessary to revise family planning programs. Despite the similar accuracy of the models, if the researcher is interested in the interpretability of the results, the use of the logistic regression model is recommended. PMID:26793655

  10. Prediction of unwanted pregnancies using logistic regression, probit regression and discriminant analysis.

    PubMed

    Ebrahimzadeh, Farzad; Hajizadeh, Ebrahim; Vahabi, Nasim; Almasian, Mohammad; Bakhteyar, Katayoon

    2015-01-01

    Unwanted pregnancy not intended by at least one of the parents has undesirable consequences for the family and the society. In the present study, three classification models were used and compared to predict unwanted pregnancies in an urban population. In this cross-sectional study, 887 pregnant mothers referring to health centers in Khorramabad, Iran, in 2012 were selected by the stratified and cluster sampling; relevant variables were measured and for prediction of unwanted pregnancy, logistic regression, discriminant analysis, and probit regression models and SPSS software version 21 were used. To compare these models, indicators such as sensitivity, specificity, the area under the ROC curve, and the percentage of correct predictions were used. The prevalence of unwanted pregnancies was 25.3%. The logistic and probit regression models indicated that parity and pregnancy spacing, contraceptive methods, household income and number of living male children were related to unwanted pregnancy. The performance of the models based on the area under the ROC curve was 0.735, 0.733, and 0.680 for logistic regression, probit regression, and linear discriminant analysis, respectively. Given the relatively high prevalence of unwanted pregnancies in Khorramabad, it seems necessary to revise family planning programs. Despite the similar accuracy of the models, if the researcher is interested in the interpretability of the results, the use of the logistic regression model is recommended.

  11. Mental health, employment and gender. Cross-sectional evidence in a sample of refugees from Bosnia-Herzegovina living in two Swedish regions.

    PubMed

    Blight, Karin Johansson; Ekblad, Solvig; Persson, Jan-Olov; Ekberg, Jan

    2006-04-01

    Large regional differences regarding access to employment have been observed amongst persons from Bosnia-Herzegovina coming to Sweden in 1993-1994. This has led to questions about the role of mental health. To explore this further, postal survey questionnaires were distributed to a community sample (N = 650) that was stratified and, within strata, randomly selected from a sampling frame of persons coming to Sweden from Bosnia-Herzegovina in 1993-1994. Four hundred and thirteen persons returned the questionnaire providing a response rate of 63.5%. The aim was to increase knowledge about the relationship between mental health and employment in the chosen population. The main mental health outcome measure was the Göteborg Quality of Life instrument from which 360 respondents were grouped according to low or high symptom levels. Data were cross tabulated (chi2-tested) against background variables such as age, gender and occupational status, and then tested using binary logistic regression. Binary logistic regression revealed unemployed men but not women, and women who had been working for longer periods during 1993-1999, to be associated with high levels of symptoms of poor mental health. Women living in the urban region were also overrepresented in the high symptom group. These findings indicate that, job occupancy is important to the health of men in the study. However, for the women, further understanding is needed, as job occupancy at some level as well as living in the urban region appear to be associated with poor mental health.

  12. Estimating the exceedance probability of rain rate by logistic regression

    NASA Technical Reports Server (NTRS)

    Chiu, Long S.; Kedem, Benjamin

    1990-01-01

    Recent studies have shown that the fraction of an area with rain intensity above a fixed threshold is highly correlated with the area-averaged rain rate. To estimate the fractional rainy area, a logistic regression model, which estimates the conditional probability that rain rate over an area exceeds a fixed threshold given the values of related covariates, is developed. The problem of dependency in the data in the estimation procedure is bypassed by the method of partial likelihood. Analyses of simulated scanning multichannel microwave radiometer and observed electrically scanning microwave radiometer data during the Global Atlantic Tropical Experiment period show that the use of logistic regression in pixel classification is superior to multiple regression in predicting whether rain rate at each pixel exceeds a given threshold, even in the presence of noisy data. The potential of the logistic regression technique in satellite rain rate estimation is discussed.

  13. The Efficacy of Two Improvement-over-Chance Effect Sizes for Two-Group Univariate Comparisons under Variance Heterogeneity and Nonnormality.

    ERIC Educational Resources Information Center

    Hess, Brian; Olejnik, Stephen; Huberty, Carl J.

    2001-01-01

    Studied the efficacy of two improvement-over-chance or "I" effect sizes derived from predictive discriminant analysis and logistic regression analysis for two-group univariate mean comparisons through simulation. Discusses the ways in which the usefulness of each of the indices depends on the population characteristics. (SLD)

  14. [Logistic regression model of noninvasive prediction for portal hypertensive gastropathy in patients with hepatitis B associated cirrhosis].

    PubMed

    Wang, Qingliang; Li, Xiaojie; Hu, Kunpeng; Zhao, Kun; Yang, Peisheng; Liu, Bo

    2015-05-12

    To explore the risk factors of portal hypertensive gastropathy (PHG) in patients with hepatitis B associated cirrhosis and establish a Logistic regression model of noninvasive prediction. The clinical data of 234 hospitalized patients with hepatitis B associated cirrhosis from March 2012 to March 2014 were analyzed retrospectively. The dependent variable was the occurrence of PHG while the independent variables were screened by binary Logistic analysis. Multivariate Logistic regression was used for further analysis of significant noninvasive independent variables. Logistic regression model was established and odds ratio was calculated for each factor. The accuracy, sensitivity and specificity of model were evaluated by the curve of receiver operating characteristic (ROC). According to univariate Logistic regression, the risk factors included hepatic dysfunction, albumin (ALB), bilirubin (TB), prothrombin time (PT), platelet (PLT), white blood cell (WBC), portal vein diameter, spleen index, splenic vein diameter, diameter ratio, PLT to spleen volume ratio, esophageal varices (EV) and gastric varices (GV). Multivariate analysis showed that hepatic dysfunction (X1), TB (X2), PLT (X3) and splenic vein diameter (X4) were the major occurring factors for PHG. The established regression model was Logit P=-2.667+2.186X1-2.167X2+0.725X3+0.976X4. The accuracy of model for PHG was 79.1% with a sensitivity of 77.2% and a specificity of 80.8%. Hepatic dysfunction, TB, PLT and splenic vein diameter are risk factors for PHG and the noninvasive predicted Logistic regression model was Logit P=-2.667+2.186X1-2.167X2+0.725X3+0.976X4.

  15. Incidence and risk factors of workplace violence against nurses in a Chinese top-level teaching hospital: A cross-sectional study.

    PubMed

    Chen, Xiaoming; Lv, Ming; Wang, Min; Wang, Xiufeng; Liu, Junyan; Zheng, Nan; Liu, Chunlan

    2018-04-01

    To investigate the incidence of workplace violence involving nurses and to identify related risk factors in a high-quality Chinese teaching hospital. A cross-sectional study design was used. The final sample comprised responses from 1831 registered nurses collected with a whole-hospital survey from June 1 to June 15, 2016. The demographic characteristics of the nurses who had experienced any form of violence were collected, and logistic regression analysis was applied to evaluate the risk factors for nurses related to workplace violence. Out of the total number of nurses surveyed, 904 (49.4%) nurses reported having experienced any type of violence in the past year. The frequencies of exposure to physical and non-physical violence were 6.3% (116) and 49.0% (897), respectively. All the incidence rates of violence were lower than those of other studies based on regional hospitals in China and were at the same level found in developed countries and districts. Binary logistic regression analysis revealed that nurses at levels 2 to 4 and female nurses in clinical departments were the most vulnerable to non-physical violence. For physical violence, the two independent risk factors were working in an emergency department and having 6-10 years of work experience. Workplace violence directly threatens nurses from high-quality Chinese teaching hospitals. However, the incidence of WPV against nurses in this teaching hospital was better than that in regional hospitals. This study also provides reference material to identify areas where nurses encounter relatively high levels of workplace violence in high-quality Chinese teaching hospitals. Copyright © 2018 Elsevier Inc. All rights reserved.

  16. Educational attainment among adult survivors of childhood cancer in Great Britain: a population-based cohort study.

    PubMed

    Lancashire, E R; Frobisher, C; Reulen, R C; Winter, D L; Glaser, A; Hawkins, M M

    2010-02-24

    Previous studies of educational attainment among childhood cancer survivors were small, had contradictory findings, and were not population based. This study investigated educational attainment in a large population-based cohort of survivors of all types of childhood cancer in Great Britain. Four levels of educational attainment among 10,183 cancer survivors--degree, teaching qualification, advanced (A') levels, and ordinary (O') levels--were compared with expected levels in the general population. A questionnaire was used to obtain educational attainment data for survivors, and comparable information for the general population was available from the General Household Survey. Factors associated with level of educational attainment achieved by cancer survivors were identified using multivariable logistic regression together with likelihood ratio tests. Logistic regression adjusting for age and sex was used for comparisons with the general population. All statistical tests were two-sided. Childhood cancer survivors had lower educational attainment than the general population (degree: odds ratio [OR] = 0.77, 99% confidence interval [CI] = 0.68 to 0.87; teaching qualification: OR = 0.85, 99% CI = 0.77 to 0.94; A'level: OR = 0.85, 99% CI = 0.78 to 0.93; O'level: OR = 0.81, 99% CI = 0.74 to 0.90; P < .001, all levels). Statistically significant deficits were restricted to central nervous system (CNS) neoplasm and leukemia survivors. For leukemia, only those treated with radiotherapy were considered. Odds ratios for achievement by irradiated CNS tumor survivors were 50%-74% of those for cranially irradiated leukemia or nonirradiated CNS tumor survivors. Survivors at greater risk of poorer educational outcomes included those treated with cranial irradiation, diagnosed with a CNS tumor, older at questionnaire completion, younger at diagnosis, diagnosed with epilepsy, and who were female. Specific groups of childhood cancer survivors achieve lower-than-expected educational attainment. Detailed educational support and implementation of regular cognitive assessment may be indicated for some groups to maximize long-term function.

  17. Predicting Salmonella populations from biological, chemical, and physical indicators in Florida surface waters.

    PubMed

    McEgan, Rachel; Mootian, Gabriel; Goodridge, Lawrence D; Schaffner, Donald W; Danyluk, Michelle D

    2013-07-01

    Coliforms, Escherichia coli, and various physicochemical water characteristics have been suggested as indicators of microbial water quality or index organisms for pathogen populations. The relationship between the presence and/or concentration of Salmonella and biological, physical, or chemical indicators in Central Florida surface water samples over 12 consecutive months was explored. Samples were taken monthly for 12 months from 18 locations throughout Central Florida (n = 202). Air and water temperature, pH, oxidation-reduction potential (ORP), turbidity, and conductivity were measured. Weather data were obtained from nearby weather stations. Aerobic plate counts and most probable numbers (MPN) for Salmonella, E. coli, and coliforms were performed. Weak linear relationships existed between biological indicators (E. coli/coliforms) and Salmonella levels (R(2) < 0.1) and between physicochemical indicators and Salmonella levels (R(2) < 0.1). The average rainfall (previous day, week, and month) before sampling did not correlate well with bacterial levels. Logistic regression analysis showed that E. coli concentration can predict the probability of enumerating selected Salmonella levels. The lack of good correlations between biological indicators and Salmonella levels and between physicochemical indicators and Salmonella levels shows that the relationship between pathogens and indicators is complex. However, Escherichia coli provides a reasonable way to predict Salmonella levels in Central Florida surface water through logistic regression.

  18. Predicting Salmonella Populations from Biological, Chemical, and Physical Indicators in Florida Surface Waters

    PubMed Central

    McEgan, Rachel; Mootian, Gabriel; Goodridge, Lawrence D.; Schaffner, Donald W.

    2013-01-01

    Coliforms, Escherichia coli, and various physicochemical water characteristics have been suggested as indicators of microbial water quality or index organisms for pathogen populations. The relationship between the presence and/or concentration of Salmonella and biological, physical, or chemical indicators in Central Florida surface water samples over 12 consecutive months was explored. Samples were taken monthly for 12 months from 18 locations throughout Central Florida (n = 202). Air and water temperature, pH, oxidation-reduction potential (ORP), turbidity, and conductivity were measured. Weather data were obtained from nearby weather stations. Aerobic plate counts and most probable numbers (MPN) for Salmonella, E. coli, and coliforms were performed. Weak linear relationships existed between biological indicators (E. coli/coliforms) and Salmonella levels (R2 < 0.1) and between physicochemical indicators and Salmonella levels (R2 < 0.1). The average rainfall (previous day, week, and month) before sampling did not correlate well with bacterial levels. Logistic regression analysis showed that E. coli concentration can predict the probability of enumerating selected Salmonella levels. The lack of good correlations between biological indicators and Salmonella levels and between physicochemical indicators and Salmonella levels shows that the relationship between pathogens and indicators is complex. However, Escherichia coli provides a reasonable way to predict Salmonella levels in Central Florida surface water through logistic regression. PMID:23624476

  19. Variable Selection in Logistic Regression.

    DTIC Science & Technology

    1987-06-01

    23 %. AUTIOR(.) S. CONTRACT OR GRANT NUMBE Rf.i %Z. D. Bai, P. R. Krishnaiah and . C. Zhao F49620-85- C-0008 " PERFORMING ORGANIZATION NAME AND AOORESS...d I7 IOK-TK- d 7 -I0 7’ VARIABLE SELECTION IN LOGISTIC REGRESSION Z. D. Bai, P. R. Krishnaiah and L. C. Zhao Center for Multivariate Analysis...University of Pittsburgh Center for Multivariate Analysis University of Pittsburgh Y !I VARIABLE SELECTION IN LOGISTIC REGRESSION Z- 0. Bai, P. R. Krishnaiah

  20. Multinomial Logistic Regression Predicted Probability Map To Visualize The Influence Of Socio-Economic Factors On Breast Cancer Occurrence in Southern Karnataka

    NASA Astrophysics Data System (ADS)

    Madhu, B.; Ashok, N. C.; Balasubramanian, S.

    2014-11-01

    Multinomial logistic regression analysis was used to develop statistical model that can predict the probability of breast cancer in Southern Karnataka using the breast cancer occurrence data during 2007-2011. Independent socio-economic variables describing the breast cancer occurrence like age, education, occupation, parity, type of family, health insurance coverage, residential locality and socioeconomic status of each case was obtained. The models were developed as follows: i) Spatial visualization of the Urban- rural distribution of breast cancer cases that were obtained from the Bharat Hospital and Institute of Oncology. ii) Socio-economic risk factors describing the breast cancer occurrences were complied for each case. These data were then analysed using multinomial logistic regression analysis in a SPSS statistical software and relations between the occurrence of breast cancer across the socio-economic status and the influence of other socio-economic variables were evaluated and multinomial logistic regression models were constructed. iii) the model that best predicted the occurrence of breast cancer were identified. This multivariate logistic regression model has been entered into a geographic information system and maps showing the predicted probability of breast cancer occurrence in Southern Karnataka was created. This study demonstrates that Multinomial logistic regression is a valuable tool for developing models that predict the probability of breast cancer Occurrence in Southern Karnataka.

  1. Modelling of binary logistic regression for obesity among secondary students in a rural area of Kedah

    NASA Astrophysics Data System (ADS)

    Kamaruddin, Ainur Amira; Ali, Zalila; Noor, Norlida Mohd.; Baharum, Adam; Ahmad, Wan Muhamad Amir W.

    2014-07-01

    Logistic regression analysis examines the influence of various factors on a dichotomous outcome by estimating the probability of the event's occurrence. Logistic regression, also called a logit model, is a statistical procedure used to model dichotomous outcomes. In the logit model the log odds of the dichotomous outcome is modeled as a linear combination of the predictor variables. The log odds ratio in logistic regression provides a description of the probabilistic relationship of the variables and the outcome. In conducting logistic regression, selection procedures are used in selecting important predictor variables, diagnostics are used to check that assumptions are valid which include independence of errors, linearity in the logit for continuous variables, absence of multicollinearity, and lack of strongly influential outliers and a test statistic is calculated to determine the aptness of the model. This study used the binary logistic regression model to investigate overweight and obesity among rural secondary school students on the basis of their demographics profile, medical history, diet and lifestyle. The results indicate that overweight and obesity of students are influenced by obesity in family and the interaction between a student's ethnicity and routine meals intake. The odds of a student being overweight and obese are higher for a student having a family history of obesity and for a non-Malay student who frequently takes routine meals as compared to a Malay student.

  2. Parenting styles, parenting practices, and physical activity in 10- to 11-year olds.

    PubMed

    Jago, Russell; Davison, Kirsten K; Brockman, Rowan; Page, Angie S; Thompson, Janice L; Fox, Kenneth R

    2011-01-01

    The objective of this study was to determine whether parenting styles and practices are associated with children's physical activity. Cross-sectional survey of seven hundred ninety-two 10- to 11-year-old UK children in Bristol (UK) in 2008-2009 was conducted. Accelerometer-assessed physical activity and mean minutes of moderate-to-vigorous physical activity (mean MVPA) and mean counts per minute (mean CPM) were obtained. Maternal parenting style and physical activity parenting practices were self-reported. In regression analyses, permissive parenting was associated with higher mean MVPA among girls (+6.0 min/day, p<0.001) and greater mean CPM (+98.9 accelerometer counts/min, p=0.014) among boys when compared to children with authoritative parents. Maternal logistic support was associated with mean CPM for girls (+36.2 counts/min, p=0.001), while paternal logistic support was associated with boys' mean MVPA (+4.0 min/day, p=0.049) and mean CPM (+55.7 counts/min, p=0.014). Maternal permissive parenting was associated with higher levels of physical activity than authoritative parenting, but associations differed by child gender and type of physical activity. Maternal logistic support was associated with girls' physical activity, while paternal logistic support was associated with boys' physical activity. Health professionals could encourage parents to increase logistic support for their children's physical activity. Copyright © 2010 Elsevier Inc. All rights reserved.

  3. Understanding logistic regression analysis.

    PubMed

    Sperandei, Sandro

    2014-01-01

    Logistic regression is used to obtain odds ratio in the presence of more than one explanatory variable. The procedure is quite similar to multiple linear regression, with the exception that the response variable is binomial. The result is the impact of each variable on the odds ratio of the observed event of interest. The main advantage is to avoid confounding effects by analyzing the association of all variables together. In this article, we explain the logistic regression procedure using examples to make it as simple as possible. After definition of the technique, the basic interpretation of the results is highlighted and then some special issues are discussed.

  4. Contributions of sociodemographic factors to criminal behavior

    PubMed Central

    Mundia, Lawrence; Matzin, Rohani; Mahalle, Salwa; Hamid, Malai Hayati; Osman, Ratna Suriani

    2016-01-01

    We explored the extent to which prisoner sociodemographic variables (age, education, marital status, employment, and whether their parents were married or not) influenced offending in 64 randomly selected Brunei inmates, comprising both sexes. A quantitative field survey design ideal for the type of participants used in a prison context was employed to investigate the problem. Hierarchical multiple regression analysis with backward elimination identified prisoner marital status and age groups as significantly related to offending. Furthermore, hierarchical multinomial logistic regression analysis with backward elimination indicated that prisoners’ age, primary level education, marital status, employment status, and parental marital status as significantly related to stealing offenses with high odds ratios. All 29 nonrecidivists were false negatives and predicted to reoffend upon release. Similarly, all 33 recidivists were projected to reoffend after release. Hierarchical binary logistic regression analysis revealed age groups (24–29 years and 30–35 years), employed prisoner, and primary level education as variables with high likelihood trends for reoffending. The results suggested that prisoner interventions (educational, counseling, and psychotherapy) in Brunei should treat not only antisocial personality, psychopathy, and mental health problems but also sociodemographic factors. The study generated offending patterns, trends, and norms that may inform subsequent investigations on Brunei prisoners. PMID:27382342

  5. Serum Irisin Predicts Mortality Risk in Acute Heart Failure Patients.

    PubMed

    Shen, Shutong; Gao, Rongrong; Bei, Yihua; Li, Jin; Zhang, Haifeng; Zhou, Yanli; Yao, Wenming; Xu, Dongjie; Zhou, Fang; Jin, Mengchao; Wei, Siqi; Wang, Kai; Xu, Xuejuan; Li, Yongqin; Xiao, Junjie; Li, Xinli

    2017-01-01

    Irisin is a peptide hormone cleaved from a plasma membrane protein fibronectin type III domain containing protein 5 (FNDC5). Emerging studies have indicated association between serum irisin and many major chronic diseases including cardiovascular diseases. However, the role of serum irisin as a predictor for mortality risk in acute heart failure (AHF) patients is not clear. AHF patients were enrolled and serum was collected at the admission and all patients were followed up for 1 year. Enzyme-linked immunosorbent assay was used to measure serum irisin levels. To explore predictors for AHF mortality, the univariate and multivariate logistic regression analysis, and receiver-operator characteristic (ROC) curve analysis were used. To determine the role of serum irisin levels in predicting survival, Kaplan-Meier survival analysis was used. In this study, 161 AHF patients were enrolled and serum irisin level was found to be significantly higher in patients deceased in 1-year follow-up. The univariate logistic regression analysis identified 18 variables associated with all-cause mortality in AHF patients, while the multivariate logistic regression analysis identified 2 variables namely blood urea nitrogen and serum irisin. ROC curve analysis indicated that blood urea nitrogen and the most commonly used biomarker, NT-pro-BNP, displayed poor prognostic value for AHF (AUCs ≤ 0.700) compared to serum irisin (AUC = 0.753). Kaplan-Meier survival analysis demonstrated that AHF patients with higher serum irisin had significantly higher mortality (P<0.001). Collectively, our study identified serum irisin as a predictive biomarker for 1-year all-cause mortality in AHF patients though large multicenter studies are highly needed. © 2017 The Author(s). Published by S. Karger AG, Basel.

  6. Calibrating random forests for probability estimation.

    PubMed

    Dankowski, Theresa; Ziegler, Andreas

    2016-09-30

    Probabilities can be consistently estimated using random forests. It is, however, unclear how random forests should be updated to make predictions for other centers or at different time points. In this work, we present two approaches for updating random forests for probability estimation. The first method has been proposed by Elkan and may be used for updating any machine learning approach yielding consistent probabilities, so-called probability machines. The second approach is a new strategy specifically developed for random forests. Using the terminal nodes, which represent conditional probabilities, the random forest is first translated to logistic regression models. These are, in turn, used for re-calibration. The two updating strategies were compared in a simulation study and are illustrated with data from the German Stroke Study Collaboration. In most simulation scenarios, both methods led to similar improvements. In the simulation scenario in which the stricter assumptions of Elkan's method were not met, the logistic regression-based re-calibration approach for random forests outperformed Elkan's method. It also performed better on the stroke data than Elkan's method. The strength of Elkan's method is its general applicability to any probability machine. However, if the strict assumptions underlying this approach are not met, the logistic regression-based approach is preferable for updating random forests for probability estimation. © 2016 The Authors. Statistics in Medicine Published by John Wiley & Sons Ltd. © 2016 The Authors. Statistics in Medicine Published by John Wiley & Sons Ltd.

  7. Ensemble learning of inverse probability weights for marginal structural modeling in large observational datasets.

    PubMed

    Gruber, Susan; Logan, Roger W; Jarrín, Inmaculada; Monge, Susana; Hernán, Miguel A

    2015-01-15

    Inverse probability weights used to fit marginal structural models are typically estimated using logistic regression. However, a data-adaptive procedure may be able to better exploit information available in measured covariates. By combining predictions from multiple algorithms, ensemble learning offers an alternative to logistic regression modeling to further reduce bias in estimated marginal structural model parameters. We describe the application of two ensemble learning approaches to estimating stabilized weights: super learning (SL), an ensemble machine learning approach that relies on V-fold cross validation, and an ensemble learner (EL) that creates a single partition of the data into training and validation sets. Longitudinal data from two multicenter cohort studies in Spain (CoRIS and CoRIS-MD) were analyzed to estimate the mortality hazard ratio for initiation versus no initiation of combined antiretroviral therapy among HIV positive subjects. Both ensemble approaches produced hazard ratio estimates further away from the null, and with tighter confidence intervals, than logistic regression modeling. Computation time for EL was less than half that of SL. We conclude that ensemble learning using a library of diverse candidate algorithms offers an alternative to parametric modeling of inverse probability weights when fitting marginal structural models. With large datasets, EL provides a rich search over the solution space in less time than SL with comparable results. Copyright © 2014 John Wiley & Sons, Ltd.

  8. Ensemble learning of inverse probability weights for marginal structural modeling in large observational datasets

    PubMed Central

    Gruber, Susan; Logan, Roger W.; Jarrín, Inmaculada; Monge, Susana; Hernán, Miguel A.

    2014-01-01

    Inverse probability weights used to fit marginal structural models are typically estimated using logistic regression. However a data-adaptive procedure may be able to better exploit information available in measured covariates. By combining predictions from multiple algorithms, ensemble learning offers an alternative to logistic regression modeling to further reduce bias in estimated marginal structural model parameters. We describe the application of two ensemble learning approaches to estimating stabilized weights: super learning (SL), an ensemble machine learning approach that relies on V -fold cross validation, and an ensemble learner (EL) that creates a single partition of the data into training and validation sets. Longitudinal data from two multicenter cohort studies in Spain (CoRIS and CoRIS-MD) were analyzed to estimate the mortality hazard ratio for initiation versus no initiation of combined antiretroviral therapy among HIV positive subjects. Both ensemble approaches produced hazard ratio estimates further away from the null, and with tighter confidence intervals, than logistic regression modeling. Computation time for EL was less than half that of SL. We conclude that ensemble learning using a library of diverse candidate algorithms offers an alternative to parametric modeling of inverse probability weights when fitting marginal structural models. With large datasets, EL provides a rich search over the solution space in less time than SL with comparable results. PMID:25316152

  9. Vitamin D levels and their associations with survival and major disease outcomes in a large cohort of patients with chronic graft-vs-host disease

    PubMed Central

    Katić, Mašenjka; Pirsl, Filip; Steinberg, Seth M.; Dobbin, Marnie; Curtis, Lauren M.; Pulanić, Dražen; Desnica, Lana; Titarenko, Irina; Pavletic, Steven Z.

    2016-01-01

    Aim To identify the factors associated with vitamin D status in patients with chronic graft-vs-host disease (cGVHD) and evaluate the association between serum vitamin D (25(OH)D) levels and cGVHD characteristics and clinical outcomes defined by the National Institutes of Health (NIH) criteria. Methods 310 cGVHD patients enrolled in the NIH cGVHD natural history study (clinicaltrials.gov: NCT00092235) were analyzed. Univariate analysis and multiple logistic regression were used to determine the associations between various parameters and 25(OH)D levels, dichotomized into categorical variables: ≤20 and >20 ng/mL, and as a continuous parameter. Multiple logistic regression was used to develop a predictive model for low vitamin D. Survival analysis and association between cGVHD outcomes and 25(OH)D as a continuous as well as categorical variable: ≤20 and >20 ng/mL; <50 and ≥50 ng/mL, and among three ordered categories: ≤20, 20-50, and ≥50 ng/mL, was performed. PMID:27374829

  10. Is physician adherence to prescription guidelines a general trait of health care practices or dependent on drug type?--a multilevel logistic regression analysis in South Sweden.

    PubMed

    Ohlsson, Henrik; Merlo, Juan

    2009-08-01

    Therapeutic traditions at health care practices (HCPs) influence physicians' adherence to prescription guidelines for specific drugs, however, it is not known if such traditions affect all kinds of prescriptions or only specific types of drug. Our goal was to determine whether adherence to prescription guidelines is a common trait of HCPs or dependent on drug type. We fitted separate multi-level logistic regression models to all patients in the Skåne region who received a prescription for a statin drug (ATC: C10AA, n = 6232), an agent acting on the renin-angiotensin system (ATC: C09, n = 7222) or a proton pump inhibitor (ATC: A02BC, n = 11 563) at 198 HCPs from July 2006 to December 2006. There was a high clustering of adherence to prescription guidelines at HCPs for the different drug types (MOR(agents acting on the renin-angiotensin system) = 4.72 [95% CI: 3.90-5.92], MOR(Statins) = 2.71 [95% CI: 2.23-3.39] and MOR(Proton pump inhibitors) = 2.16 [95% CI: 1.95-2.45]). Compared with HCPs with low adherence to guidelines in two drug types, those HCPs with the highest level of adherence for these two drug types also showed a higher probability of adherence for the third drug type. Physicians' decisions to follow prescription guidelines seem to be influenced by therapeutic traditions at the HCP. Moreover, these therapeutic traditions seem to affect all kinds of prescriptions. This information can be used as basis for interventions to support rational and cost-effective medication use. Copyright 2009 John Wiley & Sons, Ltd.

  11. Comparing Methodologies for Developing an Early Warning System: Classification and Regression Tree Model versus Logistic Regression. REL 2015-077

    ERIC Educational Resources Information Center

    Koon, Sharon; Petscher, Yaacov

    2015-01-01

    The purpose of this report was to explicate the use of logistic regression and classification and regression tree (CART) analysis in the development of early warning systems. It was motivated by state education leaders' interest in maintaining high classification accuracy while simultaneously improving practitioner understanding of the rules by…

  12. Temporal association between the influenza virus and respiratory syncytial virus (RSV): RSV as a predictor of seasonal influenza.

    PubMed

    Míguez, A; Iftimi, A; Montes, F

    2016-09-01

    Epidemiologists agree that there is a prevailing seasonality in the presentation of epidemic waves of respiratory syncytial virus (RSV) infections and influenza. The aim of this study is to quantify the potential relationship between the activity of RSV, with respect to the influenza virus, in order to use the RSV seasonal curve as a predictor of the evolution of an influenza virus epidemic wave. Two statistical tools, logistic regression and time series, are used for predicting the evolution of influenza. Both logistic models and time series of influenza consider RSV information from previous weeks. Data consist of influenza and confirmed RSV cases reported in Comunitat Valenciana (Spain) during the period from week 40 (2010) to week 8 (2014). Binomial logistic regression models used to predict the two states of influenza wave, basal or peak, result in a rate of correct classification higher than 92% with the validation set. When a finer three-states categorization is established, basal, increasing peak and decreasing peak, the multinomial logistic model performs well in 88% of cases of the validation set. The ARMAX model fits well for influenza waves and shows good performance for short-term forecasts up to 3 weeks. The seasonal evolution of influenza virus can be predicted a minimum of 4 weeks in advance using logistic models based on RSV. It would be necessary to study more inter-pandemic seasons to establish a stronger relationship between the epidemic waves of both viruses.

  13. Effects of road network on diversiform forest cover changes in the highest coverage region in China: An analysis of sampling strategies.

    PubMed

    Hu, Xisheng; Wu, Zhilong; Wu, Chengzhen; Ye, Limin; Lan, Chaofeng; Tang, Kun; Xu, Lu; Qiu, Rongzu

    2016-09-15

    Forest cover changes are of global concern due to their roles in global warming and biodiversity. However, many previous studies have ignored the fact that forest loss and forest gain are different processes that may respond to distinct factors by stressing forest loss more than gain or viewing forest cover change as a whole. It behooves us to carefully examine the patterns and drivers of the change by subdividing it into several categories. Our study includes areas of forest loss (4.8% of the study area), forest gain (1.3% of the study area) and forest loss and gain (2.0% of the study area) from 2000 to 2012 in Fujian Province, China. In the study area, approximately 65% and 90% of these changes occurred within 2000m of the nearest road and under road densities of 0.6km/km(2), respectively. We compared two sampling techniques (systematic sampling and random sampling) and four intensities for each technique to investigate the driving patterns underlying the changes using multinomial logistic regression. The results indicated the lack of pronounced differences in the regressions between the two sampling designs, although the sample size had a great impact on the regression outcome. The application of multi-model inference indicated that the low level road density had a negative significant association with forest loss and forest loss and gain, the expressway density had a positive significant impact on forest loss, and the road network was insignificantly related to forest gain. The model including socioeconomic and biophysical variables illuminated potentially different predictors of the different forest change categories. Moreover, the multiple comparisons tested by Fisher's least significant difference (LSD) were a good compensation for the multinomial logistic model to enrich the interpretation of the regression results. Copyright © 2016 Elsevier B.V. All rights reserved.

  14. Logistic regression of family data from retrospective study designs.

    PubMed

    Whittemore, Alice S; Halpern, Jerry

    2003-11-01

    We wish to study the effects of genetic and environmental factors on disease risk, using data from families ascertained because they contain multiple cases of the disease. To do so, we must account for the way participants were ascertained, and for within-family correlations in both disease occurrences and covariates. We model the joint probability distribution of the covariates of ascertained family members, given family disease occurrence and pedigree structure. We describe two such covariate models: the random effects model and the marginal model. Both models assume a logistic form for the distribution of one person's covariates that involves a vector beta of regression parameters. The components of beta in the two models have different interpretations, and they differ in magnitude when the covariates are correlated within families. We describe ascertainment assumptions needed to estimate consistently the parameters beta(RE) in the random effects model and the parameters beta(M) in the marginal model. Under the ascertainment assumptions for the random effects model, we show that conditional logistic regression (CLR) of matched family data gives a consistent estimate beta(RE) for beta(RE) and a consistent estimate for the covariance matrix of beta(RE). Under the ascertainment assumptions for the marginal model, we show that unconditional logistic regression (ULR) gives a consistent estimate for beta(M), and we give a consistent estimator for its covariance matrix. The random effects/CLR approach is simple to use and to interpret, but it can use data only from families containing both affected and unaffected members. The marginal/ULR approach uses data from all individuals, but its variance estimates require special computations. A C program to compute these variance estimates is available at http://www.stanford.edu/dept/HRP/epidemiology. We illustrate these pros and cons by application to data on the effects of parity on ovarian cancer risk in mother/daughter pairs, and use simulations to study the performance of the estimates. Copyright 2003 Wiley-Liss, Inc.

  15. The status of diabetes control in Kurdistan province, west of Iran.

    PubMed

    Esmailnasab, Nader; Afkhamzadeh, Abdorrahim; Roshani, Daem; Moradi, Ghobad

    2013-09-17

    Based on some estimation more than two million peoples in Iran are affected by Type 2 diabetes. The present study was designed to evaluate the status of diabetes control among Type 2 diabetes patients in Kurdistan, west of Iran and its associated factors. In our cross sectional study conducted in 2010, 411 Type 2 diabetes patients were randomly recruited from Sanandaj, Capital of Kurdistan. Chi square test was used in univariate analysis to address the association between HgAlc and FBS status and other variables. The significant results from Univariate analysis were entered in multivariate analysis and multinomial logistic regression model. In 38% of patients, FBS was in normal range (70-130) and in 47% HgA1c was <7% which is normal range for HgA1c. In univariate analysis, FBS level was associated with educational levels (P=0.001), referral style (P=0.001), referral time (P=0.009), and insulin injection (P=0.016). In addition, HgA1c had a relationship with sex (P=0.023), age (P=0.035), education (P=0.001), referral style (P=0.001), and insulin injection (P=0.008). After using multinomial logistic regression for significant results of univariate analysis, it was found that FBS was significantly associated with referral style. In addition HgA1c was significantly associated with referral style and Insulin injection. Although some of patients were under the coverage of specialized cares, but their diabetes were not properly controlled.

  16. Resilience model for parents of children with cancer in mainland China-An exploratory study.

    PubMed

    Ye, Zeng Jie; Qiu, Hong Zhong; Li, Peng Fei; Liang, Mu Zi; Wang, Shu Ni; Quan, Xiao Ming

    2017-04-01

    Parents have psychosocial functions that are critical for the entire family. Therefore, when their child is diagnosed with cancer, it is important that they exhibit resilience, which is the ability to preserve their emotional and physical well-being in the face of stress. The Resilience Model for Parents of Children with Cancer (RMP-CC) was developed to increase our understanding of how resilience is positively and negatively affected by protective and risk factors, respectively, in Chinese parents with children diagnosed with cancer. To evaluate the RMP-CC, the latent psychosocial variables and demographics of 229 parents were evaluated using exploratory structural equation modeling (SEM) and logistic regression. The majority of goodness-of-fit indices indicate that the SEM of RMP-CC was a good model with a high level of variance in resilience (58%). Logistic regression revealed that two demographics, educational level and clinical classification of cancer, accounted for 12% of this variance. Our results indicate that RMP-CC is an effective structure by which to develop mainland Chinese parent-focused interventions that are grounded in the experiences of the parents as caregivers of children who have been diagnosed with cancer. RMP-CC allows for a better understanding of what these parents experience while their children undergo treatment. Further studies will be needed to confirm the efficiency of the current structure, and would assist in further refinement of its clinical applications. Copyright © 2017 Elsevier Ltd. All rights reserved.

  17. Few items in the thyroid-related quality of life instrument ThyPRO exhibited differential item functioning.

    PubMed

    Watt, Torquil; Groenvold, Mogens; Hegedüs, Laszlo; Bonnema, Steen Joop; Rasmussen, Åse Krogh; Feldt-Rasmussen, Ulla; Bjorner, Jakob Bue

    2014-02-01

    To evaluate the extent of differential item functioning (DIF) within the thyroid-specific quality of life patient-reported outcome measure, ThyPRO, according to sex, age, education and thyroid diagnosis. A total of 838 patients with benign thyroid diseases completed the ThyPRO questionnaire (84 five-point items, 13 scales). Uniform and nonuniform DIF were investigated using ordinal logistic regression, testing for both statistical significance and magnitude (∆R(2) > 0.02). Scale level was estimated by the sum score, after purification. Twenty instances of DIF in 17 of the 84 items were found. Eight according to diagnosis, where the goiter scale was the one most affected, possibly due to differing perceptions in patients with auto-immune thyroid diseases compared to patients with simple goiter. Eight DIFs according to age were found, of which 5 were in positively worded items, which younger patients were more likely to endorse; one according to gender: women were more likely to report crying, and three according to educational level. The vast majority of DIF had only minor influence on the scale scores (0.1-2.3 points on the 0-100 scales), but two DIF corresponded to a difference of 4.6 and 9.8, respectively. Ordinal logistic regression identified DIF in 17 of 84 items. The potential impact of this on the present scales was low, but items displaying DIF could be avoided when developing abbreviated scales, where the potential impact of DIF (due to fewer items) will be larger.

  18. A Multilevel Assessment of Differential Item Functioning.

    ERIC Educational Resources Information Center

    Shen, Linjun

    A multilevel approach was proposed for the assessment of differential item functioning and compared with the traditional logistic regression approach. Data from the Comprehensive Osteopathic Medical Licensing Examination for 2,300 freshman osteopathic medical students were analyzed. The multilevel approach used three-level hierarchical generalized…

  19. The effects of sleep quality, physical activity, and environmental quality on the risk of falls in dementia.

    PubMed

    Eshkoor, Sima Ataollahi; Hamid, Tengku Aizan; Nudin, Siti Sa'adiah Hassan; Mun, Chan Yoke

    2013-06-01

    This study aimed to identify the effects of sleep quality, physical activity, environmental quality, age, ethnicity, sex differences, marital status, and educational level on the risk of falls in the elderly individuals with dementia. Data were derived from a group of 1210 Malaysian elderly individuals who were noninstitutionalized and demented. The multiple logistic regression model was applied to estimate the risk of falls in respondents. Approximately the prevalence of falls was 17% among the individuals. The results of multiple logistic regression analysis revealed that age (odds ratio [OR] = 1.03), ethnicity (OR = 1.76), sleep quality (OR = 1.46), and environmental quality (OR = 0.62) significantly affected the risk of falls in individuals (P < .05). Furthermore, sex differences, marital status, educational level, and physical activity were not significant predictors of falls in samples (P > .05). It was found that age, ethnic non-Malay, and sleep disruption increased the risk of falls in respondents, but high environmental quality reduced the risk of falls.

  20. Using Multiple and Logistic Regression to Estimate the Median WillCost and Probability of Cost and Schedule Overrun for Program Managers

    DTIC Science & Technology

    2017-03-23

    PUBLIC RELEASE; DISTRIBUTION UNLIMITED Using Multiple and Logistic Regression to Estimate the Median Will- Cost and Probability of Cost and... Cost and Probability of Cost and Schedule Overrun for Program Managers Ryan C. Trudelle Follow this and additional works at: https://scholar.afit.edu...afit.edu. Recommended Citation Trudelle, Ryan C., "Using Multiple and Logistic Regression to Estimate the Median Will- Cost and Probability of Cost and

  1. Expression of Proteins Involved in Epithelial-Mesenchymal Transition as Predictors of Metastasis and Survival in Breast Cancer Patients

    DTIC Science & Technology

    2013-11-01

    Ptrend 0.78 0.62 0.75 Unconditional logistic regression was used to estimate odds ratios (OR) and 95 % confidence intervals (CI) for risk of node...Ptrend 0.71 0.67 Unconditional logistic regression was used to estimate odds ratios (OR) and 95 % confidence intervals (CI) for risk of high-grade tumors... logistic regression was used to estimate odds ratios (OR) and 95 % confidence intervals (CI) for the associations between each of the seven SNPs and

  2. The alarming problems of confounding equivalence using logistic regression models in the perspective of causal diagrams.

    PubMed

    Yu, Yuanyuan; Li, Hongkai; Sun, Xiaoru; Su, Ping; Wang, Tingting; Liu, Yi; Yuan, Zhongshang; Liu, Yanxun; Xue, Fuzhong

    2017-12-28

    Confounders can produce spurious associations between exposure and outcome in observational studies. For majority of epidemiologists, adjusting for confounders using logistic regression model is their habitual method, though it has some problems in accuracy and precision. It is, therefore, important to highlight the problems of logistic regression and search the alternative method. Four causal diagram models were defined to summarize confounding equivalence. Both theoretical proofs and simulation studies were performed to verify whether conditioning on different confounding equivalence sets had the same bias-reducing potential and then to select the optimum adjusting strategy, in which logistic regression model and inverse probability weighting based marginal structural model (IPW-based-MSM) were compared. The "do-calculus" was used to calculate the true causal effect of exposure on outcome, then the bias and standard error were used to evaluate the performances of different strategies. Adjusting for different sets of confounding equivalence, as judged by identical Markov boundaries, produced different bias-reducing potential in the logistic regression model. For the sets satisfied G-admissibility, adjusting for the set including all the confounders reduced the equivalent bias to the one containing the parent nodes of the outcome, while the bias after adjusting for the parent nodes of exposure was not equivalent to them. In addition, all causal effect estimations through logistic regression were biased, although the estimation after adjusting for the parent nodes of exposure was nearest to the true causal effect. However, conditioning on different confounding equivalence sets had the same bias-reducing potential under IPW-based-MSM. Compared with logistic regression, the IPW-based-MSM could obtain unbiased causal effect estimation when the adjusted confounders satisfied G-admissibility and the optimal strategy was to adjust for the parent nodes of outcome, which obtained the highest precision. All adjustment strategies through logistic regression were biased for causal effect estimation, while IPW-based-MSM could always obtain unbiased estimation when the adjusted set satisfied G-admissibility. Thus, IPW-based-MSM was recommended to adjust for confounders set.

  3. Use and interpretation of logistic regression in habitat-selection studies

    USGS Publications Warehouse

    Keating, Kim A.; Cherry, Steve

    2004-01-01

     Logistic regression is an important tool for wildlife habitat-selection studies, but the method frequently has been misapplied due to an inadequate understanding of the logistic model, its interpretation, and the influence of sampling design. To promote better use of this method, we review its application and interpretation under 3 sampling designs: random, case-control, and use-availability. Logistic regression is appropriate for habitat use-nonuse studies employing random sampling and can be used to directly model the conditional probability of use in such cases. Logistic regression also is appropriate for studies employing case-control sampling designs, but careful attention is required to interpret results correctly. Unless bias can be estimated or probability of use is small for all habitats, results of case-control studies should be interpreted as odds ratios, rather than probability of use or relative probability of use. When data are gathered under a use-availability design, logistic regression can be used to estimate approximate odds ratios if probability of use is small, at least on average. More generally, however, logistic regression is inappropriate for modeling habitat selection in use-availability studies. In particular, using logistic regression to fit the exponential model of Manly et al. (2002:100) does not guarantee maximum-likelihood estimates, valid probabilities, or valid likelihoods. We show that the resource selection function (RSF) commonly used for the exponential model is proportional to a logistic discriminant function. Thus, it may be used to rank habitats with respect to probability of use and to identify important habitat characteristics or their surrogates, but it is not guaranteed to be proportional to probability of use. Other problems associated with the exponential model also are discussed. We describe an alternative model based on Lancaster and Imbens (1996) that offers a method for estimating conditional probability of use in use-availability studies. Although promising, this model fails to converge to a unique solution in some important situations. Further work is needed to obtain a robust method that is broadly applicable to use-availability studies.

  4. Relation between serum creatinine and postoperative results of open-heart surgery.

    PubMed

    Ezeldin, Tamer H

    2013-10-01

    To determine the impact of preoperative serum creatinine level in non-dialyzable patients on postoperative morbidity and mortality. This is a prospective study, where serum creatinine was used to give primary assessment on renal function status preoperatively. This study includes 1,033 patients, who underwent coronary artery bypass grafting, or valve(s) operations. The study took place at Al-Hada Military Hospital, Taif, Kingdom of Saudi between May 2008 and January 2012. Data were statistically analyzed using Chi square (x2) test and multivariable logistic regression, to evaluate the postoperative morbidity and mortality risks associated with low serum creatinine levels. Postoperative mortality increased with high serum creatinine level >1.8 mg/dL (p

  5. Vitamin D and Male Sexual Function: A Transversal and Longitudinal Study.

    PubMed

    Tirabassi, Giacomo; Sudano, Maurizio; Salvio, Gianmaria; Cutini, Melissa; Muscogiuri, Giovanna; Corona, Giovanni; Balercia, Giancarlo

    2018-01-01

    The effects of vitamin D on sexual function are very unclear. Therefore, we aimed at evaluating the possible association between vitamin D and sexual function and at assessing the influence of vitamin D administration on sexual function. We retrospectively studied 114 men by evaluating clinical, biochemical, and sexual parameters. A subsample ( n = 41) was also studied longitudinally before and after vitamin D replacement therapy. In the whole sample, after performing logistic regression models, higher levels of 25(OH) vitamin D were significantly associated with high values of total testosterone and of all the International Index of Erectile Function (IIEF) questionnaire parameters. On the other hand, higher levels of total testosterone were positively and significantly associated with high levels of erectile function and IIEF total score. After vitamin D replacement therapy, total and free testosterone increased and erectile function improved, whereas other sexual parameters did not change significantly. At logistic regression analysis, higher levels of vitamin D increase (Δ-) were significantly associated with high values of Δ-erectile function after adjustment for Δ-testosterone. Vitamin D is important for the wellness of male sexual function, and vitamin D administration improves sexual function.

  6. Comparative analysis on the probability of being a good payer

    NASA Astrophysics Data System (ADS)

    Mihova, V.; Pavlov, V.

    2017-10-01

    Credit risk assessment is crucial for the bank industry. The current practice uses various approaches for the calculation of credit risk. The core of these approaches is the use of multiple regression models, applied in order to assess the risk associated with the approval of people applying for certain products (loans, credit cards, etc.). Based on data from the past, these models try to predict what will happen in the future. Different data requires different type of models. This work studies the causal link between the conduct of an applicant upon payment of the loan and the data that he completed at the time of application. A database of 100 borrowers from a commercial bank is used for the purposes of the study. The available data includes information from the time of application and credit history while paying off the loan. Customers are divided into two groups, based on the credit history: Good and Bad payers. Linear and logistic regression are applied in parallel to the data in order to estimate the probability of being good for new borrowers. A variable, which contains value of 1 for Good borrowers and value of 0 for Bad candidates, is modeled as a dependent variable. To decide which of the variables listed in the database should be used in the modelling process (as independent variables), a correlation analysis is made. Due to the results of it, several combinations of independent variables are tested as initial models - both with linear and logistic regression. The best linear and logistic models are obtained after initial transformation of the data and following a set of standard and robust statistical criteria. A comparative analysis between the two final models is made and scorecards are obtained from both models to assess new customers at the time of application. A cut-off level of points, bellow which to reject the applications and above it - to accept them, has been suggested for both the models, applying the strategy to keep the same Accept Rate as in the current data.

  7. Barriers and benefits of a healthy diet in spain: comparison with other European member states.

    PubMed

    Holgado, B; de Irala-Estévez, J; Martínez-González, M A; Gibney, M; Kearney, J; Martínez, J A

    2000-06-01

    Our purpose was to identify the main barriers and benefits perceived by the European citizens in regard to following a healthy diet and to assess the differences in expected benefits and difficulties between Spain and the remaining countries of the European Union. A cross-sectional study in which quota-controlled, nationally representative samples of approximately 1000 adults from each country completed a questionnaire. The survey was carried out between October 1995 and February 1996 in the 15 member states of the European Union. Participants (aged 15 y and older) were selected and interviewed in their homes about their attitudes towards healthy diets. They were asked to select two options from a list of 22 potential barriers to achieve a healthy diet and the benefits derived from a healthy diet. The associations of the perceived benefits of barriers with the sociodemographic variables within Spain and the rest of the European Union were compared with the Pearson chi-squared test and the chi-squared linear trend test. Two multivariate logistic regression models were also fitted to assess the characteristics independently related to the selection of 'Resistance to change' among the main barriers and to the selection of 'Prevent disease/stay healthy' as the main perceived benefits. The barrier most frequently mentioned in Spain was 'Irregular work hours' (29.7%) in contrast with the rest of the European Union where 'Giving up foods that I like' was the barrier most often chosen (26.2%). In the multivariate logistic regression model studying resistance to change, Spaniards were less resistant to change than the rest of the European Union. The benefit more frequently mentioned across Europe was 'Prevent disease/stay healthy'. In the multivariate logistic regression model women, older individuals, and people with a higher educational level were more likely to choose this benefit. It is apparent that there are many barriers to achieve healthy eating, mostly lack of time. For this reason a higher availability of food in line with the nutrition guidelines could be helpful. The population could have a better knowledge of the benefits derived from a healthy diet.

  8. [Associations between dormitory environment/other factors and sleep quality of medical students].

    PubMed

    Zheng, Bang; Wang, Kailu; Pan, Ziqi; Li, Man; Pan, Yuting; Liu, Ting; Xu, Dan; Lyu, Jun

    2016-03-01

    To investigate the sleep quality and related factors among medical students in China, understand the association between dormitory environment and sleep quality, and provide evidence and recommendations for sleep hygiene intervention. A total of 555 undergraduate students were selected from a medical school of an university in Beijing through stratified-cluster random-sampling to conduct a questionnaire survey by using Chinese version of Pittsburgh Sleep Quality Index (PSQI) and self-designed questionnaire. Analyses were performed by using multiple logistic regression model as well as multilevel linear regression model. The prevalence of sleep disorder was 29.1%(149/512), and 39.1%(200/512) of the students reported that the sleep quality was influenced by dormitory environment. PSQI score was negatively correlated with self-reported rating of dormitory environment (γs=-0.310, P<0.001). Logistic regression analysis showed the related factors of sleep disorder included grade, sleep regularity, self-rated health status, pressures of school work and employment, as well as dormitory environment. RESULTS of multilevel regression analysis also indicated that perception on dormitory environment (individual level) was associated with sleep quality with the dormitory level random effects under control (b=-0.619, P<0.001). The prevalence of sleep disorder was high in medical students, which was associated with multiple factors. Dormitory environment should be taken into consideration when the interventions are taken to improve the sleep quality of students.

  9. Functional Data Analysis Applied to Modeling of Severe Acute Mucositis and Dysphagia Resulting From Head and Neck Radiation Therapy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Dean, Jamie A., E-mail: jamie.dean@icr.ac.uk; Wong, Kee H.; Gay, Hiram

    Purpose: Current normal tissue complication probability modeling using logistic regression suffers from bias and high uncertainty in the presence of highly correlated radiation therapy (RT) dose data. This hinders robust estimates of dose-response associations and, hence, optimal normal tissue–sparing strategies from being elucidated. Using functional data analysis (FDA) to reduce the dimensionality of the dose data could overcome this limitation. Methods and Materials: FDA was applied to modeling of severe acute mucositis and dysphagia resulting from head and neck RT. Functional partial least squares regression (FPLS) and functional principal component analysis were used for dimensionality reduction of the dose-volume histogrammore » data. The reduced dose data were input into functional logistic regression models (functional partial least squares–logistic regression [FPLS-LR] and functional principal component–logistic regression [FPC-LR]) along with clinical data. This approach was compared with penalized logistic regression (PLR) in terms of predictive performance and the significance of treatment covariate–response associations, assessed using bootstrapping. Results: The area under the receiver operating characteristic curve for the PLR, FPC-LR, and FPLS-LR models was 0.65, 0.69, and 0.67, respectively, for mucositis (internal validation) and 0.81, 0.83, and 0.83, respectively, for dysphagia (external validation). The calibration slopes/intercepts for the PLR, FPC-LR, and FPLS-LR models were 1.6/−0.67, 0.45/0.47, and 0.40/0.49, respectively, for mucositis (internal validation) and 2.5/−0.96, 0.79/−0.04, and 0.79/0.00, respectively, for dysphagia (external validation). The bootstrapped odds ratios indicated significant associations between RT dose and severe toxicity in the mucositis and dysphagia FDA models. Cisplatin was significantly associated with severe dysphagia in the FDA models. None of the covariates was significantly associated with severe toxicity in the PLR models. Dose levels greater than approximately 1.0 Gy/fraction were most strongly associated with severe acute mucositis and dysphagia in the FDA models. Conclusions: FPLS and functional principal component analysis marginally improved predictive performance compared with PLR and provided robust dose-response associations. FDA is recommended for use in normal tissue complication probability modeling.« less

  10. Functional Data Analysis Applied to Modeling of Severe Acute Mucositis and Dysphagia Resulting From Head and Neck Radiation Therapy.

    PubMed

    Dean, Jamie A; Wong, Kee H; Gay, Hiram; Welsh, Liam C; Jones, Ann-Britt; Schick, Ulrike; Oh, Jung Hun; Apte, Aditya; Newbold, Kate L; Bhide, Shreerang A; Harrington, Kevin J; Deasy, Joseph O; Nutting, Christopher M; Gulliford, Sarah L

    2016-11-15

    Current normal tissue complication probability modeling using logistic regression suffers from bias and high uncertainty in the presence of highly correlated radiation therapy (RT) dose data. This hinders robust estimates of dose-response associations and, hence, optimal normal tissue-sparing strategies from being elucidated. Using functional data analysis (FDA) to reduce the dimensionality of the dose data could overcome this limitation. FDA was applied to modeling of severe acute mucositis and dysphagia resulting from head and neck RT. Functional partial least squares regression (FPLS) and functional principal component analysis were used for dimensionality reduction of the dose-volume histogram data. The reduced dose data were input into functional logistic regression models (functional partial least squares-logistic regression [FPLS-LR] and functional principal component-logistic regression [FPC-LR]) along with clinical data. This approach was compared with penalized logistic regression (PLR) in terms of predictive performance and the significance of treatment covariate-response associations, assessed using bootstrapping. The area under the receiver operating characteristic curve for the PLR, FPC-LR, and FPLS-LR models was 0.65, 0.69, and 0.67, respectively, for mucositis (internal validation) and 0.81, 0.83, and 0.83, respectively, for dysphagia (external validation). The calibration slopes/intercepts for the PLR, FPC-LR, and FPLS-LR models were 1.6/-0.67, 0.45/0.47, and 0.40/0.49, respectively, for mucositis (internal validation) and 2.5/-0.96, 0.79/-0.04, and 0.79/0.00, respectively, for dysphagia (external validation). The bootstrapped odds ratios indicated significant associations between RT dose and severe toxicity in the mucositis and dysphagia FDA models. Cisplatin was significantly associated with severe dysphagia in the FDA models. None of the covariates was significantly associated with severe toxicity in the PLR models. Dose levels greater than approximately 1.0 Gy/fraction were most strongly associated with severe acute mucositis and dysphagia in the FDA models. FPLS and functional principal component analysis marginally improved predictive performance compared with PLR and provided robust dose-response associations. FDA is recommended for use in normal tissue complication probability modeling. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.

  11. Serum total bilirubin levels are negatively correlated with metabolic syndrome in aged Chinese women: a community-based study.

    PubMed

    Zhong, P; Sun, D M; Wu, D H; Li, T M; Liu, X Y; Liu, H Y

    2017-01-26

    We evaluated serum total bilirubin levels as a predictor for metabolic syndrome (MetS) and investigated the relationship between serum total bilirubin levels and MetS prevalence. This cross-sectional study included 1728 participants over 65 years of age from Eastern China. Anthropometric data, lifestyle information, and previous medical history were collected. We then measured serum levels of fasting blood-glucose, total cholesterol, triglycerides, and total bilirubin, as well as alanine aminotransferase activity. The prevalence of MetS and each of its individual component were calculated per quartile of total bilirubin level. Logistic regression was used to assess the correlation between serum total bilirubin levels and MetS. Total bilirubin level in the women who did not have MetS was significantly higher than in those who had MetS (P<0.001). Serum total bilirubin quartiles were linearly and negatively correlated with MetS prevalence and hypertriglyceridemia (HTG) in females (P<0.005). Logistic regression showed that serum total bilirubin was an independent predictor of MetS for females (OR: 0.910, 95%CI: 0.863-0.960; P=0.001). The present study suggests that physiological levels of serum total bilirubin might be an independent risk factor for aged Chinese women, and the prevalence of MetS and HTG are negatively correlated to serum total bilirubin levels.

  12. Cognitive and Social Functioning Correlates of Employment Among People with Severe Mental Illness.

    PubMed

    Saavedra, Javier; López, Marcelino; González, Sergio; Arias, Samuel; Crawford, Paul

    2016-10-01

    We assess how social and cognitive functioning is associated to gaining employment for 213 people diagnosed with severe mental illness taking part in employment programs in Andalusia (Spain). We used the Repeatable Battery for the Assessment of Neuropsychological Status and the Social Functioning Scale and conducted two binary logistical regression analyses. Response variables were: having a job or not, in ordinary companies (OCs) and social enterprises, and working in an OC or not. There were two variables with significant adjusted odds ratios for having a job: "attention" and "Educational level". There were five variables with significant odds ratios for having a job in an OC: "Sex", "Educational level", "Attention", "Communication", and "Independence-competence". The study looks at the possible benefits of combining employment with support and social enterprises in employment programs for these people and underlines how both social and cognitive functioning are central to developing employment models.

  13. Confirming the validity of the CONUT system for early detection and monitoring of clinical undernutrition: comparison with two logistic regression models developed using SGA as the gold standard.

    PubMed

    González-Madroño, A; Mancha, A; Rodríguez, F J; Culebras, J; de Ulibarri, J I

    2012-01-01

    To ratify previous validations of the CONUT nutritional screening tool by the development of two probabilistic models using the parameters included in the CONUT, to see if the CONUT´s effectiveness could be improved. It is a two step prospective study. In Step 1, 101 patients were randomly selected, and SGA and CONUT was made. With data obtained an unconditional logistic regression model was developed, and two variants of CONUT were constructed: Model 1 was made by a method of logistic regression. Model 2 was made by dividing the probabilities of undernutrition obtained in model 1 in seven regular intervals. In step 2, 60 patients were selected and underwent the SGA, the original CONUT and the new models developed. The diagnostic efficacy of the original CONUT and the new models was tested by means of ROC curves. Both samples 1 and 2 were put together to measure the agreement degree between the original CONUT and SGA, and diagnostic efficacy parameters were calculated. No statistically significant differences were found between sample 1 and 2, regarding age, sex and medical/surgical distribution and undernutrition rates were similar (over 40%). The AUC for the ROC curves were 0.862 for the original CONUT, and 0.839 and 0.874, for model 1 and 2 respectively. The kappa index for the CONUT and SGA was 0.680. The CONUT, with the original scores assigned by the authors is equally good than mathematical models and thus is a valuable tool, highly useful and efficient for the purpose of Clinical Undernutrition screening.

  14. Logistic regression models of factors influencing the location of bioenergy and biofuels plants

    Treesearch

    T.M. Young; R.L. Zaretzki; J.H. Perdue; F.M. Guess; X. Liu

    2011-01-01

    Logistic regression models were developed to identify significant factors that influence the location of existing wood-using bioenergy/biofuels plants and traditional wood-using facilities. Logistic models provided quantitative insight for variables influencing the location of woody biomass-using facilities. Availability of "thinnings to a basal area of 31.7m2/ha...

  15. First World War and Mental Health: a retrospective comparative study of veterans admitted to a psychiatric hospital between 1915 and 1918.

    PubMed

    Lagonia, Paolo; Aloi, Matteo; Magliocco, Fabio; Cerminara, Gregorio; Segura-Garcia, Cristina; Del Vecchio, Valeria; Luciano, Mario; Fiorillo, Andrea; De Fazio, Pasquale

    2017-01-01

    The association between mental illness and war has been repeatedly investigated. Higher levels of depressive symptoms and an increased suicidal risk have been found in veterans. In this study we investigated the mental health conditions among Italian soldiers during the “Great War”, who were hospitalized in a mental health hospital in Italy. The study sample consists of 498 soldiers who were admitted during the World War I between 1915 and 1918, and 498 civilian patients admitted in two different periods (1898-1914, 1919- 1932). Psychiatric diagnoses have been recorded retrospectively by a detailed examination of clinical records. Socio-demographic informations, diagnosis at first admission, number of admissions, and deployment in war zones were collected. A logistic regression analysis was performed, the diagnosis of depression was considered as dependent variable while clinical and demographic variables as independent predictors. Soldiers deployed in war zones were more likely to have a diagnosis of depression compared to those not serving on the frontline. The logistic regression analysis showed that the diagnosis of depression is predicted by being a soldier and being deployed in a war area. Our data confirm that soldiers engaged in war are at higher risk of developing depression compared to non-deployed soldiers.

  16. Classification Models to Predict Survival of Kidney Transplant Recipients Using Two Intelligent Techniques of Data Mining and Logistic Regression.

    PubMed

    Nematollahi, M; Akbari, R; Nikeghbalian, S; Salehnasab, C

    2017-01-01

    Kidney transplantation is the treatment of choice for patients with end-stage renal disease (ESRD). Prediction of the transplant survival is of paramount importance. The objective of this study was to develop a model for predicting survival in kidney transplant recipients. In a cross-sectional study, 717 patients with ESRD admitted to Nemazee Hospital during 2008-2012 for renal transplantation were studied and the transplant survival was predicted for 5 years. The multilayer perceptron of artificial neural networks (MLP-ANN), logistic regression (LR), Support Vector Machine (SVM), and evaluation tools were used to verify the determinant models of the predictions and determine the independent predictors. The accuracy, area under curve (AUC), sensitivity, and specificity of SVM, MLP-ANN, and LR models were 90.4%, 86.5%, 98.2%, and 49.6%; 85.9%, 76.9%, 97.3%, and 26.1%; and 84.7%, 77.4%, 97.5%, and 17.4%, respectively. Meanwhile, the independent predictors were discharge time creatinine level, recipient age, donor age, donor blood group, cause of ESRD, recipient hypertension after transplantation, and duration of dialysis before transplantation. SVM and MLP-ANN models could efficiently be used for determining survival prediction in kidney transplant recipients.

  17. Fuzzy multinomial logistic regression analysis: A multi-objective programming approach

    NASA Astrophysics Data System (ADS)

    Abdalla, Hesham A.; El-Sayed, Amany A.; Hamed, Ramadan

    2017-05-01

    Parameter estimation for multinomial logistic regression is usually based on maximizing the likelihood function. For large well-balanced datasets, Maximum Likelihood (ML) estimation is a satisfactory approach. Unfortunately, ML can fail completely or at least produce poor results in terms of estimated probabilities and confidence intervals of parameters, specially for small datasets. In this study, a new approach based on fuzzy concepts is proposed to estimate parameters of the multinomial logistic regression. The study assumes that the parameters of multinomial logistic regression are fuzzy. Based on the extension principle stated by Zadeh and Bárdossy's proposition, a multi-objective programming approach is suggested to estimate these fuzzy parameters. A simulation study is used to evaluate the performance of the new approach versus Maximum likelihood (ML) approach. Results show that the new proposed model outperforms ML in cases of small datasets.

  18. Effect of plasma homocysteine level and urinary monomethylarsonic acid on the risk of arsenic-associated carotid atherosclerosis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wu, M.-M.; Graduate Institute of Medicine, College of Medicine, Fu-Jen Catholic University, Taipei, Taiwan; Chiou, H.-Y.

    2006-10-01

    Arsenic-contaminated well water has been shown to increase the risk of atherosclerosis. Because of involving S-adenosylmethionine, homocysteine may modify the risk by interfering with the biomethylation of ingested arsenic. In this study, we assessed the effect of plasma homocysteine level and urinary monomethylarsonic acid (MMA{sup V}) on the risk of atherosclerosis associated with arsenic. In total, 163 patients with carotid atherosclerosis and 163 controls were studied. Lifetime cumulative arsenic exposure from well water for study subjects was measured as index of arsenic exposure. Homocysteine level was determined by high-performance liquid chromatography (HPLC). Proportion of MMA{sup V} (MMA%) was calculated bymore » dividing with total arsenic species in urine, including arsenite, arsenate, MMA{sup V}, and dimethylarsinic acid (DMA{sup V}). Results of multiple linear regression analysis show a positive correlation of plasma homocysteine levels to the cumulative arsenic exposure after controlling for atherosclerosis status and nutritional factors (P < 0.05). This correlation, however, did not change substantially the effect of arsenic exposure on the risk of atherosclerosis as analyzed in a subsequent logistic regression model. Logistic regression analyses also show that elevated plasma homocysteine levels did not confer an independent risk for developing atherosclerosis in the study population. However, the risk of having atherosclerosis was increased to 5.4-fold (95% CI, 2.0-15.0) for the study subjects with high MMA% ({>=}16.5%) and high homocysteine levels ({>=}12.7 {mu}mol/l) as compared to those with low MMA% (<9.9%) and low homocysteine levels (<12.7 {mu}mol/l). Elevated homocysteinemia may exacerbate the formation of atherosclerosis related to arsenic exposure in individuals with high levels of MMA% in urine.« less

  19. Serum Vitamin D Levels and Markers of Severity of Childhood Asthma in Costa Rica

    PubMed Central

    Brehm, John M.; Celedón, Juan C.; Soto-Quiros, Manuel E.; Avila, Lydiana; Hunninghake, Gary M.; Forno, Erick; Laskey, Daniel; Sylvia, Jody S.; Hollis, Bruce W.; Weiss, Scott T.; Litonjua, Augusto A.

    2009-01-01

    Rationale: Maternal vitamin D intake during pregnancy has been inversely associated with asthma symptoms in early childhood. However, no study has examined the relationship between measured vitamin D levels and markers of asthma severity in childhood. Objectives: To determine the relationship between measured vitamin D levels and both markers of asthma severity and allergy in childhood. Methods: We examined the relation between 25-hydroxyvitamin D levels (the major circulating form of vitamin D) and markers of allergy and asthma severity in a cross-sectional study of 616 Costa Rican children between the ages of 6 and 14 years. Linear, logistic, and negative binomial regressions were used for the univariate and multivariate analyses. Measurements and Main Results: Of the 616 children with asthma, 175 (28%) had insufficient levels of vitamin D (<30 ng/ml). In multivariate linear regression models, vitamin D levels were significantly and inversely associated with total IgE and eosinophil count. In multivariate logistic regression models, a log10 unit increase in vitamin D levels was associated with reduced odds of any hospitalization in the previous year (odds ratio [OR], 0.05; 95% confidence interval [CI], 0.004–0.71; P = 0.03), any use of antiinflammatory medications in the previous year (OR, 0.18; 95% CI, 0.05–0.67; P = 0.01), and increased airway responsiveness (a ≤8.58-μmol provocative dose of methacholine producing a 20% fall in baseline FEV1 [OR, 0.15; 95% CI, 0.024–0.97; P = 0.05]). Conclusions: Our results suggest that vitamin D insufficiency is relatively frequent in an equatorial population of children with asthma. In these children, lower vitamin D levels are associated with increased markers of allergy and asthma severity. PMID:19179486

  20. PAKDD Data Mining Competition 2009: New Ways of Using Known Methods

    NASA Astrophysics Data System (ADS)

    Linhart, Chaim; Harari, Guy; Abramovich, Sharon; Buchris, Altina

    The PAKDD 2009 competition focuses on the problem of credit risk assessment. As required, we had to confront the problem of the robustness of the credit-scoring model against performance degradation caused by gradual market changes along a few years of business operation. We utilized the following standard models: logistic regression, KNN, SVM, GBM and decision tree. The novelty of our approach is two-fold: the integration of existing models, namely feeding the results of KNN as an input variable to the logistic regression, and re-coding categorical variables as numerical values that represent each category's statistical impact on the target label. The best solution we obtained reached 3rd place in the competition, with an AUC score of 0.655.

  1. A Primer on Logistic Regression.

    ERIC Educational Resources Information Center

    Woldbeck, Tanya

    This paper introduces logistic regression as a viable alternative when the researcher is faced with variables that are not continuous. If one is to use simple regression, the dependent variable must be measured on a continuous scale. In the behavioral sciences, it may not always be appropriate or possible to have a measured dependent variable on a…

  2. Geographic information systems and logistic regression for high-resolution malaria risk mapping in a rural settlement of the southern Brazilian Amazon.

    PubMed

    de Oliveira, Elaine Cristina; dos Santos, Emerson Soares; Zeilhofer, Peter; Souza-Santos, Reinaldo; Atanaka-Santos, Marina

    2013-11-15

    In Brazil, 99% of the cases of malaria are concentrated in the Amazon region, with high level of transmission. The objectives of the study were to use geographic information systems (GIS) analysis and logistic regression as a tool to identify and analyse the relative likelihood and its socio-environmental determinants of malaria infection in the Vale do Amanhecer rural settlement, Brazil. A GIS database of georeferenced malaria cases, recorded in 2005, and multiple explanatory data layers was built, based on a multispectral Landsat 5 TM image, digital map of the settlement blocks and a SRTM digital elevation model. Satellite imagery was used to map the spatial patterns of land use and cover (LUC) and to derive spectral indices of vegetation density (NDVI) and soil/vegetation humidity (VSHI). An Euclidian distance operator was applied to measure proximity of domiciles to potential mosquito breeding habitats and gold mining areas. The malaria risk model was generated by multiple logistic regression, in which environmental factors were considered as independent variables and the number of cases, binarized by a threshold value was the dependent variable. Out of a total of 336 cases of malaria, 133 positive slides were from inhabitants at Road 08, which corresponds to 37.60% of the notifications. The southern region of the settlement presented 276 cases and a greater number of domiciles in which more than ten cases/home were notified. From these, 102 (30.36%) cases were caused by Plasmodium falciparum and 174 (51.79%) cases by Plasmodium vivax. Malaria risk is the highest in the south of the settlement, associated with proximity to gold mining sites, intense land use, high levels of soil/vegetation humidity and low vegetation density. Mid-resolution, remote sensing data and GIS-derived distance measures can be successfully combined with digital maps of the housing location of (non-) infected inhabitants to predict relative likelihood of disease infection through the analysis by logistic regression. Obtained findings on the relation between malaria cases and environmental factors should be applied in the future for land use planning in rural settlements in the Southern Amazon to minimize risks of disease transmission.

  3. Mobile phone use during driving: Effects on speed and effectiveness of driver compensatory behaviour.

    PubMed

    Choudhary, Pushpa; Velaga, Nagendra R

    2017-09-01

    This study analysed and modelled the effects of conversation and texting (each with two difficulty levels) on driving performance of Indian drivers in terms of their mean speed and accident avoiding abilities; and further explored the relationship between speed reduction strategy of the drivers and their corresponding accident frequency. 100 drivers of three different age groups (young, mid-age and old-age) participated in the simulator study. Two sudden events of Indian context: unexpected crossing of pedestrians and joining of parked vehicles from road side, were simulated for estimating the accident probabilities. Generalized linear mixed models approach was used for developing linear regression models for mean speed and binary logistic regression models for accident probability. The results of the models showed that the drivers significantly compensated the increased workload by reducing their mean speed by 2.62m/s and 5.29m/s in the presence of conversation and texting tasks respectively. The logistic models for accident probabilities showed that the accident probabilities increased by 3 and 4 times respectively when the drivers were conversing or texting on a phone during driving. Further, the relationship between the speed reduction patterns and their corresponding accident frequencies showed that all the drivers compensated differently; but, among all the drivers, only few drivers, who compensated by reducing the speed by 30% or more, were able to fully offset the increased accident risk associated with the phone use. Copyright © 2017 Elsevier Ltd. All rights reserved.

  4. Development and validation of a mortality risk model for pediatric sepsis.

    PubMed

    Chen, Mengshi; Lu, Xiulan; Hu, Li; Liu, Pingping; Zhao, Wenjiao; Yan, Haipeng; Tang, Liang; Zhu, Yimin; Xiao, Zhenghui; Chen, Lizhang; Tan, Hongzhuan

    2017-05-01

    Pediatric sepsis is a burdensome public health problem. Assessing the mortality risk of pediatric sepsis patients, offering effective treatment guidance, and improving prognosis to reduce mortality rates, are crucial.We extracted data derived from electronic medical records of pediatric sepsis patients that were collected during the first 24 hours after admission to the pediatric intensive care unit (PICU) of the Hunan Children's hospital from January 2012 to June 2014. A total of 788 children were randomly divided into a training (592, 75%) and validation group (196, 25%). The risk factors for mortality among these patients were identified by conducting multivariate logistic regression in the training group. Based on the established logistic regression equation, the logit probabilities for all patients (in both groups) were calculated to verify the model's internal and external validities.According to the training group, 6 variables (brain natriuretic peptide, albumin, total bilirubin, D-dimer, lactate levels, and mechanical ventilation in 24 hours) were included in the final logistic regression model. The areas under the curves of the model were 0.854 (0.826, 0.881) and 0.844 (0.816, 0.873) in the training and validation groups, respectively.The Mortality Risk Model for Pediatric Sepsis we established in this study showed acceptable accuracy to predict the mortality risk in pediatric sepsis patients.

  5. Development and validation of a mortality risk model for pediatric sepsis

    PubMed Central

    Chen, Mengshi; Lu, Xiulan; Hu, Li; Liu, Pingping; Zhao, Wenjiao; Yan, Haipeng; Tang, Liang; Zhu, Yimin; Xiao, Zhenghui; Chen, Lizhang; Tan, Hongzhuan

    2017-01-01

    Abstract Pediatric sepsis is a burdensome public health problem. Assessing the mortality risk of pediatric sepsis patients, offering effective treatment guidance, and improving prognosis to reduce mortality rates, are crucial. We extracted data derived from electronic medical records of pediatric sepsis patients that were collected during the first 24 hours after admission to the pediatric intensive care unit (PICU) of the Hunan Children's hospital from January 2012 to June 2014. A total of 788 children were randomly divided into a training (592, 75%) and validation group (196, 25%). The risk factors for mortality among these patients were identified by conducting multivariate logistic regression in the training group. Based on the established logistic regression equation, the logit probabilities for all patients (in both groups) were calculated to verify the model's internal and external validities. According to the training group, 6 variables (brain natriuretic peptide, albumin, total bilirubin, D-dimer, lactate levels, and mechanical ventilation in 24 hours) were included in the final logistic regression model. The areas under the curves of the model were 0.854 (0.826, 0.881) and 0.844 (0.816, 0.873) in the training and validation groups, respectively. The Mortality Risk Model for Pediatric Sepsis we established in this study showed acceptable accuracy to predict the mortality risk in pediatric sepsis patients. PMID:28514310

  6. A Solution to Separation and Multicollinearity in Multiple Logistic Regression

    PubMed Central

    Shen, Jianzhao; Gao, Sujuan

    2010-01-01

    In dementia screening tests, item selection for shortening an existing screening test can be achieved using multiple logistic regression. However, maximum likelihood estimates for such logistic regression models often experience serious bias or even non-existence because of separation and multicollinearity problems resulting from a large number of highly correlated items. Firth (1993, Biometrika, 80(1), 27–38) proposed a penalized likelihood estimator for generalized linear models and it was shown to reduce bias and the non-existence problems. The ridge regression has been used in logistic regression to stabilize the estimates in cases of multicollinearity. However, neither solves the problems for each other. In this paper, we propose a double penalized maximum likelihood estimator combining Firth’s penalized likelihood equation with a ridge parameter. We present a simulation study evaluating the empirical performance of the double penalized likelihood estimator in small to moderate sample sizes. We demonstrate the proposed approach using a current screening data from a community-based dementia study. PMID:20376286

  7. A Solution to Separation and Multicollinearity in Multiple Logistic Regression.

    PubMed

    Shen, Jianzhao; Gao, Sujuan

    2008-10-01

    In dementia screening tests, item selection for shortening an existing screening test can be achieved using multiple logistic regression. However, maximum likelihood estimates for such logistic regression models often experience serious bias or even non-existence because of separation and multicollinearity problems resulting from a large number of highly correlated items. Firth (1993, Biometrika, 80(1), 27-38) proposed a penalized likelihood estimator for generalized linear models and it was shown to reduce bias and the non-existence problems. The ridge regression has been used in logistic regression to stabilize the estimates in cases of multicollinearity. However, neither solves the problems for each other. In this paper, we propose a double penalized maximum likelihood estimator combining Firth's penalized likelihood equation with a ridge parameter. We present a simulation study evaluating the empirical performance of the double penalized likelihood estimator in small to moderate sample sizes. We demonstrate the proposed approach using a current screening data from a community-based dementia study.

  8. Correlates of county-level nonviral sexually transmitted infection hot spots in the US: application of hot spot analysis and spatial logistic regression.

    PubMed

    Chang, Brian A; Pearson, William S; Owusu-Edusei, Kwame

    2017-04-01

    We used a combination of hot spot analysis (HSA) and spatial regression to examine county-level hot spot correlates for the most commonly reported nonviral sexually transmitted infections (STIs) in the 48 contiguous states in the United States (US). We obtained reported county-level total case rates of chlamydia, gonorrhea, and primary and secondary (P&S) syphilis in all counties in the 48 contiguous states from national surveillance data and computed temporally smoothed rates using 2008-2012 data. Covariates were obtained from county-level multiyear (2008-2012) American Community Surveys from the US census. We conducted HSA to identify hot spot counties for all three STIs. We then applied spatial logistic regression with the spatial error model to determine the association between the identified hot spots and the covariates. HSA indicated that ≥84% of hot spots for each STI were in the South. Spatial regression results indicated that, a 10-unit increase in the percentage of Black non-Hispanics was associated with ≈42% (P < 0.01) [≈22% (P < 0.01), for Hispanics] increase in the odds of being a hot spot county for chlamydia and gonorrhea, and ≈27% (P < 0.01) [≈11% (P < 0.01) for Hispanics] for P&S syphilis. Compared with the other regions (West, Midwest, and Northeast), counties in the South were 6.5 (P < 0.01; chlamydia), 9.6 (P < 0.01; gonorrhea), and 4.7 (P < 0.01; P&S syphilis) times more likely to be hot spots. Our study provides important information on hot spot clusters of nonviral STIs in the entire United States, including associations between hot spot counties and sociodemographic factors. Published by Elsevier Inc.

  9. Prevalence and correlates of cognitive impairment in kidney transplant recipients.

    PubMed

    Gupta, Aditi; Mahnken, Jonathan D; Johnson, David K; Thomas, Tashra S; Subramaniam, Dipti; Polshak, Tyler; Gani, Imran; John Chen, G; Burns, Jeffrey M; Sarnak, Mark J

    2017-05-12

    There is a high prevalence of cognitive impairment in dialysis patients. The prevalence of cognitive impairment after kidney transplantation is unknown. Study Design: Cross-sectional study. Single center study of prevalent kidney transplant recipients from a transplant clinic in a large academic center. Assessment of cognition using the Montreal Cognitive Assessment (MoCA). Demographic and clinical variables associated with cognitive impairment were also examined. Outcomes and Measurements: a) Prevalence of cognitive impairment defined by a MoCA score of <26. b) Multivariable linear and logistic regression to examine the association of demographic and clinical factors with cognitive impairment. Data from 226 patients were analyzed. Mean (SD) age was 54 (13.4) years, 73% were white, 60% were male, 37% had diabetes, 58% had an education level of college or above, and the mean (SD) time since kidney transplant was 3.4 (4.1) years. The prevalence of cognitive impairment was 58.0%. Multivariable linear regression demonstrated that older age, male gender and absence of diabetes were associated with lower MoCA scores (p < 0.01 for all). Estimated glomerular filtration rate (eGFR) was not associated with level of cognition. The logistic regression analysis confirmed the association of older age with cognitive impairment. Cognitive impairment is common in prevalent kidney transplant recipients, at a younger age compared to general population, and is associated with certain demographic variables, but not level of eGFR.

  10. Relationship of lead, mercury, mirex, dichlorodiphenyldichloroethylene, hexachlorobenzene, and polychlorinated biphenyls to timing of menarche among Akwesasne Mohawk girls.

    PubMed

    Denham, Melinda; Schell, Lawrence M; Deane, Glenn; Gallo, Mia V; Ravenscroft, Julia; DeCaprio, Anthony P

    2005-02-01

    Children are commonly exposed at background levels to several ubiquitous environmental pollutants, such as lead and persistent organic pollutants, that have been linked to neurologic and endocrine effects. These effects have prompted concern about alterations in human reproductive development. Few studies have examined the effects of these toxicants on human sexual maturation at levels commonly found in the general population, and none has been able to examine multiple toxicant exposures. The aim of the current investigation was to examine the relationship between attainment of menarche and levels of 6 environmental pollutants to which children are commonly exposed at low levels, ie, dichlorodiphenyldichloroethylene (p,p'-DDE), hexachlorobenzene (HCB), polychlorinated biphenyls (PCBs), mirex, lead, and mercury. This study was conducted with residents of the Akwesasne Mohawk Nation, a sovereign territory that spans the St Lawrence River and the boundaries of New York State and Ontario and Quebec, Canada. Since the 1950s, the St Lawrence River has been a site of substantial industrial development, and the Nation is currently adjacent to a US National Priority Superfund site. PCB, p,p'-DDE, HCB, and mirex levels exceeding the US Food and Drug Administration recommended tolerance limits for human consumption have been found in local animal species. The present analysis included 138 Akwesasne Mohawk Nation girls 10 to 16.9 years of age. Blood samples and sociodemographic data were collected by Akwesasne community members, without prior knowledge of participants' exposure status. Attainment of menses (menarche) was assessed as present or absent at the time of the interview. Congener-specific PCB analysis was available, and all 16 PCB congeners detected in >50% of the sample were included in analyses (International Union of Pure and Applied Chemistry numbers 52, 70, 74, 84, 87, 95, 99, 101 [+90], 105, 110, 118, 138 [+163 and 164], 149 [+123], 153, 180, and 187). Probit analysis was used to determine the median age at menarche for the sample. Binary logistic regression analysis was used to determine predictors of menarcheal status. Six toxicants (p,p'-DDE, HCB, PCBs, mirex, lead, and mercury) were entered into the logistic regression model. Age, socioeconomic status (SES), and BMI were tested as potential cofounders and were included in the model at P < .05. Interactions among toxicants were also evaluated. Toxicant levels were measured in blood for this sample and were consistent with long-term exposure to a variety of toxicants in multiple media. Mercury levels were at or below background levels, all lead levels were well below the Centers for Disease Control and Prevention action limit of 10 microg/dL, and PCB levels were consistent with a cumulative, continuing exposure pattern. The median age at menarche for the total sample was 12.2 years. The predicted age at menarche for girls with lead levels above the median (1.2 microg/dL) was 10.5 months later than that for girls with lead levels below the median. In the logistic regression analysis, age was the strongest predictor of menarcheal status and SES was also a significant predictor but BMI was not. The logistic regression analysis that corrected for age, SES, and other pollutants (p,p'-DDE, HCB, mirex, and mercury) indicated that, at their respective geometric means, lead (geometric mean: 0.49 microg/dL) was associated with a significantly lower probability of having reached menarche (beta = -1.29) and a group of 4 potentially estrogenic PCB congeners (E-PCB) (geometric mean: 0.12 ppb; International Union of Pure and Applied Chemistry numbers 52, 70, 101 [+90], and 187) was associated with a significantly greater probability of having reached menarche (beta = 2.13). Predicted probabilities at different levels of lead and PCBs were calculated on the basis of the logistic regression model. At the respective means of all toxicants and SES, 69% of 12-year-old girls were predicted to have reached menarche. However, at the 75th percentile of lead levels, only 10% of 12-year-old Mohawk girls were predicted to have reached menarche; at the 75th percentile of E-PCB levels, 86% of 12-year-old Mohawk girls were predicted to have reached menarche. No association was observed between mirex, p,p'-DDE, or HCB and menarcheal status. Although BMI was not a significant predictor, we tested BMI in the logistic regression model; it had little effect on the relationships between menarcheal status and either lead or E-PCB. In models testing toxicant interactions, age, SES, lead levels, and PCB levels continued to be significant predictors of menarcheal status. When each toxicant was tested in a logistic regression model correcting only for age and SES, we observed little change in the effects of lead or E-PCB on menarcheal status. The analysis of multichemical exposure among Akwesasne Mohawk Nation adolescent girls suggests that the attainment of menarche may be sensitive to relatively low levels of lead and certain PCB congeners. This study is distinguished by the ability to test many toxicants simultaneously and thus to exclude effects from unmeasured but coexisting exposures. By testing several PCB congener groupings, we were able to determine that specifically a group of potentially estrogenic PCB congeners affected the odds of reaching menarche. The lead and PCB findings are consistent with the literature and are biologically plausible. The sample size, cross-sectional study design, and possible occurrence of confounders beyond those tested suggest that results should be interpreted cautiously. Additional investigation to determine whether such low toxicant levels may affect reproduction and disorders of the reproductive system is warranted.

  11. Fire spread in chaparral – a comparison of laboratory data and model predictions in burning live fuels

    Treesearch

    David R. Weise; Eunmo Koo; Xiangyang Zhou; Shankar Mahalingam; Frédéric Morandini; Jacques-Henri Balbi

    2016-01-01

    Fire behaviour data from 240 laboratory fires in high-density live chaparral fuel beds were compared with model predictions. Logistic regression was used to develop a model to predict fire spread success in the fuel beds and linear regression was used to predict rate of spread. Predictions from the Rothermel equation and three proposed changes as well as two physically...

  12. Gender contentedness in aspirations to become engineers or medical doctors

    NASA Astrophysics Data System (ADS)

    Koul, Ravinder; Lerdpornkulrat, Thanita; Poondej, Chanut

    2017-11-01

    Medical doctor and engineer are highly esteemed STEM professions. This study investigates academic and motivational characteristics of a sample of high school students in Thailand who aspire to become medical doctors or engineers. We used logistic regression to compare maths performance, gender typicality, gender contentedness, and maths and physics self-concepts among students with aspirations for these two professions. We found that high levels of felt gender contentedness in men had positive association with aspirations for engineering irrespective of the levels of maths or physics self-concept. We found that high levels of felt gender contentedness combined with high levels of maths or physics self-concept in women had positive associations with aspirations to become a medical doctor. These findings are evidence that student views of self are associated with uneven gendered patterns in career aspirations and have implications for the potential for future participation.

  13. Modeling the pressure inactivation of Escherichia coli and Salmonella typhimurium in sapote mamey ( Pouteria sapota (Jacq.) H.E. Moore & Stearn) pulp.

    PubMed

    Saucedo-Reyes, Daniela; Carrillo-Salazar, José A; Román-Padilla, Lizbeth; Saucedo-Veloz, Crescenciano; Reyes-Santamaría, María I; Ramírez-Gilly, Mariana; Tecante, Alberto

    2018-03-01

    High hydrostatic pressure inactivation kinetics of Escherichia coli ATCC 25922 and Salmonella enterica subsp. enterica serovar Typhimurium ATCC 14028 ( S. typhimurium) in a low acid mamey pulp at four pressure levels (300, 350, 400, and 450 MPa), different exposure times (0-8 min), and temperature of 25 ± 2℃ were obtained. Survival curves showed deviations from linearity in the form of a tail (upward concavity). The primary models tested were the Weibull model, the modified Gompertz equation, and the biphasic model. The Weibull model gave the best goodness of fit ( R 2 adj  > 0.956, root mean square error < 0.290) in the modeling and the lowest Akaike information criterion value. Exponential-logistic and exponential decay models, and Bigelow-type and an empirical models for b'( P) and n( P) parameters, respectively, were tested as alternative secondary models. The process validation considered the two- and one-step nonlinear regressions for making predictions of the survival fraction; both regression types provided an adequate goodness of fit and the one-step nonlinear regression clearly reduced fitting errors. The best candidate model according to the Akaike theory information, with better accuracy and more reliable predictions was the Weibull model integrated by the exponential-logistic and exponential decay secondary models as a function of time and pressure (two-step procedure) or incorporated as one equation (one-step procedure). Both mathematical expressions were used to determine the t d parameter, where the desired reductions ( 5D) (considering d = 5 ( t 5 ) as the criterion of 5 Log 10 reduction (5 D)) in both microorganisms are attainable at 400 MPa for 5.487 ± 0.488 or 5.950 ± 0.329 min, respectively, for the one- or two-step nonlinear procedure.

  14. [Influences of environmental factors and interaction of several chemokines gene-environmental on systemic lupus erythematosus].

    PubMed

    Ye, Dong-qing; Hu, Yi-song; Li, Xiang-pei; Huang, Fen; Yang, Shi-gui; Hao, Jia-hu; Yin, Jing; Zhang, Guo-qing; Liu, Hui-hui

    2004-11-01

    To explore the impact of environmental factors, daily lifestyle, psycho-social factors and the interactions between environmental factors and chemokines genes on systemic lupus erythematosus (SLE). Case-control study was carried out and environmental factors for SLE were analyzed by univariate and multivariate unconditional logistic regression. Interactions between environmental factors and chemokines polymorphism contributing to systemic lupus erythematosus were also analyzed by logistic regression model. There were nineteen factors associated with SLE when univariate unconditional logistic regression was used. However, when multivariate unconditional logistic regression was used, only five factors showed having impacts on the disease, in which drinking well water (OR=0.099) was protective factor for SLE, and multiple drug allergy (OR=8.174), over-exposure to sunshine (OR=18.339), taking antibiotics (OR=9.630) and oral contraceptives were risk factors for SLE. When unconditional logistic regression model was used, results showed that there was interaction between eating irritable food and -2518MCP-1G/G genotype (OR=4.387). No interaction between environmental factors was found that contributing to SLE in this study. Many environmental factors were related to SLE, and there was an interaction between -2518MCP-1G/G genotype and eating irritable food.

  15. [Factors associated with activities of daily living (ADL) in independently living elderly persons in a community: a baseline examination of a large scale cohort study, Fujiwara-kyo study].

    PubMed

    Komatsu, Masayo; Nezu, Satoko; Tomioka, Kimiko; Hazaki, Kan; Harano, Akihiro; Morikawa, Masayuki; Takagi, Masahiro; Yamada, Masahiro; Matsumoto, Yoshitaka; Iwamoto, Junko; Ishizuka, Rika; Saeki, Keigo; Okamoto, Nozomi; Kurumatani, Norio

    2013-01-01

    To investigate factors associated with activities of daily living in independently living elderly persons in a community. The potential subjects were 4,472 individuals aged 65 years and older who voluntarily participated in a large cohort study, the Fujiwara-kyo study. We used self-administered questionnaires consisting of an activities of daily living (ADL) questionnaire with the Physical Fitness Test established by the Ministry of Education, Culture, Sports, Science and Technology (12 ADL items) to determine the index of higher-level physical independence, demographics, Geriatric Depression Scale, and so on. Mini-mental state examination, measurement of physical fitness, and blood tests were also carried out. A lower ADL level was defined as having a total score of the 12 ADL items (range, 12-36 points) that was below the first quartile of a total score for all the subjects. Factors associated with a low ADL level were examined by multiple logistic regression. A total of 4,198 remained as subjects for analysis. The male, female and 5-year-old groups showed significant differences in the median score of 12 ADL items between any two groups. The highest odds ratio among factors associated with lower ADL level by multiple logistic regression with mutually adjusted independent variables was 4.49 (95%CI: 2.82-7.17) in the groups of "very sharp pain" or "strong pain" during the last month. Low physical ability, self-awareness of limb weakness, a BMI of over 25, low physical activity, cerebrovascular disorder, depression, low cognitive function, unable "to see normally", unable "to hear someone", "muscle, bone and joint pain" were independently associated with lower ADL level. Multiple factors are associated with lower ADL level assessed on the basis of the 12 ADL items.

  16. Controlling Type I Error Rates in Assessing DIF for Logistic Regression Method Combined with SIBTEST Regression Correction Procedure and DIF-Free-Then-DIF Strategy

    ERIC Educational Resources Information Center

    Shih, Ching-Lin; Liu, Tien-Hsiang; Wang, Wen-Chung

    2014-01-01

    The simultaneous item bias test (SIBTEST) method regression procedure and the differential item functioning (DIF)-free-then-DIF strategy are applied to the logistic regression (LR) method simultaneously in this study. These procedures are used to adjust the effects of matching true score on observed score and to better control the Type I error…

  17. Academic Success for Student Veterans Enrolled in Two-Year Colleges

    ERIC Educational Resources Information Center

    Chan, Hsun-Yu

    2018-01-01

    This study examined the relationship between college readiness of student veterans and retention, graduation, or transfer. I analyzed transcript and administrative data for student veterans who used GI Bill benefits at a public two-year college in Wisconsin. Results from logistic regression show that successful course completion rate (earning a C…

  18. Comparison of the incidence of patient-reported post-operative dysphagia between ACDF with a traditional anterior plate and artificial cervical disc replacement.

    PubMed

    Yang, Yi; Ma, Litai; Liu, Hao; Liu, Yilian; Hong, Ying; Wang, Beiyu; Ding, Chen; Deng, Yuxiao; Song, Yueming; Liu, Limin

    2016-09-01

    Compared with anterior cervical discectomy and fusion (ACDF), cervical disc replacement (CDR) has provided satisfactory clinical results. The incidence of post-operative dysphagia between ACDF with a traditional anterior plate and CDR remains controversial. Considering the limited studies and knowledge in this area, a retrospective study focusing on post-operative dysphagia was conducted. The Bazaz grading system was used to assess the severity of dysphagia at post-operative intervals including 1 week, 1 month, 3 months, 6 months, 12 months and 24 months respectively. The Chi-square test, Student t-test, Mann-Whitney U tests and Ordinal Logistic regression were used for data analysis when appropriate. Statistical significance was accepted at a probability value of <0.05. Two hundred and thirty-one patients in the CDR group and one hundred and fifty-eight patients in Plate group were included in this study. The total incidences of dysphagia in the CDR and plate group were 36.58% and 60.43% at one week, 29.27% and 38.85% at one month, 21.95% and 31.65% at three months, 6.83% and 17.99% at six months, 5.85% and 14.39% at 12 months, and 4.39% and 10.07% at the final follow-up respectively (All P<0.05, Mann-Whitney U test). Ordinal Logistic regression analysis showed that female patients, two-level surgery, C4/5 surgery, and anterior cervical plating were significant risk factors for post-operative dysphagia (all P<0.05). Comparing ACDF with a plate, CDR with a Prestige LP can significantly reduce both transient and persistent post-operative dysphagia. Female patients, two-level surgery, C4/5 surgery and anterior cervical plating were associated with a higher incidence of dysphagia. Future prospective, randomized, controlled studies are needed to further validate these findings. Copyright © 2016 Elsevier B.V. All rights reserved.

  19. Administrative Climate and Novices' Intent to Remain Teaching

    ERIC Educational Resources Information Center

    Pogodzinski, Ben; Youngs, Peter; Frank, Kenneth A.; Belman, Dale

    2012-01-01

    Using survey data from novice teachers at the elementary and middle school level across 11 districts, multilevel logistic regressions were estimated to examine the association between novices' perceptions of the administrative climate and their desire to remain teaching within their schools. We find that the probability that a novice teacher…

  20. The Radius of Trust: Religion, Social Embeddedness and Trust in Strangers

    ERIC Educational Resources Information Center

    Welch, Michael R.; Sikkink, David; Loveland, Matthew T.

    2007-01-01

    Data from the 2002 Religion and Public Activism Survey were used to examine relationships among measures of religious orientation, embeddedness in social networks and the level of trust individuals direct toward others. Results from ordered logistic regression analysis demonstrate that Catholics and members of other denominations show…

  1. Do Basic Skills Predict Youth Unemployment (16- to 24-Year-Olds) Also when Controlled for Accomplished Upper-Secondary School? A Cross-Country Comparison

    ERIC Educational Resources Information Center

    Lundetrae, Kjersti; Gabrielsen, Egil; Mykletun, Reidar

    2010-01-01

    Basic skills and educational level are closely related, and both might affect employment. Data from the Adult Literacy and Life Skills Survey were used to examine whether basic skills in terms of literacy and numeracy predicted youth unemployment (16-24 years) while controlling for educational level. Stepwise logistic regression showed that in…

  2. Individual-Level Predictors of Nonparticipation and Dropout in a Life-Skills HIV Prevention Program for Adolescents in Foster Care

    ERIC Educational Resources Information Center

    Thompson, Ronald G., Jr.; Auslander, Wendy F.; Alonzo, Dana

    2012-01-01

    The purpose of this study is to identify individual-level characteristics of foster care adolescents who are more likely to not participate in, and drop out of, a life-skills HIV prevention program delivered over 8 months. Structured interviews were conducted with 320 foster care adolescents (15-18 years). Logistic regression and survival analyses…

  3. [Application of SAS macro to evaluated multiplicative and additive interaction in logistic and Cox regression in clinical practices].

    PubMed

    Nie, Z Q; Ou, Y Q; Zhuang, J; Qu, Y J; Mai, J Z; Chen, J M; Liu, X Q

    2016-05-01

    Conditional logistic regression analysis and unconditional logistic regression analysis are commonly used in case control study, but Cox proportional hazard model is often used in survival data analysis. Most literature only refer to main effect model, however, generalized linear model differs from general linear model, and the interaction was composed of multiplicative interaction and additive interaction. The former is only statistical significant, but the latter has biological significance. In this paper, macros was written by using SAS 9.4 and the contrast ratio, attributable proportion due to interaction and synergy index were calculated while calculating the items of logistic and Cox regression interactions, and the confidence intervals of Wald, delta and profile likelihood were used to evaluate additive interaction for the reference in big data analysis in clinical epidemiology and in analysis of genetic multiplicative and additive interactions.

  4. Excess adiposity, inflammation, and iron-deficiency in female adolescents.

    PubMed

    Tussing-Humphreys, Lisa M; Liang, Huifang; Nemeth, Elizabeta; Freels, Sally; Braunschweig, Carol A

    2009-02-01

    Iron deficiency is more prevalent in overweight children and adolescents but the mechanisms that underlie this condition remain unclear. The purpose of this cross-sectional study was to assess the relationship between iron status and excess adiposity, inflammation, menarche, diet, physical activity, and poverty status in female adolescents included in the National Health and Nutrition Examination Survey 2003-2004 dataset. Descriptive and simple comparative statistics (t test, chi(2)) were used to assess differences between normal-weight (5th < or = body mass index [BMI] percentile <85th) and heavier-weight girls (< or = 85th percentile for BMI) for demographic, biochemical, dietary, and physical activity variables. In addition, logistic regression analyses predicting iron deficiency and linear regression predicting serum iron levels were performed. Heavier-weight girls had an increased prevalence of iron deficiency compared to those with normal weight. Dietary iron, age of and time since first menarche, poverty status, and physical activity were similar between the two groups and were not independent predictors of iron deficiency or log serum iron levels. Logistic modeling predicting iron deficiency revealed having a BMI > or = 85th percentile and for each 1 mg/dL increase in C-reactive protein the odds ratio for iron deficiency more than doubled. The best-fit linear model to predict serum iron levels included both serum transferrin receptor and C-reactive protein following log-transformation for normalization of these variables. Findings indicate that heavier-weight female adolescents are at greater risk for iron deficiency and that inflammation stemming from excess adipose tissue contributes to this phenomenon. Food and nutrition professionals should consider elevated BMI as an additional risk factor for iron deficiency in female adolescents.

  5. Socio-demographic correlates of participation in mammography: a survey among women aged between 35- 69 in Tehran, Iran.

    PubMed

    Samah, Asnarulkhadi Abu; Ahmadian, Maryam

    2012-01-01

    The rates of breast cancer have increased over the past two decades, and this raises concern about physical, psychological and social well-being of women with breast cancer. Further, few women really want to do breast cancer screening. We here investigated the socio-demographic correlates of mammography participation among 400 asymptomatic Iranian women aged between 35 and 69. A cross-sectional survey was conducted at the four outpatient clinics of general hospitals in Tehran during the period from July through October, 2009. Bi-variate analyses and multi-variate binary logistic regression were employed to find the socio- demographic predictors of mammography utilization among participants. The rate of mammography participation was 21.5% and relatively high because of access to general hospital services. More women who had undergone mammography were graduates from university or college, had full-time or part-time employment, were insured whether public or private, reported a positive family history of breast cancer, and were in the middle income level (P <0.01).The largest number of participating women was in the age range of 41 to 50 years. The results of multivariate logistic regression further showed that education (95%CI: 0.131-0.622), monthly income (95%CI: 0.038-0.945), and family history of breast cancer (95%CI: 1.97-9.28) were significantly associated (all P <0.05)with mammography participation. The most important issue for a successful screening program is participation. Using a random sample, this study found that the potential predictor variables of mammography participation included a higher education level, a middle income level, and a positive family history of breast cancer for Iranian women after adjusting for all other demographic variables in the model.

  6. Association of atopic diseases and parvovirus B19 with acute lymphoblastic leukemia in childhood and adolescence in the northeast of Brazil.

    PubMed

    da Conceição Nunes, Joacilda; de Araujo, Georgia Véras; Viana, Marcelo Tavares; Sarinho, Emanuel Sávio Cavalcanti

    2016-10-01

    Several factors related to the immune system, such as a history of allergies and virus infections, may be associated with acute lymphoblastic leukemia (ALL). The purpose of this study was to analyze whether the presence of atopic diseases and previous infection with parvovirus B19 and Epstein-Barr virus (EBV) are associated with the development of ALL. This case-control study was performed in two tertiary hospitals located in northeastern Brazil. The study population included 60 patients who were diagnosed with non-T-cell ALL using myelogram and immunophenotyping and 120 patients in the control group. Atopy was evaluated via a parent questionnaire and medical records. Total immunoglobulin (Ig)E and IgG levels of parvovirus B19 and EBV were measured in the serum. Logistic regression was performed to assess the association between variables and odds of ALL. We identified a significant inverse association between rhinitis, urticaria and elevated IgE serum levels with ALL. A history of parvovirus B19 infection showed a significant association with this type of cancer [OR (95 % CI) 2.00 (1.94-4.26); P = 0.050]. In logistic regression, the presence of atopy was a protective factor [OR (95 % CI) 0.57 (0.38-0.83); P = 0.004], and the presence of IgG for parvovirus B19 was an important risk factor for ALL [OR (95 % CI) 2.20 (1.02-4.76); P = 0.043]. These results suggest that atopic diseases and elevated total IgE levels are associated with a potential protective effect on the development of ALL. Previous infection with parvovirus B19 contributed to ALL susceptibility.

  7. Plasma 25-hydroxyvitamin D3 is associated with decreased risk of postmenopausal breast cancer in whites: a nested case-control study in the multiethnic cohort study.

    PubMed

    Kim, Yeonju; Franke, Adrian A; Shvetsov, Yurii B; Wilkens, Lynne R; Cooney, Robert V; Lurie, Galina; Maskarinec, Gertraud; Hernandez, Brenda Y; Le Marchand, Loïc; Henderson, Brian E; Kolonel, Laurence N; Goodman, Marc T

    2014-01-17

    Higher sunlight exposure is correlated with lower incidence of breast cancer in ecological studies, but findings from prospective studies regarding the association of circulating levels of vitamin D with the risk of breast cancer have been null. The objective of this study was to examine the relation between plasma levels of vitamin D and the risk of postmenopausal breast cancer. We conducted a nested case-control study within the Multiethnic Cohort Study of five race/ethnic groups (white, African-American, Native Hawaiian, Japanese, and Latino) from Hawaii and Los Angeles between 2001 and 2006. Pre-diagnostic plasma levels of 25-hydroxyvitamin D2 [25(OH)D2], 25-hydroxyvitamin D3 [25(OH)D3] and 25(OH)D (sum of 25(OH)D2 and 25(OH)D3) were examined among 707 postmenopausal breast cancer cases and matched controls. Using conditional logistic regression models, 20 ng/mL increases of plasma 25(OH)D3 (odds ratio (OR) 0.28; 95% confidence interval (CI) 0.14-0.56) and 25(OH)D (OR 0.43; 95% CI 0.23-0.80) were inversely associated with breast cancer risk among white women, but not among women in other race/ethnic groups. Using two-segmented, piecewise-linear logistic regression models, the change-points of the ORs, either for 25(OH)D3 or for 25(OH)D, were detected as 20 ng/mL among whites. Circulating 25(OH)D3 and 25(OH)D were associated with a reduced risk of postmenopausal breast cancer among whites, but not in other ethnic groups, who reside in low latitude regions.

  8. Measurements of the talus in the assessment of population affinity.

    PubMed

    Bidmos, Mubarak A; Dayal, Manisha R; Adegboye, Oyelola A

    2018-06-01

    As part of their routine work, forensic anthropologists are expected to report population affinity as part of the biological profile of an individual. The skull is the most widely used bone for the estimation of population affinity but it is not always present in a forensic case. Thus, other bones that preserve well have been shown to give a good indication of either the sex or population affinity of an individual. In this study, the potential of measurements of the talus was investigated for the purpose of estimating population affinity in South Africans. Nine measurements from two hundred and twenty tali of South African Africans (SAA) and South African Whites (SAW) from the Raymond A. Dart Collection of Human Skeletons were used. Direct and step-wise discriminant function and logistic regression analyses were carried out using SPSS and SAS. Talar length was the best single variable for discriminating between these two groups for males while in females the head height was the best single predictor. Average accuracies for correct population affinity classification using logistic regression analysis were higher than those obtained from discriminant function analysis. This study was the first of its type to employ discriminant function analyses and logistic regression analyses to estimate the population affinity of an individual from the talus. Thus these equations can now be used by South African anthropologists when estimating the population affinity of dismembered or damaged or incomplete skeletal remains of SAA and SAW. Copyright © 2018 Elsevier B.V. All rights reserved.

  9. Modeling health survey data with excessive zero and K responses.

    PubMed

    Lin, Ting Hsiang; Tsai, Min-Hsiao

    2013-04-30

    Zero-inflated Poisson regression is a popular tool used to analyze data with excessive zeros. Although much work has already been performed to fit zero-inflated data, most models heavily depend on special features of the individual data. To be specific, this means that there is a sizable group of respondents who endorse the same answers making the data have peaks. In this paper, we propose a new model with the flexibility to model excessive counts other than zero, and the model is a mixture of multinomial logistic and Poisson regression, in which the multinomial logistic component models the occurrence of excessive counts, including zeros, K (where K is a positive integer) and all other values. The Poisson regression component models the counts that are assumed to follow a Poisson distribution. Two examples are provided to illustrate our models when the data have counts containing many ones and sixes. As a result, the zero-inflated and K-inflated models exhibit a better fit than the zero-inflated Poisson and standard Poisson regressions. Copyright © 2012 John Wiley & Sons, Ltd.

  10. No rationale for 1 variable per 10 events criterion for binary logistic regression analysis.

    PubMed

    van Smeden, Maarten; de Groot, Joris A H; Moons, Karel G M; Collins, Gary S; Altman, Douglas G; Eijkemans, Marinus J C; Reitsma, Johannes B

    2016-11-24

    Ten events per variable (EPV) is a widely advocated minimal criterion for sample size considerations in logistic regression analysis. Of three previous simulation studies that examined this minimal EPV criterion only one supports the use of a minimum of 10 EPV. In this paper, we examine the reasons for substantial differences between these extensive simulation studies. The current study uses Monte Carlo simulations to evaluate small sample bias, coverage of confidence intervals and mean square error of logit coefficients. Logistic regression models fitted by maximum likelihood and a modified estimation procedure, known as Firth's correction, are compared. The results show that besides EPV, the problems associated with low EPV depend on other factors such as the total sample size. It is also demonstrated that simulation results can be dominated by even a few simulated data sets for which the prediction of the outcome by the covariates is perfect ('separation'). We reveal that different approaches for identifying and handling separation leads to substantially different simulation results. We further show that Firth's correction can be used to improve the accuracy of regression coefficients and alleviate the problems associated with separation. The current evidence supporting EPV rules for binary logistic regression is weak. Given our findings, there is an urgent need for new research to provide guidance for supporting sample size considerations for binary logistic regression analysis.

  11. Prediction models for clustered data: comparison of a random intercept and standard regression model

    PubMed Central

    2013-01-01

    Background When study data are clustered, standard regression analysis is considered inappropriate and analytical techniques for clustered data need to be used. For prediction research in which the interest of predictor effects is on the patient level, random effect regression models are probably preferred over standard regression analysis. It is well known that the random effect parameter estimates and the standard logistic regression parameter estimates are different. Here, we compared random effect and standard logistic regression models for their ability to provide accurate predictions. Methods Using an empirical study on 1642 surgical patients at risk of postoperative nausea and vomiting, who were treated by one of 19 anesthesiologists (clusters), we developed prognostic models either with standard or random intercept logistic regression. External validity of these models was assessed in new patients from other anesthesiologists. We supported our results with simulation studies using intra-class correlation coefficients (ICC) of 5%, 15%, or 30%. Standard performance measures and measures adapted for the clustered data structure were estimated. Results The model developed with random effect analysis showed better discrimination than the standard approach, if the cluster effects were used for risk prediction (standard c-index of 0.69 versus 0.66). In the external validation set, both models showed similar discrimination (standard c-index 0.68 versus 0.67). The simulation study confirmed these results. For datasets with a high ICC (≥15%), model calibration was only adequate in external subjects, if the used performance measure assumed the same data structure as the model development method: standard calibration measures showed good calibration for the standard developed model, calibration measures adapting the clustered data structure showed good calibration for the prediction model with random intercept. Conclusion The models with random intercept discriminate better than the standard model only if the cluster effect is used for predictions. The prediction model with random intercept had good calibration within clusters. PMID:23414436

  12. Prediction models for clustered data: comparison of a random intercept and standard regression model.

    PubMed

    Bouwmeester, Walter; Twisk, Jos W R; Kappen, Teus H; van Klei, Wilton A; Moons, Karel G M; Vergouwe, Yvonne

    2013-02-15

    When study data are clustered, standard regression analysis is considered inappropriate and analytical techniques for clustered data need to be used. For prediction research in which the interest of predictor effects is on the patient level, random effect regression models are probably preferred over standard regression analysis. It is well known that the random effect parameter estimates and the standard logistic regression parameter estimates are different. Here, we compared random effect and standard logistic regression models for their ability to provide accurate predictions. Using an empirical study on 1642 surgical patients at risk of postoperative nausea and vomiting, who were treated by one of 19 anesthesiologists (clusters), we developed prognostic models either with standard or random intercept logistic regression. External validity of these models was assessed in new patients from other anesthesiologists. We supported our results with simulation studies using intra-class correlation coefficients (ICC) of 5%, 15%, or 30%. Standard performance measures and measures adapted for the clustered data structure were estimated. The model developed with random effect analysis showed better discrimination than the standard approach, if the cluster effects were used for risk prediction (standard c-index of 0.69 versus 0.66). In the external validation set, both models showed similar discrimination (standard c-index 0.68 versus 0.67). The simulation study confirmed these results. For datasets with a high ICC (≥15%), model calibration was only adequate in external subjects, if the used performance measure assumed the same data structure as the model development method: standard calibration measures showed good calibration for the standard developed model, calibration measures adapting the clustered data structure showed good calibration for the prediction model with random intercept. The models with random intercept discriminate better than the standard model only if the cluster effect is used for predictions. The prediction model with random intercept had good calibration within clusters.

  13. Parenting styles, parenting practices, and physical activity in 10- to 11-year olds

    PubMed Central

    Jago, Russell; Davison, Kirsten K.; Brockman, Rowan; Page, Angie S.; Thompson, Janice L.; Fox, Kenneth R.

    2011-01-01

    Objective The objective of this study was to determine whether parenting styles and practices are associated with children's physical activity. Methods Cross-sectional survey of seven hundred ninety-two 10- to 11-year-old UK children in Bristol (UK) in 2008–2009 was conducted. Accelerometer-assessed physical activity and mean minutes of moderate-to-vigorous physical activity (mean MVPA) and mean counts per minute (mean CPM) were obtained. Maternal parenting style and physical activity parenting practices were self-reported. Results In regression analyses, permissive parenting was associated with higher mean MVPA among girls (+ 6.0 min/day, p < 0.001) and greater mean CPM (+ 98.9 accelerometer counts/min, p = 0.014) among boys when compared to children with authoritative parents. Maternal logistic support was associated with mean CPM for girls (+ 36.2 counts/min, p = 0.001), while paternal logistic support was associated with boys' mean MVPA (+ 4.0 min/day, p = 0.049) and mean CPM (+ 55.7 counts/min, p = 0.014). Conclusions Maternal permissive parenting was associated with higher levels of physical activity than authoritative parenting, but associations differed by child gender and type of physical activity. Maternal logistic support was associated with girls' physical activity, while paternal logistic support was associated with boys' physical activity. Health professionals could encourage parents to increase logistic support for their children's physical activity. PMID:21070805

  14. Why credit risk markets are predestined for exhibiting log-periodic power law structures

    NASA Astrophysics Data System (ADS)

    Wosnitza, Jan Henrik; Leker, Jens

    2014-01-01

    Recent research has established the existence of log-periodic power law (LPPL) patterns in financial institutions’ credit default swap (CDS) spreads. The main purpose of this paper is to clarify why credit risk markets are predestined for exhibiting LPPL structures. To this end, the credit risk prediction of two variants of logistic regression, i.e. polynomial logistic regression (PLR) and kernel logistic regression (KLR), are firstly compared to the standard logistic regression (SLR). In doing so, the question whether the performances of rating systems based on balance sheet ratios can be improved by nonlinear transformations of the explanatory variables is resolved. Building on the result that nonlinear balance sheet ratio transformations hardly improve the SLR’s predictive power in our case, we secondly compare the classification performance of a multivariate SLR to the discriminative powers of probabilities of default derived from three different capital market data, namely bonds, CDSs, and stocks. Benefiting from the prompt inclusion of relevant information, the capital market data in general and CDSs in particular increasingly outperform the SLR while approaching the time of the credit event. Due to the higher classification performances, it seems plausible for creditors to align their investment decisions with capital market-based default indicators, i.e., to imitate the aggregate opinion of the market participants. Since imitation is considered to be the source of LPPL structures in financial time series, it is highly plausible to scan CDS spread developments for LPPL patterns. By establishing LPPL patterns in governmental CDS spread trajectories of some European crisis countries, the LPPL’s application to credit risk markets is extended. This novel piece of evidence further strengthens the claim that credit risk markets are adequate breeding grounds for LPPL patterns.

  15. MODELING SNAKE MICROHABITAT FROM RADIOTELEMETRY STUDIES USING POLYTOMOUS LOGISTIC REGRESSION

    EPA Science Inventory

    Multivariate analysis of snake microhabitat has historically used techniques that were derived under assumptions of normality and common covariance structure (e.g., discriminant function analysis, MANOVA). In this study, polytomous logistic regression (PLR which does not require ...

  16. Incidence and player risk factors for injury in youth football.

    PubMed

    Malina, Robert M; Morano, Peter J; Barron, Mary; Miller, Susan J; Cumming, Sean P; Kontos, Anthony P

    2006-05-01

    To estimate the incidence of injuries in youth football and to assess the relationship between player-related risk factors (age, body size, biological maturity status) and the occurrence of injury in youth football. Prospective over two seasons. Two communities in central Michigan. Subjects were 678 youth, 9-14 years of age, who were members of 33 youth football teams in two central Michigan communities in the 2000 and 2001 seasons. Certified athletic trainers (ATCs) were on site to record the number of players at all practices and home games (exposures) and injuries as they occurred. A reportable injury (RI) was defined by the criteria used in the National Athletic Trainers' Association (NATA) survey of several high school sports. Estimated injury rates (95% confidence intervals) per athlete exposures (AE) and per number of athletes were calculated for practices and games by grade. Player risk factors included age, height, weight, BMI and estimated maturity status. Estimated injury rates and relative risks of injury during practices and games by grade; logistic regression to evaluate relationships between player-related risk factors and risk of injury. A total of 259 RIs, 178 in practice and 81 in games, were recorded during the two seasons. Practice injury rates increased with grade level, while game injury rates were similar among fourth through fifth grade and sixth grade players and about twice as high among seventh and eighth grade players. The majority of RIs during the two seasons was minor (64%); the remainder was moderate (18%) and major (13%). Injured fourth through fifth grade players were significantly lighter in weight and had a lower BMI; otherwise, injured and non-injured players within each grade did not differ in age, body size and estimated biological maturity status. Logistic regressions within grade revealed no significant associations between injury and age, height, BMI, and maturity status. Game injury rates are higher than practice injury rates, and the incidence of injury tends to increase with grade level. Age, height, BMI and maturity status were not related to the risk of injury in youth football players.

  17. Selecting risk factors: a comparison of discriminant analysis, logistic regression and Cox's regression model using data from the Tromsø Heart Study.

    PubMed

    Brenn, T; Arnesen, E

    1985-01-01

    For comparative evaluation, discriminant analysis, logistic regression and Cox's model were used to select risk factors for total and coronary deaths among 6595 men aged 20-49 followed for 9 years. Groups with mortality between 5 and 93 per 1000 were considered. Discriminant analysis selected variable sets only marginally different from the logistic and Cox methods which always selected the same sets. A time-saving option, offered for both the logistic and Cox selection, showed no advantage compared with discriminant analysis. Analysing more than 3800 subjects, the logistic and Cox methods consumed, respectively, 80 and 10 times more computer time than discriminant analysis. When including the same set of variables in non-stepwise analyses, all methods estimated coefficients that in most cases were almost identical. In conclusion, discriminant analysis is advocated for preliminary or stepwise analysis, otherwise Cox's method should be used.

  18. Maximally efficient two-stage screening: Determining intellectual disability in Taiwanese military conscripts.

    PubMed

    Chien, Chia-Chang; Huang, Shu-Fen; Lung, For-Wey

    2009-01-27

    The purpose of this study was to apply a two-stage screening method for the large-scale intelligence screening of military conscripts. We collected 99 conscripted soldiers whose educational levels were senior high school level or lower to be the participants. Every participant was required to take the Wisconsin Card Sorting Test (WCST) and the Wechsler Adult Intelligence Scale-Revised (WAIS-R) assessments. Logistic regression analysis showed the conceptual level responses (CLR) index of the WCST was the most significant index for determining intellectual disability (ID; FIQ ≤ 84). We used the receiver operating characteristic curve to determine the optimum cut-off point of CLR. The optimum one cut-off point of CLR was 66; the two cut-off points were 49 and 66. Comparing the two-stage window screening with the two-stage positive screening, the area under the curve and the positive predictive value increased. Moreover, the cost of the two-stage window screening decreased by 59%. The two-stage window screening is more accurate and economical than the two-stage positive screening. Our results provide an example for the use of two-stage screening and the possibility of the WCST to replace WAIS-R in large-scale screenings for ID in the future.

  19. Maximally efficient two-stage screening: Determining intellectual disability in Taiwanese military conscripts

    PubMed Central

    Chien, Chia-Chang; Huang, Shu-Fen; Lung, For-Wey

    2009-01-01

    Objective: The purpose of this study was to apply a two-stage screening method for the large-scale intelligence screening of military conscripts. Methods: We collected 99 conscripted soldiers whose educational levels were senior high school level or lower to be the participants. Every participant was required to take the Wisconsin Card Sorting Test (WCST) and the Wechsler Adult Intelligence Scale-Revised (WAIS-R) assessments. Results: Logistic regression analysis showed the conceptual level responses (CLR) index of the WCST was the most significant index for determining intellectual disability (ID; FIQ ≤ 84). We used the receiver operating characteristic curve to determine the optimum cut-off point of CLR. The optimum one cut-off point of CLR was 66; the two cut-off points were 49 and 66. Comparing the two-stage window screening with the two-stage positive screening, the area under the curve and the positive predictive value increased. Moreover, the cost of the two-stage window screening decreased by 59%. Conclusion: The two-stage window screening is more accurate and economical than the two-stage positive screening. Our results provide an example for the use of two-stage screening and the possibility of the WCST to replace WAIS-R in large-scale screenings for ID in the future. PMID:21197345

  20. Modification of the Mantel-Haenszel and Logistic Regression DIF Procedures to Incorporate the SIBTEST Regression Correction

    ERIC Educational Resources Information Center

    DeMars, Christine E.

    2009-01-01

    The Mantel-Haenszel (MH) and logistic regression (LR) differential item functioning (DIF) procedures have inflated Type I error rates when there are large mean group differences, short tests, and large sample sizes.When there are large group differences in mean score, groups matched on the observed number-correct score differ on true score,…

  1. The likelihood of achieving quantified road safety targets: a binary logistic regression model for possible factors.

    PubMed

    Sze, N N; Wong, S C; Lee, C Y

    2014-12-01

    In past several decades, many countries have set quantified road safety targets to motivate transport authorities to develop systematic road safety strategies and measures and facilitate the achievement of continuous road safety improvement. Studies have been conducted to evaluate the association between the setting of quantified road safety targets and road fatality reduction, in both the short and long run, by comparing road fatalities before and after the implementation of a quantified road safety target. However, not much work has been done to evaluate whether the quantified road safety targets are actually achieved. In this study, we used a binary logistic regression model to examine the factors - including vehicle ownership, fatality rate, and national income, in addition to level of ambition and duration of target - that contribute to a target's success. We analyzed 55 quantified road safety targets set by 29 countries from 1981 to 2009, and the results indicate that targets that are in progress and with lower level of ambitions had a higher likelihood of eventually being achieved. Moreover, possible interaction effects on the association between level of ambition and the likelihood of success are also revealed. Copyright © 2014 Elsevier Ltd. All rights reserved.

  2. Satellite rainfall retrieval by logistic regression

    NASA Technical Reports Server (NTRS)

    Chiu, Long S.

    1986-01-01

    The potential use of logistic regression in rainfall estimation from satellite measurements is investigated. Satellite measurements provide covariate information in terms of radiances from different remote sensors.The logistic regression technique can effectively accommodate many covariates and test their significance in the estimation. The outcome from the logistical model is the probability that the rainrate of a satellite pixel is above a certain threshold. By varying the thresholds, a rainrate histogram can be obtained, from which the mean and the variant can be estimated. A logistical model is developed and applied to rainfall data collected during GATE, using as covariates the fractional rain area and a radiance measurement which is deduced from a microwave temperature-rainrate relation. It is demonstrated that the fractional rain area is an important covariate in the model, consistent with the use of the so-called Area Time Integral in estimating total rain volume in other studies. To calibrate the logistical model, simulated rain fields generated by rainfield models with prescribed parameters are needed. A stringent test of the logistical model is its ability to recover the prescribed parameters of simulated rain fields. A rain field simulation model which preserves the fractional rain area and lognormality of rainrates as found in GATE is developed. A stochastic regression model of branching and immigration whose solutions are lognormally distributed in some asymptotic limits has also been developed.

  3. Comparison of Xenon-Enhanced Area-Detector CT and Krypton Ventilation SPECT/CT for Assessment of Pulmonary Functional Loss and Disease Severity in Smokers.

    PubMed

    Ohno, Yoshiharu; Fujisawa, Yasuko; Takenaka, Daisuke; Kaminaga, Shigeo; Seki, Shinichiro; Sugihara, Naoki; Yoshikawa, Takeshi

    2018-02-01

    The objective of this study was to compare the capability of xenon-enhanced area-detector CT (ADCT) performed with a subtraction technique and coregistered 81m Kr-ventilation SPECT/CT for the assessment of pulmonary functional loss and disease severity in smokers. Forty-six consecutive smokers (32 men and 14 women; mean age, 67.0 years) underwent prospective unenhanced and xenon-enhanced ADCT, 81m Kr-ventilation SPECT/CT, and pulmonary function tests. Disease severity was evaluated according to the Global Initiative for Chronic Obstructive Lung Disease (GOLD) classification. CT-based functional lung volume (FLV), the percentage of wall area to total airway area (WA%), and ventilated FLV on xenon-enhanced ADCT and SPECT/CT were calculated for each smoker. All indexes were correlated with percentage of forced expiratory volume in 1 second (%FEV 1 ) using step-wise regression analyses, and univariate and multivariate logistic regression analyses were performed. In addition, the diagnostic accuracy of the proposed model was compared with that of each radiologic index by means of McNemar analysis. Multivariate logistic regression showed that %FEV 1 was significantly affected (r = 0.77, r 2 = 0.59) by two factors: the first factor, ventilated FLV on xenon-enhanced ADCT (p < 0.0001); and the second factor, WA% (p = 0.004). Univariate logistic regression analyses indicated that all indexes significantly affected GOLD classification (p < 0.05). Multivariate logistic regression analyses revealed that ventilated FLV on xenon-enhanced ADCT and CT-based FLV significantly influenced GOLD classification (p < 0.0001). The diagnostic accuracy of the proposed model was significantly higher than that of ventilated FLV on SPECT/CT (p = 0.03) and WA% (p = 0.008). Xenon-enhanced ADCT is more effective than 81m Kr-ventilation SPECT/CT for the assessment of pulmonary functional loss and disease severity.

  4. Two-factor theory – at the intersection of health care management and patient satisfaction

    PubMed Central

    Bohm, Josef

    2012-01-01

    Using data obtained from the 2004 Joint Canadian/United States Survey of Health, an analytic model using principles derived from Herzberg’s motivational hygiene theory was developed for evaluating patient satisfaction with health care. The analysis sought to determine whether survey variables associated with consumer satisfaction act as Hertzberg factors and contribute to survey participants’ self-reported levels of health care satisfaction. To validate the technique, data from the survey were analyzed using logistic regression methods and then compared with results obtained from the two-factor model. The findings indicate a high degree of correlation between the two methods. The two-factor analytical methodology offers advantages due to its ability to identify whether a factor assumes a motivational or hygienic role and assesses the influence of a factor within select populations. Its ease of use makes this methodology well suited for assessment of multidimensional variables. PMID:23055755

  5. Two-factor theory - at the intersection of health care management and patient satisfaction.

    PubMed

    Bohm, Josef

    2012-01-01

    Using data obtained from the 2004 Joint Canadian/United States Survey of Health, an analytic model using principles derived from Herzberg's motivational hygiene theory was developed for evaluating patient satisfaction with health care. The analysis sought to determine whether survey variables associated with consumer satisfaction act as Hertzberg factors and contribute to survey participants' self-reported levels of health care satisfaction. To validate the technique, data from the survey were analyzed using logistic regression methods and then compared with results obtained from the two-factor model. The findings indicate a high degree of correlation between the two methods. The two-factor analytical methodology offers advantages due to its ability to identify whether a factor assumes a motivational or hygienic role and assesses the influence of a factor within select populations. Its ease of use makes this methodology well suited for assessment of multidimensional variables.

  6. Practical Session: Logistic Regression

    NASA Astrophysics Data System (ADS)

    Clausel, M.; Grégoire, G.

    2014-12-01

    An exercise is proposed to illustrate the logistic regression. One investigates the different risk factors in the apparition of coronary heart disease. It has been proposed in Chapter 5 of the book of D.G. Kleinbaum and M. Klein, "Logistic Regression", Statistics for Biology and Health, Springer Science Business Media, LLC (2010) and also by D. Chessel and A.B. Dufour in Lyon 1 (see Sect. 6 of http://pbil.univ-lyon1.fr/R/pdf/tdr341.pdf). This example is based on data given in the file evans.txt coming from http://www.sph.emory.edu/dkleinb/logreg3.htm#data.

  7. Potential habitat distribution for the freshwater diatom Didymosphenia geminata in the continental US

    USGS Publications Warehouse

    Kumar, S.; Spaulding, S.A.; Stohlgren, T.J.; Hermann, K.A.; Schmidt, T.S.; Bahls, L.L.

    2009-01-01

    The diatom Didymosphenia geminata is a single-celled alga found in lakes, streams, and rivers. Nuisance blooms of D geminata affect the diversity, abundance, and productivity of other aquatic organisms. Because D geminata can be transported by humans on waders and other gear, accurate spatial prediction of habitat suitability is urgently needed for early detection and rapid response, as well as for evaluation of monitoring and control programs. We compared four modeling methods to predict D geminata's habitat distribution; two methods use presence-absence data (logistic regression and classification and regression tree [CART]), and two involve presence data (maximum entropy model [Maxent] and genetic algorithm for rule-set production [GARP]). Using these methods, we evaluated spatially explicit, bioclimatic and environmental variables as predictors of diatom distribution. The Maxent model provided the most accurate predictions, followed by logistic regression, CART, and GARP. The most suitable habitats were predicted to occur in the western US, in relatively cool sites, and at high elevations with a high base-flow index. The results provide insights into the factors that affect the distribution of D geminata and a spatial basis for the prediction of nuisance blooms. ?? The Ecological Society of America.

  8. The cross-validated AUC for MCP-logistic regression with high-dimensional data.

    PubMed

    Jiang, Dingfeng; Huang, Jian; Zhang, Ying

    2013-10-01

    We propose a cross-validated area under the receiving operator characteristic (ROC) curve (CV-AUC) criterion for tuning parameter selection for penalized methods in sparse, high-dimensional logistic regression models. We use this criterion in combination with the minimax concave penalty (MCP) method for variable selection. The CV-AUC criterion is specifically designed for optimizing the classification performance for binary outcome data. To implement the proposed approach, we derive an efficient coordinate descent algorithm to compute the MCP-logistic regression solution surface. Simulation studies are conducted to evaluate the finite sample performance of the proposed method and its comparison with the existing methods including the Akaike information criterion (AIC), Bayesian information criterion (BIC) or Extended BIC (EBIC). The model selected based on the CV-AUC criterion tends to have a larger predictive AUC and smaller classification error than those with tuning parameters selected using the AIC, BIC or EBIC. We illustrate the application of the MCP-logistic regression with the CV-AUC criterion on three microarray datasets from the studies that attempt to identify genes related to cancers. Our simulation studies and data examples demonstrate that the CV-AUC is an attractive method for tuning parameter selection for penalized methods in high-dimensional logistic regression models.

  9. Extension of the Peters–Belson method to estimate health disparities among multiple groups using logistic regression with survey data

    PubMed Central

    Li, Y.; Graubard, B. I.; Huang, P.; Gastwirth, J. L.

    2015-01-01

    Determining the extent of a disparity, if any, between groups of people, for example, race or gender, is of interest in many fields, including public health for medical treatment and prevention of disease. An observed difference in the mean outcome between an advantaged group (AG) and disadvantaged group (DG) can be due to differences in the distribution of relevant covariates. The Peters–Belson (PB) method fits a regression model with covariates to the AG to predict, for each DG member, their outcome measure as if they had been from the AG. The difference between the mean predicted and the mean observed outcomes of DG members is the (unexplained) disparity of interest. We focus on applying the PB method to estimate the disparity based on binary/multinomial/proportional odds logistic regression models using data collected from complex surveys with more than one DG. Estimators of the unexplained disparity, an analytic variance–covariance estimator that is based on the Taylor linearization variance–covariance estimation method, as well as a Wald test for testing a joint null hypothesis of zero for unexplained disparities between two or more minority groups and a majority group, are provided. Simulation studies with data selected from simple random sampling and cluster sampling, as well as the analyses of disparity in body mass index in the National Health and Nutrition Examination Survey 1999–2004, are conducted. Empirical results indicate that the Taylor linearization variance–covariance estimation is accurate and that the proposed Wald test maintains the nominal level. PMID:25382235

  10. Forest/non-forest mapping using inventory data and satellite imagery

    Treesearch

    Ronald E. McRoberts

    2002-01-01

    For two study areas in Minnesota, USA, one heavily forested and one sparsely forested, maps of predicted proportion forest area were created using Landsat Thematic Mapper imagery, forest inventory plot data, and two prediction techniques, logistic regression and a k-Nearest Neighbours technique. The maps were used to increase the precision of forest area estimates by...

  11. Observer weighting strategies in interaural time-difference discrimination and monaural level discrimination for a multi-tone complex

    NASA Astrophysics Data System (ADS)

    Dye, Raymond H.; Stellmack, Mark A.; Jurcin, Noah F.

    2005-05-01

    Two experiments measured listeners' abilities to weight information from different components in a complex of 553, 753, and 953 Hz. The goal was to determine whether or not the ability to adjust perceptual weights generalized across tasks. Weights were measured by binary logistic regression between stimulus values that were sampled from Gaussian distributions and listeners' responses. The first task was interaural time discrimination in which listeners judged the laterality of the target component. The second task was monaural level discrimination in which listeners indicated whether the level of the target component decreased or increased across two intervals. For both experiments, each of the three components served as the target. Ten listeners participated in both experiments. The results showed that those individuals who adjusted perceptual weights in the interaural time experiment could also do so in the monaural level discrimination task. The fact that the same individuals appeared to be analytic in both tasks is an indication that the weights measure the ability to attend to a particular region of the spectrum while ignoring other spectral regions. .

  12. Reported community-level indoor residual spray coverage from two-stage cluster surveys in sub-Saharan Africa.

    PubMed

    Larsen, David A; Borrill, Lauren; Patel, Ryan; Fregosi, Lauren

    2017-06-13

    Malaria is an important cause of morbidity and mortality in malaria-endemic areas. Indoor residual spray is an effective intervention to control malaria, but high community-level coverage is needed to maximize its impact. Using thirty-four two-stage cluster surveys (e.g., demographic and health surveys) and lot quality assurance sampling, indoor residual spray was estimated at the community level (i.e. enumeration-area) across sub-Saharan Africa since 2010. For communities receiving indoor residual spray a logistic regression predicted whether community-level coverage exceeded 50% or not. Household-level coverage was equitable both in terms of wealth and urban/rural, with poorer and rural houses more likely to be sprayed than richer and urban houses. Coverage of indoor residual spray at the community level is poor across the continent, with 54% of communities receiving the intervention not reaching 50% coverage. Having >50% coverage at the community-level was not associated with increasing the number of houses sprayed in the country. Implementation and monitoring of indoor residual coverage at small geographical scales need to improve greatly to receive maximum benefit of the intervention.

  13. Principal component analysis-based pattern analysis of dose-volume histograms and influence on rectal toxicity.

    PubMed

    Söhn, Matthias; Alber, Markus; Yan, Di

    2007-09-01

    The variability of dose-volume histogram (DVH) shapes in a patient population can be quantified using principal component analysis (PCA). We applied this to rectal DVHs of prostate cancer patients and investigated the correlation of the PCA parameters with late bleeding. PCA was applied to the rectal wall DVHs of 262 patients, who had been treated with a four-field box, conformal adaptive radiotherapy technique. The correlated changes in the DVH pattern were revealed as "eigenmodes," which were ordered by their importance to represent data set variability. Each DVH is uniquely characterized by its principal components (PCs). The correlation of the first three PCs and chronic rectal bleeding of Grade 2 or greater was investigated with uni- and multivariate logistic regression analyses. Rectal wall DVHs in four-field conformal RT can primarily be represented by the first two or three PCs, which describe approximately 94% or 96% of the DVH shape variability, respectively. The first eigenmode models the total irradiated rectal volume; thus, PC1 correlates to the mean dose. Mode 2 describes the interpatient differences of the relative rectal volume in the two- or four-field overlap region. Mode 3 reveals correlations of volumes with intermediate doses ( approximately 40-45 Gy) and volumes with doses >70 Gy; thus, PC3 is associated with the maximal dose. According to univariate logistic regression analysis, only PC2 correlated significantly with toxicity. However, multivariate logistic regression analysis with the first two or three PCs revealed an increased probability of bleeding for DVHs with more than one large PC. PCA can reveal the correlation structure of DVHs for a patient population as imposed by the treatment technique and provide information about its relationship to toxicity. It proves useful for augmenting normal tissue complication probability modeling approaches.

  14. Perioperative factors associated with pressure ulcer development after major surgery.

    PubMed

    Kim, Jeong Min; Lee, Hyunjeong; Ha, Taehoon; Na, Sungwon

    2018-02-01

    Postoperative pressure ulcers are important indicators of perioperative care quality, and are serious and expensive complications during critical care. This study aimed to identify perioperative risk factors for postoperative pressure ulcers. This retrospective case-control study evaluated 2,498 patients who underwent major surgery. Forty-three patients developed postoperative pressure ulcers and were matched to 86 control patients based on age, sex, surgery, and comorbidities. The pressure ulcer group had lower baseline hemoglobin and albumin levels, compared to the control group. The pressure ulcer group also had higher values for lactate levels, blood loss, and number of packed red blood cell ( p RBC) units. Univariate analysis revealed that pressure ulcer development was associated with preoperative hemoglobin levels, albumin levels, lactate levels, intraoperative blood loss, number of p RBC units, Acute Physiologic and Chronic Health Evaluation II score, Braden scale score, postoperative ventilator care, and patient restraint. In the multiple logistic regression analysis, only preoperative low albumin levels (odds ratio [OR]: 0.21, 95% CI: 0.05-0.82; P < 0.05) and high lactate levels (OR: 1.70, 95% CI: 1.07-2.71; P < 0.05) were independently associated with pressure ulcer development. A receiver operating characteristic curve was used to assess the predictive power of the logistic regression model, and the area under the curve was 0.88 (95% CI: 0.79-0.97; P < 0.001). The present study revealed that preoperative low albumin levels and high lactate levels were significantly associated with pressure ulcer development after surgery.

  15. Nutrient intake and use of dietary supplements among US adults with disabilities.

    PubMed

    An, Ruopeng; Chiu, Chung-Yi; Andrade, Flavia

    2015-04-01

    Physical, mental, social, and financial hurdles in adults with disabilities may limit their access to adequate nutrition. To examine the impact of dietary supplement use on daily total nutrient intake levels among US adults 20 years and older with disabilities. Study sample came from 2007-2008 and 2009-2010 waves of the National Health and Nutrition Examination Survey, a nationally representative repeated cross-sectional survey. Disability was classified into 5 categories using standardized indices. Nutrient intakes from foods and dietary supplements were calculated from 2 nonconsecutive 24-hour dietary recalls. Two-sample proportion tests and multiple logistic regressions were used to examine the adherence rates to the recommended daily nutrient intake levels between dietary supplement users and nonusers in each disability category. The association between sociodemographic characteristics and dietary supplement use was assessed using multiple logistic regressions, accounting for complex survey design. A substantial proportion of the US adult population with disabilities failed to meet dietary guidelines, with insufficient intakes of multiple nutrients. Over half of the US adults with disabilities used dietary supplements. Dietary supplement use was associated with higher adherence rates for vitamin A, vitamin B1, vitamin B2, vitamin B6, vitamin B12, vitamin C, vitamin D, vitamin E, calcium, copper, iron, magnesium, and zinc intake among adults with disabilities. Women, non-Hispanic Whites, older age, higher education, and higher household income were found to predict dietary supplement use. Proper use of dietary supplements under the guidance of health care providers may improve the nutritional status among adults with disabilities. Copyright © 2015 Elsevier Inc. All rights reserved.

  16. Relationship between attention deficit hyperactive disorder symptoms and perceived parenting practices of school-age children.

    PubMed

    Kim, Dong Hee; Yoo, Il Young

    2013-04-01

    To examine the relationship between the perception on parenting practices and attention deficit hyperactivity disorder (ADHD) symptoms in school-age children. Psychosocial attention deficit hyperactivity disorder intervention approaches emphasise environmental risk factors at the individual, family and community level. Parenting variables are strongly related to attention deficit hyperactivity disorder symptom severity. A cross-sectional questionnaire survey. The participants were 747 children and their parents in two elementary schools. The instruments used were Korean Conners Abbreviated Parent Questionnaire and Korean version Maternal Behavior Research Instrument (measuring four dimensions of parenting practices: affection, autonomy, rejection, control). Descriptive and logistic regression analyses were performed. The rejective parenting practice was statistically significant in logistic regression controlling gender and age of children, family structure, maternal education level and socio-economic status. The rejection parenting is associated with attention deficit hyperactivity disorder symptoms in children (OR=1.356). These results suggest the importance of specific parenting educational programmes for parents to prevent and decrease attention deficit hyperactivity disorder symptoms. It would be more effective rather than focusing only on the child's attention deficit hyperactivity disorder symptoms, developing educational programmes for parents to prevent rejection parenting practice and improve parenting skills in the family system. When developing a treatment programme for children with attention deficit hyperactivity disorder, healthcare providers should consider not only the child's attention deficit hyperactivity disorder symptoms, but also the parenting practices. Comprehensive interventions designed to prevent rejection and improve parenting skills may be helpful in mitigating attention deficit hyperactivity disorder symptoms. © 2012 Blackwell Publishing Ltd.

  17. A comparison between univariate probabilistic and multivariate (logistic regression) methods for landslide susceptibility analysis: the example of the Febbraro valley (Northern Alps, Italy)

    NASA Astrophysics Data System (ADS)

    Rossi, M.; Apuani, T.; Felletti, F.

    2009-04-01

    The aim of this paper is to compare the results of two statistical methods for landslide susceptibility analysis: 1) univariate probabilistic method based on landslide susceptibility index, 2) multivariate method (logistic regression). The study area is the Febbraro valley, located in the central Italian Alps, where different types of metamorphic rocks croup out. On the eastern part of the studied basin a quaternary cover represented by colluvial and secondarily, by glacial deposits, is dominant. In this study 110 earth flows, mainly located toward NE portion of the catchment, were analyzed. They involve only the colluvial deposits and their extension mainly ranges from 36 to 3173 m2. Both statistical methods require to establish a spatial database, in which each landslide is described by several parameters that can be assigned using a main scarp central point of landslide. The spatial database is constructed using a Geographical Information System (GIS). Each landslide is described by several parameters corresponding to the value of main scarp central point of the landslide. Based on bibliographic review a total of 15 predisposing factors were utilized. The width of the intervals, in which the maps of the predisposing factors have to be reclassified, has been defined assuming constant intervals to: elevation (100 m), slope (5 °), solar radiation (0.1 MJ/cm2/year), profile curvature (1.2 1/m), tangential curvature (2.2 1/m), drainage density (0.5), lineament density (0.00126). For the other parameters have been used the results of the probability-probability plots analysis and the statistical indexes of landslides site. In particular slope length (0 ÷ 2, 2 ÷ 5, 5 ÷ 10, 10 ÷ 20, 20 ÷ 35, 35 ÷ 260), accumulation flow (0 ÷ 1, 1 ÷ 2, 2 ÷ 5, 5 ÷ 12, 12 ÷ 60, 60 ÷27265), Topographic Wetness Index 0 ÷ 0.74, 0.74 ÷ 1.94, 1.94 ÷ 2.62, 2.62 ÷ 3.48, 3.48 ÷ 6,00, 6.00 ÷ 9.44), Stream Power Index (0 ÷ 0.64, 0.64 ÷ 1.28, 1.28 ÷ 1.81, 1.81 ÷ 4.20, 4.20 ÷ 9.40). Geological map and land use map were also used, considering geological and land use properties as categorical variables. Appling the univariate probabilistic method the Landslide Susceptibility Index (LSI) is defined as the sum of the ratio Ra/Rb calculated for each predisposing factor, where Ra is the ratio between number of pixel of class and the total number of pixel of the study area, and Rb is the ratio between number of landslides respect to the pixel number of the interval area. From the analysis of the Ra/Rb ratio the relationship between landslide occurrence and predisposing factors were defined. Then the equation of LSI was used in GIS to trace the landslide susceptibility maps. The multivariate method for landslide susceptibility analysis, based on logistic regression, was performed starting from the density maps of the predisposing factors, calculated with the intervals defined above using the equation Rb/Rbtot, where Rbtot is a sum of all Rb values. Using stepwise forward algorithms the logistic regression was performed in two successive steps: first a univariate logistic regression is used to choose the most significant predisposing factors, then the multivariate logistic regression can be performed. The univariate regression highlighted the importance of the following factors: elevation, accumulation flow, drainage density, lineament density, geology and land use. When the multivariate regression was applied the number of controlling factors was reduced neglecting the geological properties. The resulting final susceptibility equation is: P = 1 / (1 + exp-(6.46-22.34*elevation-5.33*accumulation flow-7.99* drainage density-4.47*lineament density-17.31*land use)) and using this equation the susceptibility maps were obtained. To easy compare the results of the two methodologies, the susceptibility maps were reclassified in five susceptibility intervals (very high, high, moderate, low and very low) using natural breaks. Then the maps were validated using two cumulative distribution curves, one related to the landslides (number of landslides in each susceptibility class) and one to the basin (number of pixel covering each class). Comparing the curves for each method, it results that the two approaches (univariate and multivariate) are appropriate, providing acceptable results. In both maps the distribution of high susceptibility condition is mainly localized on the left slope of the catchment in agreement with the field evidences. The comparison between the methods was obtained by subtraction of the two maps. This operation shows that about 40% of the basin is classified by the same class of susceptibility. In general the univariate probabilistic method tends to overestimate the areal extension of the high susceptibility class with respect to the maps obtained by the logistic regression method.

  18. Genomic-Enabled Prediction of Ordinal Data with Bayesian Logistic Ordinal Regression.

    PubMed

    Montesinos-López, Osval A; Montesinos-López, Abelardo; Crossa, José; Burgueño, Juan; Eskridge, Kent

    2015-08-18

    Most genomic-enabled prediction models developed so far assume that the response variable is continuous and normally distributed. The exception is the probit model, developed for ordered categorical phenotypes. In statistical applications, because of the easy implementation of the Bayesian probit ordinal regression (BPOR) model, Bayesian logistic ordinal regression (BLOR) is implemented rarely in the context of genomic-enabled prediction [sample size (n) is much smaller than the number of parameters (p)]. For this reason, in this paper we propose a BLOR model using the Pólya-Gamma data augmentation approach that produces a Gibbs sampler with similar full conditional distributions of the BPOR model and with the advantage that the BPOR model is a particular case of the BLOR model. We evaluated the proposed model by using simulation and two real data sets. Results indicate that our BLOR model is a good alternative for analyzing ordinal data in the context of genomic-enabled prediction with the probit or logit link. Copyright © 2015 Montesinos-López et al.

  19. Risk factors for persistent gestational trophoblastic neoplasia.

    PubMed

    Kuyumcuoglu, Umur; Guzel, Ali Irfan; Erdemoglu, Mahmut; Celik, Yusuf

    2011-01-01

    This retrospective study evaluated the risk factors for persistent gestational trophoblastic disease (GTN) and determined their odds ratios. This study included 100 cases with GTN admitted to our clinic. Possible risk factors recorded were age, gravidity, parity, size of the neoplasia, and beta-human chorionic gonadotropin levels (beta-hCG) before and after the procedure. Statistical analyses consisted of the independent sample t-test and logistic regression using the statistical package SPSS ver. 15.0 for Windows (SPSS, Chicago, IL, USA). Twenty of the cases had persistent GTN, and the differences between these and the others cases were evaluated. The size of the neoplasia and histopathological type of GTN had no statistical relationship with persistence, whereas age, gravidity, and beta-hCG levels were significant risk factors for persistent GTN (p < 0.05). The odds ratios (95% confidence interval (CI)) for age, gravidity, and pre- and post-evacuation beta-hCG levels determined using logistic regression were 4.678 (0.97-22.44), 7.315 (1.16-46.16), 2.637 (1.41-4.94), and 2.339 (1.52-3.60), respectively. Patient age, gravidity, and beta-hCG levels were risk factors for persistent GTN, whereas the size of the neoplasia and histopathological type of GTN were not significant risk factors.

  20. Serum antioxidant vitamins and the risk of oral cancer in patients seen at a tertiary institution in Nigeria.

    PubMed

    Lawal, A O; Kolude, B; Adeyemi, B F; Lawoyin, J O; Akang, E E

    2012-01-01

    Tobacco and alcohol are major risk factors of oral cancer, but nutritional deficiency may also contribute to development of oral cancer. This study compared serum antioxidant vitamin levels in oral cancer patients and controls in order to validate the role of vitamin deficiencies in the etiology of oral cancer. Serum vitamin A, C, and E levels of 33 oral cancer patients and 30 controls at University College Hospital, Ibadan, Nigeria, were determined using standard methods. The data obtained were analyzed using the Student t-test, odds ratio, and logistic regression. Mean vitamin A, C, and E levels were significantly lower in oral cancer patients (P=0.022, P=0.000, and P=0.013 respectively). Risk of oral cancer was 10.89, 11.35, and 5.6 times more in patients with low serum vitamins A, C, and E, respectively. However, on logistic regression analysis, only low serum vitamin E independently predicted occurrence of oral cancer. The lower serum vitamin A, C, and E levels in oral cancer patients could be either a cause or an effect of the oral cancer. Further studies using a larger sample size and cohort studies with long-term follow-up of subjects are desirable.

  1. Personality traits, level of anxiety and styles of coping with stressin people with asthma and chronic obstructive pulmonary disease - a comparative analysis.

    PubMed

    Tabała, Klaudia; Wrzesińska, Magdalena; Stecz, Patryk; Kocur, Józef

    2016-12-23

    Chronic obstructive pulmonary disease (COPD) and asthma are a challenge to public health, with the sufferers experiencing a range of psychological factors affecting their health and behavior. The aim of the present study was to determine the level of anxiety, personality traits and stress-coping ability of patients with obstructive lung disease and comparison with a group of healthy controls. The research was conducted on a group of 150 people with obstructive lung diseases (asthma and COPD) and healthy controls (mean age = 56.0 ± 16.00). Four surveys were used: a sociodemographic survey, NEO-FFI Personality Inventory, State-Trait Anxiety Inventory (STAI), and Brief Cope Inventory. Logistic regression was used to identify the investigated variables which best differentiated the healthy and sick individuals. Patients with asthma or COPD demonstrated a significantly lower level of conscientiousness, openness to experience, active coping and planning, as well as higher levels of neuroticism and a greater tendency to behavioral disengagement. Logistic regression found trait-anxiety, openness to experience, positive reframing, acceptance, humor and behavioral disengagement to be best at distinguishing people with lung diseases from healthy individuals. The results indicate the need for intervention in the psychological functioning of people with obstructive diseases.

  2. Factors predictive of critical value of hypocalcemia after total parathyroidectomy without autotransplantation in patients with secondary hyperparathyroidism.

    PubMed

    Yang, Meng; Zhang, Ling; Huang, Linping; Sun, Xiaoliang; Ji, Haoyang; Lu, Yao

    2016-09-01

    Severe hypocalcemia is the most dangerous complication occurring after total parathyroidectomy without autotransplantation (TPTX) for secondary hyperparathyroidism (SHPT). We aim to identify the prevalence and potential risk factors of very severe hypocalcemia in patients with SHPT undergoing TPTX. From April 2012 to August 2015, 157 patients with SHPT undergoing TPTX were reviewed. The critical value of hypocalcemia (CVH) was postoperative serum Ca(2+) levels of ≤1.5 mmol/L. Univariate analysis showed that patients in the CVH group were significantly younger than those in the non-CVH group. Sex ratio was significantly different between the two groups. The CVH group had significantly higher levels of preoperative PTH and ALP. Male sex and preoperative levels of PTH and ALP were significant independent risk factors by logistic regression analysis. Male sex, preoperative PTH and ALP were significantly associated with CVH in patients with SHPT undergoing TPTX.

  3. High stress, lack of sleep, low school performance, and suicide attempts are associated with high energy drink intake in adolescents.

    PubMed

    Kim, So Young; Sim, Songyong; Choi, Hyo Geun

    2017-01-01

    Although an association between energy drinks and suicide has been suggested, few prior studies have considered the role of emotional factors including stress, sleep, and school performance in adolescents. This study aimed to evaluate the association of energy drinks with suicide, independent of possible confounders including stress, sleep, and school performance. In total, 121,106 adolescents with 13-18 years olds from the 2014 and 2015 Korea Youth Risk Behavior Web-based Survey were surveyed for age, sex, region of residence, economic level, paternal and maternal education level, sleep time, stress level, school performance, frequency of energy drink intake, and suicide attempts. Subjective stress levels were classified into severe, moderate, mild, a little, and no stress. Sleep time was divided into 6 groups: < 6 h; 6 ≤ h < 7; 7 ≤ h < 8; 8 ≤ h < 9; and ≥ 9 h. School performance was classified into 5 levels: A (highest), B (middle, high), C (middle), D (middle, low), and E (lowest). Frequency of energy drink consumption was divided into 3 groups: ≥ 3, 1-2, and 0 times a week. The associations of sleep time, stress level, and school performance with suicide attempts and the frequency of energy drink intake were analyzed using multiple and ordinal logistic regression analysis, respectively, with complex sampling. The relationship between frequency of energy drink intake and suicide attempts was analyzed using multiple logistic regression analysis with complex sampling. Higher stress levels, lack of sleep, and low school performance were significantly associated with suicide attempts (each P < 0.001). These variables of high stress level, abnormal sleep time, and low school performance were also proportionally related with higher energy drink intake (P < 0.001). Frequent energy drink intake was significantly associated with suicide attempts in multiple logistic regression analyses (AOR for frequency of energy intake ≥ 3 times a week = 3.03, 95% CI = 2.64-3.49, P < 0.001). Severe stress, inadequate sleep, and low school performance were related with more energy drink intake and suicide attempts in Korean adolescents. Frequent energy drink intake was positively related with suicide attempts, even after adjusting for stress, sleep time, and school performance.

  4. High stress, lack of sleep, low school performance, and suicide attempts are associated with high energy drink intake in adolescents

    PubMed Central

    Kim, So Young; Sim, Songyong

    2017-01-01

    Objective Although an association between energy drinks and suicide has been suggested, few prior studies have considered the role of emotional factors including stress, sleep, and school performance in adolescents. This study aimed to evaluate the association of energy drinks with suicide, independent of possible confounders including stress, sleep, and school performance. Methods In total, 121,106 adolescents with 13–18 years olds from the 2014 and 2015 Korea Youth Risk Behavior Web-based Survey were surveyed for age, sex, region of residence, economic level, paternal and maternal education level, sleep time, stress level, school performance, frequency of energy drink intake, and suicide attempts. Subjective stress levels were classified into severe, moderate, mild, a little, and no stress. Sleep time was divided into 6 groups: < 6 h; 6 ≤ h < 7; 7 ≤ h < 8; 8 ≤ h < 9; and ≥ 9 h. School performance was classified into 5 levels: A (highest), B (middle, high), C (middle), D (middle, low), and E (lowest). Frequency of energy drink consumption was divided into 3 groups: ≥ 3, 1–2, and 0 times a week. The associations of sleep time, stress level, and school performance with suicide attempts and the frequency of energy drink intake were analyzed using multiple and ordinal logistic regression analysis, respectively, with complex sampling. The relationship between frequency of energy drink intake and suicide attempts was analyzed using multiple logistic regression analysis with complex sampling. Results Higher stress levels, lack of sleep, and low school performance were significantly associated with suicide attempts (each P < 0.001). These variables of high stress level, abnormal sleep time, and low school performance were also proportionally related with higher energy drink intake (P < 0.001). Frequent energy drink intake was significantly associated with suicide attempts in multiple logistic regression analyses (AOR for frequency of energy intake ≥ 3 times a week = 3.03, 95% CI = 2.64–3.49, P < 0.001). Conclusion Severe stress, inadequate sleep, and low school performance were related with more energy drink intake and suicide attempts in Korean adolescents. Frequent energy drink intake was positively related with suicide attempts, even after adjusting for stress, sleep time, and school performance. PMID:29135989

  5. An Investigation of the Variables Predicting Faculty of Education Students' Speaking Anxiety through Ordinal Logistic Regression Analysis

    ERIC Educational Resources Information Center

    Bozpolat, Ebru

    2017-01-01

    The purpose of this study is to determine whether Cumhuriyet University Faculty of Education students' levels of speaking anxiety are predicted by the variables of gender, department, grade, such sub-dimensions of "Speaking Self-Efficacy Scale for Pre-Service Teachers" as "public speaking," "effective speaking,"…

  6. Predicting and Managing Turnover in Human Service Agencies: A Case Study of an Organization in Crisis.

    ERIC Educational Resources Information Center

    Balfour, Danny L.; Neff, Donna M.

    1993-01-01

    A logistic regression model applied to data from 171 child service caseworkers identified variables determining job turnover during times of intense external criticism of the agency (length of service, professional commitment, level of education). A special training program did not significantly reduce the probability of turnover. (SK)

  7. Racial Threat and White Opposition to Bilingual Education in Texas

    ERIC Educational Resources Information Center

    Hempel, Lynn M.; Dowling, Julie A.; Boardman, Jason D.; Ellison, Christopher G.

    2013-01-01

    This study examines local contextual conditions that influence opposition to bilingual education among non-Hispanic Whites, net of individual-level characteristics. Data from the Texas Poll (N = 615) are used in conjunction with U.S. Census data to test five competing hypotheses using binomial and multinomial logistic regression models. Our…

  8. Analyzing Army Reserve Unsatisfactory Participants through Logistic Regression

    DTIC Science & Technology

    2012-06-08

    Unsatisfactory Participants by State/Territory ...............................45 Figure 14. Observed vs . Expected Unsatisfactory Participants by Grade...47 Figure 15. Observed vs . Expected Unsatisfactory Participants by Age ............................47 x TABLES Page Table...rank, and previous military experience. “A typical 1995-96 USAR Unsatisfactory Participant was a white, unmarried male whose highest level of

  9. The Impact of Teacher Collaboration on School Management in Canada

    ERIC Educational Resources Information Center

    Bouchamma, Yamina; Savoie, Andrea A.; Basque, Marc

    2012-01-01

    This study examined the level of collaboration between Francophone and Anglophone language teachers of 13- and 16- year-old Canadian students (N = 4,494) using data from the 2002 SAIP (School Achievement Indicators Program) of the Council of Ministers of Education of Canada. Among 32 factors, logistic regression identified six predictors of…

  10. Ethnicity and Economic Well-Being: The Case of Ghana

    ERIC Educational Resources Information Center

    Addai, Isaac; Pokimica, Jelena

    2010-01-01

    In the context of decades of successful economic reforms in Ghana, this study investigates whether ethnicity influences economic well-being (perceived and actual) among Ghanaians at the micro-level. Drawing on Afro-barometer 2008 data, the authors employs logistic and multiple regression techniques to explore the relative effect of ethnicity on…

  11. Influence over School Discipline Policy: Variation across Levels of Governance, School Contexts, and Time

    ERIC Educational Resources Information Center

    Curran, F. Chris

    2017-01-01

    Little research explores the relative influence of various stakeholders on school discipline policy. Using data from the SASS and ordered logistic regression, this study explores such influence while assessing variation across schools types and changes over time. Principals consistently rate themselves and teachers as the most influential…

  12. Student Assistance Program Outcomes for Students at Risk for Suicide

    ERIC Educational Resources Information Center

    Biddle, Virginia Sue; Kern, John, III; Brent, David A.; Thurkettle, Mary Ann; Puskar, Kathryn R.; Sekula, L. Kathleen

    2014-01-01

    Pennsylvania's response to adolescent suicide is its Student Assistance Program (SAP). SAP has been funded for 27 years although no statewide outcome studies using case-level data have been conducted. This study used logistic regression to examine drug-/alcohol-related behaviors and suspensions of suicidal students who participated in SAP. Of the…

  13. The Oklahoma's Promise Program: A National Model to Promote College Persistence

    ERIC Educational Resources Information Center

    Mendoza, Pilar; Mendez, Jesse P.

    2013-01-01

    Using a multi-method approach involving fixed effects and logistic regressions, this study examined the effect of the Oklahoma's Promise Program on student persistence in relation to the Pell and Stafford federal programs and according to socio-economic characteristics and class level. The Oklahoma's Promise is a hybrid state program that pays…

  14. A Multiple Logistic Regression Model for Predicting the Development of Phytophthora ramorum symptoms in Tanoak (Lithocarpus densiflorus)

    Treesearch

    Mark Spencer; Kevin O' Hara

    2007-01-01

    Phytophthora ramorum attacks tanoak (Lithocarpus densiflorus) in California and Oregon. We present a stand-level study examining the presence of disease symptoms in individual stems. Working with data from four plots in redwood (Sequoia sempervirens)/tanoak forests in Marin County, and three plots in Mendocino...

  15. Logistic Regression and Path Analysis Method to Analyze Factors influencing Students’ Achievement

    NASA Astrophysics Data System (ADS)

    Noeryanti, N.; Suryowati, K.; Setyawan, Y.; Aulia, R. R.

    2018-04-01

    Students' academic achievement cannot be separated from the influence of two factors namely internal and external factors. The first factors of the student (internal factors) consist of intelligence (X1), health (X2), interest (X3), and motivation of students (X4). The external factors consist of family environment (X5), school environment (X6), and society environment (X7). The objects of this research are eighth grade students of the school year 2016/2017 at SMPN 1 Jiwan Madiun sampled by using simple random sampling. Primary data are obtained by distributing questionnaires. The method used in this study is binary logistic regression analysis that aims to identify internal and external factors that affect student’s achievement and how the trends of them. Path Analysis was used to determine the factors that influence directly, indirectly or totally on student’s achievement. Based on the results of binary logistic regression, variables that affect student’s achievement are interest and motivation. And based on the results obtained by path analysis, factors that have a direct impact on student’s achievement are students’ interest (59%) and students’ motivation (27%). While the factors that have indirect influences on students’ achievement, are family environment (97%) and school environment (37).

  16. Bias in discriminating very mild dementia for older adults with different levels of education in Hong Kong.

    PubMed

    Chang, Jianfang; Tse, Chi-Shing; Leung, Grace Tak Yu; Fung, Ada Wai Tung; Hau, Kit-Tai; Chiu, Helen Fung Kum; Lam, Linda Chiu Wa

    2014-06-01

    Education has a profound effect on older adults' cognitive performance. In Hong Kong, some dementia screening tasks were originally designed for developed population with, on average, higher education. We compared the screening power of these tasks for Chinese older adults with different levels of education. Community-dwelling older adults who were healthy (N = 383) and with very mild dementia (N = 405) performed the following tasks: Mini-Mental State Examination, Alzheimer's Disease Assessment Scale-Cognitive subscales, Verbal Fluency, Abstract Thinking, and Visual/Digit Span. Logistic regression was used to examine the power of these tasks to predict Clinical Dementia Rating (CDR 0.5 vs. 0). Logistic regression analysis showed that while the screening power of the total scores in all tasks was similar for high and low education groups, there were education biases in some items of these tasks. The differential screening power in high and low education groups was not identical across items in some tasks. Thus, in cognitive assessments, we should exercise great caution when using these potentially biased items for older adults with limited education.

  17. [The effect of self-foot reflexology on the relief of premenstrual syndrome and dysmenorrhea in high school girls].

    PubMed

    Kim, Yi-Soon; Kim, Min-Za; Jeong, Ihn-Sook

    2004-08-01

    This study was aimed to identify the effect of self-foot reflexology on the relief of premenstrual syndrome and dysmenorrhea in high school girls. Study subjects was 236 women residing in the community, teachers and nurses who were older than 45 were recruited. Data was collected with self administered questionnaires from July 1st to August 31st, 2003 and analysed using SPSS/WIN 10.0 with Xtest, t-test, and stepwise multiple logistic regression at a significant level of =.05. The breast cancer screening rate was 57.2%, and repeat screening rate was 15.3%. With the multiple logistic regression analysis, factors associated with mammography screening were age and perceived barriers of action, and factors related to the repeat mammography screening were education level and other cancer screening experience. Based on the results, we recommend the development of an intervention program to decrease the perceived barrier of action, to regard mammography as an essential test in regular check-up, and to give active advertisement and education to the public to improve the rates of breast cancer screening and repeat screening.

  18. [Metabolic syndrome in workers of a second level hospital].

    PubMed

    Mathiew-Quirós, Alvaro; Salinas-Martínez, Ana María; Hernández-Herrera, Ricardo Jorge; Gallardo-Vela, José Alberto

    2014-01-01

    People with metabolic syndrome (20-25 % of the world population) are three times more likely to suffer a heart attack or stroke and twice as likely to die from this cause. The objective of this study was to assess the prevalence of metabolic syndrome in workers of a second level hospital. This was a cross-sectional study with 160 healthcare workers in Monterrey, México. Sociodemographic, anthropometric and biochemical data were obtained to assess the prevalence of metabolic syndrome. Bivariate and multiple logistic regression analysis were carried out in order to assess the relationship between metabolic syndrome and sociodemographic and occupational variables. The prevalence of metabolic syndrome among workers was 38.1 %. Nurses were more affected with 32.8 %. Overweight and obesity were prevalent in 78 %. In the logistic regression there was a significant association between metabolic syndrome and not having partner (OR 3.98, 95 % CI [1.54-10.25]) and obesity (OR 4.69, 95 % CI [1.73-12.73]). The prevalence of metabolic syndrome and obesity is alarming. Appropriate and prompt actions must be taken in order to reduce the risk of cardiovascular disease in this population.

  19. Variational dynamic background model for keyword spotting in handwritten documents

    NASA Astrophysics Data System (ADS)

    Kumar, Gaurav; Wshah, Safwan; Govindaraju, Venu

    2013-12-01

    We propose a bayesian framework for keyword spotting in handwritten documents. This work is an extension to our previous work where we proposed dynamic background model, DBM for keyword spotting that takes into account the local character level scores and global word level scores to learn a logistic regression classifier to separate keywords from non-keywords. In this work, we add a bayesian layer on top of the DBM called the variational dynamic background model, VDBM. The logistic regression classifier uses the sigmoid function to separate keywords from non-keywords. The sigmoid function being neither convex nor concave, exact inference of VDBM becomes intractable. An expectation maximization step is proposed to do approximate inference. The advantage of VDBM over the DBM is multi-fold. Firstly, being bayesian, it prevents over-fitting of data. Secondly, it provides better modeling of data and an improved prediction of unseen data. VDBM is evaluated on the IAM dataset and the results prove that it outperforms our prior work and other state of the art line based word spotting system.

  20. Dynamics of contraceptive use in India: apprehension versus future intention among non-users and traditional method users.

    PubMed

    Rai, Rajesh Kumar; Unisa, Sayeed

    2013-06-01

    This study examines the reasons for not using any method of contraception as well as reasons for not using modern methods of contraception, and factors associated with the future intention to use different types of contraceptives in India and its selected states, namely Uttar Pradesh, Assam and West Bengal. Data from the third wave of District Level Household and Facility Survey, 2007-08 were used. Bivariate as well as logistic regression analyses were performed to fulfill the study objective. Postpartum amenorrhea and breastfeeding practices were reported as the foremost causes for not using any method of contraception. Opposition to use, health concerns and fear of side effects were reported to be major hurdles in the way of using modern methods of contraception. Results from logistic regression suggest considerable variation in explaining the factors associated with future intention to use contraceptives. Promotion of health education addressing the advantages of contraceptive methods and eliminating apprehension about the use of these methods through effective communication by community level workers is the need of the hour. Copyright © 2013 Elsevier B.V. All rights reserved.

  1. Demand analysis of flood insurance by using logistic regression model and genetic algorithm

    NASA Astrophysics Data System (ADS)

    Sidi, P.; Mamat, M. B.; Sukono; Supian, S.; Putra, A. S.

    2018-03-01

    Citarum River floods in the area of South Bandung Indonesia, often resulting damage to some buildings belonging to the people living in the vicinity. One effort to alleviate the risk of building damage is to have flood insurance. The main obstacle is not all people in the Citarum basin decide to buy flood insurance. In this paper, we intend to analyse the decision to buy flood insurance. It is assumed that there are eight variables that influence the decision of purchasing flood assurance, include: income level, education level, house distance with river, building election with road, flood frequency experience, flood prediction, perception on insurance company, and perception towards government effort in handling flood. The analysis was done by using logistic regression model, and to estimate model parameters, it is done with genetic algorithm. The results of the analysis shows that eight variables analysed significantly influence the demand of flood insurance. These results are expected to be considered for insurance companies, to influence the decision of the community to be willing to buy flood insurance.

  2. Analysis of medical litigation among patients with medical disputes in cosmetic surgery in Taiwan.

    PubMed

    Lyu, Shu-Yu; Liao, Chuh-Kai; Chang, Kao-Ping; Tsai, Shang-Ta; Lee, Ming-Been; Tsai, Feng-Chou

    2011-10-01

    This study aimed to investigate the key factors in medical disputes (arguments) among female patients after cosmetic surgery in Taiwan and to explore the correlates of medical litigation. A total of 6,888 patients (3,210 patients from two hospitals and 3,678 patients from two clinics) received cosmetic surgery from January 2001 to December 2009. The inclusion criteria specified female patients with a medical dispute. Chi-square testing and multiple logistic regression analysis were used to analyze the data. Of the 43 patients who had a medical dispute (hospitals, 0.53%; clinics, 0.73%), 9 plaintiffs eventually filed suit against their plastic surgeons. Such an outcome exhibited a decreasing annual trend. The hospitals and clinics did not differ significantly in terms of patient profiles. The Chi-square test showed that most patients with a medical dispute (p < 0.05) were older than 30 years, were divorced or married, had received operations under general anesthesia, had no economic stress, had a history of medical litigation, and eventually did not sue the surgeons. The test results also showed that the surgeon's seniority and experience significantly influenced the possibility of medical dispute and nonlitigation. Multiple logistical regression analysis further showed that the patients who did decide to enter into litigation had two main related factors: marital stress (odds ratio [OR], 10.67; 95% confidence interval [CI], 1.20-94.73) and an education level below junior college (OR, 9.33; 95% CI, 1.01-86.36). The study findings suggest that the key characteristics of patients and surgeons should be taken into consideration not only in the search for ways to enhance pre- and postoperative communication but also as useful information for expert testimony in the inquisitorial law system.

  3. Vitamin D is associated with testosterone and hypogonadism in Chinese men: Results from a cross-sectional SPECT-China study.

    PubMed

    Wang, Ningjian; Han, Bing; Li, Qin; Chen, Yi; Chen, Yingchao; Xia, Fangzhen; Lin, Dongping; Jensen, Michael D; Lu, Yingli

    2015-07-16

    To date, no study has explored the association between androgen levels and 25-hydroxyvitamin D (25(OH)D) levels in Chinese men. We aimed to investigate the relationship between 25(OH)D levels and total and free testosterone (T), sex hormone binding globulin (SHBG), estradiol, and hypogonadism in Chinese men. Our data, which were based on the population, were collected from 16 sites in East China. There were 2,854 men enrolled in the study, with a mean (SD) age of 53.0 (13.5) years. Hypogonadism was defined as total T <11.3 nmol/L or free T <22.56 pmol/L. The 25(OH)D, follicle-stimulating hormone, luteinizing hormone, total T, estradiol and SHBG were measured using chemiluminescence and free T by enzyme-linked immune-sorbent assay. The associations between 25(OH)D and reproductive hormones and hypogonadism were analyzed using linear regression and binary logistic regression analyses, respectively. A total of 713 (25.0 %) men had hypogonadism with significantly lower 25(OH)D levels but greater BMI and HOMA-IR. Using linear regression, after fully adjusting for age, residence area, economic status, smoking, BMI, HOMA-IR, diabetes and systolic pressure, 25(OH)D was associated with total T and estradiol (P < 0.05). In the logistic regression analyses, increased quartiles of 25(OH)D were associated with significantly decreased odds ratios of hypogonadism (P for trend <0.01). This association, which was considerably attenuated by BMI and HOMA-IR, persisted in the fully adjusted model (P for trend <0.01) in which for the lowest compared with the highest quartile of 25(OH)D, the odds ratio of hypogonadism was 1.50 (95 % CI, 1.14, 1.97). A lower vitamin D level was associated with a higher prevalence of hypogonadism in Chinese men. This association might, in part, be explained by adiposity and insulin resistance and warrants additional investigation.

  4. Moderation analysis using a two-level regression model.

    PubMed

    Yuan, Ke-Hai; Cheng, Ying; Maxwell, Scott

    2014-10-01

    Moderation analysis is widely used in social and behavioral research. The most commonly used model for moderation analysis is moderated multiple regression (MMR) in which the explanatory variables of the regression model include product terms, and the model is typically estimated by least squares (LS). This paper argues for a two-level regression model in which the regression coefficients of a criterion variable on predictors are further regressed on moderator variables. An algorithm for estimating the parameters of the two-level model by normal-distribution-based maximum likelihood (NML) is developed. Formulas for the standard errors (SEs) of the parameter estimates are provided and studied. Results indicate that, when heteroscedasticity exists, NML with the two-level model gives more efficient and more accurate parameter estimates than the LS analysis of the MMR model. When error variances are homoscedastic, NML with the two-level model leads to essentially the same results as LS with the MMR model. Most importantly, the two-level regression model permits estimating the percentage of variance of each regression coefficient that is due to moderator variables. When applied to data from General Social Surveys 1991, NML with the two-level model identified a significant moderation effect of race on the regression of job prestige on years of education while LS with the MMR model did not. An R package is also developed and documented to facilitate the application of the two-level model.

  5. Impact of literacy and years of education on the diagnosis of dementia: A population-based study.

    PubMed

    Contador, Israel; Del Ser, Teodoro; Llamas, Sara; Villarejo, Alberto; Benito-León, Julián; Bermejo-Pareja, Félix

    2017-03-01

    The effect of different educational indices on clinical diagnosis of dementia requires more investigation. We compared the differential influence of two educational indices (EIs): years of schooling and level of education (i.e., null/low literacy, can read and write, primary school, and secondary school) on global cognition, functional performance, and the probability of having a dementia diagnosis. A total of 3,816 participants were selected from the population-based study of older adults "Neurological Disorders in Central Spain" (NEDICES). The 37-item version of the Mini-Mental State Examination (MMSE-37) and the Pfeffer's questionnaire were applied to assess cognitive and functional performance, respectively. The diagnosis of dementia was performed by expert neurologists according to Diagnostic and Statistical Manual of Mental Disorders-Fourth Edition (DSM-IV) criteria. Logistic regression models adjusted for potential confounders were carried out to test the association between the two EIs and dementia diagnosis. Both EIs were significantly associated with cognitive and functional scores, but individuals with null/low literacy performed significantly worse on MMSE-37 than literates when these groups were compared in terms of years of schooling. The two EIs were also related to an increased probability of dementia diagnosis in logistic models, but the association's strength was stronger for level of education than for years of schooling. Literacy predicted cognitive performance over and above the years of schooling. Lower education increases the probability of having a dementia diagnosis but the impact of different EIs is not uniform.

  6. Robust logistic regression to narrow down the winner's curse for rare and recessive susceptibility variants.

    PubMed

    Kesselmeier, Miriam; Lorenzo Bermejo, Justo

    2017-11-01

    Logistic regression is the most common technique used for genetic case-control association studies. A disadvantage of standard maximum likelihood estimators of the genotype relative risk (GRR) is their strong dependence on outlier subjects, for example, patients diagnosed at unusually young age. Robust methods are available to constrain outlier influence, but they are scarcely used in genetic studies. This article provides a non-intimidating introduction to robust logistic regression, and investigates its benefits and limitations in genetic association studies. We applied the bounded Huber and extended the R package 'robustbase' with the re-descending Hampel functions to down-weight outlier influence. Computer simulations were carried out to assess the type I error rate, mean squared error (MSE) and statistical power according to major characteristics of the genetic study and investigated markers. Simulations were complemented with the analysis of real data. Both standard and robust estimation controlled type I error rates. Standard logistic regression showed the highest power but standard GRR estimates also showed the largest bias and MSE, in particular for associated rare and recessive variants. For illustration, a recessive variant with a true GRR=6.32 and a minor allele frequency=0.05 investigated in a 1000 case/1000 control study by standard logistic regression resulted in power=0.60 and MSE=16.5. The corresponding figures for Huber-based estimation were power=0.51 and MSE=0.53. Overall, Hampel- and Huber-based GRR estimates did not differ much. Robust logistic regression may represent a valuable alternative to standard maximum likelihood estimation when the focus lies on risk prediction rather than identification of susceptibility variants. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  7. Polymorphisms within the FANCA gene associate with premature ovarian failure in Korean women.

    PubMed

    Pyun, Jung-A; Kim, Sunshin; Cha, Dong Hyun; Kwack, KyuBum

    2014-05-01

    This study investigated whether polymorphisms within the Fanconi anemia complementation group A (FANCA) gene contribute to the increased risk of premature ovarian failure (POF) in Korean women. Ninety-eight women with POF and 218 controls participated in this study. Genomic DNA from peripheral blood was isolated, and GoldenGate genotyping assay was used to identify single nucleotide polymorphisms (SNPs) within the FANCA gene. Two significant SNPs (rs1006547 and rs2239359; P < 0.05) were identified by logistic regression analysis, but results were insignificant after Bonferroni correction. Six SNPs formed a linkage disequilibrium block, and three main haplotypes were found. Two of three haplotypes (AAAGAA and GGGAGG) distributed highly in the POF group, whereas the remaining haplotype (GGAAGG) distributed highly in the control group by logistic regression analysis (highest odds ratio, 2.515; 95% CI, 1.515-4.175; P = 0.00036). Our observations suggest that genetic variations in the FANCA gene may increase the risk for POF in Korean women.

  8. Measurement equivalence of the KINDL questionnaire across child self-reports and parent proxy-reports: a comparison between item response theory and ordinal logistic regression.

    PubMed

    Jafari, Peyman; Sharafi, Zahra; Bagheri, Zahra; Shalileh, Sara

    2014-06-01

    Measurement equivalence is a necessary assumption for meaningful comparison of pediatric quality of life rated by children and parents. In this study, differential item functioning (DIF) analysis is used to examine whether children and their parents respond consistently to the items in the KINDer Lebensqualitätsfragebogen (KINDL; in German, Children Quality of Life Questionnaire). Two DIF detection methods, graded response model (GRM) and ordinal logistic regression (OLR), were applied for comparability. The KINDL was completed by 1,086 school children and 1,061 of their parents. While the GRM revealed that 12 out of the 24 items were flagged with DIF, the OLR identified 14 out of the 24 items with DIF. Seven items with DIF and five items without DIF were common across the two methods, yielding a total agreement rate of 50 %. This study revealed that parent proxy-reports cannot be used as a substitute for a child's ratings in the KINDL.

  9. Relationship between COMLEX-USA scores and performance on the American Osteopathic Board of Emergency Medicine Part I certifying examination.

    PubMed

    Li, Feiming; Gimpel, John R; Arenson, Ethan; Song, Hao; Bates, Bruce P; Ludwin, Fredric

    2014-04-01

    Few studies have investigated how well scores from the Comprehensive Osteopathic Medical Licensing Examination-USA (COMLEX-USA) series predict resident outcomes, such as performance on board certification examinations. To determine how well COMLEX-USA predicts performance on the American Osteopathic Board of Emergency Medicine (AOBEM) Part I certification examination. The target study population was first-time examinees who took AOBEM Part I in 2011 and 2012 with matched performances on COMLEX-USA Level 1, Level 2-Cognitive Evaluation (CE), and Level 3. Pearson correlations were computed between AOBEM Part I first-attempt scores and COMLEX-USA performances to measure the association between these examinations. Stepwise linear regression analysis was conducted to predict AOBEM Part I scores by the 3 COMLEX-USA scores. An independent t test was conducted to compare mean COMLEX-USA performances between candidates who passed and who failed AOBEM Part I, and a stepwise logistic regression analysis was used to predict the log-odds of passing AOBEM Part I on the basis of COMLEX-USA scores. Scores from AOBEM Part I had the highest correlation with COMLEX-USA Level 3 scores (.57) and slightly lower correlation with COMLEX-USA Level 2-CE scores (.53). The lowest correlation was between AOBEM Part I and COMLEX-USA Level 1 scores (.47). According to the stepwise regression model, COMLEX-USA Level 1 and Level 2-CE scores, which residency programs often use as selection criteria, together explained 30% of variance in AOBEM Part I scores. Adding Level 3 scores explained 37% of variance. The independent t test indicated that the 397 examinees passing AOBEM Part I performed significantly better than the 54 examinees failing AOBEM Part I in all 3 COMLEX-USA levels (P<.001 for all 3 levels). The logistic regression model showed that COMLEX-USA Level 1 and Level 3 scores predicted the log-odds of passing AOBEM Part I (P=.03 and P<.001, respectively). The present study empirically supported the predictive and discriminant validities of the COMLEX-USA series in relation to the AOBEM Part I certification examination. Although residency programs may use COMLEX-USA Level 1 and Level 2-CE scores as partial criteria in selecting residents, Level 3 scores, though typically not available at the time of application, are actually the most statistically related to performances on AOBEM Part I.

  10. CUSUM-Logistic Regression analysis for the rapid detection of errors in clinical laboratory test results.

    PubMed

    Sampson, Maureen L; Gounden, Verena; van Deventer, Hendrik E; Remaley, Alan T

    2016-02-01

    The main drawback of the periodic analysis of quality control (QC) material is that test performance is not monitored in time periods between QC analyses, potentially leading to the reporting of faulty test results. The objective of this study was to develop a patient based QC procedure for the more timely detection of test errors. Results from a Chem-14 panel measured on the Beckman LX20 analyzer were used to develop the model. Each test result was predicted from the other 13 members of the panel by multiple regression, which resulted in correlation coefficients between the predicted and measured result of >0.7 for 8 of the 14 tests. A logistic regression model, which utilized the measured test result, the predicted test result, the day of the week and time of day, was then developed for predicting test errors. The output of the logistic regression was tallied by a daily CUSUM approach and used to predict test errors, with a fixed specificity of 90%. The mean average run length (ARL) before error detection by CUSUM-Logistic Regression (CSLR) was 20 with a mean sensitivity of 97%, which was considerably shorter than the mean ARL of 53 (sensitivity 87.5%) for a simple prediction model that only used the measured result for error detection. A CUSUM-Logistic Regression analysis of patient laboratory data can be an effective approach for the rapid and sensitive detection of clinical laboratory errors. Published by Elsevier Inc.

  11. Dysglycemia, Glycemic Variability, and Outcome After Cardiac Arrest and Temperature Management at 33°C and 36°C.

    PubMed

    Borgquist, Ola; Wise, Matt P; Nielsen, Niklas; Al-Subaie, Nawaf; Cranshaw, Julius; Cronberg, Tobias; Glover, Guy; Hassager, Christian; Kjaergaard, Jesper; Kuiper, Michael; Smid, Ondrej; Walden, Andrew; Friberg, Hans

    2017-08-01

    Dysglycemia and glycemic variability are associated with poor outcomes in critically ill patients. Targeted temperature management alters blood glucose homeostasis. We investigated the association between blood glucose concentrations and glycemic variability and the neurologic outcomes of patients randomized to targeted temperature management at 33°C or 36°C after cardiac arrest. Post hoc analysis of the multicenter TTM-trial. Primary outcome of this analysis was neurologic outcome after 6 months, referred to as "Cerebral Performance Category." Thirty-six sites in Europe and Australia. All 939 patients with out-of-hospital cardiac arrest of presumed cardiac cause that had been included in the TTM-trial. Targeted temperature management at 33°C or 36°C. Nonparametric tests as well as multiple logistic regression and mixed effects logistic regression models were used. Median glucose concentrations on hospital admission differed significantly between Cerebral Performance Category outcomes (p < 0.0001). Hyper- and hypoglycemia were associated with poor neurologic outcome (p = 0.001 and p = 0.054). In the multiple logistic regression models, the median glycemic level was an independent predictor of poor Cerebral Performance Category (Cerebral Performance Category, 3-5) with an odds ratio (OR) of 1.13 in the adjusted model (p = 0.008; 95% CI, 1.03-1.24). It was also a predictor in the mixed model, which served as a sensitivity analysis to adjust for the multiple time points. The proportion of hyperglycemia was higher in the 33°C group compared with the 36°C group. Higher blood glucose levels at admission and during the first 36 hours, and higher glycemic variability, were associated with poor neurologic outcome and death. More patients in the 33°C treatment arm had hyperglycemia.

  12. Binary Logistic Regression Versus Boosted Regression Trees in Assessing Landslide Susceptibility for Multiple-Occurring Regional Landslide Events: Application to the 2009 Storm Event in Messina (Sicily, southern Italy).

    NASA Astrophysics Data System (ADS)

    Lombardo, L.; Cama, M.; Maerker, M.; Parisi, L.; Rotigliano, E.

    2014-12-01

    This study aims at comparing the performances of Binary Logistic Regression (BLR) and Boosted Regression Trees (BRT) methods in assessing landslide susceptibility for multiple-occurrence regional landslide events within the Mediterranean region. A test area was selected in the north-eastern sector of Sicily (southern Italy), corresponding to the catchments of the Briga and the Giampilieri streams both stretching for few kilometres from the Peloritan ridge (eastern Sicily, Italy) to the Ionian sea. This area was struck on the 1st October 2009 by an extreme climatic event resulting in thousands of rapid shallow landslides, mainly of debris flows and debris avalanches types involving the weathered layer of a low to high grade metamorphic bedrock. Exploiting the same set of predictors and the 2009 landslide archive, BLR- and BRT-based susceptibility models were obtained for the two catchments separately, adopting a random partition (RP) technique for validation; besides, the models trained in one of the two catchments (Briga) were tested in predicting the landslide distribution in the other (Giampilieri), adopting a spatial partition (SP) based validation procedure. All the validation procedures were based on multi-folds tests so to evaluate and compare the reliability of the fitting, the prediction skill, the coherence in the predictor selection and the precision of the susceptibility estimates. All the obtained models for the two methods produced very high predictive performances, with a general congruence between BLR and BRT in the predictor importance. In particular, the research highlighted that BRT-models reached a higher prediction performance with respect to BLR-models, for RP based modelling, whilst for the SP-based models the difference in predictive skills between the two methods dropped drastically, converging to an analogous excellent performance. However, when looking at the precision of the probability estimates, BLR demonstrated to produce more robust models in terms of selected predictors and coefficients, as well as of dispersion of the estimated probabilities around the mean value for each mapped pixel. The difference in the behaviour could be interpreted as the result of overfitting effects, which heavily affect decision tree classification more than logistic regression techniques.

  13. The effect of service satisfaction and spiritual well-being on the quality of life of patients with schizophrenia.

    PubMed

    Lanfredi, Mariangela; Candini, Valentina; Buizza, Chiara; Ferrari, Clarissa; Boero, Maria E; Giobbio, Gian M; Goldschmidt, Nicoletta; Greppo, Stefania; Iozzino, Laura; Maggi, Paolo; Melegari, Anna; Pasqualetti, Patrizio; Rossi, Giuseppe; de Girolamo, Giovanni

    2014-05-15

    Quality of life (QOL) has been considered an important outcome measure in psychiatric research and determinants of QOL have been widely investigated. We aimed at detecting predictors of QOL at baseline and at testing the longitudinal interrelations of the baseline predictors with QOL scores at a 1-year follow-up in a sample of patients living in Residential Facilities (RFs). Logistic regression models were adopted to evaluate the association between WHOQoL-Bref scores and potential determinants of QOL. In addition, all variables significantly associated with QOL domains in the final logistic regression model were included by using the Structural Equation Modeling (SEM). We included 139 patients with a diagnosis of schizophrenia spectrum. In the final logistic regression model level of activity, social support, age, service satisfaction, spiritual well-being and symptoms' severity were identified as predictors of QOL scores at baseline. Longitudinal analyses carried out by SEM showed that 40% of QOL follow-up variability was explained by QOL at baseline, and significant indirect effects toward QOL at follow-up were found for satisfaction with services and for social support. Rehabilitation plans for people with schizophrenia living in RFs should also consider mediators of change in subjective QOL such as satisfaction with mental health services. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  14. Can shoulder dystocia be reliably predicted?

    PubMed

    Dodd, Jodie M; Catcheside, Britt; Scheil, Wendy

    2012-06-01

    To evaluate factors reported to increase the risk of shoulder dystocia, and to evaluate their predictive value at a population level. The South Australian Pregnancy Outcome Unit's population database from 2005 to 2010 was accessed to determine the occurrence of shoulder dystocia in addition to reported risk factors, including age, parity, self-reported ethnicity, presence of diabetes and infant birth weight. Odds ratios (and 95% confidence interval) of shoulder dystocia was calculated for each risk factor, which were then incorporated into a logistic regression model. Test characteristics for each variable in predicting shoulder dystocia were calculated. As a proportion of all births, the reported rate of shoulder dystocia increased significantly from 0.95% in 2005 to 1.38% in 2010 (P = 0.0002). Using a logistic regression model, induction of labour and infant birth weight greater than both 4000 and 4500 g were identified as significant independent predictors of shoulder dystocia. The value of risk factors alone and when incorporated into the logistic regression model was poorly predictive of the occurrence of shoulder dystocia. While there are a number of factors associated with an increased risk of shoulder dystocia, none are of sufficient sensitivity or positive predictive value to allow their use clinically to reliably and accurately identify the occurrence of shoulder dystocia. © 2012 The Authors ANZJOG © 2012 The Royal Australian and New Zealand College of Obstetricians and Gynaecologists.

  15. Occupational exposure to potentially infectious biological material in a dental teaching environment.

    PubMed

    Machado-Carvalhais, Helenaura P; Ramos-Jorge, Maria L; Auad, Sheyla M; Martins, Laura H P M; Paiva, Saul M; Pordeus, Isabela A

    2008-10-01

    The aims of this cross-sectional study were to determine the prevalence of occupational accidents with exposure to biological material among undergraduate students of dentistry and to estimate potential risk factors associated with exposure to blood. Data were collected through a self-administered questionnaire (86.4 percent return rate), which was completed by a sample of 286 undergraduate dental students (mean age 22.4 +/-2.4 years). The students were enrolled in the clinical component of the curriculum, which corresponds to the final six semesters of study. Descriptive, bivariate, simple logistic regression and multiple logistic regression (Forward Stepwise Procedure) analyses were performed. The level of statistical significance was set at 5 percent. Percutaneous and mucous exposures to potentially infectious biological material were reported by 102 individuals (35.6 percent); 26.8 percent reported the occurrence of multiple episodes of exposure. The logistic regression analyses revealed that the incomplete use of individual protection equipment (OR=3.7; 95 percent CI 1.5-9.3), disciplines where surgical procedures are carried out (OR=16.3; 95 percent CI 7.1-37.2), and handling sharp instruments (OR=4.4; 95 percent CI 2.1-9.1), more specifically, hollow-bore needles (OR=6.8; 95 percent CI 2.1-19.0), were independently associated with exposure to blood. Policies of reviewing the procedures during clinical practice are recommended in order to reduce occupational exposure.

  16. An Optimal Hierarchical Decision Model for a Regional Logistics Network with Environmental Impact Consideration

    PubMed Central

    Zhang, Dezhi; Li, Shuangyan

    2014-01-01

    This paper proposes a new model of simultaneous optimization of three-level logistics decisions, for logistics authorities, logistics operators, and logistics users, for regional logistics network with environmental impact consideration. The proposed model addresses the interaction among the three logistics players in a complete competitive logistics service market with CO2 emission charges. We also explicitly incorporate the impacts of the scale economics of the logistics park and the logistics users' demand elasticity into the model. The logistics authorities aim to maximize the total social welfare of the system, considering the demand of green logistics development by two different methods: optimal location of logistics nodes and charging a CO2 emission tax. Logistics operators are assumed to compete with logistics service fare and frequency, while logistics users minimize their own perceived logistics disutility given logistics operators' service fare and frequency. A heuristic algorithm based on the multinomial logit model is presented for the three-level decision model, and a numerical example is given to illustrate the above optimal model and its algorithm. The proposed model provides a useful tool for modeling competitive logistics services and evaluating logistics policies at the strategic level. PMID:24977209

  17. An optimal hierarchical decision model for a regional logistics network with environmental impact consideration.

    PubMed

    Zhang, Dezhi; Li, Shuangyan; Qin, Jin

    2014-01-01

    This paper proposes a new model of simultaneous optimization of three-level logistics decisions, for logistics authorities, logistics operators, and logistics users, for regional logistics network with environmental impact consideration. The proposed model addresses the interaction among the three logistics players in a complete competitive logistics service market with CO2 emission charges. We also explicitly incorporate the impacts of the scale economics of the logistics park and the logistics users' demand elasticity into the model. The logistics authorities aim to maximize the total social welfare of the system, considering the demand of green logistics development by two different methods: optimal location of logistics nodes and charging a CO2 emission tax. Logistics operators are assumed to compete with logistics service fare and frequency, while logistics users minimize their own perceived logistics disutility given logistics operators' service fare and frequency. A heuristic algorithm based on the multinomial logit model is presented for the three-level decision model, and a numerical example is given to illustrate the above optimal model and its algorithm. The proposed model provides a useful tool for modeling competitive logistics services and evaluating logistics policies at the strategic level.

  18. Gene-environment interaction between adiponectin gene polymorphisms and environmental factors on the risk of diabetic retinopathy.

    PubMed

    Li, Yuan; Wu, Qun Hong; Jiao, Ming Li; Fan, Xiao Hong; Hu, Quan; Hao, Yan Hua; Liu, Ruo Hong; Zhang, Wei; Cui, Yu; Han, Li Yuan

    2015-01-01

    To evaluate whether the adiponectin gene is associated with diabetic retinopathy (DR) risk and interaction with environmental factors modifies the DR risk, and to investigate the relationship between serum adiponectin levels and DR. Four adiponectin polymorphisms were evaluated in 372 DR cases and 145 controls. Differences in environmental factors between cases and controls were evaluated by unconditional logistic regression analysis. The model-free multifactor dimensionality reduction method and traditional multiple regression models were applied to explore interactions between the polymorphisms and environmental factors. Using the Bonferroni method, we found no significant associations between four adiponectin polymorphisms and DR susceptibility. Multivariate logistic regression found that physical activity played a protective role in the progress of DR, whereas family history of diabetes (odds ratio 1.75) and insulin therapy (odds ratio 1.78) were associated with an increased risk for DR. The interaction between the C-11377 G (rs266729) polymorphism and insulin therapy might be associated with DR risk. Family history of diabetes combined with insulin therapy also increased the risk of DR. No adiponectin gene polymorphisms influenced the serum adiponectin levels. Serum adiponectin levels did not differ between the DR group and non-DR group. No significant association was identified between four adiponectin polymorphisms and DR susceptibility after stringent Bonferroni correction. The interaction between C-11377G (rs266729) polymorphism and insulin therapy, as well as the interaction between family history of diabetes and insulin therapy, might be associated with DR susceptibility.

  19. Fertility desires of Yoruba couples of South-western Nigeria.

    PubMed

    Oyediran, Kolawole Azeez

    2006-09-01

    Using the matched wife-husband (763) sample from the data collected from Ogbomoso and Iseyin towns in Oyo State, Nigeria, this paper examines factors associated with couples' fertility intention. The analysis used logistic regression models for predicting the effects of selected socioeconomic background characteristics on a couple's fertility intention. Results indicate high levels of concurrence among husbands and wives on fertility intention. Where differences exist, husbands are more pronatalists than their wives. About 87% of pairs of partners reported similar fertility preferences. Of these couples, 59.5% wanted more children while only 27.8% reported otherwise. The logistic regression models indicated that a couple's fertility intention was associated with age, education, place of residence, frequency of television-watching and number of living children. Therefore, programme interventions aimed at promoting fertility reduction in Nigeria should convey fertility regulation messages to both husbands and wives.

  20. Racial/Ethnic Disparities in Depressive Symptoms Among Pregnant Women Vary by Income and Neighborhood Poverty.

    PubMed

    Cubbin, Catherine; Heck, Katherine; Powell, Tara; Marchi, Kristen; Braveman, Paula

    2015-01-01

    We examined racial/ethnic disparities in depressive symptoms during pregnancy among a population-based sample of childbearing women in California (N = 24,587). We hypothesized that these racial/ethnic disparities would be eliminated when comparing women with similar incomes and neighborhood poverty environments. Neighborhood poverty trajectory descriptions were linked with survey data measuring age, parity, race/ethnicity, marital status, education, income, and depressive symptoms. We constructed logistic regression models among the overall sample to examine both crude and adjusted racial/ethnic disparities in feeling depressed. Next, stratified adjusted logistic regression models were constructed to examine racial/ethnic disparities in feeling depressed among women of similar income levels living in similar neighborhood poverty environments. We found that racial/ethnic disparities in feeling depressed remained only among women who were not poor themselves and who lived in long-term moderate or low poverty neighborhoods.

  1. Binary logistic regression modelling: Measuring the probability of relapse cases among drug addict

    NASA Astrophysics Data System (ADS)

    Ismail, Mohd Tahir; Alias, Siti Nor Shadila

    2014-07-01

    For many years Malaysia faced the drug addiction issues. The most serious case is relapse phenomenon among treated drug addict (drug addict who have under gone the rehabilitation programme at Narcotic Addiction Rehabilitation Centre, PUSPEN). Thus, the main objective of this study is to find the most significant factor that contributes to relapse to happen. The binary logistic regression analysis was employed to model the relationship between independent variables (predictors) and dependent variable. The dependent variable is the status of the drug addict either relapse, (Yes coded as 1) or not, (No coded as 0). Meanwhile the predictors involved are age, age at first taking drug, family history, education level, family crisis, community support and self motivation. The total of the sample is 200 which the data are provided by AADK (National Antidrug Agency). The finding of the study revealed that age and self motivation are statistically significant towards the relapse cases..

  2. Reactive oxygen metabolites (ROMs) are associated with cardiovascular disease in chronic hemodialysis patients.

    PubMed

    Bossola, Maurizio; Vulpio, Carlo; Colacicco, Luigi; Scribano, Donata; Zuppi, Cecilia; Tazza, Luigi

    2012-02-11

    The aim of our study was to measure reactive oxygen metabolites (ROMs) in chronic hemodialysis (HD) patients and evaluate the possible association with cardiovascular disease (CVD) and mortality. We measured ROMs in 76 HD patients and correlated with CVD, cardiovascular (CV) events in the follow-up and all-cause and CVD-related mortality. The levels of ROMs presented a median value of 270 (238.2-303.2) CARR U (interquartile range). We created a ROC curve (ROMs levels vs. CVD) and we identified a cut-off point of 273 CARR U. Patients with ROMs levels ≥273 CARR U were significantly older, had higher C-reactive protein levels and lower creatinine concentrations. The prevalence of CVD was higher in patients with ROMs levels ≥273 (87.1%) than in those with ROMs levels <273 CARR U (17.7%; p<0.0001). ROMs levels were significantly higher in patients with CVD (317±63.8) than in those without (242.7±49.1; p<0.0001). At multiple regression analysis, age, creatinine and C-reactive protein were independent factors associated with ROMs. At multiple logistic regression analysis the association between ROMs and CVD was independent (OR: 1.02, 95% CI: 1.00-1.05; p=0.03). Twenty six patients developed cardiovascular (CV) events during the follow-up. Of these, seven were in the group with ROMs levels <273 CARR U and 19 in the group with ROMs levels ≥273 CARR U. The logistic regression analysis showed that both age (OR: 1.06, 95% CI: 1.01-1.12; p=0.013) and ROMs levels (OR: 1.10, 95% CI: 1.00-1.02; p=0.045) were independently associated with CV events in the follow-up. ROMs are independently associated with CVD and predict CV events in chronic HD patients.

  3. Support vector machine learning model for the prediction of sentinel node status in patients with cutaneous melanoma.

    PubMed

    Mocellin, Simone; Ambrosi, Alessandro; Montesco, Maria Cristina; Foletto, Mirto; Zavagno, Giorgio; Nitti, Donato; Lise, Mario; Rossi, Carlo Riccardo

    2006-08-01

    Currently, approximately 80% of melanoma patients undergoing sentinel node biopsy (SNB) have negative sentinel lymph nodes (SLNs), and no prediction system is reliable enough to be implemented in the clinical setting to reduce the number of SNB procedures. In this study, the predictive power of support vector machine (SVM)-based statistical analysis was tested. The clinical records of 246 patients who underwent SNB at our institution were used for this analysis. The following clinicopathologic variables were considered: the patient's age and sex and the tumor's histological subtype, Breslow thickness, Clark level, ulceration, mitotic index, lymphocyte infiltration, regression, angiolymphatic invasion, microsatellitosis, and growth phase. The results of SVM-based prediction of SLN status were compared with those achieved with logistic regression. The SLN positivity rate was 22% (52 of 234). When the accuracy was > or = 80%, the negative predictive value, positive predictive value, specificity, and sensitivity were 98%, 54%, 94%, and 77% and 82%, 41%, 69%, and 93% by using SVM and logistic regression, respectively. Moreover, SVM and logistic regression were associated with a diagnostic error and an SNB percentage reduction of (1) 1% and 60% and (2) 15% and 73%, respectively. The results from this pilot study suggest that SVM-based prediction of SLN status might be evaluated as a prognostic method to avoid the SNB procedure in 60% of patients currently eligible, with a very low error rate. If validated in larger series, this strategy would lead to obvious advantages in terms of both patient quality of life and costs for the health care system.

  4. Nonconvex Sparse Logistic Regression With Weakly Convex Regularization

    NASA Astrophysics Data System (ADS)

    Shen, Xinyue; Gu, Yuantao

    2018-06-01

    In this work we propose to fit a sparse logistic regression model by a weakly convex regularized nonconvex optimization problem. The idea is based on the finding that a weakly convex function as an approximation of the $\\ell_0$ pseudo norm is able to better induce sparsity than the commonly used $\\ell_1$ norm. For a class of weakly convex sparsity inducing functions, we prove the nonconvexity of the corresponding sparse logistic regression problem, and study its local optimality conditions and the choice of the regularization parameter to exclude trivial solutions. Despite the nonconvexity, a method based on proximal gradient descent is used to solve the general weakly convex sparse logistic regression, and its convergence behavior is studied theoretically. Then the general framework is applied to a specific weakly convex function, and a necessary and sufficient local optimality condition is provided. The solution method is instantiated in this case as an iterative firm-shrinkage algorithm, and its effectiveness is demonstrated in numerical experiments by both randomly generated and real datasets.

  5. Comparison of cranial sex determination by discriminant analysis and logistic regression.

    PubMed

    Amores-Ampuero, Anabel; Alemán, Inmaculada

    2016-04-05

    Various methods have been proposed for estimating dimorphism. The objective of this study was to compare sex determination results from cranial measurements using discriminant analysis or logistic regression. The study sample comprised 130 individuals (70 males) of known sex, age, and cause of death from San José cemetery in Granada (Spain). Measurements of 19 neurocranial dimensions and 11 splanchnocranial dimensions were subjected to discriminant analysis and logistic regression, and the percentages of correct classification were compared between the sex functions obtained with each method. The discriminant capacity of the selected variables was evaluated with a cross-validation procedure. The percentage accuracy with discriminant analysis was 78.2% for the neurocranium (82.4% in females and 74.6% in males) and 73.7% for the splanchnocranium (79.6% in females and 68.8% in males). These percentages were higher with logistic regression analysis: 85.7% for the neurocranium (in both sexes) and 94.1% for the splanchnocranium (100% in females and 91.7% in males).

  6. Racial/ethnic variation in health care satisfaction: The role of acculturation.

    PubMed

    Han, Woojae; Lee, Sungkyu

    2016-10-01

    This study examined the role of acculturation and racial/ethnic variation in health care satisfaction among four different racial/ethnic groups. The study sample consisted of 41,560 adults from the 2011 California Health Interview Survey. Health care satisfaction was measured via two questions regarding doctors' listening and explanations. Guided by Andersen's behavioral model of health care use, multivariate logistic regressions were conducted. Hispanic and Asian respondents showed the lowest levels of satisfaction with their doctors' listening and explanations, respectively. Acculturation was found to be a significant predictor of health care satisfaction. Health care professionals should develop ways of expanding culturally competent health care professionals, who are aware of racial/ethnic variation in health care satisfaction.

  7. Prediction of early postoperative infections in pediatric liver transplantation by logistic regression

    NASA Astrophysics Data System (ADS)

    Uzunova, Yordanka; Prodanova, Krasimira; Spassov, Lubomir

    2016-12-01

    Orthotopic liver transplantation (OLT) is the only curative treatment for end-stage liver disease. Early diagnosis and treatment of infections after OLT are usually associated with improved outcomes. This study's objective is to identify reliable factors that can predict postoperative infectious morbidity. 27 children were included in the analysis. They underwent liver transplantation in our department. The correlation between two parameters (the level of blood glucose at 5th postoperative day and the duration of the anhepatic phase) and postoperative infections was analyzed, using univariate analysis. In this analysis, an independent predictive factor was derived which adequately identifies patients at risk of infectious complications after a liver transplantation.

  8. [Study on the correlation among adolescents' family function, negative life events stress amount and suicide ideation].

    PubMed

    Zhang, Dongdong; Chen, Ling; Yin, Dan; Miao, Jinping; Sun, Yehuan

    2014-07-01

    To explore the correlation between suicide ideation and family function & negative life events, as well as other influential factors in adolescents, thus present a theoretical base for clinicians and school staff to develop intervention for those problems. By adopting current situation random sampling method, Self-Rating Idea of Suicide Scale, Adolescent Self-Rating Life Events Check List and Family APGAR Index were used to assess adolescents at random in a hygiene vocational school in Changzhou City, Jiangsu Province and a collage in Wuhu City, Anhui Province. 3700 questionnaires were granted, 3675 questionnaires were collected, among which 3620 were valid. Chi-square test, t-test, and univariate logistic regression were employed in univariate analysis, multivariate logistic regression was used in multivariate analysis. The detection rate of suicide ideation is 7.0%, and the top five suicide ideation characteristics were: poor academic performance (33.6%), serious family functional impairment (25.8%), lower-middle academic performance (11.7%), bad economic conditions (10.8%) and study in Grade Three (9.9%). Multiple logistic regression showed that the following three high-level stress amount in negative life events are most crucial for suicide ideation. They are "relationships" (OR = 1.135, 95% CI 1.071 - 1. 202), "academic pressure" (OR = 1.169, 95% CI 1.101 - 1.241), and "external events" (OR = 1.278, 95% CI 1.187 - 1.376). What' s more, the stress of attending higher grades (OR = 1.980, 95% CI 1.302 - 3.008), poor academic performance (OR = 7.206, 95% CI 1.745 - 9.789), moderate family functional impairment (OR = 2.562, 95% CI 1.527 - 2.892) and its serious level (OR = 8.287, 95% CI 3.154 - 6.917) are also influential factors for suicide ideation. Severe family functional impairment and high-level stress amount of negative life events produced the main factors of suicide ideation. Therefore, necessary and sufficient support should be given to adolescents by families and schools.

  9. Stepwise Distributed Open Innovation Contests for Software Development: Acceleration of Genome-Wide Association Analysis

    PubMed Central

    Hill, Andrew; Loh, Po-Ru; Bharadwaj, Ragu B.; Pons, Pascal; Shang, Jingbo; Guinan, Eva; Lakhani, Karim; Kilty, Iain

    2017-01-01

    Abstract Background: The association of differing genotypes with disease-related phenotypic traits offers great potential to both help identify new therapeutic targets and support stratification of patients who would gain the greatest benefit from specific drug classes. Development of low-cost genotyping and sequencing has made collecting large-scale genotyping data routine in population and therapeutic intervention studies. In addition, a range of new technologies is being used to capture numerous new and complex phenotypic descriptors. As a result, genotype and phenotype datasets have grown exponentially. Genome-wide association studies associate genotypes and phenotypes using methods such as logistic regression. As existing tools for association analysis limit the efficiency by which value can be extracted from increasing volumes of data, there is a pressing need for new software tools that can accelerate association analyses on large genotype-phenotype datasets. Results: Using open innovation (OI) and contest-based crowdsourcing, the logistic regression analysis in a leading, community-standard genetics software package (PLINK 1.07) was substantially accelerated. OI allowed us to do this in <6 months by providing rapid access to highly skilled programmers with specialized, difficult-to-find skill sets. Through a crowd-based contest a combination of computational, numeric, and algorithmic approaches was identified that accelerated the logistic regression in PLINK 1.07 by 18- to 45-fold. Combining contest-derived logistic regression code with coarse-grained parallelization, multithreading, and associated changes to data initialization code further developed through distributed innovation, we achieved an end-to-end speedup of 591-fold for a data set size of 6678 subjects by 645 863 variants, compared to PLINK 1.07's logistic regression. This represents a reduction in run time from 4.8 hours to 29 seconds. Accelerated logistic regression code developed in this project has been incorporated into the PLINK2 project. Conclusions: Using iterative competition-based OI, we have developed a new, faster implementation of logistic regression for genome-wide association studies analysis. We present lessons learned and recommendations on running a successful OI process for bioinformatics. PMID:28327993

  10. Stepwise Distributed Open Innovation Contests for Software Development: Acceleration of Genome-Wide Association Analysis.

    PubMed

    Hill, Andrew; Loh, Po-Ru; Bharadwaj, Ragu B; Pons, Pascal; Shang, Jingbo; Guinan, Eva; Lakhani, Karim; Kilty, Iain; Jelinsky, Scott A

    2017-05-01

    The association of differing genotypes with disease-related phenotypic traits offers great potential to both help identify new therapeutic targets and support stratification of patients who would gain the greatest benefit from specific drug classes. Development of low-cost genotyping and sequencing has made collecting large-scale genotyping data routine in population and therapeutic intervention studies. In addition, a range of new technologies is being used to capture numerous new and complex phenotypic descriptors. As a result, genotype and phenotype datasets have grown exponentially. Genome-wide association studies associate genotypes and phenotypes using methods such as logistic regression. As existing tools for association analysis limit the efficiency by which value can be extracted from increasing volumes of data, there is a pressing need for new software tools that can accelerate association analyses on large genotype-phenotype datasets. Using open innovation (OI) and contest-based crowdsourcing, the logistic regression analysis in a leading, community-standard genetics software package (PLINK 1.07) was substantially accelerated. OI allowed us to do this in <6 months by providing rapid access to highly skilled programmers with specialized, difficult-to-find skill sets. Through a crowd-based contest a combination of computational, numeric, and algorithmic approaches was identified that accelerated the logistic regression in PLINK 1.07 by 18- to 45-fold. Combining contest-derived logistic regression code with coarse-grained parallelization, multithreading, and associated changes to data initialization code further developed through distributed innovation, we achieved an end-to-end speedup of 591-fold for a data set size of 6678 subjects by 645 863 variants, compared to PLINK 1.07's logistic regression. This represents a reduction in run time from 4.8 hours to 29 seconds. Accelerated logistic regression code developed in this project has been incorporated into the PLINK2 project. Using iterative competition-based OI, we have developed a new, faster implementation of logistic regression for genome-wide association studies analysis. We present lessons learned and recommendations on running a successful OI process for bioinformatics. © The Author 2017. Published by Oxford University Press.

  11. Bayesian logistic regression in detection of gene-steroid interaction for cancer at PDLIM5 locus.

    PubMed

    Wang, Ke-Sheng; Owusu, Daniel; Pan, Yue; Xie, Changchun

    2016-06-01

    The PDZ and LIM domain 5 (PDLIM5) gene may play a role in cancer, bipolar disorder, major depression, alcohol dependence and schizophrenia; however, little is known about the interaction effect of steroid and PDLIM5 gene on cancer. This study examined 47 single-nucleotide polymorphisms (SNPs) within the PDLIM5 gene in the Marshfield sample with 716 cancer patients (any diagnosed cancer, excluding minor skin cancer) and 2848 noncancer controls. Multiple logistic regression model in PLINK software was used to examine the association of each SNP with cancer. Bayesian logistic regression in PROC GENMOD in SAS statistical software, ver. 9.4 was used to detect gene- steroid interactions influencing cancer. Single marker analysis using PLINK identified 12 SNPs associated with cancer (P< 0.05); especially, SNP rs6532496 revealed the strongest association with cancer (P = 6.84 × 10⁻³); while the next best signal was rs951613 (P = 7.46 × 10⁻³). Classic logistic regression in PROC GENMOD showed that both rs6532496 and rs951613 revealed strong gene-steroid interaction effects (OR=2.18, 95% CI=1.31-3.63 with P = 2.9 × 10⁻³ for rs6532496 and OR=2.07, 95% CI=1.24-3.45 with P = 5.43 × 10⁻³ for rs951613, respectively). Results from Bayesian logistic regression showed stronger interaction effects (OR=2.26, 95% CI=1.2-3.38 for rs6532496 and OR=2.14, 95% CI=1.14-3.2 for rs951613, respectively). All the 12 SNPs associated with cancer revealed significant gene-steroid interaction effects (P < 0.05); whereas 13 SNPs showed gene-steroid interaction effects without main effect on cancer. SNP rs4634230 revealed the strongest gene-steroid interaction effect (OR=2.49, 95% CI=1.5-4.13 with P = 4.0 × 10⁻⁴ based on the classic logistic regression and OR=2.59, 95% CI=1.4-3.97 from Bayesian logistic regression; respectively). This study provides evidence of common genetic variants within the PDLIM5 gene and interactions between PLDIM5 gene polymorphisms and steroid use influencing cancer.

  12. Characteristics of insufficiently active participants that benefit from health-enhancing physical activity (HEPA) promotion programs implemented in the sports club setting.

    PubMed

    Ooms, Linda; Leemrijse, Chantal; Collard, Dorine; Schipper-van Veldhoven, Nicolette; Veenhof, Cindy

    2018-06-01

    Health-enhancing physical activity (HEPA) promotion programs are implemented in sports clubs. The purpose of this study was to examine the characteristics of the insufficiently active participants that benefit from these programs. Data of three sporting programs, developed for insufficiently active adults, were used for this study. These sporting programs were implemented in different sports clubs in the Netherlands. Participants completed an online questionnaire at baseline and after six months (n = 458). Of this sample, 35.1% (n = 161) was insufficiently active (i.e. not meeting HEPA levels) at baseline. Accordingly, two groups were compared: participants who were insufficiently active at baseline, but increased their physical activity to HEPA levels after six months (activated group, n = 86) versus participants who were insufficiently active both at baseline and after six months (non-activated group, n = 75). Potential associated characteristics (demographic, social, sport history, physical activity) were included as independent variables in bivariate and multivariate logistic regression analyses. The percentage of active participants increased significantly from baseline to six months (from 64.9 to 76.9%, p < 0.05). The bivariate logistic regression analyses showed that participants in the activated group were more likely to receive support from family members with regard to their sport participation (62.8% vs. 42.7%, p = 0.02) and spent more time in moderate-intensity physical activity (128 ± 191 min/week vs. 70 ± 106 min/week, p = 0.02) at baseline compared with participants in the non-activated group. These results were confirmed in the multivariate logistic regression analyses: when receiving support from most family members, there is a 216% increase in the odds of being in the activated group (OR = 2.155; 95% CI: 1.118-4.154, p = 0.02) and for each additional 1 min/week spent in moderate-intensity physical activity, the odds increases with 0.3% (OR = 1.003; 95% CI: 1.001-1.006, p = 0.02). The results suggest that HEPA sporting programs can be used to increase HEPA levels of insufficiently active people, but it seems a challenge to reach the least active ones. It is important that promotional strategies and channels are tailored to the target group. Furthermore, strategies that promote family support may enhance the impact of the programs.

  13. Lipoprotein lipase variants associated with an endophenotype of hypertension: hypertension combined with elevated triglycerides.

    PubMed

    Chen, Pei; Jou, Yuh-Shan; Fann, Cathy S J; Chen, Jaw-Wen; Chung, Chia-Min; Lin, Chin-Yu; Wu, Sheng-Yeu; Kang, Mei-Jyh; Chen, Ying-Chuang; Jong, Yuh-Shiun; Lo, Huey-Ming; Kang, Chih-Sen; Chen, Chien-Chung; Chang, Huan-Cheng; Huang, Nai-Kuei; Wu, Yi-Lin; Pan, Wen-Harn

    2009-01-01

    Previously, we observed that young-onset hypertension was independently associated with elevated plasma triglyceride(s) (TG) levels to a greater extent than other metabolic risk factors. Thus, focusing on the endophenotype--hypertension combined with elevated TG--we designed a family-based haplotype association study to explore its genetic connection with novel genetic variants of lipoprotein lipase gene (LPL), which encodes a major lipid metabolizing enzyme. Young-onset hypertension probands and their families were recruited, numbering 1,002 individuals from 345 families. Single-nucleotide polymorphism discovery for LPL, linkage disequilibrium (LD) analysis, transmission disequilibrium tests (TDT), bin construction, haplotype TDT association and logistic regression analysis were performed. We found that the CC- haplotype (i) spanning from intron 2 to intron 4 and the ACATT haplotype (ii) spanning from intron 5 to intron 6 were significantly associated with hypertension-related phenotypes: hypertension (ii, P=0.05), elevated TG (i, P=0.01), and hypertension combined with elevated TG (i, P=0.001; ii, P<0.0001), according to TDT. The risk of this hypertension subtype increased with the number of risk haplotypes in the two loci, using logistic regression model after adjusting within-family correlation. The relationships between LPL variants and hypertension-related disorders were also confirmed by an independent association study. Finally, we showed a trend that individuals with homozygous risk haplotypes had decreased LPL expression after a fatty meal, as opposed to those with protective haplotypes. In conclusion, this study strongly suggests that two LPL intronic variants may be associated with development of the hypertension endophenotype with elevated TG. Copyright 2008 Wiley-Liss, Inc.

  14. Deletion Diagnostics for Alternating Logistic Regressions

    PubMed Central

    Preisser, John S.; By, Kunthel; Perin, Jamie; Qaqish, Bahjat F.

    2013-01-01

    Deletion diagnostics are introduced for the regression analysis of clustered binary outcomes estimated with alternating logistic regressions, an implementation of generalized estimating equations (GEE) that estimates regression coefficients in a marginal mean model and in a model for the intracluster association given by the log odds ratio. The diagnostics are developed within an estimating equations framework that recasts the estimating functions for association parameters based upon conditional residuals into equivalent functions based upon marginal residuals. Extensions of earlier work on GEE diagnostics follow directly, including computational formulae for one-step deletion diagnostics that measure the influence of a cluster of observations on the estimated regression parameters and on the overall marginal mean or association model fit. The diagnostic formulae are evaluated with simulations studies and with an application concerning an assessment of factors associated with health maintenance visits in primary care medical practices. The application and the simulations demonstrate that the proposed cluster-deletion diagnostics for alternating logistic regressions are good approximations of their exact fully iterated counterparts. PMID:22777960

  15. Dietary protein intakes and risk of ulcerative colitis.

    PubMed

    Rashvand, Samaneh; Somi, Mohammad Hossein; Rashidkhani, Bahram; Hekmatdoost, Azita

    2015-01-01

    The incidence of ulcerative colitis (UC) is rising in populations with western-style diet, rich in fat and protein, and low in fruits and vegetables. In the present study, we aimed to evaluate the association between dietary protein intakes and the risk of developing incident UC. Sixty two cases of UC and 124 controls were studied using country-specific food frequency questionnaire (FFQ). Group comparisons by each factor were done using χ2 test, and significance level was set at α= 0.05. Logistic regression adjusted for potential confounding variables was carried out. Univariate analysis suggested positive associations between processed meat, red meat and organ meat with risk of ulcerative colitis. Comparing highest versus lowest categories of consumption, multivariate conditional logistic regression analysis accounting for potential confounding variables indicated that patients who consumed a higher amount of processed meat were at a higher risk for developing UC (P value for trend= 0.02). Similarly, patients who consumed higher amounts of red meat were at a higher risk for UC (P value for trend= 0.01). The highest tertile of intake of organ meat was associated with an increased risk of ulcerative colitis with a statistically significant trend across tertiles (P value for trend= 0.01) when adjusted. In this case-control study we observed that higher consumptions of processed meat, red meat and organ meat were associated with increased risk for UC.

  16. Contextual determinants of neonatal mortality using two analysis methods, Rio Grande do Sul, Brazil.

    PubMed

    Zanini, Roselaine Ruviaro; Moraes, Anaelena Bragança de; Giugliani, Elsa Regina Justo; Riboldi, João

    2011-02-01

    To analyze neonatal mortality determinants using multilevel logistic regression and classic hierarchical models. Cohort study including 138,407 live births with birth certificates and 1,134 neonatal deaths recorded in 2003, in the state of Rio Grande do Sul, Southern Brazil. The Information System on Live Births and mortality records were linked for gathering information on individual-level exposures. Sociodemographic data and information on the pregnancy, childbirth care and characteristics of the children at birth were collected. The associated factors were estimated and compared by traditional and multilevel logistic regression analysis. The neonatal mortality rate was 8.19 deaths per 1,000 live births. Low birth weight, 1- and 5-minute Apgar score below eight, congenital malformation, pre-term birth and previous fetal loss were associated with neonatal death in the traditional model. Elective cesarean section had a protective effect. Previous fetal loss did not remain significant in the multilevel model, but the inclusion of a contextual variable (poverty rate) showed that 15% of neonatal mortality variation can be explained by varying poverty rates in the microregions. The use of multilevel models showed a small effect of contextual determinants on the neonatal mortality rate. There was found a positive association with the poverty rate in the general model, and the proportion of households with water supply among preterm newborns.

  17. Exposure and effects of perfluoroalkyl substances in tree swallows nesting in Minnesota and Wisconsin, USA

    USGS Publications Warehouse

    Custer, Christine M.; Custer, Thomas W.; Dummer, Paul; Etterson, Matthew A.; Thogmartin, Wayne E.; Wu, Qian; Kannan, Kurunthachalam; Trowbridge, Annette; McKann, Patrick C.

    2013-01-01

    The exposure and effects of perfluoroalkyl substances (PFASs) were studied at eight locations in Minnesota and Wisconsin between 2007 and 2011 using tree swallows (Tachycineta bicolor). Concentrations of PFASs were quantified as were reproductive success end points. The sample egg method was used wherein an egg sample is collected, and the hatching success of the remaining eggs in the nest is assessed. The association between PFAS exposure and reproductive success was assessed by site comparisons, logistic regression analysis, and multistate modeling, a technique not previously used in this context. There was a negative association between concentrations of perfluorooctane sulfonate (PFOS) in eggs and hatching success. The concentration at which effects became evident (150–200 ng/g wet weight) was far lower than effect levels found in laboratory feeding trials or egg-injection studies of other avian species. This discrepancy was likely because behavioral effects and other extrinsic factors are not accounted for in these laboratory studies and the possibility that tree swallows are unusually sensitive to PFASs. The results from multistate modeling and simple logistic regression analyses were nearly identical. Multistate modeling provides a better method to examine possible effects of additional covariates and assessment of models using Akaike information criteria analyses. There was a credible association between PFOS concentrations in plasma and eggs, so extrapolation between these two commonly sampled tissues can be performed.

  18. Association of salivary levels of the bone remodelling regulators sRANKL and OPG with periodontal clinical status.

    PubMed

    Tobón-Arroyave, Sergio I; Isaza-Guzmán, Diana M; Restrepo-Cadavid, Eliana M; Zapata-Molina, Sandra M; Martínez-Pabón, María C

    2012-12-01

    To determine the variations in salivary concentrations of sRANKL, osteoprotegerin (OPG) and its ratio, regarding the periodontal status. Ninety-seven chronic periodontitis (CP) subjects and 43 healthy controls were selected. Periodontal status was assessed based on full-mouth clinical periodontal measurements. sRANKL and OPG salivary levels were analysed by ELISA. The association between these analytes and its ratio with CP was analysed individually and adjusted for confounding using a binary logistic regression model. sRANKL and sRANKL/OPG ratio were increased, whereas OPG was decreased in CP compared with healthy controls subjects. Although univariate analysis revealed a positive association of sRANKL salivary levels ≥6 pg/ml, OPG salivary levels ≤131 pg/ml and sRANKL/OPG ratio ≥0.062 with CP, after logistic regression analysis only the latter parameter was strongly and independently associated with disease status. Confounding and interaction effects of ageing and smoking habit on sRANKL and OPG levels could be noted. Although salivary concentrations of sRANKL, OPG and its ratio may act as indicators of the amount/extent of periodontal breakdown, the mutual confounding and synergistic biological interactive effects related to ageing and smoking habit of the susceptible host may also promote the tissue destruction in CP. © 2012 John Wiley & Sons A/S.

  19. Adenovirus 36 Seropositivity is Strongly Associated With Race and Gender, But Not Obesity, Among U.S. Military Personnel

    DTIC Science & Technology

    2010-01-01

    relationship between Ad-36 exposure and (1) obesity, and (2) levels of serum cholesterol and triglycerides . In this study there was no association in...value 0.0075), female gender (P-value 0.036), and a lower frequency of high levels of low- density lipoproteins (P-value 0.013). Logistic regression...levels of / cholesterol and triglycerides . There was no association in either case. Unanticipated relationships between Ad-36 exposure and age, race

  20. Process model comparison and transferability across bioreactor scales and modes of operation for a mammalian cell bioprocess.

    PubMed

    Craven, Stephen; Shirsat, Nishikant; Whelan, Jessica; Glennon, Brian

    2013-01-01

    A Monod kinetic model, logistic equation model, and statistical regression model were developed for a Chinese hamster ovary cell bioprocess operated under three different modes of operation (batch, bolus fed-batch, and continuous fed-batch) and grown on two different bioreactor scales (3 L bench-top and 15 L pilot-scale). The Monod kinetic model was developed for all modes of operation under study and predicted cell density, glucose glutamine, lactate, and ammonia concentrations well for the bioprocess. However, it was computationally demanding due to the large number of parameters necessary to produce a good model fit. The transferability of the Monod kinetic model structure and parameter set across bioreactor scales and modes of operation was investigated and a parameter sensitivity analysis performed. The experimentally determined parameters had the greatest influence on model performance. They changed with scale and mode of operation, but were easily calculated. The remaining parameters, which were fitted using a differential evolutionary algorithm, were not as crucial. Logistic equation and statistical regression models were investigated as alternatives to the Monod kinetic model. They were less computationally intensive to develop due to the absence of a large parameter set. However, modeling of the nutrient and metabolite concentrations proved to be troublesome due to the logistic equation model structure and the inability of both models to incorporate a feed. The complexity, computational load, and effort required for model development has to be balanced with the necessary level of model sophistication when choosing which model type to develop for a particular application. Copyright © 2012 American Institute of Chemical Engineers (AIChE).

  1. A logistic regression equation for estimating the probability of a stream in Vermont having intermittent flow

    USGS Publications Warehouse

    Olson, Scott A.; Brouillette, Michael C.

    2006-01-01

    A logistic regression equation was developed for estimating the probability of a stream flowing intermittently at unregulated, rural stream sites in Vermont. These determinations can be used for a wide variety of regulatory and planning efforts at the Federal, State, regional, county and town levels, including such applications as assessing fish and wildlife habitats, wetlands classifications, recreational opportunities, water-supply potential, waste-assimilation capacities, and sediment transport. The equation will be used to create a derived product for the Vermont Hydrography Dataset having the streamflow characteristic of 'intermittent' or 'perennial.' The Vermont Hydrography Dataset is Vermont's implementation of the National Hydrography Dataset and was created at a scale of 1:5,000 based on statewide digital orthophotos. The equation was developed by relating field-verified perennial or intermittent status of a stream site during normal summer low-streamflow conditions in the summer of 2005 to selected basin characteristics of naturally flowing streams in Vermont. The database used to develop the equation included 682 stream sites with drainage areas ranging from 0.05 to 5.0 square miles. When the 682 sites were observed, 126 were intermittent (had no flow at the time of the observation) and 556 were perennial (had flowing water at the time of the observation). The results of the logistic regression analysis indicate that the probability of a stream having intermittent flow in Vermont is a function of drainage area, elevation of the site, the ratio of basin relief to basin perimeter, and the areal percentage of well- and moderately well-drained soils in the basin. Using a probability cutpoint (a lower probability indicates the site has perennial flow and a higher probability indicates the site has intermittent flow) of 0.5, the logistic regression equation correctly predicted the perennial or intermittent status of 116 test sites 85 percent of the time.

  2. Logits and Tigers and Bears, Oh My! A Brief Look at the Simple Math of Logistic Regression and How It Can Improve Dissemination of Results

    ERIC Educational Resources Information Center

    Osborne, Jason W.

    2012-01-01

    Logistic regression is slowly gaining acceptance in the social sciences, and fills an important niche in the researcher's toolkit: being able to predict important outcomes that are not continuous in nature. While OLS regression is a valuable tool, it cannot routinely be used to predict outcomes that are binary or categorical in nature. These…

  3. Positive Parenting Practices Associated with Subsequent Childhood Weight Change

    ERIC Educational Resources Information Center

    Avula, Rasmi; Gonzalez, Wendy; Shapiro, Cheri J.; Fram, Maryah S.; Beets, Michael W.; Jones, Sonya J.; Blake, Christine E.; Frongillo, Edward A.

    2011-01-01

    We aimed to identify positive parenting practices that set children on differential weight-trajectories. Parenting practices studied were cognitively stimulating activities, limit-setting, disciplinary practices, and parent warmth. Data from two U.S. national longitudinal data sets and linear and logistic regression were used to examine…

  4. Men's Alcohol Expectancies at Selected Community Colleges

    ERIC Educational Resources Information Center

    Derby, Dustin C.

    2011-01-01

    Men's alcohol expectancies are an important cognitive-behavioral component of their consumption; yet, sparse research details such behaviors for men in two-year colleges. Selected for inclusion with the current study were 563 men from seven Illinois community colleges. Logistic regression analysis indicated four significant, positive relationships…

  5. Impact of Missing Data on the Detection of Differential Item Functioning: The Case of Mantel-Haenszel and Logistic Regression Analysis

    ERIC Educational Resources Information Center

    Robitzsch, Alexander; Rupp, Andre A.

    2009-01-01

    This article describes the results of a simulation study to investigate the impact of missing data on the detection of differential item functioning (DIF). Specifically, it investigates how four methods for dealing with missing data (listwise deletion, zero imputation, two-way imputation, response function imputation) interact with two methods of…

  6. Survival Data and Regression Models

    NASA Astrophysics Data System (ADS)

    Grégoire, G.

    2014-12-01

    We start this chapter by introducing some basic elements for the analysis of censored survival data. Then we focus on right censored data and develop two types of regression models. The first one concerns the so-called accelerated failure time models (AFT), which are parametric models where a function of a parameter depends linearly on the covariables. The second one is a semiparametric model, where the covariables enter in a multiplicative form in the expression of the hazard rate function. The main statistical tool for analysing these regression models is the maximum likelihood methodology and, in spite we recall some essential results about the ML theory, we refer to the chapter "Logistic Regression" for a more detailed presentation.

  7. Estimation of Recurrence of Colorectal Adenomas with Dependent Censoring Using Weighted Logistic Regression

    PubMed Central

    Hsu, Chiu-Hsieh; Li, Yisheng; Long, Qi; Zhao, Qiuhong; Lance, Peter

    2011-01-01

    In colorectal polyp prevention trials, estimation of the rate of recurrence of adenomas at the end of the trial may be complicated by dependent censoring, that is, time to follow-up colonoscopy and dropout may be dependent on time to recurrence. Assuming that the auxiliary variables capture the dependence between recurrence and censoring times, we propose to fit two working models with the auxiliary variables as covariates to define risk groups and then extend an existing weighted logistic regression method for independent censoring to each risk group to accommodate potential dependent censoring. In a simulation study, we show that the proposed method results in both a gain in efficiency and reduction in bias for estimating the recurrence rate. We illustrate the methodology by analyzing a recurrent adenoma dataset from a colorectal polyp prevention trial. PMID:22065985

  8. Elevated Fasting Blood Glucose Is Predictive of Poor Outcome in Non-Diabetic Stroke Patients: A Sub-Group Analysis of SMART.

    PubMed

    Yao, Ming; Ni, Jun; Zhou, Lixin; Peng, Bin; Zhu, Yicheng; Cui, Liying

    2016-01-01

    Although increasing evidence suggests that hyperglycemia following acute stroke adversely affects clinical outcome, whether the association between glycaemia and functional outcome varies between stroke patients with\\without pre-diagnosed diabetes remains controversial. We aimed to investigate the relationship between the fasting blood glucose (FBG) and the 6-month functional outcome in a subgroup of SMART cohort and further to assess whether this association varied based on the status of pre-diagnosed diabetes. Data of 2862 patients with acute ischemic stroke (629 with pre-diagnosed diabetics) enrolled from SMART cohort were analyzed. Functional outcome at 6-month post-stroke was measured by modified Rankin Scale (mRS) and categorized as favorable (mRS:0-2) or poor (mRS:3-5). Binary logistic regression model, adjusting for age, gender, educational level, history of hypertension and stroke, baseline NIHSS and treatment group, was used in the whole cohort to evaluate the association between admission FBG and functional outcome. Stratified logistic regression analyses were further performed based on the presence/absence of pre-diabetes history. In the whole cohort, multivariable logistical regression showed that poor functional outcome was associated with elevated FBG (OR1.21 (95%CI 1.07-1.37), p = 0.002), older age (OR1.64 (95% CI1.38-1.94), p<0.001), higher NIHSS (OR2.90 (95%CI 2.52-3.33), p<0.001) and hypertension (OR1.42 (95%CI 1.13-1.98), p = 0.04). Stratified logistical regression analysis showed that the association between FBG and functional outcome remained significant only in patients without pre-diagnosed diabetes (OR1.26 (95%CI 1.03-1.55), p = 0.023), but not in those with premorbid diagnosis of diabetes (p = 0.885). The present results demonstrate a significant association between elevated FBG after stroke and poor functional outcome in patients without pre-diagnosed diabetes, but not in diabetics. This finding confirms the importance of glycemic control during acute phase of ischemic stroke especially in patients without pre-diagnosed diabetes. Further investigation for developing optimal strategies to control blood glucose level in hyperglycemic setting is therefore of great importance. ClinicalTrials.gov NCT00664846.

  9. Suicide Risk at Young Adulthood: Continuities and Discontinuities From Adolescence

    PubMed Central

    Hooven, Carole; Snedker, Karen A.; Thompson, Elaine Adams

    2011-01-01

    Young adult suicide is an important social problem, yet little is known about how risk for young adult suicide develops from earlier life stages. In this study the authors report on 759 young adults who were potential high school dropouts as youth. At both adolescence and young adulthood, measures of suicide risk status and related suicide risk factors are collected. With a two-by-two classification on the basis of suicide risk status at both adolescence and young adulthood, the authors distinguish four mutually exclusive groups reflecting suicide risk at two life stages. Using ANOVA and logistic regression, both adolescent and young adult suicide risk factors are identified, with evidence of similarity between risk factors at adolescence and at young adulthood, for both individual-level and social-context factors. There is also support for both continuity and discontinuity of adolescent suicide risk. Implications for social policy are discussed. PMID:23129876

  10. Should waist circumference be used to identify metabolic disorders than BMI in South Korea?

    PubMed

    Lee, S-K

    2010-11-01

    Although indicators of central obesity have been suggested as a better alternative to body mass index (BMI), yet mixed results exist. This study examined whether waist circumference (WC) was better in identifying metabolic disorders than BMI at two time points. This study used nationally representative 1998 and 2005 Korea National Health and Nutrition Examination Survey data sets. Odds ratios from logistic regressions and area under the curves (AUC) were calculated. BMI and WC showed similar level of odds ratios (1.1-1.6) to diabetes, hypertension, dyslipidemia and having two or three metabolic syndrome criteria. The AUC comparison, however, indicated that, in only women, WC was a better discriminator for diabetes, hypertension and having two or three metabolic syndrome criteria. No meaningful differences were found between 1998 and 2005. Prospective studies to weigh practical and clinical relevance are needed to assert the use of WC over BMI in clinical and public health settings.

  11. Correlates of aortic stiffness progression in patients with type 2 diabetes: importance of glycemic control: the Rio de Janeiro type 2 diabetes cohort study.

    PubMed

    Ferreira, Marcel T; Leite, Nathalie C; Cardoso, Claudia R L; Salles, Gil F

    2015-05-01

    The correlates of serial changes in aortic stiffness in patients with diabetes have never been investigated. We aimed to examine the importance of glycemic control on progression/regression of carotid-femoral pulse wave velocity (cf-PWV) in type 2 diabetes. In a prospective study, two cf-PWV measurements were performed with the Complior equipment in 417 patients with type 2 diabetes over a mean follow-up of 4.2 years. Clinical laboratory data were obtained at baseline and throughout follow-up. Multivariable linear/logistic regressions assessed the independent correlates of changes in cf-PWV. Median cf-PWV increase was 0.11 m/s per year (1.1% per year). Overall, 212 patients (51%) increased/persisted with high cf-PWV, while 205 (49%) reduced/persisted with low cf-PWV. Multivariate linear regression demonstrated direct associations between cf-PWV changes and mean HbA1c during follow-up (partial correlation 0.14, P = 0.005). On logistic regression, a mean HbA1c ≥7.5% (58 mmol/mol) was associated with twofold higher odds of having increased/persistently high cf-PWV during follow-up. Furthermore, the rate of HbA1c reduction relative to baseline levels was inversely associated with cf-PWV changes (partial correlation -0.11, P = 0.011) and associated with reduced risk of having increased/persistently high aortic stiffness (odds ratio 0.82 [95% CI 0.69-0.96]; P = 0.017). Other independent correlates of progression in aortic stiffness were increases in systolic blood pressure and heart rate between the two cf-PWV measurements, older age, female sex, and presence of dyslipidemia and retinopathy. Better glycemic control, together with reductions in blood pressure and heart rate, was the most important correlate to attenuate/prevent progression of aortic stiffness in patients with type 2 diabetes. © 2015 by the American Diabetes Association. Readers may use this article as long as the work is properly cited, the use is educational and not for profit, and the work is not altered.

  12. Maternal education and age: inequalities in neonatal death.

    PubMed

    Fonseca, Sandra Costa; Flores, Patricia Viana Guimarães; Camargo, Kenneth Rochel; Pinheiro, Rejane Sobrino; Coeli, Claudia Medina

    2017-11-17

    Evaluate the interaction between maternal age and education level in neonatal mortality, as well as investigate the temporal evolution of neonatal mortality in each stratum formed by the combination of these two risk factors. A nonconcurrent cohort study, resulting from a probabilistic relationship between the Mortality Information System and the Live Birth Information System. To investigate the risk of neonatal death we performed a logistic regression, with an odds ratio estimate for the combined variable of maternal education and age, as well as the evaluation of additive and multiplicative interaction. The neonatal mortality rate time series, according to maternal education and age, was estimated by the Joinpoint Regression program. The neonatal mortality rate in the period was 8.09‰ and it was higher in newborns of mothers with low education levels: 12.7‰ (adolescent mothers) and 12.4‰ (mother 35 years old or older). Low level of education, without the age effect, increased the chance of neonatal death by 25% (OR = 1.25, 95%CI 1.14-1.36). The isolated effect of age on neonatal death was higher for adolescent mothers (OR = 1.39, 95%CI 1.33-1.46) than for mothers aged ≥ 35 years (OR = 1.16, 95%CI 1.09-1.23). In the time-trend analysis, no age group of women with low education levels presented a reduction in the neonatal mortality rate for the period, as opposed to women with intermediate or high levels of education, where the reduction was significant, around 4% annually. Two more vulnerable groups - adolescents with low levels of education and older women with low levels of education - were identified in relation to the risk of neonatal death and inequality in reducing the mortality rate.

  13. Maternal education and age: inequalities in neonatal death

    PubMed Central

    Fonseca, Sandra Costa; Flores, Patricia Viana Guimarães; Camargo, Kenneth Rochel; Pinheiro, Rejane Sobrino; Coeli, Claudia Medina

    2017-01-01

    ABSTRACT OBJECTIVE Evaluate the interaction between maternal age and education level in neonatal mortality, as well as investigate the temporal evolution of neonatal mortality in each stratum formed by the combination of these two risk factors. METHODS A nonconcurrent cohort study, resulting from a probabilistic relationship between the Mortality Information System and the Live Birth Information System. To investigate the risk of neonatal death we performed a logistic regression, with an odds ratio estimate for the combined variable of maternal education and age, as well as the evaluation of additive and multiplicative interaction. The neonatal mortality rate time series, according to maternal education and age, was estimated by the Joinpoint Regression program. RESULTS The neonatal mortality rate in the period was 8.09‰ and it was higher in newborns of mothers with low education levels: 12.7‰ (adolescent mothers) and 12.4‰ (mother 35 years old or older). Low level of education, without the age effect, increased the chance of neonatal death by 25% (OR = 1.25, 95%CI 1.14–1.36). The isolated effect of age on neonatal death was higher for adolescent mothers (OR = 1.39, 95%CI 1.33–1.46) than for mothers aged ≥ 35 years (OR = 1.16, 95%CI 1.09–1.23). In the time-trend analysis, no age group of women with low education levels presented a reduction in the neonatal mortality rate for the period, as opposed to women with intermediate or high levels of education, where the reduction was significant, around 4% annually. CONCLUSIONS Two more vulnerable groups – adolescents with low levels of education and older women with low levels of education – were identified in relation to the risk of neonatal death and inequality in reducing the mortality rate. PMID:29166446

  14. Semiparametric time varying coefficient model for matched case-crossover studies.

    PubMed

    Ortega-Villa, Ana Maria; Kim, Inyoung; Kim, H

    2017-03-15

    In matched case-crossover studies, it is generally accepted that the covariates on which a case and associated controls are matched cannot exert a confounding effect on independent predictors included in the conditional logistic regression model. This is because any stratum effect is removed by the conditioning on the fixed number of sets of the case and controls in the stratum. Hence, the conditional logistic regression model is not able to detect any effects associated with the matching covariates by stratum. However, some matching covariates such as time often play an important role as an effect modification leading to incorrect statistical estimation and prediction. Therefore, we propose three approaches to evaluate effect modification by time. The first is a parametric approach, the second is a semiparametric penalized approach, and the third is a semiparametric Bayesian approach. Our parametric approach is a two-stage method, which uses conditional logistic regression in the first stage and then estimates polynomial regression in the second stage. Our semiparametric penalized and Bayesian approaches are one-stage approaches developed by using regression splines. Our semiparametric one stage approach allows us to not only detect the parametric relationship between the predictor and binary outcomes, but also evaluate nonparametric relationships between the predictor and time. We demonstrate the advantage of our semiparametric one-stage approaches using both a simulation study and an epidemiological example of a 1-4 bi-directional case-crossover study of childhood aseptic meningitis with drinking water turbidity. We also provide statistical inference for the semiparametric Bayesian approach using Bayes Factors. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  15. Predicting Social Trust with Binary Logistic Regression

    ERIC Educational Resources Information Center

    Adwere-Boamah, Joseph; Hufstedler, Shirley

    2015-01-01

    This study used binary logistic regression to predict social trust with five demographic variables from a national sample of adult individuals who participated in The General Social Survey (GSS) in 2012. The five predictor variables were respondents' highest degree earned, race, sex, general happiness and the importance of personally assisting…

  16. The implementation of rare events logistic regression to predict the distribution of mesophotic hard corals across the main Hawaiian Islands.

    PubMed

    Veazey, Lindsay M; Franklin, Erik C; Kelley, Christopher; Rooney, John; Frazer, L Neil; Toonen, Robert J

    2016-01-01

    Predictive habitat suitability models are powerful tools for cost-effective, statistically robust assessment of the environmental drivers of species distributions. The aim of this study was to develop predictive habitat suitability models for two genera of scleractinian corals (Leptoserisand Montipora) found within the mesophotic zone across the main Hawaiian Islands. The mesophotic zone (30-180 m) is challenging to reach, and therefore historically understudied, because it falls between the maximum limit of SCUBA divers and the minimum typical working depth of submersible vehicles. Here, we implement a logistic regression with rare events corrections to account for the scarcity of presence observations within the dataset. These corrections reduced the coefficient error and improved overall prediction success (73.6% and 74.3%) for both original regression models. The final models included depth, rugosity, slope, mean current velocity, and wave height as the best environmental covariates for predicting the occurrence of the two genera in the mesophotic zone. Using an objectively selected theta ("presence") threshold, the predicted presence probability values (average of 0.051 for Leptoseris and 0.040 for Montipora) were translated to spatially-explicit habitat suitability maps of the main Hawaiian Islands at 25 m grid cell resolution. Our maps are the first of their kind to use extant presence and absence data to examine the habitat preferences of these two dominant mesophotic coral genera across Hawai'i.

  17. A Statewide Study of Gang Membership in California Secondary Schools

    ERIC Educational Resources Information Center

    Estrada, Joey Nuñez, Jr.; Gilreath, Tamika D.; Astor, Ron Avi; Benbenishty, Rami

    2016-01-01

    To date, there is a paucity of empirical evidence that examines gang membership in schools. Using statewide data of 7th-, 9th-, and 11th-grade students from California, this study focuses on the prevalence of gang membership by county, region, ethnicity, and grade level. Bivariate and multivariate logistic regression analyses were employed with…

  18. Seasonal Variation in Physical Activity among Preschool Children in a Northern Canadian City

    ERIC Educational Resources Information Center

    Carson, Valerie; Spence, John C.; Cutumisu, Nicoleta; Boule, Normand; Edwards, Joy

    2010-01-01

    Little research has examined seasonal differences in physical activity (PA) levels among children. Proxy reports of PA were completed by 1,715 parents on their children in Edmonton, Alberta, Canada. Total PA (TPA) minutes were calculated, and each participant was classified as active, somewhat active, or inactive. Logistic regression models were…

  19. Determining the Factors of Social Phobia Levels of University Students: A Logistic Regression Analysis

    ERIC Educational Resources Information Center

    Ozen, Hamit

    2016-01-01

    Experiencing social phobia is an important factor which can hinder academic success during university years. In this study, research of social phobia with several variables is conducted among university students. The research group of the study consists of total 736 students studying at various departments at universities in Turkey. Students are…

  20. Knowledge of Millennium Development Goals among University Faculty in Uganda and Kenya

    ERIC Educational Resources Information Center

    Wamala, Robert; Nabachwa, Mary Sonko; Chamberlain, Jean; Nakalembe, Eva

    2012-01-01

    This article examines the level of knowledge of the Millennium Development Goals (MDGs) among university faculty. The assessment is based on data from 197 academic unit or faculty heads randomly selected from universities in Uganda and Kenya. Frequency distributions and logistic regression were used for analysis. Slightly more than one in three…

  1. Exploring Person Fit with an Approach Based on Multilevel Logistic Regression

    ERIC Educational Resources Information Center

    Walker, A. Adrienne; Engelhard, George, Jr.

    2015-01-01

    The idea that test scores may not be valid representations of what students know, can do, and should learn next is well known. Person fit provides an important aspect of validity evidence. Person fit analyses at the individual student level are not typically conducted and person fit information is not communicated to educational stakeholders. In…

  2. Who Stays and for How Long: Examining Attrition in Canadian Graduate Programs

    ERIC Educational Resources Information Center

    DeClou, Lindsay

    2016-01-01

    Attrition from Canadian graduate programs is a point of concern on a societal, institutional, and individual level. To improve retention in graduate school, a better understanding of what leads to withdrawal needs to be reached. This paper uses logistic regression and discrete-time survival analysis with time-varying covariates to analyze data…

  3. Escaping Poverty: Rural Low-Income Mothers' Opportunity to Pursue Post-Secondary Education

    ERIC Educational Resources Information Center

    Woodford, Michelle; Mammen, Sheila

    2010-01-01

    Using human capital theory, this paper identifies the factors that may affect the opportunity for rural low-income mothers to pursue post-secondary education or training in order to escape poverty. Dependent variables used in the logistic regression model included micro-level household variables as well as the effects of state-wide welfare…

  4. Genetic association of APOB polymorphisms with variation in serum lipid profile among the Kuwait population.

    PubMed

    Al-Bustan, Suzanne A; Alnaqeeb, Majed A; Annice, Babitha G; Ebrahim, Ghada A; Refai, Thanaa M

    2014-10-08

    Several studies have identified APOB as a candidate gene predisposing individuals to dyslipidemia. Polymorphisms including the signal peptide (rs11279109), codon 2488 XbaI (rs1042031), codon 3611 MspI (rs693), codon 4154 EcoRI (rs1801701) and the 3' variable number of tandem repeats have been reported to be associated with dyslipidemia in several populations. With limited studies on Arabs, this study aimed to investigate the genetic association of APOB polymorphisms and assess the potential influence of minor and rare alleles on serum lipid levels in the Kuwaiti population. A total of 795 Kuwaiti subjects, documented with phenotypic data and fasting serum lipid levels, were genotyped for the five polymorphisms using PCR, PCR-RFLP and gene fragment analysis. Genotype and allele association with variation in serum lipid levels as well as haplotypes were analyzed using chi-square test, univariate and logistic regression analysis. Analysis of the genotype and allele frequencies distribution revealed a significant positive association between the APOB signal peptide and 3611 MspI polymorphisms with increased levels of triglycerides (statistical power of 80%). Haplotype analysis further supported the findings by showing that carriers of haplotypes (IX-M-E+M) had significantly lower mean (SD) TG levels (0.86 ± 0.07) as compared to non-carriers (1.01 ± 0.02). Significance was also observed with regards to positive family history of hypercholesterolemia. The results imply a "protective role" for two alleles (rs11279109 and rs1801701) in which logistic regression analysis showed a significant half-fold decrease in the risk for heterozygotes of rs11279109 and an 8.8 fold decrease in the risk for homozygous M-M- of rs1801701 of having lower TG levels (<1.70 mmol/L) in individuals. This suggests that genetic interaction between various polymorphisms at different gene loci act in linkage disequilibrium to affect serum TG levels. Apo B genotyping may be a useful adjunct for the identification of individuals at risk of developing dyslipidemia in order to provide them with lifestyle modifications and/or pharmacological intervention to mitigate the effects of gene interaction and environmental influence.

  5. Isolated insular strokes and plasma MR-proANP levels are associated with newly diagnosed atrial fibrillation: a pilot study.

    PubMed

    Frontzek, Karl; Fluri, Felix; Siemerkus, Jakob; Müller, Beat; Gass, Achim; Christ-Crain, Mirjam; Katan, Mira

    2014-01-01

    In this study, we assessed the relationship of insular strokes and plasma MR-proANP levels with newly diagnosed atrial fibrillation (NDAF). This study is based on a prospective acute stroke cohort (http://www.clinicaltrials.gov, NCT00390962). Patient eligibility was dependent on the diagnosis of acute ischemic stroke, absence of previous stroke based on past medical history and MRI, no history of AF and congestive heart failure (cohort A) and, additionally, no stroke lesion size ≥ 20 mL (sub-cohort A*). AF, the primary endpoint, was detected on 24-hour electrocardiography and/or echocardiography. Involvement of the insula was assessed by two experienced readers on MRI blinded to clinical data. MR-proANP levels were obtained through a novel sandwich immunoassay. Logistic-regression-models were fitted to estimate odds ratios for the association of insular strokes and MR-proANP with NDAF. The discriminatory accuracy of insular strokes and MR-proANP was assessed by a model-wise comparison of the area under the receiver-operating-characteristics-curve (AUC) with known predictors of AF. 104 (cohort A) and 83 (cohort A*) patients fulfilled above-mentioned criteria. Patients with isolated insular strokes had a 10.7-fold higher odds of NDAF than patients with a small ischemic stroke at any other location. The AUC of multivariate logistic regression models for the prediction of NDAF improved significantly when adding stroke location and MR-proANP levels. Moreover, MR-proANP levels remained significantly elevated throughout the acute hospitalization period in patients with NDAF compared to those without. Isolated insular strokes and plasma MR-proANP levels on admission are independent predictors of NDAF and significantly improve the prediction accuracy of identifying patients with NDAF compared to known predictors including age, the NIHSS and lesion size. To accelerate accurate diagnosis and enhance secondary prevention in acute stroke, higher levels of MR-proANP and insular strokes may represent easily accessible indicators of AF if confirmed in an independent validation cohort.

  6. Comparing Revictimization in Two Groups of Marginalized Women

    ERIC Educational Resources Information Center

    Tusher, Chantal Poister; Cook, Sarah L.

    2010-01-01

    This study examines physical and sexual revictimization in a random sample of incarcerated and poor, urban, nonincarcerated women using multiple measures of physical and sexual child abuse. Researchers used hierarchical logistic regression to compare rates of revictimization and the strength of the association between child abuse and adult…

  7. Predictors of Employment Outcomes for State-Federal Vocational Rehabilitation Consumers with HIV/AIDS

    ERIC Educational Resources Information Center

    Jung, Youngoh; Schaller, James; Bellini, James

    2010-01-01

    In this study, the authors investigated the effects of demographic, medical, and vocational rehabilitation service variables on employment outcomes of persons living with HIV/AIDS. Binary logistic regression analyses were conducted to determine predictors of employment outcomes using two groups drawn from Rehabilitation Services Administration…

  8. Juvenile Offender Recidivism: An Examination of Risk Factors

    ERIC Educational Resources Information Center

    Calley, Nancy G.

    2012-01-01

    One hundred and seventy three male juvenile offenders were followed two years postrelease from a residential treatment facility to assess recidivism and factors related to recidivism. The overall recidivism rate was 23.9%. Logistic regression with stepwise and backward variable selection methods was used to examine the relationship between…

  9. Predicting the US Drought Monitor (USDM) using precipitation, soil noisture, and evapotranspiration anomalies, Part II: Intraseasonal drought intensification forecasts

    USDA-ARS?s Scientific Manuscript database

    Probabilistic forecasts of US Drought Monitor (USDM) intensification over two, four and eight week time periods are developed based on recent anomalies in precipitation, evapotranspiration and soil moisture. These statistical forecasts are computed using logistic regression with cross validation. Wh...

  10. EVALUATING THE ROLE OF HABITAT QUALITY ON ESTABLISHMENT OF GM AGROSTIS STOLONIFERA PLANTS IN NON-AGRONOMIC SETTINGS

    EPA Science Inventory

    We compared soil chemistry and plant community data at non-agronomic mesic locations that either did or did not contain genetically modified (GM) Agrostis stolonifera. The best two-variable logistic regression model included soil Mn content and A. stolonifera cover and explained...

  11. Modeling recreation participants' willingness to substitute using multi-attribute indicators

    Treesearch

    Yung-Ping (Emilio) Tseng; Robert B. Ditton

    2008-01-01

    A logistic regression was used to predict anglers' resource-substitution decisions based on three dimensions of recreation specialization (behavior, skill and knowledge, and commitment), two dimensions of place attachment (place identity and place dependence), and three demographic indicators. Results indicated that place dependence was the most effective...

  12. Visual grading characteristics and ordinal regression analysis during optimisation of CT head examinations.

    PubMed

    Zarb, Francis; McEntee, Mark F; Rainford, Louise

    2015-06-01

    To evaluate visual grading characteristics (VGC) and ordinal regression analysis during head CT optimisation as a potential alternative to visual grading assessment (VGA), traditionally employed to score anatomical visualisation. Patient images (n = 66) were obtained using current and optimised imaging protocols from two CT suites: a 16-slice scanner at the national Maltese centre for trauma and a 64-slice scanner in a private centre. Local resident radiologists (n = 6) performed VGA followed by VGC and ordinal regression analysis. VGC alone indicated that optimised protocols had similar image quality as current protocols. Ordinal logistic regression analysis provided an in-depth evaluation, criterion by criterion allowing the selective implementation of the protocols. The local radiology review panel supported the implementation of optimised protocols for brain CT examinations (including trauma) in one centre, achieving radiation dose reductions ranging from 24 % to 36 %. In the second centre a 29 % reduction in radiation dose was achieved for follow-up cases. The combined use of VGC and ordinal logistic regression analysis led to clinical decisions being taken on the implementation of the optimised protocols. This improved method of image quality analysis provided the evidence to support imaging protocol optimisation, resulting in significant radiation dose savings. • There is need for scientifically based image quality evaluation during CT optimisation. • VGC and ordinal regression analysis in combination led to better informed clinical decisions. • VGC and ordinal regression analysis led to dose reductions without compromising diagnostic efficacy.

  13. [Analysis for related factors of upper urinary tract deterioration in patients with spinal cord injury].

    PubMed

    Jing, Hua-fang; Liao, Li-min; Fu, Guang; Wu, Juan; Ju, Yan-he; Chen, Guo-qing

    2014-08-18

    To evaluate the related factors of upper urinary tract deterioration in spinal cord injured patients. Medical records of spinal cord injured patients from Jan.2002 to Sep.2009 were retrospectively reviewed. All the patients were divided into the upper urinary tract deterioration group and non-deterioration group according to the diagnostic criteria. Indexes such as demographic characteristic (gender, age), spinal cord injury information (cause, level, completeness), statuses of urinary tract system (bladder management, urine routine, urine culture, ultrasound, serum creatinine, fever caused by urinary tract infection) and urodynamics information(bladder compliance, bladder stability, bladder sensation, detrusor sphincter dyssynergia, detrusor leak point pressure, maximum cystometric capacity, relative safe bladder capacity, maximum flow rate, maximum urethra closure pressure) were compared between the two groups.Then Logistic regression analysis were performed. There was significantly difference between the two groups in spinal cord injury level(χ(2) = 8.840, P = 0.031),bladder management(χ(2) = 11.362, P = 0.045), urinary rutine(χ(2) = 17.983, P = 0.000), fever caused by urinary tract infection(χ(2)= 64.472, P = 0.000), bladder compliance(χ(2) = 6.531, P = 0.011), bladder sensation(χ(2) = 11.505, P = 0.009), maximum cystometric capacity(t = 2.209, P = 0.043), and detrusor-sphincter dyssynergia(χ(2) = 4.247, P = 0.039). The multiple-factor non-conditional Logistic regression analysis showed that bladder management (OR = 1.114, P = 0.006), fever caused by urinary tract infection(OR = 1.018,P = 0.000), bladder compliance (OR = 1.588, P = 0.040) and detrusor-sphincter dyssynergia(OR = 1.023, P = 0.034) were the key factors of upper urinary tract deterioration in spinal cord injured patients. Urinary tract infection, lower bladder compliance, detrusor-sphincter dyssynergia and unreasonable bladder management are the risk factors of upper urinary tract deterioration in spinal cord injured patients.

  14. A Cross-border Comparison of Hepatitis B Testing Among Chinese Residing in Canada and the United States

    PubMed Central

    Tu, Shin-Ping; Li, Lin; Tsai, Jenny Hsin-Chun; Yip, Mei-Po; Terasaki, Genji; Teh, Chong; Yasui, Yutaka; Hislop, T Gregory; Taylor, Vicky

    2013-01-01

    Background The Western Pacific region has the highest level of endemic hepatitis B virus (HBV) infection in the world, with the Chinese representing nearly one-third of infected persons globally. HBV carriers are potentially infectious to others and have an increased risk of chronic active hepatitis, cirrhosis, and hepatocellular carcinoma. Studies from the U.S. and Canada demonstrate that immigrants, particularly from Asia, are disproportionately affected by liver cancer. Purpose Given the different health care systems in Seattle and Vancouver, two geographically proximate cities, we examined HBV testing levels and factors associated with testing among Chinese residents of these cities. Methods We surveyed Chinese living in areas of Seattle and Vancouver with relatively high proportions of Chinese residents. In-person interviews were conducted in Cantonese, Mandarin, or English. Our bivariate analyses consisted of the chi-square test, with Fisher’s Exact test as necessary. We then performed unconditional logistic regression, first examining only the city effect as the sole explanatory variable of the model, then assessing the adjusted city effect in a final main-effects model that was constructed through backward selection to select statistically significant variables at alpha = 0.05. Results Survey cooperation rates for Seattle and Vancouver were 58% and 59%, respectively. In Seattle, 48% reported HBV testing, whereas in Vancouver, 55% reported testing. HBV testing in Seattle was lower than in Vancouver, with a crude odds ratio of 0.73 (95% CI = 0.56, 0.94). However after adjusting for demographic, health care access, knowledge, and social support variables, we found no significant differences in HBV testing between the two cities. In our logistic regression model, the odds of HBV testing were greatest when the doctor recommended the test, followed by when the employer asked for the test. Discussion Findings from this study support the need for additional research to examine the effectiveness of clinic-based and workplace interventions to promote HBV testing among immigrants to North America. PMID:19640196

  15. A cross-border comparison of hepatitis B testing among chinese residing in Canada and the United States.

    PubMed

    Tu R, Shin-Ping; Li, Lin; Tsai, Jenny Hsin-Chun; Yip, Mei-Po; Terasaki, Genji; Teh, Chong; Yasui, Yutaka; Hislop, T Gregory; Taylor, Vicky

    2009-01-01

    The Western Pacific region has the highest level of endemic hepatitis B virus (HBV) infection in the world, with the Chinese representing nearly one-third of infected persons globally. HBV carriers are potentially infectious to others and have an increased risk of chronic active hepatitis, cirrhosis, and hepatocellular carcinoma. Studies from the U.S. and Canada demonstrate that immigrants, particularly from Asia, are disproportionately affected by liver cancer. Given the different health care systems in Seattle and Vancouver, two geographically proximate cities, we examined HBV testing levels and factors associated with testing among Chinese residents of these cities. We surveyed Chinese living in areas of Seattle and Vancouver with relatively high proportions of Chinese residents. In-person interviews were conducted in Cantonese, Mandarin, or English. Our bivariate analyses consisted of the chi-square test, with Fisher's Exact test as necessary. We then performed unconditional logistic regression, first examining only the city effect as the sole explanatory variable of the model, then assessing the adjusted city effect in a final main-effects model that was constructed through backward selection to select statistically significant variables at alpha=0.05. Survey cooperation rates for Seattle and Vancouver were 58% and 59%, respectively. In Seattle, 48% reported HBV testing, whereas in Vancouver, 55% reported testing. HBV testing in Seattle was lower than in Vancouver, with a crude odds ratio of 0.73 (95% CI = 0.56, 0.94). However after adjusting for demographic, health care access, knowledge, and social support variables, we found no significant differences in HBV testing between the two cities. In our logistic regression model, the odds of HBV testing were greatest when the doctor recommended the test, followed by when the employer asked for the test. Findings from this study support the need for additional research to examine the effectiveness of clinic-based and workplace interventions to promote HBV testing among immigrants to North America.

  16. Sexual satisfaction in females with premenstrual symptoms.

    PubMed

    Nowosielski, Krzysztof; Drosdzol, Agnieszka; Skrzypulec, Violetta; Plinta, Ryszard

    2010-11-01

    The impact of premenstrual symptoms, such as the premenstrual syndrome (PMS) and the premenstrual dysphoric disorder (PMDD), on sexual satisfaction, sexual distress, and sexual behaviors has not yet been established. To assess the correlates and risk factors of sexual satisfaction and to evaluate sexual behaviors among Polish women with premenstrual symptoms. 2,500 females, aged 18 to 45 years, from the Upper Silesian region of Poland were eligible for the questionnaire-based, prospective population study. All the inclusion criteria were met by 1,540 women who constituted the final study group. The participants were further divided into two subgroups: PMS+ (749 females) and PMS- (791 healthy subjects). Two additional subgroups were created: PMDD+ encompassing 32 subjects diagnosed with PMDD, and PMDD- comprising 32 healthy women, matched to the PMDD+ females for age, marital status, education level, employment status, place of living, and body mass index. A multiple logistic regression analysis was performed to evaluate the influence of PMS on sexual satisfaction and adjust for potential confounders. To evaluate risk factors for sexual dissatisfaction in a population of Polish females of reproductive age, diagnosed with PMS and PMDD. Women from the PMS+ group were less sexually satisfied than PMS- (77.73% vs. 88.66%, P=0.001) and reported more sexual distress (28.65% vs. 15.24%, P=0.001). There were no significant differences in sexual satisfaction between PMDD- and PMDD+. Sexual satisfaction correlated positively with a higher frequency of sexual intercourses and a higher level of education. The presence of PMS correlated negatively with sexual satisfaction, even after adjusting for potential confounders in the multivariate logistic regression model (odds ratio=0.48; confidence interval: 0.26-0.89; P=0.02). The presence of PMS is a risk factor for sexual dissatisfaction in Polish women of reproductive age. © 2010 International Society for Sexual Medicine.

  17. THE CONSEQUENCES OF INDIA’S MALE SURPLUS FOR WOMEN’S PARTNERING AND SEXUAL EXPERIENCES*

    PubMed Central

    Trent, Katherine; South, Scott J.; Bose, Sunita

    2013-01-01

    Data from the third wave of India’s 2005–2006 National Family and Health Survey are used to examine the influence of the community-level sex ratio on several dimensions of women’s partnering behavior and sexual experiences. Multi-level logistic regression models that control for individual demographic attributes and community-level characteristics reveal that the local male-to-female sex ratio is positively and significantly associated with the likelihood that women marry prior to age 16 and have experienced forced sex. These associations are modest in magnitude. However, no significant associations are observed between the sex ratio and whether women have had two or more lifetime sexual partners or women’s risk of contracting a sexually-transmitted disease. Birth cohort, education, religion, caste, region, urban residence, and several community-level measures of women’s status also emerge as significant predictors of Indian women’s partnering and sexual experiences. The implications of our results for India’s growing surplus of adult men are discussed. PMID:26085706

  18. THE CONSEQUENCES OF INDIA'S MALE SURPLUS FOR WOMEN'S PARTNERING AND SEXUAL EXPERIENCES.

    PubMed

    Trent, Katherine; South, Scott J; Bose, Sunita

    2015-06-01

    Data from the third wave of India's 2005-2006 National Family and Health Survey are used to examine the influence of the community-level sex ratio on several dimensions of women's partnering behavior and sexual experiences. Multi-level logistic regression models that control for individual demographic attributes and community-level characteristics reveal that the local male-to-female sex ratio is positively and significantly associated with the likelihood that women marry prior to age 16 and have experienced forced sex. These associations are modest in magnitude. However, no significant associations are observed between the sex ratio and whether women have had two or more lifetime sexual partners or women's risk of contracting a sexually-transmitted disease. Birth cohort, education, religion, caste, region, urban residence, and several community-level measures of women's status also emerge as significant predictors of Indian women's partnering and sexual experiences. The implications of our results for India's growing surplus of adult men are discussed.

  19. A large-scale assessment of two-way SNP interactions in breast cancer susceptibility using 46 450 cases and 42 461 controls from the breast cancer association consortium

    PubMed Central

    Milne, Roger L.; Herranz, Jesús; Michailidou, Kyriaki; Dennis, Joe; Tyrer, Jonathan P.; Zamora, M. Pilar; Arias-Perez, José Ignacio; González-Neira, Anna; Pita, Guillermo; Alonso, M. Rosario; Wang, Qin; Bolla, Manjeet K.; Czene, Kamila; Eriksson, Mikael; Humphreys, Keith; Darabi, Hatef; Li, Jingmei; Anton-Culver, Hoda; Neuhausen, Susan L.; Ziogas, Argyrios; Clarke, Christina A.; Hopper, John L.; Dite, Gillian S.; Apicella, Carmel; Southey, Melissa C.; Chenevix-Trench, Georgia; Swerdlow, Anthony; Ashworth, Alan; Orr, Nicholas; Schoemaker, Minouk; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katarzyna; Andrulis, Irene L.; Knight, Julia A.; Glendon, Gord; Mulligan, Anna Marie; Bojesen, Stig E.; Nordestgaard, Børge G.; Flyger, Henrik; Nevanlinna, Heli; Muranen, Taru A.; Aittomäki, Kristiina; Blomqvist, Carl; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Wang, Xianshu; Olson, Janet E.; Vachon, Celine; Purrington, Kristen; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Dunning, Alison M.; Shah, Mitul; Guénel, Pascal; Truong, Thérèse; Sanchez, Marie; Mulot, Claire; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Lindblom, Annika; Margolin, Sara; Hooning, Maartje J.; Hollestelle, Antoinette; Collée, J. Margriet; Jager, Agnes; Cox, Angela; Brock, Ian W.; Reed, Malcolm W.R.; Devilee, Peter; Tollenaar, Robert A.E.M.; Seynaeve, Caroline; Haiman, Christopher A.; Henderson, Brian E.; Schumacher, Fredrick; Le Marchand, Loic; Simard, Jacques; Dumont, Martine; Soucy, Penny; Dörk, Thilo; Bogdanova, Natalia V.; Hamann, Ute; Försti, Asta; Rüdiger, Thomas; Ulmer, Hans-Ulrich; Fasching, Peter A.; Häberle, Lothar; Ekici, Arif B.; Beckmann, Matthias W.; Fletcher, Olivia; Johnson, Nichola; dos Santos Silva, Isabel; Peto, Julian; Radice, Paolo; Peterlongo, Paolo; Peissel, Bernard; Mariani, Paolo; Giles, Graham G.; Severi, Gianluca; Baglietto, Laura; Sawyer, Elinor; Tomlinson, Ian; Kerin, Michael; Miller, Nicola; Marme, Federik; Burwinkel, Barbara; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M.; Lambrechts, Diether; Yesilyurt, Betul T.; Floris, Giuseppe; Leunen, Karin; Alnæs, Grethe Grenaker; Kristensen, Vessela; Børresen-Dale, Anne-Lise; García-Closas, Montserrat; Chanock, Stephen J.; Lissowska, Jolanta; Figueroa, Jonine D.; Schmidt, Marjanka K.; Broeks, Annegien; Verhoef, Senno; Rutgers, Emiel J.; Brauch, Hiltrud; Brüning, Thomas; Ko, Yon-Dschun; Couch, Fergus J.; Toland, Amanda E.; Yannoukakos, Drakoulis; Pharoah, Paul D.P.; Hall, Per; Benítez, Javier; Malats, Núria; Easton, Douglas F.

    2014-01-01

    Part of the substantial unexplained familial aggregation of breast cancer may be due to interactions between common variants, but few studies have had adequate statistical power to detect interactions of realistic magnitude. We aimed to assess all two-way interactions in breast cancer susceptibility between 70 917 single nucleotide polymorphisms (SNPs) selected primarily based on prior evidence of a marginal effect. Thirty-eight international studies contributed data for 46 450 breast cancer cases and 42 461 controls of European origin as part of a multi-consortium project (COGS). First, SNPs were preselected based on evidence (P < 0.01) of a per-allele main effect, and all two-way combinations of those were evaluated by a per-allele (1 d.f.) test for interaction using logistic regression. Second, all 2.5 billion possible two-SNP combinations were evaluated using Boolean operation-based screening and testing, and SNP pairs with the strongest evidence of interaction (P < 10−4) were selected for more careful assessment by logistic regression. Under the first approach, 3277 SNPs were preselected, but an evaluation of all possible two-SNP combinations (1 d.f.) identified no interactions at P < 10−8. Results from the second analytic approach were consistent with those from the first (P > 10−10). In summary, we observed little evidence of two-way SNP interactions in breast cancer susceptibility, despite the large number of SNPs with potential marginal effects considered and the very large sample size. This finding may have important implications for risk prediction, simplifying the modelling required. Further comprehensive, large-scale genome-wide interaction studies may identify novel interacting loci if the inherent logistic and computational challenges can be overcome. PMID:24242184

  20. A large-scale assessment of two-way SNP interactions in breast cancer susceptibility using 46,450 cases and 42,461 controls from the breast cancer association consortium.

    PubMed

    Milne, Roger L; Herranz, Jesús; Michailidou, Kyriaki; Dennis, Joe; Tyrer, Jonathan P; Zamora, M Pilar; Arias-Perez, José Ignacio; González-Neira, Anna; Pita, Guillermo; Alonso, M Rosario; Wang, Qin; Bolla, Manjeet K; Czene, Kamila; Eriksson, Mikael; Humphreys, Keith; Darabi, Hatef; Li, Jingmei; Anton-Culver, Hoda; Neuhausen, Susan L; Ziogas, Argyrios; Clarke, Christina A; Hopper, John L; Dite, Gillian S; Apicella, Carmel; Southey, Melissa C; Chenevix-Trench, Georgia; Swerdlow, Anthony; Ashworth, Alan; Orr, Nicholas; Schoemaker, Minouk; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katarzyna; Andrulis, Irene L; Knight, Julia A; Glendon, Gord; Mulligan, Anna Marie; Bojesen, Stig E; Nordestgaard, Børge G; Flyger, Henrik; Nevanlinna, Heli; Muranen, Taru A; Aittomäki, Kristiina; Blomqvist, Carl; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Wang, Xianshu; Olson, Janet E; Vachon, Celine; Purrington, Kristen; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Dunning, Alison M; Shah, Mitul; Guénel, Pascal; Truong, Thérèse; Sanchez, Marie; Mulot, Claire; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Lindblom, Annika; Margolin, Sara; Hooning, Maartje J; Hollestelle, Antoinette; Collée, J Margriet; Jager, Agnes; Cox, Angela; Brock, Ian W; Reed, Malcolm W R; Devilee, Peter; Tollenaar, Robert A E M; Seynaeve, Caroline; Haiman, Christopher A; Henderson, Brian E; Schumacher, Fredrick; Le Marchand, Loic; Simard, Jacques; Dumont, Martine; Soucy, Penny; Dörk, Thilo; Bogdanova, Natalia V; Hamann, Ute; Försti, Asta; Rüdiger, Thomas; Ulmer, Hans-Ulrich; Fasching, Peter A; Häberle, Lothar; Ekici, Arif B; Beckmann, Matthias W; Fletcher, Olivia; Johnson, Nichola; dos Santos Silva, Isabel; Peto, Julian; Radice, Paolo; Peterlongo, Paolo; Peissel, Bernard; Mariani, Paolo; Giles, Graham G; Severi, Gianluca; Baglietto, Laura; Sawyer, Elinor; Tomlinson, Ian; Kerin, Michael; Miller, Nicola; Marme, Federik; Burwinkel, Barbara; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Lambrechts, Diether; Yesilyurt, Betul T; Floris, Giuseppe; Leunen, Karin; Alnæs, Grethe Grenaker; Kristensen, Vessela; Børresen-Dale, Anne-Lise; García-Closas, Montserrat; Chanock, Stephen J; Lissowska, Jolanta; Figueroa, Jonine D; Schmidt, Marjanka K; Broeks, Annegien; Verhoef, Senno; Rutgers, Emiel J; Brauch, Hiltrud; Brüning, Thomas; Ko, Yon-Dschun; Couch, Fergus J; Toland, Amanda E; Yannoukakos, Drakoulis; Pharoah, Paul D P; Hall, Per; Benítez, Javier; Malats, Núria; Easton, Douglas F

    2014-04-01

    Part of the substantial unexplained familial aggregation of breast cancer may be due to interactions between common variants, but few studies have had adequate statistical power to detect interactions of realistic magnitude. We aimed to assess all two-way interactions in breast cancer susceptibility between 70,917 single nucleotide polymorphisms (SNPs) selected primarily based on prior evidence of a marginal effect. Thirty-eight international studies contributed data for 46,450 breast cancer cases and 42,461 controls of European origin as part of a multi-consortium project (COGS). First, SNPs were preselected based on evidence (P < 0.01) of a per-allele main effect, and all two-way combinations of those were evaluated by a per-allele (1 d.f.) test for interaction using logistic regression. Second, all 2.5 billion possible two-SNP combinations were evaluated using Boolean operation-based screening and testing, and SNP pairs with the strongest evidence of interaction (P < 10(-4)) were selected for more careful assessment by logistic regression. Under the first approach, 3277 SNPs were preselected, but an evaluation of all possible two-SNP combinations (1 d.f.) identified no interactions at P < 10(-8). Results from the second analytic approach were consistent with those from the first (P > 10(-10)). In summary, we observed little evidence of two-way SNP interactions in breast cancer susceptibility, despite the large number of SNPs with potential marginal effects considered and the very large sample size. This finding may have important implications for risk prediction, simplifying the modelling required. Further comprehensive, large-scale genome-wide interaction studies may identify novel interacting loci if the inherent logistic and computational challenges can be overcome.

Top