Sample records for controls logistic regression

  1. Logistic regression applied to natural hazards: rare event logistic regression with replications

    NASA Astrophysics Data System (ADS)

    Guns, M.; Vanacker, V.

    2012-06-01

    Statistical analysis of natural hazards needs particular attention, as most of these phenomena are rare events. This study shows that the ordinary rare event logistic regression, as it is now commonly used in geomorphologic studies, does not always lead to a robust detection of controlling factors, as the results can be strongly sample-dependent. In this paper, we introduce some concepts of Monte Carlo simulations in rare event logistic regression. This technique, so-called rare event logistic regression with replications, combines the strength of probabilistic and statistical methods, and allows overcoming some of the limitations of previous developments through robust variable selection. This technique was here developed for the analyses of landslide controlling factors, but the concept is widely applicable for statistical analyses of natural hazards.

  2. Evaluation of logistic regression models and effect of covariates for case-control study in RNA-Seq analysis.

    PubMed

    Choi, Seung Hoan; Labadorf, Adam T; Myers, Richard H; Lunetta, Kathryn L; Dupuis, Josée; DeStefano, Anita L

    2017-02-06

    Next generation sequencing provides a count of RNA molecules in the form of short reads, yielding discrete, often highly non-normally distributed gene expression measurements. Although Negative Binomial (NB) regression has been generally accepted in the analysis of RNA sequencing (RNA-Seq) data, its appropriateness has not been exhaustively evaluated. We explore logistic regression as an alternative method for RNA-Seq studies designed to compare cases and controls, where disease status is modeled as a function of RNA-Seq reads using simulated and Huntington disease data. We evaluate the effect of adjusting for covariates that have an unknown relationship with gene expression. Finally, we incorporate the data adaptive method in order to compare false positive rates. When the sample size is small or the expression levels of a gene are highly dispersed, the NB regression shows inflated Type-I error rates but the Classical logistic and Bayes logistic (BL) regressions are conservative. Firth's logistic (FL) regression performs well or is slightly conservative. Large sample size and low dispersion generally make Type-I error rates of all methods close to nominal alpha levels of 0.05 and 0.01. However, Type-I error rates are controlled after applying the data adaptive method. The NB, BL, and FL regressions gain increased power with large sample size, large log2 fold-change, and low dispersion. The FL regression has comparable power to NB regression. We conclude that implementing the data adaptive method appropriately controls Type-I error rates in RNA-Seq analysis. Firth's logistic regression provides a concise statistical inference process and reduces spurious associations from inaccurately estimated dispersion parameters in the negative binomial framework.

  3. Controlling Type I Error Rates in Assessing DIF for Logistic Regression Method Combined with SIBTEST Regression Correction Procedure and DIF-Free-Then-DIF Strategy

    ERIC Educational Resources Information Center

    Shih, Ching-Lin; Liu, Tien-Hsiang; Wang, Wen-Chung

    2014-01-01

    The simultaneous item bias test (SIBTEST) method regression procedure and the differential item functioning (DIF)-free-then-DIF strategy are applied to the logistic regression (LR) method simultaneously in this study. These procedures are used to adjust the effects of matching true score on observed score and to better control the Type I error…

  4. Fungible weights in logistic regression.

    PubMed

    Jones, Jeff A; Waller, Niels G

    2016-06-01

    In this article we develop methods for assessing parameter sensitivity in logistic regression models. To set the stage for this work, we first review Waller's (2008) equations for computing fungible weights in linear regression. Next, we describe 2 methods for computing fungible weights in logistic regression. To demonstrate the utility of these methods, we compute fungible logistic regression weights using data from the Centers for Disease Control and Prevention's (2010) Youth Risk Behavior Surveillance Survey, and we illustrate how these alternate weights can be used to evaluate parameter sensitivity. To make our work accessible to the research community, we provide R code (R Core Team, 2015) that will generate both kinds of fungible logistic regression weights. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  5. Should metacognition be measured by logistic regression?

    PubMed

    Rausch, Manuel; Zehetleitner, Michael

    2017-03-01

    Are logistic regression slopes suitable to quantify metacognitive sensitivity, i.e. the efficiency with which subjective reports differentiate between correct and incorrect task responses? We analytically show that logistic regression slopes are independent from rating criteria in one specific model of metacognition, which assumes (i) that rating decisions are based on sensory evidence generated independently of the sensory evidence used for primary task responses and (ii) that the distributions of evidence are logistic. Given a hierarchical model of metacognition, logistic regression slopes depend on rating criteria. According to all considered models, regression slopes depend on the primary task criterion. A reanalysis of previous data revealed that massive numbers of trials are required to distinguish between hierarchical and independent models with tolerable accuracy. It is argued that researchers who wish to use logistic regression as measure of metacognitive sensitivity need to control the primary task criterion and rating criteria. Copyright © 2017 Elsevier Inc. All rights reserved.

  6. On the use and misuse of scalar scores of confounders in design and analysis of observational studies.

    PubMed

    Pfeiffer, R M; Riedl, R

    2015-08-15

    We assess the asymptotic bias of estimates of exposure effects conditional on covariates when summary scores of confounders, instead of the confounders themselves, are used to analyze observational data. First, we study regression models for cohort data that are adjusted for summary scores. Second, we derive the asymptotic bias for case-control studies when cases and controls are matched on a summary score, and then analyzed either using conditional logistic regression or by unconditional logistic regression adjusted for the summary score. Two scores, the propensity score (PS) and the disease risk score (DRS) are studied in detail. For cohort analysis, when regression models are adjusted for the PS, the estimated conditional treatment effect is unbiased only for linear models, or at the null for non-linear models. Adjustment of cohort data for DRS yields unbiased estimates only for linear regression; all other estimates of exposure effects are biased. Matching cases and controls on DRS and analyzing them using conditional logistic regression yields unbiased estimates of exposure effect, whereas adjusting for the DRS in unconditional logistic regression yields biased estimates, even under the null hypothesis of no association. Matching cases and controls on the PS yield unbiased estimates only under the null for both conditional and unconditional logistic regression, adjusted for the PS. We study the bias for various confounding scenarios and compare our asymptotic results with those from simulations with limited sample sizes. To create realistic correlations among multiple confounders, we also based simulations on a real dataset. Copyright © 2015 John Wiley & Sons, Ltd.

  7. The Effect of Latent Binary Variables on the Uncertainty of the Prediction of a Dichotomous Outcome Using Logistic Regression Based Propensity Score Matching.

    PubMed

    Szekér, Szabolcs; Vathy-Fogarassy, Ágnes

    2018-01-01

    Logistic regression based propensity score matching is a widely used method in case-control studies to select the individuals of the control group. This method creates a suitable control group if all factors affecting the output variable are known. However, if relevant latent variables exist as well, which are not taken into account during the calculations, the quality of the control group is uncertain. In this paper, we present a statistics-based research in which we try to determine the relationship between the accuracy of the logistic regression model and the uncertainty of the dependent variable of the control group defined by propensity score matching. Our analyses show that there is a linear correlation between the fit of the logistic regression model and the uncertainty of the output variable. In certain cases, a latent binary explanatory variable can result in a relative error of up to 70% in the prediction of the outcome variable. The observed phenomenon calls the attention of analysts to an important point, which must be taken into account when deducting conclusions.

  8. Use and interpretation of logistic regression in habitat-selection studies

    USGS Publications Warehouse

    Keating, Kim A.; Cherry, Steve

    2004-01-01

     Logistic regression is an important tool for wildlife habitat-selection studies, but the method frequently has been misapplied due to an inadequate understanding of the logistic model, its interpretation, and the influence of sampling design. To promote better use of this method, we review its application and interpretation under 3 sampling designs: random, case-control, and use-availability. Logistic regression is appropriate for habitat use-nonuse studies employing random sampling and can be used to directly model the conditional probability of use in such cases. Logistic regression also is appropriate for studies employing case-control sampling designs, but careful attention is required to interpret results correctly. Unless bias can be estimated or probability of use is small for all habitats, results of case-control studies should be interpreted as odds ratios, rather than probability of use or relative probability of use. When data are gathered under a use-availability design, logistic regression can be used to estimate approximate odds ratios if probability of use is small, at least on average. More generally, however, logistic regression is inappropriate for modeling habitat selection in use-availability studies. In particular, using logistic regression to fit the exponential model of Manly et al. (2002:100) does not guarantee maximum-likelihood estimates, valid probabilities, or valid likelihoods. We show that the resource selection function (RSF) commonly used for the exponential model is proportional to a logistic discriminant function. Thus, it may be used to rank habitats with respect to probability of use and to identify important habitat characteristics or their surrogates, but it is not guaranteed to be proportional to probability of use. Other problems associated with the exponential model also are discussed. We describe an alternative model based on Lancaster and Imbens (1996) that offers a method for estimating conditional probability of use in use-availability studies. Although promising, this model fails to converge to a unique solution in some important situations. Further work is needed to obtain a robust method that is broadly applicable to use-availability studies.

  9. Unconditional or Conditional Logistic Regression Model for Age-Matched Case-Control Data?

    PubMed

    Kuo, Chia-Ling; Duan, Yinghui; Grady, James

    2018-01-01

    Matching on demographic variables is commonly used in case-control studies to adjust for confounding at the design stage. There is a presumption that matched data need to be analyzed by matched methods. Conditional logistic regression has become a standard for matched case-control data to tackle the sparse data problem. The sparse data problem, however, may not be a concern for loose-matching data when the matching between cases and controls is not unique, and one case can be matched to other controls without substantially changing the association. Data matched on a few demographic variables are clearly loose-matching data, and we hypothesize that unconditional logistic regression is a proper method to perform. To address the hypothesis, we compare unconditional and conditional logistic regression models by precision in estimates and hypothesis testing using simulated matched case-control data. Our results support our hypothesis; however, the unconditional model is not as robust as the conditional model to the matching distortion that the matching process not only makes cases and controls similar for matching variables but also for the exposure status. When the study design involves other complex features or the computational burden is high, matching in loose-matching data can be ignored for negligible loss in testing and estimation if the distributions of matching variables are not extremely different between cases and controls.

  10. Unconditional or Conditional Logistic Regression Model for Age-Matched Case–Control Data?

    PubMed Central

    Kuo, Chia-Ling; Duan, Yinghui; Grady, James

    2018-01-01

    Matching on demographic variables is commonly used in case–control studies to adjust for confounding at the design stage. There is a presumption that matched data need to be analyzed by matched methods. Conditional logistic regression has become a standard for matched case–control data to tackle the sparse data problem. The sparse data problem, however, may not be a concern for loose-matching data when the matching between cases and controls is not unique, and one case can be matched to other controls without substantially changing the association. Data matched on a few demographic variables are clearly loose-matching data, and we hypothesize that unconditional logistic regression is a proper method to perform. To address the hypothesis, we compare unconditional and conditional logistic regression models by precision in estimates and hypothesis testing using simulated matched case–control data. Our results support our hypothesis; however, the unconditional model is not as robust as the conditional model to the matching distortion that the matching process not only makes cases and controls similar for matching variables but also for the exposure status. When the study design involves other complex features or the computational burden is high, matching in loose-matching data can be ignored for negligible loss in testing and estimation if the distributions of matching variables are not extremely different between cases and controls. PMID:29552553

  11. Dietary consumption patterns and laryngeal cancer risk.

    PubMed

    Vlastarakos, Petros V; Vassileiou, Andrianna; Delicha, Evie; Kikidis, Dimitrios; Protopapas, Dimosthenis; Nikolopoulos, Thomas P

    2016-06-01

    We conducted a case-control study to investigate the effect of diet on laryngeal carcinogenesis. Our study population was made up of 140 participants-70 patients with laryngeal cancer (LC) and 70 controls with a non-neoplastic condition that was unrelated to diet, smoking, or alcohol. A food-frequency questionnaire determined the mean consumption of 113 different items during the 3 years prior to symptom onset. Total energy intake and cooking mode were also noted. The relative risk, odds ratio (OR), and 95% confidence interval (CI) were estimated by multiple logistic regression analysis. We found that the total energy intake was significantly higher in the LC group (p < 0.001), and that the difference remained statistically significant after logistic regression analysis (p < 0.001; OR: 118.70). Notably, meat consumption was higher in the LC group (p < 0.001), and the difference remained significant after logistic regression analysis (p = 0.029; OR: 1.16). LC patients also consumed significantly more fried food (p = 0.036); this difference also remained significant in the logistic regression model (p = 0.026; OR: 5.45). The LC group also consumed significantly more seafood (p = 0.012); the difference persisted after logistic regression analysis (p = 0.009; OR: 2.48), with the consumption of shrimp proving detrimental (p = 0.049; OR: 2.18). Finally, the intake of zinc was significantly higher in the LC group before and after logistic regression analysis (p = 0.034 and p = 0.011; OR: 30.15, respectively). Cereal consumption (including pastas) was also higher among the LC patients (p = 0.043), with logistic regression analysis showing that their negative effect was possibly associated with the sauces and dressings that traditionally accompany pasta dishes (p = 0.006; OR: 4.78). Conversely, a higher consumption of dairy products was found in controls (p < 0.05); logistic regression analysis showed that calcium appeared to be protective at the micronutrient level (p < 0.001; OR: 0.27). We found no difference in the overall consumption of fruits and vegetables between the LC patients and controls; however, the LC patients did have a greater consumption of cooked tomatoes and cooked root vegetables (p = 0.039 for both), and the controls had more consumption of leeks (p = 0.042) and, among controls younger than 65 years, cooked beans (p = 0.037). Lemon (p = 0.037), squeezed fruit juice (p = 0.032), and watermelon (p = 0.018) were also more frequently consumed by the controls. Other differences at the micronutrient level included greater consumption by the LC patients of retinol (p = 0.044), polyunsaturated fats (p = 0.041), and linoleic acid (p = 0.008); LC patients younger than 65 years also had greater intake of riboflavin (p = 0.045). We conclude that the differences in dietary consumption patterns between LC patients and controls indicate a possible role for lifestyle modifications involving nutritional factors as a means of decreasing the risk of laryngeal cancer.

  12. Robust logistic regression to narrow down the winner's curse for rare and recessive susceptibility variants.

    PubMed

    Kesselmeier, Miriam; Lorenzo Bermejo, Justo

    2017-11-01

    Logistic regression is the most common technique used for genetic case-control association studies. A disadvantage of standard maximum likelihood estimators of the genotype relative risk (GRR) is their strong dependence on outlier subjects, for example, patients diagnosed at unusually young age. Robust methods are available to constrain outlier influence, but they are scarcely used in genetic studies. This article provides a non-intimidating introduction to robust logistic regression, and investigates its benefits and limitations in genetic association studies. We applied the bounded Huber and extended the R package 'robustbase' with the re-descending Hampel functions to down-weight outlier influence. Computer simulations were carried out to assess the type I error rate, mean squared error (MSE) and statistical power according to major characteristics of the genetic study and investigated markers. Simulations were complemented with the analysis of real data. Both standard and robust estimation controlled type I error rates. Standard logistic regression showed the highest power but standard GRR estimates also showed the largest bias and MSE, in particular for associated rare and recessive variants. For illustration, a recessive variant with a true GRR=6.32 and a minor allele frequency=0.05 investigated in a 1000 case/1000 control study by standard logistic regression resulted in power=0.60 and MSE=16.5. The corresponding figures for Huber-based estimation were power=0.51 and MSE=0.53. Overall, Hampel- and Huber-based GRR estimates did not differ much. Robust logistic regression may represent a valuable alternative to standard maximum likelihood estimation when the focus lies on risk prediction rather than identification of susceptibility variants. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  13. Epidemiologic programs for computers and calculators. A microcomputer program for multiple logistic regression by unconditional and conditional maximum likelihood methods.

    PubMed

    Campos-Filho, N; Franco, E L

    1989-02-01

    A frequent procedure in matched case-control studies is to report results from the multivariate unmatched analyses if they do not differ substantially from the ones obtained after conditioning on the matching variables. Although conceptually simple, this rule requires that an extensive series of logistic regression models be evaluated by both the conditional and unconditional maximum likelihood methods. Most computer programs for logistic regression employ only one maximum likelihood method, which requires that the analyses be performed in separate steps. This paper describes a Pascal microcomputer (IBM PC) program that performs multiple logistic regression by both maximum likelihood estimation methods, which obviates the need for switching between programs to obtain relative risk estimates from both matched and unmatched analyses. The program calculates most standard statistics and allows factoring of categorical or continuous variables by two distinct methods of contrast. A built-in, descriptive statistics option allows the user to inspect the distribution of cases and controls across categories of any given variable.

  14. Estimating time-varying exposure-outcome associations using case-control data: logistic and case-cohort analyses.

    PubMed

    Keogh, Ruth H; Mangtani, Punam; Rodrigues, Laura; Nguipdop Djomo, Patrick

    2016-01-05

    Traditional analyses of standard case-control studies using logistic regression do not allow estimation of time-varying associations between exposures and the outcome. We present two approaches which allow this. The motivation is a study of vaccine efficacy as a function of time since vaccination. Our first approach is to estimate time-varying exposure-outcome associations by fitting a series of logistic regressions within successive time periods, reusing controls across periods. Our second approach treats the case-control sample as a case-cohort study, with the controls forming the subcohort. In the case-cohort analysis, controls contribute information at all times they are at risk. Extensions allow left truncation, frequency matching and, using the case-cohort analysis, time-varying exposures. Simulations are used to investigate the methods. The simulation results show that both methods give correct estimates of time-varying effects of exposures using standard case-control data. Using the logistic approach there are efficiency gains by reusing controls over time and care should be taken over the definition of controls within time periods. However, using the case-cohort analysis there is no ambiguity over the definition of controls. The performance of the two analyses is very similar when controls are used most efficiently under the logistic approach. Using our methods, case-control studies can be used to estimate time-varying exposure-outcome associations where they may not previously have been considered. The case-cohort analysis has several advantages, including that it allows estimation of time-varying associations as a continuous function of time, while the logistic regression approach is restricted to assuming a step function form for the time-varying association.

  15. Hierarchical Bayesian Logistic Regression to forecast metabolic control in type 2 DM patients.

    PubMed

    Dagliati, Arianna; Malovini, Alberto; Decata, Pasquale; Cogni, Giulia; Teliti, Marsida; Sacchi, Lucia; Cerra, Carlo; Chiovato, Luca; Bellazzi, Riccardo

    2016-01-01

    In this work we present our efforts in building a model able to forecast patients' changes in clinical conditions when repeated measurements are available. In this case the available risk calculators are typically not applicable. We propose a Hierarchical Bayesian Logistic Regression model, which allows taking into account individual and population variability in model parameters estimate. The model is used to predict metabolic control and its variation in type 2 diabetes mellitus. In particular we have analyzed a population of more than 1000 Italian type 2 diabetic patients, collected within the European project Mosaic. The results obtained in terms of Matthews Correlation Coefficient are significantly better than the ones gathered with standard logistic regression model, based on data pooling.

  16. Iterative Purification and Effect Size Use with Logistic Regression for Differential Item Functioning Detection

    ERIC Educational Resources Information Center

    French, Brian F.; Maller, Susan J.

    2007-01-01

    Two unresolved implementation issues with logistic regression (LR) for differential item functioning (DIF) detection include ability purification and effect size use. Purification is suggested to control inaccuracies in DIF detection as a result of DIF items in the ability estimate. Additionally, effect size use may be beneficial in controlling…

  17. Matched samples logistic regression in case-control studies with missing values: when to break the matches.

    PubMed

    Hansson, Lisbeth; Khamis, Harry J

    2008-12-01

    Simulated data sets are used to evaluate conditional and unconditional maximum likelihood estimation in an individual case-control design with continuous covariates when there are different rates of excluded cases and different levels of other design parameters. The effectiveness of the estimation procedures is measured by method bias, variance of the estimators, root mean square error (RMSE) for logistic regression and the percentage of explained variation. Conditional estimation leads to higher RMSE than unconditional estimation in the presence of missing observations, especially for 1:1 matching. The RMSE is higher for the smaller stratum size, especially for the 1:1 matching. The percentage of explained variation appears to be insensitive to missing data, but is generally higher for the conditional estimation than for the unconditional estimation. It is particularly good for the 1:2 matching design. For minimizing RMSE, a high matching ratio is recommended; in this case, conditional and unconditional logistic regression models yield comparable levels of effectiveness. For maximizing the percentage of explained variation, the 1:2 matching design with the conditional logistic regression model is recommended.

  18. Effect of folic acid on appetite in children: ordinal logistic and fuzzy logistic regressions.

    PubMed

    Namdari, Mahshid; Abadi, Alireza; Taheri, S Mahmoud; Rezaei, Mansour; Kalantari, Naser; Omidvar, Nasrin

    2014-03-01

    Reduced appetite and low food intake are often a concern in preschool children, since it can lead to malnutrition, a leading cause of impaired growth and mortality in childhood. It is occasionally considered that folic acid has a positive effect on appetite enhancement and consequently growth in children. The aim of this study was to assess the effect of folic acid on the appetite of preschool children 3 to 6 y old. The study sample included 127 children ages 3 to 6 who were randomly selected from 20 preschools in the city of Tehran in 2011. Since appetite was measured by linguistic terms, a fuzzy logistic regression was applied for modeling. The obtained results were compared with a statistical ordinal logistic model. After controlling for the potential confounders, in a statistical ordinal logistic model, serum folate showed a significantly positive effect on appetite. A small but positive effect of folate was detected by fuzzy logistic regression. Based on fuzzy regression, the risk for poor appetite in preschool children was related to the employment status of their mothers. In this study, a positive association was detected between the levels of serum folate and improved appetite. For further investigation, a randomized controlled, double-blind clinical trial could be helpful to address causality. Copyright © 2014 Elsevier Inc. All rights reserved.

  19. Power and sample size for multivariate logistic modeling of unmatched case-control studies.

    PubMed

    Gail, Mitchell H; Haneuse, Sebastien

    2017-01-01

    Sample size calculations are needed to design and assess the feasibility of case-control studies. Although such calculations are readily available for simple case-control designs and univariate analyses, there is limited theory and software for multivariate unconditional logistic analysis of case-control data. Here we outline the theory needed to detect scalar exposure effects or scalar interactions while controlling for other covariates in logistic regression. Both analytical and simulation methods are presented, together with links to the corresponding software.

  20. [Application of SAS macro to evaluated multiplicative and additive interaction in logistic and Cox regression in clinical practices].

    PubMed

    Nie, Z Q; Ou, Y Q; Zhuang, J; Qu, Y J; Mai, J Z; Chen, J M; Liu, X Q

    2016-05-01

    Conditional logistic regression analysis and unconditional logistic regression analysis are commonly used in case control study, but Cox proportional hazard model is often used in survival data analysis. Most literature only refer to main effect model, however, generalized linear model differs from general linear model, and the interaction was composed of multiplicative interaction and additive interaction. The former is only statistical significant, but the latter has biological significance. In this paper, macros was written by using SAS 9.4 and the contrast ratio, attributable proportion due to interaction and synergy index were calculated while calculating the items of logistic and Cox regression interactions, and the confidence intervals of Wald, delta and profile likelihood were used to evaluate additive interaction for the reference in big data analysis in clinical epidemiology and in analysis of genetic multiplicative and additive interactions.

  1. [Influences of environmental factors and interaction of several chemokines gene-environmental on systemic lupus erythematosus].

    PubMed

    Ye, Dong-qing; Hu, Yi-song; Li, Xiang-pei; Huang, Fen; Yang, Shi-gui; Hao, Jia-hu; Yin, Jing; Zhang, Guo-qing; Liu, Hui-hui

    2004-11-01

    To explore the impact of environmental factors, daily lifestyle, psycho-social factors and the interactions between environmental factors and chemokines genes on systemic lupus erythematosus (SLE). Case-control study was carried out and environmental factors for SLE were analyzed by univariate and multivariate unconditional logistic regression. Interactions between environmental factors and chemokines polymorphism contributing to systemic lupus erythematosus were also analyzed by logistic regression model. There were nineteen factors associated with SLE when univariate unconditional logistic regression was used. However, when multivariate unconditional logistic regression was used, only five factors showed having impacts on the disease, in which drinking well water (OR=0.099) was protective factor for SLE, and multiple drug allergy (OR=8.174), over-exposure to sunshine (OR=18.339), taking antibiotics (OR=9.630) and oral contraceptives were risk factors for SLE. When unconditional logistic regression model was used, results showed that there was interaction between eating irritable food and -2518MCP-1G/G genotype (OR=4.387). No interaction between environmental factors was found that contributing to SLE in this study. Many environmental factors were related to SLE, and there was an interaction between -2518MCP-1G/G genotype and eating irritable food.

  2. A deeper look at two concepts of measuring gene-gene interactions: logistic regression and interaction information revisited.

    PubMed

    Mielniczuk, Jan; Teisseyre, Paweł

    2018-03-01

    Detection of gene-gene interactions is one of the most important challenges in genome-wide case-control studies. Besides traditional logistic regression analysis, recently the entropy-based methods attracted a significant attention. Among entropy-based methods, interaction information is one of the most promising measures having many desirable properties. Although both logistic regression and interaction information have been used in several genome-wide association studies, the relationship between them has not been thoroughly investigated theoretically. The present paper attempts to fill this gap. We show that although certain connections between the two methods exist, in general they refer two different concepts of dependence and looking for interactions in those two senses leads to different approaches to interaction detection. We introduce ordering between interaction measures and specify conditions for independent and dependent genes under which interaction information is more discriminative measure than logistic regression. Moreover, we show that for so-called perfect distributions those measures are equivalent. The numerical experiments illustrate the theoretical findings indicating that interaction information and its modified version are more universal tools for detecting various types of interaction than logistic regression and linkage disequilibrium measures. © 2017 WILEY PERIODICALS, INC.

  3. Asthma exacerbation and proximity of residence to major roads: a population-based matched case-control study among the pediatric Medicaid population in Detroit, Michigan

    PubMed Central

    2011-01-01

    Background The relationship between asthma and traffic-related pollutants has received considerable attention. The use of individual-level exposure measures, such as residence location or proximity to emission sources, may avoid ecological biases. Method This study focused on the pediatric Medicaid population in Detroit, MI, a high-risk population for asthma-related events. A population-based matched case-control analysis was used to investigate associations between acute asthma outcomes and proximity of residence to major roads, including freeways. Asthma cases were identified as all children who made at least one asthma claim, including inpatient and emergency department visits, during the three-year study period, 2004-06. Individually matched controls were randomly selected from the rest of the Medicaid population on the basis of non-respiratory related illness. We used conditional logistic regression with distance as both categorical and continuous variables, and examined non-linear relationships with distance using polynomial splines. The conditional logistic regression models were then extended by considering multiple asthma states (based on the frequency of acute asthma outcomes) using polychotomous conditional logistic regression. Results Asthma events were associated with proximity to primary roads with an odds ratio of 0.97 (95% CI: 0.94, 0.99) for a 1 km increase in distance using conditional logistic regression, implying that asthma events are less likely as the distance between the residence and a primary road increases. Similar relationships and effect sizes were found using polychotomous conditional logistic regression. Another plausible exposure metric, a reduced form response surface model that represents atmospheric dispersion of pollutants from roads, was not associated under that exposure model. Conclusions There is moderately strong evidence of elevated risk of asthma close to major roads based on the results obtained in this population-based matched case-control study. PMID:21513554

  4. Conditional Poisson models: a flexible alternative to conditional logistic case cross-over analysis.

    PubMed

    Armstrong, Ben G; Gasparrini, Antonio; Tobias, Aurelio

    2014-11-24

    The time stratified case cross-over approach is a popular alternative to conventional time series regression for analysing associations between time series of environmental exposures (air pollution, weather) and counts of health outcomes. These are almost always analyzed using conditional logistic regression on data expanded to case-control (case crossover) format, but this has some limitations. In particular adjusting for overdispersion and auto-correlation in the counts is not possible. It has been established that a Poisson model for counts with stratum indicators gives identical estimates to those from conditional logistic regression and does not have these limitations, but it is little used, probably because of the overheads in estimating many stratum parameters. The conditional Poisson model avoids estimating stratum parameters by conditioning on the total event count in each stratum, thus simplifying the computing and increasing the number of strata for which fitting is feasible compared with the standard unconditional Poisson model. Unlike the conditional logistic model, the conditional Poisson model does not require expanding the data, and can adjust for overdispersion and auto-correlation. It is available in Stata, R, and other packages. By applying to some real data and using simulations, we demonstrate that conditional Poisson models were simpler to code and shorter to run than are conditional logistic analyses and can be fitted to larger data sets than possible with standard Poisson models. Allowing for overdispersion or autocorrelation was possible with the conditional Poisson model but when not required this model gave identical estimates to those from conditional logistic regression. Conditional Poisson regression models provide an alternative to case crossover analysis of stratified time series data with some advantages. The conditional Poisson model can also be used in other contexts in which primary control for confounding is by fine stratification.

  5. Racial/ethnic and educational differences in the estimated odds of recent nitrite use among adult household residents in the United States: an illustration of matching and conditional logistic regression.

    PubMed

    Delva, J; Spencer, M S; Lin, J K

    2000-01-01

    This article compares estimates of the relative odds of nitrite use obtained from weighted unconditional logistic regression with estimates obtained from conditional logistic regression after post-stratification and matching of cases with controls by neighborhood of residence. We illustrate these methods by comparing the odds associated with nitrite use among adults of four racial/ethnic groups, with and without a high school education. We used aggregated data from the 1994-B through 1996 National Household Survey on Drug Abuse (NHSDA). Difference between the methods and implications for analysis and inference are discussed.

  6. Polymorphism Thr160Thr in SRD5A1, involved in the progesterone metabolism, modifies postmenopausal breast cancer risk associated with menopausal hormone therapy.

    PubMed

    Hein, R; Abbas, S; Seibold, P; Salazar, R; Flesch-Janys, D; Chang-Claude, J

    2012-01-01

    Menopausal hormone therapy (MHT) is associated with an increased breast cancer risk in postmenopausal women, with combined estrogen-progestagen therapy posing a greater risk than estrogen monotherapy. However, few studies focused on potential effect modification of MHT-associated breast cancer risk by genetic polymorphisms in the progesterone metabolism. We assessed effect modification of MHT use by five coding single nucleotide polymorphisms (SNPs) in the progesterone metabolizing enzymes AKR1C3 (rs7741), AKR1C4 (rs3829125, rs17134592), and SRD5A1 (rs248793, rs3736316) using a two-center population-based case-control study from Germany with 2,502 postmenopausal breast cancer patients and 4,833 matched controls. An empirical-Bayes procedure that tests for interaction using a weighted combination of the prospective and the retrospective case-control estimators as well as standard prospective logistic regression were applied to assess multiplicative statistical interaction between polymorphisms and duration of MHT use with regard to breast cancer risk assuming a log-additive mode of inheritance. No genetic marginal effects were observed. Breast cancer risk associated with duration of combined therapy was significantly modified by SRD5A1_rs3736316, showing a reduced risk elevation in carriers of the minor allele (p (interaction,empirical-Bayes) = 0.006 using the empirical-Bayes method, p (interaction,logistic regression) = 0.013 using logistic regression). The risk associated with duration of use of monotherapy was increased by AKR1C3_rs7741 in minor allele carriers (p (interaction,empirical-Bayes) = 0.083, p (interaction,logistic regression) = 0.029) and decreased in minor allele carriers of two SNPs in AKR1C4 (rs3829125: p (interaction,empirical-Bayes) = 0.07, p (interaction,logistic regression) = 0.021; rs17134592: p (interaction,empirical-Bayes) = 0.101, p (interaction,logistic regression) = 0.038). After Bonferroni correction for multiple testing only SRD5A1_rs3736316 assessed using the empirical-Bayes method remained significant. Postmenopausal breast cancer risk associated with combined therapy may be modified by genetic variation in SRD5A1. Further well-powered studies are, however, required to replicate our finding.

  7. Determination of osteoporosis risk factors using a multiple logistic regression model in postmenopausal Turkish women.

    PubMed

    Akkus, Zeki; Camdeviren, Handan; Celik, Fatma; Gur, Ali; Nas, Kemal

    2005-09-01

    To determine the risk factors of osteoporosis using a multiple binary logistic regression method and to assess the risk variables for osteoporosis, which is a major and growing health problem in many countries. We presented a case-control study, consisting of 126 postmenopausal healthy women as control group and 225 postmenopausal osteoporotic women as the case group. The study was carried out in the Department of Physical Medicine and Rehabilitation, Dicle University, Diyarbakir, Turkey between 1999-2002. The data from the 351 participants were collected using a standard questionnaire that contains 43 variables. A multiple logistic regression model was then used to evaluate the data and to find the best regression model. We classified 80.1% (281/351) of the participants using the regression model. Furthermore, the specificity value of the model was 67% (84/126) of the control group while the sensitivity value was 88% (197/225) of the case group. We found the distribution of residual values standardized for final model to be exponential using the Kolmogorow-Smirnow test (p=0.193). The receiver operating characteristic curve was found successful to predict patients with risk for osteoporosis. This study suggests that low levels of dietary calcium intake, physical activity, education, and longer duration of menopause are independent predictors of the risk of low bone density in our population. Adequate dietary calcium intake in combination with maintaining a daily physical activity, increasing educational level, decreasing birth rate, and duration of breast-feeding may contribute to healthy bones and play a role in practical prevention of osteoporosis in Southeast Anatolia. In addition, the findings of the present study indicate that the use of multivariate statistical method as a multiple logistic regression in osteoporosis, which maybe influenced by many variables, is better than univariate statistical evaluation.

  8. Evaluation of Cox's model and logistic regression for matched case-control data with time-dependent covariates: a simulation study.

    PubMed

    Leffondré, Karen; Abrahamowicz, Michal; Siemiatycki, Jack

    2003-12-30

    Case-control studies are typically analysed using the conventional logistic model, which does not directly account for changes in the covariate values over time. Yet, many exposures may vary over time. The most natural alternative to handle such exposures would be to use the Cox model with time-dependent covariates. However, its application to case-control data opens the question of how to manipulate the risk sets. Through a simulation study, we investigate how the accuracy of the estimates of Cox's model depends on the operational definition of risk sets and/or on some aspects of the time-varying exposure. We also assess the estimates obtained from conventional logistic regression. The lifetime experience of a hypothetical population is first generated, and a matched case-control study is then simulated from this population. We control the frequency, the age at initiation, and the total duration of exposure, as well as the strengths of their effects. All models considered include a fixed-in-time covariate and one or two time-dependent covariate(s): the indicator of current exposure and/or the exposure duration. Simulation results show that none of the models always performs well. The discrepancies between the odds ratios yielded by logistic regression and the 'true' hazard ratio depend on both the type of the covariate and the strength of its effect. In addition, it seems that logistic regression has difficulty separating the effects of inter-correlated time-dependent covariates. By contrast, each of the two versions of Cox's model systematically induces either a serious under-estimation or a moderate over-estimation bias. The magnitude of the latter bias is proportional to the true effect, suggesting that an improved manipulation of the risk sets may eliminate, or at least reduce, the bias. Copyright 2003 JohnWiley & Sons, Ltd.

  9. Separation in Logistic Regression: Causes, Consequences, and Control.

    PubMed

    Mansournia, Mohammad Ali; Geroldinger, Angelika; Greenland, Sander; Heinze, Georg

    2018-04-01

    Separation is encountered in regression models with a discrete outcome (such as logistic regression) where the covariates perfectly predict the outcome. It is most frequent under the same conditions that lead to small-sample and sparse-data bias, such as presence of a rare outcome, rare exposures, highly correlated covariates, or covariates with strong effects. In theory, separation will produce infinite estimates for some coefficients. In practice, however, separation may be unnoticed or mishandled because of software limits in recognizing and handling the problem and in notifying the user. We discuss causes of separation in logistic regression and describe how common software packages deal with it. We then describe methods that remove separation, focusing on the same penalized-likelihood techniques used to address more general sparse-data problems. These methods improve accuracy, avoid software problems, and allow interpretation as Bayesian analyses with weakly informative priors. We discuss likelihood penalties, including some that can be implemented easily with any software package, and their relative advantages and disadvantages. We provide an illustration of ideas and methods using data from a case-control study of contraceptive practices and urinary tract infection.

  10. Logistic regression trees for initial selection of interesting loci in case-control studies

    PubMed Central

    Nickolov, Radoslav Z; Milanov, Valentin B

    2007-01-01

    Modern genetic epidemiology faces the challenge of dealing with hundreds of thousands of genetic markers. The selection of a small initial subset of interesting markers for further investigation can greatly facilitate genetic studies. In this contribution we suggest the use of a logistic regression tree algorithm known as logistic tree with unbiased selection. Using the simulated data provided for Genetic Analysis Workshop 15, we show how this algorithm, with incorporation of multifactor dimensionality reduction method, can reduce an initial large pool of markers to a small set that includes the interesting markers with high probability. PMID:18466557

  11. Access disparities to Magnet hospitals for patients undergoing neurosurgical operations

    PubMed Central

    Missios, Symeon; Bekelis, Kimon

    2017-01-01

    Background Centers of excellence focusing on quality improvement have demonstrated superior outcomes for a variety of surgical interventions. We investigated the presence of access disparities to hospitals recognized by the Magnet Recognition Program of the American Nurses Credentialing Center (ANCC) for patients undergoing neurosurgical operations. Methods We performed a cohort study of all neurosurgery patients who were registered in the New York Statewide Planning and Research Cooperative System (SPARCS) database from 2009–2013. We examined the association of African-American race and lack of insurance with Magnet status hospitalization for neurosurgical procedures. A mixed effects propensity adjusted multivariable regression analysis was used to control for confounding. Results During the study period, 190,535 neurosurgical patients met the inclusion criteria. Using a multivariable logistic regression, we demonstrate that African-Americans had lower admission rates to Magnet institutions (OR 0.62; 95% CI, 0.58–0.67). This persisted in a mixed effects logistic regression model (OR 0.77; 95% CI, 0.70–0.83) to adjust for clustering at the patient county level, and a propensity score adjusted logistic regression model (OR 0.75; 95% CI, 0.69–0.82). Additionally, lack of insurance was associated with lower admission rates to Magnet institutions (OR 0.71; 95% CI, 0.68–0.73), in a multivariable logistic regression model. This persisted in a mixed effects logistic regression model (OR 0.72; 95% CI, 0.69–0.74), and a propensity score adjusted logistic regression model (OR 0.72; 95% CI, 0.69–0.75). Conclusions Using a comprehensive all-payer cohort of neurosurgery patients in New York State we identified an association of African-American race and lack of insurance with lower rates of admission to Magnet hospitals. PMID:28684152

  12. CUSUM-Logistic Regression analysis for the rapid detection of errors in clinical laboratory test results.

    PubMed

    Sampson, Maureen L; Gounden, Verena; van Deventer, Hendrik E; Remaley, Alan T

    2016-02-01

    The main drawback of the periodic analysis of quality control (QC) material is that test performance is not monitored in time periods between QC analyses, potentially leading to the reporting of faulty test results. The objective of this study was to develop a patient based QC procedure for the more timely detection of test errors. Results from a Chem-14 panel measured on the Beckman LX20 analyzer were used to develop the model. Each test result was predicted from the other 13 members of the panel by multiple regression, which resulted in correlation coefficients between the predicted and measured result of >0.7 for 8 of the 14 tests. A logistic regression model, which utilized the measured test result, the predicted test result, the day of the week and time of day, was then developed for predicting test errors. The output of the logistic regression was tallied by a daily CUSUM approach and used to predict test errors, with a fixed specificity of 90%. The mean average run length (ARL) before error detection by CUSUM-Logistic Regression (CSLR) was 20 with a mean sensitivity of 97%, which was considerably shorter than the mean ARL of 53 (sensitivity 87.5%) for a simple prediction model that only used the measured result for error detection. A CUSUM-Logistic Regression analysis of patient laboratory data can be an effective approach for the rapid and sensitive detection of clinical laboratory errors. Published by Elsevier Inc.

  13. Improving power and robustness for detecting genetic association with extreme-value sampling design.

    PubMed

    Chen, Hua Yun; Li, Mingyao

    2011-12-01

    Extreme-value sampling design that samples subjects with extremely large or small quantitative trait values is commonly used in genetic association studies. Samples in such designs are often treated as "cases" and "controls" and analyzed using logistic regression. Such a case-control analysis ignores the potential dose-response relationship between the quantitative trait and the underlying trait locus and thus may lead to loss of power in detecting genetic association. An alternative approach to analyzing such data is to model the dose-response relationship by a linear regression model. However, parameter estimation from this model can be biased, which may lead to inflated type I errors. We propose a robust and efficient approach that takes into consideration of both the biased sampling design and the potential dose-response relationship. Extensive simulations demonstrate that the proposed method is more powerful than the traditional logistic regression analysis and is more robust than the linear regression analysis. We applied our method to the analysis of a candidate gene association study on high-density lipoprotein cholesterol (HDL-C) which includes study subjects with extremely high or low HDL-C levels. Using our method, we identified several SNPs showing a stronger evidence of association with HDL-C than the traditional case-control logistic regression analysis. Our results suggest that it is important to appropriately model the quantitative traits and to adjust for the biased sampling when dose-response relationship exists in extreme-value sampling designs. © 2011 Wiley Periodicals, Inc.

  14. Estimating a Logistic Discrimination Functions When One of the Training Samples Is Subject to Misclassification: A Maximum Likelihood Approach.

    PubMed

    Nagelkerke, Nico; Fidler, Vaclav

    2015-01-01

    The problem of discrimination and classification is central to much of epidemiology. Here we consider the estimation of a logistic regression/discrimination function from training samples, when one of the training samples is subject to misclassification or mislabeling, e.g. diseased individuals are incorrectly classified/labeled as healthy controls. We show that this leads to zero-inflated binomial model with a defective logistic regression or discrimination function, whose parameters can be estimated using standard statistical methods such as maximum likelihood. These parameters can be used to estimate the probability of true group membership among those, possibly erroneously, classified as controls. Two examples are analyzed and discussed. A simulation study explores properties of the maximum likelihood parameter estimates and the estimates of the number of mislabeled observations.

  15. The effect of high leverage points on the logistic ridge regression estimator having multicollinearity

    NASA Astrophysics Data System (ADS)

    Ariffin, Syaiba Balqish; Midi, Habshah

    2014-06-01

    This article is concerned with the performance of logistic ridge regression estimation technique in the presence of multicollinearity and high leverage points. In logistic regression, multicollinearity exists among predictors and in the information matrix. The maximum likelihood estimator suffers a huge setback in the presence of multicollinearity which cause regression estimates to have unduly large standard errors. To remedy this problem, a logistic ridge regression estimator is put forward. It is evident that the logistic ridge regression estimator outperforms the maximum likelihood approach for handling multicollinearity. The effect of high leverage points are then investigated on the performance of the logistic ridge regression estimator through real data set and simulation study. The findings signify that logistic ridge regression estimator fails to provide better parameter estimates in the presence of both high leverage points and multicollinearity.

  16. Sample size determination for logistic regression on a logit-normal distribution.

    PubMed

    Kim, Seongho; Heath, Elisabeth; Heilbrun, Lance

    2017-06-01

    Although the sample size for simple logistic regression can be readily determined using currently available methods, the sample size calculation for multiple logistic regression requires some additional information, such as the coefficient of determination ([Formula: see text]) of a covariate of interest with other covariates, which is often unavailable in practice. The response variable of logistic regression follows a logit-normal distribution which can be generated from a logistic transformation of a normal distribution. Using this property of logistic regression, we propose new methods of determining the sample size for simple and multiple logistic regressions using a normal transformation of outcome measures. Simulation studies and a motivating example show several advantages of the proposed methods over the existing methods: (i) no need for [Formula: see text] for multiple logistic regression, (ii) available interim or group-sequential designs, and (iii) much smaller required sample size.

  17. A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design.

    PubMed

    Staley, James R; Jones, Edmund; Kaptoge, Stephen; Butterworth, Adam S; Sweeting, Michael J; Wood, Angela M; Howson, Joanna M M

    2017-06-01

    Logistic regression is often used instead of Cox regression to analyse genome-wide association studies (GWAS) of single-nucleotide polymorphisms (SNPs) and disease outcomes with cohort and case-cohort designs, as it is less computationally expensive. Although Cox and logistic regression models have been compared previously in cohort studies, this work does not completely cover the GWAS setting nor extend to the case-cohort study design. Here, we evaluated Cox and logistic regression applied to cohort and case-cohort genetic association studies using simulated data and genetic data from the EPIC-CVD study. In the cohort setting, there was a modest improvement in power to detect SNP-disease associations using Cox regression compared with logistic regression, which increased as the disease incidence increased. In contrast, logistic regression had more power than (Prentice weighted) Cox regression in the case-cohort setting. Logistic regression yielded inflated effect estimates (assuming the hazard ratio is the underlying measure of association) for both study designs, especially for SNPs with greater effect on disease. Given logistic regression is substantially more computationally efficient than Cox regression in both settings, we propose a two-step approach to GWAS in cohort and case-cohort studies. First to analyse all SNPs with logistic regression to identify associated variants below a pre-defined P-value threshold, and second to fit Cox regression (appropriately weighted in case-cohort studies) to those identified SNPs to ensure accurate estimation of association with disease.

  18. The crux of the method: assumptions in ordinary least squares and logistic regression.

    PubMed

    Long, Rebecca G

    2008-10-01

    Logistic regression has increasingly become the tool of choice when analyzing data with a binary dependent variable. While resources relating to the technique are widely available, clear discussions of why logistic regression should be used in place of ordinary least squares regression are difficult to find. The current paper compares and contrasts the assumptions of ordinary least squares with those of logistic regression and explains why logistic regression's looser assumptions make it adept at handling violations of the more important assumptions in ordinary least squares.

  19. Detecting Anomalies in Process Control Networks

    NASA Astrophysics Data System (ADS)

    Rrushi, Julian; Kang, Kyoung-Don

    This paper presents the estimation-inspection algorithm, a statistical algorithm for anomaly detection in process control networks. The algorithm determines if the payload of a network packet that is about to be processed by a control system is normal or abnormal based on the effect that the packet will have on a variable stored in control system memory. The estimation part of the algorithm uses logistic regression integrated with maximum likelihood estimation in an inductive machine learning process to estimate a series of statistical parameters; these parameters are used in conjunction with logistic regression formulas to form a probability mass function for each variable stored in control system memory. The inspection part of the algorithm uses the probability mass functions to estimate the normalcy probability of a specific value that a network packet writes to a variable. Experimental results demonstrate that the algorithm is very effective at detecting anomalies in process control networks.

  20. Using Dominance Analysis to Determine Predictor Importance in Logistic Regression

    ERIC Educational Resources Information Center

    Azen, Razia; Traxel, Nicole

    2009-01-01

    This article proposes an extension of dominance analysis that allows researchers to determine the relative importance of predictors in logistic regression models. Criteria for choosing logistic regression R[superscript 2] analogues were determined and measures were selected that can be used to perform dominance analysis in logistic regression. A…

  1. Applying Kaplan-Meier to Item Response Data

    ERIC Educational Resources Information Center

    McNeish, Daniel

    2018-01-01

    Some IRT models can be equivalently modeled in alternative frameworks such as logistic regression. Logistic regression can also model time-to-event data, which concerns the probability of an event occurring over time. Using the relation between time-to-event models and logistic regression and the relation between logistic regression and IRT, this…

  2. Bayesian logistic regression in detection of gene-steroid interaction for cancer at PDLIM5 locus.

    PubMed

    Wang, Ke-Sheng; Owusu, Daniel; Pan, Yue; Xie, Changchun

    2016-06-01

    The PDZ and LIM domain 5 (PDLIM5) gene may play a role in cancer, bipolar disorder, major depression, alcohol dependence and schizophrenia; however, little is known about the interaction effect of steroid and PDLIM5 gene on cancer. This study examined 47 single-nucleotide polymorphisms (SNPs) within the PDLIM5 gene in the Marshfield sample with 716 cancer patients (any diagnosed cancer, excluding minor skin cancer) and 2848 noncancer controls. Multiple logistic regression model in PLINK software was used to examine the association of each SNP with cancer. Bayesian logistic regression in PROC GENMOD in SAS statistical software, ver. 9.4 was used to detect gene- steroid interactions influencing cancer. Single marker analysis using PLINK identified 12 SNPs associated with cancer (P< 0.05); especially, SNP rs6532496 revealed the strongest association with cancer (P = 6.84 × 10⁻³); while the next best signal was rs951613 (P = 7.46 × 10⁻³). Classic logistic regression in PROC GENMOD showed that both rs6532496 and rs951613 revealed strong gene-steroid interaction effects (OR=2.18, 95% CI=1.31-3.63 with P = 2.9 × 10⁻³ for rs6532496 and OR=2.07, 95% CI=1.24-3.45 with P = 5.43 × 10⁻³ for rs951613, respectively). Results from Bayesian logistic regression showed stronger interaction effects (OR=2.26, 95% CI=1.2-3.38 for rs6532496 and OR=2.14, 95% CI=1.14-3.2 for rs951613, respectively). All the 12 SNPs associated with cancer revealed significant gene-steroid interaction effects (P < 0.05); whereas 13 SNPs showed gene-steroid interaction effects without main effect on cancer. SNP rs4634230 revealed the strongest gene-steroid interaction effect (OR=2.49, 95% CI=1.5-4.13 with P = 4.0 × 10⁻⁴ based on the classic logistic regression and OR=2.59, 95% CI=1.4-3.97 from Bayesian logistic regression; respectively). This study provides evidence of common genetic variants within the PDLIM5 gene and interactions between PLDIM5 gene polymorphisms and steroid use influencing cancer.

  3. Registered dietitian's personal beliefs and characteristics predict their teaching or intention to teach fresh vegetable food safety.

    PubMed

    Casagrande, Gina; LeJeune, Jeffery; Belury, Martha A; Medeiros, Lydia C

    2011-04-01

    The Theory of Planned Behavior was used to determine if dietitians personal characteristics and beliefs about fresh vegetable food safety predict whether they currently teach, intend to teach, or neither currently teach nor intend to teach food safety information to their clients. Dietitians who participated in direct client education responded to this web-based survey (n=327). The survey evaluated three independent belief variables: Subjective Norm, Attitudes, and Perceived Behavioral Control. Spearman rho correlations were completed to determine variables that correlated best with current teaching behavior. Multinomial logistical regression was conducted to determine if the belief variables significantly predicted dietitians teaching behavior. Binary logistic regression was used to determine which independent variable was the better predictor of whether dietitians currently taught. Controlling for age, income, education, and gender, the multinomial logistical regression was significant. Perceived behavioral control was the best predictor of whether a dietitian currently taught fresh vegetable food safety. Factors affecting whether dietitians currently taught were confidence in fresh vegetable food safety knowledge, being socially influenced, and a positive attitude toward the teaching behavior. These results validate the importance of teaching food safety effectively and may be used to create more informed food safety curriculum for dietitians. Copyright © 2011 Elsevier Ltd. All rights reserved.

  4. Early warnings for suicide attempt among Chinese rural population.

    PubMed

    Lyu, Juncheng; Wang, Yingying; Shi, Hong; Zhang, Jie

    2018-06-05

    This study was to explore the main influencing factors of attempted suicide and establish an early warning model, so as to put forward prevention strategies for attempted suicide. Data came from a large-scale case-control epidemiological survey. A sample of 659 serious suicide attempters was randomly recruited from 13 rural counties in China. Each case was matched by a community control for gender, age, and residence location. Face to face interviews were conducted for all the cases and controls with the same structured questionnaire. Univariate logistic regression was applied to screen the factors and multivariate logistic regression was used to excavate the predictors. There were no statistical differences between suicide attempters and the community controls in gender, age, and residence location. The Cronbach`s coefficients for all the scales used were above 0.675. The multivariate logistic regressions have revealed 12 statistically significant variables predicting attempted suicide, including less education, family history of suicide, poor health, mental problem, aspiration strain, hopelessness, impulsivity, depression, negative life events. On the other hand, social support, coping skills, and healthy community protected the rural residents from suicide attempt. The excavated warning predictors are significant clinical meaning for the clinical psychiatrist. Crisis intervention strategies in rural China should be informed by the findings from this research. Education, social support, healthy community, and strain reduction are all measures to decrease the likelihood of crises. Copyright © 2018. Published by Elsevier B.V.

  5. Impact of Contextual Factors on Prostate Cancer Risk and Outcomes

    DTIC Science & Technology

    2013-07-01

    framework with random effects (“frailty models”) while the case-control analyses (Aim 4) will use multilevel unconditional logistic regression models...contextual-level SES on prostate cancer risk within racial/ethnic groups. The survival analyses (Aims 1-3) will utilize a proportional hazards regression

  6. Comparison of multinomial logistic regression and logistic regression: which is more efficient in allocating land use?

    NASA Astrophysics Data System (ADS)

    Lin, Yingzhi; Deng, Xiangzheng; Li, Xing; Ma, Enjun

    2014-12-01

    Spatially explicit simulation of land use change is the basis for estimating the effects of land use and cover change on energy fluxes, ecology and the environment. At the pixel level, logistic regression is one of the most common approaches used in spatially explicit land use allocation models to determine the relationship between land use and its causal factors in driving land use change, and thereby to evaluate land use suitability. However, these models have a drawback in that they do not determine/allocate land use based on the direct relationship between land use change and its driving factors. Consequently, a multinomial logistic regression method was introduced to address this flaw, and thereby, judge the suitability of a type of land use in any given pixel in a case study area of the Jiangxi Province, China. A comparison of the two regression methods indicated that the proportion of correctly allocated pixels using multinomial logistic regression was 92.98%, which was 8.47% higher than that obtained using logistic regression. Paired t-test results also showed that pixels were more clearly distinguished by multinomial logistic regression than by logistic regression. In conclusion, multinomial logistic regression is a more efficient and accurate method for the spatial allocation of land use changes. The application of this method in future land use change studies may improve the accuracy of predicting the effects of land use and cover change on energy fluxes, ecology, and environment.

  7. Carotid artery intima-media complex thickening in patients with relatively long-surviving type 1 diabetes mellitus.

    PubMed

    Distiller, Larry A; Joffe, Barry I; Melville, Vanessa; Welman, Tania; Distiller, Greg B

    2006-01-01

    The factors responsible for premature coronary atherosclerosis in patients with type 1 diabetes are ill defined. We therefore assessed carotid intima-media complex thickness (IMT) in relatively long-surviving patients with type 1 diabetes as a marker of atherosclerosis and correlated this with traditional risk factors. Cross-sectional study of 148 patients with relatively long-surviving (>18 years) type 1 diabetes (76 men and 72 women) attending the Centre for Diabetes and Endocrinology, Johannesburg. The mean common carotid artery IMT and presence or absence of plaque was evaluated by high-resolution B-mode ultrasound. Their median age was 48 years and duration of diabetes 26 years (range 18-59 years). Traditional risk factors (age, duration of diabetes, glycemic control, hypertension, smoking and lipoprotein concentrations) were recorded. Three response variables were defined and modeled. Standard multiple regression was used for a continuous IMT variable, logistic regression for the presence/absence of plaque and ordinal logistic regression to model three categories of "risk." The median common carotid IMT was 0.62 mm (range 0.44-1.23 mm) with plaque detected in 28 cases. The multiple regression model found significant associations between IMT and current age (P=.001), duration of diabetes (P=.033), BMI (P=.008) and diagnosed hypertension (P=.046) with HDL showing a protective effect (P=.022). Current age (P=.001) and diagnosed hypertension (P=.004), smoking (P=.008) and retinopathy (P=.033) were significant in the logistic regression model. Current age was also significant in the ordinal logistic regression model (P<.001), as was total cholesterol/HDL ratio (P<.001) and mean HbA(1c) concentration (P=.073). The major factors influencing common carotid IMT in patients with relatively long-surviving type 1 diabetes are age, duration of diabetes, existing hypertension and HDL (protective) with a relatively minor role ascribed to relatively long-standing glycemic control.

  8. Application of logistic regression to case-control association studies involving two causative loci.

    PubMed

    North, Bernard V; Curtis, David; Sham, Pak C

    2005-01-01

    Models in which two susceptibility loci jointly influence the risk of developing disease can be explored using logistic regression analysis. Comparison of likelihoods of models incorporating different sets of disease model parameters allows inferences to be drawn regarding the nature of the joint effect of the loci. We have simulated case-control samples generated assuming different two-locus models and then analysed them using logistic regression. We show that this method is practicable and that, for the models we have used, it can be expected to allow useful inferences to be drawn from sample sizes consisting of hundreds of subjects. Interactions between loci can be explored, but interactive effects do not exactly correspond with classical definitions of epistasis. We have particularly examined the issue of the extent to which it is helpful to utilise information from a previously identified locus when investigating a second, unknown locus. We show that for some models conditional analysis can have substantially greater power while for others unconditional analysis can be more powerful. Hence we conclude that in general both conditional and unconditional analyses should be performed when searching for additional loci.

  9. Risk factors for highly pathogenic avian influenza in commercial layer chicken farms in bangladesh during 2011.

    PubMed

    Osmani, M G; Thornton, R N; Dhand, N K; Hoque, M A; Milon, Sk M A; Kalam, M A; Hossain, M; Yamage, M

    2014-12-01

    A case-control study conducted during 2011 involved 90 randomly selected commercial layer farms infected with highly pathogenic avian influenza type A subtype H5N1 (HPAI) and 175 control farms randomly selected from within 5 km of infected farms. A questionnaire was designed to obtain information about potential risk factors for contracting HPAI and was administered to farm owners or managers. Logistic regression analyses were conducted to identify significant risk factors. A total of 20 of 43 risk factors for contracting HPAI were identified after univariable logistic regression analysis. A multivariable logistic regression model was derived by forward stepwise selection. Both unmatched and matched analyses were performed. The key risk factors identified were numbers of staff, frequency of veterinary visits, presence of village chickens roaming on the farm and staff trading birds. Aggregating these findings with those from other studies resulted in a list of 16 key risk factors identified in Bangladesh. Most of these related to biosecurity. It is considered feasible for Bangladesh to achieve a very low incidence of HPAI. Using the cumulative list of risk factors to enhance biosecurity pertaining to commercial farms would facilitate this objective. © 2013 Blackwell Verlag GmbH.

  10. Network-based regularization for matched case-control analysis of high-dimensional DNA methylation data.

    PubMed

    Sun, Hokeun; Wang, Shuang

    2013-05-30

    The matched case-control designs are commonly used to control for potential confounding factors in genetic epidemiology studies especially epigenetic studies with DNA methylation. Compared with unmatched case-control studies with high-dimensional genomic or epigenetic data, there have been few variable selection methods for matched sets. In an earlier paper, we proposed the penalized logistic regression model for the analysis of unmatched DNA methylation data using a network-based penalty. However, for popularly applied matched designs in epigenetic studies that compare DNA methylation between tumor and adjacent non-tumor tissues or between pre-treatment and post-treatment conditions, applying ordinary logistic regression ignoring matching is known to bring serious bias in estimation. In this paper, we developed a penalized conditional logistic model using the network-based penalty that encourages a grouping effect of (1) linked Cytosine-phosphate-Guanine (CpG) sites within a gene or (2) linked genes within a genetic pathway for analysis of matched DNA methylation data. In our simulation studies, we demonstrated the superiority of using conditional logistic model over unconditional logistic model in high-dimensional variable selection problems for matched case-control data. We further investigated the benefits of utilizing biological group or graph information for matched case-control data. We applied the proposed method to a genome-wide DNA methylation study on hepatocellular carcinoma (HCC) where we investigated the DNA methylation levels of tumor and adjacent non-tumor tissues from HCC patients by using the Illumina Infinium HumanMethylation27 Beadchip. Several new CpG sites and genes known to be related to HCC were identified but were missed by the standard method in the original paper. Copyright © 2012 John Wiley & Sons, Ltd.

  11. Standards for Standardized Logistic Regression Coefficients

    ERIC Educational Resources Information Center

    Menard, Scott

    2011-01-01

    Standardized coefficients in logistic regression analysis have the same utility as standardized coefficients in linear regression analysis. Although there has been no consensus on the best way to construct standardized logistic regression coefficients, there is now sufficient evidence to suggest a single best approach to the construction of a…

  12. A Bayesian goodness of fit test and semiparametric generalization of logistic regression with measurement data.

    PubMed

    Schörgendorfer, Angela; Branscum, Adam J; Hanson, Timothy E

    2013-06-01

    Logistic regression is a popular tool for risk analysis in medical and population health science. With continuous response data, it is common to create a dichotomous outcome for logistic regression analysis by specifying a threshold for positivity. Fitting a linear regression to the nondichotomized response variable assuming a logistic sampling model for the data has been empirically shown to yield more efficient estimates of odds ratios than ordinary logistic regression of the dichotomized endpoint. We illustrate that risk inference is not robust to departures from the parametric logistic distribution. Moreover, the model assumption of proportional odds is generally not satisfied when the condition of a logistic distribution for the data is violated, leading to biased inference from a parametric logistic analysis. We develop novel Bayesian semiparametric methodology for testing goodness of fit of parametric logistic regression with continuous measurement data. The testing procedures hold for any cutoff threshold and our approach simultaneously provides the ability to perform semiparametric risk estimation. Bayes factors are calculated using the Savage-Dickey ratio for testing the null hypothesis of logistic regression versus a semiparametric generalization. We propose a fully Bayesian and a computationally efficient empirical Bayesian approach to testing, and we present methods for semiparametric estimation of risks, relative risks, and odds ratios when parametric logistic regression fails. Theoretical results establish the consistency of the empirical Bayes test. Results from simulated data show that the proposed approach provides accurate inference irrespective of whether parametric assumptions hold or not. Evaluation of risk factors for obesity shows that different inferences are derived from an analysis of a real data set when deviations from a logistic distribution are permissible in a flexible semiparametric framework. © 2013, The International Biometric Society.

  13. Propensity score estimation: machine learning and classification methods as alternatives to logistic regression

    PubMed Central

    Westreich, Daniel; Lessler, Justin; Funk, Michele Jonsson

    2010-01-01

    Summary Objective Propensity scores for the analysis of observational data are typically estimated using logistic regression. Our objective in this Review was to assess machine learning alternatives to logistic regression which may accomplish the same goals but with fewer assumptions or greater accuracy. Study Design and Setting We identified alternative methods for propensity score estimation and/or classification from the public health, biostatistics, discrete mathematics, and computer science literature, and evaluated these algorithms for applicability to the problem of propensity score estimation, potential advantages over logistic regression, and ease of use. Results We identified four techniques as alternatives to logistic regression: neural networks, support vector machines, decision trees (CART), and meta-classifiers (in particular, boosting). Conclusion While the assumptions of logistic regression are well understood, those assumptions are frequently ignored. All four alternatives have advantages and disadvantages compared with logistic regression. Boosting (meta-classifiers) and to a lesser extent decision trees (particularly CART) appear to be most promising for use in the context of propensity score analysis, but extensive simulation studies are needed to establish their utility in practice. PMID:20630332

  14. Robust mislabel logistic regression without modeling mislabel probabilities.

    PubMed

    Hung, Hung; Jou, Zhi-Yu; Huang, Su-Yun

    2018-03-01

    Logistic regression is among the most widely used statistical methods for linear discriminant analysis. In many applications, we only observe possibly mislabeled responses. Fitting a conventional logistic regression can then lead to biased estimation. One common resolution is to fit a mislabel logistic regression model, which takes into consideration of mislabeled responses. Another common method is to adopt a robust M-estimation by down-weighting suspected instances. In this work, we propose a new robust mislabel logistic regression based on γ-divergence. Our proposal possesses two advantageous features: (1) It does not need to model the mislabel probabilities. (2) The minimum γ-divergence estimation leads to a weighted estimating equation without the need to include any bias correction term, that is, it is automatically bias-corrected. These features make the proposed γ-logistic regression more robust in model fitting and more intuitive for model interpretation through a simple weighting scheme. Our method is also easy to implement, and two types of algorithms are included. Simulation studies and the Pima data application are presented to demonstrate the performance of γ-logistic regression. © 2017, The International Biometric Society.

  15. Propensity score estimation: neural networks, support vector machines, decision trees (CART), and meta-classifiers as alternatives to logistic regression.

    PubMed

    Westreich, Daniel; Lessler, Justin; Funk, Michele Jonsson

    2010-08-01

    Propensity scores for the analysis of observational data are typically estimated using logistic regression. Our objective in this review was to assess machine learning alternatives to logistic regression, which may accomplish the same goals but with fewer assumptions or greater accuracy. We identified alternative methods for propensity score estimation and/or classification from the public health, biostatistics, discrete mathematics, and computer science literature, and evaluated these algorithms for applicability to the problem of propensity score estimation, potential advantages over logistic regression, and ease of use. We identified four techniques as alternatives to logistic regression: neural networks, support vector machines, decision trees (classification and regression trees [CART]), and meta-classifiers (in particular, boosting). Although the assumptions of logistic regression are well understood, those assumptions are frequently ignored. All four alternatives have advantages and disadvantages compared with logistic regression. Boosting (meta-classifiers) and, to a lesser extent, decision trees (particularly CART), appear to be most promising for use in the context of propensity score analysis, but extensive simulation studies are needed to establish their utility in practice. Copyright (c) 2010 Elsevier Inc. All rights reserved.

  16. Genetic variation of the androgen receptor and risk of myocardial infarction and ischemic stroke in women.

    PubMed

    Rexrode, Kathryn M; Ridker, Paul M; Hegener, Hillary H; Buring, Julie E; Manson, JoAnn E; Zee, Robert Y L

    2008-05-01

    Androgen receptors (AR) are expressed in endothelial cells and vascular smooth-muscle cells. Some studies suggest an association between AR gene variation and risk of cardiovascular disease (CVD) in men; however, the relationship has not been examined in women. Six haplotype block-tagging single nucleotide polymorphisms (rs962458, rs6152, rs1204038, rs2361634, rs1337080, rs1337082), as well as the cysteine, adenine, guanine (CAG) microsatellite in exon 1, of the AR gene were evaluated among 300 white postmenopausal women who developed CVD (158 myocardial infarctions and 142 ischemic strokes) and an equal number of matched controls within the Women's Health Study. Genotype distributions were similar between cases and controls, and genotypes were not significantly related to risk of CVD, myocardial infarctions or ischemic stroke in conditional logistic regression models. Seven common haplotypes were observed, but distributions did not differ between cases and controls nor were significant associations observed in logistic regression analysis. The median CAG repeat length was 21. In conditional logistic regression, there was no association between the number of alleles with CAG repeat length >or=21 (or >or=22) and risk of CVD, myocardial infarctions or ischemic stroke. No association between AR genetic variation, as measured by haplotype-tagging single nucleotide polymorphisms and CAG repeat number, and risk of CVD was observed in women.

  17. London Measure of Unplanned Pregnancy: guidance for its use as an outcome measure

    PubMed Central

    Hall, Jennifer A; Barrett, Geraldine; Copas, Andrew; Stephenson, Judith

    2017-01-01

    Background The London Measure of Unplanned Pregnancy (LMUP) is a psychometrically validated measure of the degree of intention of a current or recent pregnancy. The LMUP is increasingly being used worldwide, and can be used to evaluate family planning or preconception care programs. However, beyond recommending the use of the full LMUP scale, there is no published guidance on how to use the LMUP as an outcome measure. Ordinal logistic regression has been recommended informally, but studies published to date have all used binary logistic regression and dichotomized the scale at different cut points. There is thus a need for evidence-based guidance to provide a standardized methodology for multivariate analysis and to enable comparison of results. This paper makes recommendations for the regression method for analysis of the LMUP as an outcome measure. Materials and methods Data collected from 4,244 pregnant women in Malawi were used to compare five regression methods: linear, logistic with two cut points, and ordinal logistic with either the full or grouped LMUP score. The recommendations were then tested on the original UK LMUP data. Results There were small but no important differences in the findings across the regression models. Logistic regression resulted in the largest loss of information, and assumptions were violated for the linear and ordinal logistic regression. Consequently, robust standard errors were used for linear regression and a partial proportional odds ordinal logistic regression model attempted. The latter could only be fitted for grouped LMUP score. Conclusion We recommend the linear regression model with robust standard errors to make full use of the LMUP score when analyzed as an outcome measure. Ordinal logistic regression could be considered, but a partial proportional odds model with grouped LMUP score may be required. Logistic regression is the least-favored option, due to the loss of information. For logistic regression, the cut point for un/planned pregnancy should be between nine and ten. These recommendations will standardize the analysis of LMUP data and enhance comparability of results across studies. PMID:28435343

  18. Logistic models--an odd(s) kind of regression.

    PubMed

    Jupiter, Daniel C

    2013-01-01

    The logistic regression model bears some similarity to the multivariable linear regression with which we are familiar. However, the differences are great enough to warrant a discussion of the need for and interpretation of logistic regression. Copyright © 2013 American College of Foot and Ankle Surgeons. Published by Elsevier Inc. All rights reserved.

  19. Computing group cardinality constraint solutions for logistic regression problems.

    PubMed

    Zhang, Yong; Kwon, Dongjin; Pohl, Kilian M

    2017-01-01

    We derive an algorithm to directly solve logistic regression based on cardinality constraint, group sparsity and use it to classify intra-subject MRI sequences (e.g. cine MRIs) of healthy from diseased subjects. Group cardinality constraint models are often applied to medical images in order to avoid overfitting of the classifier to the training data. Solutions within these models are generally determined by relaxing the cardinality constraint to a weighted feature selection scheme. However, these solutions relate to the original sparse problem only under specific assumptions, which generally do not hold for medical image applications. In addition, inferring clinical meaning from features weighted by a classifier is an ongoing topic of discussion. Avoiding weighing features, we propose to directly solve the group cardinality constraint logistic regression problem by generalizing the Penalty Decomposition method. To do so, we assume that an intra-subject series of images represents repeated samples of the same disease patterns. We model this assumption by combining series of measurements created by a feature across time into a single group. Our algorithm then derives a solution within that model by decoupling the minimization of the logistic regression function from enforcing the group sparsity constraint. The minimum to the smooth and convex logistic regression problem is determined via gradient descent while we derive a closed form solution for finding a sparse approximation of that minimum. We apply our method to cine MRI of 38 healthy controls and 44 adult patients that received reconstructive surgery of Tetralogy of Fallot (TOF) during infancy. Our method correctly identifies regions impacted by TOF and generally obtains statistically significant higher classification accuracy than alternative solutions to this model, i.e., ones relaxing group cardinality constraints. Copyright © 2016 Elsevier B.V. All rights reserved.

  20. Methodologic considerations in the design and analysis of nested case-control studies: association between cytokines and postoperative delirium.

    PubMed

    Ngo, Long H; Inouye, Sharon K; Jones, Richard N; Travison, Thomas G; Libermann, Towia A; Dillon, Simon T; Kuchel, George A; Vasunilashorn, Sarinnapha M; Alsop, David C; Marcantonio, Edward R

    2017-06-06

    The nested case-control study (NCC) design within a prospective cohort study is used when outcome data are available for all subjects, but the exposure of interest has not been collected, and is difficult or prohibitively expensive to obtain for all subjects. A NCC analysis with good matching procedures yields estimates that are as efficient and unbiased as estimates from the full cohort study. We present methodological considerations in a matched NCC design and analysis, which include the choice of match algorithms, analysis methods to evaluate the association of exposures of interest with outcomes, and consideration of overmatching. Matched, NCC design within a longitudinal observational prospective cohort study in the setting of two academic hospitals. Study participants are patients aged over 70 years who underwent scheduled major non-cardiac surgery. The primary outcome was postoperative delirium from in-hospital interviews and medical record review. The main exposure was IL-6 concentration (pg/ml) from blood sampled at three time points before delirium occurred. We used nonparametric signed ranked test to test for the median of the paired differences. We used conditional logistic regression to model the risk of IL-6 on delirium incidence. Simulation was used to generate a sample of cohort data on which unconditional multivariable logistic regression was used, and the results were compared to those of the conditional logistic regression. Partial R-square was used to assess the level of overmatching. We found that the optimal match algorithm yielded more matched pairs than the greedy algorithm. The choice of analytic strategy-whether to consider measured cytokine levels as the predictor or outcome-- yielded inferences that have different clinical interpretations but similar levels of statistical significance. Estimation results from NCC design using conditional logistic regression, and from simulated cohort design using unconditional logistic regression, were similar. We found minimal evidence for overmatching. Using a matched NCC approach introduces methodological challenges into the study design and data analysis. Nonetheless, with careful selection of the match algorithm, match factors, and analysis methods, this design is cost effective and, for our study, yields estimates that are similar to those from a prospective cohort study design.

  1. The association between short interpregnancy interval and preterm birth in Louisiana: a comparison of methods.

    PubMed

    Howard, Elizabeth J; Harville, Emily; Kissinger, Patricia; Xiong, Xu

    2013-07-01

    There is growing interest in the application of propensity scores (PS) in epidemiologic studies, especially within the field of reproductive epidemiology. This retrospective cohort study assesses the impact of a short interpregnancy interval (IPI) on preterm birth and compares the results of the conventional logistic regression analysis with analyses utilizing a PS. The study included 96,378 singleton infants from Louisiana birth certificate data (1995-2007). Five regression models designed for methods comparison are presented. Ten percent (10.17 %) of all births were preterm; 26.83 % of births were from a short IPI. The PS-adjusted model produced a more conservative estimate of the exposure variable compared to the conventional logistic regression method (β-coefficient: 0.21 vs. 0.43), as well as a smaller standard error (0.024 vs. 0.028), odds ratio and 95 % confidence intervals [1.15 (1.09, 1.20) vs. 1.23 (1.17, 1.30)]. The inclusion of more covariate and interaction terms in the PS did not change the estimates of the exposure variable. This analysis indicates that PS-adjusted regression may be appropriate for validation of conventional methods in a large dataset with a fairly common outcome. PS's may be beneficial in producing more precise estimates, especially for models with many confounders and effect modifiers and where conventional adjustment with logistic regression is unsatisfactory. Short intervals between pregnancies are associated with preterm birth in this population, according to either technique. Birth spacing is an issue that women have some control over. Educational interventions, including birth control, should be applied during prenatal visits and following delivery.

  2. Personality predicts time to remission and clinical status in hypochondriasis during a 6-year follow-up.

    PubMed

    Greeven, Anja; van Balkom, Anton J L M; Spinhoven, Philip

    2014-05-01

    We aimed to investigate whether personality characteristics predict time to remission and psychiatric status. The follow-up was at most 6 years and was performed within the scope of a randomized controlled trial that investigated the efficacy of cognitive behavioral therapy, paroxetine, and placebo in hypochondriasis. The Life Chart Interview was administered to investigate for each year if remission had occurred. Personality was assessed at pretest by the Abbreviated Dutch Temperament and Character Inventory. Cox's regression models for recurrent events were compared with logistic regression models. Sixteen (36.4%) of 44 patients achieved remission during the follow-up period. Cox's regression yielded approximately the same results as the logistic regression. Being less harm avoidant and more cooperative were associated with a shorter time to remission and a remitted state after the follow-up period. Personality variables seem to be relevant for describing patients with a more chronic course of hypochondriacal complaints.

  3. Parameters Estimation of Geographically Weighted Ordinal Logistic Regression (GWOLR) Model

    NASA Astrophysics Data System (ADS)

    Zuhdi, Shaifudin; Retno Sari Saputro, Dewi; Widyaningsih, Purnami

    2017-06-01

    A regression model is the representation of relationship between independent variable and dependent variable. The dependent variable has categories used in the logistic regression model to calculate odds on. The logistic regression model for dependent variable has levels in the logistics regression model is ordinal. GWOLR model is an ordinal logistic regression model influenced the geographical location of the observation site. Parameters estimation in the model needed to determine the value of a population based on sample. The purpose of this research is to parameters estimation of GWOLR model using R software. Parameter estimation uses the data amount of dengue fever patients in Semarang City. Observation units used are 144 villages in Semarang City. The results of research get GWOLR model locally for each village and to know probability of number dengue fever patient categories.

  4. Genetic prediction of type 2 diabetes using deep neural network.

    PubMed

    Kim, J; Kim, J; Kwak, M J; Bajaj, M

    2018-04-01

    Type 2 diabetes (T2DM) has strong heritability but genetic models to explain heritability have been challenging. We tested deep neural network (DNN) to predict T2DM using the nested case-control study of Nurses' Health Study (3326 females, 45.6% T2DM) and Health Professionals Follow-up Study (2502 males, 46.5% T2DM). We selected 96, 214, 399, and 678 single-nucleotide polymorphism (SNPs) through Fisher's exact test and L1-penalized logistic regression. We split each dataset randomly in 4:1 to train prediction models and test their performance. DNN and logistic regressions showed better area under the curve (AUC) of ROC curves than the clinical model when 399 or more SNPs included. DNN was superior than logistic regressions in AUC with 399 or more SNPs in male and 678 SNPs in female. Addition of clinical factors consistently increased AUC of DNN but failed to improve logistic regressions with 214 or more SNPs. In conclusion, we show that DNN can be a versatile tool to predict T2DM incorporating large numbers of SNPs and clinical information. Limitations include a relatively small number of the subjects mostly of European ethnicity. Further studies are warranted to confirm and improve performance of genetic prediction models using DNN in different ethnic groups. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  5. PARAMETRIC AND NON PARAMETRIC (MARS: MULTIVARIATE ADDITIVE REGRESSION SPLINES) LOGISTIC REGRESSIONS FOR PREDICTION OF A DICHOTOMOUS RESPONSE VARIABLE WITH AN EXAMPLE FOR PRESENCE/ABSENCE OF AMPHIBIANS

    EPA Science Inventory

    The purpose of this report is to provide a reference manual that could be used by investigators for making informed use of logistic regression using two methods (standard logistic regression and MARS). The details for analyses of relationships between a dependent binary response ...

  6. Predicting U.S. Army Reserve Unit Manning Using Market Demographics

    DTIC Science & Technology

    2015-06-01

    develops linear regression , classification tree, and logistic regression models to determine the ability of the location to support manning requirements... logistic regression model delivers predictive results that allow decision-makers to identify locations with a high probability of meeting unit...manning requirements. The recommendation of this thesis is that the USAR implement the logistic regression model. 14. SUBJECT TERMS U.S

  7. Analyzing Student Learning Outcomes: Usefulness of Logistic and Cox Regression Models. IR Applications, Volume 5

    ERIC Educational Resources Information Center

    Chen, Chau-Kuang

    2005-01-01

    Logistic and Cox regression methods are practical tools used to model the relationships between certain student learning outcomes and their relevant explanatory variables. The logistic regression model fits an S-shaped curve into a binary outcome with data points of zero and one. The Cox regression model allows investigators to study the duration…

  8. An appraisal of convergence failures in the application of logistic regression model in published manuscripts.

    PubMed

    Yusuf, O B; Bamgboye, E A; Afolabi, R F; Shodimu, M A

    2014-09-01

    Logistic regression model is widely used in health research for description and predictive purposes. Unfortunately, most researchers are sometimes not aware that the underlying principles of the techniques have failed when the algorithm for maximum likelihood does not converge. Young researchers particularly postgraduate students may not know why separation problem whether quasi or complete occurs, how to identify it and how to fix it. This study was designed to critically evaluate convergence issues in articles that employed logistic regression analysis published in an African Journal of Medicine and medical sciences between 2004 and 2013. Problems of quasi or complete separation were described and were illustrated with the National Demographic and Health Survey dataset. A critical evaluation of articles that employed logistic regression was conducted. A total of 581 articles was reviewed, of which 40 (6.9%) used binary logistic regression. Twenty-four (60.0%) stated the use of logistic regression model in the methodology while none of the articles assessed model fit. Only 3 (12.5%) properly described the procedures. Of the 40 that used the logistic regression model, the problem of convergence occurred in 6 (15.0%) of the articles. Logistic regression tends to be poorly reported in studies published between 2004 and 2013. Our findings showed that the procedure may not be well understood by researchers since very few described the process in their reports and may be totally unaware of the problem of convergence or how to deal with it.

  9. Prescription-drug-related risk in driving: comparing conventional and lasso shrinkage logistic regressions.

    PubMed

    Avalos, Marta; Adroher, Nuria Duran; Lagarde, Emmanuel; Thiessard, Frantz; Grandvalet, Yves; Contrand, Benjamin; Orriols, Ludivine

    2012-09-01

    Large data sets with many variables provide particular challenges when constructing analytic models. Lasso-related methods provide a useful tool, although one that remains unfamiliar to most epidemiologists. We illustrate the application of lasso methods in an analysis of the impact of prescribed drugs on the risk of a road traffic crash, using a large French nationwide database (PLoS Med 2010;7:e1000366). In the original case-control study, the authors analyzed each exposure separately. We use the lasso method, which can simultaneously perform estimation and variable selection in a single model. We compare point estimates and confidence intervals using (1) a separate logistic regression model for each drug with a Bonferroni correction and (2) lasso shrinkage logistic regression analysis. Shrinkage regression had little effect on (bias corrected) point estimates, but led to less conservative results, noticeably for drugs with moderate levels of exposure. Carbamates, carboxamide derivative and fatty acid derivative antiepileptics, drugs used in opioid dependence, and mineral supplements of potassium showed stronger associations. Lasso is a relevant method in the analysis of databases with large number of exposures and can be recommended as an alternative to conventional strategies.

  10. Logistic Regression: Concept and Application

    ERIC Educational Resources Information Center

    Cokluk, Omay

    2010-01-01

    The main focus of logistic regression analysis is classification of individuals in different groups. The aim of the present study is to explain basic concepts and processes of binary logistic regression analysis intended to determine the combination of independent variables which best explain the membership in certain groups called dichotomous…

  11. Remote sensing and GIS-based landslide hazard analysis and cross-validation using multivariate logistic regression model on three test areas in Malaysia

    NASA Astrophysics Data System (ADS)

    Pradhan, Biswajeet

    2010-05-01

    This paper presents the results of the cross-validation of a multivariate logistic regression model using remote sensing data and GIS for landslide hazard analysis on the Penang, Cameron, and Selangor areas in Malaysia. Landslide locations in the study areas were identified by interpreting aerial photographs and satellite images, supported by field surveys. SPOT 5 and Landsat TM satellite imagery were used to map landcover and vegetation index, respectively. Maps of topography, soil type, lineaments and land cover were constructed from the spatial datasets. Ten factors which influence landslide occurrence, i.e., slope, aspect, curvature, distance from drainage, lithology, distance from lineaments, soil type, landcover, rainfall precipitation, and normalized difference vegetation index (ndvi), were extracted from the spatial database and the logistic regression coefficient of each factor was computed. Then the landslide hazard was analysed using the multivariate logistic regression coefficients derived not only from the data for the respective area but also using the logistic regression coefficients calculated from each of the other two areas (nine hazard maps in all) as a cross-validation of the model. For verification of the model, the results of the analyses were then compared with the field-verified landslide locations. Among the three cases of the application of logistic regression coefficient in the same study area, the case of Selangor based on the Selangor logistic regression coefficients showed the highest accuracy (94%), where as Penang based on the Penang coefficients showed the lowest accuracy (86%). Similarly, among the six cases from the cross application of logistic regression coefficient in other two areas, the case of Selangor based on logistic coefficient of Cameron showed highest (90%) prediction accuracy where as the case of Penang based on the Selangor logistic regression coefficients showed the lowest accuracy (79%). Qualitatively, the cross application model yields reasonable results which can be used for preliminary landslide hazard mapping.

  12. An Entropy-Based Measure for Assessing Fuzziness in Logistic Regression

    PubMed Central

    Weiss, Brandi A.; Dardick, William

    2015-01-01

    This article introduces an entropy-based measure of data–model fit that can be used to assess the quality of logistic regression models. Entropy has previously been used in mixture-modeling to quantify how well individuals are classified into latent classes. The current study proposes the use of entropy for logistic regression models to quantify the quality of classification and separation of group membership. Entropy complements preexisting measures of data–model fit and provides unique information not contained in other measures. Hypothetical data scenarios, an applied example, and Monte Carlo simulation results are used to demonstrate the application of entropy in logistic regression. Entropy should be used in conjunction with other measures of data–model fit to assess how well logistic regression models classify cases into observed categories. PMID:29795897

  13. Large unbalanced credit scoring using Lasso-logistic regression ensemble.

    PubMed

    Wang, Hong; Xu, Qingsong; Zhou, Lifeng

    2015-01-01

    Recently, various ensemble learning methods with different base classifiers have been proposed for credit scoring problems. However, for various reasons, there has been little research using logistic regression as the base classifier. In this paper, given large unbalanced data, we consider the plausibility of ensemble learning using regularized logistic regression as the base classifier to deal with credit scoring problems. In this research, the data is first balanced and diversified by clustering and bagging algorithms. Then we apply a Lasso-logistic regression learning ensemble to evaluate the credit risks. We show that the proposed algorithm outperforms popular credit scoring models such as decision tree, Lasso-logistic regression and random forests in terms of AUC and F-measure. We also provide two importance measures for the proposed model to identify important variables in the data.

  14. An Entropy-Based Measure for Assessing Fuzziness in Logistic Regression.

    PubMed

    Weiss, Brandi A; Dardick, William

    2016-12-01

    This article introduces an entropy-based measure of data-model fit that can be used to assess the quality of logistic regression models. Entropy has previously been used in mixture-modeling to quantify how well individuals are classified into latent classes. The current study proposes the use of entropy for logistic regression models to quantify the quality of classification and separation of group membership. Entropy complements preexisting measures of data-model fit and provides unique information not contained in other measures. Hypothetical data scenarios, an applied example, and Monte Carlo simulation results are used to demonstrate the application of entropy in logistic regression. Entropy should be used in conjunction with other measures of data-model fit to assess how well logistic regression models classify cases into observed categories.

  15. [A case-control study on the risk factors of work-related acute pesticide poisoning among farmers from Jiangsu province].

    PubMed

    Tu, Zhi-bin; Cui, Meng-jing; Yao, Hong-yan; Hu, Guo-qing; Xiang, Hui-yun; Stallones, Lorann; Zhang, Xu-jun

    2012-04-01

    To explore the risk factors on cases regarding work-related acute pesticide poisoning among farmers of Jiangsu province. A population-based, 1:2 matched case-control study was carried out, with 121 patients as case-group paired by 242 persons with same gender, district and age less then difference of 3 years, as controls. Cases were the ones who had suffered from work-related acute pesticide poisoning. A unified questionnaire was used. Data base was established by EpiData 3.1, and SPSS 16.0 was used for both data single factor and multi-conditional logistics regression analysis. Results from the single factor logistic regression analysis showed that the related risk factors were: lack of safety guidance, lack of readable labels before praying pesticides, no regression during application, using hand to wipe sweat, using leaking knapsack, body contaminated during application and continuing to work when feeling ill after the contact of pesticides. Results from multi-conditional logistic regression analysis indicated that the lack of safety guidance (OR=2.25, 95%CI: 1.35-3.74), no readable labels before praying pesticides (OR=1.95, 95%CI: 1.19-3.18), wiping the sweat by hand during application (OR=1.97, 95%CI: 1.20-3.24) and using leaking knapsack during application (OR=1.82, 95%CI:1.10-3.01) were risk factors for the occurrence of work-related acute pesticide poisoning. The lack of safety guidance, no readable labels before praying pesticides, wiping the sweat by hand or using leaking knapsack during application were correlated to the occurrence of work-related acute pesticide poisoning.

  16. The Joint Effects of Lifestyle Factors and Comorbidities on the Risk of Colorectal Cancer: A Large Chinese Retrospective Case-Control Study

    PubMed Central

    Hu, Hai; Zhou, Yangyang; Ren, Shujuan; Wu, Jiajin; Zhu, Meiying; Chen, Donghui; Yang, Haiyan; Wang, Liwei

    2015-01-01

    Background Colorectal cancer (CRC) is a major cause of cancer morbidity and mortality. In previous epidemiologic studies, the respective correlation between lifestyle factors and comorbidity and CRC has been extensively studied. However, little is known about their joint effects on CRC. Methods We conducted a retrospective case-control study of 1,144 diagnosed CRC patients and 60,549 community controls. A structured questionnaire was administered to the participants about their socio-demographic factors, anthropometric measures, comorbidity history and lifestyle factors. Logistic regression model was used to calculate the odds ratio (ORs) and 95% confidence intervals (95%CIs) for each factor. According to the results from logistic regression model, we further developed healthy lifestyle index (HLI) and comorbidity history index (CHI) to investigate their independent and joint effects on CRC risk. Results Four lifestyle factors (including physical activities, sleep, red meat and vegetable consumption) and four types of comorbidity (including diabetes, hyperlipidemia, history of inflammatory bowel disease and polyps) were found to be independently associated with the risk of CRC in multivariant logistic regression model. Intriguingly, their combined pattern- HLI and CHI demonstrated significant correlation with CRC risk independently (ORHLI: 3.91, 95%CI: 3.13–4.88; ORCHI: 2.49, 95%CI: 2.11–2.93) and jointly (OR: 10.33, 95%CI: 6.59–16.18). Conclusions There are synergistic effects of lifestyle factors and comorbidity on the risk of colorectal cancer in the Chinese population. PMID:26710070

  17. Power and Sample Size Calculations for Logistic Regression Tests for Differential Item Functioning

    ERIC Educational Resources Information Center

    Li, Zhushan

    2014-01-01

    Logistic regression is a popular method for detecting uniform and nonuniform differential item functioning (DIF) effects. Theoretical formulas for the power and sample size calculations are derived for likelihood ratio tests and Wald tests based on the asymptotic distribution of the maximum likelihood estimators for the logistic regression model.…

  18. A Methodology for Generating Placement Rules that Utilizes Logistic Regression

    ERIC Educational Resources Information Center

    Wurtz, Keith

    2008-01-01

    The purpose of this article is to provide the necessary tools for institutional researchers to conduct a logistic regression analysis and interpret the results. Aspects of the logistic regression procedure that are necessary to evaluate models are presented and discussed with an emphasis on cutoff values and choosing the appropriate number of…

  19. Comparison of standard maximum likelihood classification and polytomous logistic regression used in remote sensing

    Treesearch

    John Hogland; Nedret Billor; Nathaniel Anderson

    2013-01-01

    Discriminant analysis, referred to as maximum likelihood classification within popular remote sensing software packages, is a common supervised technique used by analysts. Polytomous logistic regression (PLR), also referred to as multinomial logistic regression, is an alternative classification approach that is less restrictive, more flexible, and easy to interpret. To...

  20. Large Unbalanced Credit Scoring Using Lasso-Logistic Regression Ensemble

    PubMed Central

    Wang, Hong; Xu, Qingsong; Zhou, Lifeng

    2015-01-01

    Recently, various ensemble learning methods with different base classifiers have been proposed for credit scoring problems. However, for various reasons, there has been little research using logistic regression as the base classifier. In this paper, given large unbalanced data, we consider the plausibility of ensemble learning using regularized logistic regression as the base classifier to deal with credit scoring problems. In this research, the data is first balanced and diversified by clustering and bagging algorithms. Then we apply a Lasso-logistic regression learning ensemble to evaluate the credit risks. We show that the proposed algorithm outperforms popular credit scoring models such as decision tree, Lasso-logistic regression and random forests in terms of AUC and F-measure. We also provide two importance measures for the proposed model to identify important variables in the data. PMID:25706988

  1. Semiparametric time varying coefficient model for matched case-crossover studies.

    PubMed

    Ortega-Villa, Ana Maria; Kim, Inyoung; Kim, H

    2017-03-15

    In matched case-crossover studies, it is generally accepted that the covariates on which a case and associated controls are matched cannot exert a confounding effect on independent predictors included in the conditional logistic regression model. This is because any stratum effect is removed by the conditioning on the fixed number of sets of the case and controls in the stratum. Hence, the conditional logistic regression model is not able to detect any effects associated with the matching covariates by stratum. However, some matching covariates such as time often play an important role as an effect modification leading to incorrect statistical estimation and prediction. Therefore, we propose three approaches to evaluate effect modification by time. The first is a parametric approach, the second is a semiparametric penalized approach, and the third is a semiparametric Bayesian approach. Our parametric approach is a two-stage method, which uses conditional logistic regression in the first stage and then estimates polynomial regression in the second stage. Our semiparametric penalized and Bayesian approaches are one-stage approaches developed by using regression splines. Our semiparametric one stage approach allows us to not only detect the parametric relationship between the predictor and binary outcomes, but also evaluate nonparametric relationships between the predictor and time. We demonstrate the advantage of our semiparametric one-stage approaches using both a simulation study and an epidemiological example of a 1-4 bi-directional case-crossover study of childhood aseptic meningitis with drinking water turbidity. We also provide statistical inference for the semiparametric Bayesian approach using Bayes Factors. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  2. Automobile age and risk : summary

    DOT National Transportation Integrated Search

    1998-03-01

    The partial relationship between automobile age and risk is studied by means of logistic regression as applied to a large insurance policy data set. Annual mileage and car owner's gender, age and county of residence are controlled for.

  3. An Entropy-Based Measure for Assessing Fuzziness in Logistic Regression

    ERIC Educational Resources Information Center

    Weiss, Brandi A.; Dardick, William

    2016-01-01

    This article introduces an entropy-based measure of data-model fit that can be used to assess the quality of logistic regression models. Entropy has previously been used in mixture-modeling to quantify how well individuals are classified into latent classes. The current study proposes the use of entropy for logistic regression models to quantify…

  4. What Are the Odds of that? A Primer on Understanding Logistic Regression

    ERIC Educational Resources Information Center

    Huang, Francis L.; Moon, Tonya R.

    2013-01-01

    The purpose of this Methodological Brief is to present a brief primer on logistic regression, a commonly used technique when modeling dichotomous outcomes. Using data from the National Education Longitudinal Study of 1988 (NELS:88), logistic regression techniques were used to investigate student-level variables in eighth grade (i.e., enrolled in a…

  5. A Randomized Controlled Study of the Virginia Student Threat Assessment Guidelines in Kindergarten through Grade 12

    ERIC Educational Resources Information Center

    Cornell, Dewey G.; Allen, Korrie; Fan, Xitao

    2012-01-01

    This randomized controlled study examined disciplinary outcomes for 201 students who made threats of violence at school. The students attended 40 schools randomly assigned to use the Virginia Student Threat Assessment Guidelines or follow a business-as-usual disciplinary approach in a control group. Logistic regression analyses found, after…

  6. On the Usefulness of a Multilevel Logistic Regression Approach to Person-Fit Analysis

    ERIC Educational Resources Information Center

    Conijn, Judith M.; Emons, Wilco H. M.; van Assen, Marcel A. L. M.; Sijtsma, Klaas

    2011-01-01

    The logistic person response function (PRF) models the probability of a correct response as a function of the item locations. Reise (2000) proposed to use the slope parameter of the logistic PRF as a person-fit measure. He reformulated the logistic PRF model as a multilevel logistic regression model and estimated the PRF parameters from this…

  7. Mortality risk prediction in burn injury: Comparison of logistic regression with machine learning approaches.

    PubMed

    Stylianou, Neophytos; Akbarov, Artur; Kontopantelis, Evangelos; Buchan, Iain; Dunn, Ken W

    2015-08-01

    Predicting mortality from burn injury has traditionally employed logistic regression models. Alternative machine learning methods have been introduced in some areas of clinical prediction as the necessary software and computational facilities have become accessible. Here we compare logistic regression and machine learning predictions of mortality from burn. An established logistic mortality model was compared to machine learning methods (artificial neural network, support vector machine, random forests and naïve Bayes) using a population-based (England & Wales) case-cohort registry. Predictive evaluation used: area under the receiver operating characteristic curve; sensitivity; specificity; positive predictive value and Youden's index. All methods had comparable discriminatory abilities, similar sensitivities, specificities and positive predictive values. Although some machine learning methods performed marginally better than logistic regression the differences were seldom statistically significant and clinically insubstantial. Random forests were marginally better for high positive predictive value and reasonable sensitivity. Neural networks yielded slightly better prediction overall. Logistic regression gives an optimal mix of performance and interpretability. The established logistic regression model of burn mortality performs well against more complex alternatives. Clinical prediction with a small set of strong, stable, independent predictors is unlikely to gain much from machine learning outside specialist research contexts. Copyright © 2015 Elsevier Ltd and ISBI. All rights reserved.

  8. Bias in logistic regression due to imperfect diagnostic test results and practical correction approaches.

    PubMed

    Valle, Denis; Lima, Joanna M Tucker; Millar, Justin; Amratia, Punam; Haque, Ubydul

    2015-11-04

    Logistic regression is a statistical model widely used in cross-sectional and cohort studies to identify and quantify the effects of potential disease risk factors. However, the impact of imperfect tests on adjusted odds ratios (and thus on the identification of risk factors) is under-appreciated. The purpose of this article is to draw attention to the problem associated with modelling imperfect diagnostic tests, and propose simple Bayesian models to adequately address this issue. A systematic literature review was conducted to determine the proportion of malaria studies that appropriately accounted for false-negatives/false-positives in a logistic regression setting. Inference from the standard logistic regression was also compared with that from three proposed Bayesian models using simulations and malaria data from the western Brazilian Amazon. A systematic literature review suggests that malaria epidemiologists are largely unaware of the problem of using logistic regression to model imperfect diagnostic test results. Simulation results reveal that statistical inference can be substantially improved when using the proposed Bayesian models versus the standard logistic regression. Finally, analysis of original malaria data with one of the proposed Bayesian models reveals that microscopy sensitivity is strongly influenced by how long people have lived in the study region, and an important risk factor (i.e., participation in forest extractivism) is identified that would have been missed by standard logistic regression. Given the numerous diagnostic methods employed by malaria researchers and the ubiquitous use of logistic regression to model the results of these diagnostic tests, this paper provides critical guidelines to improve data analysis practice in the presence of misclassification error. Easy-to-use code that can be readily adapted to WinBUGS is provided, enabling straightforward implementation of the proposed Bayesian models.

  9. Deciphering factors controlling groundwater arsenic spatial variability in Bangladesh

    NASA Astrophysics Data System (ADS)

    Tan, Z.; Yang, Q.; Zheng, C.; Zheng, Y.

    2017-12-01

    Elevated concentrations of geogenic arsenic in groundwater have been found in many countries to exceed 10 μg/L, the WHO's guideline value for drinking water. A common yet unexplained characteristic of groundwater arsenic spatial distribution is the extensive variability at various spatial scales. This study investigates factors influencing the spatial variability of groundwater arsenic in Bangladesh to improve the accuracy of models predicting arsenic exceedance rate spatially. A novel boosted regression tree method is used to establish a weak-learning ensemble model, which is compared to a linear model using a conventional stepwise logistic regression method. The boosted regression tree models offer the advantage of parametric interaction when big datasets are analyzed in comparison to the logistic regression. The point data set (n=3,538) of groundwater hydrochemistry with 19 parameters was obtained by the British Geological Survey in 2001. The spatial data sets of geological parameters (n=13) were from the Consortium for Spatial Information, Technical University of Denmark, University of East Anglia and the FAO, while the soil parameters (n=42) were from the Harmonized World Soil Database. The aforementioned parameters were regressed to categorical groundwater arsenic concentrations below or above three thresholds: 5 μg/L, 10 μg/L and 50 μg/L to identify respective controlling factors. Boosted regression tree method outperformed logistic regression methods in all three threshold levels in terms of accuracy, specificity and sensitivity, resulting in an improvement of spatial distribution map of probability of groundwater arsenic exceeding all three thresholds when compared to disjunctive-kriging interpolated spatial arsenic map using the same groundwater arsenic dataset. Boosted regression tree models also show that the most important controlling factors of groundwater arsenic distribution include groundwater iron content and well depth for all three thresholds. The probability of a well with iron content higher than 5mg/L to contain greater than 5 μg/L, 10 μg/L and 50 μg/L As is estimated to be more than 91%, 85% and 51%, respectively, while the probability of a well from depth more than 160m to contain more than 5 μg/L, 10 μg/L and 50 μg/L As is estimated to be less than 38%, 25% and 14%, respectively.

  10. [Is Mapuche ethnicity a risk factor for hip fracture in aged?].

    PubMed

    Sapunar, Jorge; Bravo, Paulina; Schneider, Hermann; Jiménez, Marcela

    2003-10-01

    Ethnic factors are involved in the risk for osteoporosis and hip fracture. To assess the effect of Mapuche ethnicity on the risk of hip fracture. A case control study. Cases were subjects over 55 years of age admitted, during one year, for hip fracture not associated to major trauma or tumors. Controls were randomly chosen from other hospital services and paired for age with cases. The magnitude of the association between ethnicity and hip fracture was expressed as odds ratio in a logistic regression model. In the study period, 156 cases with hip fracture were admitted. The proportion of subjects with Mapuche origin was significantly lower among cases than controls (11.8 and 26.5% respectively, p < 0.001). In the logistic regression model, Mapuche ethnicity was associated with hip fracture with an odds radio of 0.14 (p = 0.03, 95% CI 0.03-0.8). In this sample, Mapuche ethnicity is a protective factor for hip fracture.

  11. Logistic regression for risk factor modelling in stuttering research.

    PubMed

    Reed, Phil; Wu, Yaqionq

    2013-06-01

    To outline the uses of logistic regression and other statistical methods for risk factor analysis in the context of research on stuttering. The principles underlying the application of a logistic regression are illustrated, and the types of questions to which such a technique has been applied in the stuttering field are outlined. The assumptions and limitations of the technique are discussed with respect to existing stuttering research, and with respect to formulating appropriate research strategies to accommodate these considerations. Finally, some alternatives to the approach are briefly discussed. The way the statistical procedures are employed are demonstrated with some hypothetical data. Research into several practical issues concerning stuttering could benefit if risk factor modelling were used. Important examples are early diagnosis, prognosis (whether a child will recover or persist) and assessment of treatment outcome. After reading this article you will: (a) Summarize the situations in which logistic regression can be applied to a range of issues about stuttering; (b) Follow the steps in performing a logistic regression analysis; (c) Describe the assumptions of the logistic regression technique and the precautions that need to be checked when it is employed; (d) Be able to summarize its advantages over other techniques like estimation of group differences and simple regression. Copyright © 2012 Elsevier Inc. All rights reserved.

  12. Complementary nonparametric analysis of covariance for logistic regression in a randomized clinical trial setting.

    PubMed

    Tangen, C M; Koch, G G

    1999-03-01

    In the randomized clinical trial setting, controlling for covariates is expected to produce variance reduction for the treatment parameter estimate and to adjust for random imbalances of covariates between the treatment groups. However, for the logistic regression model, variance reduction is not obviously obtained. This can lead to concerns about the assumptions of the logistic model. We introduce a complementary nonparametric method for covariate adjustment. It provides results that are usually compatible with expectations for analysis of covariance. The only assumptions required are based on randomization and sampling arguments. The resulting treatment parameter is a (unconditional) population average log-odds ratio that has been adjusted for random imbalance of covariates. Data from a randomized clinical trial are used to compare results from the traditional maximum likelihood logistic method with those from the nonparametric logistic method. We examine treatment parameter estimates, corresponding standard errors, and significance levels in models with and without covariate adjustment. In addition, we discuss differences between unconditional population average treatment parameters and conditional subpopulation average treatment parameters. Additional features of the nonparametric method, including stratified (multicenter) and multivariate (multivisit) analyses, are illustrated. Extensions of this methodology to the proportional odds model are also made.

  13. Desistance from intimate partner violence: the role of legal cynicism, collective efficacy, and social disorganization in Chicago neighborhoods.

    PubMed

    Emery, Clifton R; Jolley, Jennifer M; Wu, Shali

    2011-12-01

    This paper examined the relationship between reported Intimate Partner Violence (IPV) desistance and neighborhood concentrated disadvantage, ethnic heterogeneity, residential instability, collective efficacy and legal cynicism. Data from the Project on Human Development in Chicago Neighborhoods (PHDCN) Longitudinal survey were used to identify 599 cases of IPV in Wave 1 eligible for reported desistance in Wave 2. A Generalized Boosting Model was used to determine the best proximal predictors of IPV desistance from the longitudinal data. Controlling for these predictors, logistic regression of neighborhood characteristics from the PHDCN community survey was used to predict reported IPV desistance in Wave 2. The paper finds that participants living in neighborhoods high in legal cynicism have lower odds of reporting IPV desistance, controlling for other variables in the logistic regression model. Analyses did not find that IPV desistance was related to neighborhood concentrated disadvantage, ethnic heterogeneity, residential instability and collective efficacy.

  14. Dynamic Dimensionality Selection for Bayesian Classifier Ensembles

    DTIC Science & Technology

    2015-03-19

    learning of weights in an otherwise generatively learned naive Bayes classifier. WANBIA-C is very cometitive to Logistic Regression but much more...classifier, Generative learning, Discriminative learning, Naïve Bayes, Feature selection, Logistic regression , higher order attribute independence 16...discriminative learning of weights in an otherwise generatively learned naive Bayes classifier. WANBIA-C is very cometitive to Logistic Regression but

  15. A review of logistic regression models used to predict post-fire tree mortality of western North American conifers

    Treesearch

    Travis Woolley; David C. Shaw; Lisa M. Ganio; Stephen Fitzgerald

    2012-01-01

    Logistic regression models used to predict tree mortality are critical to post-fire management, planning prescribed bums and understanding disturbance ecology. We review literature concerning post-fire mortality prediction using logistic regression models for coniferous tree species in the western USA. We include synthesis and review of: methods to develop, evaluate...

  16. Preserving Institutional Privacy in Distributed binary Logistic Regression.

    PubMed

    Wu, Yuan; Jiang, Xiaoqian; Ohno-Machado, Lucila

    2012-01-01

    Privacy is becoming a major concern when sharing biomedical data across institutions. Although methods for protecting privacy of individual patients have been proposed, it is not clear how to protect the institutional privacy, which is many times a critical concern of data custodians. Built upon our previous work, Grid Binary LOgistic REgression (GLORE)1, we developed an Institutional Privacy-preserving Distributed binary Logistic Regression model (IPDLR) that considers both individual and institutional privacy for building a logistic regression model in a distributed manner. We tested our method using both simulated and clinical data, showing how it is possible to protect the privacy of individuals and of institutions using a distributed strategy.

  17. Covariate Imbalance and Adjustment for Logistic Regression Analysis of Clinical Trial Data

    PubMed Central

    Ciolino, Jody D.; Martin, Reneé H.; Zhao, Wenle; Jauch, Edward C.; Hill, Michael D.; Palesch, Yuko Y.

    2014-01-01

    In logistic regression analysis for binary clinical trial data, adjusted treatment effect estimates are often not equivalent to unadjusted estimates in the presence of influential covariates. This paper uses simulation to quantify the benefit of covariate adjustment in logistic regression. However, International Conference on Harmonization guidelines suggest that covariate adjustment be pre-specified. Unplanned adjusted analyses should be considered secondary. Results suggest that that if adjustment is not possible or unplanned in a logistic setting, balance in continuous covariates can alleviate some (but never all) of the shortcomings of unadjusted analyses. The case of log binomial regression is also explored. PMID:24138438

  18. Differentially private distributed logistic regression using private and public data.

    PubMed

    Ji, Zhanglong; Jiang, Xiaoqian; Wang, Shuang; Xiong, Li; Ohno-Machado, Lucila

    2014-01-01

    Privacy protecting is an important issue in medical informatics and differential privacy is a state-of-the-art framework for data privacy research. Differential privacy offers provable privacy against attackers who have auxiliary information, and can be applied to data mining models (for example, logistic regression). However, differentially private methods sometimes introduce too much noise and make outputs less useful. Given available public data in medical research (e.g. from patients who sign open-consent agreements), we can design algorithms that use both public and private data sets to decrease the amount of noise that is introduced. In this paper, we modify the update step in Newton-Raphson method to propose a differentially private distributed logistic regression model based on both public and private data. We try our algorithm on three different data sets, and show its advantage over: (1) a logistic regression model based solely on public data, and (2) a differentially private distributed logistic regression model based on private data under various scenarios. Logistic regression models built with our new algorithm based on both private and public datasets demonstrate better utility than models that trained on private or public datasets alone without sacrificing the rigorous privacy guarantee.

  19. Logistic regression analysis of conventional ultrasonography, strain elastosonography, and contrast-enhanced ultrasound characteristics for the differentiation of benign and malignant thyroid nodules

    PubMed Central

    Deng, Yingyuan; Wang, Tianfu; Chen, Siping; Liu, Weixiang

    2017-01-01

    The aim of the study is to screen the significant sonographic features by logistic regression analysis and fit a model to diagnose thyroid nodules. A total of 525 pathological thyroid nodules were retrospectively analyzed. All the nodules underwent conventional ultrasonography (US), strain elastosonography (SE), and contrast -enhanced ultrasound (CEUS). Those nodules’ 12 suspicious sonographic features were used to assess thyroid nodules. The significant features of diagnosing thyroid nodules were picked out by logistic regression analysis. All variables that were statistically related to diagnosis of thyroid nodules, at a level of p < 0.05 were embodied in a logistic regression analysis model. The significant features in the logistic regression model of diagnosing thyroid nodules were calcification, suspected cervical lymph node metastasis, hypoenhancement pattern, margin, shape, vascularity, posterior acoustic, echogenicity, and elastography score. According to the results of logistic regression analysis, the formula that could predict whether or not thyroid nodules are malignant was established. The area under the receiver operating curve (ROC) was 0.930 and the sensitivity, specificity, accuracy, positive predictive value, and negative predictive value were 83.77%, 89.56%, 87.05%, 86.04%, and 87.79% respectively. PMID:29228030

  20. Logistic regression analysis of conventional ultrasonography, strain elastosonography, and contrast-enhanced ultrasound characteristics for the differentiation of benign and malignant thyroid nodules.

    PubMed

    Pang, Tiantian; Huang, Leidan; Deng, Yingyuan; Wang, Tianfu; Chen, Siping; Gong, Xuehao; Liu, Weixiang

    2017-01-01

    The aim of the study is to screen the significant sonographic features by logistic regression analysis and fit a model to diagnose thyroid nodules. A total of 525 pathological thyroid nodules were retrospectively analyzed. All the nodules underwent conventional ultrasonography (US), strain elastosonography (SE), and contrast -enhanced ultrasound (CEUS). Those nodules' 12 suspicious sonographic features were used to assess thyroid nodules. The significant features of diagnosing thyroid nodules were picked out by logistic regression analysis. All variables that were statistically related to diagnosis of thyroid nodules, at a level of p < 0.05 were embodied in a logistic regression analysis model. The significant features in the logistic regression model of diagnosing thyroid nodules were calcification, suspected cervical lymph node metastasis, hypoenhancement pattern, margin, shape, vascularity, posterior acoustic, echogenicity, and elastography score. According to the results of logistic regression analysis, the formula that could predict whether or not thyroid nodules are malignant was established. The area under the receiver operating curve (ROC) was 0.930 and the sensitivity, specificity, accuracy, positive predictive value, and negative predictive value were 83.77%, 89.56%, 87.05%, 86.04%, and 87.79% respectively.

  1. Prevalence and Determinants of Preterm Birth in Tehran, Iran: A Comparison between Logistic Regression and Decision Tree Methods.

    PubMed

    Amini, Payam; Maroufizadeh, Saman; Samani, Reza Omani; Hamidi, Omid; Sepidarkish, Mahdi

    2017-06-01

    Preterm birth (PTB) is a leading cause of neonatal death and the second biggest cause of death in children under five years of age. The objective of this study was to determine the prevalence of PTB and its associated factors using logistic regression and decision tree classification methods. This cross-sectional study was conducted on 4,415 pregnant women in Tehran, Iran, from July 6-21, 2015. Data were collected by a researcher-developed questionnaire through interviews with mothers and review of their medical records. To evaluate the accuracy of the logistic regression and decision tree methods, several indices such as sensitivity, specificity, and the area under the curve were used. The PTB rate was 5.5% in this study. The logistic regression outperformed the decision tree for the classification of PTB based on risk factors. Logistic regression showed that multiple pregnancies, mothers with preeclampsia, and those who conceived with assisted reproductive technology had an increased risk for PTB ( p < 0.05). Identifying and training mothers at risk as well as improving prenatal care may reduce the PTB rate. We also recommend that statisticians utilize the logistic regression model for the classification of risk groups for PTB.

  2. An application in identifying high-risk populations in alternative tobacco product use utilizing logistic regression and CART: a heuristic comparison.

    PubMed

    Lei, Yang; Nollen, Nikki; Ahluwahlia, Jasjit S; Yu, Qing; Mayo, Matthew S

    2015-04-09

    Other forms of tobacco use are increasing in prevalence, yet most tobacco control efforts are aimed at cigarettes. In light of this, it is important to identify individuals who are using both cigarettes and alternative tobacco products (ATPs). Most previous studies have used regression models. We conducted a traditional logistic regression model and a classification and regression tree (CART) model to illustrate and discuss the added advantages of using CART in the setting of identifying high-risk subgroups of ATP users among cigarettes smokers. The data were collected from an online cross-sectional survey administered by Survey Sampling International between July 5, 2012 and August 15, 2012. Eligible participants self-identified as current smokers, African American, White, or Latino (of any race), were English-speaking, and were at least 25 years old. The study sample included 2,376 participants and was divided into independent training and validation samples for a hold out validation. Logistic regression and CART models were used to examine the important predictors of cigarettes + ATP users. The logistic regression model identified nine important factors: gender, age, race, nicotine dependence, buying cigarettes or borrowing, whether the price of cigarettes influences the brand purchased, whether the participants set limits on cigarettes per day, alcohol use scores, and discrimination frequencies. The C-index of the logistic regression model was 0.74, indicating good discriminatory capability. The model performed well in the validation cohort also with good discrimination (c-index = 0.73) and excellent calibration (R-square = 0.96 in the calibration regression). The parsimonious CART model identified gender, age, alcohol use score, race, and discrimination frequencies to be the most important factors. It also revealed interesting partial interactions. The c-index is 0.70 for the training sample and 0.69 for the validation sample. The misclassification rate was 0.342 for the training sample and 0.346 for the validation sample. The CART model was easier to interpret and discovered target populations that possess clinical significance. This study suggests that the non-parametric CART model is parsimonious, potentially easier to interpret, and provides additional information in identifying the subgroups at high risk of ATP use among cigarette smokers.

  3. Glucose-6-phosphate dehydrogenase deficiency and diabetes mellitus with severe retinal complications in a Sardinian population, Italy.

    PubMed

    Pinna, Antonio; Contini, Emma Luigia; Carru, Ciriaco; Solinas, Giuliana

    2013-01-01

    Glucose-6-Phosphate Dehydrogenase (G6PD) deficiency is one of the most common human genetic abnormalities, with a high prevalence in Sardinia, Italy. Evidence indicates that G6PD-deficient patients are protected against vascular disease. Little is known about the relationship between G6PD deficiency and diabetes mellitus. The purpose of this study was to compare G6PD deficiency prevalence in Sardinian diabetic men with severe retinal vascular complications and in age-matched non-diabetic controls and ascertain whether G6PD deficiency may offer protection against this vascular disorder. Erythrocyte G6PD activity was determined using a quantitative assay in 390 diabetic men with proliferative diabetic retinopathy (PDR) and 390 male non-diabetic controls, both aged ≥50 years. Conditional logistic regression models were used to investigate the association between G6PD deficiency and diabetes with severe retinal complications. G6PD deficiency was found in 21 (5.4 %) diabetic patients and 33 (8.5 %) controls (P=0.09). In a univariate conditional logistic regression model, G6PD deficiency showed a trend for protection against diabetes with PDR, but the odds ratio (OR) fell short of statistical significance (OR=0.6, 95% confidence interval=0.35-1.08, P=0.09). In multivariate conditional logistic regression models, including as covariates G6PD deficiency, plasma glucose, and systemic hypertension or systolic or diastolic blood pressure, G6PD deficiency showed no statistically significant protection against diabetes with PDR. The prevalence of G6PD deficiency in diabetic men with PDR was lower than in age-matched non-diabetic controls. G6PD deficiency showed a trend for protection against diabetes with PDR, but results were not statistically significant.

  4. Work stress, asthma control and asthma-specific quality of life: Initial evidence from a cross-sectional study.

    PubMed

    Hartmann, Bettina; Leucht, Verena; Loerbroks, Adrian

    2017-03-01

    Research has suggested that psychological stress is positively associated with asthma morbidity. One major source of stress in adulthood is one's occupation. However, to date, potential links of work stress with asthma control or asthma-specific quality of life have not been examined. We aimed to address this knowledge gap. In 2014/2015, we conducted a cross-sectional study among adults with asthma in Germany (n = 362). For the current analyses that sample was restricted to participants in employment and reporting to have never been diagnosed with chronic obstructive pulmonary disease (n = 94). Work stress was operationalized by the 16-item effort-reward-imbalance (ERI) questionnaire, which measures the subcomponents "effort", "reward" and "overcommitment." Participants further completed the Asthma Control Test and the Asthma Quality of Life Questionnaire-Sydney. Multivariable associations were quantified by linear regression and logistic regression. Effort, reward and their ratio (i.e. ERI ratio) did not show meaningful associations with asthma morbidity. By contrast, increasing levels of overcommitment were associated with poorer asthma control and worse quality of life in both linear regression (ß = -0.26, p = 0.01 and ß = 0.44, p < 0.01, respectively) and logistic regression (odds ratio [OR] = 1.87, 95% confidence interval [CI] = 1.14-3.07 and OR = 2.34, 95% CI = 1.32-4.15, respectively). The present study provides initial evidence of a positive relationship of work-related overcommitment with asthma control and asthma-specific quality of life. Longitudinal studies with larger samples are needed to confirm our findings and to disentangle the potential causality of associations.

  5. Sperm Retrieval in Patients with Klinefelter Syndrome: A Skewed Regression Model Analysis.

    PubMed

    Chehrazi, Mohammad; Rahimiforoushani, Abbas; Sabbaghian, Marjan; Nourijelyani, Keramat; Sadighi Gilani, Mohammad Ali; Hoseini, Mostafa; Vesali, Samira; Yaseri, Mehdi; Alizadeh, Ahad; Mohammad, Kazem; Samani, Reza Omani

    2017-01-01

    The most common chromosomal abnormality due to non-obstructive azoospermia (NOA) is Klinefelter syndrome (KS) which occurs in 1-1.72 out of 500-1000 male infants. The probability of retrieving sperm as the outcome could be asymmetrically different between patients with and without KS, therefore logistic regression analysis is not a well-qualified test for this type of data. This study has been designed to evaluate skewed regression model analysis for data collected from microsurgical testicular sperm extraction (micro-TESE) among azoospermic patients with and without non-mosaic KS syndrome. This cohort study compared the micro-TESE outcome between 134 men with classic KS and 537 men with NOA and normal karyotype who were referred to Royan Institute between 2009 and 2011. In addition to our main outcome, which was sperm retrieval, we also used logistic and skewed regression analyses to compare the following demographic and hormonal factors: age, level of follicle stimulating hormone (FSH), luteinizing hormone (LH), and testosterone between the two groups. A comparison of the micro-TESE between the KS and control groups showed a success rate of 28.4% (38/134) for the KS group and 22.2% (119/537) for the control group. In the KS group, a significantly difference (P<0.001) existed between testosterone levels for the successful sperm retrieval group (3.4 ± 0.48 mg/mL) compared to the unsuccessful sperm retrieval group (2.33 ± 0.23 mg/mL). The index for quasi Akaike information criterion (QAIC) had a goodness of fit of 74 for the skewed model which was lower than logistic regression (QAIC=85). According to the results, skewed regression is more efficient in estimating sperm retrieval success when the data from patients with KS are analyzed. This finding should be investigated by conducting additional studies with different data structures.

  6. Comparison of Logistic Regression and Random Forests techniques for shallow landslide susceptibility assessment in Giampilieri (NE Sicily, Italy)

    NASA Astrophysics Data System (ADS)

    Trigila, Alessandro; Iadanza, Carla; Esposito, Carlo; Scarascia-Mugnozza, Gabriele

    2015-11-01

    The aim of this work is to define reliable susceptibility models for shallow landslides using Logistic Regression and Random Forests multivariate statistical techniques. The study area, located in North-East Sicily, was hit on October 1st 2009 by a severe rainstorm (225 mm of cumulative rainfall in 7 h) which caused flash floods and more than 1000 landslides. Several small villages, such as Giampilieri, were hit with 31 fatalities, 6 missing persons and damage to buildings and transportation infrastructures. Landslides, mainly types such as earth and debris translational slides evolving into debris flows, were triggered on steep slopes and involved colluvium and regolith materials which cover the underlying metamorphic bedrock. The work has been carried out with the following steps: i) realization of a detailed event landslide inventory map through field surveys coupled with observation of high resolution aerial colour orthophoto; ii) identification of landslide source areas; iii) data preparation of landslide controlling factors and descriptive statistics based on a bivariate method (Frequency Ratio) to get an initial overview on existing relationships between causative factors and shallow landslide source areas; iv) choice of criteria for the selection and sizing of the mapping unit; v) implementation of 5 multivariate statistical susceptibility models based on Logistic Regression and Random Forests techniques and focused on landslide source areas; vi) evaluation of the influence of sample size and type of sampling on results and performance of the models; vii) evaluation of the predictive capabilities of the models using ROC curve, AUC and contingency tables; viii) comparison of model results and obtained susceptibility maps; and ix) analysis of temporal variation of landslide susceptibility related to input parameter changes. Models based on Logistic Regression and Random Forests have demonstrated excellent predictive capabilities. Land use and wildfire variables were found to have a strong control on the occurrence of very rapid shallow landslides.

  7. Measurement of faculty anesthesiologists' quality of clinical supervision has greater reliability when controlling for the leniency of the rating anesthesia resident: a retrospective cohort study.

    PubMed

    Dexter, Franklin; Ledolter, Johannes; Hindman, Bradley J

    2017-06-01

    Our department monitors the quality of anesthesiologists' clinical supervision and provides each anesthesiologist with periodic feedback. We hypothesized that greater differentiation among anesthesiologists' supervision scores could be obtained by adjusting for leniency of the rating resident. From July 1, 2013 to December 31, 2015, our department has utilized the de Oliveira Filho unidimensional nine-item supervision scale to assess the quality of clinical supervision provided by faculty as rated by residents. We examined all 13,664 ratings of the 97 anesthesiologists (ratees) by the 65 residents (raters). Testing for internal consistency among answers to questions (large Cronbach's alpha > 0.90) was performed to rule out that one or two questions accounted for leniency. Mixed-effects logistic regression was used to compare ratees while controlling for rater leniency vs using Student t tests without rater leniency. The mean supervision scale score was calculated for each combination of the 65 raters and nine questions. The Cronbach's alpha was very large (0.977). The mean score was calculated for each of the 3,421 observed combinations of resident and anesthesiologist. The logits of the percentage of scores equal to the maximum value of 4.00 were normally distributed (residents, P = 0.24; anesthesiologists, P = 0.50). There were 20/97 anesthesiologists identified as significant outliers (13 with below average supervision scores and seven with better than average) using the mixed-effects logistic regression with rater leniency entered as a fixed effect but not by Student's t test. In contrast, there were three of 97 anesthesiologists identified as outliers (all three above average) using Student's t tests but not by logistic regression with leniency. The 20 vs 3 was significant (P < 0.001). Use of logistic regression with leniency results in greater detection of anesthesiologists with significantly better (or worse) clinical supervision scores than use of Student's t tests (i.e., without adjustment for rater leniency).

  8. Comparison of the Relationship between Women' Empowerment and Fertility between Single-child and Multi-child Families

    PubMed Central

    Saberi, Tahereh; Ehsanpour, Soheila; Mahaki, Behzad; Kohan, Shahnaz

    2018-01-01

    Background: The reduction in fertility and increase in the number of single-child families in Iran will result in an increased risk of population aging. One of the factors affecting fertility is women's empowerment. This study aimed to evaluate the relationship between women's empowerment and fertility in single-child and multi-child families. Materials and Methods: This case-control study was conducted among 350 women (120 who had only 1 child as case group and 230 who had 2 or more children as control group) of 15–49 years of age in Isfahan, Iran, in 2016. For data collection, a 2-part questionnaire was designed. Data were analyzed using independent t-test, Chi-square test, and logistic regression analysis. Results: The difference between average scores of women's empowerment in the case group 54.08 (9.88) and control group 51.47 (8.57) was significant (p = 0.002). Simple logistic regression analysis showed that under diploma education, compared to postgraduate education, (OR = 0.21, p = 0.001) and being a housewife, compared to being employed, (OR = 0.45, p = 0.004) decreased the odds of having only 1 child. Multiple logistic regression results showed that the relationship between women's empowerment and fertility was not significant (p = 0.265). Conclusions: Although women in single-child families were more empowered, this was not the main reason for their preference to have only 1 child. In fact, educated and employed women postpone marriage and childbearing and limit fertility to only 1 child despite their desire. PMID:29628961

  9. The combination of ovarian volume and outline has better diagnostic accuracy than prostate-specific antigen (PSA) concentrations in women with polycystic ovarian syndrome (PCOs).

    PubMed

    Bili, Eleni; Bili, Authors Eleni; Dampala, Kaliopi; Iakovou, Ioannis; Tsolakidis, Dimitrios; Giannakou, Anastasia; Tarlatzis, Basil C

    2014-08-01

    The aim of this study was to determine the performance of prostate specific antigen (PSA) and ultrasound parameters, such as ovarian volume and outline, in the diagnosis of polycystic ovary syndrome (PCOS). This prospective, observational, case-controlled study included 43 women with PCOS, and 40 controls. Between day 3 and 5 of the menstrual cycle, fasting serum samples were collected and transvaginal ultrasound was performed. The diagnostic performance of each parameter [total PSA (tPSA), total-to-free PSA ratio (tPSA:fPSA), ovarian volume, ovarian outline] was estimated by means of receiver operating characteristic (ROC) analysis, along with area under the curve (AUC), threshold, sensitivity, specificity as well as positive (+) and negative (-) likelihood ratios (LRs). Multivariate logistical regression models, using ovarian volume and ovarian outline, were constructed. The tPSA and tPSA:fPSA ratio resulted in AUC of 0.74 and 0.70, respectively, with moderate specificity/sensitivity and insufficient LR+/- values. In the multivariate logistic regression model, the combination of ovarian volume and outline had a sensitivity of 97.7% and a specificity of 97.5% in the diagnosis of PCOS, with +LR and -LR values of 39.1 and 0.02, respectively. In women with PCOS, tPSA and tPSA:fPSA ratio have similar diagnostic performance. The use of a multivariate logistic regression model, incorporating ovarian volume and outline, offers very good diagnostic accuracy in distinguishing women with PCOS patients from controls. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  10. Predicting Teacher Value-Added Results in Non-Tested Subjects Based on Confounding Variables: A Multinomial Logistic Regression

    ERIC Educational Resources Information Center

    Street, Nathan Lee

    2017-01-01

    Teacher value-added measures (VAM) are designed to provide information regarding teachers' causal impact on the academic growth of students while controlling for exogenous variables. While some researchers contend VAMs successfully and authentically measure teacher causality on learning, others suggest VAMs cannot adequately control for exogenous…

  11. Risk factors for displaced abomasum or ketosis in Swedish dairy herds.

    PubMed

    Stengärde, L; Hultgren, J; Tråvén, M; Holtenius, K; Emanuelson, U

    2012-03-01

    Risk factors associated with high or low long-term incidence of displaced abomasum (DA) or clinical ketosis were studied in 60 Swedish dairy herds, using multivariable logistic regression modelling. Forty high-incidence herds were included as cases and 20 low-incidence herds as controls. Incidence rates were calculated based on veterinary records of clinical diagnoses. During the 3-year period preceding the herd classification, herds with a high incidence had a disease incidence of DA or clinical ketosis above the 3rd quartile in a national database for disease recordings. Control herds had no cows with DA or clinical ketosis. All herds were visited during the housing period and herdsmen were interviewed about management routines, housing, feeding, milk yield, and herd health. Target groups were heifers in late gestation, dry cows, and cows in early lactation. Univariable logistic regression was used to screen for factors associated with being a high-incidence herd. A multivariable logistic regression model was built using stepwise regression. A higher maximum daily milk yield in multiparous cows and a large herd size (p=0.054 and p=0.066, respectively) tended to be associated with being a high-incidence herd. Not cleaning the heifer feeding platform daily increased the odds of having a high-incidence herd twelvefold (p<0.01). Keeping cows in only one group in the dry period increased the odds of having a high incidence herd eightfold (p=0.03). Herd size was confounded with housing system. Housing system was therefore added to the final logistic regression model. In conclusion, a large herd size, a high maximum daily milk yield, keeping dry cows in one group, and not cleaning the feeding platform daily appear to be important risk factors for a high incidence of DA or clinical ketosis in Swedish dairy herds. These results confirm the importance of housing, management and feeding in the prevention of metabolic disorders in dairy cows around parturition and in early lactation. Copyright © 2011 Elsevier B.V. All rights reserved.

  12. Misperception among rural diabetic residents: a cross-sectional descriptive study.

    PubMed

    Huang, Tzu-Ting; Guo, Su-Er; Chang, Chia-Hao; Huang, Jui-Chu; Lin, Ming-Shyan; Lee, Chia-Mou; Chen, Mei-Yen

    2013-04-01

    To evaluate the self-perception of diabetes control associated with physical indicators and with practicing exercise and a healthy diet, among rural residents. It remains unclear whether a subject's self-perception of diabetes control increases its deleterious effects. Cross-sectional, correlational. We recruited 715 participants from 18 primary healthcare centres in the rural regions of Chiayi County, Taiwan. Data were collected between 1 January 2009-30 June 2010. Logistic regression was conducted to identify the determinant factors associated with perceptions of diabetes control. A high percentage of participants overestimated their fasting blood glucose and HbA1 C status. Total cholesterol, triglyceride, low density lipoprotein cholesterol, blood pressure, and waist circumference exceeded the medical standard in the 'feel good' group, and many did not adopt a healthy diet and undertake physical activity. The final logistic regression model demonstrated that residents with diabetes who exercised frequently had normal fasting glucose, and normal HbA1 C tended to perceive 'feel good' control. Misperception and unawareness of diabetes control were prevalent among rural residents. Addressing misperceptions among rural residents with diabetes and increasing their knowledge of professional advice could be important steps in improving diabetes control in an elder population. © 2012 Blackwell Publishing Ltd.

  13. Logistic regression for dichotomized counts.

    PubMed

    Preisser, John S; Das, Kalyan; Benecha, Habtamu; Stamm, John W

    2016-12-01

    Sometimes there is interest in a dichotomized outcome indicating whether a count variable is positive or zero. Under this scenario, the application of ordinary logistic regression may result in efficiency loss, which is quantifiable under an assumed model for the counts. In such situations, a shared-parameter hurdle model is investigated for more efficient estimation of regression parameters relating to overall effects of covariates on the dichotomous outcome, while handling count data with many zeroes. One model part provides a logistic regression containing marginal log odds ratio effects of primary interest, while an ancillary model part describes the mean count of a Poisson or negative binomial process in terms of nuisance regression parameters. Asymptotic efficiency of the logistic model parameter estimators of the two-part models is evaluated with respect to ordinary logistic regression. Simulations are used to assess the properties of the models with respect to power and Type I error, the latter investigated under both misspecified and correctly specified models. The methods are applied to data from a randomized clinical trial of three toothpaste formulations to prevent incident dental caries in a large population of Scottish schoolchildren. © The Author(s) 2014.

  14. Predicting 30-day Hospital Readmission with Publicly Available Administrative Database. A Conditional Logistic Regression Modeling Approach.

    PubMed

    Zhu, K; Lou, Z; Zhou, J; Ballester, N; Kong, N; Parikh, P

    2015-01-01

    This article is part of the Focus Theme of Methods of Information in Medicine on "Big Data and Analytics in Healthcare". Hospital readmissions raise healthcare costs and cause significant distress to providers and patients. It is, therefore, of great interest to healthcare organizations to predict what patients are at risk to be readmitted to their hospitals. However, current logistic regression based risk prediction models have limited prediction power when applied to hospital administrative data. Meanwhile, although decision trees and random forests have been applied, they tend to be too complex to understand among the hospital practitioners. Explore the use of conditional logistic regression to increase the prediction accuracy. We analyzed an HCUP statewide inpatient discharge record dataset, which includes patient demographics, clinical and care utilization data from California. We extracted records of heart failure Medicare beneficiaries who had inpatient experience during an 11-month period. We corrected the data imbalance issue with under-sampling. In our study, we first applied standard logistic regression and decision tree to obtain influential variables and derive practically meaning decision rules. We then stratified the original data set accordingly and applied logistic regression on each data stratum. We further explored the effect of interacting variables in the logistic regression modeling. We conducted cross validation to assess the overall prediction performance of conditional logistic regression (CLR) and compared it with standard classification models. The developed CLR models outperformed several standard classification models (e.g., straightforward logistic regression, stepwise logistic regression, random forest, support vector machine). For example, the best CLR model improved the classification accuracy by nearly 20% over the straightforward logistic regression model. Furthermore, the developed CLR models tend to achieve better sensitivity of more than 10% over the standard classification models, which can be translated to correct labeling of additional 400 - 500 readmissions for heart failure patients in the state of California over a year. Lastly, several key predictor identified from the HCUP data include the disposition location from discharge, the number of chronic conditions, and the number of acute procedures. It would be beneficial to apply simple decision rules obtained from the decision tree in an ad-hoc manner to guide the cohort stratification. It could be potentially beneficial to explore the effect of pairwise interactions between influential predictors when building the logistic regression models for different data strata. Judicious use of the ad-hoc CLR models developed offers insights into future development of prediction models for hospital readmissions, which can lead to better intuition in identifying high-risk patients and developing effective post-discharge care strategies. Lastly, this paper is expected to raise the awareness of collecting data on additional markers and developing necessary database infrastructure for larger-scale exploratory studies on readmission risk prediction.

  15. Interpretation of commonly used statistical regression models.

    PubMed

    Kasza, Jessica; Wolfe, Rory

    2014-01-01

    A review of some regression models commonly used in respiratory health applications is provided in this article. Simple linear regression, multiple linear regression, logistic regression and ordinal logistic regression are considered. The focus of this article is on the interpretation of the regression coefficients of each model, which are illustrated through the application of these models to a respiratory health research study. © 2013 The Authors. Respirology © 2013 Asian Pacific Society of Respirology.

  16. Aging, not menopause, is associated with higher prevalence of hyperuricemia among older women.

    PubMed

    Krishnan, Eswar; Bennett, Mihoko; Chen, Linjun

    2014-11-01

    This work aims to study the associations, if any, of hyperuricemia, gout, and menopause status in the US population. Using multiyear data from the National Health and Nutrition Examination Survey, we performed unmatched comparisons and one to three age-matched comparisons of women aged 20 to 70 years with and without hyperuricemia (serum urate ≥6 mg/dL). Analyses were performed using survey-weighted multiple logistic regression and conditional logistic regression, respectively. Overall, there were 1,477 women with hyperuricemia. Age and serum urate were significantly correlated. In unmatched analyses (n = 9,573 controls), postmenopausal women were older, were heavier, and had higher prevalence of renal impairment, hypertension, diabetes, and hyperlipidemia. In multivariable regression, after accounting for age, body mass index, glomerular filtration rate, and diuretic use, menopause was associated with hyperuricemia (odds ratio, 1.36; 95% CI, 1.05-1.76; P = 0.002). In corresponding multivariable regression using age-matched data (n = 4,431 controls), the odds ratio for menopause was 0.94 (95% CI, 0.83-1.06). Current use of hormone therapy was not associated with prevalent hyperuricemia in both unmatched and matched analyses. Age is a better statistical explanation for the higher prevalence of hyperuricemia among older women than menopause status.

  17. Classification of sodium MRI data of cartilage using machine learning.

    PubMed

    Madelin, Guillaume; Poidevin, Frederick; Makrymallis, Antonios; Regatte, Ravinder R

    2015-11-01

    To assess the possible utility of machine learning for classifying subjects with and subjects without osteoarthritis using sodium magnetic resonance imaging data. Theory: Support vector machine, k-nearest neighbors, naïve Bayes, discriminant analysis, linear regression, logistic regression, neural networks, decision tree, and tree bagging were tested. Sodium magnetic resonance imaging with and without fluid suppression by inversion recovery was acquired on the knee cartilage of 19 controls and 28 osteoarthritis patients. Sodium concentrations were measured in regions of interests in the knee for both acquisitions. Mean (MEAN) and standard deviation (STD) of these concentrations were measured in each regions of interest, and the minimum, maximum, and mean of these two measurements were calculated over all regions of interests for each subject. The resulting 12 variables per subject were used as predictors for classification. Either Min [STD] alone, or in combination with Mean [MEAN] or Min [MEAN], all from fluid suppressed data, were the best predictors with an accuracy >74%, mainly with linear logistic regression and linear support vector machine. Other good classifiers include discriminant analysis, linear regression, and naïve Bayes. Machine learning is a promising technique for classifying osteoarthritis patients and controls from sodium magnetic resonance imaging data. © 2014 Wiley Periodicals, Inc.

  18. Differentially private distributed logistic regression using private and public data

    PubMed Central

    2014-01-01

    Background Privacy protecting is an important issue in medical informatics and differential privacy is a state-of-the-art framework for data privacy research. Differential privacy offers provable privacy against attackers who have auxiliary information, and can be applied to data mining models (for example, logistic regression). However, differentially private methods sometimes introduce too much noise and make outputs less useful. Given available public data in medical research (e.g. from patients who sign open-consent agreements), we can design algorithms that use both public and private data sets to decrease the amount of noise that is introduced. Methodology In this paper, we modify the update step in Newton-Raphson method to propose a differentially private distributed logistic regression model based on both public and private data. Experiments and results We try our algorithm on three different data sets, and show its advantage over: (1) a logistic regression model based solely on public data, and (2) a differentially private distributed logistic regression model based on private data under various scenarios. Conclusion Logistic regression models built with our new algorithm based on both private and public datasets demonstrate better utility than models that trained on private or public datasets alone without sacrificing the rigorous privacy guarantee. PMID:25079786

  19. A retrospective analysis to identify the factors affecting infection in patients undergoing chemotherapy.

    PubMed

    Park, Ji Hyun; Kim, Hyeon-Young; Lee, Hanna; Yun, Eun Kyoung

    2015-12-01

    This study compares the performance of the logistic regression and decision tree analysis methods for assessing the risk factors for infection in cancer patients undergoing chemotherapy. The subjects were 732 cancer patients who were receiving chemotherapy at K university hospital in Seoul, Korea. The data were collected between March 2011 and February 2013 and were processed for descriptive analysis, logistic regression and decision tree analysis using the IBM SPSS Statistics 19 and Modeler 15.1 programs. The most common risk factors for infection in cancer patients receiving chemotherapy were identified as alkylating agents, vinca alkaloid and underlying diabetes mellitus. The logistic regression explained 66.7% of the variation in the data in terms of sensitivity and 88.9% in terms of specificity. The decision tree analysis accounted for 55.0% of the variation in the data in terms of sensitivity and 89.0% in terms of specificity. As for the overall classification accuracy, the logistic regression explained 88.0% and the decision tree analysis explained 87.2%. The logistic regression analysis showed a higher degree of sensitivity and classification accuracy. Therefore, logistic regression analysis is concluded to be the more effective and useful method for establishing an infection prediction model for patients undergoing chemotherapy. Copyright © 2015 Elsevier Ltd. All rights reserved.

  20. Performance and strategy comparisons of human listeners and logistic regression in discriminating underwater targets.

    PubMed

    Yang, Lixue; Chen, Kean

    2015-11-01

    To improve the design of underwater target recognition systems based on auditory perception, this study compared human listeners with automatic classifiers. Performances measures and strategies in three discrimination experiments, including discriminations between man-made and natural targets, between ships and submarines, and among three types of ships, were used. In the experiments, the subjects were asked to assign a score to each sound based on how confident they were about the category to which it belonged, and logistic regression, which represents linear discriminative models, also completed three similar tasks by utilizing many auditory features. The results indicated that the performances of logistic regression improved as the ratio between inter- and intra-class differences became larger, whereas the performances of the human subjects were limited by their unfamiliarity with the targets. Logistic regression performed better than the human subjects in all tasks but the discrimination between man-made and natural targets, and the strategies employed by excellent human subjects were similar to that of logistic regression. Logistic regression and several human subjects demonstrated similar performances when discriminating man-made and natural targets, but in this case, their strategies were not similar. An appropriate fusion of their strategies led to further improvement in recognition accuracy.

  1. Simulating land-use changes by incorporating spatial autocorrelation and self-organization in CLUE-S modeling: a case study in Zengcheng District, Guangzhou, China

    NASA Astrophysics Data System (ADS)

    Mei, Zhixiong; Wu, Hao; Li, Shiyun

    2018-06-01

    The Conversion of Land Use and its Effects at Small regional extent (CLUE-S), which is a widely used model for land-use simulation, utilizes logistic regression to estimate the relationships between land use and its drivers, and thus, predict land-use change probabilities. However, logistic regression disregards possible spatial autocorrelation and self-organization in land-use data. Autologistic regression can depict spatial autocorrelation but cannot address self-organization, while logistic regression by considering only self-organization (NElogistic regression) fails to capture spatial autocorrelation. Therefore, this study developed a regression (NE-autologistic regression) method, which incorporated both spatial autocorrelation and self-organization, to improve CLUE-S. The Zengcheng District of Guangzhou, China was selected as the study area. The land-use data of 2001, 2005, and 2009, as well as 10 typical driving factors, were used to validate the proposed regression method and the improved CLUE-S model. Then, three future land-use scenarios in 2020: the natural growth scenario, ecological protection scenario, and economic development scenario, were simulated using the improved model. Validation results showed that NE-autologistic regression performed better than logistic regression, autologistic regression, and NE-logistic regression in predicting land-use change probabilities. The spatial allocation accuracy and kappa values of NE-autologistic-CLUE-S were higher than those of logistic-CLUE-S, autologistic-CLUE-S, and NE-logistic-CLUE-S for the simulations of two periods, 2001-2009 and 2005-2009, which proved that the improved CLUE-S model achieved the best simulation and was thereby effective to a certain extent. The scenario simulation results indicated that under all three scenarios, traffic land and residential/industrial land would increase, whereas arable land and unused land would decrease during 2009-2020. Apparent differences also existed in the simulated change sizes and locations of each land-use type under different scenarios. The results not only demonstrate the validity of the improved model but also provide a valuable reference for relevant policy-makers.

  2. Unitary Response Regression Models

    ERIC Educational Resources Information Center

    Lipovetsky, S.

    2007-01-01

    The dependent variable in a regular linear regression is a numerical variable, and in a logistic regression it is a binary or categorical variable. In these models the dependent variable has varying values. However, there are problems yielding an identity output of a constant value which can also be modelled in a linear or logistic regression with…

  3. Binary logistic regression-Instrument for assessing museum indoor air impact on exhibits.

    PubMed

    Bucur, Elena; Danet, Andrei Florin; Lehr, Carol Blaziu; Lehr, Elena; Nita-Lazar, Mihai

    2017-04-01

    This paper presents a new way to assess the environmental impact on historical artifacts using binary logistic regression. The prediction of the impact on the exhibits during certain pollution scenarios (environmental impact) was calculated by a mathematical model based on the binary logistic regression; it allows the identification of those environmental parameters from a multitude of possible parameters with a significant impact on exhibitions and ranks them according to their severity effect. Air quality (NO 2 , SO 2 , O 3 and PM 2.5 ) and microclimate parameters (temperature, humidity) monitoring data from a case study conducted within exhibition and storage spaces of the Romanian National Aviation Museum Bucharest have been used for developing and validating the binary logistic regression method and the mathematical model. The logistic regression analysis was used on 794 data combinations (715 to develop of the model and 79 to validate it) by a Statistical Package for Social Sciences (SPSS 20.0). The results from the binary logistic regression analysis demonstrated that from six parameters taken into consideration, four of them present a significant effect upon exhibits in the following order: O 3 >PM 2.5 >NO 2 >humidity followed at a significant distance by the effects of SO 2 and temperature. The mathematical model, developed in this study, correctly predicted 95.1 % of the cumulated effect of the environmental parameters upon the exhibits. Moreover, this model could also be used in the decisional process regarding the preventive preservation measures that should be implemented within the exhibition space. The paper presents a new way to assess the environmental impact on historical artifacts using binary logistic regression. The mathematical model developed on the environmental parameters analyzed by the binary logistic regression method could be useful in a decision-making process establishing the best measures for pollution reduction and preventive preservation of exhibits.

  4. Determining factors influencing survival of breast cancer by fuzzy logistic regression model.

    PubMed

    Nikbakht, Roya; Bahrampour, Abbas

    2017-01-01

    Fuzzy logistic regression model can be used for determining influential factors of disease. This study explores the important factors of actual predictive survival factors of breast cancer's patients. We used breast cancer data which collected by cancer registry of Kerman University of Medical Sciences during the period of 2000-2007. The variables such as morphology, grade, age, and treatments (surgery, radiotherapy, and chemotherapy) were applied in the fuzzy logistic regression model. Performance of model was determined in terms of mean degree of membership (MDM). The study results showed that almost 41% of patients were in neoplasm and malignant group and more than two-third of them were still alive after 5-year follow-up. Based on the fuzzy logistic model, the most important factors influencing survival were chemotherapy, morphology, and radiotherapy, respectively. Furthermore, the MDM criteria show that the fuzzy logistic regression have a good fit on the data (MDM = 0.86). Fuzzy logistic regression model showed that chemotherapy is more important than radiotherapy in survival of patients with breast cancer. In addition, another ability of this model is calculating possibilistic odds of survival in cancer patients. The results of this study can be applied in clinical research. Furthermore, there are few studies which applied the fuzzy logistic models. Furthermore, we recommend using this model in various research areas.

  5. Factors associated with active commuting to work among women.

    PubMed

    Bopp, Melissa; Child, Stephanie; Campbell, Matthew

    2014-01-01

    Active commuting (AC), the act of walking or biking to work, has notable health benefits though rates of AC remain low among women. This study used a social-ecological framework to examine the factors associated with AC among women. A convenience sample of employed, working women (n = 709) completed an online survey about their mode of travel to work. Individual, interpersonal, institutional, community, and environmental influences were assessed. Basic descriptive statistics and frequencies described the sample. Simple logistic regression models examined associations with the independent variables with AC participation and multiple logistic regression analysis determined the relative influence of social ecological factors on AC participation. The sample was primarily middle-aged (44.09±11.38 years) and non-Hispanic White (92%). Univariate analyses revealed several individual, interpersonal, institutional, community and environmental factors significantly associated with AC. The multivariable logistic regression analysis results indicated that significant factors associated with AC included number of children, income, perceived behavioral control, coworker AC, coworker AC normative beliefs, employer and community supports for AC, and traffic. The results of this study contribute to the limited body of knowledge on AC participation for women and may help to inform gender-tailored interventions to enhance AC behavior and improve health.

  6. Beyond Reading Alone: The Relationship Between Aural Literacy And Asthma Management

    PubMed Central

    Rosenfeld, Lindsay; Rudd, Rima; Emmons, Karen M.; Acevedo-García, Dolores; Martin, Laurie; Buka, Stephen

    2010-01-01

    Objectives To examine the relationship between literacy and asthma management with a focus on the oral exchange. Methods Study participants, all of whom reported asthma, were drawn from the New England Family Study (NEFS), an examination of links between education and health. NEFS data included reading, oral (speaking), and aural (listening) literacy measures. An additional survey was conducted with this group of study participants related to asthma issues, particularly asthma management. Data analysis focused on bivariate and multivariable logistic regression. Results In bivariate logistic regression models exploring aural literacy, there was a statistically significant association between those participants with lower aural literacy skills and less successful asthma management (OR:4.37, 95%CI:1.11, 17.32). In multivariable logistic regression analyses, controlling for gender, income, and race in separate models (one-at-a-time), there remained a statistically significant association between those participants with lower aural literacy skills and less successful asthma management. Conclusion Lower aural literacy skills seem to complicate asthma management capabilities. Practice Implications Greater attention to the oral exchange, in particular the listening skills highlighted by aural literacy, as well as other related literacy skills may help us develop strategies for clear communication related to asthma management. PMID:20399060

  7. Psychological trauma symptoms and Type 2 diabetes prevalence, glucose control, and treatment modality among American Indians in the Strong Heart Family Study

    PubMed Central

    Jacob, Michelle M.; Gonzales, Kelly L.; Calhoun, Darren; Beals, Janette; Muller, Clemma Jacobsen; Goldberg, Jack; Nelson, Lonnie; Welty, Thomas K.; Howard, Barbara V.

    2013-01-01

    Aims The aims of this paper are to examine the relationship between psychological trauma symptoms and Type 2 diabetes prevalence, glucose control, and treatment modality among 3,776 American Indians in Phase V of the Strong Heart Family Study. Methods This cross-sectional analysis measured psychological trauma symptoms using the National Anxiety Disorder Screening Day instrument, diabetes by American Diabetes Association criteria, and treatment modality by four categories: no medication, oral medication only, insulin only, or both oral medication and insulin. We used binary logistic regression to evaluate the association between psychological trauma symptoms and diabetes prevalence. We used ordinary least squares regression to evaluate the association between psychological trauma symptoms and glucose control. We used binary logistic regression to model the association of psychological trauma symptoms with treatment modality. Results Neither diabetes prevalence (22-31%; p = 0.19) nor control (8.0-8.6; p = 0.25) varied significantly by psychological trauma symptoms categories. However, diabetes treatment modality was associated with psychological trauma symptoms categories, as people with greater burden used either no medication, or both oral and insulin medications (odds ratio = 3.1, p < 0.001). Conclusions The positive relationship between treatment modality and psychological trauma symptoms suggests future research investigate patient and provider treatment decision making. PMID:24051029

  8. Association between age and high-risk human papilloma virus in Mexican oral cancer patients.

    PubMed

    González-Ramírez, I; Irigoyen-Camacho, M E; Ramírez-Amador, V; Lizano-Soberón, M; Carrillo-García, A; García-Carrancá, A; Sánchez-Pérez, Y; Méndez-Martínez, R; Granados-García, M; Ruíz-Godoy, Lm; García-Cuellar, Cm

    2013-11-01

    Studies reporting low prevalence of HPV in OSCC with declining age at presentation are increasing. The aim of this study was to determine the prevalence of HPV in a group of OSCC cases and controls in a Mexican population. The matched case-control study included 80 OSCC cases and 320 controls. HPV/DNA presence was evaluated through PCR amplification using three sets of consensus primers for the L1 gene. A conditional logistic regression analysis was carried out for the matched OSCC cases and controls. Interactions between risk factors and OCSS were tested in the construction process of the models. HPV prevalence was 5% in OSCC cases and 2.5% in controls. HPV-detected types were 16, 18 and 56. According to conditional logistics regression model, an association was detected between HR-HPV and OSCC. All HR-HPV-positive OSCC cases corresponded to young patients (<45 years), non-smokers and non-alcohol drinkers. The HR-HPV can be a contributing factor to oral carcinogenesis, especially in younger individuals without known risk factors such as tobacco and alcohol. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  9. Lack of Correlation between Metallic Elements Analyzed in Hair by ICP-MS and Autism

    ERIC Educational Resources Information Center

    De Palma, Giuseppe; Catalani, Simona; Franco, Anna; Brighenti, Maurizio; Apostoli, Pietro

    2012-01-01

    A cross-sectional case-control study was carried out to evaluate the concentrations of metallic elements in the hair of 44 children with diagnosis of autism and 61 age-balanced controls. Unadjusted comparisons showed higher concentrations of molybdenum, lithium and selenium in autistic children. Logistic regression analysis confirmed the role of…

  10. Compensatory Weight Control Behaviors of Women in Emerging Adulthood: Associations between Childhood Abuse Experiences and Adult Relationship Avoidance

    ERIC Educational Resources Information Center

    Bankoff, Sarah M.; Valentine, Sarah E.; Jackson, Michelle A.; Schacht, Rebecca L.; Pantalone, David W.

    2013-01-01

    Objective: To examine correlates of compensatory weight control behaviors among women in transition between adolescence and adulthood. Participants: The authors recruited a sample of undergraduate women ("N" = 759) at a large northwestern university during the 2009-2010 academic year. Methods: Logistic regression was used to assess…

  11. Analysis of occlusal variables, dental attrition, and age for distinguishing healthy controls from female patients with intracapsular temporomandibular disorders.

    PubMed

    Seligman, D A; Pullinger, A G

    2000-01-01

    Confusion about the relationship of occlusion to temporomandibular disorders (TMD) persists. This study attempted to identify occlusal and attrition factors plus age that would characterize asymptomatic normal female subjects. A total of 124 female patients with intracapsular TMD were compared with 47 asymptomatic female controls for associations to 9 occlusal factors, 3 attrition severity measures, and age using classification tree, multiple stepwise logistic regression, and univariate analyses. Models were tested for accuracy (sensitivity and specificity) and total contribution to the variance. The classification tree model had 4 terminal nodes that used only anterior attrition and age. "Normals" were mainly characterized by low attrition levels, whereas patients had higher attrition and tended to be younger. The tree model was only moderately useful (sensitivity 63%, specificity 94%) in predicting normals. The logistic regression model incorporated unilateral posterior crossbite and mediotrusive attrition severity in addition to the 2 factors in the tree, but was slightly less accurate than the tree (sensitivity 51%, specificity 90%). When only occlusal factors were considered in the analysis, normals were additionally characterized by a lack of anterior open bite, smaller overjet, and smaller RCP-ICP slides. The log likelihood accounted for was similar for both the tree (pseudo R(2) = 29.38%; mean deviance = 0.95) and the multiple logistic regression (Cox Snell R(2) = 30.3%, mean deviance = 0.84) models. The occlusal and attrition factors studied were only moderately useful in differentiating normals from TMD patients.

  12. Mixed conditional logistic regression for habitat selection studies.

    PubMed

    Duchesne, Thierry; Fortin, Daniel; Courbin, Nicolas

    2010-05-01

    1. Resource selection functions (RSFs) are becoming a dominant tool in habitat selection studies. RSF coefficients can be estimated with unconditional (standard) and conditional logistic regressions. While the advantage of mixed-effects models is recognized for standard logistic regression, mixed conditional logistic regression remains largely overlooked in ecological studies. 2. We demonstrate the significance of mixed conditional logistic regression for habitat selection studies. First, we use spatially explicit models to illustrate how mixed-effects RSFs can be useful in the presence of inter-individual heterogeneity in selection and when the assumption of independence from irrelevant alternatives (IIA) is violated. The IIA hypothesis states that the strength of preference for habitat type A over habitat type B does not depend on the other habitat types also available. Secondly, we demonstrate the significance of mixed-effects models to evaluate habitat selection of free-ranging bison Bison bison. 3. When movement rules were homogeneous among individuals and the IIA assumption was respected, fixed-effects RSFs adequately described habitat selection by simulated animals. In situations violating the inter-individual homogeneity and IIA assumptions, however, RSFs were best estimated with mixed-effects regressions, and fixed-effects models could even provide faulty conclusions. 4. Mixed-effects models indicate that bison did not select farmlands, but exhibited strong inter-individual variations in their response to farmlands. Less than half of the bison preferred farmlands over forests. Conversely, the fixed-effect model simply suggested an overall selection for farmlands. 5. Conditional logistic regression is recognized as a powerful approach to evaluate habitat selection when resource availability changes. This regression is increasingly used in ecological studies, but almost exclusively in the context of fixed-effects models. Fitness maximization can imply differences in trade-offs among individuals, which can yield inter-individual differences in selection and lead to departure from IIA. These situations are best modelled with mixed-effects models. Mixed-effects conditional logistic regression should become a valuable tool for ecological research.

  13. Advanced colorectal neoplasia risk stratification by penalized logistic regression.

    PubMed

    Lin, Yunzhi; Yu, Menggang; Wang, Sijian; Chappell, Richard; Imperiale, Thomas F

    2016-08-01

    Colorectal cancer is the second leading cause of death from cancer in the United States. To facilitate the efficiency of colorectal cancer screening, there is a need to stratify risk for colorectal cancer among the 90% of US residents who are considered "average risk." In this article, we investigate such risk stratification rules for advanced colorectal neoplasia (colorectal cancer and advanced, precancerous polyps). We use a recently completed large cohort study of subjects who underwent a first screening colonoscopy. Logistic regression models have been used in the literature to estimate the risk of advanced colorectal neoplasia based on quantifiable risk factors. However, logistic regression may be prone to overfitting and instability in variable selection. Since most of the risk factors in our study have several categories, it was tempting to collapse these categories into fewer risk groups. We propose a penalized logistic regression method that automatically and simultaneously selects variables, groups categories, and estimates their coefficients by penalizing the [Formula: see text]-norm of both the coefficients and their differences. Hence, it encourages sparsity in the categories, i.e. grouping of the categories, and sparsity in the variables, i.e. variable selection. We apply the penalized logistic regression method to our data. The important variables are selected, with close categories simultaneously grouped, by penalized regression models with and without the interactions terms. The models are validated with 10-fold cross-validation. The receiver operating characteristic curves of the penalized regression models dominate the receiver operating characteristic curve of naive logistic regressions, indicating a superior discriminative performance. © The Author(s) 2013.

  14. Using Logistic Regression To Predict the Probability of Debris Flows Occurring in Areas Recently Burned By Wildland Fires

    USGS Publications Warehouse

    Rupert, Michael G.; Cannon, Susan H.; Gartner, Joseph E.

    2003-01-01

    Logistic regression was used to predict the probability of debris flows occurring in areas recently burned by wildland fires. Multiple logistic regression is conceptually similar to multiple linear regression because statistical relations between one dependent variable and several independent variables are evaluated. In logistic regression, however, the dependent variable is transformed to a binary variable (debris flow did or did not occur), and the actual probability of the debris flow occurring is statistically modeled. Data from 399 basins located within 15 wildland fires that burned during 2000-2002 in Colorado, Idaho, Montana, and New Mexico were evaluated. More than 35 independent variables describing the burn severity, geology, land surface gradient, rainfall, and soil properties were evaluated. The models were developed as follows: (1) Basins that did and did not produce debris flows were delineated from National Elevation Data using a Geographic Information System (GIS). (2) Data describing the burn severity, geology, land surface gradient, rainfall, and soil properties were determined for each basin. These data were then downloaded to a statistics software package for analysis using logistic regression. (3) Relations between the occurrence/non-occurrence of debris flows and burn severity, geology, land surface gradient, rainfall, and soil properties were evaluated and several preliminary multivariate logistic regression models were constructed. All possible combinations of independent variables were evaluated to determine which combination produced the most effective model. The multivariate model that best predicted the occurrence of debris flows was selected. (4) The multivariate logistic regression model was entered into a GIS, and a map showing the probability of debris flows was constructed. The most effective model incorporates the percentage of each basin with slope greater than 30 percent, percentage of land burned at medium and high burn severity in each basin, particle size sorting, average storm intensity (millimeters per hour), soil organic matter content, soil permeability, and soil drainage. The results of this study demonstrate that logistic regression is a valuable tool for predicting the probability of debris flows occurring in recently-burned landscapes.

  15. Addressing data privacy in matched studies via virtual pooling.

    PubMed

    Saha-Chaudhuri, P; Weinberg, C R

    2017-09-07

    Data confidentiality and shared use of research data are two desirable but sometimes conflicting goals in research with multi-center studies and distributed data. While ideal for straightforward analysis, confidentiality restrictions forbid creation of a single dataset that includes covariate information of all participants. Current approaches such as aggregate data sharing, distributed regression, meta-analysis and score-based methods can have important limitations. We propose a novel application of an existing epidemiologic tool, specimen pooling, to enable confidentiality-preserving analysis of data arising from a matched case-control, multi-center design. Instead of pooling specimens prior to assay, we apply the methodology to virtually pool (aggregate) covariates within nodes. Such virtual pooling retains most of the information used in an analysis with individual data and since individual participant data is not shared externally, within-node virtual pooling preserves data confidentiality. We show that aggregated covariate levels can be used in a conditional logistic regression model to estimate individual-level odds ratios of interest. The parameter estimates from the standard conditional logistic regression are compared to the estimates based on a conditional logistic regression model with aggregated data. The parameter estimates are shown to be similar to those without pooling and to have comparable standard errors and confidence interval coverage. Virtual data pooling can be used to maintain confidentiality of data from multi-center study and can be particularly useful in research with large-scale distributed data.

  16. Identification of patients with gout: elaboration of a questionnaire for epidemiological studies.

    PubMed

    Richette, P; Clerson, P; Bouée, S; Chalès, G; Doherty, M; Flipo, R M; Lambert, C; Lioté, F; Poiraud, T; Schaeverbeke, T; Bardin, T

    2015-09-01

    In France, the prevalence of gout is currently unknown. We aimed to design a questionnaire to detect gout that would be suitable for use in a telephone survey by non-physicians and assessed its performance. We designed a 62-item questionnaire covering comorbidities, clinical features and treatment of gout. In a case-control study, we enrolled patients with a history of arthritis who had undergone arthrocentesis for synovial fluid analysis and crystal detection. Cases were patients with crystal-proven gout and controls were patients who had arthritis and effusion with no monosodium urate crystals in synovial fluid. The questionnaire was administered by phone to cases and controls by non-physicians who were unaware of the patient diagnosis. Logistic regression analysis and classification and regression trees were used to select items discriminating cases and controls. We interviewed 246 patients (102 cases and 142 controls). Two logistic regression models (sensitivity 88.0% and 87.5%; specificity 93.0% and 89.8%, respectively) and one classification and regression tree model (sensitivity 81.4%, specificity 93.7%) revealed 11 informative items that allowed for classifying 90.0%, 88.8% and 88.5% of patients, respectively. We developed a questionnaire to detect gout containing 11 items that is fast and suitable for use in a telephone survey by non-physicians. The questionnaire demonstrated good properties for discriminating patients with and without gout. It will be administered in a large sample of the general population to estimate the prevalence of gout in France. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  17. Prediction of unwanted pregnancies using logistic regression, probit regression and discriminant analysis

    PubMed Central

    Ebrahimzadeh, Farzad; Hajizadeh, Ebrahim; Vahabi, Nasim; Almasian, Mohammad; Bakhteyar, Katayoon

    2015-01-01

    Background: Unwanted pregnancy not intended by at least one of the parents has undesirable consequences for the family and the society. In the present study, three classification models were used and compared to predict unwanted pregnancies in an urban population. Methods: In this cross-sectional study, 887 pregnant mothers referring to health centers in Khorramabad, Iran, in 2012 were selected by the stratified and cluster sampling; relevant variables were measured and for prediction of unwanted pregnancy, logistic regression, discriminant analysis, and probit regression models and SPSS software version 21 were used. To compare these models, indicators such as sensitivity, specificity, the area under the ROC curve, and the percentage of correct predictions were used. Results: The prevalence of unwanted pregnancies was 25.3%. The logistic and probit regression models indicated that parity and pregnancy spacing, contraceptive methods, household income and number of living male children were related to unwanted pregnancy. The performance of the models based on the area under the ROC curve was 0.735, 0.733, and 0.680 for logistic regression, probit regression, and linear discriminant analysis, respectively. Conclusion: Given the relatively high prevalence of unwanted pregnancies in Khorramabad, it seems necessary to revise family planning programs. Despite the similar accuracy of the models, if the researcher is interested in the interpretability of the results, the use of the logistic regression model is recommended. PMID:26793655

  18. Prediction of unwanted pregnancies using logistic regression, probit regression and discriminant analysis.

    PubMed

    Ebrahimzadeh, Farzad; Hajizadeh, Ebrahim; Vahabi, Nasim; Almasian, Mohammad; Bakhteyar, Katayoon

    2015-01-01

    Unwanted pregnancy not intended by at least one of the parents has undesirable consequences for the family and the society. In the present study, three classification models were used and compared to predict unwanted pregnancies in an urban population. In this cross-sectional study, 887 pregnant mothers referring to health centers in Khorramabad, Iran, in 2012 were selected by the stratified and cluster sampling; relevant variables were measured and for prediction of unwanted pregnancy, logistic regression, discriminant analysis, and probit regression models and SPSS software version 21 were used. To compare these models, indicators such as sensitivity, specificity, the area under the ROC curve, and the percentage of correct predictions were used. The prevalence of unwanted pregnancies was 25.3%. The logistic and probit regression models indicated that parity and pregnancy spacing, contraceptive methods, household income and number of living male children were related to unwanted pregnancy. The performance of the models based on the area under the ROC curve was 0.735, 0.733, and 0.680 for logistic regression, probit regression, and linear discriminant analysis, respectively. Given the relatively high prevalence of unwanted pregnancies in Khorramabad, it seems necessary to revise family planning programs. Despite the similar accuracy of the models, if the researcher is interested in the interpretability of the results, the use of the logistic regression model is recommended.

  19. Predictors of course in obsessive-compulsive disorder: logistic regression versus Cox regression for recurrent events.

    PubMed

    Kempe, P T; van Oppen, P; de Haan, E; Twisk, J W R; Sluis, A; Smit, J H; van Dyck, R; van Balkom, A J L M

    2007-09-01

    Two methods for predicting remissions in obsessive-compulsive disorder (OCD) treatment are evaluated. Y-BOCS measurements of 88 patients with a primary OCD (DSM-III-R) diagnosis were performed over a 16-week treatment period, and during three follow-ups. Remission at any measurement was defined as a Y-BOCS score lower than thirteen combined with a reduction of seven points when compared with baseline. Logistic regression models were compared with a Cox regression for recurrent events model. Logistic regression yielded different models at different evaluation times. The recurrent events model remained stable when fewer measurements were used. Higher baseline levels of neuroticism and more severe OCD symptoms were associated with a lower chance of remission, early age of onset and more depressive symptoms with a higher chance. Choice of outcome time affects logistic regression prediction models. Recurrent events analysis uses all information on remissions and relapses. Short- and long-term predictors for OCD remission show overlap.

  20. Estimating the exceedance probability of rain rate by logistic regression

    NASA Technical Reports Server (NTRS)

    Chiu, Long S.; Kedem, Benjamin

    1990-01-01

    Recent studies have shown that the fraction of an area with rain intensity above a fixed threshold is highly correlated with the area-averaged rain rate. To estimate the fractional rainy area, a logistic regression model, which estimates the conditional probability that rain rate over an area exceeds a fixed threshold given the values of related covariates, is developed. The problem of dependency in the data in the estimation procedure is bypassed by the method of partial likelihood. Analyses of simulated scanning multichannel microwave radiometer and observed electrically scanning microwave radiometer data during the Global Atlantic Tropical Experiment period show that the use of logistic regression in pixel classification is superior to multiple regression in predicting whether rain rate at each pixel exceeds a given threshold, even in the presence of noisy data. The potential of the logistic regression technique in satellite rain rate estimation is discussed.

  1. Comparison of naïve Bayes and logistic regression for computer-aided diagnosis of breast masses using ultrasound imaging

    NASA Astrophysics Data System (ADS)

    Cary, Theodore W.; Cwanger, Alyssa; Venkatesh, Santosh S.; Conant, Emily F.; Sehgal, Chandra M.

    2012-03-01

    This study compares the performance of two proven but very different machine learners, Naïve Bayes and logistic regression, for differentiating malignant and benign breast masses using ultrasound imaging. Ultrasound images of 266 masses were analyzed quantitatively for shape, echogenicity, margin characteristics, and texture features. These features along with patient age, race, and mammographic BI-RADS category were used to train Naïve Bayes and logistic regression classifiers to diagnose lesions as malignant or benign. ROC analysis was performed using all of the features and using only a subset that maximized information gain. Performance was determined by the area under the ROC curve, Az, obtained from leave-one-out cross validation. Naïve Bayes showed significant variation (Az 0.733 +/- 0.035 to 0.840 +/- 0.029, P < 0.002) with the choice of features, but the performance of logistic regression was relatively unchanged under feature selection (Az 0.839 +/- 0.029 to 0.859 +/- 0.028, P = 0.605). Out of 34 features, a subset of 6 gave the highest information gain: brightness difference, margin sharpness, depth-to-width, mammographic BI-RADs, age, and race. The probabilities of malignancy determined by Naïve Bayes and logistic regression after feature selection showed significant correlation (R2= 0.87, P < 0.0001). The diagnostic performance of Naïve Bayes and logistic regression can be comparable, but logistic regression is more robust. Since probability of malignancy cannot be measured directly, high correlation between the probabilities derived from two basic but dissimilar models increases confidence in the predictive power of machine learning models for characterizing solid breast masses on ultrasound.

  2. [Logistic regression model of noninvasive prediction for portal hypertensive gastropathy in patients with hepatitis B associated cirrhosis].

    PubMed

    Wang, Qingliang; Li, Xiaojie; Hu, Kunpeng; Zhao, Kun; Yang, Peisheng; Liu, Bo

    2015-05-12

    To explore the risk factors of portal hypertensive gastropathy (PHG) in patients with hepatitis B associated cirrhosis and establish a Logistic regression model of noninvasive prediction. The clinical data of 234 hospitalized patients with hepatitis B associated cirrhosis from March 2012 to March 2014 were analyzed retrospectively. The dependent variable was the occurrence of PHG while the independent variables were screened by binary Logistic analysis. Multivariate Logistic regression was used for further analysis of significant noninvasive independent variables. Logistic regression model was established and odds ratio was calculated for each factor. The accuracy, sensitivity and specificity of model were evaluated by the curve of receiver operating characteristic (ROC). According to univariate Logistic regression, the risk factors included hepatic dysfunction, albumin (ALB), bilirubin (TB), prothrombin time (PT), platelet (PLT), white blood cell (WBC), portal vein diameter, spleen index, splenic vein diameter, diameter ratio, PLT to spleen volume ratio, esophageal varices (EV) and gastric varices (GV). Multivariate analysis showed that hepatic dysfunction (X1), TB (X2), PLT (X3) and splenic vein diameter (X4) were the major occurring factors for PHG. The established regression model was Logit P=-2.667+2.186X1-2.167X2+0.725X3+0.976X4. The accuracy of model for PHG was 79.1% with a sensitivity of 77.2% and a specificity of 80.8%. Hepatic dysfunction, TB, PLT and splenic vein diameter are risk factors for PHG and the noninvasive predicted Logistic regression model was Logit P=-2.667+2.186X1-2.167X2+0.725X3+0.976X4.

  3. Impact of wearing fixed orthodontic appliances on quality of life among adolescents: Case-control study.

    PubMed

    Costa, Andréa A; Serra-Negra, Júnia M; Bendo, Cristiane B; Pordeus, Isabela A; Paiva, Saul M

    2016-01-01

    To investigate the impact of wearing a fixed orthodontic appliance on oral health-related quality of life (OHRQoL) among adolescents. A case-control study (1 ∶ 2) was carried out with a population-based randomized sample of 327 adolescents aged 11 to 14 years enrolled at public and private schools in the City of Brumadinho, southeast of Brazil. The case group (n  =  109) was made up of adolescents with a high negative impact on OHRQoL, and the control group (n  =  218) was made up of adolescents with a low negative impact. The outcome variable was the impact on OHRQoL measured by the Brazilian version of the Child Perceptions Questionnaire (CPQ 11-14) - Impact Short Form (ISF:16). The main independent variable was wearing fixed orthodontic appliances. Malocclusion and the type of school were identified as possible confounding variables. Bivariate and multiple conditional logistic regressions were employed in the statistical analysis. A multiple conditional logistic regression model demonstrated that adolescents wearing fixed orthodontic appliances had a 4.88-fold greater chance of presenting high negative impact on OHRQoL (95% CI: 2.93-8.13; P < .001) than those who did not wear fixed orthodontic appliances. A bivariate conditional logistic regression demonstrated that malocclusion was significantly associated with OHRQoL (P  =  .017), whereas no statistically significant association was found between the type of school and OHRQoL (P  =  .108). Adolescents who wore fixed orthodontic appliances had a greater chance of reporting a negative impact on OHRQoL than those who did not wear such appliances.

  4. Relationship between chemical structure and the occupational asthma hazard of low molecular weight organic compounds

    PubMed Central

    Jarvis, J; Seed, M; Elton, R; Sawyer, L; Agius, R

    2005-01-01

    Aims: To investigate quantitatively, relationships between chemical structure and reported occupational asthma hazard for low molecular weight (LMW) organic compounds; to develop and validate a model linking asthma hazard with chemical substructure; and to generate mechanistic hypotheses that might explain the relationships. Methods: A learning dataset used 78 LMW chemical asthmagens reported in the literature before 1995, and 301 control compounds with recognised occupational exposures and hazards other than respiratory sensitisation. The chemical structures of the asthmagens and control compounds were characterised by the presence of chemical substructure fragments. Odds ratios were calculated for these fragments to determine which were associated with a likelihood of being reported as an occupational asthmagen. Logistic regression modelling was used to identify the independent contribution of these substructures. A post-1995 set of 21 asthmagens and 77 controls were selected to externally validate the model. Results: Nitrogen or oxygen containing functional groups such as isocyanate, amine, acid anhydride, and carbonyl were associated with an occupational asthma hazard, particularly when the functional group was present twice or more in the same molecule. A logistic regression model using only statistically significant independent variables for occupational asthma hazard correctly assigned 90% of the model development set. The external validation showed a sensitivity of 86% and specificity of 99%. Conclusions: Although a wide variety of chemical structures are associated with occupational asthma, bifunctional reactivity is strongly associated with occupational asthma hazard across a range of chemical substructures. This suggests that chemical cross-linking is an important molecular mechanism leading to the development of occupational asthma. The logistic regression model is freely available on the internet and may offer a useful but inexpensive adjunct to the prediction of occupational asthma hazard. PMID:15778257

  5. An association between dietary habits and traffic accidents in patients with chronic liver disease: A data-mining analysis

    PubMed Central

    KAWAGUCHI, TAKUMI; SUETSUGU, TAKURO; OGATA, SHYOU; IMANAGA, MINAMI; ISHII, KUMIKO; ESAKI, NAO; SUGIMOTO, MASAKO; OTSUYAMA, JYURI; NAGAMATSU, AYU; TANIGUCHI, EITARO; ITOU, MINORU; ORIISHI, TETSUHARU; IWASAKI, SHOKO; MIURA, HIROKO; TORIMURA, TAKUJI

    2016-01-01

    The incidence of traffic accidents in patients with chronic liver disease (CLD) is high in the USA. However, the characteristics of patients, including dietary habits, differ between Japan and the USA. The present study investigated the incidence of traffic accidents in CLD patients and the clinical profiles associated with traffic accidents in Japan using a data-mining analysis. A cross-sectional study was performed and 256 subjects [148 CLD patients (CLD group) and 106 patients with other digestive diseases (disease control group)] were enrolled; 2 patients were excluded. The incidence of traffic accidents was compared between the two groups. Independent factors for traffic accidents were analyzed using logistic regression and decision-tree analyses. The incidence of traffic accidents did not differ between the CLD and disease control groups (8.8 vs. 11.3%). The results of the logistic regression analysis showed that yoghurt consumption was the only independent risk factor for traffic accidents (odds ratio, 0.37; 95% confidence interval, 0.16–0.85; P=0.0197). Similarly, the results of the decision-tree analysis showed that yoghurt consumption was the initial divergence variable. In patients who consumed yoghurt habitually, the incidence of traffic accidents was 6.6%, while that in patients who did not consume yoghurt was 16.0%. CLD was not identified as an independent factor in the logistic regression and decision-tree analyses. In conclusion, the difference in the incidence of traffic accidents in Japan between the CLD and disease control groups was insignificant. Furthermore, yoghurt consumption was an independent negative risk factor for traffic accidents in patients with digestive diseases, including CLD. PMID:27123257

  6. An association between dietary habits and traffic accidents in patients with chronic liver disease: A data-mining analysis.

    PubMed

    Kawaguchi, Takumi; Suetsugu, Takuro; Ogata, Shyou; Imanaga, Minami; Ishii, Kumiko; Esaki, Nao; Sugimoto, Masako; Otsuyama, Jyuri; Nagamatsu, Ayu; Taniguchi, Eitaro; Itou, Minoru; Oriishi, Tetsuharu; Iwasaki, Shoko; Miura, Hiroko; Torimura, Takuji

    2016-05-01

    The incidence of traffic accidents in patients with chronic liver disease (CLD) is high in the USA. However, the characteristics of patients, including dietary habits, differ between Japan and the USA. The present study investigated the incidence of traffic accidents in CLD patients and the clinical profiles associated with traffic accidents in Japan using a data-mining analysis. A cross-sectional study was performed and 256 subjects [148 CLD patients (CLD group) and 106 patients with other digestive diseases (disease control group)] were enrolled; 2 patients were excluded. The incidence of traffic accidents was compared between the two groups. Independent factors for traffic accidents were analyzed using logistic regression and decision-tree analyses. The incidence of traffic accidents did not differ between the CLD and disease control groups (8.8 vs. 11.3%). The results of the logistic regression analysis showed that yoghurt consumption was the only independent risk factor for traffic accidents (odds ratio, 0.37; 95% confidence interval, 0.16-0.85; P=0.0197). Similarly, the results of the decision-tree analysis showed that yoghurt consumption was the initial divergence variable. In patients who consumed yoghurt habitually, the incidence of traffic accidents was 6.6%, while that in patients who did not consume yoghurt was 16.0%. CLD was not identified as an independent factor in the logistic regression and decision-tree analyses. In conclusion, the difference in the incidence of traffic accidents in Japan between the CLD and disease control groups was insignificant. Furthermore, yoghurt consumption was an independent negative risk factor for traffic accidents in patients with digestive diseases, including CLD.

  7. Variable Selection in Logistic Regression.

    DTIC Science & Technology

    1987-06-01

    23 %. AUTIOR(.) S. CONTRACT OR GRANT NUMBE Rf.i %Z. D. Bai, P. R. Krishnaiah and . C. Zhao F49620-85- C-0008 " PERFORMING ORGANIZATION NAME AND AOORESS...d I7 IOK-TK- d 7 -I0 7’ VARIABLE SELECTION IN LOGISTIC REGRESSION Z. D. Bai, P. R. Krishnaiah and L. C. Zhao Center for Multivariate Analysis...University of Pittsburgh Center for Multivariate Analysis University of Pittsburgh Y !I VARIABLE SELECTION IN LOGISTIC REGRESSION Z- 0. Bai, P. R. Krishnaiah

  8. Multinomial Logistic Regression Predicted Probability Map To Visualize The Influence Of Socio-Economic Factors On Breast Cancer Occurrence in Southern Karnataka

    NASA Astrophysics Data System (ADS)

    Madhu, B.; Ashok, N. C.; Balasubramanian, S.

    2014-11-01

    Multinomial logistic regression analysis was used to develop statistical model that can predict the probability of breast cancer in Southern Karnataka using the breast cancer occurrence data during 2007-2011. Independent socio-economic variables describing the breast cancer occurrence like age, education, occupation, parity, type of family, health insurance coverage, residential locality and socioeconomic status of each case was obtained. The models were developed as follows: i) Spatial visualization of the Urban- rural distribution of breast cancer cases that were obtained from the Bharat Hospital and Institute of Oncology. ii) Socio-economic risk factors describing the breast cancer occurrences were complied for each case. These data were then analysed using multinomial logistic regression analysis in a SPSS statistical software and relations between the occurrence of breast cancer across the socio-economic status and the influence of other socio-economic variables were evaluated and multinomial logistic regression models were constructed. iii) the model that best predicted the occurrence of breast cancer were identified. This multivariate logistic regression model has been entered into a geographic information system and maps showing the predicted probability of breast cancer occurrence in Southern Karnataka was created. This study demonstrates that Multinomial logistic regression is a valuable tool for developing models that predict the probability of breast cancer Occurrence in Southern Karnataka.

  9. Comparison of Logistic Regression and Artificial Neural Network in Low Back Pain Prediction: Second National Health Survey

    PubMed Central

    Parsaeian, M; Mohammad, K; Mahmoudi, M; Zeraati, H

    2012-01-01

    Background: The purpose of this investigation was to compare empirically predictive ability of an artificial neural network with a logistic regression in prediction of low back pain. Methods: Data from the second national health survey were considered in this investigation. This data includes the information of low back pain and its associated risk factors among Iranian people aged 15 years and older. Artificial neural network and logistic regression models were developed using a set of 17294 data and they were validated in a test set of 17295 data. Hosmer and Lemeshow recommendation for model selection was used in fitting the logistic regression. A three-layer perceptron with 9 inputs, 3 hidden and 1 output neurons was employed. The efficiency of two models was compared by receiver operating characteristic analysis, root mean square and -2 Loglikelihood criteria. Results: The area under the ROC curve (SE), root mean square and -2Loglikelihood of the logistic regression was 0.752 (0.004), 0.3832 and 14769.2, respectively. The area under the ROC curve (SE), root mean square and -2Loglikelihood of the artificial neural network was 0.754 (0.004), 0.3770 and 14757.6, respectively. Conclusions: Based on these three criteria, artificial neural network would give better performance than logistic regression. Although, the difference is statistically significant, it does not seem to be clinically significant. PMID:23113198

  10. Comparison of logistic regression and artificial neural network in low back pain prediction: second national health survey.

    PubMed

    Parsaeian, M; Mohammad, K; Mahmoudi, M; Zeraati, H

    2012-01-01

    The purpose of this investigation was to compare empirically predictive ability of an artificial neural network with a logistic regression in prediction of low back pain. Data from the second national health survey were considered in this investigation. This data includes the information of low back pain and its associated risk factors among Iranian people aged 15 years and older. Artificial neural network and logistic regression models were developed using a set of 17294 data and they were validated in a test set of 17295 data. Hosmer and Lemeshow recommendation for model selection was used in fitting the logistic regression. A three-layer perceptron with 9 inputs, 3 hidden and 1 output neurons was employed. The efficiency of two models was compared by receiver operating characteristic analysis, root mean square and -2 Loglikelihood criteria. The area under the ROC curve (SE), root mean square and -2Loglikelihood of the logistic regression was 0.752 (0.004), 0.3832 and 14769.2, respectively. The area under the ROC curve (SE), root mean square and -2Loglikelihood of the artificial neural network was 0.754 (0.004), 0.3770 and 14757.6, respectively. Based on these three criteria, artificial neural network would give better performance than logistic regression. Although, the difference is statistically significant, it does not seem to be clinically significant.

  11. Modelling of binary logistic regression for obesity among secondary students in a rural area of Kedah

    NASA Astrophysics Data System (ADS)

    Kamaruddin, Ainur Amira; Ali, Zalila; Noor, Norlida Mohd.; Baharum, Adam; Ahmad, Wan Muhamad Amir W.

    2014-07-01

    Logistic regression analysis examines the influence of various factors on a dichotomous outcome by estimating the probability of the event's occurrence. Logistic regression, also called a logit model, is a statistical procedure used to model dichotomous outcomes. In the logit model the log odds of the dichotomous outcome is modeled as a linear combination of the predictor variables. The log odds ratio in logistic regression provides a description of the probabilistic relationship of the variables and the outcome. In conducting logistic regression, selection procedures are used in selecting important predictor variables, diagnostics are used to check that assumptions are valid which include independence of errors, linearity in the logit for continuous variables, absence of multicollinearity, and lack of strongly influential outliers and a test statistic is calculated to determine the aptness of the model. This study used the binary logistic regression model to investigate overweight and obesity among rural secondary school students on the basis of their demographics profile, medical history, diet and lifestyle. The results indicate that overweight and obesity of students are influenced by obesity in family and the interaction between a student's ethnicity and routine meals intake. The odds of a student being overweight and obese are higher for a student having a family history of obesity and for a non-Malay student who frequently takes routine meals as compared to a Malay student.

  12. Occupational exposures and non-Hodgkin's lymphoma: Canadian case-control study.

    PubMed

    Karunanayake, Chandima P; McDuffie, Helen H; Dosman, James A; Spinelli, John J; Pahwa, Punam

    2008-08-07

    The objective was to study the association between Non-Hodgkin's Lymphoma (NHL) and occupational exposures related to long held occupations among males in six provinces of Canada. A population based case-control study was conducted from 1991 to 1994. Males with newly diagnosed NHL (ICD-10) were stratified by province of residence and age group. A total of 513 incident cases and 1506 population based controls were included in the analysis. Conditional logistic regression was conducted to fit statistical models. Based on conditional logistic regression modeling, the following factors independently increased the risk of NHL: farmer and machinist as long held occupations; constant exposure to diesel exhaust fumes; constant exposure to ionizing radiation (radium); and personal history of another cancer. Men who had worked for 20 years or more as farmer and machinist were the most likely to develop NHL. An increased risk of developing NHL is associated with the following: long held occupations of faer and machinist; exposure to diesel fumes; and exposure to ionizing radiation (radium). The risk of NHL increased with the duration of employment as a farmer or machinist.

  13. Hyperhomocysteinemia is a risk factor for Alzheimer's disease in an Algerian population.

    PubMed

    Nazef, Khaled; Khelil, Malika; Chelouti, Hiba; Kacimi, Ghouti; Bendini, Mohamed; Tazir, Meriem; Belarbi, Soraya; El Hadi Cherifi, Mohamed; Djerdjouri, Bahia

    2014-04-01

    There is growing evidence that increased blood concentration of total homocysteine (tHcy) may be a risk factor for Alzheimer's disease (AD). The present study was conducted to evaluate the association of serum tHcy and other biochemical risk factors with AD. This is a case-control study including 41 individuals diagnosed with AD and 46 nondemented controls. Serum levels of all studied biochemical parameters were performed. Univariate logistic regression showed a significant increase of tHcy (p = 0.008), urea (p = 0.036) and a significant decrease of vitamin B12 (p = 0.012) in AD group vs. controls. Using multivariate logistic regression, tHcy (p = 0.007, OR = 1.376) appeared as an independent risk factor predictor of AD. There was a significant positive correlation between tHcy and creatinine (p <0.0001). A negative correlation was found between tHcy and vitamin B12 (p <0.0001). Our findings support that hyperhomocysteinemia is a risk factor for AD in an Algerian population and is also associated with vitamin B12 deficiency. Copyright © 2014 IMSS. Published by Elsevier Inc. All rights reserved.

  14. Understanding logistic regression analysis.

    PubMed

    Sperandei, Sandro

    2014-01-01

    Logistic regression is used to obtain odds ratio in the presence of more than one explanatory variable. The procedure is quite similar to multiple linear regression, with the exception that the response variable is binomial. The result is the impact of each variable on the odds ratio of the observed event of interest. The main advantage is to avoid confounding effects by analyzing the association of all variables together. In this article, we explain the logistic regression procedure using examples to make it as simple as possible. After definition of the technique, the basic interpretation of the results is highlighted and then some special issues are discussed.

  15. Regression discontinuity was a valid design for dichotomous outcomes in three randomized trials.

    PubMed

    van Leeuwen, Nikki; Lingsma, Hester F; Mooijaart, Simon P; Nieboer, Daan; Trompet, Stella; Steyerberg, Ewout W

    2018-06-01

    Regression discontinuity (RD) is a quasi-experimental design that may provide valid estimates of treatment effects in case of continuous outcomes. We aimed to evaluate validity and precision in the RD design for dichotomous outcomes. We performed validation studies in three large randomized controlled trials (RCTs) (Corticosteroid Randomization After Significant Head injury [CRASH], the Global Utilization of Streptokinase and Tissue Plasminogen Activator for Occluded Coronary Arteries [GUSTO], and PROspective Study of Pravastatin in elderly individuals at risk of vascular disease [PROSPER]). To mimic the RD design, we selected patients above and below a cutoff (e.g., age 75 years) randomized to treatment and control, respectively. Adjusted logistic regression models using restricted cubic splines (RCS) and polynomials and local logistic regression models estimated the odds ratio (OR) for treatment, with 95% confidence intervals (CIs) to indicate precision. In CRASH, treatment increased mortality with OR 1.22 [95% CI 1.06-1.40] in the RCT. The RD estimates were 1.42 (0.94-2.16) and 1.13 (0.90-1.40) with RCS adjustment and local regression, respectively. In GUSTO, treatment reduced mortality (OR 0.83 [0.72-0.95]), with more extreme estimates in the RD analysis (OR 0.57 [0.35; 0.92] and 0.67 [0.51; 0.86]). In PROSPER, similar RCT and RD estimates were found, again with less precision in RD designs. We conclude that the RD design provides similar but substantially less precise treatment effect estimates compared with an RCT, with local regression being the preferred method of analysis. Copyright © 2018 Elsevier Inc. All rights reserved.

  16. Comparing Methodologies for Developing an Early Warning System: Classification and Regression Tree Model versus Logistic Regression. REL 2015-077

    ERIC Educational Resources Information Center

    Koon, Sharon; Petscher, Yaacov

    2015-01-01

    The purpose of this report was to explicate the use of logistic regression and classification and regression tree (CART) analysis in the development of early warning systems. It was motivated by state education leaders' interest in maintaining high classification accuracy while simultaneously improving practitioner understanding of the rules by…

  17. Use of logistic regression for modelling risk factors: with application to non-melanoma skin cancer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Vitaliano, P.P.

    Logistic regression was used to estimate the relative risk of basal and squamous skin cancer for such factors as cumulative lifetime solar exposure, age, complexion, and tannability. In previous reports, a subject's exposure was estimated indirectly, by latitude, or by the number of sun days in a subject's habitat. In contrast, these results are based on interview data gathered for each subject. A relatively new technique was used to estimate relative risk by controlling for confounding and testing for effect modification. A linear effect for the relative risk of cancer versus exposure was found. Tannability was shown to be amore » more important risk factor than complexion. This result is consistent with the work of Silverstone and Searle.« less

  18. Polymorphisms within the FANCA gene associate with premature ovarian failure in Korean women.

    PubMed

    Pyun, Jung-A; Kim, Sunshin; Cha, Dong Hyun; Kwack, KyuBum

    2014-05-01

    This study investigated whether polymorphisms within the Fanconi anemia complementation group A (FANCA) gene contribute to the increased risk of premature ovarian failure (POF) in Korean women. Ninety-eight women with POF and 218 controls participated in this study. Genomic DNA from peripheral blood was isolated, and GoldenGate genotyping assay was used to identify single nucleotide polymorphisms (SNPs) within the FANCA gene. Two significant SNPs (rs1006547 and rs2239359; P < 0.05) were identified by logistic regression analysis, but results were insignificant after Bonferroni correction. Six SNPs formed a linkage disequilibrium block, and three main haplotypes were found. Two of three haplotypes (AAAGAA and GGGAGG) distributed highly in the POF group, whereas the remaining haplotype (GGAAGG) distributed highly in the control group by logistic regression analysis (highest odds ratio, 2.515; 95% CI, 1.515-4.175; P = 0.00036). Our observations suggest that genetic variations in the FANCA gene may increase the risk for POF in Korean women.

  19. [Risk factors for asthma in children in Hefei, China].

    PubMed

    Xiong, Mei; Ni, Chen; Pan, Jia-Hua; Wang, Qiang; Zheng, Li-Lin

    2013-05-01

    To investigate the risk factors for asthma in children in Hefei, China and to provide a strategy for asthma control in this region. A total of 400 children with a confirmed diagnosis of asthma, as well as 400 children of comparable age, sex, living environment, and family background, who had no respiratory diseases, were selected for a case-control study. A survey questionnaire survey was completed for all children. The obtained data were subjected to univariate and multivariate logistic regression analysis to determine the risk factors for asthma. The logistic regression analysis showed that a family history of allergy, allergic rhinitis, infantile eczema, no breastfeeding, air-conditioning and passive smoking were the risk factors for asthma in children, with odds ratios of 9.63, 7.56, 4.58, 2.16, 1.73, and 1.55 respectively. In order to reduce the incidence of asthma, we should advocate breast feeding, promote outdoor activities, keep ventilation natural, prevent passive smoking and cure allergic rhinitis.

  20. An assessment of the association between asset ownership and intimate partner violence in Pakistan.

    PubMed

    Murshid, N S

    2017-09-01

    This study assessed the association between women's reports of asset ownership (home and land) and experience of three types of intimate partner violence (IPV): physical violence, emotional violence, and husbands' controlling behaviors. Population-based secondary analysis. This cross-sectional study used data from a sub-sample of 658 women from the nationally representative Pakistan Demographic and Health Survey 2012-13. Logistic regression analyses were used to estimate the association between asset ownership and IPV. Results from logistic regressions indicated that when women owned assets their husbands were 2.3 times more likely to use controlling tactics (P < 0.001) which was mitigated only when women had a say in household decisions. Physical or emotional violence, however, was not significantly associated with women's asset ownership. The study findings highlight the importance of culture and context in policy implementation. Copyright © 2017 The Royal Society for Public Health. Published by Elsevier Ltd. All rights reserved.

  1. Reported gum disease as a cardiovascular risk factor in adults with intellectual disabilities.

    PubMed

    Hsieh, K; Murthy, S; Heller, T; Rimmer, J H; Yen, G

    2018-03-01

    Several risk factors for cardiovascular disease (CVD) have been identified among adults with intellectual disabilities (ID). Periodontitis has been reported to increase the risk of developing a CVD in the general population. Given that individuals with ID have been reported to have a higher prevalence of poor oral health than the general population, the purpose of this study was to determine whether adults with ID with informant reported gum disease present greater reported CVD than those who do not have reported gum disease and whether gum disease can be considered a risk factor for CVD. Using baseline data from the Longitudinal Health and Intellectual Disability Study from which informant survey data were collected, 128 participants with reported gum disease and 1252 subjects without reported gum disease were identified. A series of univariate logistic regressions was conducted to identify potential confounding factors for a multiple logistic regression. The series of univariate logistic regressions identified age, Down syndrome, hypercholesterolemia, hypertension, reported gum disease, daily consumption of fruits and vegetables and the addition of table salt as significant risk factors for reported CVD. When the significant factors from the univariate logistic regression were included in the multiple logistic analysis, reported gum disease remained as an independent risk factor for reported CVD after adjusting for the remaining risk factors. Compared with the adults with ID without reported gum disease, adults in the gum disease group demonstrated a significantly higher prevalence of reported CVD (19.5% vs. 9.7%; P = .001). After controlling for other risk factors, reported gum disease among adults with ID may be associated with a higher risk of CVD. However, further research that also includes clinical indices of periodontal disease and CVD for this population is needed to determine if there is a causal relationship between gum disease and CVD. © 2017 MENCAP and International Association of the Scientific Study of Intellectual and Developmental Disabilities and John Wiley & Sons Ltd.

  2. Application of Social Control Theory to Examine Parent, Teacher, and Close Friend Attachment and Substance Use Initiation among Korean Youth

    ERIC Educational Resources Information Center

    Han, Yoonsun; Kim, Heejoo; Lee, DongHun

    2016-01-01

    Based on Hirschi's social control theory (1969), this study examined the relationship between attachment (an element of social bonds) and the onset of substance use among South Korean adolescents. Using discrete-time logistic regression, the study investigated how attachment to parents, teachers, and close friends was associated with the timing of…

  3. A case-control study of the relationship between a passive second stage of labor and obstetric anal sphincter injuries.

    PubMed

    Gossett, Dana R; Deibel, Philip; Lewicky-Gaupp, Christina

    2016-02-01

    To estimate the relationship between a passive second stage of labor and obstetric anal sphincter injuries (OASIS). A retrospective, case-control study was undertaken of women who delivered at a tertiary-care center in Chicago, IL, USA, between November 2005 and December 2012. Cases had sustained OASIS and were matched on the basis of parity with controls who had no OASIS. Data were obtained from an electronic repository and chart review. Participants with a passive second stage of labor lasting 60 minutes or more were deemed to have "labored down." A logistic regression model to predict OASIS was created. Overall, 1629 cases were compared with 1312 controls. OASIS were recorded among 1452 (57.8%) of 2510 women who did not labor down compared with 169 (40.0%) of 423 women who labored down (P<0.001). However, in binary logistic regression, the addition of laboring down to the model only increased the predictive accuracy from 80.1% to 80.7%. When known risk factors for OASIS are accounted for, the effect of laboring down on perineal outcome is negligible. Copyright © 2015 International Federation of Gynecology and Obstetrics. Published by Elsevier Ireland Ltd. All rights reserved.

  4. Using Multiple and Logistic Regression to Estimate the Median WillCost and Probability of Cost and Schedule Overrun for Program Managers

    DTIC Science & Technology

    2017-03-23

    PUBLIC RELEASE; DISTRIBUTION UNLIMITED Using Multiple and Logistic Regression to Estimate the Median Will- Cost and Probability of Cost and... Cost and Probability of Cost and Schedule Overrun for Program Managers Ryan C. Trudelle Follow this and additional works at: https://scholar.afit.edu...afit.edu. Recommended Citation Trudelle, Ryan C., "Using Multiple and Logistic Regression to Estimate the Median Will- Cost and Probability of Cost and

  5. Expression of Proteins Involved in Epithelial-Mesenchymal Transition as Predictors of Metastasis and Survival in Breast Cancer Patients

    DTIC Science & Technology

    2013-11-01

    Ptrend 0.78 0.62 0.75 Unconditional logistic regression was used to estimate odds ratios (OR) and 95 % confidence intervals (CI) for risk of node...Ptrend 0.71 0.67 Unconditional logistic regression was used to estimate odds ratios (OR) and 95 % confidence intervals (CI) for risk of high-grade tumors... logistic regression was used to estimate odds ratios (OR) and 95 % confidence intervals (CI) for the associations between each of the seven SNPs and

  6. Transmission Risks of Schistosomiasis Japonica: Extraction from Back-propagation Artificial Neural Network and Logistic Regression Model

    PubMed Central

    Xu, Jun-Fang; Xu, Jing; Li, Shi-Zhu; Jia, Tia-Wu; Huang, Xi-Bao; Zhang, Hua-Ming; Chen, Mei; Yang, Guo-Jing; Gao, Shu-Jing; Wang, Qing-Yun; Zhou, Xiao-Nong

    2013-01-01

    Background The transmission of schistosomiasis japonica in a local setting is still poorly understood in the lake regions of the People's Republic of China (P. R. China), and its transmission patterns are closely related to human, social and economic factors. Methodology/Principal Findings We aimed to apply the integrated approach of artificial neural network (ANN) and logistic regression model in assessment of transmission risks of Schistosoma japonicum with epidemiological data collected from 2339 villagers from 1247 households in six villages of Jiangling County, P.R. China. By using the back-propagation (BP) of the ANN model, 16 factors out of 27 factors were screened, and the top five factors ranked by the absolute value of mean impact value (MIV) were mainly related to human behavior, i.e. integration of water contact history and infection history, family with past infection, history of water contact, infection history, and infection times. The top five factors screened by the logistic regression model were mainly related to the social economics, i.e. village level, economic conditions of family, age group, education level, and infection times. The risk of human infection with S. japonicum is higher in the population who are at age 15 or younger, or with lower education, or with the higher infection rate of the village, or with poor family, and in the population with more than one time to be infected. Conclusion/Significance Both BP artificial neural network and logistic regression model established in a small scale suggested that individual behavior and socioeconomic status are the most important risk factors in the transmission of schistosomiasis japonica. It was reviewed that the young population (≤15) in higher-risk areas was the main target to be intervened for the disease transmission control. PMID:23556015

  7. Logistic LASSO regression for the diagnosis of breast cancer using clinical demographic data and the BI-RADS lexicon for ultrasonography.

    PubMed

    Kim, Sun Mi; Kim, Yongdai; Jeong, Kuhwan; Jeong, Heeyeong; Kim, Jiyoung

    2018-01-01

    The aim of this study was to compare the performance of image analysis for predicting breast cancer using two distinct regression models and to evaluate the usefulness of incorporating clinical and demographic data (CDD) into the image analysis in order to improve the diagnosis of breast cancer. This study included 139 solid masses from 139 patients who underwent a ultrasonography-guided core biopsy and had available CDD between June 2009 and April 2010. Three breast radiologists retrospectively reviewed 139 breast masses and described each lesion using the Breast Imaging Reporting and Data System (BI-RADS) lexicon. We applied and compared two regression methods-stepwise logistic (SL) regression and logistic least absolute shrinkage and selection operator (LASSO) regression-in which the BI-RADS descriptors and CDD were used as covariates. We investigated the performances of these regression methods and the agreement of radiologists in terms of test misclassification error and the area under the curve (AUC) of the tests. Logistic LASSO regression was superior (P<0.05) to SL regression, regardless of whether CDD was included in the covariates, in terms of test misclassification errors (0.234 vs. 0.253, without CDD; 0.196 vs. 0.258, with CDD) and AUC (0.785 vs. 0.759, without CDD; 0.873 vs. 0.735, with CDD). However, it was inferior (P<0.05) to the agreement of three radiologists in terms of test misclassification errors (0.234 vs. 0.168, without CDD; 0.196 vs. 0.088, with CDD) and the AUC without CDD (0.785 vs. 0.844, P<0.001), but was comparable to the AUC with CDD (0.873 vs. 0.880, P=0.141). Logistic LASSO regression based on BI-RADS descriptors and CDD showed better performance than SL in predicting the presence of breast cancer. The use of CDD as a supplement to the BI-RADS descriptors significantly improved the prediction of breast cancer using logistic LASSO regression.

  8. Understanding the Personality and Behavioral Mechanisms Defining Hypersexuality in Men Who Have Sex with Men

    PubMed Central

    Miner, Michael H.; Romine, Rebecca Swinburne; Raymond, Nancy; Janssen, Erick; MacDonald, Angus; Coleman, Eli

    2016-01-01

    Objective The purpose of this study was to investigate personality factors and behavioral mechanisms that are relevant to hypersexuality in men who have sex with men. Method A sample of 242 men who have sex with men were recruited from various sites in a moderate size mid-western city. Participants were assigned to hypersexuality or control group using a SCID-type interview. Self-report inventories were administered that measured the broad band personality constructs of positive emotionality, negative emotionality and constraint, and more narrow constructs related to sexual behavioral control, behavioral activation, behavioral inhibition, sexual excitation, sexual inhibition, impulsivity, ADHD, and sexual behavior. Hierarchical logistic regression was used to determine the relationship between these personality and behavioral variables and group membership. Results A hierarchical logistic regression, controlling for age, revealed a significant positive relationship between hypersexuality and negative emotionality and a negative relationship with constraint. None of the behavioral mechanism variables entered this equation. However, a hierarchical multiple regression predicting sexual behavioral control indicated that lack of such control was positively related to sexual excitation and sexual inhibition due to the threat of performance failure and negatively related to sexual inhibition due to the threat of performance consequences and general behavioral inhibition Conclusions Hypersexuality was found to be related to two broad personality factors that are characterized by emotional reactivity, risk-taking, and impulsivity. The associated lack of sexual behavior control is influenced by both sexual excitatory and inhibitory mechanisms, but not general behavioral activation and inhibitory mechanisms. PMID:27486137

  9. The alarming problems of confounding equivalence using logistic regression models in the perspective of causal diagrams.

    PubMed

    Yu, Yuanyuan; Li, Hongkai; Sun, Xiaoru; Su, Ping; Wang, Tingting; Liu, Yi; Yuan, Zhongshang; Liu, Yanxun; Xue, Fuzhong

    2017-12-28

    Confounders can produce spurious associations between exposure and outcome in observational studies. For majority of epidemiologists, adjusting for confounders using logistic regression model is their habitual method, though it has some problems in accuracy and precision. It is, therefore, important to highlight the problems of logistic regression and search the alternative method. Four causal diagram models were defined to summarize confounding equivalence. Both theoretical proofs and simulation studies were performed to verify whether conditioning on different confounding equivalence sets had the same bias-reducing potential and then to select the optimum adjusting strategy, in which logistic regression model and inverse probability weighting based marginal structural model (IPW-based-MSM) were compared. The "do-calculus" was used to calculate the true causal effect of exposure on outcome, then the bias and standard error were used to evaluate the performances of different strategies. Adjusting for different sets of confounding equivalence, as judged by identical Markov boundaries, produced different bias-reducing potential in the logistic regression model. For the sets satisfied G-admissibility, adjusting for the set including all the confounders reduced the equivalent bias to the one containing the parent nodes of the outcome, while the bias after adjusting for the parent nodes of exposure was not equivalent to them. In addition, all causal effect estimations through logistic regression were biased, although the estimation after adjusting for the parent nodes of exposure was nearest to the true causal effect. However, conditioning on different confounding equivalence sets had the same bias-reducing potential under IPW-based-MSM. Compared with logistic regression, the IPW-based-MSM could obtain unbiased causal effect estimation when the adjusted confounders satisfied G-admissibility and the optimal strategy was to adjust for the parent nodes of outcome, which obtained the highest precision. All adjustment strategies through logistic regression were biased for causal effect estimation, while IPW-based-MSM could always obtain unbiased estimation when the adjusted set satisfied G-admissibility. Thus, IPW-based-MSM was recommended to adjust for confounders set.

  10. Modeling Governance KB with CATPCA to Overcome Multicollinearity in the Logistic Regression

    NASA Astrophysics Data System (ADS)

    Khikmah, L.; Wijayanto, H.; Syafitri, U. D.

    2017-04-01

    The problem often encounters in logistic regression modeling are multicollinearity problems. Data that have multicollinearity between explanatory variables with the result in the estimation of parameters to be bias. Besides, the multicollinearity will result in error in the classification. In general, to overcome multicollinearity in regression used stepwise regression. They are also another method to overcome multicollinearity which involves all variable for prediction. That is Principal Component Analysis (PCA). However, classical PCA in only for numeric data. Its data are categorical, one method to solve the problems is Categorical Principal Component Analysis (CATPCA). Data were used in this research were a part of data Demographic and Population Survey Indonesia (IDHS) 2012. This research focuses on the characteristic of women of using the contraceptive methods. Classification results evaluated using Area Under Curve (AUC) values. The higher the AUC value, the better. Based on AUC values, the classification of the contraceptive method using stepwise method (58.66%) is better than the logistic regression model (57.39%) and CATPCA (57.39%). Evaluation of the results of logistic regression using sensitivity, shows the opposite where CATPCA method (99.79%) is better than logistic regression method (92.43%) and stepwise (92.05%). Therefore in this study focuses on major class classification (using a contraceptive method), then the selected model is CATPCA because it can raise the level of the major class model accuracy.

  11. A general regression framework for a secondary outcome in case-control studies.

    PubMed

    Tchetgen Tchetgen, Eric J

    2014-01-01

    Modern case-control studies typically involve the collection of data on a large number of outcomes, often at considerable logistical and monetary expense. These data are of potentially great value to subsequent researchers, who, although not necessarily concerned with the disease that defined the case series in the original study, may want to use the available information for a regression analysis involving a secondary outcome. Because cases and controls are selected with unequal probability, regression analysis involving a secondary outcome generally must acknowledge the sampling design. In this paper, the author presents a new framework for the analysis of secondary outcomes in case-control studies. The approach is based on a careful re-parameterization of the conditional model for the secondary outcome given the case-control outcome and regression covariates, in terms of (a) the population regression of interest of the secondary outcome given covariates and (b) the population regression of the case-control outcome on covariates. The error distribution for the secondary outcome given covariates and case-control status is otherwise unrestricted. For a continuous outcome, the approach sometimes reduces to extending model (a) by including a residual of (b) as a covariate. However, the framework is general in the sense that models (a) and (b) can take any functional form, and the methodology allows for an identity, log or logit link function for model (a).

  12. Survival analysis of postoperative nausea and vomiting in patients receiving patient-controlled epidural analgesia.

    PubMed

    Lee, Shang-Yi; Hung, Chih-Jen; Chen, Chih-Chieh; Wu, Chih-Cheng

    2014-11-01

    Postoperative nausea and vomiting as well as postoperative pain are two major concerns when patients undergo surgery and receive anesthetics. Various models and predictive methods have been developed to investigate the risk factors of postoperative nausea and vomiting, and different types of preventive managements have subsequently been developed. However, there continues to be a wide variation in the previously reported incidence rates of postoperative nausea and vomiting. This may have occurred because patients were assessed at different time points, coupled with the overall limitation of the statistical methods used. However, using survival analysis with Cox regression, and thus factoring in these time effects, may solve this statistical limitation and reveal risk factors related to the occurrence of postoperative nausea and vomiting in the following period. In this retrospective, observational, uni-institutional study, we analyzed the results of 229 patients who received patient-controlled epidural analgesia following surgery from June 2007 to December 2007. We investigated the risk factors for the occurrence of postoperative nausea and vomiting, and also assessed the effect of evaluating patients at different time points using the Cox proportional hazards model. Furthermore, the results of this inquiry were compared with those results using logistic regression. The overall incidence of postoperative nausea and vomiting in our study was 35.4%. Using logistic regression, we found that only sex, but not the total doses and the average dose of opioids, had significant effects on the occurrence of postoperative nausea and vomiting at some time points. Cox regression showed that, when patients consumed a higher average dose of opioids, this correlated with a higher incidence of postoperative nausea and vomiting with a hazard ratio of 1.286. Survival analysis using Cox regression showed that the average consumption of opioids played an important role in postoperative nausea and vomiting, a result not found by logistic regression. Therefore, the incidence of postoperative nausea and vomiting in patients cannot be reliably determined on the basis of a single visit at one point in time. Copyright © 2014. Published by Elsevier Taiwan.

  13. Adolescent Violence: The Protective Effects of Youth Assets

    ERIC Educational Resources Information Center

    Aspy, Cheryl B.; Oman, Roy F.; Vesely, Sara K.; McLeroy, Kenneth; Rodine, Sharon; Marshall, LaDonna

    2004-01-01

    The authors explored adolescent physical fighting and weapon carrying, using in-home interviews with 1,098 middle-high school students and their parents. Logistic regression analyses examined the relationship between youth assets and the risk behaviors while controlling for demographic information. Both demographic factors and assets were…

  14. Logistic regression models of factors influencing the location of bioenergy and biofuels plants

    Treesearch

    T.M. Young; R.L. Zaretzki; J.H. Perdue; F.M. Guess; X. Liu

    2011-01-01

    Logistic regression models were developed to identify significant factors that influence the location of existing wood-using bioenergy/biofuels plants and traditional wood-using facilities. Logistic models provided quantitative insight for variables influencing the location of woody biomass-using facilities. Availability of "thinnings to a basal area of 31.7m2/ha...

  15. Discrete post-processing of total cloud cover ensemble forecasts

    NASA Astrophysics Data System (ADS)

    Hemri, Stephan; Haiden, Thomas; Pappenberger, Florian

    2017-04-01

    This contribution presents an approach to post-process ensemble forecasts for the discrete and bounded weather variable of total cloud cover. Two methods for discrete statistical post-processing of ensemble predictions are tested. The first approach is based on multinomial logistic regression, the second involves a proportional odds logistic regression model. Applying them to total cloud cover raw ensemble forecasts from the European Centre for Medium-Range Weather Forecasts improves forecast skill significantly. Based on station-wise post-processing of raw ensemble total cloud cover forecasts for a global set of 3330 stations over the period from 2007 to early 2014, the more parsimonious proportional odds logistic regression model proved to slightly outperform the multinomial logistic regression model. Reference Hemri, S., Haiden, T., & Pappenberger, F. (2016). Discrete post-processing of total cloud cover ensemble forecasts. Monthly Weather Review 144, 2565-2577.

  16. Fuzzy multinomial logistic regression analysis: A multi-objective programming approach

    NASA Astrophysics Data System (ADS)

    Abdalla, Hesham A.; El-Sayed, Amany A.; Hamed, Ramadan

    2017-05-01

    Parameter estimation for multinomial logistic regression is usually based on maximizing the likelihood function. For large well-balanced datasets, Maximum Likelihood (ML) estimation is a satisfactory approach. Unfortunately, ML can fail completely or at least produce poor results in terms of estimated probabilities and confidence intervals of parameters, specially for small datasets. In this study, a new approach based on fuzzy concepts is proposed to estimate parameters of the multinomial logistic regression. The study assumes that the parameters of multinomial logistic regression are fuzzy. Based on the extension principle stated by Zadeh and Bárdossy's proposition, a multi-objective programming approach is suggested to estimate these fuzzy parameters. A simulation study is used to evaluate the performance of the new approach versus Maximum likelihood (ML) approach. Results show that the new proposed model outperforms ML in cases of small datasets.

  17. Sleep problems and suicide attempts among adolescents: a case-control study.

    PubMed

    Koyawala, Neel; Stevens, Jack; McBee-Strayer, Sandra M; Cannon, Elizabeth A; Bridge, Jeffrey A

    2015-01-01

    This study used a case-control design to compare sleep disturbances in 40 adolescents who attempted suicide with 40 never-suicidal adolescents. Using hierarchical logistic regression analyses, we found that self-reported nighttime awakenings were significantly associated with attempted suicide, after controlling for antidepressant use, antipsychotic use, affective problems, and being bullied. In a separate regression analysis, the parent-reported total sleep problems score also predicted suicide attempt status, controlling for key covariates. No associations were found between suicide attempts and other distinct sleep problems, including falling asleep at bedtime, sleeping a lot during the day, trouble waking up in the morning, sleep duration, and parent-reported nightmares. Clinicians should be aware of sleep problems as potential risk factors for suicide attempts for adolescents.

  18. Serum magnesium but not calcium was associated with hemorrhagic transformation in stroke overall and stroke subtypes: a case-control study in China.

    PubMed

    Tan, Ge; Yuan, Ruozhen; Wei, ChenChen; Xu, Mangmang; Liu, Ming

    2018-05-26

    Association between serum calcium and magnesium versus hemorrhagic transformation (HT) remains to be identified. A total of 1212 non-thrombolysis patients with serum calcium and magnesium collected within 24 h from stroke onset were enrolled. Backward stepwise multivariate logistic regression analysis was conducted to investigate association between calcium and magnesium versus HT. Calcium and magnesium were entered into logistic regression analysis in two models, separately: model 1, as continuous variable (per 1-mmol/L increase), and model 2, as four-categorized variable (being collapsed into quartiles). HT occurred in 140 patients (11.6%). Serum calcium was slightly lower in patients with HT than in patient without HT (P = 0.273). But serum magnesium was significantly lower in patients with HT than in patients without HT (P = 0.007). In logistic regression analysis, calcium displayed no association with HT. Magnesium, as either continuous or four-categorized variable, was independently and inversely associated with HT in stroke overall and stroke of large-artery atherosclerosis (LAA). The results demonstrated that serum calcium had no association with HT in patients without thrombolysis after acute ischemic stroke. Serum magnesium in low level was independently associated with increasing HT in stroke overall and particularly in stroke of LAA.

  19. A Primer on Logistic Regression.

    ERIC Educational Resources Information Center

    Woldbeck, Tanya

    This paper introduces logistic regression as a viable alternative when the researcher is faced with variables that are not continuous. If one is to use simple regression, the dependent variable must be measured on a continuous scale. In the behavioral sciences, it may not always be appropriate or possible to have a measured dependent variable on a…

  20. Periodontal disease in Chinese patients with systemic lupus erythematosus.

    PubMed

    Zhang, Qiuxiang; Zhang, Xiaoli; Feng, Guijaun; Fu, Ting; Yin, Rulan; Zhang, Lijuan; Feng, Xingmei; Li, Liren; Gu, Zhifeng

    2017-08-01

    Disease of systemic lupus erythematosus (SLE) and periodontal disease (PD) shares the common multiple characteristics. The aims of the present study were to evaluate the prevalence and severity of periodontal disease in Chinese SLE patients and to determine the association between SLE features and periodontal parameters. A cross-sectional study of 108 SLE patients together with 108 age- and sex-matched healthy controls was made. Periodontal status was conducted by two dentists independently. Sociodemographic characteristics, lifestyle factors, medication use, and clinical parameters were also assessed. The periodontal status was significantly worse in SLE patients compared to controls. In univariate logistic regression, SLE had a significant 2.78-fold [95% confidence interval (CI) 1.60-4.82] increase in odds of periodontitis compared to healthy controls. Adjusted for potential risk factors, patients with SLE had 13.98-fold (95% CI 5.10-38.33) increased odds against controls. In multiple linear regression model, the independent variable negatively and significantly associated with gingival index was education (P = 0.005); conversely, disease activity (P < 0.001) and plaque index (P = 0.002) were positively associated; Age was the only variable independently associated with periodontitis of SLE in multivariate logistic regression (OR 1.348; 95% CI: 1.183-1.536, P < 0.001). Chinese SLE patients were likely to suffer from higher odds of PD. These findings confirmed the importance of early interventions in combination with medical therapy. It is necessary for a close collaboration between dentists and clinicians when treating those patients.

  1. Serum 25-hydroxyvitamin D and breast cancer in the military: a case-control study utilizing pre-diagnostic serum.

    PubMed

    Mohr, Sharif B; Gorham, Edward D; Alcaraz, John E; Kane, Christopher I; Macera, Caroline A; Parsons, J Kellogg; Wingard, Deborah L; Horst, Ronald; Garland, Cedric F

    2013-03-01

    The objective of this study was to ascertain whether a relationship exists between pre-diagnostic serum levels of 25-hydroxyvitamin D (25(OH)D) and risk of breast cancer in young women. About 600 incident cases of breast cancer were matched to 600 controls as part of a nested case-control study that utilized pre-diagnostic sera. Logistic regression was used to assess the relationship between serum 25(OH)D concentration and breast cancer risk, controlling for race and age. According to the conditional logistic regression for all subjects, odds ratios for breast cancer by quintile of serum 25(OH)D from lowest to highest were 1.2, 1.0, 0.9, 1.1, and 1.0 (reference) (p trend = 0.72). After multivariate regression for subjects whose blood had been collected within 90 days preceding diagnosis, odds ratios for breast cancer by quintile of serum 25(OH)D from lowest to highest were 3.3, 1.9, 1.7, 2.6, and 1.0 (reference) (p trend = 0.09). An inverse association between serum 25(OH)D concentration and risk of breast cancer was not present in the principal analysis, although an inverse association was present in a small subgroup analysis of subjects whose blood had been collected within 90 days preceding diagnosis. Further prospective studies of 25(OH)D and breast cancer risk are needed.

  2. Association of Parkinsonism or Parkinson Disease with Polypharmacy in the Year Preceding Diagnosis: A Nested Case-Control Study in South Korea.

    PubMed

    Park, Hae-Young; Park, Ji-Won; Sohn, Hyun Soon; Kwon, Jin-Won

    2017-11-01

    Published studies on the association between polypharmacy and parkinsonism or Parkinson disease are very limited. The objective of this study was to investigate whether polypharmacy is associated with parkinsonism or Parkinson disease in elderly patients. From a South Korean national health insurance sample cohort database for 2002-2013, we matched parkinsonism cases (defined by diagnosis codes for parkinsonism/Parkinson disease) and Parkinson disease cases (patients who had records for both Parkinson disease diagnosis and anti-Parkinson disease drug prescriptions) with controls. Logistic regression analysis evaluated the associations of parkinsonism/Parkinson disease with polypharmacy (i.e., five or more prescribed daily drugs) during the year preceding parkinsonism/Parkinson disease diagnosis, medications potentially associated with parkinsonism, and comorbidity status (using the Charlson Comorbidity Index score and hospitalization records). The study population included 6209 cases and 24,836 controls for parkinsonism and 1331 cases and 5324 controls for Parkinson disease. In univariate logistic regression, odds ratios for parkinsonism/Parkinson disease increased significantly with increased polypharmacy, medications potentially associated with parkinsonism, Charlson Comorbidity Index score, or prior hospitalizations. In multiple logistic regression, odds ratios for parkinsonism/Parkinson disease (adjusted for medications potentially associated with parkinsonism and comorbidities) also increased with increased polypharmacy. Odds ratios (95% confidence interval) for Parkinson disease were higher than those for parkinsonism with stronger statistical significance: 1.41 (1.28-1.55) and 2.17 (1.84-2.57) for parkinsonism and 2.87 (2.30-3.58) and 4.75 (3.39-6.66) for Parkinson disease for between five and ten prescribed daily drugs and ten or more drugs, respectively. Polypharmacy in the year preceding diagnosis may be associated with an increased risk for parkinsonism/Parkinson disease. Medications potentially associated with parkinsonism were assumed to increase the risk for parkinsonism/Parkinson disease, but more studies are required to confirm this relationship.

  3. A case-control study of determinants for high and low dental caries prevalence in Nevada youth

    PubMed Central

    2010-01-01

    Background The main purpose of this study was to compare the 30% of Nevada Youth who presented with the highest Decayed Missing and Filled Teeth (DMFT) index to a cohort who were caries free and to national NHANES data. Secondly, to explore the factors associated with higher caries prevalence in those with the highest DMFT scores compared to the caries-free group. Methods Over 4000 adolescents between ages 12 and 19 (Case Group: N = 2124; Control Group: N = 2045) received oral health screenings conducted in public/private middle and high schools in Nevada in 2008/2009 academic year. Caries prevalence was computed (Untreated decay scores [D-Score] and DMFT scores) for the 30% of Nevada Youth who presented with the highest DMFT score (case group) and compared to the control group (caries-free) and to national averages. Bivariate and multivariate logistic regression was used to analyze the relationship between selected variables and caries prevalence. Results A majority of the sample was non-Hispanic (62%), non-smokers (80%), and had dental insurance (70%). With the exception of gender, significant differences in mean D-scores were found in seven of the eight variables. All variables produced significant differences between the case and control groups in mean DMFT Scores. With the exception of smoking status, there were significant differences in seven of the eight variables in the bivariate logistic regression. All of the independent variables remained in the multivariate logistic regression model contributing significantly to over 40% of the variation in the increased DMFT status. The strongest predictors for the high DMFT status were racial background, age, fluoridated community, and applied sealants respectively. Gender, second hand smoke, insurance status, and tobacco use were significant, but to a lesser extent. Conclusions Findings from this study will aid in creating educational programs and other primary and secondary interventions to help promote oral health for Nevada youth, especially focusing on the subgroup that presents with the highest mean DMFT scores. PMID:21067620

  4. A Solution to Separation and Multicollinearity in Multiple Logistic Regression

    PubMed Central

    Shen, Jianzhao; Gao, Sujuan

    2010-01-01

    In dementia screening tests, item selection for shortening an existing screening test can be achieved using multiple logistic regression. However, maximum likelihood estimates for such logistic regression models often experience serious bias or even non-existence because of separation and multicollinearity problems resulting from a large number of highly correlated items. Firth (1993, Biometrika, 80(1), 27–38) proposed a penalized likelihood estimator for generalized linear models and it was shown to reduce bias and the non-existence problems. The ridge regression has been used in logistic regression to stabilize the estimates in cases of multicollinearity. However, neither solves the problems for each other. In this paper, we propose a double penalized maximum likelihood estimator combining Firth’s penalized likelihood equation with a ridge parameter. We present a simulation study evaluating the empirical performance of the double penalized likelihood estimator in small to moderate sample sizes. We demonstrate the proposed approach using a current screening data from a community-based dementia study. PMID:20376286

  5. A Solution to Separation and Multicollinearity in Multiple Logistic Regression.

    PubMed

    Shen, Jianzhao; Gao, Sujuan

    2008-10-01

    In dementia screening tests, item selection for shortening an existing screening test can be achieved using multiple logistic regression. However, maximum likelihood estimates for such logistic regression models often experience serious bias or even non-existence because of separation and multicollinearity problems resulting from a large number of highly correlated items. Firth (1993, Biometrika, 80(1), 27-38) proposed a penalized likelihood estimator for generalized linear models and it was shown to reduce bias and the non-existence problems. The ridge regression has been used in logistic regression to stabilize the estimates in cases of multicollinearity. However, neither solves the problems for each other. In this paper, we propose a double penalized maximum likelihood estimator combining Firth's penalized likelihood equation with a ridge parameter. We present a simulation study evaluating the empirical performance of the double penalized likelihood estimator in small to moderate sample sizes. We demonstrate the proposed approach using a current screening data from a community-based dementia study.

  6. Oral Microbiota and Risk for Esophageal Squamous Cell Carcinoma in a High-Risk Area of China.

    PubMed

    Chen, Xingdong; Winckler, Björn; Lu, Ming; Cheng, Hongwei; Yuan, Ziyu; Yang, Yajun; Jin, Li; Ye, Weimin

    2015-01-01

    Poor oral health has been linked with an increased risk of esophageal squamous cell carcinoma (ESCC). We investigated whether alteration of oral microbiota is associated with ESCC risk. Fasting saliva samples were collected from 87 incident and histopathologicallly diagnosed ESCC cases, 63 subjects with dysplasia and 85 healthy controls. All subjects were also interviewed with a questionnaire. V3-V4 region of 16S rRNA was amplified and sequenced by 454-pyrosequencing platform. Carriage of each genus was compared by means of multivariate-adjusted odds ratios derived from logistic regression model. Relative abundance was compared using Metastats method. Beta diversity was estimated using Unifrac and weighted Unifrac distances. Principal coordinate analysis (PCoA) was applied to ordinate dissimilarity matrices. Multinomial logistic regression was used to compare the coordinates between different groups. ESCC subjects had an overall decreased microbial diversity compared to control and dysplasia subjects (P<0.001). Decreased carriage of genera Lautropia, Bulleidia, Catonella, Corynebacterium, Moryella, Peptococcus and Cardiobacterium were found in ESCC subjects compared to non-ESCC subjects. Multinomial logistic regression analyses on PCoA coordinates also revealed that ESCC subjects had significantly different levels for several coordinates compared to non-ESCC subjects. In conclusion, we observed a correlation between altered salivary bacterial microbiota and ESCC risk. The results of our study on the saliva microbiome are of particular interest as it reflects the shift in microbial communities. Further studies are warranted to verify this finding, and if being verified, to explore the underlying mechanisms.

  7. Job stress models, depressive disorders and work performance of engineers in microelectronics industry.

    PubMed

    Chen, Sung-Wei; Wang, Po-Chuan; Hsin, Ping-Lung; Oates, Anthony; Sun, I-Wen; Liu, Shen-Ing

    2011-01-01

    Microelectronic engineers are considered valuable human capital contributing significantly toward economic development, but they may encounter stressful work conditions in the context of a globalized industry. The study aims at identifying risk factors of depressive disorders primarily based on job stress models, the Demand-Control-Support and Effort-Reward Imbalance models, and at evaluating whether depressive disorders impair work performance in microelectronics engineers in Taiwan. The case-control study was conducted among 678 microelectronics engineers, 452 controls and 226 cases with depressive disorders which were defined by a score 17 or more on the Beck Depression Inventory and a psychiatrist's diagnosis. The self-administered questionnaires included the Job Content Questionnaire, Effort-Reward Imbalance Questionnaire, demography, psychosocial factors, health behaviors and work performance. Hierarchical logistic regression was applied to identify risk factors of depressive disorders. Multivariate linear regressions were used to determine factors affecting work performance. By hierarchical logistic regression, risk factors of depressive disorders are high demands, low work social support, high effort/reward ratio and low frequency of physical exercise. Combining the two job stress models may have better predictive power for depressive disorders than adopting either model alone. Three multivariate linear regressions provide similar results indicating that depressive disorders are associated with impaired work performance in terms of absence, role limitation and social functioning limitation. The results may provide insight into the applicability of job stress models in a globalized high-tech industry considerably focused in non-Western countries, and the design of workplace preventive strategies for depressive disorders in Asian electronics engineering population.

  8. Constructive thinking, rational intelligence and irritable bowel syndrome.

    PubMed

    Rey, Enrique; Moreno Ortega, Marta; Garcia Alonso, Monica-Olga; Diaz-Rubio, Manuel

    2009-07-07

    To evaluate rational and experiential intelligence in irritable bowel syndrome (IBS) sufferers. We recruited 100 subjects with IBS as per Rome II criteria (50 consulters and 50 non-consulters) and 100 healthy controls, matched by age, sex and educational level. Cases and controls completed a clinical questionnaire (including symptom characteristics and medical consultation) and the following tests: rational-intelligence (Wechsler Adult Intelligence Scale, 3rd edition); experiential-intelligence (Constructive Thinking Inventory); personality (NEO personality inventory); psychopathology (MMPI-2), anxiety (state-trait anxiety inventory) and life events (social readjustment rating scale). Analysis of variance was used to compare the test results of IBS-sufferers and controls, and a logistic regression model was then constructed and adjusted for age, sex and educational level to evaluate any possible association with IBS. No differences were found between IBS cases and controls in terms of IQ (102.0 +/- 10.8 vs 102.8 +/- 12.6), but IBS sufferers scored significantly lower in global constructive thinking (43.7 +/- 9.4 vs 49.6 +/- 9.7). In the logistic regression model, global constructive thinking score was independently linked to suffering from IBS [OR 0.92 (0.87-0.97)], without significant OR for total IQ. IBS subjects do not show lower rational intelligence than controls, but lower experiential intelligence is nevertheless associated with IBS.

  9. Controlled pattern imputation for sensitivity analysis of longitudinal binary and ordinal outcomes with nonignorable dropout.

    PubMed

    Tang, Yongqiang

    2018-04-30

    The controlled imputation method refers to a class of pattern mixture models that have been commonly used as sensitivity analyses of longitudinal clinical trials with nonignorable dropout in recent years. These pattern mixture models assume that participants in the experimental arm after dropout have similar response profiles to the control participants or have worse outcomes than otherwise similar participants who remain on the experimental treatment. In spite of its popularity, the controlled imputation has not been formally developed for longitudinal binary and ordinal outcomes partially due to the lack of a natural multivariate distribution for such endpoints. In this paper, we propose 2 approaches for implementing the controlled imputation for binary and ordinal data based respectively on the sequential logistic regression and the multivariate probit model. Efficient Markov chain Monte Carlo algorithms are developed for missing data imputation by using the monotone data augmentation technique for the sequential logistic regression and a parameter-expanded monotone data augmentation scheme for the multivariate probit model. We assess the performance of the proposed procedures by simulation and the analysis of a schizophrenia clinical trial and compare them with the fully conditional specification, last observation carried forward, and baseline observation carried forward imputation methods. Copyright © 2018 John Wiley & Sons, Ltd.

  10. Do Basic Skills Predict Youth Unemployment (16- to 24-Year-Olds) Also when Controlled for Accomplished Upper-Secondary School? A Cross-Country Comparison

    ERIC Educational Resources Information Center

    Lundetrae, Kjersti; Gabrielsen, Egil; Mykletun, Reidar

    2010-01-01

    Basic skills and educational level are closely related, and both might affect employment. Data from the Adult Literacy and Life Skills Survey were used to examine whether basic skills in terms of literacy and numeracy predicted youth unemployment (16-24 years) while controlling for educational level. Stepwise logistic regression showed that in…

  11. Reduction of Racial Disparities in Prostate Cancer

    DTIC Science & Technology

    2007-12-01

    anti-inflammatory medication, COX-2 inhibitors, aspirin, anti-TNF medications), and other medications of interest (testosterone, finasteride , alpha...compared to control-patients (mean 123) P=0.01. There were 14 (7%) control-patients who had Finasteride use, with an average of 398.6 doses per...individual. None of the prosate cancer patients had prior finasteride use. In a multiple logistic regression model (Table 2), after adjustment for the

  12. Employment Hardship among Mexican-Origin Women

    ERIC Educational Resources Information Center

    De Anda, Roberto M.

    2005-01-01

    This study compares the prevalence and causes of employment hardship between Mexican-origin and White women. Data come from the March 1992, 1996, and 2000 Current Population Surveys. Using logistic regression, the author assesses whether there is a difference between Mexican-origin and White women in employment hardship, controlling for personal…

  13. Obesity, Physical Activity, and Sedentary Behavior of Youth with Learning Disabilities and ADHD

    ERIC Educational Resources Information Center

    Cook, Bryan G.; Li, Dongmei; Heinrich, Katie M.

    2015-01-01

    Obesity, physical activity, and sedentary behavior in childhood are important indicators of present and future health and are associated with school-related outcomes such as academic achievement, behavior, peer relationships, and self-esteem. Using logistic regression models that controlled for gender, age, ethnicity/race, and socioeconomic…

  14. Optimizing Treatment of Lung Cancer Patients with Comorbidities

    DTIC Science & Technology

    2017-10-01

    of treatment options, comorbid illness, age, sex , histology, and tumor size. We will simulate base case scenarios for stage I NSCLC for all possible...fitting adjusted logistic regression models controlling for age, sex and cancer stage. Results Overall, 5,644 (80.4%) and 1,377 (19.6%) patients

  15. Nonstandard Employment in the Nonmetropolitan United States

    ERIC Educational Resources Information Center

    McLaughlin, Diane K.; Coleman-Jensen, Alisha J.

    2008-01-01

    We examine the prevalence of nonstandard employment in the nonmetropolitan United States using the Current Population Survey Supplement on Contingent Work (1999 and 2001). We find that nonstandard work is more prevalent in nonmetropolitan than in central city or suburban areas. Logistic regression models controlling for sociodemographic and work…

  16. Social Context of Drinking and Alcohol Problems among College Students

    ERIC Educational Resources Information Center

    Beck, Kenneth H.; Arria, Amelia M.; Caldeira, Kimberly M.; Vincent, Kathryn B.; O'Grady, Kevin E.; Wish, Eric D.

    2008-01-01

    Objective: To examine how social contexts of drinking are related to alcohol use disorders, other alcohol-related problems, and depression among college students. Methods: Logistic regression models controlling for drinking frequency measured the association between social context and problems, among 728 current drinkers. Results: Drinking for…

  17. Clinical Utility of Cancellation on the WISC-IV

    ERIC Educational Resources Information Center

    Zhu, Jianjun; Chen, Hsinyi

    2013-01-01

    This study examined empirical evidence for clinical utility of the Wechsler Intelligence Scale for Children, fourth edition (WISC-IV) cancellation subtest by comparing data from 597 clinical and 597 matched control children. The results of dependent t and sequential logistic regression analyses demonstrated that (a) children with intellectual…

  18. Characterizing mammographic images by using generic texture features

    PubMed Central

    2012-01-01

    Introduction Although mammographic density is an established risk factor for breast cancer, its use is limited in clinical practice because of a lack of automated and standardized measurement methods. The aims of this study were to evaluate a variety of automated texture features in mammograms as risk factors for breast cancer and to compare them with the percentage mammographic density (PMD) by using a case-control study design. Methods A case-control study including 864 cases and 418 controls was analyzed automatically. Four hundred seventy features were explored as possible risk factors for breast cancer. These included statistical features, moment-based features, spectral-energy features, and form-based features. An elaborate variable selection process using logistic regression analyses was performed to identify those features that were associated with case-control status. In addition, PMD was assessed and included in the regression model. Results Of the 470 image-analysis features explored, 46 remained in the final logistic regression model. An area under the curve of 0.79, with an odds ratio per standard deviation change of 2.88 (95% CI, 2.28 to 3.65), was obtained with validation data. Adding the PMD did not improve the final model. Conclusions Using texture features to predict the risk of breast cancer appears feasible. PMD did not show any additional value in this study. With regard to the features assessed, most of the analysis tools appeared to reflect mammographic density, although some features did not correlate with PMD. It remains to be investigated in larger case-control studies whether these features can contribute to increased prediction accuracy. PMID:22490545

  19. Adjusting for Confounding in Early Postlaunch Settings: Going Beyond Logistic Regression Models.

    PubMed

    Schmidt, Amand F; Klungel, Olaf H; Groenwold, Rolf H H

    2016-01-01

    Postlaunch data on medical treatments can be analyzed to explore adverse events or relative effectiveness in real-life settings. These analyses are often complicated by the number of potential confounders and the possibility of model misspecification. We conducted a simulation study to compare the performance of logistic regression, propensity score, disease risk score, and stabilized inverse probability weighting methods to adjust for confounding. Model misspecification was induced in the independent derivation dataset. We evaluated performance using relative bias confidence interval coverage of the true effect, among other metrics. At low events per coefficient (1.0 and 0.5), the logistic regression estimates had a large relative bias (greater than -100%). Bias of the disease risk score estimates was at most 13.48% and 18.83%. For the propensity score model, this was 8.74% and >100%, respectively. At events per coefficient of 1.0 and 0.5, inverse probability weighting frequently failed or reduced to a crude regression, resulting in biases of -8.49% and 24.55%. Coverage of logistic regression estimates became less than the nominal level at events per coefficient ≤5. For the disease risk score, inverse probability weighting, and propensity score, coverage became less than nominal at events per coefficient ≤2.5, ≤1.0, and ≤1.0, respectively. Bias of misspecified disease risk score models was 16.55%. In settings with low events/exposed subjects per coefficient, disease risk score methods can be useful alternatives to logistic regression models, especially when propensity score models cannot be used. Despite better performance of disease risk score methods than logistic regression and propensity score models in small events per coefficient settings, bias, and coverage still deviated from nominal.

  20. Predictors and Neuropsychiatric Profile of Nucleus Basalis of Meynert Degeneration in Parkinson Disease

    DTIC Science & Technology

    2017-10-01

    baseline were available for 228 PD subjects. In a logistic regression model adjusted for age and sex , Ch4 density was associated with lower risk of...events, there were no significant differences in age or sex (p>0.05). PD subjects with 2 or more psychotic events had significantly lower baseline Ch4...Aim 1 and 2 include use of linear regression models to adjust for age, sex , and other significant covariates. Aim 3 is a cross-sectional controlled

  1. No rationale for 1 variable per 10 events criterion for binary logistic regression analysis.

    PubMed

    van Smeden, Maarten; de Groot, Joris A H; Moons, Karel G M; Collins, Gary S; Altman, Douglas G; Eijkemans, Marinus J C; Reitsma, Johannes B

    2016-11-24

    Ten events per variable (EPV) is a widely advocated minimal criterion for sample size considerations in logistic regression analysis. Of three previous simulation studies that examined this minimal EPV criterion only one supports the use of a minimum of 10 EPV. In this paper, we examine the reasons for substantial differences between these extensive simulation studies. The current study uses Monte Carlo simulations to evaluate small sample bias, coverage of confidence intervals and mean square error of logit coefficients. Logistic regression models fitted by maximum likelihood and a modified estimation procedure, known as Firth's correction, are compared. The results show that besides EPV, the problems associated with low EPV depend on other factors such as the total sample size. It is also demonstrated that simulation results can be dominated by even a few simulated data sets for which the prediction of the outcome by the covariates is perfect ('separation'). We reveal that different approaches for identifying and handling separation leads to substantially different simulation results. We further show that Firth's correction can be used to improve the accuracy of regression coefficients and alleviate the problems associated with separation. The current evidence supporting EPV rules for binary logistic regression is weak. Given our findings, there is an urgent need for new research to provide guidance for supporting sample size considerations for binary logistic regression analysis.

  2. 4D-Fingerprint Categorical QSAR Models for Skin Sensitization Based on Classification Local Lymph Node Assay Measures

    PubMed Central

    Li, Yi; Tseng, Yufeng J.; Pan, Dahua; Liu, Jianzhong; Kern, Petra S.; Gerberick, G. Frank; Hopfinger, Anton J.

    2008-01-01

    Currently, the only validated methods to identify skin sensitization effects are in vivo models, such as the Local Lymph Node Assay (LLNA) and guinea pig studies. There is a tremendous need, in particular due to novel legislation, to develop animal alternatives, eg. Quantitative Structure-Activity Relationship (QSAR) models. Here, QSAR models for skin sensitization using LLNA data have been constructed. The descriptors used to generate these models are derived from the 4D-molecular similarity paradigm and are referred to as universal 4D-fingerprints. A training set of 132 structurally diverse compounds and a test set of 15 structurally diverse compounds were used in this study. The statistical methodologies used to build the models are logistic regression (LR), and partial least square coupled logistic regression (PLS-LR), which prove to be effective tools for studying skin sensitization measures expressed in the two categorical terms of sensitizer and non-sensitizer. QSAR models with low values of the Hosmer-Lemeshow goodness-of-fit statistic, χHL2, are significant and predictive. For the training set, the cross-validated prediction accuracy of the logistic regression models ranges from 77.3% to 78.0%, while that of PLS-logistic regression models ranges from 87.1% to 89.4%. For the test set, the prediction accuracy of logistic regression models ranges from 80.0%-86.7%, while that of PLS-logistic regression models ranges from 73.3%-80.0%. The QSAR models are made up of 4D-fingerprints related to aromatic atoms, hydrogen bond acceptors and negatively partially charged atoms. PMID:17226934

  3. Logistic Regression Likelihood Ratio Test Analysis for Detecting Signals of Adverse Events in Post-market Safety Surveillance.

    PubMed

    Nam, Kijoeng; Henderson, Nicholas C; Rohan, Patricia; Woo, Emily Jane; Russek-Cohen, Estelle

    2017-01-01

    The Vaccine Adverse Event Reporting System (VAERS) and other product surveillance systems compile reports of product-associated adverse events (AEs), and these reports may include a wide range of information including age, gender, and concomitant vaccines. Controlling for possible confounding variables such as these is an important task when utilizing surveillance systems to monitor post-market product safety. A common method for handling possible confounders is to compare observed product-AE combinations with adjusted baseline frequencies where the adjustments are made by stratifying on observable characteristics. Though approaches such as these have proven to be useful, in this article we propose a more flexible logistic regression approach which allows for covariates of all types rather than relying solely on stratification. Indeed, a main advantage of our approach is that the general regression framework provides flexibility to incorporate additional information such as demographic factors and concomitant vaccines. As part of our covariate-adjusted method, we outline a procedure for signal detection that accounts for multiple comparisons and controls the overall Type 1 error rate. To demonstrate the effectiveness of our approach, we illustrate our method with an example involving febrile convulsion, and we further evaluate its performance in a series of simulation studies.

  4. Health related quality of life among myocardial infarction survivors in the United States: a propensity score matched analysis.

    PubMed

    Mollon, Lea; Bhattacharjee, Sandipan

    2017-12-04

    Little is known regarding the health-related quality of life among myocardial infarction (MI) survivors in the United States. The purpose of this population-based study was to identify differences in health-related quality of life domains between MI survivors and propensity score matched controls. This retrospective, cross-sectional matched case-control study examined differences in health-related quality of life (HRQoL) among MI survivors of myocardial infarction compared to propensity score matched controls using data from the 2015 Behavioral Risk Factor Surveillance System (BRFSS) survey. Propensity scores were generated via logistic regression for MI survivors and controls based on gender, race/ethnicity, age, body mass index (BMI), smoking status, and comorbidities. Chi-square tests were used to compare differences between MI survivors to controls for demographic variables. A multivariate analysis of HRQoL domains estimated odds ratios. Life satisfaction, sleep quality, and activity limitations were estimated using binary logistic regression. Social support, perceived general health, perceived physical health, and perceived mental health were estimated using multinomial logistic regression. Significance was set at p < 0.05. The final sample consisted of 16,729 MI survivors matched to 50,187 controls (n = 66,916). Survivors were approximately 2.7 times more likely to report fair/poor general health compared to control (AOR = 2.72, 95% CI: 2.43-3.05) and 1.5 times more likely to report limitations to daily activities (AOR = 1.46, 95% CI: 1.34-1.59). Survivors were more likely to report poor physical health >15 days in the month (AOR = 1.63, 95% CI: 1.46-1.83) and poor mental health >15 days in the month (AOR = 1.25, 95% CI: 1.07-1.46) compared to matched controls. There was no difference in survivors compared to controls in level of emotional support (rarely/never: AOR = 0.75, 95% CI: 0.48-1.18; sometimes: AOR = 0.73, 95% CI: 0.41-1.28), hours of recommended sleep (AOR = 1.14, 95% CI: 0.94-1.38), or life satisfaction (AOR = 1.62, 95% CI: 0.99-2.63). MI survivors experienced lower HRQoL on domains of general health, physical health, daily activity, and mental health compared to the general population.

  5. MODELING SNAKE MICROHABITAT FROM RADIOTELEMETRY STUDIES USING POLYTOMOUS LOGISTIC REGRESSION

    EPA Science Inventory

    Multivariate analysis of snake microhabitat has historically used techniques that were derived under assumptions of normality and common covariance structure (e.g., discriminant function analysis, MANOVA). In this study, polytomous logistic regression (PLR which does not require ...

  6. Socioeconomic factors affecting infant sleep-related deaths in St. Louis.

    PubMed

    Hogan, Cathy

    2014-01-01

    Though the Back to Sleep Campaign that began in 1994 caused an overall decrease in sudden infant death syndrome (SIDS) rates, racial disparity has continued to increase in St. Louis. Though researchers have analyzed and described various sociodemographic characteristics of SIDS and infant deaths by unintentional suffocation in St. Louis, they have not simultaneously controlled for contributory risk factors to racial disparity such as race, poverty, maternal education, and number of children born to each mother (parity). To determine whether there is a relationship between maternal socioeconomic factors and sleep-related infant death. This quantitative case-control study used secondary data collected by the Missouri Department of Health and Senior Services between 2005 and 2009. The sample includes matched birth/death certificates and living birth certificates of infants who were born/died within time frame. Descriptive analysis, Chi-square, and logistic regression. The controls were birth records of infants who lived more than 1 year. Chi-square and logistic regression analyses confirmed that race and poverty have significant relationships with infant sleep-related deaths. The social significance of this study is that the results may lead to population-specific modifications of prevention messages that will reduce infant sleep-related deaths. © 2013 Wiley Periodicals, Inc.

  7. Evaluation of Female Breast Cancer Risk Among the Betel Quid Chewer: A Bio-Statistical Assessment in Assam, India.

    PubMed

    Rajbongshi, Nijara; Mahanta, Lipi B; Nath, Dilip C

    2015-06-01

    Breast cancer is the most commonly diagnosed cancer among the female population of Assam, India. Chewing of betel quid with or without tobacco is common practice among female population of this region. Moreoverthe method of preparing the betel quid is different from other parts of the country.So matched case control study is conducted to analyse whetherbetel quid chewing plays a significant role in the high incidence of breast cancer occurrences in Assam. Here, controls are matched to the cases by age at diagnosis (±5 years), family income and place of residence with matching ratio 1:1. Conditional logistic regression models and odd ratios (OR) was used to draw conclusions. It is observed that cases are more habituated to chewing habits than the controls.Further the conditional logistic regression analysis reveals that betel quid chewer faces 2.353 times more risk having breast cancer than the non-chewer with p value 0.0003 (95% CI 1.334-4.150). Though the female population in Assam usually does not smoke, the addictive habits typical to this region have equal effect on the occurrence of breast cancer.

  8. Selecting risk factors: a comparison of discriminant analysis, logistic regression and Cox's regression model using data from the Tromsø Heart Study.

    PubMed

    Brenn, T; Arnesen, E

    1985-01-01

    For comparative evaluation, discriminant analysis, logistic regression and Cox's model were used to select risk factors for total and coronary deaths among 6595 men aged 20-49 followed for 9 years. Groups with mortality between 5 and 93 per 1000 were considered. Discriminant analysis selected variable sets only marginally different from the logistic and Cox methods which always selected the same sets. A time-saving option, offered for both the logistic and Cox selection, showed no advantage compared with discriminant analysis. Analysing more than 3800 subjects, the logistic and Cox methods consumed, respectively, 80 and 10 times more computer time than discriminant analysis. When including the same set of variables in non-stepwise analyses, all methods estimated coefficients that in most cases were almost identical. In conclusion, discriminant analysis is advocated for preliminary or stepwise analysis, otherwise Cox's method should be used.

  9. Modification of the Mantel-Haenszel and Logistic Regression DIF Procedures to Incorporate the SIBTEST Regression Correction

    ERIC Educational Resources Information Center

    DeMars, Christine E.

    2009-01-01

    The Mantel-Haenszel (MH) and logistic regression (LR) differential item functioning (DIF) procedures have inflated Type I error rates when there are large mean group differences, short tests, and large sample sizes.When there are large group differences in mean score, groups matched on the observed number-correct score differ on true score,…

  10. A Genome-Wide Investigation of Autozygosity and Breast Cancer Risk

    DTIC Science & Technology

    2011-07-01

    cases than in controls, using logistic regression methods. Using genome-wide SNP data (525,000 SNPs) on 1,647 non-Hispanic white, early-onset...premenopausal breast cancer cases and 1,556 matched controls we identified over 65,000 individual RoHs and 423 genomic regions harbor RoHs for at least 10...we hypothesize that germline autozygosity is more common in breast cancer cases than in controls. More specifically, we hypothesize that there are

  11. Satellite rainfall retrieval by logistic regression

    NASA Technical Reports Server (NTRS)

    Chiu, Long S.

    1986-01-01

    The potential use of logistic regression in rainfall estimation from satellite measurements is investigated. Satellite measurements provide covariate information in terms of radiances from different remote sensors.The logistic regression technique can effectively accommodate many covariates and test their significance in the estimation. The outcome from the logistical model is the probability that the rainrate of a satellite pixel is above a certain threshold. By varying the thresholds, a rainrate histogram can be obtained, from which the mean and the variant can be estimated. A logistical model is developed and applied to rainfall data collected during GATE, using as covariates the fractional rain area and a radiance measurement which is deduced from a microwave temperature-rainrate relation. It is demonstrated that the fractional rain area is an important covariate in the model, consistent with the use of the so-called Area Time Integral in estimating total rain volume in other studies. To calibrate the logistical model, simulated rain fields generated by rainfield models with prescribed parameters are needed. A stringent test of the logistical model is its ability to recover the prescribed parameters of simulated rain fields. A rain field simulation model which preserves the fractional rain area and lognormality of rainrates as found in GATE is developed. A stochastic regression model of branching and immigration whose solutions are lognormally distributed in some asymptotic limits has also been developed.

  12. Practical Session: Logistic Regression

    NASA Astrophysics Data System (ADS)

    Clausel, M.; Grégoire, G.

    2014-12-01

    An exercise is proposed to illustrate the logistic regression. One investigates the different risk factors in the apparition of coronary heart disease. It has been proposed in Chapter 5 of the book of D.G. Kleinbaum and M. Klein, "Logistic Regression", Statistics for Biology and Health, Springer Science Business Media, LLC (2010) and also by D. Chessel and A.B. Dufour in Lyon 1 (see Sect. 6 of http://pbil.univ-lyon1.fr/R/pdf/tdr341.pdf). This example is based on data given in the file evans.txt coming from http://www.sph.emory.edu/dkleinb/logreg3.htm#data.

  13. Multinomial logistic regression modelling of obesity and overweight among primary school students in a rural area of Negeri Sembilan

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ghazali, Amirul Syafiq Mohd; Ali, Zalila; Noor, Norlida Mohd

    Multinomial logistic regression is widely used to model the outcomes of a polytomous response variable, a categorical dependent variable with more than two categories. The model assumes that the conditional mean of the dependent categorical variables is the logistic function of an affine combination of predictor variables. Its procedure gives a number of logistic regression models that make specific comparisons of the response categories. When there are q categories of the response variable, the model consists of q-1 logit equations which are fitted simultaneously. The model is validated by variable selection procedures, tests of regression coefficients, a significant test ofmore » the overall model, goodness-of-fit measures, and validation of predicted probabilities using odds ratio. This study used the multinomial logistic regression model to investigate obesity and overweight among primary school students in a rural area on the basis of their demographic profiles, lifestyles and on the diet and food intake. The results indicated that obesity and overweight of students are related to gender, religion, sleep duration, time spent on electronic games, breakfast intake in a week, with whom meals are taken, protein intake, and also, the interaction between breakfast intake in a week with sleep duration, and the interaction between gender and protein intake.« less

  14. Multinomial logistic regression modelling of obesity and overweight among primary school students in a rural area of Negeri Sembilan

    NASA Astrophysics Data System (ADS)

    Ghazali, Amirul Syafiq Mohd; Ali, Zalila; Noor, Norlida Mohd; Baharum, Adam

    2015-10-01

    Multinomial logistic regression is widely used to model the outcomes of a polytomous response variable, a categorical dependent variable with more than two categories. The model assumes that the conditional mean of the dependent categorical variables is the logistic function of an affine combination of predictor variables. Its procedure gives a number of logistic regression models that make specific comparisons of the response categories. When there are q categories of the response variable, the model consists of q-1 logit equations which are fitted simultaneously. The model is validated by variable selection procedures, tests of regression coefficients, a significant test of the overall model, goodness-of-fit measures, and validation of predicted probabilities using odds ratio. This study used the multinomial logistic regression model to investigate obesity and overweight among primary school students in a rural area on the basis of their demographic profiles, lifestyles and on the diet and food intake. The results indicated that obesity and overweight of students are related to gender, religion, sleep duration, time spent on electronic games, breakfast intake in a week, with whom meals are taken, protein intake, and also, the interaction between breakfast intake in a week with sleep duration, and the interaction between gender and protein intake.

  15. The cross-validated AUC for MCP-logistic regression with high-dimensional data.

    PubMed

    Jiang, Dingfeng; Huang, Jian; Zhang, Ying

    2013-10-01

    We propose a cross-validated area under the receiving operator characteristic (ROC) curve (CV-AUC) criterion for tuning parameter selection for penalized methods in sparse, high-dimensional logistic regression models. We use this criterion in combination with the minimax concave penalty (MCP) method for variable selection. The CV-AUC criterion is specifically designed for optimizing the classification performance for binary outcome data. To implement the proposed approach, we derive an efficient coordinate descent algorithm to compute the MCP-logistic regression solution surface. Simulation studies are conducted to evaluate the finite sample performance of the proposed method and its comparison with the existing methods including the Akaike information criterion (AIC), Bayesian information criterion (BIC) or Extended BIC (EBIC). The model selected based on the CV-AUC criterion tends to have a larger predictive AUC and smaller classification error than those with tuning parameters selected using the AIC, BIC or EBIC. We illustrate the application of the MCP-logistic regression with the CV-AUC criterion on three microarray datasets from the studies that attempt to identify genes related to cancers. Our simulation studies and data examples demonstrate that the CV-AUC is an attractive method for tuning parameter selection for penalized methods in high-dimensional logistic regression models.

  16. A case-control study evaluating relative risk factors for decompression sickness: a research report.

    PubMed

    Suzuki, Naoko; Yagishita, Kazuyosi; Togawa, Seiichiro; Okazaki, Fumihiro; Shibayama, Masaharu; Yamamoto, Kazuo; Mano, Yoshihiro

    2014-01-01

    Factors contributing to the pathogenesis of decompression sickness (DCS) in divers have been described in many studies. However, relative importance of these factors has not been reported. In this case-control study, we compared the diving profiles of divers experiencing DCS with those of a control group. The DCS group comprised 35 recreational scuba divers who were diagnosed by physicians as having DCS. The control group consisted of 324 apparently healthy recreational divers. All divers conducted their dives from 2009 to 2011. The questionnaire consisted of 33 items about an individual's diving profile, physical condition and activities before, during and just after the dive. To simplify dive parameters, the dive site was limited to Izu Osezaki. Odds ratios and multiple logistic regression were used for the analysis. Odds ratios revealed several items as dive and health factors associated with DCS. The major items were as follows: shortness of breath after heavy exercise during the dive (OR = 12.12), dehydration (OR = 10.63), and maximum dive depth > 30 msw (OR = 7.18). Results of logistic regression were similar to those by odds ratio analysis. We assessed the relative weights of the surveyed dive and health factors associated with DCS. Because results of several factors conflict with previous studies, future studies are needed.

  17. Plasma Homocysteine and Asymmetrical Dimethyl-l-Arginine (ADMA) and Whole Blood DNA Methylation in Early and Neovascular Age-Related Macular Degeneration: A Pilot Study.

    PubMed

    Pinna, Antonio; Zinellu, Angelo; Tendas, Donatella; Blasetti, Francesco; Carru, Ciriaco; Castiglia, Paolo

    2016-01-01

    To compare the plasma levels of homocysteine and asymmetrical dimethyl-l-arginine (ADMA) and the degree of whole blood DNA methylation in patients with early and neovascular age-related macular degeneration (AMD) and in controls without maculopathy of any sort. This observational case-control pilot study included 39 early AMD patients, 27 neovascular AMD patients and 132 sex- and age-matched controls without maculopathy. Plasma homocysteine and ADMA concentrations and the degree of whole blood DNA methylation were measured. Quantitative variables were compared by Student's t-test or Mann-Whitney test. Logistic regression models were used to investigate the significance of the association between early or wet AMD and some variables. There were no significant differences in mean plasma homocysteine and ADMA concentrations and in the degree of whole blood DNA methylation between patients with early or neovascular AMD and their controls. Similarly, logistic regression analysis disclosed that plasma homocysteine and ADMA levels were not associated with an increased risk for early or neovascular AMD. We failed to demonstrate an association between early or neovascular AMD and increased plasma homocysteine and/or ADMA. Results also suggest that the degree of whole blood DNA methylation is not a marker of AMD.

  18. Association between coagulation function and patients with primary angle closure glaucoma: a 5-year retrospective case-control study.

    PubMed

    Li, Shengjie; Gao, Yanting; Shao, Mingxi; Tang, Binghua; Cao, Wenjun; Sun, Xinghuai

    2017-11-04

    To evaluate the association between coagulation function and patients with primary angle closure glaucoma (PACG). A retrospective, hospital-based, case-control study. Shanghai, China. A total of 1778 subjects were recruited from the Eye & ENT Hospital of Fudan University from January 2010 to December 2015, including patients with PACG (male=296; female=569) and control subjects (male=290; female=623). Sociodemographic data and clinical data were collected. The one-way analysis of variance test was used to compare the levels of laboratory parameters among the mild, moderate and severe PACG groups. Multivariate logistic regression analyses were performed to identify the independent risk factors for PACG. The nomogram was constructed based on the logistic regression model using the R project for statistical computing (R V.3.3.2). The activated partial thromboplastin time (APTT) of the PACG group was approximately 4% shorter (p<0.001) than that of the control group. The prothrombin time (PT) was approximately 2.40% shorter (p<0.001) in patients with PACG compared with the control group. The thrombin time was also approximately 2.14% shorter (p<0.001) in patients with PACG compared with the control group. The level of D-dimer was significantly higher (p=0.042) in patients with PACG. Moreover, the mean platelet volume (MPV) of the PACG group was significantly higher (p=0.013) than that of the control group. A similar trend was observed when coagulation parameters were compared between the PACG and control groups with respect to gender and/or age. Multiple logistic regression analyses revealed that APTT (OR=1.032, 95% CI 1.000 to 1.026), PT (OR=1.249, 95% CI 1.071 to 1.457) and MPV (OR=1.185, 95% CI 1.081 to 1.299) were independently associated with PACG. Patients with PACG had a shorter coagulation time. Our results suggest that coagulation function is significantly associated with patients with PACG and may play an important role in the onset and development of PACG. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  19. Emotional Self-Efficacy and Alcohol and Tobacco Use in Adolescents

    ERIC Educational Resources Information Center

    Zullig, Keith J.; Teoli, Dac A.; Valois, Robert F.

    2014-01-01

    This study examined relationships between emotional self-efficacy (ESE) and alcohol and tobacco use in a statewide sample of public high school adolescents (n?=?2,566). The Center for Disease Control Youth Risk Behavior Survey and an adolescent ESE scale were utilized. Logistic regression analyses indicated the presence of any significant race by…

  20. Predictors of Child Molestation: Adult Attachment, Cognitive Distortions, and Empathy

    ERIC Educational Resources Information Center

    Wood, Eric; Riggs, Shelley

    2008-01-01

    A conceptual model derived from attachment theory was tested by examining adult attachment style, cognitive distortions, and both general and victim empathy in a sample of 61 paroled child molesters and 51 community controls. Results of logistic multiple regression showed that attachment anxiety, cognitive distortions, high general empathy but low…

  1. Parental Youth Assets and Sexual Activity: Differences by Race/Ethnicity

    ERIC Educational Resources Information Center

    Tolma, Eleni L.; Oman, Roy F.; Vesely, Sara K.; Aspy, Cheryl B.; Beebe, Laura; Fluhr, Janene

    2011-01-01

    Objectives: To examine how the relationship between parental-related youth assets and youth sexual activity differed by race/ethnicity. Methods: A random sample of 976 youth and their parents living in a Midwestern city participated in the study. Multivariate logistic regression analyses were conducted for 3 major ethnic groups controlling for the…

  2. Self-Reported Weight Perceptions, Dieting Behavior, and Breakfast Eating among High School Adolescents

    ERIC Educational Resources Information Center

    Zullig, Keith; Ubbes, Valerie A.; Pyle, Jennifer; Valois, Robert F.

    2006-01-01

    This study explored the relationships among weight perceptions, dieting behavior, and breakfast eating in 4597 public high school adolescents using the Centers for Disease Control and Prevention Youth Risk Behavior Survey. Adjusted multiple logistic regression models were constructed separately for race and gender groups via SUDAAN (Survey Data…

  3. Design of Embedded System for Multivariate Classification of Finger and Thumb Movements Using EEG Signals for Control of Upper Limb Prosthesis.

    PubMed

    Rashid, Nasir; Iqbal, Javaid; Javed, Amna; Tiwana, Mohsin I; Khan, Umar Shahbaz

    2018-01-01

    Brain Computer Interface (BCI) determines the intent of the user from a variety of electrophysiological signals. These signals, Slow Cortical Potentials, are recorded from scalp, and cortical neuronal activity is recorded by implanted electrodes. This paper is focused on design of an embedded system that is used to control the finger movements of an upper limb prosthesis using Electroencephalogram (EEG) signals. This is a follow-up of our previous research which explored the best method to classify three movements of fingers (thumb movement, index finger movement, and first movement). Two-stage logistic regression classifier exhibited the highest classification accuracy while Power Spectral Density (PSD) was used as a feature of the filtered signal. The EEG signal data set was recorded using a 14-channel electrode headset (a noninvasive BCI system) from right-handed, neurologically intact volunteers. Mu (commonly known as alpha waves) and Beta Rhythms (8-30 Hz) containing most of the movement data were retained through filtering using "Arduino Uno" microcontroller followed by 2-stage logistic regression to obtain a mean classification accuracy of 70%.

  4. Cannabis, tobacco and domestic fumes intake are associated with nasopharyngeal carcinoma in North Africa.

    PubMed

    Feng, B-J; Khyatti, M; Ben-Ayoub, W; Dahmoul, S; Ayad, M; Maachi, F; Bedadra, W; Abdoun, M; Mesli, S; Bakkali, H; Jalbout, M; Hamdi-Cherif, M; Boualga, K; Bouaouina, N; Chouchane, L; Benider, A; Ben-Ayed, F; Goldgar, D E; Corbex, M

    2009-10-06

    The lifestyle risk factors for nasopharyngeal carcinoma (NPC) in North Africa are not known. From 2002 to 2005, we interviewed 636 patients and 615 controls from Algeria, Morocco and Tunisia, frequency-matched by centre, age, sex, and childhood household type (urban/rural). Conditional logistic regression was used to evaluate the association of lifestyles with NPC risk, controlling for socioeconomic status and dietary risk factors. Cigarette smoking and snuff (tobacco powder with additives) intake were significantly associated with differentiated NPC but not with undifferentiated carcinoma (UCNT), which is the major histological type of NPC in these populations. As demonstrated by a stratified permutation test and by conditional logistic regression, marijuana smoking significantly elevated NPC risk independently of cigarette smoking, suggesting dissimilar carcinogenic mechanisms between cannabis and tobacco. Domestic cooking fumes intake by using kanoun (compact charcoal oven) during childhood increased NPC risk, whereas exposure during adulthood had less effect. Neither alcohol nor shisha (water pipe) was associated with risk. Tobacco, cannabis and domestic cooking fumes intake are risk factors for NPC in western North Africa.

  5. Cannabis, tobacco and domestic fumes intake are associated with nasopharyngeal carcinoma in North Africa

    PubMed Central

    Feng, B-J; Khyatti, M; Ben-Ayoub, W; Dahmoul, S; Ayad, M; Maachi, F; Bedadra, W; Abdoun, M; Mesli, S; Bakkali, H; Jalbout, M; Hamdi-Cherif, M; Boualga, K; Bouaouina, N; Chouchane, L; Benider, A; Ben-Ayed, F; Goldgar, D E; Corbex, M

    2009-01-01

    Background: The lifestyle risk factors for nasopharyngeal carcinoma (NPC) in North Africa are not known. Methods: From 2002 to 2005, we interviewed 636 patients and 615 controls from Algeria, Morocco and Tunisia, frequency-matched by centre, age, sex, and childhood household type (urban/rural). Conditional logistic regression was used to evaluate the association of lifestyles with NPC risk, controlling for socioeconomic status and dietary risk factors. Results: Cigarette smoking and snuff (tobacco powder with additives) intake were significantly associated with differentiated NPC but not with undifferentiated carcinoma (UCNT), which is the major histological type of NPC in these populations. As demonstrated by a stratified permutation test and by conditional logistic regression, marijuana smoking significantly elevated NPC risk independently of cigarette smoking, suggesting dissimilar carcinogenic mechanisms between cannabis and tobacco. Domestic cooking fumes intake by using kanoun (compact charcoal oven) during childhood increased NPC risk, whereas exposure during adulthood had less effect. Neither alcohol nor shisha (water pipe) was associated with risk. Conclusion: Tobacco, cannabis and domestic cooking fumes intake are risk factors for NPC in western North Africa. PMID:19724280

  6. Myxomatosis in wild rabbit: design of control programs in Mediterranean ecosystems.

    PubMed

    García-Bocanegra, Ignacio; Astorga, Rafael Jesús; Napp, Sebastián; Casal, Jordi; Huerta, Belén; Borge, Carmen; Arenas, Antonio

    2010-01-01

    A cross-sectional study was carried out in natural wild rabbit (Oryctolagus cuniculus) populations from southern Spain to identify risk factors associated to myxoma virus infection. Blood samples from 619 wild rabbits were collected, and questionnaires which included variables related to host, disease, game management and environment were completed. A logistic regression analysis was conducted to investigate the associations between myxomatosis seropositivity (dependent variable) across 7 hunting estates and an extensive set of explanatory variables obtained from the questionnaires. The prevalence of antibodies against myxomatosis virus was 56.4% (95% CI: 52.5-60.3) and ranged between 21.4% (95% CI: 9.0-33.8) and 70.2% (95% CI: 58.3-82.1) among the different sampling areas. The logistic regression analysis showed that autumn (OR 9.0), high abundance of mosquitoes (OR 8.2), reproductive activity (OR 4.1), warren's insecticide treatment (OR 3.7), rabbit haemorrhagic disease (RHD) seropositivity (OR 2.6), high hunting pressure (OR 6.3) and sheep presence (OR 6.4) were associated with seropositivity to myxomatosis. Based on the results, diverse management measures for myxomatosis control are proposed.

  7. A simple approach to power and sample size calculations in logistic regression and Cox regression models.

    PubMed

    Vaeth, Michael; Skovlund, Eva

    2004-06-15

    For a given regression problem it is possible to identify a suitably defined equivalent two-sample problem such that the power or sample size obtained for the two-sample problem also applies to the regression problem. For a standard linear regression model the equivalent two-sample problem is easily identified, but for generalized linear models and for Cox regression models the situation is more complicated. An approximately equivalent two-sample problem may, however, also be identified here. In particular, we show that for logistic regression and Cox regression models the equivalent two-sample problem is obtained by selecting two equally sized samples for which the parameters differ by a value equal to the slope times twice the standard deviation of the independent variable and further requiring that the overall expected number of events is unchanged. In a simulation study we examine the validity of this approach to power calculations in logistic regression and Cox regression models. Several different covariate distributions are considered for selected values of the overall response probability and a range of alternatives. For the Cox regression model we consider both constant and non-constant hazard rates. The results show that in general the approach is remarkably accurate even in relatively small samples. Some discrepancies are, however, found in small samples with few events and a highly skewed covariate distribution. Comparison with results based on alternative methods for logistic regression models with a single continuous covariate indicates that the proposed method is at least as good as its competitors. The method is easy to implement and therefore provides a simple way to extend the range of problems that can be covered by the usual formulas for power and sample size determination. Copyright 2004 John Wiley & Sons, Ltd.

  8. Factors associated with obstructive sleep apnea among commercial motor vehicle drivers.

    PubMed

    Xie, Wen; Chakrabarty, Sangita; Levine, Robert; Johnson, Roy; Talmage, James B

    2011-02-01

    Identify factors associated with obstructive sleep apnea (OSA) risk during commercial driver medical examinations. A case-control study was conducted at an occupational health clinic by reviewing the commercial driver medical examinations medical records performed from January 2007 to December 2008. The magnitude of association with OSA was estimated with logistic regression. Among 1890 commercial motor vehicle drivers, 51 were confirmed positive for OSA by polysomnography after initial screening by Joint Task Force guidelines, yielding estimated positive predictive values of 78.5% for the screening criteria. Multivariable logistic regression showed that body mass index ≥ 30 (odds ratio: 26.86), hypertension (odds ratio: 2.57), and diabetes (odds ratio: 2.03) were independently associated with OSA. Medical examiners' use of objectively measurable risk factors, such as obesity, history of hypertension, and/or diabetes, rather than symptoms, may be more effective in identifying undiagnosed OSA in commercial drivers during the commercial driver medical examinations.

  9. Testing a model of research intention among U.K. clinical psychologists: a logistic regression analysis.

    PubMed

    Eke, Gemma; Holttum, Sue; Hayward, Mark

    2012-03-01

    Previous research highlights barriers to clinical psychologists conducting research, but has rarely examined U.K. clinical psychologists. The study investigated U.K. clinical psychologists' self-reported research output and tested part of a theoretical model of factors influencing their intention to conduct research. Questionnaires were mailed to 1,300 U.K. clinical psychologists. Three hundred and seventy-four questionnaires were returned (29% response-rate). This study replicated in a U.K. sample the finding that the modal number of publications was zero, highlighted in a number of U.K. and U.S. studies. Research intention was bimodally distributed, and logistic regression classified 78% of cases successfully. Outcome expectations, perceived behavioral control and normative beliefs mediated between research training environment and intention. Further research should explore how research is negotiated in clinical roles, and this issue should be incorporated into prequalification training. © 2012 Wiley Periodicals, Inc.

  10. Cancer prevalence and education by cancer site: logistic regression analysis.

    PubMed

    Johnson, Stephanie; Corsten, Martin J; McDonald, James T; Gupta, Michael

    2010-10-01

    Previously, using the American National Health Interview Survey (NHIS) and a logistic regression analysis, we found that upper aerodigestive tract (UADT) cancer is correlated with low socioeconomic status (SES). The objective of this study was to determine if this correlation between low SES and cancer prevalence exists for other cancers. We again used the NHIS and employed education level as our main measure of SES. We controlled for potentially confounding factors, including smoking status and alcohol consumption. We found that only two cancer subsites shared the pattern of increased prevalence with low education level and decreased prevalence with high education level: UADT cancer and cervical cancer. UADT cancer and cervical cancer were the only two cancers identified that had a link between prevalence and lower education level. This raises the possibility that an associated risk factor for the two cancers is causing the relationship between lower education level and prevalence.

  11. Nonconvex Sparse Logistic Regression With Weakly Convex Regularization

    NASA Astrophysics Data System (ADS)

    Shen, Xinyue; Gu, Yuantao

    2018-06-01

    In this work we propose to fit a sparse logistic regression model by a weakly convex regularized nonconvex optimization problem. The idea is based on the finding that a weakly convex function as an approximation of the $\\ell_0$ pseudo norm is able to better induce sparsity than the commonly used $\\ell_1$ norm. For a class of weakly convex sparsity inducing functions, we prove the nonconvexity of the corresponding sparse logistic regression problem, and study its local optimality conditions and the choice of the regularization parameter to exclude trivial solutions. Despite the nonconvexity, a method based on proximal gradient descent is used to solve the general weakly convex sparse logistic regression, and its convergence behavior is studied theoretically. Then the general framework is applied to a specific weakly convex function, and a necessary and sufficient local optimality condition is provided. The solution method is instantiated in this case as an iterative firm-shrinkage algorithm, and its effectiveness is demonstrated in numerical experiments by both randomly generated and real datasets.

  12. A comparative study on entrepreneurial attitudes modeled with logistic regression and Bayes nets.

    PubMed

    López Puga, Jorge; García García, Juan

    2012-11-01

    Entrepreneurship research is receiving increasing attention in our context, as entrepreneurs are key social agents involved in economic development. We compare the success of the dichotomic logistic regression model and the Bayes simple classifier to predict entrepreneurship, after manipulating the percentage of missing data and the level of categorization in predictors. A sample of undergraduate university students (N = 1230) completed five scales (motivation, attitude towards business creation, obstacles, deficiencies, and training needs) and we found that each of them predicted different aspects of the tendency to business creation. Additionally, our results show that the receiver operating characteristic (ROC) curve is affected by the rate of missing data in both techniques, but logistic regression seems to be more vulnerable when faced with missing data, whereas Bayes nets underperform slightly when categorization has been manipulated. Our study sheds light on the potential entrepreneur profile and we propose to use Bayesian networks as an additional alternative to overcome the weaknesses of logistic regression when missing data are present in applied research.

  13. Comparison of cranial sex determination by discriminant analysis and logistic regression.

    PubMed

    Amores-Ampuero, Anabel; Alemán, Inmaculada

    2016-04-05

    Various methods have been proposed for estimating dimorphism. The objective of this study was to compare sex determination results from cranial measurements using discriminant analysis or logistic regression. The study sample comprised 130 individuals (70 males) of known sex, age, and cause of death from San José cemetery in Granada (Spain). Measurements of 19 neurocranial dimensions and 11 splanchnocranial dimensions were subjected to discriminant analysis and logistic regression, and the percentages of correct classification were compared between the sex functions obtained with each method. The discriminant capacity of the selected variables was evaluated with a cross-validation procedure. The percentage accuracy with discriminant analysis was 78.2% for the neurocranium (82.4% in females and 74.6% in males) and 73.7% for the splanchnocranium (79.6% in females and 68.8% in males). These percentages were higher with logistic regression analysis: 85.7% for the neurocranium (in both sexes) and 94.1% for the splanchnocranium (100% in females and 91.7% in males).

  14. [Risk factors for surgical site infections in patients undergoing craniotomy].

    PubMed

    Cha, Kyeong-Sook; Cho, Ok-Hee; Yoo, So-Yeon

    2010-04-01

    The objectives of this study were to determine the prevalence, incidence, and risk factors for postoperative surgical site infections (SSIs) after craniotomy. This study was a retrospective case-control study of 103 patients who had craniotomies between March 2007 and December 2008. A retrospective review of prospectively collected databases of consecutive patients who underwent craniotomy was done. SSIs were defined by using the Centers for Disease Control criteria. Twenty-six cases (infection) and 77 controls (no infection) were matched for age, gender and time of surgery. Descriptive analysis, t-test, X(2)-test and logistic regression analyses were used for data analysis. The statistical difference between cases and controls was significant for hospital length of stay (>14 days), intensive care unit stay more than 15 days, Glasgrow Coma Scale (GCS) score (< or = 7 days), extra-ventricular drainage and coexistent infection. Risk factors were identified by logistic regression and included hospital length of stay of more than 14 days (odds ratio [OR]=23.39, 95% confidence interval [CI]=2.53-216.11) and GCS score (< or = 7 scores) (OR=4.71, 95% CI=1.64-13.50). The results of this study show that patients are at high risk for infection when they have a low level of consciousness or their length hospital stay is long term. Nurses have to take an active and continuous approach to infection control to help with patients having these risk factors.

  15. Constructive thinking, rational intelligence and irritable bowel syndrome

    PubMed Central

    Rey, Enrique; Ortega, Marta Moreno; Alonso, Monica Olga Garcia; Diaz-Rubio, Manuel

    2009-01-01

    AIM: To evaluate rational and experiential intelligence in irritable bowel syndrome (IBS) sufferers. METHODS: We recruited 100 subjects with IBS as per Rome II criteria (50 consulters and 50 non-consulters) and 100 healthy controls, matched by age, sex and educational level. Cases and controls completed a clinical questionnaire (including symptom characteristics and medical consultation) and the following tests: rational-intelligence (Wechsler Adult Intelligence Scale, 3rd edition); experiential-intelligence (Constructive Thinking Inventory); personality (NEO personality inventory); psychopathology (MMPI-2), anxiety (state-trait anxiety inventory) and life events (social readjustment rating scale). Analysis of variance was used to compare the test results of IBS-sufferers and controls, and a logistic regression model was then constructed and adjusted for age, sex and educational level to evaluate any possible association with IBS. RESULTS: No differences were found between IBS cases and controls in terms of IQ (102.0 ± 10.8 vs 102.8 ± 12.6), but IBS sufferers scored significantly lower in global constructive thinking (43.7 ± 9.4 vs 49.6 ± 9.7). In the logistic regression model, global constructive thinking score was independently linked to suffering from IBS [OR 0.92 (0.87-0.97)], without significant OR for total IQ. CONCLUSION: IBS subjects do not show lower rational intelligence than controls, but lower experiential intelligence is nevertheless associated with IBS. PMID:19575489

  16. Stepwise Distributed Open Innovation Contests for Software Development: Acceleration of Genome-Wide Association Analysis

    PubMed Central

    Hill, Andrew; Loh, Po-Ru; Bharadwaj, Ragu B.; Pons, Pascal; Shang, Jingbo; Guinan, Eva; Lakhani, Karim; Kilty, Iain

    2017-01-01

    Abstract Background: The association of differing genotypes with disease-related phenotypic traits offers great potential to both help identify new therapeutic targets and support stratification of patients who would gain the greatest benefit from specific drug classes. Development of low-cost genotyping and sequencing has made collecting large-scale genotyping data routine in population and therapeutic intervention studies. In addition, a range of new technologies is being used to capture numerous new and complex phenotypic descriptors. As a result, genotype and phenotype datasets have grown exponentially. Genome-wide association studies associate genotypes and phenotypes using methods such as logistic regression. As existing tools for association analysis limit the efficiency by which value can be extracted from increasing volumes of data, there is a pressing need for new software tools that can accelerate association analyses on large genotype-phenotype datasets. Results: Using open innovation (OI) and contest-based crowdsourcing, the logistic regression analysis in a leading, community-standard genetics software package (PLINK 1.07) was substantially accelerated. OI allowed us to do this in <6 months by providing rapid access to highly skilled programmers with specialized, difficult-to-find skill sets. Through a crowd-based contest a combination of computational, numeric, and algorithmic approaches was identified that accelerated the logistic regression in PLINK 1.07 by 18- to 45-fold. Combining contest-derived logistic regression code with coarse-grained parallelization, multithreading, and associated changes to data initialization code further developed through distributed innovation, we achieved an end-to-end speedup of 591-fold for a data set size of 6678 subjects by 645 863 variants, compared to PLINK 1.07's logistic regression. This represents a reduction in run time from 4.8 hours to 29 seconds. Accelerated logistic regression code developed in this project has been incorporated into the PLINK2 project. Conclusions: Using iterative competition-based OI, we have developed a new, faster implementation of logistic regression for genome-wide association studies analysis. We present lessons learned and recommendations on running a successful OI process for bioinformatics. PMID:28327993

  17. Stepwise Distributed Open Innovation Contests for Software Development: Acceleration of Genome-Wide Association Analysis.

    PubMed

    Hill, Andrew; Loh, Po-Ru; Bharadwaj, Ragu B; Pons, Pascal; Shang, Jingbo; Guinan, Eva; Lakhani, Karim; Kilty, Iain; Jelinsky, Scott A

    2017-05-01

    The association of differing genotypes with disease-related phenotypic traits offers great potential to both help identify new therapeutic targets and support stratification of patients who would gain the greatest benefit from specific drug classes. Development of low-cost genotyping and sequencing has made collecting large-scale genotyping data routine in population and therapeutic intervention studies. In addition, a range of new technologies is being used to capture numerous new and complex phenotypic descriptors. As a result, genotype and phenotype datasets have grown exponentially. Genome-wide association studies associate genotypes and phenotypes using methods such as logistic regression. As existing tools for association analysis limit the efficiency by which value can be extracted from increasing volumes of data, there is a pressing need for new software tools that can accelerate association analyses on large genotype-phenotype datasets. Using open innovation (OI) and contest-based crowdsourcing, the logistic regression analysis in a leading, community-standard genetics software package (PLINK 1.07) was substantially accelerated. OI allowed us to do this in <6 months by providing rapid access to highly skilled programmers with specialized, difficult-to-find skill sets. Through a crowd-based contest a combination of computational, numeric, and algorithmic approaches was identified that accelerated the logistic regression in PLINK 1.07 by 18- to 45-fold. Combining contest-derived logistic regression code with coarse-grained parallelization, multithreading, and associated changes to data initialization code further developed through distributed innovation, we achieved an end-to-end speedup of 591-fold for a data set size of 6678 subjects by 645 863 variants, compared to PLINK 1.07's logistic regression. This represents a reduction in run time from 4.8 hours to 29 seconds. Accelerated logistic regression code developed in this project has been incorporated into the PLINK2 project. Using iterative competition-based OI, we have developed a new, faster implementation of logistic regression for genome-wide association studies analysis. We present lessons learned and recommendations on running a successful OI process for bioinformatics. © The Author 2017. Published by Oxford University Press.

  18. Easy and low-cost identification of metabolic syndrome in patients treated with second-generation antipsychotics: artificial neural network and logistic regression models.

    PubMed

    Lin, Chao-Cheng; Bai, Ya-Mei; Chen, Jen-Yeu; Hwang, Tzung-Jeng; Chen, Tzu-Ting; Chiu, Hung-Wen; Li, Yu-Chuan

    2010-03-01

    Metabolic syndrome (MetS) is an important side effect of second-generation antipsychotics (SGAs). However, many SGA-treated patients with MetS remain undetected. In this study, we trained and validated artificial neural network (ANN) and multiple logistic regression models without biochemical parameters to rapidly identify MetS in patients with SGA treatment. A total of 383 patients with a diagnosis of schizophrenia or schizoaffective disorder (DSM-IV criteria) with SGA treatment for more than 6 months were investigated to determine whether they met the MetS criteria according to the International Diabetes Federation. The data for these patients were collected between March 2005 and September 2005. The input variables of ANN and logistic regression were limited to demographic and anthropometric data only. All models were trained by randomly selecting two-thirds of the patient data and were internally validated with the remaining one-third of the data. The models were then externally validated with data from 69 patients from another hospital, collected between March 2008 and June 2008. The area under the receiver operating characteristic curve (AUC) was used to measure the performance of all models. Both the final ANN and logistic regression models had high accuracy (88.3% vs 83.6%), sensitivity (93.1% vs 86.2%), and specificity (86.9% vs 83.8%) to identify MetS in the internal validation set. The mean +/- SD AUC was high for both the ANN and logistic regression models (0.934 +/- 0.033 vs 0.922 +/- 0.035, P = .63). During external validation, high AUC was still obtained for both models. Waist circumference and diastolic blood pressure were the common variables that were left in the final ANN and logistic regression models. Our study developed accurate ANN and logistic regression models to detect MetS in patients with SGA treatment. The models are likely to provide a noninvasive tool for large-scale screening of MetS in this group of patients. (c) 2010 Physicians Postgraduate Press, Inc.

  19. Deletion Diagnostics for Alternating Logistic Regressions

    PubMed Central

    Preisser, John S.; By, Kunthel; Perin, Jamie; Qaqish, Bahjat F.

    2013-01-01

    Deletion diagnostics are introduced for the regression analysis of clustered binary outcomes estimated with alternating logistic regressions, an implementation of generalized estimating equations (GEE) that estimates regression coefficients in a marginal mean model and in a model for the intracluster association given by the log odds ratio. The diagnostics are developed within an estimating equations framework that recasts the estimating functions for association parameters based upon conditional residuals into equivalent functions based upon marginal residuals. Extensions of earlier work on GEE diagnostics follow directly, including computational formulae for one-step deletion diagnostics that measure the influence of a cluster of observations on the estimated regression parameters and on the overall marginal mean or association model fit. The diagnostic formulae are evaluated with simulations studies and with an application concerning an assessment of factors associated with health maintenance visits in primary care medical practices. The application and the simulations demonstrate that the proposed cluster-deletion diagnostics for alternating logistic regressions are good approximations of their exact fully iterated counterparts. PMID:22777960

  20. Fusing Data Mining, Machine Learning and Traditional Statistics to Detect Biomarkers Associated with Depression

    PubMed Central

    Dipnall, Joanna F.

    2016-01-01

    Background Atheoretical large-scale data mining techniques using machine learning algorithms have promise in the analysis of large epidemiological datasets. This study illustrates the use of a hybrid methodology for variable selection that took account of missing data and complex survey design to identify key biomarkers associated with depression from a large epidemiological study. Methods The study used a three-step methodology amalgamating multiple imputation, a machine learning boosted regression algorithm and logistic regression, to identify key biomarkers associated with depression in the National Health and Nutrition Examination Study (2009–2010). Depression was measured using the Patient Health Questionnaire-9 and 67 biomarkers were analysed. Covariates in this study included gender, age, race, smoking, food security, Poverty Income Ratio, Body Mass Index, physical activity, alcohol use, medical conditions and medications. The final imputed weighted multiple logistic regression model included possible confounders and moderators. Results After the creation of 20 imputation data sets from multiple chained regression sequences, machine learning boosted regression initially identified 21 biomarkers associated with depression. Using traditional logistic regression methods, including controlling for possible confounders and moderators, a final set of three biomarkers were selected. The final three biomarkers from the novel hybrid variable selection methodology were red cell distribution width (OR 1.15; 95% CI 1.01, 1.30), serum glucose (OR 1.01; 95% CI 1.00, 1.01) and total bilirubin (OR 0.12; 95% CI 0.05, 0.28). Significant interactions were found between total bilirubin with Mexican American/Hispanic group (p = 0.016), and current smokers (p<0.001). Conclusion The systematic use of a hybrid methodology for variable selection, fusing data mining techniques using a machine learning algorithm with traditional statistical modelling, accounted for missing data and complex survey sampling methodology and was demonstrated to be a useful tool for detecting three biomarkers associated with depression for future hypothesis generation: red cell distribution width, serum glucose and total bilirubin. PMID:26848571

  1. Fusing Data Mining, Machine Learning and Traditional Statistics to Detect Biomarkers Associated with Depression.

    PubMed

    Dipnall, Joanna F; Pasco, Julie A; Berk, Michael; Williams, Lana J; Dodd, Seetal; Jacka, Felice N; Meyer, Denny

    2016-01-01

    Atheoretical large-scale data mining techniques using machine learning algorithms have promise in the analysis of large epidemiological datasets. This study illustrates the use of a hybrid methodology for variable selection that took account of missing data and complex survey design to identify key biomarkers associated with depression from a large epidemiological study. The study used a three-step methodology amalgamating multiple imputation, a machine learning boosted regression algorithm and logistic regression, to identify key biomarkers associated with depression in the National Health and Nutrition Examination Study (2009-2010). Depression was measured using the Patient Health Questionnaire-9 and 67 biomarkers were analysed. Covariates in this study included gender, age, race, smoking, food security, Poverty Income Ratio, Body Mass Index, physical activity, alcohol use, medical conditions and medications. The final imputed weighted multiple logistic regression model included possible confounders and moderators. After the creation of 20 imputation data sets from multiple chained regression sequences, machine learning boosted regression initially identified 21 biomarkers associated with depression. Using traditional logistic regression methods, including controlling for possible confounders and moderators, a final set of three biomarkers were selected. The final three biomarkers from the novel hybrid variable selection methodology were red cell distribution width (OR 1.15; 95% CI 1.01, 1.30), serum glucose (OR 1.01; 95% CI 1.00, 1.01) and total bilirubin (OR 0.12; 95% CI 0.05, 0.28). Significant interactions were found between total bilirubin with Mexican American/Hispanic group (p = 0.016), and current smokers (p<0.001). The systematic use of a hybrid methodology for variable selection, fusing data mining techniques using a machine learning algorithm with traditional statistical modelling, accounted for missing data and complex survey sampling methodology and was demonstrated to be a useful tool for detecting three biomarkers associated with depression for future hypothesis generation: red cell distribution width, serum glucose and total bilirubin.

  2. Evaluating risk factors for endemic human Salmonella Enteritidis infections with different phage types in Ontario, Canada using multinomial logistic regression and a case-case study approach

    PubMed Central

    2012-01-01

    Background Identifying risk factors for Salmonella Enteritidis (SE) infections in Ontario will assist public health authorities to design effective control and prevention programs to reduce the burden of SE infections. Our research objective was to identify risk factors for acquiring SE infections with various phage types (PT) in Ontario, Canada. We hypothesized that certain PTs (e.g., PT8 and PT13a) have specific risk factors for infection. Methods Our study included endemic SE cases with various PTs whose isolates were submitted to the Public Health Laboratory-Toronto from January 20th to August 12th, 2011. Cases were interviewed using a standardized questionnaire that included questions pertaining to demographics, travel history, clinical symptoms, contact with animals, and food exposures. A multinomial logistic regression method using the Generalized Linear Latent and Mixed Model procedure and a case-case study design were used to identify risk factors for acquiring SE infections with various PTs in Ontario, Canada. In the multinomial logistic regression model, the outcome variable had three categories representing human infections caused by SE PT8, PT13a, and all other SE PTs (i.e., non-PT8/non-PT13a) as a referent category to which the other two categories were compared. Results In the multivariable model, SE PT8 was positively associated with contact with dogs (OR=2.17, 95% CI 1.01-4.68) and negatively associated with pepper consumption (OR=0.35, 95% CI 0.13-0.94), after adjusting for age categories and gender, and using exposure periods and health regions as random effects to account for clustering. Conclusions Our study findings offer interesting hypotheses about the role of phage type-specific risk factors. Multinomial logistic regression analysis and the case-case study approach are novel methodologies to evaluate associations among SE infections with different PTs and various risk factors. PMID:23057531

  3. Estimating interaction on an additive scale between continuous determinants in a logistic regression model.

    PubMed

    Knol, Mirjam J; van der Tweel, Ingeborg; Grobbee, Diederick E; Numans, Mattijs E; Geerlings, Mirjam I

    2007-10-01

    To determine the presence of interaction in epidemiologic research, typically a product term is added to the regression model. In linear regression, the regression coefficient of the product term reflects interaction as departure from additivity. However, in logistic regression it refers to interaction as departure from multiplicativity. Rothman has argued that interaction estimated as departure from additivity better reflects biologic interaction. So far, literature on estimating interaction on an additive scale using logistic regression only focused on dichotomous determinants. The objective of the present study was to provide the methods to estimate interaction between continuous determinants and to illustrate these methods with a clinical example. and results From the existing literature we derived the formulas to quantify interaction as departure from additivity between one continuous and one dichotomous determinant and between two continuous determinants using logistic regression. Bootstrapping was used to calculate the corresponding confidence intervals. To illustrate the theory with an empirical example, data from the Utrecht Health Project were used, with age and body mass index as risk factors for elevated diastolic blood pressure. The methods and formulas presented in this article are intended to assist epidemiologists to calculate interaction on an additive scale between two variables on a certain outcome. The proposed methods are included in a spreadsheet which is freely available at: http://www.juliuscenter.nl/additive-interaction.xls.

  4. Logits and Tigers and Bears, Oh My! A Brief Look at the Simple Math of Logistic Regression and How It Can Improve Dissemination of Results

    ERIC Educational Resources Information Center

    Osborne, Jason W.

    2012-01-01

    Logistic regression is slowly gaining acceptance in the social sciences, and fills an important niche in the researcher's toolkit: being able to predict important outcomes that are not continuous in nature. While OLS regression is a valuable tool, it cannot routinely be used to predict outcomes that are binary or categorical in nature. These…

  5. Accounting for center in the Early External Cephalic Version trials: an empirical comparison of statistical methods to adjust for center in a multicenter trial with binary outcomes.

    PubMed

    Reitsma, Angela; Chu, Rong; Thorpe, Julia; McDonald, Sarah; Thabane, Lehana; Hutton, Eileen

    2014-09-26

    Clustering of outcomes at centers involved in multicenter trials is a type of center effect. The Consolidated Standards of Reporting Trials Statement recommends that multicenter randomized controlled trials (RCTs) should account for center effects in their analysis, however most do not. The Early External Cephalic Version (EECV) trials published in 2003 and 2011 stratified by center at randomization, but did not account for center in the analyses, and due to the nature of the intervention and number of centers, may have been prone to center effects. Using data from the EECV trials, we undertook an empirical study to compare various statistical approaches to account for center effect while estimating the impact of external cephalic version timing (early or delayed) on the outcomes of cesarean section, preterm birth, and non-cephalic presentation at the time of birth. The data from the EECV pilot trial and the EECV2 trial were merged into one dataset. Fisher's exact method was used to test the overall effect of external cephalic version timing unadjusted for center effects. Seven statistical models that accounted for center effects were applied to the data. The models included: i) the Mantel-Haenszel test, ii) logistic regression with fixed center effect and fixed treatment effect, iii) center-size weighted and iv) un-weighted logistic regression with fixed center effect and fixed treatment-by-center interaction, iv) logistic regression with random center effect and fixed treatment effect, v) logistic regression with random center effect and random treatment-by-center interaction, and vi) generalized estimating equations. For each of the three outcomes of interest approaches to account for center effect did not alter the overall findings of the trial. The results were similar for the majority of the methods used to adjust for center, illustrating the robustness of the findings. Despite literature that suggests center effect can change the estimate of effect in multicenter trials, this empirical study does not show a difference in the outcomes of the EECV trials when accounting for center effect. The EECV2 trial was registered on 30 July 30 2005 with Current Controlled Trials: ISRCTN 56498577.

  6. Elevated Fasting Blood Glucose Is Predictive of Poor Outcome in Non-Diabetic Stroke Patients: A Sub-Group Analysis of SMART.

    PubMed

    Yao, Ming; Ni, Jun; Zhou, Lixin; Peng, Bin; Zhu, Yicheng; Cui, Liying

    2016-01-01

    Although increasing evidence suggests that hyperglycemia following acute stroke adversely affects clinical outcome, whether the association between glycaemia and functional outcome varies between stroke patients with\\without pre-diagnosed diabetes remains controversial. We aimed to investigate the relationship between the fasting blood glucose (FBG) and the 6-month functional outcome in a subgroup of SMART cohort and further to assess whether this association varied based on the status of pre-diagnosed diabetes. Data of 2862 patients with acute ischemic stroke (629 with pre-diagnosed diabetics) enrolled from SMART cohort were analyzed. Functional outcome at 6-month post-stroke was measured by modified Rankin Scale (mRS) and categorized as favorable (mRS:0-2) or poor (mRS:3-5). Binary logistic regression model, adjusting for age, gender, educational level, history of hypertension and stroke, baseline NIHSS and treatment group, was used in the whole cohort to evaluate the association between admission FBG and functional outcome. Stratified logistic regression analyses were further performed based on the presence/absence of pre-diabetes history. In the whole cohort, multivariable logistical regression showed that poor functional outcome was associated with elevated FBG (OR1.21 (95%CI 1.07-1.37), p = 0.002), older age (OR1.64 (95% CI1.38-1.94), p<0.001), higher NIHSS (OR2.90 (95%CI 2.52-3.33), p<0.001) and hypertension (OR1.42 (95%CI 1.13-1.98), p = 0.04). Stratified logistical regression analysis showed that the association between FBG and functional outcome remained significant only in patients without pre-diagnosed diabetes (OR1.26 (95%CI 1.03-1.55), p = 0.023), but not in those with premorbid diagnosis of diabetes (p = 0.885). The present results demonstrate a significant association between elevated FBG after stroke and poor functional outcome in patients without pre-diagnosed diabetes, but not in diabetics. This finding confirms the importance of glycemic control during acute phase of ischemic stroke especially in patients without pre-diagnosed diabetes. Further investigation for developing optimal strategies to control blood glucose level in hyperglycemic setting is therefore of great importance. ClinicalTrials.gov NCT00664846.

  7. Intermediate and advanced topics in multilevel logistic regression analysis

    PubMed Central

    Merlo, Juan

    2017-01-01

    Multilevel data occur frequently in health services, population and public health, and epidemiologic research. In such research, binary outcomes are common. Multilevel logistic regression models allow one to account for the clustering of subjects within clusters of higher‐level units when estimating the effect of subject and cluster characteristics on subject outcomes. A search of the PubMed database demonstrated that the use of multilevel or hierarchical regression models is increasing rapidly. However, our impression is that many analysts simply use multilevel regression models to account for the nuisance of within‐cluster homogeneity that is induced by clustering. In this article, we describe a suite of analyses that can complement the fitting of multilevel logistic regression models. These ancillary analyses permit analysts to estimate the marginal or population‐average effect of covariates measured at the subject and cluster level, in contrast to the within‐cluster or cluster‐specific effects arising from the original multilevel logistic regression model. We describe the interval odds ratio and the proportion of opposed odds ratios, which are summary measures of effect for cluster‐level covariates. We describe the variance partition coefficient and the median odds ratio which are measures of components of variance and heterogeneity in outcomes. These measures allow one to quantify the magnitude of the general contextual effect. We describe an R 2 measure that allows analysts to quantify the proportion of variation explained by different multilevel logistic regression models. We illustrate the application and interpretation of these measures by analyzing mortality in patients hospitalized with a diagnosis of acute myocardial infarction. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. PMID:28543517

  8. Intermediate and advanced topics in multilevel logistic regression analysis.

    PubMed

    Austin, Peter C; Merlo, Juan

    2017-09-10

    Multilevel data occur frequently in health services, population and public health, and epidemiologic research. In such research, binary outcomes are common. Multilevel logistic regression models allow one to account for the clustering of subjects within clusters of higher-level units when estimating the effect of subject and cluster characteristics on subject outcomes. A search of the PubMed database demonstrated that the use of multilevel or hierarchical regression models is increasing rapidly. However, our impression is that many analysts simply use multilevel regression models to account for the nuisance of within-cluster homogeneity that is induced by clustering. In this article, we describe a suite of analyses that can complement the fitting of multilevel logistic regression models. These ancillary analyses permit analysts to estimate the marginal or population-average effect of covariates measured at the subject and cluster level, in contrast to the within-cluster or cluster-specific effects arising from the original multilevel logistic regression model. We describe the interval odds ratio and the proportion of opposed odds ratios, which are summary measures of effect for cluster-level covariates. We describe the variance partition coefficient and the median odds ratio which are measures of components of variance and heterogeneity in outcomes. These measures allow one to quantify the magnitude of the general contextual effect. We describe an R 2 measure that allows analysts to quantify the proportion of variation explained by different multilevel logistic regression models. We illustrate the application and interpretation of these measures by analyzing mortality in patients hospitalized with a diagnosis of acute myocardial infarction. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd.

  9. Relationship between Type of Trauma Exposure and Posttraumatic Stress Disorder among Urban Children and Adolescents

    ERIC Educational Resources Information Center

    Luthra, Rohini; Abramovitz, Robert; Greenberg, Rick; Schoor, Alan; Newcorn, Jeffrey; Schmeidler, James; Levine, Paul; Nomura, Yoko; Chemtob, Claude M.

    2009-01-01

    This study examines the association between trauma exposure and posttraumatic stress disorder (PTSD) among 157 help-seeking children (aged 8-17). Structured clinical interviews are carried out, and linear and logistic regression analyses are conducted to examine the relationship between PTSD and type of trauma exposure controlling for age, gender,…

  10. Predicting Social Trust with Binary Logistic Regression

    ERIC Educational Resources Information Center

    Adwere-Boamah, Joseph; Hufstedler, Shirley

    2015-01-01

    This study used binary logistic regression to predict social trust with five demographic variables from a national sample of adult individuals who participated in The General Social Survey (GSS) in 2012. The five predictor variables were respondents' highest degree earned, race, sex, general happiness and the importance of personally assisting…

  11. Examining the Link Between Public Transit Use and Active Commuting

    PubMed Central

    Bopp, Melissa; Gayah, Vikash V.; Campbell, Matthew E.

    2015-01-01

    Background: An established relationship exists between public transportation (PT) use and physical activity. However, there is limited literature that examines the link between PT use and active commuting (AC) behavior. This study examines this link to determine if PT users commute more by active modes. Methods: A volunteer, convenience sample of adults (n = 748) completed an online survey about AC/PT patterns, demographic, psychosocial, community and environmental factors. t-test compared differences between PT riders and non-PT riders. Binary logistic regression analyses examined the effect of multiple factors on AC and a full logistic regression model was conducted to examine AC. Results: Non-PT riders (n = 596) reported less AC than PT riders. There were several significant relationships with AC for demographic, interpersonal, worksite, community and environmental factors when considering PT use. The logistic multivariate analysis for included age, number of children and perceived distance to work as negative predictors and PT use, feelings of bad weather and lack of on-street bike lanes as a barrier to AC, perceived behavioral control and spouse AC were positive predictors. Conclusions: This study revealed the complex relationship between AC and PT use. Further research should investigate how AC and public transit use are related. PMID:25898405

  12. Examining the link between public transit use and active commuting.

    PubMed

    Bopp, Melissa; Gayah, Vikash V; Campbell, Matthew E

    2015-04-17

    An established relationship exists between public transportation (PT) use and physical activity. However, there is limited literature that examines the link between PT use and active commuting (AC) behavior. This study examines this link to determine if PT users commute more by active modes. A volunteer, convenience sample of adults (n = 748) completed an online survey about AC/PT patterns, demographic, psychosocial, community and environmental factors. t-test compared differences between PT riders and non-PT riders. Binary logistic regression analyses examined the effect of multiple factors on AC and a full logistic regression model was conducted to examine AC. Non-PT riders (n = 596) reported less AC than PT riders. There were several significant relationships with AC for demographic, interpersonal, worksite, community and environmental factors when considering PT use. The logistic multivariate analysis for included age, number of children and perceived distance to work as negative predictors and PT use, feelings of bad weather and lack of on-street bike lanes as a barrier to AC, perceived behavioral control and spouse AC were positive predictors. This study revealed the complex relationship between AC and PT use. Further research should investigate how AC and public transit use are related.

  13. Atrial Fibrillation Management Strategies in Routine Clinical Practice: Insights from the International RealiseAF Survey

    PubMed Central

    Chiang, Chern-En; Naditch-Brûlé, Lisa; Brette, Sandrine; Silva-Cardoso, José; Gamra, Habib; Murin, Jan; Zharinov, Oleg J.; Steg, Philippe Gabriel

    2016-01-01

    Background Atrial fibrillation (AF) can be managed with rhythm- or rate-control strategies. There are few data from routine clinical practice on the frequency with which each strategy is used and their correlates in terms of patients’ clinical characteristics, AF control, and symptom burden. Methods RealiseAF was an international, cross-sectional, observational survey of 11,198 patients with AF. The aim of this analysis was to describe patient profiles and symptoms according to the AF management strategy used. A multivariate logistic regression identified factors associated with AF management strategy at the end of the visit. Results Among 10,497 eligible patients, 53.7% used a rate-control strategy, compared with 34.5% who used a rhythm-control strategy. In 11.8% of patients, no clear strategy was stated. The proportion of patients with AF-related symptoms (EHRA Class > = II) was 78.1% (n = 4396/5630) for those using a rate-control strategy vs. 67.8% for those using a rhythm-control strategy (p<0.001). Multivariate logistic regression analysis revealed that age <75 years or the paroxysmal or persistent form of AF favored the choice of a rhythm-control strategy. A change in strategy was infrequent, even in patients with European Heart Rhythm Association (EHRA) Class > = II. Conclusions In the RealiseAF routine clinical practice survey, rate control was more commonly used than rhythm control, and a change in strategy was uncommon, even in symptomatic patients. In almost 12% of patients, no clear strategy was stated. Physician awareness regarding optimal management strategies for AF may be improved. PMID:26800084

  14. Quantitative evaluation of infection control models in the prevention of nosocomial transmission of SARS virus to healthcare workers: implication to nosocomial viral infection control for healthcare workers.

    PubMed

    Yen, Muh-Yong; Lu, Yun-Ching; Huang, Pi-Hsiang; Chen, Chen-Ming; Chen, Yee-Chun; Lin, Yusen E

    2010-07-01

    Healthcare workers (HCWs) are at high risk of acquiring emerging infections while caring for patients, as has been shown in the recent SARS and swine flu epidemics. Using SARS as an example, we determined the effectiveness of infection control measures (ICMs) by logistic regression and structural equation modelling (SEM), a quantitative methodology that can test a hypothetical model and validates causal relationships among ICMs. Logistic regression showed that installing hand wash stations in the emergency room (p = 0.012, odds ratio = 1.07) was the only ICM significantly associated with the protection of HCWs from acquiring the SARS virus. The structural equation modelling results showed that the most important contributing factor (highest proportion of effectiveness) was installation of a fever screening station outside the emergency department (51%). Other measures included traffic control in the emergency department (19%), availability of an outbreak standard operation protocol (12%), mandatory temperature screening (9%), establishing a hand washing setup at each hospital checkpoint (3%), adding simplified isolation rooms (3%), and a standardized patient transfer protocol (3%). Installation of fever screening stations outside of the hospital and implementing traffic control in the emergency department contributed to 70% of the effectiveness in the prevention of SARS transmission. Our approach can be applied to the evaluation of control measures for other epidemic infectious diseases, including swine flu and avian flu.

  15. Sleep duration, daytime napping, markers of obstructive sleep apnea and stroke in a population of southern China

    PubMed Central

    Wen, Ye; Pi, Fu-Hua; Guo, Pi; Dong, Wen-Ya; Xie, Yu-Qing; Wang, Xiang-Yu; Xia, Fang-Fang; Pang, Shao-Jie; Wu, Yan-Chun; Wang, Yuan-Yuan; Zhang, Qing-Ying

    2016-01-01

    Sleep habits are associated with stroke in western populations, but this relation has been rarely investigated in China. Moreover, the differences among stroke subtypes remain unclear. This study aimed to explore the associations of total stroke, including ischemic and hemorrhagic type, with sleep habits of a population in southern China. We performed a case-control study in patients admitted to the hospital with first stroke and community control subjects. A total of 333 patients (n = 223, 67.0%, with ischemic stroke; n = 110, 23.0%, with hemorrhagic stroke) and 547 controls were enrolled in the study. Participants completed a structured questionnaire to identify sleep habits and other stroke risk factors. Least absolute shrinkage and selection operator (Lasso) and multiple logistic regression were performed to identify risk factors of disease. Incidence of stroke, and its subtypes, was significantly associated with snorting/gasping, snoring, sleep duration, and daytime napping. Snorting/gasping was identified as an important risk factor in the Lasso logistic regression model (Lasso’ β = 0.84), and the result was proven to be robust. This study showed the association between stroke and sleep habits in the southern Chinese population and might help in better detecting important sleep-related factors for stroke risk. PMID:27698374

  16. Association between bullous pemphigoid and neurologic diseases: a case-control study.

    PubMed

    Casas-de-la-Asunción, E; Ruano-Ruiz, J; Rodríguez-Martín, A M; Vélez García-Nieto, A; Moreno-Giménez, J C

    2014-11-01

    In the past 10 years, bullous pemphigoid has been associated with other comorbidities and neurologic and psychiatric conditions in particular. Case series, small case-control studies, and large population-based studies in different Asian populations, mainland Europe, and the United Kingdom have confirmed this association. However, no data are available for the Spanish population. This was an observational, retrospective, case-control study with 1:2 matching. Fifty-four patients with bullous pemphigoid were selected. We compared the percentage of patients in each group with concurrent neurologic conditions, ischemic heart disease, diabetes, chronic obstructive pulmonary disease, and solid tumors using univariate logistic regression. An association model was constructed with conditional multiple logistic regression. The case group had a significantly higher percentage of patients with cerebrovascular accident and/or transient ischemic attack (odds ratio [OR], 3.06; 95% CI, 1.19-7.87], dementia (OR, 5.52; 95% CI, 2.19-13.93), and Parkinson disease (OR, 5; 95% CI, 1.57-15.94). A significantly higher percentage of cases had neurologic conditions (OR, 6.34; 95% CI, 2.89-13.91). Dementia and Parkinson disease were independently associated with bullous pemphigoid in the multivariate analysis. Patients with bullous pemphigoid have a higher frequency of neurologic conditions. Copyright © 2013 Elsevier España, S.L.U. and AEDV. All rights reserved.

  17. The association between dietary lignans, phytoestrogen-rich foods, and fiber intake and postmenopausal breast cancer risk: a German case-control study.

    PubMed

    Zaineddin, Aida Karina; Buck, Katharina; Vrieling, Alina; Heinz, Judith; Flesch-Janys, Dieter; Linseisen, Jakob; Chang-Claude, Jenny

    2012-01-01

    Phytoestrogens are structurally similar to estrogens and may affect breast cancer risk by mimicking estrogenic/antiestrogenic properties. In Western societies, whole grains and possibly soy foods are rich sources of phytoestrogens. A population-based case-control study in German postmenopausal women was used to evaluate the association of phytoestrogen-rich foods and dietary lignans with breast cancer risk. Dietary data were collected from 2,884 cases and 5,509 controls using a validated food-frequency questionnaire, which included additional questions phytoestrogen-rich foods. Associations were assessed using conditional logistic regression. All analyses were adjusted for relevant risk and confounding factors. Polytomous logistic regression analysis was performed to evaluate the associations by estrogen receptor (ER) status. High and low consumption of soybeans as well as of sunflower and pumpkin seeds were associated with significantly reduced breast cancer risk compared to no consumption (OR = 0.83, 95% CI = 0.70-0.97; and OR = 0.66, 95% CI = 0.77-0.97, respectively). The observed associations were not differential by ER status. No statistically significant associations were found for dietary intake of plant lignans, fiber, or the calculated enterolignans. Our results provide evidence for a reduced postmenopausal breast cancer risk associated with increased consumption of sunflower and pumpkin seeds and soybeans.

  18. Development and evaluation of an electromagnetic hypersensitivity questionnaire for Japanese people

    PubMed Central

    Tokiya, Mikiko; Mizuki, Masami; Miyata, Mikio; Kanatani, Kumiko T.; Takagi, Airi; Tsurikisawa, Naomi; Kame, Setsuko; Katoh, Takahiko; Tsujiuchi, Takuya; Kumano, Hiroaki

    2016-01-01

    The purpose of the present study was to evaluate the validity and reliability of a Japanese version of an electromagnetic hypersensitivity (EHS) questionnaire, originally developed by Eltiti et al. in the United Kingdom. Using this Japanese EHS questionnaire, surveys were conducted on 1306 controls and 127 self‐selected EHS subjects in Japan. Principal component analysis of controls revealed eight principal symptom groups, namely, nervous, skin‐related, head‐related, auditory and vestibular, musculoskeletal, allergy‐related, sensory, and heart/chest‐related. The reliability of the Japanese EHS questionnaire was confirmed by high to moderate intraclass correlation coefficients in a test–retest analysis, and high Cronbach's α coefficients (0.853–0.953) from each subscale. A comparison of scores of each subscale between self‐selected EHS subjects and age‐ and sex‐matched controls using bivariate logistic regression analysis, Mann–Whitney U‐ and χ 2 tests, verified the validity of the questionnaire. This study demonstrated that the Japanese EHS questionnaire is reliable and valid, and can be used for surveillance of EHS individuals in Japan. Furthermore, based on multiple logistic regression and receiver operating characteristic analyses, we propose specific preliminary criteria for screening EHS individuals in Japan. Bioelectromagnetics. 37:353–372, 2016. © 2016 The Authors. Bioelectromagnetics Published by Wiley Periodicals, Inc. PMID:27324106

  19. What does theory-driven evaluation add to the analysis of self-reported outcomes of diabetes education? A comparative realist evaluation of a participatory patient education approach.

    PubMed

    Pals, Regitze A S; Olesen, Kasper; Willaing, Ingrid

    2016-06-01

    To explore the effects of the Next Education (NEED) patient education approach in diabetes education. We tested the use of the NEED approach at eight intervention sites (n=193). Six additional sites served as controls (n=58). Data were collected through questionnaires, interviews and observations. We analysed data using descriptive statistics, logistic regression and systematic text condensation. Results from logistic regression demonstrated better overall assessment of education program experiences and enhanced self-reported improvements in maintaining medications correctly among patients from intervention sites, as compared to control sites. Interviews and observations suggested that improvements in health behavior could be explained by mechanisms related to the education setting, including using person-centeredness and dialogue. However, similar mechanisms were observed at control sites. Observations suggested that the quality of group dynamics, patients' motivation and educators' ability to facilitate participation in education, supported by the NEED approach, contributed to better results at intervention sites. The use of participatory approaches and, in particular, the NEED patient education approach in group-based diabetes education improved self-management skills and health behavior outcomes among individuals with diabetes. The use of dialogue tools in diabetes education is advised for educators. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  20. [Risk factors for anorexia in children].

    PubMed

    Liu, Wei-Xiao; Lang, Jun-Feng; Zhang, Qin-Feng

    2016-11-01

    To investigate the risk factors for anorexia in children, and to reduce the prevalence of anorexia in children. A questionnaire survey and a case-control study were used to collect the general information of 150 children with anorexia (case group) and 150 normal children (control group). Univariate analysis and multivariate logistic stepwise regression analysis were performed to identify the risk factors for anorexia in children. The results of the univariate analysis showed significant differences between the case and control groups in the age in months when supplementary food were added, feeding pattern, whether they liked meat, vegetables and salty food, whether they often took snacks and beverages, whether they liked to play while eating, and whether their parents asked them to eat food on time (P<0.05). The results of the multivariate logistic regression analysis showed that late addition of supplementary food (OR=5.408), high frequency of taking snacks and/or drinks (OR=11.813), and eating while playing (OR=6.654) were major risk factors for anorexia in children. Liking of meat (OR=0.093) and vegetables (OR=0.272) and eating on time required by parents (OR=0.079) were protective factors against anorexia in children. Timely addition of supplementary food, a proper diet, and development of children's proper eating and living habits can reduce the incidence of anorexia in children.

  1. [Diabetic Foot Neuropathy and Related Factors in Patients With Type 2 Diabetes Mellitus].

    PubMed

    Chen, Tzu-Yu; Lin, Chia-Huei; Chang, Yue-Cune; Wang, Chih-Hsin; Hung, Yi-Jen; Tzeng, Wen-Chii

    2018-06-01

    Patients with type 2 diabetes mellitus (T2DM) face a higher risk of diabetic foot neuropathy, which increases the risk of death. The early detection of factors that influence diabetic neuropathy reduces the risk of foot lesions, including foot ulcerations, lower extremity amputation, and mortality. To explore the demographic, disease-characteristic, health-literacy, and foot-self-care-behavior factors that affect diabetic foot neuropathy in patients with T2DM. A case-control study design was employed in which cases (Michigan Neuropathy Screening Instrument, MNSI) ≥ 2 were matched to controls based on age and gender in a medical center. A total of 114 patients diagnosed with T2DM in a medical center were recruited as participants. Data were collected using a structured questionnaire. The collected data were analyzed using Fisher's exact test, Mann-Whitney U test, and logistic regression. The results of multiple logistic regression showed that glycated hemoglobin (B = 1.696, p = .041) and communication and critical health literacy (B = -0.082, p = .034) were significant factors of diabetic foot neuropathy. The findings of this study suggest that nurses should assess the health literacy of patients with T2DM before providing health education and should develop a specific foot-care intervention for individuals with poor glycemic control.

  2. Epidemiological characteristics of measles from 2000 to 2014: Results of a measles catch-up vaccination campaign in Xianyang, China.

    PubMed

    Zhang, Rong-Qiang; Li, Hong-Bing; Li, Feng-Ying; Han, Li-Xin; Xiong, Yong-Min

    This study was a cross-sectional case-control study aimed at (1) identifying risk factors contributing to the measles epidemic and (2) evaluating the impacts of measles-containing vaccines (MCVs), with the goal of providing evidence-based recommendations for measles elimination strategies in China. Data on measles cases from 2000 to 2014 were obtained from a passive surveillance system at the Center for Diseases Prevention and Control in Xianyang. The effectiveness of MCVs was evaluated in 357 patients with a vaccination history and 503 healthy randomly selected controls. Patient data were subjected to multivariable logistic regression modeling. From 2005 to 2014, the average incidence of measles in Xianyang was 5.42 cases per 100,000 people. The second MCV dose was highly protective in 8-month-old infants. MCVs in general have been highly protective in 8-month-old infants. Multivariable logistic regression modeling indicated that age (≥2 years vs. <2years), MCV dose 2 vaccination, and MV vaccination were each independently associated with measles case status. In conclusions: A MCV should be administered on time to all age-eligible children, reproductive-age women, and migrant populations, to maximize herd immunity to measles. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.

  3. Clustering performance comparison using K-means and expectation maximization algorithms.

    PubMed

    Jung, Yong Gyu; Kang, Min Soo; Heo, Jun

    2014-11-14

    Clustering is an important means of data mining based on separating data categories by similar features. Unlike the classification algorithm, clustering belongs to the unsupervised type of algorithms. Two representatives of the clustering algorithms are the K -means and the expectation maximization (EM) algorithm. Linear regression analysis was extended to the category-type dependent variable, while logistic regression was achieved using a linear combination of independent variables. To predict the possibility of occurrence of an event, a statistical approach is used. However, the classification of all data by means of logistic regression analysis cannot guarantee the accuracy of the results. In this paper, the logistic regression analysis is applied to EM clusters and the K -means clustering method for quality assessment of red wine, and a method is proposed for ensuring the accuracy of the classification results.

  4. Regression trees for predicting mortality in patients with cardiovascular disease: What improvement is achieved by using ensemble-based methods?

    PubMed Central

    Austin, Peter C; Lee, Douglas S; Steyerberg, Ewout W; Tu, Jack V

    2012-01-01

    In biomedical research, the logistic regression model is the most commonly used method for predicting the probability of a binary outcome. While many clinical researchers have expressed an enthusiasm for regression trees, this method may have limited accuracy for predicting health outcomes. We aimed to evaluate the improvement that is achieved by using ensemble-based methods, including bootstrap aggregation (bagging) of regression trees, random forests, and boosted regression trees. We analyzed 30-day mortality in two large cohorts of patients hospitalized with either acute myocardial infarction (N = 16,230) or congestive heart failure (N = 15,848) in two distinct eras (1999–2001 and 2004–2005). We found that both the in-sample and out-of-sample prediction of ensemble methods offered substantial improvement in predicting cardiovascular mortality compared to conventional regression trees. However, conventional logistic regression models that incorporated restricted cubic smoothing splines had even better performance. We conclude that ensemble methods from the data mining and machine learning literature increase the predictive performance of regression trees, but may not lead to clear advantages over conventional logistic regression models for predicting short-term mortality in population-based samples of subjects with cardiovascular disease. PMID:22777999

  5. Mead acid (20:3n-9) and n-3 polyunsaturated fatty acids are not associated with risk of posterior longitudinal ligament ossification: results of a case-control study.

    PubMed

    Hamazaki, Kei; Kawaguchi, Yoshiharu; Nakano, Masato; Yasuda, Taketoshi; Seki, Shoji; Hori, Takeshi; Hamazaki, Tomohito; Kimura, Tomoatsu

    2015-05-01

    Ossification of the posterior longitudinal ligament (OPLL) involves the replacement of ligamentous tissue with ectopic bone. Although genetics and heritability appear to be involved in the development of OPLL, its pathogenesis remains to be elucidated. Given previous findings that 5,8,11-eicosatrienoic acid [20:3n-9, Mead acid (MA)] has depressive effects on osteoblastic activity and anti-angiogenic effects, and that n-3 polyunsaturated fatty acids (PUFAs) have a preventive effect on heterotopic ossification, we hypothesized that both fatty acids would be involved in OPLL development. To examine the biological significance of these and other fatty acids in OPLL, we conducted this case-control study involving 106 patients with cervical OPLL and 109 age matched controls. Fatty acid composition was determined from plasma samples by gas chromatography. Associations between fatty acid levels and incident OPLL were evaluated by logistic regression. Contrary to our expectations, we found no significant differences between patients and controls in the levels of MA or n-3 PUFAs (e.g., eicosapentaenoic acid and docosahexaenoic acid). Logistic regression analysis did not reveal any associations with OPLL risk for MA or n-3 PUFAs. In conclusion, no potential role was found for MA or n-3 PUFAs in ectopic bone formation in the spinal canal. Copyright © 2015 Elsevier Ltd. All rights reserved.

  6. Genetic risk factors for ovarian cancer and their role for endometriosis risk.

    PubMed

    Burghaus, Stefanie; Fasching, Peter A; Häberle, Lothar; Rübner, Matthias; Büchner, Kathrin; Blum, Simon; Engel, Anne; Ekici, Arif B; Hartmann, Arndt; Hein, Alexander; Beckmann, Matthias W; Renner, Stefan P

    2017-04-01

    Several genetic variants have been validated as risk factors for ovarian cancer. Endometriosis has also been described as a risk factor for ovarian cancer. Identifying genetic risk factors that are common to the two diseases might help improve our understanding of the molecular pathogenesis potentially linking the two conditions. In a hospital-based case-control analysis, 12 single nucleotide polymorphisms (SNPs), validated by the Ovarian Cancer Association Consortium (OCAC) and the Collaborative Oncological Gene-environment Study (COGS) project, were genotyped using TaqMan® OpenArray™ analysis. The cases consisted of patients with endometriosis, and the controls were healthy individuals without endometriosis. A total of 385 cases and 484 controls were analyzed. Odds ratios and P values were obtained using simple logistic regression models, as well as from multiple logistic regression models with adjustment for clinical predictors. rs11651755 in HNF1B was found to be associated with endometriosis in this case-control study. The OR was 0.66 (95% CI, 0.51 to 0.84) and the P value after correction for multiple testing was 0.01. None of the other genotypes was associated with a risk for endometriosis. As rs11651755 in HNF1B modified both the ovarian cancer risk and also the risk for endometriosis, HNF1B may be causally involved in the pathogenetic pathway leading from endometriosis to ovarian cancer. Copyright © 2017 Elsevier Inc. All rights reserved.

  7. Factors associated with difficulty achieving initial control with crotalidae polyvalent immune fab antivenom in snakebite patients.

    PubMed

    Yin, Shan; Kokko, Jamie; Lavonas, Eric; Mlynarchek, Sara; Bogdan, Greg; Schaeffer, Tammi

    2011-01-01

    The prescribing information for Crotalidae Fab antivenom (FabAV) instructs clinicians to administer FabAV until initial control of the envenomation syndrome is achieved. Risk factors for difficulty achieving initial control are not known. The study aim was to identify factors present before administration of antivenom associated with difficulty achieving initial control. The authors conducted a retrospective study of all patients presenting to any one of 17 centers and receiving FabAV from 2002 to 2004. Demographic and historical information, as well as data about nine specific venom effects, were collected prior to the administration of antivenom. An expert panel used standard criteria to determine if initial control was achieved. The patient group that had difficulty achieving initial control was compared to the group that achieved initial control, and adjusted odds ratios were calculated using stepwise logistic regression. A total of 247 patients were included in the final analysis. The majority of patients were envenomated on the upper extremity and were young males. A total of 203 patients (82.2%) achieved initial control. In univariate analysis, thrombocytopenia, bleeding, neurologic effects, and a severe bite were significantly associated with difficulty achieving initial control. After logistic regression, the presence of neurologic effects and thrombocytopenia remained significantly associated with difficulty achieving initial control. When both factors were present, the patient was 13.8 times more likely to have difficulty achieving initial control. A number of factors were present before the administration of FabAV that were independently associated with difficulty achieving initial control of the envenomation syndrome. Predicting which patients will have difficulty achieving initial control has important ramifications for patient disposition and may provide insight into the mechanisms for lack of antivenom efficacy. © 2010 by the Society for Academic Emergency Medicine.

  8. Gender orientation and alcohol-related weight control behavior among male and female college students.

    PubMed

    Peralta, Robert L; Barr, Peter B

    2017-01-01

    We examine weight control behavior used to (a) compensate for caloric content of heavy alcohol use; and (b) enhance the psychoactive effects of alcohol among college students. We evaluate the role of gender orientation and sex. Participants completed an online survey (N = 651; 59.9% women; 40.1% men). Weight control behavior was assessed via the Compensatory-Eating-and-Behaviors-in Response-to-Alcohol-Consumption-Scale. Control variables included sex, race/ethnicity, age, and depressive symptoms. Gender orientation was measured by the Bem Sex Role Inventory. The prevalence and probability of alcohol-related weight control behavior using ordinal logistic regression are reported. Men and women do not significantly differ in compensatory-weight-control-behavior. However, regression models suggest that recent binge drinking, other substance use, and masculine orientation are positively associated with alcohol-related weight control behavior. Sex was not a robust predictor of weight control behavior. Masculine orientation should be considered a possible risk factor for these behaviors and considered when designing prevention and intervention strategies.

  9. Strategies for Testing Statistical and Practical Significance in Detecting DIF with Logistic Regression Models

    ERIC Educational Resources Information Center

    Fidalgo, Angel M.; Alavi, Seyed Mohammad; Amirian, Seyed Mohammad Reza

    2014-01-01

    This study examines three controversial aspects in differential item functioning (DIF) detection by logistic regression (LR) models: first, the relative effectiveness of different analytical strategies for detecting DIF; second, the suitability of the Wald statistic for determining the statistical significance of the parameters of interest; and…

  10. A Note on Three Statistical Tests in the Logistic Regression DIF Procedure

    ERIC Educational Resources Information Center

    Paek, Insu

    2012-01-01

    Although logistic regression became one of the well-known methods in detecting differential item functioning (DIF), its three statistical tests, the Wald, likelihood ratio (LR), and score tests, which are readily available under the maximum likelihood, do not seem to be consistently distinguished in DIF literature. This paper provides a clarifying…

  11. "Let Me Count the Ways:" Fostering Reasons for Living among Low-Income, Suicidal, African American Women

    ERIC Educational Resources Information Center

    West, Lindsey M.; Davis, Telsie A.; Thompson, Martie P.; Kaslow, Nadine J.

    2011-01-01

    Protective factors for fostering reasons for living were examined among low-income, suicidal, African American women. Bivariate logistic regressions revealed that higher levels of optimism, spiritual well-being, and family social support predicted reasons for living. Multivariate logistic regressions indicated that spiritual well-being showed…

  12. Comparison of Two Approaches for Handling Missing Covariates in Logistic Regression

    ERIC Educational Resources Information Center

    Peng, Chao-Ying Joanne; Zhu, Jin

    2008-01-01

    For the past 25 years, methodological advances have been made in missing data treatment. Most published work has focused on missing data in dependent variables under various conditions. The present study seeks to fill the void by comparing two approaches for handling missing data in categorical covariates in logistic regression: the…

  13. Comparison of IRT Likelihood Ratio Test and Logistic Regression DIF Detection Procedures

    ERIC Educational Resources Information Center

    Atar, Burcu; Kamata, Akihito

    2011-01-01

    The Type I error rates and the power of IRT likelihood ratio test and cumulative logit ordinal logistic regression procedures in detecting differential item functioning (DIF) for polytomously scored items were investigated in this Monte Carlo simulation study. For this purpose, 54 simulation conditions (combinations of 3 sample sizes, 2 sample…

  14. Multiple Logistic Regression Analysis of Cigarette Use among High School Students

    ERIC Educational Resources Information Center

    Adwere-Boamah, Joseph

    2011-01-01

    A binary logistic regression analysis was performed to predict high school students' cigarette smoking behavior from selected predictors from 2009 CDC Youth Risk Behavior Surveillance Survey. The specific target student behavior of interest was frequent cigarette use. Five predictor variables included in the model were: a) race, b) frequency of…

  15. Modeling Polytomous Item Responses Using Simultaneously Estimated Multinomial Logistic Regression Models

    ERIC Educational Resources Information Center

    Anderson, Carolyn J.; Verkuilen, Jay; Peyton, Buddy L.

    2010-01-01

    Survey items with multiple response categories and multiple-choice test questions are ubiquitous in psychological and educational research. We illustrate the use of log-multiplicative association (LMA) models that are extensions of the well-known multinomial logistic regression model for multiple dependent outcome variables to reanalyze a set of…

  16. Propensity Score Estimation with Data Mining Techniques: Alternatives to Logistic Regression

    ERIC Educational Resources Information Center

    Keller, Bryan S. B.; Kim, Jee-Seon; Steiner, Peter M.

    2013-01-01

    Propensity score analysis (PSA) is a methodological technique which may correct for selection bias in a quasi-experiment by modeling the selection process using observed covariates. Because logistic regression is well understood by researchers in a variety of fields and easy to implement in a number of popular software packages, it has…

  17. Two-factor logistic regression in pediatric liver transplantation

    NASA Astrophysics Data System (ADS)

    Uzunova, Yordanka; Prodanova, Krasimira; Spasov, Lyubomir

    2017-12-01

    Using a two-factor logistic regression analysis an estimate is derived for the probability of absence of infections in the early postoperative period after pediatric liver transplantation. The influence of both the bilirubin level and the international normalized ratio of prothrombin time of blood coagulation at the 5th postoperative day is studied.

  18. Predictors of Placement Stability at the State Level: The Use of Logistic Regression to Inform Practice

    ERIC Educational Resources Information Center

    Courtney, Jon R.; Prophet, Retta

    2011-01-01

    Placement instability is often associated with a number of negative outcomes for children. To gain state level contextual knowledge of factors associated with placement stability/instability, logistic regression was applied to selected variables from the New Mexico Adoption and Foster Care Administrative Reporting System dataset. Predictors…

  19. Classifying machinery condition using oil samples and binary logistic regression

    NASA Astrophysics Data System (ADS)

    Phillips, J.; Cripps, E.; Lau, John W.; Hodkiewicz, M. R.

    2015-08-01

    The era of big data has resulted in an explosion of condition monitoring information. The result is an increasing motivation to automate the costly and time consuming human elements involved in the classification of machine health. When working with industry it is important to build an understanding and hence some trust in the classification scheme for those who use the analysis to initiate maintenance tasks. Typically "black box" approaches such as artificial neural networks (ANN) and support vector machines (SVM) can be difficult to provide ease of interpretability. In contrast, this paper argues that logistic regression offers easy interpretability to industry experts, providing insight to the drivers of the human classification process and to the ramifications of potential misclassification. Of course, accuracy is of foremost importance in any automated classification scheme, so we also provide a comparative study based on predictive performance of logistic regression, ANN and SVM. A real world oil analysis data set from engines on mining trucks is presented and using cross-validation we demonstrate that logistic regression out-performs the ANN and SVM approaches in terms of prediction for healthy/not healthy engines.

  20. Length bias correction in gene ontology enrichment analysis using logistic regression.

    PubMed

    Mi, Gu; Di, Yanming; Emerson, Sarah; Cumbie, Jason S; Chang, Jeff H

    2012-01-01

    When assessing differential gene expression from RNA sequencing data, commonly used statistical tests tend to have greater power to detect differential expression of genes encoding longer transcripts. This phenomenon, called "length bias", will influence subsequent analyses such as Gene Ontology enrichment analysis. In the presence of length bias, Gene Ontology categories that include longer genes are more likely to be identified as enriched. These categories, however, are not necessarily biologically more relevant. We show that one can effectively adjust for length bias in Gene Ontology analysis by including transcript length as a covariate in a logistic regression model. The logistic regression model makes the statistical issue underlying length bias more transparent: transcript length becomes a confounding factor when it correlates with both the Gene Ontology membership and the significance of the differential expression test. The inclusion of the transcript length as a covariate allows one to investigate the direct correlation between the Gene Ontology membership and the significance of testing differential expression, conditional on the transcript length. We present both real and simulated data examples to show that the logistic regression approach is simple, effective, and flexible.

  1. Label-noise resistant logistic regression for functional data classification with an application to Alzheimer's disease study.

    PubMed

    Lee, Seokho; Shin, Hyejin; Lee, Sang Han

    2016-12-01

    Alzheimer's disease (AD) is usually diagnosed by clinicians through cognitive and functional performance test with a potential risk of misdiagnosis. Since the progression of AD is known to cause structural changes in the corpus callosum (CC), the CC thickness can be used as a functional covariate in AD classification problem for a diagnosis. However, misclassified class labels negatively impact the classification performance. Motivated by AD-CC association studies, we propose a logistic regression for functional data classification that is robust to misdiagnosis or label noise. Specifically, our logistic regression model is constructed by adopting individual intercepts to functional logistic regression model. This approach enables to indicate which observations are possibly mislabeled and also lead to a robust and efficient classifier. An effective algorithm using MM algorithm provides simple closed-form update formulas. We test our method using synthetic datasets to demonstrate its superiority over an existing method, and apply it to differentiating patients with AD from healthy normals based on CC from MRI. © 2016, The International Biometric Society.

  2. Logistic regression for circular data

    NASA Astrophysics Data System (ADS)

    Al-Daffaie, Kadhem; Khan, Shahjahan

    2017-05-01

    This paper considers the relationship between a binary response and a circular predictor. It develops the logistic regression model by employing the linear-circular regression approach. The maximum likelihood method is used to estimate the parameters. The Newton-Raphson numerical method is used to find the estimated values of the parameters. A data set from weather records of Toowoomba city is analysed by the proposed methods. Moreover, a simulation study is considered. The R software is used for all computations and simulations.

  3. Naval Research Logistics Quarterly. Volume 28. Number 3,

    DTIC Science & Technology

    1981-09-01

    denotes component-wise maximum. f has antone (isotone) differences on C x D if for cl < c2 and d, < d2, NAVAL RESEARCH LOGISTICS QUARTERLY VOL. 28...or negative correlations and linear or nonlinear regressions. Given are the mo- ments to order two and, for special cases, (he regression function and...data sets. We designate this bnb distribution as G - B - N(a, 0, v). The distribution admits only of positive correlation and linear regressions

  4. Regression approaches in the test-negative study design for assessment of influenza vaccine effectiveness.

    PubMed

    Bond, H S; Sullivan, S G; Cowling, B J

    2016-06-01

    Influenza vaccination is the most practical means available for preventing influenza virus infection and is widely used in many countries. Because vaccine components and circulating strains frequently change, it is important to continually monitor vaccine effectiveness (VE). The test-negative design is frequently used to estimate VE. In this design, patients meeting the same clinical case definition are recruited and tested for influenza; those who test positive are the cases and those who test negative form the comparison group. When determining VE in these studies, the typical approach has been to use logistic regression, adjusting for potential confounders. Because vaccine coverage and influenza incidence change throughout the season, time is included among these confounders. While most studies use unconditional logistic regression, adjusting for time, an alternative approach is to use conditional logistic regression, matching on time. Here, we used simulation data to examine the potential for both regression approaches to permit accurate and robust estimates of VE. In situations where vaccine coverage changed during the influenza season, the conditional model and unconditional models adjusting for categorical week and using a spline function for week provided more accurate estimates. We illustrated the two approaches on data from a test-negative study of influenza VE against hospitalization in children in Hong Kong which resulted in the conditional logistic regression model providing the best fit to the data.

  5. ADCYAP1R1 and asthma in Puerto Rican children.

    PubMed

    Chen, Wei; Boutaoui, Nadia; Brehm, John M; Han, Yueh-Ying; Schmitz, Cassandra; Cressley, Alex; Acosta-Pérez, Edna; Alvarez, María; Colón-Semidey, Angel; Baccarelli, Andrea A; Weeks, Daniel E; Kolls, Jay K; Canino, Glorisa; Celedón, Juan C

    2013-03-15

    Epigenetic and/or genetic variation in the gene encoding the receptor for adenylate-cyclase activating polypeptide 1 (ADCYAP1R1) has been linked to post-traumatic stress disorder in adults and anxiety in children. Psychosocial stress has been linked to asthma morbidity in Puerto Rican children. To examine whether epigenetic or genetic variation in ADCYAP1R1 is associated with childhood asthma in Puerto Ricans. We conducted a case-control study of 516 children ages 6-14 years living in San Juan, Puerto Rico. We assessed methylation at a CpG site in the promoter of ADCYAP1R1 (cg11218385) using a pyrosequencing assay in DNA from white blood cells. We tested whether cg11218385 methylation (range, 0.4-6.1%) is associated with asthma using logistic regression. We also examined whether exposure to violence (assessed by the Exposure to Violence [ETV] Scale in children 9 yr and older) is associated with cg11218385 methylation (using linear regression) or asthma (using logistic regression). Logistic regression was used to test for association between a single nucleotide polymorphism in ADCYAP1R1 (rs2267735) and asthma under an additive model. All multivariate models were adjusted for age, sex, household income, and principal components. EACH 1% increment in cg11218385 methylation was associated with increased odds of asthma (adjusted odds ratio, 1.3; 95% confidence interval, 1.0-1.6; P = 0.03). Among children 9 years and older, exposure to violence was associated with cg11218385 methylation. The C allele of single nucleotide polymorphism rs2267735 was significantly associated with increased odds of asthma (adjusted odds ratio, 1.3; 95% confidence interval, 1.02-1.67; P = 0.03). Epigenetic and genetic variants in ADCYAP1R1 are associated with asthma in Puerto Rican children.

  6. A novel hybrid method of beta-turn identification in protein using binary logistic regression and neural network

    PubMed Central

    Asghari, Mehdi Poursheikhali; Hayatshahi, Sayyed Hamed Sadat; Abdolmaleki, Parviz

    2012-01-01

    From both the structural and functional points of view, β-turns play important biological roles in proteins. In the present study, a novel two-stage hybrid procedure has been developed to identify β-turns in proteins. Binary logistic regression was initially used for the first time to select significant sequence parameters in identification of β-turns due to a re-substitution test procedure. Sequence parameters were consisted of 80 amino acid positional occurrences and 20 amino acid percentages in sequence. Among these parameters, the most significant ones which were selected by binary logistic regression model, were percentages of Gly, Ser and the occurrence of Asn in position i+2, respectively, in sequence. These significant parameters have the highest effect on the constitution of a β-turn sequence. A neural network model was then constructed and fed by the parameters selected by binary logistic regression to build a hybrid predictor. The networks have been trained and tested on a non-homologous dataset of 565 protein chains. With applying a nine fold cross-validation test on the dataset, the network reached an overall accuracy (Qtotal) of 74, which is comparable with results of the other β-turn prediction methods. In conclusion, this study proves that the parameter selection ability of binary logistic regression together with the prediction capability of neural networks lead to the development of more precise models for identifying β-turns in proteins. PMID:27418910

  7. A novel hybrid method of beta-turn identification in protein using binary logistic regression and neural network.

    PubMed

    Asghari, Mehdi Poursheikhali; Hayatshahi, Sayyed Hamed Sadat; Abdolmaleki, Parviz

    2012-01-01

    From both the structural and functional points of view, β-turns play important biological roles in proteins. In the present study, a novel two-stage hybrid procedure has been developed to identify β-turns in proteins. Binary logistic regression was initially used for the first time to select significant sequence parameters in identification of β-turns due to a re-substitution test procedure. Sequence parameters were consisted of 80 amino acid positional occurrences and 20 amino acid percentages in sequence. Among these parameters, the most significant ones which were selected by binary logistic regression model, were percentages of Gly, Ser and the occurrence of Asn in position i+2, respectively, in sequence. These significant parameters have the highest effect on the constitution of a β-turn sequence. A neural network model was then constructed and fed by the parameters selected by binary logistic regression to build a hybrid predictor. The networks have been trained and tested on a non-homologous dataset of 565 protein chains. With applying a nine fold cross-validation test on the dataset, the network reached an overall accuracy (Qtotal) of 74, which is comparable with results of the other β-turn prediction methods. In conclusion, this study proves that the parameter selection ability of binary logistic regression together with the prediction capability of neural networks lead to the development of more precise models for identifying β-turns in proteins.

  8. Differential item functioning analysis with ordinal logistic regression techniques. DIFdetect and difwithpar.

    PubMed

    Crane, Paul K; Gibbons, Laura E; Jolley, Lance; van Belle, Gerald

    2006-11-01

    We present an ordinal logistic regression model for identification of items with differential item functioning (DIF) and apply this model to a Mini-Mental State Examination (MMSE) dataset. We employ item response theory ability estimation in our models. Three nested ordinal logistic regression models are applied to each item. Model testing begins with examination of the statistical significance of the interaction term between ability and the group indicator, consistent with nonuniform DIF. Then we turn our attention to the coefficient of the ability term in models with and without the group term. If including the group term has a marked effect on that coefficient, we declare that it has uniform DIF. We examined DIF related to language of test administration in addition to self-reported race, Hispanic ethnicity, age, years of education, and sex. We used PARSCALE for IRT analyses and STATA for ordinal logistic regression approaches. We used an iterative technique for adjusting IRT ability estimates on the basis of DIF findings. Five items were found to have DIF related to language. These same items also had DIF related to other covariates. The ordinal logistic regression approach to DIF detection, when combined with IRT ability estimates, provides a reasonable alternative for DIF detection. There appear to be several items with significant DIF related to language of test administration in the MMSE. More attention needs to be paid to the specific criteria used to determine whether an item has DIF, not just the technique used to identify DIF.

  9. Use of generalized ordered logistic regression for the analysis of multidrug resistance data.

    PubMed

    Agga, Getahun E; Scott, H Morgan

    2015-10-01

    Statistical analysis of antimicrobial resistance data largely focuses on individual antimicrobial's binary outcome (susceptible or resistant). However, bacteria are becoming increasingly multidrug resistant (MDR). Statistical analysis of MDR data is mostly descriptive often with tabular or graphical presentations. Here we report the applicability of generalized ordinal logistic regression model for the analysis of MDR data. A total of 1,152 Escherichia coli, isolated from the feces of weaned pigs experimentally supplemented with chlortetracycline (CTC) and copper, were tested for susceptibilities against 15 antimicrobials and were binary classified into resistant or susceptible. The 15 antimicrobial agents tested were grouped into eight different antimicrobial classes. We defined MDR as the number of antimicrobial classes to which E. coli isolates were resistant ranging from 0 to 8. Proportionality of the odds assumption of the ordinal logistic regression model was violated only for the effect of treatment period (pre-treatment, during-treatment and post-treatment); but not for the effect of CTC or copper supplementation. Subsequently, a partially constrained generalized ordinal logistic model was built that allows for the effect of treatment period to vary while constraining the effects of treatment (CTC and copper supplementation) to be constant across the levels of MDR classes. Copper (Proportional Odds Ratio [Prop OR]=1.03; 95% CI=0.73-1.47) and CTC (Prop OR=1.1; 95% CI=0.78-1.56) supplementation were not significantly associated with the level of MDR adjusted for the effect of treatment period. MDR generally declined over the trial period. In conclusion, generalized ordered logistic regression can be used for the analysis of ordinal data such as MDR data when the proportionality assumptions for ordered logistic regression are violated. Published by Elsevier B.V.

  10. Modeling recall memory for emotional objects in Alzheimer's disease.

    PubMed

    Sundstrøm, Martin

    2011-07-01

    To examine whether emotional memory (EM) of objects with self-reference in Alzheimer's disease (AD) can be modeled with binomial logistic regression in a free recall and an object recognition test to predict EM enhancement. Twenty patients with AD and twenty healthy controls were studied. Six objects (three presented as gifts) were shown to each participant. Ten minutes later, a free recall and a recognition test were applied. The recognition test had target-objects mixed with six similar distracter objects. Participants were asked to name any object in the recall test and identify each object in the recognition test as known or unknown. The total of gift objects recalled in AD patients (41.6%) was larger than neutral objects (13.3%) and a significant EM recall effect for gifts was found (Wilcoxon: p < .003). EM was not found for recognition in AD patients due to a ceiling effect. Healthy older adults scored overall higher in recall and recognition but showed no EM enhancement due to a ceiling effect. A logistic regression showed that likelihood of emotional recall memory can be modeled as a function of MMSE score (p < .014) and object status (p < .0001) as gift or non-gift. Recall memory was enhanced in AD patients for emotional objects indicating that EM in mild to moderate AD although impaired can be provoked with strong emotional load. The logistic regression model suggests that EM declines with the progression of AD rather than disrupts and may be a useful tool for evaluating magnitude of emotional load.

  11. Cross-national differences in the gender gap in subjective health in Europe: does country-level gender equality matter?

    PubMed

    Dahlin, Johanna; Härkönen, Juho

    2013-12-01

    Multiple studies have found that women report being in worse health despite living longer. Gender gaps vary cross-nationally, but relatively little is known about the causes of comparative differences. Existing literature is inconclusive as to whether gender gaps in health are smaller in more gender equal societies. We analyze gender gaps in self-rated health (SRH) and limiting longstanding illness (LLI) with five waves of European Social Survey data for 191,104 respondents from 28 countries. We use means, odds ratios, logistic regressions, and multilevel random slopes logistic regressions. Gender gaps in subjective health vary visibly across Europe. In many countries (especially in Eastern and Southern Europe), women report distinctly worse health, while in others (such as Estonia, Finland, and Great Britain) there are small or no differences. Logistic regressions ran separately for each country revealed that individual-level socioeconomic and demographic variables explain a majority of these gaps in some countries, but contribute little to their understanding in most countries. In yet other countries, men had worse health when these variables were controlled for. Cross-national variation in the gender gaps exists after accounting for individual-level factors. Against expectations, the remaining gaps are not systematically related to societal-level gender inequality in the multilevel analyses. Our findings stress persistent cross-national variability in gender gaps in health and call for further analysis. Copyright © 2013 Elsevier Ltd. All rights reserved.

  12. Artificial neural networks predict the incidence of portosplenomesenteric venous thrombosis in patients with acute pancreatitis.

    PubMed

    Fei, Y; Hu, J; Li, W-Q; Wang, W; Zong, G-Q

    2017-03-01

    Essentials Predicting the occurrence of portosplenomesenteric vein thrombosis (PSMVT) is difficult. We studied 72 patients with acute pancreatitis. Artificial neural networks modeling was more accurate than logistic regression in predicting PSMVT. Additional predictive factors may be incorporated into artificial neural networks. Objective To construct and validate artificial neural networks (ANNs) for predicting the occurrence of portosplenomesenteric venous thrombosis (PSMVT) and compare the predictive ability of the ANNs with that of logistic regression. Methods The ANNs and logistic regression modeling were constructed using simple clinical and laboratory data of 72 acute pancreatitis (AP) patients. The ANNs and logistic modeling were first trained on 48 randomly chosen patients and validated on the remaining 24 patients. The accuracy and the performance characteristics were compared between these two approaches by SPSS17.0 software. Results The training set and validation set did not differ on any of the 11 variables. After training, the back propagation network training error converged to 1 × 10 -20 , and it retained excellent pattern recognition ability. When the ANNs model was applied to the validation set, it revealed a sensitivity of 80%, specificity of 85.7%, a positive predictive value of 77.6% and negative predictive value of 90.7%. The accuracy was 83.3%. Differences could be found between ANNs modeling and logistic regression modeling in these parameters (10.0% [95% CI, -14.3 to 34.3%], 14.3% [95% CI, -8.6 to 37.2%], 15.7% [95% CI, -9.9 to 41.3%], 11.8% [95% CI, -8.2 to 31.8%], 22.6% [95% CI, -1.9 to 47.1%], respectively). When ANNs modeling was used to identify PSMVT, the area under receiver operating characteristic curve was 0.849 (95% CI, 0.807-0.901), which demonstrated better overall properties than logistic regression modeling (AUC = 0.716) (95% CI, 0.679-0.761). Conclusions ANNs modeling was a more accurate tool than logistic regression in predicting the occurrence of PSMVT following AP. More clinical factors or biomarkers may be incorporated into ANNs modeling to improve its predictive ability. © 2016 International Society on Thrombosis and Haemostasis.

  13. PREDICTION OF MALIGNANT BREAST LESIONS FROM MRI FEATURES: A COMPARISON OF ARTIFICIAL NEURAL NETWORK AND LOGISTIC REGRESSION TECHNIQUES

    PubMed Central

    McLaren, Christine E.; Chen, Wen-Pin; Nie, Ke; Su, Min-Ying

    2009-01-01

    Rationale and Objectives Dynamic contrast enhanced MRI (DCE-MRI) is a clinical imaging modality for detection and diagnosis of breast lesions. Analytical methods were compared for diagnostic feature selection and performance of lesion classification to differentiate between malignant and benign lesions in patients. Materials and Methods The study included 43 malignant and 28 benign histologically-proven lesions. Eight morphological parameters, ten gray level co-occurrence matrices (GLCM) texture features, and fourteen Laws’ texture features were obtained using automated lesion segmentation and quantitative feature extraction. Artificial neural network (ANN) and logistic regression analysis were compared for selection of the best predictors of malignant lesions among the normalized features. Results Using ANN, the final four selected features were compactness, energy, homogeneity, and Law_LS, with area under the receiver operating characteristic curve (AUC) = 0.82, and accuracy = 0.76. The diagnostic performance of these 4-features computed on the basis of logistic regression yielded AUC = 0.80 (95% CI, 0.688 to 0.905), similar to that of ANN. The analysis also shows that the odds of a malignant lesion decreased by 48% (95% CI, 25% to 92%) for every increase of 1 SD in the Law_LS feature, adjusted for differences in compactness, energy, and homogeneity. Using logistic regression with z-score transformation, a model comprised of compactness, NRL entropy, and gray level sum average was selected, and it had the highest overall accuracy of 0.75 among all models, with AUC = 0.77 (95% CI, 0.660 to 0.880). When logistic modeling of transformations using the Box-Cox method was performed, the most parsimonious model with predictors, compactness and Law_LS, had an AUC of 0.79 (95% CI, 0.672 to 0.898). Conclusion The diagnostic performance of models selected by ANN and logistic regression was similar. The analytic methods were found to be roughly equivalent in terms of predictive ability when a small number of variables were chosen. The robust ANN methodology utilizes a sophisticated non-linear model, while logistic regression analysis provides insightful information to enhance interpretation of the model features. PMID:19409817

  14. Logistic regression analysis of factors associated with avascular necrosis of the femoral head following femoral neck fractures in middle-aged and elderly patients.

    PubMed

    Ai, Zi-Sheng; Gao, You-Shui; Sun, Yuan; Liu, Yue; Zhang, Chang-Qing; Jiang, Cheng-Hua

    2013-03-01

    Risk factors for femoral neck fracture-induced avascular necrosis of the femoral head have not been elucidated clearly in middle-aged and elderly patients. Moreover, the high incidence of screw removal in China and its effect on the fate of the involved femoral head require statistical methods to reflect their intrinsic relationship. Ninety-nine patients older than 45 years with femoral neck fracture were treated by internal fixation between May 1999 and April 2004. Descriptive analysis, interaction analysis between associated factors, single factor logistic regression, multivariate logistic regression, and detailed interaction analysis were employed to explore potential relationships among associated factors. Avascular necrosis of the femoral head was found in 15 cases (15.2 %). Age × the status of implants (removal vs. maintenance) and gender × the timing of reduction were interactive according to two-factor interactive analysis. Age, the displacement of fractures, the quality of reduction, and the status of implants were found to be significant factors in single factor logistic regression analysis. Age, age × the status of implants, and the quality of reduction were found to be significant factors in multivariate logistic regression analysis. In fine interaction analysis after multivariate logistic regression analysis, implant removal was the most important risk factor for avascular necrosis in 56-to-85-year-old patients, with a risk ratio of 26.00 (95 % CI = 3.076-219.747). The middle-aged and elderly have less incidence of avascular necrosis of the femoral head following femoral neck fractures treated by cannulated screws. The removal of cannulated screws can induce a significantly high incidence of avascular necrosis of the femoral head in elderly patients, while a high-quality reduction is helpful to reduce avascular necrosis.

  15. Multivariate logistic regression analysis of postoperative complications and risk model establishment of gastrectomy for gastric cancer: A single-center cohort report.

    PubMed

    Zhou, Jinzhe; Zhou, Yanbing; Cao, Shougen; Li, Shikuan; Wang, Hao; Niu, Zhaojian; Chen, Dong; Wang, Dongsheng; Lv, Liang; Zhang, Jian; Li, Yu; Jiao, Xuelong; Tan, Xiaojie; Zhang, Jianli; Wang, Haibo; Zhang, Bingyuan; Lu, Yun; Sun, Zhenqing

    2016-01-01

    Reporting of surgical complications is common, but few provide information about the severity and estimate risk factors of complications. If have, but lack of specificity. We retrospectively analyzed data on 2795 gastric cancer patients underwent surgical procedure at the Affiliated Hospital of Qingdao University between June 2007 and June 2012, established multivariate logistic regression model to predictive risk factors related to the postoperative complications according to the Clavien-Dindo classification system. Twenty-four out of 86 variables were identified statistically significant in univariate logistic regression analysis, 11 significant variables entered multivariate analysis were employed to produce the risk model. Liver cirrhosis, diabetes mellitus, Child classification, invasion of neighboring organs, combined resection, introperative transfusion, Billroth II anastomosis of reconstruction, malnutrition, surgical volume of surgeons, operating time and age were independent risk factors for postoperative complications after gastrectomy. Based on logistic regression equation, p=Exp∑BiXi / (1+Exp∑BiXi), multivariate logistic regression predictive model that calculated the risk of postoperative morbidity was developed, p = 1/(1 + e((4.810-1.287X1-0.504X2-0.500X3-0.474X4-0.405X5-0.318X6-0.316X7-0.305X8-0.278X9-0.255X10-0.138X11))). The accuracy, sensitivity and specificity of the model to predict the postoperative complications were 86.7%, 76.2% and 88.6%, respectively. This risk model based on Clavien-Dindo grading severity of complications system and logistic regression analysis can predict severe morbidity specific to an individual patient's risk factors, estimate patients' risks and benefits of gastric surgery as an accurate decision-making tool and may serve as a template for the development of risk models for other surgical groups.

  16. Reduction of Racial Disparities in Prostate Cancer

    DTIC Science & Technology

    2008-12-01

    inhibitors, aspirin, anti-TNF medications), and other medications of interest (testosterone, finasteride , alpha receptor blockers). 12 We...0.01. There were 14 (7%) control-patients who had finasteride use, with an average of 398.6 doses per individual. None of the prostate cancer...patients had prior finasteride use. In a multiple logistic regression model (Table 2, see supporting materials), after adjustment for the matching

  17. Carrying a Weapon to School: The Influence of Youth Assets at Home and School

    ERIC Educational Resources Information Center

    Marsh, Shawn C.; Evans, William P.

    2007-01-01

    Eighth and tenth grade students (n= 1,619) reported on exposure to risk and protective assets in their day-to-day lives. The relationship between carrying a weapon to school and risk and protective factors in the home and school ecological domains was explored through logistic regression conducted separately by gender. Environmental control in the…

  18. Organochlorine pesticides accumulation and breast cancer: A hospital-based case-control study.

    PubMed

    He, Ting-Ting; Zuo, An-Jun; Wang, Ji-Gang; Zhao, Peng

    2017-05-01

    The aim of this study is to detect the accumulation status of organochlorine pesticides in breast cancer patients and to explore the relationship between organochlorine pesticides contamination and breast cancer development. We conducted a hospital-based case-control study in 56 patients with breast cancer and 46 patients with benign breast disease. We detected the accumulation level of several organochlorine pesticides products (β-hexachlorocyclohexane, γ-hexachlorocyclohexane, polychlorinated biphenyls-28, polychlorinated biphenyls-52, pentachlorothioanisole, and pp'-dichlorodiphenyldichloroethane) in breast adipose tissues of all 102 patients using gas chromatography. Thereafter, we examined the expression status of estrogen receptor, progesterone receptor, human epidermal growth factor receptor-2 (HER2), and Ki-67 in 56 breast cancer cases by immunohistochemistry. In addition, we analyzed the risk of breast cancer in those patients with organochlorine pesticides contamination using a logistic regression model. Our data showed that breast cancer patients suffered high accumulation levels of pp'-dichlorodiphenyldichloroethane and polychlorinated biphenyls-52. However, the concentrations of pp'-dichlorodiphenyldichloroethane and polychlorinated biphenyls-52 were not related to clinicopathologic parameters of breast cancer. Further logistic regression analysis showed polychlorinated biphenyls-52 and pp'-dichlorodiphenyldichloroethane were risk factors for breast cancer. Our results provide new evidence on etiology of breast cancer.

  19. Differences by Sexual Orientation in Expectations About Future Long-Term Care Needs Among Adults 40 to 65 Years Old.

    PubMed

    Henning-Smith, Carrie; Gonzales, Gilbert; Shippee, Tetyana P

    2015-11-01

    We examined whether and how lesbian, gay, and bisexual (LGB) adults between 40 and 65 years of age differ from heterosexual adults in long-term care (LTC) expectations. Our data were derived from the 2013 National Health Interview Survey. We used ordered logistic regression to compare the odds of expected future use of LTC among LGB (n = 297) and heterosexual (n = 13 120) adults. We also used logistic regression models to assess the odds of expecting to use specific sources of care. All models controlled for key socioeconomic characteristics. Although LGB adults had greater expectations of needing LTC in the future than their heterosexual counterparts, that association was largely explained by sociodemographic and health differences. After control for these differentials, LGB adults were less likely to expect care from family and more likely to expect to use institutional care in old age. LGB adults may rely more heavily than heterosexual adults on formal systems of care. As the older population continues to diversify, nursing homes and assisted living facilities should work to ensure safety and culturally sensitive best practices for older LGB groups.

  20. Differences by Sexual Orientation in Expectations About Future Long-Term Care Needs Among Adults 40 to 65 Years Old

    PubMed Central

    Gonzales, Gilbert; Shippee, Tetyana P.

    2015-01-01

    Objectives. We examined whether and how lesbian, gay, and bisexual (LGB) adults between 40 and 65 years of age differ from heterosexual adults in long-term care (LTC) expectations. Methods. Our data were derived from the 2013 National Health Interview Survey. We used ordered logistic regression to compare the odds of expected future use of LTC among LGB (n = 297) and heterosexual (n = 13 120) adults. We also used logistic regression models to assess the odds of expecting to use specific sources of care. All models controlled for key socioeconomic characteristics. Results. Although LGB adults had greater expectations of needing LTC in the future than their heterosexual counterparts, that association was largely explained by sociodemographic and health differences. After control for these differentials, LGB adults were less likely to expect care from family and more likely to expect to use institutional care in old age. Conclusions. LGB adults may rely more heavily than heterosexual adults on formal systems of care. As the older population continues to diversify, nursing homes and assisted living facilities should work to ensure safety and culturally sensitive best practices for older LGB groups. PMID:26378822

  1. A retrospective population-based study of primigravid women on the potential effect of threatened miscarriage on obstetric outcome.

    PubMed

    Mulik, Varsha; Bethel, Jackie; Bhal, K

    2004-04-01

    The aim of this study was to ascertain any potential link between threatened miscarriage and obstetric outcome. Threatened miscarriage was associated independently with an increased incidence of abruption (OR 2.8, 2.0-3.7), unexplained antepartum haemorrhage (APH) (OR 2.3, 1.1-5.1) and preterm delivery (OR 2.0, 1.3-3.3). The incidence of low and very low birth weight deliveries, although significantly higher compared with the control population, was not affected independently by this early pregnancy complication on logistic regression (OR 1.3, 0.8-1.9). The early neonatal mortality rates were significantly higher in the threatened miscarriage group, which on logistic regression was due independently to preterm delivery, placental abruption and low birth weight deliveries. All forms of APH were significantly higher in term deliveries complicated by threatened miscarriage. Pregnancies presenting with threatened miscarriage should be highlighted as 'high risk' for a suboptimal obstetric outcome and a prospective observational trial followed by a randomised-controlled trial may be needed to establish whether the need exists for increased feto-maternal surveillance in this cohort of women.

  2. Design of Embedded System for Multivariate Classification of Finger and Thumb Movements Using EEG Signals for Control of Upper Limb Prosthesis

    PubMed Central

    Javed, Amna; Tiwana, Mohsin I.; Khan, Umar Shahbaz

    2018-01-01

    Brain Computer Interface (BCI) determines the intent of the user from a variety of electrophysiological signals. These signals, Slow Cortical Potentials, are recorded from scalp, and cortical neuronal activity is recorded by implanted electrodes. This paper is focused on design of an embedded system that is used to control the finger movements of an upper limb prosthesis using Electroencephalogram (EEG) signals. This is a follow-up of our previous research which explored the best method to classify three movements of fingers (thumb movement, index finger movement, and first movement). Two-stage logistic regression classifier exhibited the highest classification accuracy while Power Spectral Density (PSD) was used as a feature of the filtered signal. The EEG signal data set was recorded using a 14-channel electrode headset (a noninvasive BCI system) from right-handed, neurologically intact volunteers. Mu (commonly known as alpha waves) and Beta Rhythms (8–30 Hz) containing most of the movement data were retained through filtering using “Arduino Uno” microcontroller followed by 2-stage logistic regression to obtain a mean classification accuracy of 70%. PMID:29888252

  3. Parental 'affectionless control' as an antecedent to adult depression: a risk factor refined.

    PubMed

    Mackinnon, A; Henderson, A S; Andrews, G

    1993-02-01

    It has been well established that individuals with a history of depression report their parents as being less caring and more overprotective of them than do controls. 'Affectionless control' in childhood has thus been proposed as a risk factor for depression. Evidence is presented from a logistic regression analysis of data from a volunteer community sample that lack of care rather than over-protection is the primary risk factor. No evidence for an interaction effect of low care and over-protection was found.

  4. Family-centered prevention ameliorates the association between adverse childhood experiences and prediabetes status in young black adults.

    PubMed

    Brody, Gene H; Yu, Tianyi; Chen, Edith; Miller, Gregory E

    2017-07-01

    Individuals exposed to adverse childhood experiences (ACEs) are vulnerable to various health problems later in life. This study was designed to determine whether participation in an efficacious program to enhance supportive parenting would ameliorate the association between ACEs and prediabetes status at age 25. Rural African American parents and their 11-year-old children (N=390) participated in the Strong African American Families (SAAF) program or a control condition. Each youth at age 25 provided a total ACEs score and a blood sample from which overnight fasting glucose was assayed. Logistic regression equations were used to test the hypotheses. The logistic regression analyses revealed a significant interaction between total ACEs and random assignment to SAAF or control, OR=0.56, 95% CI [0.36, 0.88]. Follow-up analyses indicated that, for participants in the control condition, a 1-point increase in ACEs was associated with a 37.3% increase in risk of having prediabetes. ACEs were not associated with the likelihood of having prediabetes among participants in the SAAF condition. Control participants with high total ACEs scores were 3.54 times more likely to have prediabetes than were SAAF participants with similar scores. This study indicated that participation at age 11 in a randomized controlled trial designed to enhance supportive parenting ameliorated the association of ACEs with prediabetes at age 25. If substantiated, these findings may provide a strategy for preventing negative health consequences of ACEs. Copyright © 2017 Elsevier Inc. All rights reserved.

  5. Climate controls the distribution of a widespread invasive species: Implications for future range expansion

    USGS Publications Warehouse

    McDowell, W.G.; Benson, A.J.; Byers, J.E.

    2014-01-01

    1. Two dominant drivers of species distributions are climate and habitat, both of which are changing rapidly. Understanding the relative importance of variables that can control distributions is critical, especially for invasive species that may spread rapidly and have strong effects on ecosystems. 2. Here, we examine the relative importance of climate and habitat variables in controlling the distribution of the widespread invasive freshwater clam Corbicula fluminea, and we model its future distribution under a suite of climate scenarios using logistic regression and maximum entropy modelling (MaxEnt). 3. Logistic regression identified climate variables as more important than habitat variables in controlling Corbicula distribution. MaxEnt modelling predicted Corbicula's range expansion westward and northward to occupy half of the contiguous United States. By 2080, Corbicula's potential range will expand 25–32%, with more than half of the continental United States being climatically suitable. 4. Our combination of multiple approaches has revealed the importance of climate over habitat in controlling Corbicula's distribution and validates the climate-only MaxEnt model, which can readily examine the consequences of future climate projections. 5. Given the strong influence of climate variables on Corbicula's distribution, as well as Corbicula's ability to disperse quickly and over long distances, Corbicula is poised to expand into New England and the northern Midwest of the United States. Thus, the direct effects of climate change will probably be compounded by the addition of Corbicula and its own influences on ecosystem function.

  6. Rank-Optimized Logistic Matrix Regression toward Improved Matrix Data Classification.

    PubMed

    Zhang, Jianguang; Jiang, Jianmin

    2018-02-01

    While existing logistic regression suffers from overfitting and often fails in considering structural information, we propose a novel matrix-based logistic regression to overcome the weakness. In the proposed method, 2D matrices are directly used to learn two groups of parameter vectors along each dimension without vectorization, which allows the proposed method to fully exploit the underlying structural information embedded inside the 2D matrices. Further, we add a joint [Formula: see text]-norm on two parameter matrices, which are organized by aligning each group of parameter vectors in columns. This added co-regularization term has two roles-enhancing the effect of regularization and optimizing the rank during the learning process. With our proposed fast iterative solution, we carried out extensive experiments. The results show that in comparison to both the traditional tensor-based methods and the vector-based regression methods, our proposed solution achieves better performance for matrix data classifications.

  7. Potential habitat distribution for the freshwater diatom Didymosphenia geminata in the continental US

    USGS Publications Warehouse

    Kumar, S.; Spaulding, S.A.; Stohlgren, T.J.; Hermann, K.A.; Schmidt, T.S.; Bahls, L.L.

    2009-01-01

    The diatom Didymosphenia geminata is a single-celled alga found in lakes, streams, and rivers. Nuisance blooms of D geminata affect the diversity, abundance, and productivity of other aquatic organisms. Because D geminata can be transported by humans on waders and other gear, accurate spatial prediction of habitat suitability is urgently needed for early detection and rapid response, as well as for evaluation of monitoring and control programs. We compared four modeling methods to predict D geminata's habitat distribution; two methods use presence-absence data (logistic regression and classification and regression tree [CART]), and two involve presence data (maximum entropy model [Maxent] and genetic algorithm for rule-set production [GARP]). Using these methods, we evaluated spatially explicit, bioclimatic and environmental variables as predictors of diatom distribution. The Maxent model provided the most accurate predictions, followed by logistic regression, CART, and GARP. The most suitable habitats were predicted to occur in the western US, in relatively cool sites, and at high elevations with a high base-flow index. The results provide insights into the factors that affect the distribution of D geminata and a spatial basis for the prediction of nuisance blooms. ?? The Ecological Society of America.

  8. Keeping children safe at home: protocol for three matched case–control studies of modifiable risk factors for falls

    PubMed Central

    Kendrick, Denise; Stewart, Jane; Clacy, Rose; Coffey, Frank; Cooper, Nicola; Coupland, Carol; Hayes, Mike; McColl, Elaine; Reading, Richard; Sutton, Alex; M L Towner, Elizabeth; Craig Watson, Michael

    2012-01-01

    Background Childhood falls result in considerable morbidity, mortality and health service use. Despite this, little evidence exists on protective factors or effective falls prevention interventions in young children. Objectives To estimate ORs for three types of medically attended fall injuries in young children in relation to safety equipment, safety behaviours and hazard reduction and explore differential effects by child and family factors and injury severity. Design Three multicentre case–control studies in UK hospitals with validation of parental reported exposures using home observations. Cases are aged 0–4 years with a medically attended fall injury occurring at home, matched on age and sex with community controls. Children attending hospital for other types of injury will serve as unmatched hospital controls. Matched analyses will use conditional logistic regression to adjust for potential confounding variables. Unmatched analyses will use unconditional logistic regression, adjusted for age, sex, deprivation and distance from hospital in addition to other confounders. Each study requires 496 cases and 1984 controls to detect an OR of 0.7, with 80% power, 5% significance level, a correlation between cases and controls of 0.1 and a range of exposure prevalences. Main outcome measures Falls on stairs, on one level and from furniture. Discussion As the largest in the field to date, these case control studies will adjust for potential confounders, validate measures of exposure and investigate modifiable risk factors for specific falls injury mechanisms. Findings should enhance the evidence base for falls prevention for young children. PMID:22628151

  9. Availability of tobacco cessation services in substance use disorder treatment programs: Impact of state tobacco control policy.

    PubMed

    Abraham, Amanda J; Bagwell-Adams, Grace; Jayawardhana, Jayani

    2017-08-01

    Given the high prevalence of smoking among substance use disorder (SUD) patients, the specialty SUD treatment system is an important target for adoption and implementation of tobacco cessation (TC) services. While research has addressed the impact of tobacco control on individual tobacco consumption, largely overlooked in the literature is the potential impact of state tobacco control policies on availability of services for tobacco cessation. This paper examines the association between state tobacco control policy and availability of TC services in SUD treatment programs in the United States. State tobacco control and state demographic data (n=51) were merged with treatment program data from the 2012 National Survey of Substance Abuse Treatment Services (n=10.413) to examine availability of TC screening, counseling and pharmacotherapy services in SUD treatment programs using multivariate logistic regression models clustered at the state-level. Approximately 60% of SUD treatment programs offered TC screening services, 41% offered TC counseling services and 26% offered TC pharmacotherapy services. Results of multivariate logistic regression showed the odds of offering TC services were greater for SUD treatment programs located in states with higher cigarette excise taxes and greater spending on tobacco prevention and control. Findings indicate cigarette excise taxes and recommended funding levels may be effective policy tools for increasing access to TC services in SUD treatment programs. Coupled with changes to insurance coverage for TC under the Affordable Care Act, state tobacco control policy tools may further reduce tobacco use in the United States. Published by Elsevier Ltd.

  10. Detecting DIF in Polytomous Items Using MACS, IRT and Ordinal Logistic Regression

    ERIC Educational Resources Information Center

    Elosua, Paula; Wells, Craig

    2013-01-01

    The purpose of the present study was to compare the Type I error rate and power of two model-based procedures, the mean and covariance structure model (MACS) and the item response theory (IRT), and an observed-score based procedure, ordinal logistic regression, for detecting differential item functioning (DIF) in polytomous items. A simulation…

  11. Accuracy of Bayes and Logistic Regression Subscale Probabilities for Educational and Certification Tests

    ERIC Educational Resources Information Center

    Rudner, Lawrence

    2016-01-01

    In the machine learning literature, it is commonly accepted as fact that as calibration sample sizes increase, Naïve Bayes classifiers initially outperform Logistic Regression classifiers in terms of classification accuracy. Applied to subtests from an on-line final examination and from a highly regarded certification examination, this study shows…

  12. Comparing Linear Discriminant Function with Logistic Regression for the Two-Group Classification Problem.

    ERIC Educational Resources Information Center

    Fan, Xitao; Wang, Lin

    The Monte Carlo study compared the performance of predictive discriminant analysis (PDA) and that of logistic regression (LR) for the two-group classification problem. Prior probabilities were used for classification, but the cost of misclassification was assumed to be equal. The study used a fully crossed three-factor experimental design (with…

  13. Effects of Social Class and School Conditions on Educational Enrollment and Achievement of Boys and Girls in Rural Viet Nam

    ERIC Educational Resources Information Center

    Nguyen, Phuong L.

    2006-01-01

    This study examines the effects of parental SES, school quality, and community factors on children's enrollment and achievement in rural areas in Viet Nam, using logistic regression and ordered logistic regression. Multivariate analysis reveals significant differences in educational enrollment and outcomes by level of household expenditures and…

  14. School Exits in the Milwaukee Parental Choice Program: Evidence of a Marketplace?

    ERIC Educational Resources Information Center

    Ford, Michael

    2011-01-01

    This article examines whether the large number of school exits from the Milwaukee school voucher program is evidence of a marketplace. Two logistic regression and multinomial logistic regression models tested the relation between the inability to draw large numbers of voucher students and the ability for a private school to remain viable. Data on…

  15. An empirical study of statistical properties of variance partition coefficients for multi-level logistic regression models

    USGS Publications Warehouse

    Li, Ji; Gray, B.R.; Bates, D.M.

    2008-01-01

    Partitioning the variance of a response by design levels is challenging for binomial and other discrete outcomes. Goldstein (2003) proposed four definitions for variance partitioning coefficients (VPC) under a two-level logistic regression model. In this study, we explicitly derived formulae for multi-level logistic regression model and subsequently studied the distributional properties of the calculated VPCs. Using simulations and a vegetation dataset, we demonstrated associations between different VPC definitions, the importance of methods for estimating VPCs (by comparing VPC obtained using Laplace and penalized quasilikehood methods), and bivariate dependence between VPCs calculated at different levels. Such an empirical study lends an immediate support to wider applications of VPC in scientific data analysis.

  16. Model building strategy for logistic regression: purposeful selection.

    PubMed

    Zhang, Zhongheng

    2016-03-01

    Logistic regression is one of the most commonly used models to account for confounders in medical literature. The article introduces how to perform purposeful selection model building strategy with R. I stress on the use of likelihood ratio test to see whether deleting a variable will have significant impact on model fit. A deleted variable should also be checked for whether it is an important adjustment of remaining covariates. Interaction should be checked to disentangle complex relationship between covariates and their synergistic effect on response variable. Model should be checked for the goodness-of-fit (GOF). In other words, how the fitted model reflects the real data. Hosmer-Lemeshow GOF test is the most widely used for logistic regression model.

  17. Brick mortar exposure and chronic lymphocytic leukemia.

    PubMed

    Markovic-Denic, L; Jankovic, S; Marinkovic, J; Radovanovic, Z

    1995-01-01

    A case-control study of 130 patients with chronic lymphocytic leukemia (CLL) and 130 controls matched with respect to sex, age (2 years), type of residence (urban-rural) and area of residence (according to the national per capita income) was carried out. Conditional logistic regression analysis showed that, apart of four risk factors already described in the literature (work in a hazardous industry, hair dye use, family history of leukemia and exposure to electromagnetic radiation), brick mortar exposure was also significantly related to CLL.

  18. Weight Fluctuation and Postmenopausal Breast Cancer in the National Health and Nutrition Examination Survey I Epidemiologic Follow-Up Study.

    PubMed

    Komaroff, Marina

    2016-01-01

    The aim of this study is to investigate if weight fluctuation is an independent risk factor for postmenopausal breast cancer (PBC) among women who gained weight in adult years. NHANES I Epidemiologic Follow-Up Study (NHEFS) database was used in the study. Women that were cancers-free at enrollment and diagnosed for the first time with breast cancer at age 50 or greater were considered cases. Controls were chosen from the subset of cancers-free women and matched to cases by years of follow-up and status of body mass index (BMI) at 25 years of age. Weight fluctuation was measured by the root-mean-square-error (RMSE) from a simple linear regression model for each woman with their body mass index (BMI) regressed on age (started at 25 years) while women with the positive slope from this regression were defined as weight gainers. Data were analyzed using conditional logistic regression models. A total of 158 women were included into the study. The conditional logistic regression adjusted for weight gain demonstrated positive association between weight fluctuation in adult years and postmenopausal breast cancers (odds ratio/OR = 1.67; 95% confidence interval/CI: 1.06-2.66). The data suggested that long-term weight fluctuation was significant risk factor for PBC among women who gained weight in adult years. This finding underscores the importance of maintaining lost weight and avoiding weight fluctuation.

  19. Weight Fluctuation and Postmenopausal Breast Cancer in the National Health and Nutrition Examination Survey I Epidemiologic Follow-Up Study

    PubMed Central

    Komaroff, Marina

    2016-01-01

    Objective. The aim of this study is to investigate if weight fluctuation is an independent risk factor for postmenopausal breast cancer (PBC) among women who gained weight in adult years. Methods. NHANES I Epidemiologic Follow-Up Study (NHEFS) database was used in the study. Women that were cancers-free at enrollment and diagnosed for the first time with breast cancer at age 50 or greater were considered cases. Controls were chosen from the subset of cancers-free women and matched to cases by years of follow-up and status of body mass index (BMI) at 25 years of age. Weight fluctuation was measured by the root-mean-square-error (RMSE) from a simple linear regression model for each woman with their body mass index (BMI) regressed on age (started at 25 years) while women with the positive slope from this regression were defined as weight gainers. Data were analyzed using conditional logistic regression models. Results. A total of 158 women were included into the study. The conditional logistic regression adjusted for weight gain demonstrated positive association between weight fluctuation in adult years and postmenopausal breast cancers (odds ratio/OR = 1.67; 95% confidence interval/CI: 1.06–2.66). Conclusions. The data suggested that long-term weight fluctuation was significant risk factor for PBC among women who gained weight in adult years. This finding underscores the importance of maintaining lost weight and avoiding weight fluctuation. PMID:26953120

  20. Risk Factors for Brachial Plexus Birth Injury

    PubMed Central

    Louden, Emily; Marcotte, Michael; Mehlman, Charles; Lippert, William; Huang, Bin; Paulson, Andrea

    2018-01-01

    Over the course of decades, the incidence of brachial plexus birth injury (BPBI) has increased despite advances in healthcare which would seem to assist in decreasing the rate. The aim of this study is to identify previously unknown risk factors for BPBI and the risk factors with potential to guide preventative measures. A case control study of 52 mothers who had delivered a child with a BPBI injury and 132 mothers who had delivered without BPBI injury was conducted. Univariate, multivariable and logistic regressions identified risk factors and their combinations. The odds of BPBI were 2.5 times higher when oxytocin was used and 3.7 times higher when tachysystole occurred. The odds of BPBI injury are increased when tachysystole and oxytocin occur during the mother’s labor. Logistic regression identified a higher risk for BPBI when more than three of the following variables (>30 lbs gained during the pregnancy, stage 2 labor >61.5 min, mother’s age >26.4 years, tachysystole, or fetal malpresentation) were present in any combination. PMID:29596309

  1. Are math readiness and personality predictive of first-year retention in engineering?

    PubMed

    Moses, Laurie; Hall, Cathy; Wuensch, Karl; De Urquidi, Karen; Kauffmann, Paul; Swart, William; Duncan, Steve; Dixon, Gene

    2011-01-01

    On the basis of J. G. Borkowski, L. K. Chan, and N. Muthukrishna's model of academic success (2000), the present authors hypothesized that freshman retention in an engineering program would be related to not only basic aptitude but also affective factors. Participants were 129 college freshmen with engineering as their stated major. Aptitude was measured by SAT verbal and math scores, high school grade-point average (GPA), and an assessment of calculus readiness. Affective factors were assessed by the NEO-Five Factor Inventory (FFI; P. I. Costa & R. R. McCrae, 2007), and the Nowicki-Duke Locus of Control (LOC) scale (S. Nowicki & M. Duke, 1974). A binary logistic regression analysis found that calculus readiness and high school GPA were predictive of retention. Scores on the Neuroticism and Openness subscales from the NEO-FFI and LOC were correlated with retention status, but Openness was the only affective factor with a significant unique effect in the binary logistic regression. Results of the study lend modest support to Borkowski's model.

  2. A population study of the contribution of medical comorbidity to the risk of prematurity in blacks.

    PubMed

    Ehrenthal, Deborah B; Jurkovitz, Claudine; Hoffman, Matthew; Kroelinger, Charlan; Weintraub, William

    2007-10-01

    The purpose of this study was to test the hypothesis that the higher prevalence of medical comorbidities among black women accounts for their increased risk of prematurity. A population-based regional cohort of women receiving obstetric care for singleton pregnancies at a large community hospital between 2003 and 2006 were analyzed using univariate and multivariable logistic regression. Data for 18,624 consecutive births found increased odds of adverse outcomes for black compared to white women: prematurity OR = 1.6 (1.4-1.8), extreme prematurity OR = 2.5 (2.0-3.2). Logistic regression modeling identified black race, age < 20, preconception diabetes and hypertension, smoking, underweight, and gestational hypertension as the greatest risks for adverse outcomes. Controlling for these risks did not attenuate the higher risk for prematurity among blacks. Though there is a greater burden of health risk among black women, this did not account for the higher rates of low birthweight and prematurity.

  3. A Pilot Study of Reasons and Risk Factors for "No-Shows" in a Pediatric Neurology Clinic.

    PubMed

    Guzek, Lindsay M; Fadel, William F; Golomb, Meredith R

    2015-09-01

    Missed clinic appointments lead to decreased patient access, worse patient outcomes, and increased healthcare costs. The goal of this pilot study was to identify reasons for and risk factors associated with missed pediatric neurology outpatient appointments ("no-shows"). This was a prospective cohort study of patients scheduled for 1 week of clinic. Data on patient clinical and demographic information were collected by record review; data on reasons for missed appointments were collected by phone interviews. Univariate and multivariate analyses were conducted using chi-square tests and multiple logistic regression to assess risk factors for missed appointments. Fifty-nine (25%) of 236 scheduled patients were no-shows. Scheduling conflicts (25.9%) and forgetting (20.4%) were the most common reasons for missed appointments. When controlling for confounding factors in the logistic regression, Medicaid (odds ratio 2.36), distance from clinic, and time since appointment was scheduled were associated with missed appointments. Further work in this area is needed. © The Author(s) 2014.

  4. Unevenness in Health at the Intersection of Gender and Sexuality: Sexual Minority Disparities in Alcohol and Drug Use Among Transwomen in the San Francisco Bay Area.

    PubMed

    Arayasirikul, Sean; Pomart, W Andres; Raymond, H Fisher; Wilson, Erin C

    2018-01-01

    Research on the health of transwomen is largely focused on heterosexual HIV risk. Little is known about the health of sexual minority transwomen. We conducted a secondary cross-sectional analysis of data from a HIV risk and resilience study of transwomen aged 16 to 24 years in the San Francisco Bay Area (N = 259). Prevalence and demographic characteristics of sexual minority transwomen was assessed and logistic regression models were used to examine the relationship between sexual minority status and alcohol and drug use. In logistic regression models, sexual minority transwomen had greater fold odds of heavy episodic drinking and illicit prescription drug use compared to their heterosexual counterparts, controlling for race/ethnicity, age, income, nativity, hormone status, and history of feminization procedures. These results suggest that sexual minority status may be an important social determinant of health among gender minorities. Populations of transwomen are heterogeneous; effective interventions must consider sexual minority status.

  5. Assessing landslide susceptibility by statistical data analysis and GIS: the case of Daunia (Apulian Apennines, Italy)

    NASA Astrophysics Data System (ADS)

    Ceppi, C.; Mancini, F.; Ritrovato, G.

    2009-04-01

    This study aim at the landslide susceptibility mapping within an area of the Daunia (Apulian Apennines, Italy) by a multivariate statistical method and data manipulation in a Geographical Information System (GIS) environment. Among the variety of existing statistical data analysis techniques, the logistic regression was chosen to produce a susceptibility map all over an area where small settlements are historically threatened by landslide phenomena. By logistic regression a best fitting between the presence or absence of landslide (dependent variable) and the set of independent variables is performed on the basis of a maximum likelihood criterion, bringing to the estimation of regression coefficients. The reliability of such analysis is therefore due to the ability to quantify the proneness to landslide occurrences by the probability level produced by the analysis. The inventory of dependent and independent variables were managed in a GIS, where geometric properties and attributes have been translated into raster cells in order to proceed with the logistic regression by means of SPSS (Statistical Package for the Social Sciences) package. A landslide inventory was used to produce the bivariate dependent variable whereas the independent set of variable concerned with slope, aspect, elevation, curvature, drained area, lithology and land use after their reductions to dummy variables. The effect of independent parameters on landslide occurrence was assessed by the corresponding coefficient in the logistic regression function, highlighting a major role played by the land use variable in determining occurrence and distribution of phenomena. Once the outcomes of the logistic regression are determined, data are re-introduced in the GIS to produce a map reporting the proneness to landslide as predicted level of probability. As validation of results and regression model a cell-by-cell comparison between the susceptibility map and the initial inventory of landslide events was performed and an agreement at 75% level achieved.

  6. Determination of riverbank erosion probability using Locally Weighted Logistic Regression

    NASA Astrophysics Data System (ADS)

    Ioannidou, Elena; Flori, Aikaterini; Varouchakis, Emmanouil A.; Giannakis, Georgios; Vozinaki, Anthi Eirini K.; Karatzas, George P.; Nikolaidis, Nikolaos

    2015-04-01

    Riverbank erosion is a natural geomorphologic process that affects the fluvial environment. The most important issue concerning riverbank erosion is the identification of the vulnerable locations. An alternative to the usual hydrodynamic models to predict vulnerable locations is to quantify the probability of erosion occurrence. This can be achieved by identifying the underlying relations between riverbank erosion and the geomorphological or hydrological variables that prevent or stimulate erosion. Thus, riverbank erosion can be determined by a regression model using independent variables that are considered to affect the erosion process. The impact of such variables may vary spatially, therefore, a non-stationary regression model is preferred instead of a stationary equivalent. Locally Weighted Regression (LWR) is proposed as a suitable choice. This method can be extended to predict the binary presence or absence of erosion based on a series of independent local variables by using the logistic regression model. It is referred to as Locally Weighted Logistic Regression (LWLR). Logistic regression is a type of regression analysis used for predicting the outcome of a categorical dependent variable (e.g. binary response) based on one or more predictor variables. The method can be combined with LWR to assign weights to local independent variables of the dependent one. LWR allows model parameters to vary over space in order to reflect spatial heterogeneity. The probabilities of the possible outcomes are modelled as a function of the independent variables using a logistic function. Logistic regression measures the relationship between a categorical dependent variable and, usually, one or several continuous independent variables by converting the dependent variable to probability scores. Then, a logistic regression is formed, which predicts success or failure of a given binary variable (e.g. erosion presence or absence) for any value of the independent variables. The erosion occurrence probability can be calculated in conjunction with the model deviance regarding the independent variables tested. The most straightforward measure for goodness of fit is the G statistic. It is a simple and effective way to study and evaluate the Logistic Regression model efficiency and the reliability of each independent variable. The developed statistical model is applied to the Koiliaris River Basin on the island of Crete, Greece. Two datasets of river bank slope, river cross-section width and indications of erosion were available for the analysis (12 and 8 locations). Two different types of spatial dependence functions, exponential and tricubic, were examined to determine the local spatial dependence of the independent variables at the measurement locations. The results show a significant improvement when the tricubic function is applied as the erosion probability is accurately predicted at all eight validation locations. Results for the model deviance show that cross-section width is more important than bank slope in the estimation of erosion probability along the Koiliaris riverbanks. The proposed statistical model is a useful tool that quantifies the erosion probability along the riverbanks and can be used to assist managing erosion and flooding events. Acknowledgements This work is part of an on-going THALES project (CYBERSENSORS - High Frequency Monitoring System for Integrated Water Resources Management of Rivers). The project has been co-financed by the European Union (European Social Fund - ESF) and Greek national funds through the Operational Program "Education and Lifelong Learning" of the National Strategic Reference Framework (NSRF) - Research Funding Program: THALES. Investing in knowledge society through the European Social Fund.

  7. Gender differences in hypertension control among older korean adults: Korean social life, health, and aging project.

    PubMed

    Chu, Sang Hui; Baek, Ji Won; Kim, Eun Sook; Stefani, Katherine M; Lee, Won Joon; Park, Yeong-Ran; Youm, Yoosik; Kim, Hyeon Chang

    2015-01-01

    Controlling blood pressure is a key step in reducing cardiovascular mortality in older adults. Gender differences in patients' attitudes after disease diagnosis and their management of the disease have been identified. However, it is unclear whether gender differences exist in hypertension management among older adults. We hypothesized that gender differences would exist among factors associated with hypertension diagnosis and control among community-dwelling, older adults. This cross-sectional study analyzed data from 653 Koreans aged ≥60 years who participated in the Korean Social Life, Health, and Aging Project. Multiple logistic regression was used to compare several variables between undiagnosed and diagnosed hypertension, and between uncontrolled and controlled hypertension. Diabetes was more prevalent in men and women who had uncontrolled hypertension than those with controlled hypertension or undiagnosed hypertension. High body mass index was significantly associated with uncontrolled hypertension only in men. Multiple logistic regression analysis indicated that in women, awareness of one's blood pressure level (odds ratio [OR], 2.86; p=0.003) and the number of blood pressure checkups over the previous year (OR, 1.06; p=0.011) might influence the likelihood of being diagnosed with hypertension. More highly educated women were more likely to have controlled hypertension than non-educated women (OR, 5.23; p=0.013). This study suggests that gender differences exist among factors associated with hypertension diagnosis and control in the study population of community-dwelling, older adults. Education-based health promotion strategies for hypertension control might be more effective in elderly women than in elderly men. Gender-specific approaches may be required to effectively control hypertension among older adults.

  8. A case-control study of the association between ulcerative colitis and hyperthyroidism in an Asian population.

    PubMed

    Tsai, Ming-Chieh; Lin, Herng-Ching; Lee, Cha-Ze

    2017-06-01

    Ulcerative colitis (UC) is a chronic relapsing inflammatory disease with significant clinical diversity. However, the aetiology, pathogenesis and optimal treatment of UC remain unclear. The purpose of this case-control study was to investigate the association between previously diagnosed hyperthyroidism and UC using a large population-based data set in Taiwan. The data for this population-based case-control study were retrieved from the Taiwan Longitudinal Health Insurance Database 2005. We included 2709 patients with UC as cases and 8127 sex- and age-matched patients without UC as controls. A conditional logistic regression analysis was conducted to compute the odds ratio (OR) and corresponding 95% confidence interval (CI) for the association between UC and prior hyperthyroidism. We found that, in total, 327 of the 10 836 sampled patients (3.02%) had previously been diagnosed with hyperthyroidism. There was a higher proportion of prior hyperthyroidism among cases than controls (4.10% vs 2.66%, P<.001). A conditional logistic regression showed that the OR of prior hyperthyroidism was 1.57 (95% CI=1.24-1.98) compared to controls. Similarly, after adjusting for monthly income, geographic location and urbanization level, cases were still more likely to have previously been diagnosed with hyperthyroidism than controls (OR=1.61, 95% CI=1.27-2.05). Furthermore, we analysed the ORs of prior hyperthyroidism between cases and controls according to age group. We found that of the youngest group of sampled patients (18-39 years), cases had the greatest adjusted OR for having previously been diagnosed with hyperthyroidism than controls (OR=1.98, 95% CI=1.04-3.79). This study demonstrated an association between UC and hyperthyroidism. © 2017 John Wiley & Sons Ltd.

  9. Landslide susceptibility mapping using frequency ratio, logistic regression, artificial neural networks and their comparison: A case study from Kat landslides (Tokat—Turkey)

    NASA Astrophysics Data System (ADS)

    Yilmaz, Işık

    2009-06-01

    The purpose of this study is to compare the landslide susceptibility mapping methods of frequency ratio (FR), logistic regression and artificial neural networks (ANN) applied in the Kat County (Tokat—Turkey). Digital elevation model (DEM) was first constructed using GIS software. Landslide-related factors such as geology, faults, drainage system, topographical elevation, slope angle, slope aspect, topographic wetness index (TWI) and stream power index (SPI) were used in the landslide susceptibility analyses. Landslide susceptibility maps were produced from the frequency ratio, logistic regression and neural networks models, and they were then compared by means of their validations. The higher accuracies of the susceptibility maps for all three models were obtained from the comparison of the landslide susceptibility maps with the known landslide locations. However, respective area under curve (AUC) values of 0.826, 0.842 and 0.852 for frequency ratio, logistic regression and artificial neural networks showed that the map obtained from ANN model is more accurate than the other models, accuracies of all models can be evaluated relatively similar. The results obtained in this study also showed that the frequency ratio model can be used as a simple tool in assessment of landslide susceptibility when a sufficient number of data were obtained. Input process, calculations and output process are very simple and can be readily understood in the frequency ratio model, however logistic regression and neural networks require the conversion of data to ASCII or other formats. Moreover, it is also very hard to process the large amount of data in the statistical package.

  10. Burnout does not help predict depression among French school teachers.

    PubMed

    Bianchi, Renzo; Schonfeld, Irvin Sam; Laurent, Eric

    2015-11-01

    Burnout has been viewed as a phase in the development of depression. However, supportive research is scarce. We examined whether burnout predicted depression among French school teachers. We conducted a 2-wave, 21-month study involving 627 teachers (73% female) working in French primary and secondary schools. Burnout was assessed with the Maslach Burnout Inventory and depression with the 9-item depression module of the Patient Health Questionnaire (PHQ-9). The PHQ-9 grades depressive symptom severity and provides a provisional diagnosis of major depression. Depression was treated both as a continuous and categorical variable using linear and logistic regression analyses. We controlled for gender, age, and length of employment. Controlling for baseline depressive symptoms, linear regression analysis showed that burnout symptoms at time 1 (T1) did not predict depressive symptoms at time 2 (T2). Baseline depressive symptoms accounted for about 88% of the association between T1 burnout and T2 depressive symptoms. Only baseline depressive symptoms predicted depressive symptoms at follow-up. Similarly, logistic regression analysis revealed that burnout symptoms at T1 did not predict incident cases of major depression at T2 when depressive symptoms at T1 were included in the predictive model. Only baseline depressive symptoms predicted cases of major depression at follow-up. This study does not support the view that burnout is a phase in the development of depression. Assessing burnout symptoms in addition to "classical" depressive symptoms may not always improve our ability to predict future depression.

  11. Gene-environment interaction between adiponectin gene polymorphisms and environmental factors on the risk of diabetic retinopathy.

    PubMed

    Li, Yuan; Wu, Qun Hong; Jiao, Ming Li; Fan, Xiao Hong; Hu, Quan; Hao, Yan Hua; Liu, Ruo Hong; Zhang, Wei; Cui, Yu; Han, Li Yuan

    2015-01-01

    To evaluate whether the adiponectin gene is associated with diabetic retinopathy (DR) risk and interaction with environmental factors modifies the DR risk, and to investigate the relationship between serum adiponectin levels and DR. Four adiponectin polymorphisms were evaluated in 372 DR cases and 145 controls. Differences in environmental factors between cases and controls were evaluated by unconditional logistic regression analysis. The model-free multifactor dimensionality reduction method and traditional multiple regression models were applied to explore interactions between the polymorphisms and environmental factors. Using the Bonferroni method, we found no significant associations between four adiponectin polymorphisms and DR susceptibility. Multivariate logistic regression found that physical activity played a protective role in the progress of DR, whereas family history of diabetes (odds ratio 1.75) and insulin therapy (odds ratio 1.78) were associated with an increased risk for DR. The interaction between the C-11377 G (rs266729) polymorphism and insulin therapy might be associated with DR risk. Family history of diabetes combined with insulin therapy also increased the risk of DR. No adiponectin gene polymorphisms influenced the serum adiponectin levels. Serum adiponectin levels did not differ between the DR group and non-DR group. No significant association was identified between four adiponectin polymorphisms and DR susceptibility after stringent Bonferroni correction. The interaction between C-11377G (rs266729) polymorphism and insulin therapy, as well as the interaction between family history of diabetes and insulin therapy, might be associated with DR susceptibility.

  12. Food and Drug Administration tobacco regulation and product judgments.

    PubMed

    Kaufman, Annette R; Finney Rutten, Lila J; Parascandola, Mark; Blake, Kelly D; Augustson, Erik M

    2015-04-01

    The Family Smoking Prevention and Tobacco Control Act granted the Food and Drug Administration (FDA) the authority to regulate tobacco products in the U.S. However, little is known about how regulation may be related to judgments about tobacco product-related risks. To understand how FDA tobacco regulation beliefs are associated with judgments about tobacco product-related risks. The Health Information National Trends Survey is a national survey of the U.S. adult population. Data used in this analysis were collected from October 2012 through January 2013 (N=3,630) by mailed questionnaire and analyzed in 2013. Weighted bivariate chi-square analyses were used to assess associations among FDA regulation belief, tobacco harm judgments, sociodemographics, and smoking status. A weighted multinomial logistic regression was conducted where FDA regulation belief was regressed on tobacco product judgments, controlling for sociodemographic variables and smoking status. About 41% believed that the FDA regulates tobacco products in the U.S., 23.6% reported the FDA does not, and 35.3% did not know. Chi-square analyses showed that smoking status was significantly related to harm judgments about electronic cigarettes (p<0.0001). The multinomial logistic regression revealed that uncertainty about FDA regulation was associated with tobacco product harm judgment uncertainty. Tobacco product harm perceptions are associated with beliefs about tobacco product regulation by the FDA. These findings suggest the need for increased public awareness and understanding of the role of tobacco product regulation in protecting public health. Copyright © 2015. Published by Elsevier Inc.

  13. Effectiveness of electronic stability control on single-vehicle accidents.

    PubMed

    Lyckegaard, Allan; Hels, Tove; Bernhoft, Inger Marie

    2015-01-01

    This study aims at evaluating the effectiveness of electronic stability control (ESC) on single-vehicle injury accidents while controlling for a number of confounders influencing the accident risk. Using police-registered injury accidents from 2004 to 2011 in Denmark with cars manufactured in the period 1998 to 2011 and the principle of induced exposure, 2 measures of the effectiveness of ESC were calculated: The crude odds ratio and the adjusted odds ratio, the latter by means of logistic regression. The logistic regression controlled for a number of confounding factors, of which the following were significant. For the driver: Age, gender, driving experience, valid driving license, and seat belt use. For the vehicle: Year of registration, weight, and ESC. For the accident surroundings: Visibility, light, and location. Finally, for the road: Speed limit, surface, and section characteristics. The present study calculated the crude odds ratio for ESC-equipped cars of getting in a single-vehicle injury accident as 0.40 (95% confidence interval [CI], 0.34-0.47) and the adjusted odds ratio as 0.69 (95% CI, 0.54-0.88). No difference was found in the effectiveness of ESC across the injury severity categories (slight, severe, and fatal). In line with previous results, this study concludes that ESC reduces the risk for single-vehicle injury accidents by 31% when controlling for various confounding factors related to the driver, the car, and the accident surroundings. Furthermore, it is concluded that it is important to control for human factors (at a minimum age and gender) in analyses where evaluations of this type are performed.

  14. A case-control study of the risk factors for obstetric fistula in Tigray, Ethiopia.

    PubMed

    Lewis Wall, L; Belay, Shewaye; Haregot, Tesfahun; Dukes, Jonathan; Berhan, Eyoel; Abreha, Melaku

    2017-12-01

    We tested the null hypothesis that there were no differences between patients with obstetric fistula and parous controls without fistula. A unmatched case-control study was carried out comparing 75 women with a history of obstetric fistula with 150 parous controls with no history of fistula. Height and weight were measured for each participant, along with basic socio-demographic and obstetric information. Descriptive statistics were calculated and differences between the groups were analyzed using Student's t test, Mann-Whitney U test where appropriate, and Chi-squared or Fisher's exact test, along with backward stepwise logistic regression analyses to detect predictors of obstetric fistula. Associations with a p value <0.05 were considered significant. Patients with fistulas married earlier and delivered their first pregnancies earlier than controls. They had significantly less education, a higher prevalence of divorce/separation, and lived in more impoverished circumstances than controls. Fistula patients had worse reproductive histories, with greater numbers of stillbirths/abortions and higher rates of assisted vaginal delivery and cesarean section. The final logistic regression model found four significant risk factors for developing an obstetric fistula: age at marriage (OR 1.23), history of assisted vaginal delivery (OR 3.44), lack of adequate antenatal care (OR 4.43), and a labor lasting longer than 1 day (OR 14.84). Our data indicate that obstetric fistula results from the lack of access to effective obstetrical services when labor is prolonged. Rural poverty and lack of adequate transportation infrastructure are probably important co-factors in inhibiting access to needed care.

  15. Alzheimer's disease is associated with prostate cancer: a population-based study.

    PubMed

    Lin, Herng-Ching; Kao, Li-Ting; Chung, Shiu-Dong; Huang, Chung-Chien; Shia, Ben-Chang; Huang, Chao-Yuan

    2018-01-26

    Alzheimer's disease and cancer are increasingly prevalent with advancing age. However, the association between Alzheimer's disease and prostate cancer remains unclear. The aim of this study was to examine the relationship between prior Alzheimer's disease and subsequent prostate cancer using a population-based dataset in Taiwan. Data for this study were sourced from the Taiwan Longitudinal Health Insurance Database 2005. This case-control study included 2101 prostate cancer patients as cases and 6303 matched controls. We used conditional logistic regression analyses to calculate the odds ratio (OR) and corresponding 95% confidence interval (CI) for Alzheimer's disease between prostate cancer patients and controls. We found that of the 8404 sampled patients, 128 (1.5%) had been diagnosed with Alzheimer's disease prior to the index date. A Chi-squared test showed that there was a significant difference in the prevalences of prior Alzheimer's disease between prostate cancer patients and controls (2.1% vs. 1.3%, p < 0.001). The conditional logistic regression analysis showed that the OR of prior Alzheimer's disease for prostate cancer patients was 1.53 (95% CI: 1.06∼2.21) compared to controls. Furthermore, the OR of prior Alzheimer's disease for prostate cancer patients was 1.52 (95% CI: 1.04∼2.22) compared to controls after adjusting for hypertension, diabetes, coronary heart disease, hyperlipidemia, obesity, prostatitis, gonorrhea or chlamydia infection, testitis or epididymitis, and alcohol abuse/alcohol dependency syndrome. This study revealed an association between prior Alzheimer's disease and prostate cancer. We suggest that clinicians be alert to the increased risk of prostate cancer when caring for elderly individuals with Alzheimer's disease.

  16. A Comparison of Logistic Regression, Neural Networks, and Classification Trees Predicting Success of Actuarial Students

    ERIC Educational Resources Information Center

    Schumacher, Phyllis; Olinsky, Alan; Quinn, John; Smith, Richard

    2010-01-01

    The authors extended previous research by 2 of the authors who conducted a study designed to predict the successful completion of students enrolled in an actuarial program. They used logistic regression to determine the probability of an actuarial student graduating in the major or dropping out. They compared the results of this study with those…

  17. Logistic regression accuracy across different spatial and temporal scales for a wide-ranging species, the marbled murrelet

    Treesearch

    Carolyn B. Meyer; Sherri L. Miller; C. John Ralph

    2004-01-01

    The scale at which habitat variables are measured affects the accuracy of resource selection functions in predicting animal use of sites. We used logistic regression models for a wide-ranging species, the marbled murrelet, (Brachyramphus marmoratus) in a large region in California to address how much changing the spatial or temporal scale of...

  18. Odds Ratio, Delta, ETS Classification, and Standardization Measures of DIF Magnitude for Binary Logistic Regression

    ERIC Educational Resources Information Center

    Monahan, Patrick O.; McHorney, Colleen A.; Stump, Timothy E.; Perkins, Anthony J.

    2007-01-01

    Previous methodological and applied studies that used binary logistic regression (LR) for detection of differential item functioning (DIF) in dichotomously scored items either did not report an effect size or did not employ several useful measures of DIF magnitude derived from the LR model. Equations are provided for these effect size indices.…

  19. A Generalized Logistic Regression Procedure to Detect Differential Item Functioning among Multiple Groups

    ERIC Educational Resources Information Center

    Magis, David; Raiche, Gilles; Beland, Sebastien; Gerard, Paul

    2011-01-01

    We present an extension of the logistic regression procedure to identify dichotomous differential item functioning (DIF) in the presence of more than two groups of respondents. Starting from the usual framework of a single focal group, we propose a general approach to estimate the item response functions in each group and to test for the presence…

  20. Risk Factors of Falls in Community-Dwelling Older Adults: Logistic Regression Tree Analysis

    ERIC Educational Resources Information Center

    Yamashita, Takashi; Noe, Douglas A.; Bailer, A. John

    2012-01-01

    Purpose of the Study: A novel logistic regression tree-based method was applied to identify fall risk factors and possible interaction effects of those risk factors. Design and Methods: A nationally representative sample of American older adults aged 65 years and older (N = 9,592) in the Health and Retirement Study 2004 and 2006 modules was used.…

  1. Estimation of Logistic Regression Models in Small Samples. A Simulation Study Using a Weakly Informative Default Prior Distribution

    ERIC Educational Resources Information Center

    Gordovil-Merino, Amalia; Guardia-Olmos, Joan; Pero-Cebollero, Maribel

    2012-01-01

    In this paper, we used simulations to compare the performance of classical and Bayesian estimations in logistic regression models using small samples. In the performed simulations, conditions were varied, including the type of relationship between independent and dependent variable values (i.e., unrelated and related values), the type of variable…

  2. Acculturation and healthy lifestyle habits among Hispanics in United States-Mexico border communities.

    PubMed

    Ghaddar, Suad; Brown, Cynthia J; Pagán, José A; Díaz, Violeta

    2010-09-01

    To explore the relationship between acculturation and healthy lifestyle habits in the largely Hispanic populations living in underserved communities in the United States of America along the U.S.-Mexico border. A cross-sectional study was conducted from April 2006 to June 2008 using survey data from the Alliance for a Healthy Border, a program designed to reduce health disparities in the U.S.-Mexico border region by funding nutrition and physical activity education programs at 12 federally qualified community health centers in Arizona, California, New Mexico, and Texas. The survey included questions on acculturation, diet, exercise, and demographic factors and was completed by 2,381 Alliance program participants, of whom 95.3% were Hispanic and 45.4% were under the U.S. poverty level for 2007. Chi-square (χ2) and Student's t tests were used for bivariate comparisons between acculturation and dietary and physical activity measures. Linear regression and binary logistic regression were used to control for factors associated with nutrition and exercise. Based on univariate tests and confirmed by regression analysis controlling for sociodemographic and health variables, less acculturated survey respondents reported a significantly higher frequency of fruit and vegetable consumption and healthier dietary habits than those who were more acculturated. Adjusted binary logistic regression confirmed that individuals with low language acculturation were less likely to engage in physical activity than those with moderate to high acculturation (odds ratio 0.75, 95% confidence interval 0.59-0.95). Findings confirmed an association between acculturation and healthy lifestyle habits and supported the hypothesis that acculturation in border community populations tends to decrease the practice of some healthy dietary habits while increasing exposure to and awareness of the importance of other healthy behaviors.

  3. Why do some studies find that CPR fraction is not a predictor of survival?

    PubMed

    Wik, Lars; Olsen, Jan-Aage; Persse, David; Sterz, Fritz; Lozano, Michael; Brouwer, Marc A; Westfall, Mark; Souders, Chris M; Travis, David T; Herken, Ulrich R; Lerner, E Brooke

    2016-07-01

    An 80% chest compression fraction (CCF) during resuscitation is recommended. However, heterogeneous results in CCF studies were found during the 2015 Consensus on Science (CoS), which may be because chest compressions are stopped for a wide variety of reasons including providing lifesaving care, provider distraction, fatigue, confusion, and inability to perform lifesaving skills efficiently. The effect of confounding variables on CCF to predict cardiac arrest survival. A secondary analysis of emergency medical services (EMS) treated out-of-hospital cardiac arrest (OHCA) patients who received manual compressions. CCF (percent of time patients received compressions) was determined from electronic defibrillator files. Two Sample Wilcoxon Rank Sum or regression determined a statistical association between CCF and age, gender, bystander CPR, public location, witnessed arrest, shockable rhythm, resuscitation duration, study site, and number of shocks. Univariate and multivariate logistic regressions were used to determine CCF effect on survival. Of 2132 patients with manual compressions 1997 had complete data. Shockable rhythm (p<0.001), public location (p<0.004), treatment duration (p<0.001), and number of shocks (p<0.001) were associated with lower CCF. Univariate logistic regression found that CCF was inversely associated with survival (OR 0.07; 95% CI 0.01-0.36). Multivariate regression controlling for factors associated with survival and/or CCF found that increasing CCF was associated with survival (OR 6.34; 95% CI 1.02-39.5). CCF cannot be looked at in isolation as a predictor of survival, but in the context of other resuscitation activities. When controlling for the effects of other resuscitation activities, a higher CCF is predictive of survival. This may explain the heterogeneity of findings during the CoS review. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  4. Using multiple logistic regression and GIS technology to predict landslide hazard in northeast Kansas, USA

    USGS Publications Warehouse

    Ohlmacher, G.C.; Davis, J.C.

    2003-01-01

    Landslides in the hilly terrain along the Kansas and Missouri rivers in northeastern Kansas have caused millions of dollars in property damage during the last decade. To address this problem, a statistical method called multiple logistic regression has been used to create a landslide-hazard map for Atchison, Kansas, and surrounding areas. Data included digitized geology, slopes, and landslides, manipulated using ArcView GIS. Logistic regression relates predictor variables to the occurrence or nonoccurrence of landslides within geographic cells and uses the relationship to produce a map showing the probability of future landslides, given local slopes and geologic units. Results indicated that slope is the most important variable for estimating landslide hazard in the study area. Geologic units consisting mostly of shale, siltstone, and sandstone were most susceptible to landslides. Soil type and aspect ratio were considered but excluded from the final analysis because these variables did not significantly add to the predictive power of the logistic regression. Soil types were highly correlated with the geologic units, and no significant relationships existed between landslides and slope aspect. ?? 2003 Elsevier Science B.V. All rights reserved.

  5. A Method for Calculating the Probability of Successfully Completing a Rocket Propulsion Ground Test

    NASA Technical Reports Server (NTRS)

    Messer, Bradley

    2007-01-01

    Propulsion ground test facilities face the daily challenge of scheduling multiple customers into limited facility space and successfully completing their propulsion test projects. Over the last decade NASA s propulsion test facilities have performed hundreds of tests, collected thousands of seconds of test data, and exceeded the capabilities of numerous test facility and test article components. A logistic regression mathematical modeling technique has been developed to predict the probability of successfully completing a rocket propulsion test. A logistic regression model is a mathematical modeling approach that can be used to describe the relationship of several independent predictor variables X(sub 1), X(sub 2),.., X(sub k) to a binary or dichotomous dependent variable Y, where Y can only be one of two possible outcomes, in this case Success or Failure of accomplishing a full duration test. The use of logistic regression modeling is not new; however, modeling propulsion ground test facilities using logistic regression is both a new and unique application of the statistical technique. Results from this type of model provide project managers with insight and confidence into the effectiveness of rocket propulsion ground testing.

  6. Predicting risk for portal vein thrombosis in acute pancreatitis patients: A comparison of radical basis function artificial neural network and logistic regression models.

    PubMed

    Fei, Yang; Hu, Jian; Gao, Kun; Tu, Jianfeng; Li, Wei-Qin; Wang, Wei

    2017-06-01

    To construct a radical basis function (RBF) artificial neural networks (ANNs) model to predict the incidence of acute pancreatitis (AP)-induced portal vein thrombosis. The analysis included 353 patients with AP who had admitted between January 2011 and December 2015. RBF ANNs model and logistic regression model were constructed based on eleven factors relevant to AP respectively. Statistical indexes were used to evaluate the value of the prediction in two models. The predict sensitivity, specificity, positive predictive value, negative predictive value and accuracy by RBF ANNs model for PVT were 73.3%, 91.4%, 68.8%, 93.0% and 87.7%, respectively. There were significant differences between the RBF ANNs and logistic regression models in these parameters (P<0.05). In addition, a comparison of the area under receiver operating characteristic curves of the two models showed a statistically significant difference (P<0.05). The RBF ANNs model is more likely to predict the occurrence of PVT induced by AP than logistic regression model. D-dimer, AMY, Hct and PT were important prediction factors of approval for AP-induced PVT. Copyright © 2017 Elsevier Inc. All rights reserved.

  7. [Associations between dormitory environment/other factors and sleep quality of medical students].

    PubMed

    Zheng, Bang; Wang, Kailu; Pan, Ziqi; Li, Man; Pan, Yuting; Liu, Ting; Xu, Dan; Lyu, Jun

    2016-03-01

    To investigate the sleep quality and related factors among medical students in China, understand the association between dormitory environment and sleep quality, and provide evidence and recommendations for sleep hygiene intervention. A total of 555 undergraduate students were selected from a medical school of an university in Beijing through stratified-cluster random-sampling to conduct a questionnaire survey by using Chinese version of Pittsburgh Sleep Quality Index (PSQI) and self-designed questionnaire. Analyses were performed by using multiple logistic regression model as well as multilevel linear regression model. The prevalence of sleep disorder was 29.1%(149/512), and 39.1%(200/512) of the students reported that the sleep quality was influenced by dormitory environment. PSQI score was negatively correlated with self-reported rating of dormitory environment (γs=-0.310, P<0.001). Logistic regression analysis showed the related factors of sleep disorder included grade, sleep regularity, self-rated health status, pressures of school work and employment, as well as dormitory environment. RESULTS of multilevel regression analysis also indicated that perception on dormitory environment (individual level) was associated with sleep quality with the dormitory level random effects under control (b=-0.619, P<0.001). The prevalence of sleep disorder was high in medical students, which was associated with multiple factors. Dormitory environment should be taken into consideration when the interventions are taken to improve the sleep quality of students.

  8. Risk Factors for Drug Abuse among Nepalese Samples Selected from a Town of Eastern Nepal

    ERIC Educational Resources Information Center

    Niraula, Surya Raj; Chhetry, Devendra Bahadur; Singh, Girish Kumar; Nagesh, S.; Shyangwa, Pramod Mohan

    2009-01-01

    The study focuses on the serious issue related to the adolescents' and adults' behavior and health. It aims to identify the risk factors for drug abuse from samples taken from a town of Eastern Nepal. This is a matched case-control study. The conditional logistic regression method was adopted for data analysis. The diagnosis cut off was determined…

  9. Identification of an Interaction between VWF rs7965413 and Platelet Count as a Novel Risk Marker for Metabolic Syndrome: An Extensive Search of Candidate Polymorphisms in a Case-Control Study

    PubMed Central

    Nakatochi, Masahiro; Ushida, Yasunori; Yasuda, Yoshinari; Yoshida, Yasuko; Kawai, Shun; Kato, Ryuji; Nakashima, Toru; Iwata, Masamitsu; Kuwatsuka, Yachiyo; Ando, Masahiko; Hamajima, Nobuyuki; Kondo, Takaaki; Oda, Hiroaki; Hayashi, Mutsuharu; Kato, Sawako; Yamaguchi, Makoto; Maruyama, Shoichi; Matsuo, Seiichi; Honda, Hiroyuki

    2015-01-01

    Although many single nucleotide polymorphisms (SNPs) have been identified to be associated with metabolic syndrome (MetS), there was only a slight improvement in the ability to predict future MetS by the simply addition of SNPs to clinical risk markers. To improve the ability to predict future MetS, combinational effects, such as SNP—SNP interaction, SNP—environment interaction, and SNP—clinical parameter (SNP × CP) interaction should be also considered. We performed a case-control study to explore novel SNP × CP interactions as risk markers for MetS based on health check-up data of Japanese male employees. We selected 99 SNPs that were previously reported to be associated with MetS and components of MetS; subsequently, we genotyped these SNPs from 360 cases and 1983 control subjects. First, we performed logistic regression analyses to assess the association of each SNP with MetS. Of these SNPs, five SNPs were significantly associated with MetS (P < 0.05): LRP2 rs2544390, rs1800592 between UCP1 and TBC1D9, APOA5 rs662799, VWF rs7965413, and rs1411766 between MYO16 and IRS2. Furthermore, we performed multiple logistic regression analyses, including an SNP term, a CP term, and an SNP × CP interaction term for each CP and SNP that was significantly associated with MetS. We identified a novel SNP × CP interaction between rs7965413 and platelet count that was significantly associated with MetS [SNP term: odds ratio (OR) = 0.78, P = 0.004; SNP × CP interaction term: OR = 1.33, P = 0.001]. This association of the SNP × CP interaction with MetS remained nominally significant in multiple logistic regression analysis after adjustment for either the number of MetS components or MetS components excluding obesity. Our results reveal new insight into platelet count as a risk marker for MetS. PMID:25646961

  10. Identification of an interaction between VWF rs7965413 and platelet count as a novel risk marker for metabolic syndrome: an extensive search of candidate polymorphisms in a case-control study.

    PubMed

    Nakatochi, Masahiro; Ushida, Yasunori; Yasuda, Yoshinari; Yoshida, Yasuko; Kawai, Shun; Kato, Ryuji; Nakashima, Toru; Iwata, Masamitsu; Kuwatsuka, Yachiyo; Ando, Masahiko; Hamajima, Nobuyuki; Kondo, Takaaki; Oda, Hiroaki; Hayashi, Mutsuharu; Kato, Sawako; Yamaguchi, Makoto; Maruyama, Shoichi; Matsuo, Seiichi; Honda, Hiroyuki

    2015-01-01

    Although many single nucleotide polymorphisms (SNPs) have been identified to be associated with metabolic syndrome (MetS), there was only a slight improvement in the ability to predict future MetS by the simply addition of SNPs to clinical risk markers. To improve the ability to predict future MetS, combinational effects, such as SNP-SNP interaction, SNP-environment interaction, and SNP-clinical parameter (SNP × CP) interaction should be also considered. We performed a case-control study to explore novel SNP × CP interactions as risk markers for MetS based on health check-up data of Japanese male employees. We selected 99 SNPs that were previously reported to be associated with MetS and components of MetS; subsequently, we genotyped these SNPs from 360 cases and 1983 control subjects. First, we performed logistic regression analyses to assess the association of each SNP with MetS. Of these SNPs, five SNPs were significantly associated with MetS (P < 0.05): LRP2 rs2544390, rs1800592 between UCP1 and TBC1D9, APOA5 rs662799, VWF rs7965413, and rs1411766 between MYO16 and IRS2. Furthermore, we performed multiple logistic regression analyses, including an SNP term, a CP term, and an SNP × CP interaction term for each CP and SNP that was significantly associated with MetS. We identified a novel SNP × CP interaction between rs7965413 and platelet count that was significantly associated with MetS [SNP term: odds ratio (OR) = 0.78, P = 0.004; SNP × CP interaction term: OR = 1.33, P = 0.001]. This association of the SNP × CP interaction with MetS remained nominally significant in multiple logistic regression analysis after adjustment for either the number of MetS components or MetS components excluding obesity. Our results reveal new insight into platelet count as a risk marker for MetS.

  11. Further investigations of the W-test for pairwise epistasis testing.

    PubMed

    Howey, Richard; Cordell, Heather J

    2017-01-01

    Background: In a recent paper, a novel W-test for pairwise epistasis testing was proposed that appeared, in computer simulations, to have higher power than competing alternatives. Application to genome-wide bipolar data detected significant epistasis between SNPs in genes of relevant biological function. Network analysis indicated that the implicated genes formed two separate interaction networks, each containing genes highly related to autism and neurodegenerative disorders. Methods: Here we investigate further the properties and performance of the W-test via theoretical evaluation, computer simulations and application to real data. Results: We demonstrate that, for common variants, the W-test is closely related to several existing tests of association allowing for interaction, including logistic regression on 8 degrees of freedom, although logistic regression can show inflated type I error for low minor allele frequencies,  whereas the W-test shows good/conservative type I error control. Although in some situations the W-test can show higher power, logistic regression is not limited to tests on 8 degrees of freedom but can instead be tailored to impose greater structure on the assumed alternative hypothesis, offering a power advantage when the imposed structure matches the true structure. Conclusions: The W-test is a potentially useful method for testing for association - without necessarily implying interaction - between genetic variants disease, particularly when one or more of the genetic variants are rare. For common variants, the advantages of the W-test are less clear, and, indeed, there are situations where existing methods perform better. In our investigations, we further uncover a number of problems with the practical implementation and application of the W-test (to bipolar disorder) previously described, apparently due to inadequate use of standard data quality-control procedures. This observation leads us to urge caution in interpretation of the previously-presented results, most of which we consider are highly likely to be artefacts.

  12. Nutrition Risk in Critically Ill Versus the Nutritional Risk Screening 2002: Are They Comparable for Assessing Risk of Malnutrition in Critically Ill Patients?

    PubMed

    Canales, Cecilia; Elsayes, Ali; Yeh, D Dante; Belcher, Donna; Nakayama, Anna; McCarthy, Caitlin M; Chokengarmwong, Nalin; Quraishi, Sadeq A

    2018-05-30

    Malnutrition influences clinical outcomes. Although various screening tools are available to assess nutrition status, their use in the intensive care unit (ICU) has not been rigorously studied. Our goal was to compare the Nutrition Risk in Critically Ill (NUTRIC) to the Nutritional Risk Screening (NRS) 2002 in terms of their associations with macronutrient deficit in ICU patients. We performed a retrospective analysis to investigate the relationship between NUTRIC vs NRS 2002 and macronutrient deficit (protein and calories) in critically ill patients. We performed linear regression analyses, controlling for age, sex, race, body mass index, and ICU length of stay. We then dichotomized our primary exposures and outcomes to perform logistic regression analyses, controlling for the same covariates. The analytic cohort included 312 adults. Mean NUTRIC and NRS 2002 scores were 4 ± 2 and 4 ± 1, respectively. Linear regression demonstrated that each increment in NUTRIC score was associated with a 49 g higher protein deficit (β = 48.70: 95% confidence interval [CI] 29.23-68.17) and a 752 kcal higher caloric deficit (β = 751.95; 95% CI 447.80-1056.09). Logistic regression demonstrated that NUTRIC scores >4 had over twice the odds of protein deficits ≥300 g (odds ratio [OR] 2.35; 95% CI 1.43-3.85) and caloric deficits ≥6000 kcal (OR 2.73; 95% CI 1.66-4.50) compared with NUTRIC scores ≤4. We did not observe an association of NRS 2002 scores with macronutrient deficit. Our data suggest that NUTRIC is superior to NRS 2002 for assessing malnutrition risk in ICU patients. Randomized, controlled studies are needed to determine whether nutrition interventions, stratified by NUTRIC score, can improve patient outcomes. © 2018 American Society for Parenteral and Enteral Nutrition.

  13. Predictors of condom use and refusal among the population of Free State province in South Africa

    PubMed Central

    2012-01-01

    Background This study investigated the extent and predictors of condom use and condom refusal in the Free State province in South Africa. Methods Through a household survey conducted in the Free Sate province of South Africa, 5,837 adults were interviewed. Univariate and multivariate survey logistic regressions and classification trees (CT) were used for analysing two response variables ‘ever used condom’ and ‘ever refused condom’. Results Eighty-three per cent of the respondents had ever used condoms, of which 38% always used them; 61% used them during the last sexual intercourse and 9% had ever refused to use them. The univariate logistic regression models and CT analysis indicated that a strong predictor of condom use was its perceived need. In the CT analysis, this variable was followed in importance by ‘knowledge of correct use of condom’, condom availability, young age, being single and higher education. ‘Perceived need’ for condoms did not remain significant in the multivariate analysis after controlling for other variables. The strongest predictor of condom refusal, as shown by the CT, was shame associated with condoms followed by the presence of sexual risk behaviour, knowing one’s HIV status, older age and lacking knowledge of condoms (i.e., ability to prevent sexually transmitted diseases and pregnancy, availability, correct and consistent use and existence of female condoms). In the multivariate logistic regression, age was not significant for condom refusal while affordability and perceived need were additional significant variables. Conclusions The use of complementary modelling techniques such as CT in addition to logistic regressions adds to a better understanding of condom use and refusal. Further improvement in correct and consistent use of condoms will require targeted interventions. In addition to existing social marketing campaigns, tailored approaches should focus on establishing the perceived need for condom-use and improving skills for correct use. They should also incorporate interventions to reduce the shame associated with condoms and individual counselling of those likely to refuse condoms. PMID:22639964

  14. Risk stratification personalised model for prediction of life-threatening ventricular tachyarrhythmias in patients with chronic heart failure.

    PubMed

    Frolov, Alexander Vladimirovich; Vaikhanskaya, Tatjana Gennadjevna; Melnikova, Olga Petrovna; Vorobiev, Anatoly Pavlovich; Guel, Ludmila Michajlovna

    2017-01-01

    The development of prognostic factors of life-threatening ventricular tachyarrhythmias (VTA) and sudden cardiac death (SCD) continues to maintain its priority and relevance in cardiology. The development of a method of personalised prognosis based on multifactorial analysis of the risk factors associated with life-threatening heart rhythm disturbances is considered a key research and clinical task. To design a prognostic and mathematical model to define personalised risk for life-threatening VTA in patients with chronic heart failure (CHF). The study included 240 patients with CHF (mean-age of 50.5 ± 12.1 years; left ventricular ejection fraction 32.8 ± 10.9%; follow-up period 36.8 ± 5.7 months). The participants received basic therapy for heart failure. The elec-trocardiogram (ECG) markers of myocardial electrical instability were assessed including microvolt T-wave alternans, heart rate turbulence, heart rate deceleration, and QT dispersion. Additionally, echocardiography and Holter monitoring (HM) were performed. The cardiovascular events were considered as primary endpoints, including SCD, paroxysmal ventricular tachycardia/ventricular fibrillation (VT/VF) based on HM-ECG data, and data obtained from implantable device interrogation (CRT-D, ICD) as well as appropriated shocks. During the follow-up period, 66 (27.5%) subjects with CHF showed adverse arrhythmic events, including nine SCD events and 57 VTAs. Data from a stepwise discriminant analysis of cumulative ECG-markers of myocardial electrical instability were used to make a mathematical model of preliminary VTA risk stratification. Uni- and multivariate Cox logistic regression analysis were performed to define an individualised risk stratification model of SCD/VTA. A binary logistic regression model demonstrated a high prognostic significance of discriminant function with a classification sensitivity of 80.8% and specificity of 99.1% (F = 31.2; c2 = 143.2; p < 0.0001). The method of personalised risk stratification using Cox logistic regression allows correct classification of more than 93.9% of CHF cases. A robust body of evidence concerning logistic regression prognostic significance to define VTA risk allows inclusion of this method into the algorithm of subsequent control and selection of the optimal treatment modality to treat patients with CHF.

  15. EXpectation Propagation LOgistic REgRession (EXPLORER): Distributed Privacy-Preserving Online Model Learning

    PubMed Central

    Wang, Shuang; Jiang, Xiaoqian; Wu, Yuan; Cui, Lijuan; Cheng, Samuel; Ohno-Machado, Lucila

    2013-01-01

    We developed an EXpectation Propagation LOgistic REgRession (EXPLORER) model for distributed privacy-preserving online learning. The proposed framework provides a high level guarantee for protecting sensitive information, since the information exchanged between the server and the client is the encrypted posterior distribution of coefficients. Through experimental results, EXPLORER shows the same performance (e.g., discrimination, calibration, feature selection etc.) as the traditional frequentist Logistic Regression model, but provides more flexibility in model updating. That is, EXPLORER can be updated one point at a time rather than having to retrain the entire data set when new observations are recorded. The proposed EXPLORER supports asynchronized communication, which relieves the participants from coordinating with one another, and prevents service breakdown from the absence of participants or interrupted communications. PMID:23562651

  16. Social capital, political trust, and health locus of control: a population-based study.

    PubMed

    Lindström, Martin

    2011-02-01

    To investigate the association between political trust in the Riksdag and lack of belief in the possibility to influence one's own health (external locus of control), taking horizontal trust into account. The 2008 public health survey in Skåne is a cross-sectional postal questionnaire study with a 55% participation rate. A random sample of 28,198 persons aged 18-80 years participated. Logistic regression models were used to investigate the associations between political trust in the Riksdag (an aspect of vertical trust) and lack of belief in the possibility to influence one's own health (external locus of control). The multiple regression analyses included age, country of birth, education, and horizontal trust in other people. A 33.7% of all men and 31.8% of all women lack internal locus of control. Low (external) health locus of control is more common in higher age groups, among people born outside Sweden, with lower education, low horizontal trust, low political trust, and no opinion concerning political trust. Respondents with not particularly strong political trust, no political trust at all and no opinion have significantly higher odds ratios of external locus of control throughout the multiple regression analyses. Low political trust in the Riksdag seems to be independently associated with external health locus of control.

  17. Living near overhead high voltage transmission power lines as a risk factor for childhood acute lymphoblastic leukemia: a case-control study.

    PubMed

    Sohrabi, Mohammad-Reza; Tarjoman, Termeh; Abadi, Alireza; Yavari, Parvin

    2010-01-01

    This study aimed to investigate association of living near high voltage power lines with occurrence of childhood acute lymphoblastic leukemia (ALL). Through a case-control study 300 children aged 1-18 years with confirmed ALL were selected from all referral teaching centers for cancer. They interviewed for history of living near overhead high voltage power lines during at least past two years and compared with 300 controls which were individually matched for sex and approximate age. Logistic regression, chi square and paired t-tests were used for analysis when appropriate. The case group were living significantly closer to power lines (P<0.001). More than half of the cases were exposed to two or three types of power lines (P<0.02). Using logistic regression, odds ratio of 2.61 (95%CI: 1.73 to 3.94) calculated for less than 600 meters far from the nearest lines against more than 600 meters. This ratio estimated as 9.93 (95%CI: 3.47 to 28.5) for 123 KV, 10.78 (95%CI: 3.75 to 31) for 230 KV and 2.98 (95%CI: 0.93 to 9.54) for 400 KV lines. Odds of ALL decreased 0.61 for every 600 meters from the nearest power line. This study emphasizes that living close to high voltage power lines is a risk for ALL.

  18. Association between Prenatal Environmental Factors and Child Autism: A Case Control Study in Tianjin, China.

    PubMed

    Gao, Lei; Xi, Qian Qian; Wu, Jun; Han, Yu; Dai, Wei; Su, Yuan Yuan; Zhang, Xin

    2015-09-01

    To investigate the association between autism and prenatal environmental risk factors. A case-control study was conducted among 193 children with autism from the special educational schools and 733 typical development controls matched by age and gender by using questionnaire in Tianjin from 2007 to 2012. Statistical analysis included quick unbiased efficient statistical tree (QUEST) and logistic regression in SPSS 20.0. There were four predictors by QUEST and the logistic regression analysis, maternal air conditioner use during pregnancy (OR=0.316, 95% CI: 0.215-0.463) was the single first-level node (χ²=50.994, P=0.000); newborn complications (OR=4.277, 95% CI: 2.314-7.908) and paternal consumption of freshwater fish (OR=0.383, 95% CI: 0.256-0.573) were second-layer predictors (χ²=45.248, P=0.000; χ²=24.212, P=0.000); and maternal depression (OR=4.822, 95% CI: 3.047-7.631) was the single third-level predictor (χ²=23.835, P=0.000). The prediction accuracy of the tree was 89.2%. The air conditioner use during pregnancy and paternal freshwater fish diet might be beneficial for the prevention of autism, while newborn complications and maternal depression might be the risk factors. Copyright © 2015 The Editorial Board of Biomedical and Environmental Sciences. Published by China CDC. All rights reserved.

  19. Dietary factors and the risk of testicular cancer.

    PubMed

    Bonner, Matthew R; McCann, Susan E; Moysich, Kirsten B

    2002-01-01

    The etiology of testicular cancer (TC) remains largely unknown. Few studies have investigated the role diet may play in the etiology of TC. We report on a hospital-based case-control study of TC and selected nutrients and food groups. Cases included 117 patients with primary, incident TC treated at Roswell Park Cancer Institute between 1982 and 1998. A total of 334 hospital controls were frequency matched on age to cases. Cases were categorized by histology (seminoma, nonseminoma, and mixed germ cell TC), and multinomial logistic regression and unconditional logistic regression were used to compute odds ratios (ORs) and 95% confidence intervals (CIs) comparing each histological type with the controls. For nonseminoma and mixed germ cell TC, vitamin E intake was suggestive of reduced risk (OR = 0.51, 95% CI = 0.15-1.76 and OR = 0.36, 95% CI = 0.01-1.31, respectively); for seminoma, it was suggestive of an increased risk (OR = 2.94, 95% CI = 0.99-8.78). Fat intakes were not associated with nonseminoma or mixed germ cell risk; high saturated, animal, and total fat intakes were suggestive of an increase in risk of seminoma. Overall, diet was not associated with TC. However, risk seemed to differ by histology, suggesting that seminoma, nonseminoma, and mixed germ cell TC may have different etiologies. We suggest that future investigations should continue to stratify cases by histology.

  20. Chronic periodontitis is associated with erectile dysfunction. A case-control study in european population.

    PubMed

    Martín, Amada; Bravo, Manuel; Arrabal, Miguel; Magán-Fernández, Antonio; Mesa, Francisco

    2018-07-01

    To determine the association between chronic periodontitis and erectile dysfunction adjusting for biochemical markers and other comorbidities. A case-control study was conducted on 158 male patients; 80 cases with erectile dysfunction according to the International Index of Erectile Function and 78 controls. Sociodemographic data were gathered, and a periodontal examination was performed. Testosterone, lipid profile, C-reactive protein and glycaemic parameters were assessed. All variables were compared between groups, and multivariate logistic regression analyses were performed. 74% of the cases were diagnosed with chronic periodontitis. Number of sites with pocket probing depth 4-6 mm (p = 0.05) and number of sites with clinical attachment loss >3 mm (p < 0.01) were higher in the cases. Triglycerides (p < 0.01), C-reactive protein (p = 0.02) and glycosylated haemoglobin (p = 0.04) were also higher in the cases. Logistic regression showed that patients with chronic periodontitis were more likely to have erectile dysfunction (OR=2.17; 95% CI (1.06-4.43); p = 0.03) independently of other confounders. Patients with erectile dysfunction showed worse periodontal condition. Chronic periodontitis seems to play a key role as a risk factor in the pathogenesis of erectile dysfunction independently of other morbidities. © 2018 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  1. Association between chronic viral hepatitis infection and breast cancer risk: a nationwide population-based case-control study

    PubMed Central

    2011-01-01

    Background In Taiwan, there is a high incidence of breast cancer and a high prevalence of viral hepatitis. In this case-control study, we used a population-based insurance dataset to evaluate whether breast cancer in women is associated with chronic viral hepatitis infection. Methods From the claims data, we identified 1,958 patients with newly diagnosed breast cancer during the period 2000-2008. A randomly selected, age-matched cohort of 7,832 subjects without cancer was selected for comparison. Multivariable logistic regression models were constructed to calculate odds ratios of breast cancer associated with viral hepatitis after adjustment for age, residential area, occupation, urbanization, and income. The age-specific (<50 years and ≥50 years) risk of breast cancer was also evaluated. Results There were no significant differences in the prevalence of hepatitis C virus (HCV) infection, hepatitis B virus (HBV), or the prevalence of combined HBC/HBV infection between breast cancer patients and control subjects (p = 0.48). Multivariable logistic regression analysis, however, revealed that age <50 years was associated with a 2-fold greater risk of developing breast cancer (OR = 2.03, 95% CI = 1.23-3.34). Conclusions HCV infection, but not HBV infection, appears to be associated with early onset risk of breast cancer in areas endemic for HCV and HBV. This finding needs to be replicated in further studies. PMID:22115285

  2. Power and type I error results for a bias-correction approach recently shown to provide accurate odds ratios of genetic variants for the secondary phenotypes associated with primary diseases.

    PubMed

    Wang, Jian; Shete, Sanjay

    2011-11-01

    We recently proposed a bias correction approach to evaluate accurate estimation of the odds ratio (OR) of genetic variants associated with a secondary phenotype, in which the secondary phenotype is associated with the primary disease, based on the original case-control data collected for the purpose of studying the primary disease. As reported in this communication, we further investigated the type I error probabilities and powers of the proposed approach, and compared the results to those obtained from logistic regression analysis (with or without adjustment for the primary disease status). We performed a simulation study based on a frequency-matching case-control study with respect to the secondary phenotype of interest. We examined the empirical distribution of the natural logarithm of the corrected OR obtained from the bias correction approach and found it to be normally distributed under the null hypothesis. On the basis of the simulation study results, we found that the logistic regression approaches that adjust or do not adjust for the primary disease status had low power for detecting secondary phenotype associated variants and highly inflated type I error probabilities, whereas our approach was more powerful for identifying the SNP-secondary phenotype associations and had better-controlled type I error probabilities. © 2011 Wiley Periodicals, Inc.

  3. Statistical Methods for Quality Control of Steel Coils Manufacturing Process using Generalized Linear Models

    NASA Astrophysics Data System (ADS)

    García-Díaz, J. Carlos

    2009-11-01

    Fault detection and diagnosis is an important problem in process engineering. Process equipments are subject to malfunctions during operation. Galvanized steel is a value added product, furnishing effective performance by combining the corrosion resistance of zinc with the strength and formability of steel. Fault detection and diagnosis is an important problem in continuous hot dip galvanizing and the increasingly stringent quality requirements in automotive industry has also demanded ongoing efforts in process control to make the process more robust. When faults occur, they change the relationship among these observed variables. This work compares different statistical regression models proposed in the literature for estimating the quality of galvanized steel coils on the basis of short time histories. Data for 26 batches were available. Five variables were selected for monitoring the process: the steel strip velocity, four bath temperatures and bath level. The entire data consisting of 48 galvanized steel coils was divided into sets. The first training data set was 25 conforming coils and the second data set was 23 nonconforming coils. Logistic regression is a modeling tool in which the dependent variable is categorical. In most applications, the dependent variable is binary. The results show that the logistic generalized linear models do provide good estimates of quality coils and can be useful for quality control in manufacturing process.

  4. Maternal exposures and risk of spontaneous abortion before and after a community oriented health education campaign.

    PubMed

    Agnesi, Roberto; Valentini, Flavio; Fedeli, Ugo; Rylander, Ragnar; Meneghetti, Maurizia; Fadda, Emanuela; Buja, Alessandra; Mastrangelo, Giuseppe

    2011-06-01

    In a district of Veneto (North-east Italy) where numerous females of childbearing age were occupationally exposed to organic solvents in nearly 400 shoe factories, a case-control study found significant associations between maternal exposures (from occupation and risky behavior) and spontaneous abortion (SAB). Thereafter, a health education campaign was undertaken to increase awareness of risk factors for pregnancy in the population. To evaluate the effects of this campaign maternal exposures and SAB risks were compared before and after the campaign. Hospital records were collected from a local hospital for SAB cases and age- residence-matched controls with normal deliveries. Information on solvent exposure, coffee and alcohol consumption, smoking and the use of medication was collected using a questionnaire. Before and after differences were tested through a modified Chi-square test and linear and logistic regressions for survey data. Odds ratios (ORs) with 95% confidence interval (CI) were estimated using logistic regression models. The consumption of coffee (P = 0.003) and alcohol (P < 0.001) was lower after than before the campaign, controlling for age at pregnancy and level of education. There were no differences in reported solvent exposure or smoking (smokers were few). The previously detected increased risks of SAB in relation to solvent exposure and coffee consumption were no longer present. The results suggest that health education campaigns might reduce harmful maternal exposures and the risk of SAB.

  5. Minimal intervention dentistry for early childhood caries and child dental anxiety: a randomized controlled trial.

    PubMed

    Arrow, P; Klobas, E

    2017-06-01

    To compare changes in child dental anxiety after treatment for early childhood caries (ECC) using two treatment approaches. Children with ECC were randomized to test (atraumatic restorative treatment (ART)-based approach) or control (standard care approach) groups. Children aged 3 years or older completed a dental anxiety scale at baseline and follow up. Changes in child dental anxiety from baseline to follow up were tested using the chi-squared statistic, Wilcoxon rank sum test, McNemar's test and multinomial logistic regression. Two hundred and fifty-four children were randomized (N = 127 test, N = 127 control). At baseline, 193 children completed the dental anxiety scale, 211 at follow up and 170 completed the scale on both occasions. Children who were anxious at baseline (11%) were no longer anxious at follow up, and 11% non-anxious children became anxious. Multinomial logistic regression found each increment in the number of visits increased the odds of worsening dental anxiety (odds ratio (OR), 2.2; P < 0.05), whereas each increment in the number of treatments lowered the odds of worsening anxiety (OR, 0.50; P = 0.05). The ART-based approach to managing ECC resulted in similar levels of dental anxiety to the standard treatment approach and provides a valuable alternative approach to the management of ECC in a primary dental care setting. © 2016 Australian Dental Association.

  6. [Risk factors for patent ductus arteriosus in early preterm infants: a case-control study].

    PubMed

    Du, Jin-Feng; Liu, Tian-Tian; Wu, Hui

    2016-01-01

    To investigate the risk factors for the occurrence of patent ductus arteriosus (PDA) and to provide a clinical basis for reducing the occurrence of PDA in early preterm infants. A total of 136 early preterm infants (gestational age≤32 weeks) who were hospitalized between January 2013 and December 2014 and diagnosed with hemodynamicalhy significant PDA (hs-PDA) were enrolled as the case group. Based on the matched case-control principle, 136 early preterm infants without hs-PDA were selected among those who were hospitalized within the same period at a ratio of 1:1 and enrolled as the control group. The two groups were matched for sex and gestational age. The basic information of neonates and maternal conditions during the pregnancy and perinatal periods were collected. Logistic regression analysis was performed to identify the risk factors for the development of PDA. Univariate analysis showed that neonatal infectious diseases, neonatal respiratory distress syndrome, decreased platelet count within 24 hours after birth, and low birth weight were associated with the development of hs-PDA (P<0.05). Multivariate conditional logistic regression analysis revealed that neonatal infectious diseases (OR=2.368) and decreased platelet count within 24 hours after birth (OR=0.996) were independent risk factors for hs-PDA. Neonatal infectious diseases and decreased platelet count within 24 hours after birth increase the risk of hs-PDA in early preterm infants.

  7. Sub-Saharan African university students' beliefs about condoms, condom-use intention, and subsequent condom use: a prospective study.

    PubMed

    Heeren, G Anita; Jemmott, John B; Mandeya, Andrew; Tyler, Joanne C

    2009-04-01

    Whether certain behavioral beliefs, normative beliefs, and control beliefs predict the intention to use condoms and subsequent condom use was examined among 320 undergraduates at a university in South Africa who completed confidential questionnaires on two occasions separated by 3 months. Participants' mean age was 23.4 years, 47.8% were women, 48.9% were South Africans, and 51.1% were from other sub-Saharan African countries. Multiple regression revealed that condom-use intention was predicted by hedonistic behavioral beliefs, normative beliefs regarding sexual partners and peers, and control beliefs regarding condom-use technical skill and impulse control. Logistic regression revealed that baseline condom-use intention predicted consistent condom use and condom use during most recent intercourse at 3-month follow-up. HIV/STI risk-reduction interventions for undergraduates in South Africa should target their condom-use hedonistic beliefs, normative beliefs regarding partners and peers, and control beliefs regarding technical skill and impulse control.

  8. RED MEAT, MICRONUTRIENTS AND ORAL SQUAMOUS CELL CARCINOMA OF ARGENTINE ADULT PATIENTS.

    PubMed

    Secchi, Dante Gustavo; Aballay, Laura Rosana; Galíndez, María Fernanda; Piccini, Daniel; Lanfranchi, Héctor; Brunotto, Mabel

    2015-09-01

    the identification of risk group of oral cancer allows reducing the typical morbidity and mortality rates of this pathology. it was analyzed the role of red meat, macronutrients and micronutrients on Oral Squamous Cell carcinoma (OSCC) in a case-control study carried out in Cordoba, Argentina. case-control study 3:1, both genders, aged 24-80 years. Dietary information was collected using a quali-quantitative food frequency questionnaire. The logistic regression was applied for assessing the association among case/control status and daily red meat/macronutrient/ micronutrients/energy intake. micronutrients and minerals in the diet that showed high significant median values of common consumption in cases relative to controls were iron, phosphorus, vitamins B1, B5, B6, E and K and selenium. The association measurement estimated by logistic regression was showed that a significant association between red meat, fat, daily energy, phosphorous, vitamin B5, vitamin E, and selenium intake and OSCC presence. a high intake of fats, phosphorus, vitamin B5, vitamin E, and selenium intake and red meat appears to be related to the presence OSCC in Cordoba, Argentina. In relation to red meat consumption and risk of OSCC, the future research should center of attention on reducing the complexity of diet and disease relationships and reducing variability in intake data by standardizing of criteria in order to implement simple strategies in public health for recognizing risk groups of OSCC. Copyright AULA MEDICA EDICIONES 2014. Published by AULA MEDICA. All rights reserved.

  9. Impact of maternal education, employment and family size on nutritional status of children.

    PubMed

    Iftikhar, Aisha; Bari, Attia; Bano, Iqbal; Masood, Qaisar

    2017-01-01

    To determine the impact of maternal education, employment, and family size on nutritional status of children. It was case control study conducted at OPD of children Hospital Lahore, from September 2015 to April 2017. Total 340 children (170 cases and 170 controls) with age range of six months to five years along with their mothers were included. Anthropometric measurements were plotted against WHO growth Charts. 170 wasted (<-2 SD) were matched with 170 controls (≥ -2 SD). Maternal education, employment and family size were compared between the cases and control. Confounding variables noted and dichotomized. Univariate analysis was carried out for factors under consideration i.e.; Maternal Education, employment and family size to study the association of each factor. Logistic regression analysis was applied to study the independent association. Maternal education had significant association with growth parameters; OR of 1.32 with confidence interval of (CI= 1.1 to 1.623). Employment status of mothers had OR of 1.132 with insignificant confidence interval of (CI=0.725 to 1.768). Family size had OR of one with insignificant confidence interval (CI=0.8 -1.21). Association remained same after applying bivariate logistic regression analysis. Maternal education has definite and significant effect on nutritional status of children. This is the key factor to be addressed for prevention or improvement of childhood malnutrition. For this it is imperative to launch sustainable programs at national and regional level to uplift women educational status to combat this ever increasing burden of malnutrition.

  10. Impulsivity, attention, memory, and decision-making among adolescent marijuana users.

    PubMed

    Dougherty, Donald M; Mathias, Charles W; Dawes, Michael A; Furr, R Michael; Charles, Nora E; Liguori, Anthony; Shannon, Erin E; Acheson, Ashley

    2013-03-01

    Marijuana is a popular drug of abuse among adolescents, and they may be uniquely vulnerable to resulting cognitive and behavioral impairments. Previous studies have found impairments among adolescent marijuana users. However, the majority of this research has examined measures individually rather than multiple domains in a single cohesive analysis. This study used a logistic regression model that combines performance on a range of tasks to identify which measures were most altered among adolescent marijuana users. The purpose of this research was to determine unique associations between adolescent marijuana use and performances on multiple cognitive and behavioral domains (attention, memory, decision-making, and impulsivity) in 14- to 17-year-olds while simultaneously controlling for performances across the measures to determine which measures most strongly distinguish marijuana users from nonusers. Marijuana-using adolescents (n = 45) and controls (n = 48) were tested. Logistic regression analyses were conducted to test for: (1) differences between marijuana users and nonusers on each measure, (2) associations between marijuana use and each measure after controlling for the other measures, and (3) the degree to which (1) and (2) together elucidated differences among marijuana users and nonusers. Of all the cognitive and behavioral domains tested, impaired short-term recall memory and consequence sensitivity impulsivity were associated with marijuana use after controlling for performances across all measures. This study extends previous findings by identifying cognitive and behavioral impairments most strongly associated with adolescent marijuana users. These specific deficits are potential targets of intervention for this at-risk population.

  11. Risk of Peripheral Artery Occlusive Disease in Patients with Vertigo, Tinnitus, or Sudden Deafness: A Secondary Case-Control Analysis of a Nationwide, Population-Based Health Claims Database.

    PubMed

    Koo, Malcolm; Chen, Jin-Cherng; Hwang, Juen-Haur

    2016-01-01

    Cochleovestibular symptoms, such as vertigo, tinnitus, and sudden deafness, are common manifestations of microvascular diseases. However, it is unclear whether these symptoms occurred preceding the diagnosis of peripheral artery occlusive disease (PAOD). Therefore, the aim of this case-control study was to investigate the risk of PAOD among patients with vertigo, tinnitus, and sudden deafness using a nationwide, population-based health claim database in Taiwan. We identified 5,340 adult patients with PAOD diagnosed between January 1, 2006 and December 31, 2010 and 16,020 controls, frequency matched on age interval, sex, and year of index date, from the Taiwan National Health Insurance Research Database. Risks of PAOD in patients with vertigo, tinnitus, or sudden deafness were separately evaluated with multivariate logistic regression analyses. Of the 5,340 patients with PAOD, 12.7%, 6.7%, and 0.3% were diagnosed with vertigo, tinnitus, and sudden deafness, respectively. In the controls, 10.6%, 6.1%, and 0.3% were diagnosed with vertigo (P < 0.001), tinnitus (P = 0.161), and sudden deafness (P = 0.774), respectively. Results from the multivariate logistic regression analyses showed that the risk of PAOD was significantly increased in patients with vertigo (adjusted odds ratio = 1.12, P = 0.027) but not in those with tinnitus or sudden deafness. A modest increase in the risk of PAOD was observed among Taiwanese patients with vertigo, after adjustment for comorbidities.

  12. Impact of parental-rearing styles on irritable bowel syndrome in adolescents: a school-based study.

    PubMed

    Xing, Zhouxiong; Hou, Xiaohua; Zhou, Kan; Qin, Diyuan; Pan, Wen

    2014-03-01

    A strong association between family function and irritable bowel syndrome (IBS) has been observed. Parental rearing styles, as a comprehensive mark for family function, may provide new clues to the etiology of IBS. This study aimed to explore which dimensions of parental rearing styles were risk factors or protective factors for IBS in adolescents. Two thousand three hundred twenty adolescents were recruited from one middle school and one high school randomly selected from Jiangan District (an urban district in Wuhan City). Data were collected using two Chinese versions of validated self-report questionnaires including the Rome III diagnostic criteria for pediatric IBS and the Egna Minnen Beträffande Uppfostran: One's Memories of Upbringing for perceived parental rearing styles. Ninety-six subjects diagnosed as pediatric IBS were compared with 1618 controls. The IBS patients reported less both paternal and maternal emotional warmth (all P < 0.01) and more both paternal and maternal punishment, overinterference, rejection, and overprotection (only for father) (all P < 0.01) than the controls. Furthermore, the IBS patients had higher total scores of parental rearing styles (all P < 0.001) than the controls. With univariate logistic regression, standardized regression coefficients and odds ratios of parental rearing variables were calculated. Multivariate logistic regression revealed that paternal rejection (P = 0.001) and maternal overinterference (P = 0.002) were independent risk factors for IBS in adolescents. Parental emotional warmth is a protective factor for IBS in adolescents and parental punishment, overinterference, rejection, and overprotection are risk factors for IBS in adolescents. © 2013 Journal of Gastroenterology and Hepatology Foundation and Wiley Publishing Asia Pty Ltd.

  13. [Optimization of diagnosis indicator selection and inspection plan by 3.0T MRI in breast cancer].

    PubMed

    Jiang, Zhongbiao; Wang, Yunhua; He, Zhong; Zhang, Lejun; Zheng, Kai

    2013-08-01

    To optimize 3.0T MRI diagnosis indicator in breast cancer and to select the best MRI scan program. Totally 45 patients with breast cancers were collected, and another 35 patients with benign breast tumor served as the control group. All patients underwent 3.0T MRI, including T1- weighted imaging (T1WI), fat suppression of the T2-weighted imaging (T2WI), diffusion weighted imaging (DWI), 1H magnetic resonance spectroscopy (1H-MRS) and dynamic contrast enhanced (DCE) sequence. With operation pathology results as the gold standard in the diagnosis of breast diseases, the pathological results of benign and malignant served as dependent variables, and the diagnostic indicators of MRI were taken as independent variables. We put all the indicators of MRI examination under Logistic regression analysis, established the Logistic model, and optimized the diagnosis indicators of MRI examination to further improve MRI scan of breast cancer. By Logistic regression analysis, some indicators were selected in the equation, including the edge feature of the tumor, the time-signal intensity curve (TIC) type and the apparent diffusion coefficient (ADC) value when b=500 s/mm2. The regression equation was Logit (P)=-21.936+20.478X6+3.267X7+ 21.488X3. Valuable indicators in the diagnosis of breast cancer are the edge feature of the tumor, the TIC type and the ADC value when b=500 s/mm2. Combining conventional MRI scan, DWI and dynamic enhanced MRI is a better examination program, while MRS is the complementary program when diagnosis is difficult.

  14. Resection of complex pancreatic injuries: Benchmarking postoperative complications using the Accordion classification

    PubMed Central

    Krige, Jake E; Jonas, Eduard; Thomson, Sandie R; Kotze, Urda K; Setshedi, Mashiko; Navsaria, Pradeep H; Nicol, Andrew J

    2017-01-01

    AIM To benchmark severity of complications using the Accordion Severity Grading System (ASGS) in patients undergoing operation for severe pancreatic injuries. METHODS A prospective institutional database of 461 patients with pancreatic injuries treated from 1990 to 2015 was reviewed. One hundred and thirty patients with AAST grade 3, 4 or 5 pancreatic injuries underwent resection (pancreatoduodenectomy, n = 20, distal pancreatectomy, n = 110), including 30 who had an initial damage control laparotomy (DCL) and later definitive surgery. AAST injury grades, type of pancreatic resection, need for DCL and incidence and ASGS severity of complications were assessed. Uni- and multivariate logistic regression analysis was applied. RESULTS Overall 238 complications occurred in 95 (73%) patients of which 73% were ASGS grades 3-6. Nineteen patients (14.6%) died. Patients more likely to have complications after pancreatic resection were older, had a revised trauma score (RTS) < 7.8, were shocked on admission, had grade 5 injuries of the head and neck of the pancreas with associated vascular and duodenal injuries, required a DCL, received a larger blood transfusion, had a pancreatoduodenectomy (PD) and repeat laparotomies. Applying univariate logistic regression analysis, mechanism of injury, RTS < 7.8, shock on admission, DCL, increasing AAST grade and type of pancreatic resection were significant variables for complications. Multivariate logistic regression analysis however showed that only age and type of pancreatic resection (PD) were significant. CONCLUSION This ASGS-based study benchmarked postoperative morbidity after pancreatic resection for trauma. The detailed outcome analysis provided may serve as a reference for future institutional comparisons. PMID:28396721

  15. The base rates and factors associated with reported access to firearms in psychiatric inpatients.

    PubMed

    Kolla, Bhanu Prakash; O'Connor, Stephen S; Lineberry, Timothy W

    2011-01-01

    The aim of this study was to define whether specific patient demographic groups, diagnoses or other factors are associated with psychiatric inpatients reporting firearms access. A retrospective medical records review study was conducted using information on access to firearms from electronic medical records for all patients 16 years and older admitted between July 2007 and May 2008 at the Mayo Clinic Psychiatric Hospital in Rochester, MN. Data were obtained only on patients providing authorization for record review. Data were analyzed using univariate and multivariate logistic regression analyses accounting for gender, diagnostic groups, comorbid substance use, history of suicide attempts and family history of suicide/suicide attempts. Seventy-four percent (1169/1580) of patients provided research authorization. The ratio of men to women was identical in both research and nonresearch authorization groups. There were 14.6% of inpatients who reported firearms access. In univariate analysis, men were more likely (P<.0001) to report access than women, and a history of previous suicide attempt(s) was associated with decreased access (P=.02). Multiple logistic regression analyses controlling for other factors found females and patients with history of previous suicide attempt(s) less likely to report access, while patients with a family history of suicide or suicide attempts reported increased firearms access. Diagnostic groups were not associated with access on univariate or multiple logistic regression analyses. Men and inpatients with a family history of suicide/suicide attempts were more likely to report firearms access. Clinicians should develop standardized systems of identification of firearms access and provide guidance on removal. Copyright © 2011 Elsevier Inc. All rights reserved.

  16. Prevalence of abortion and stillbirth in a beef cattle system in Southeastern Mexico.

    PubMed

    Segura-Correa, José C; Segura-Correa, Victor M

    2009-12-01

    Prenatal mortality is an important cause of production losses in the livestock industry. This study estimates the prevalences of abortion and stillbirth in a beef cattle system and determines the significance of some risk factors, in the tropics of Mexico. Data were obtained from a Zebu cattle herd and their crosses with Bos taurus breeds, in Yucatan, Mexico. The logit of the probability of an abortion or stillbirth was modeled using binary logistic regression. The risk factors tested were: year of abortion (or calving), season of abortion (or calving), parity number and dam breed group. The effect of twins on stillbirth was tested using Fisher exact test. Of the 4175 calvings studied 49 were abortions (1.17%). Significant factors in the logistic regression analysis for abortions were season of abortion and parity number. The risk of abortion was lower in the dry seasons compared to the rainy and windy seasons (P = 0.009). The risk of abortion was higher in second parity cows followed by the third and first parity cows, as compared to older cows (P = 0.015). Of the 4126 births, 87 were stillbirths (2.11%). Significant factors in the logistic regression analysis for stillbirth were year of calving (P = 0.0001) and parity number (P < 0.001). The risk of stillbirth in first parity cows was 2.6 times that of old cows. Of the total births, 15 were twins (0.36%) of which 7 were born dead calves. Herd owners must focus on the significant risk factors under their control to reduce the prevalence of prenatal mortality.

  17. Sparse modeling of spatial environmental variables associated with asthma

    PubMed Central

    Chang, Timothy S.; Gangnon, Ronald E.; Page, C. David; Buckingham, William R.; Tandias, Aman; Cowan, Kelly J.; Tomasallo, Carrie D.; Arndt, Brian G.; Hanrahan, Lawrence P.; Guilbert, Theresa W.

    2014-01-01

    Geographically distributed environmental factors influence the burden of diseases such as asthma. Our objective was to identify sparse environmental variables associated with asthma diagnosis gathered from a large electronic health record (EHR) dataset while controlling for spatial variation. An EHR dataset from the University of Wisconsin’s Family Medicine, Internal Medicine and Pediatrics Departments was obtained for 199,220 patients aged 5–50 years over a three-year period. Each patient’s home address was geocoded to one of 3,456 geographic census block groups. Over one thousand block group variables were obtained from a commercial database. We developed a Sparse Spatial Environmental Analysis (SASEA). Using this method, the environmental variables were first dimensionally reduced with sparse principal component analysis. Logistic thin plate regression spline modeling was then used to identify block group variables associated with asthma from sparse principal components. The addresses of patients from the EHR dataset were distributed throughout the majority of Wisconsin’s geography. Logistic thin plate regression spline modeling captured spatial variation of asthma. Four sparse principal components identified via model selection consisted of food at home, dog ownership, household size, and disposable income variables. In rural areas, dog ownership and renter occupied housing units from significant sparse principal components were associated with asthma. Our main contribution is the incorporation of sparsity in spatial modeling. SASEA sequentially added sparse principal components to Logistic thin plate regression spline modeling. This method allowed association of geographically distributed environmental factors with asthma using EHR and environmental datasets. SASEA can be applied to other diseases with environmental risk factors. PMID:25533437

  18. Sparse modeling of spatial environmental variables associated with asthma.

    PubMed

    Chang, Timothy S; Gangnon, Ronald E; David Page, C; Buckingham, William R; Tandias, Aman; Cowan, Kelly J; Tomasallo, Carrie D; Arndt, Brian G; Hanrahan, Lawrence P; Guilbert, Theresa W

    2015-02-01

    Geographically distributed environmental factors influence the burden of diseases such as asthma. Our objective was to identify sparse environmental variables associated with asthma diagnosis gathered from a large electronic health record (EHR) dataset while controlling for spatial variation. An EHR dataset from the University of Wisconsin's Family Medicine, Internal Medicine and Pediatrics Departments was obtained for 199,220 patients aged 5-50years over a three-year period. Each patient's home address was geocoded to one of 3456 geographic census block groups. Over one thousand block group variables were obtained from a commercial database. We developed a Sparse Spatial Environmental Analysis (SASEA). Using this method, the environmental variables were first dimensionally reduced with sparse principal component analysis. Logistic thin plate regression spline modeling was then used to identify block group variables associated with asthma from sparse principal components. The addresses of patients from the EHR dataset were distributed throughout the majority of Wisconsin's geography. Logistic thin plate regression spline modeling captured spatial variation of asthma. Four sparse principal components identified via model selection consisted of food at home, dog ownership, household size, and disposable income variables. In rural areas, dog ownership and renter occupied housing units from significant sparse principal components were associated with asthma. Our main contribution is the incorporation of sparsity in spatial modeling. SASEA sequentially added sparse principal components to Logistic thin plate regression spline modeling. This method allowed association of geographically distributed environmental factors with asthma using EHR and environmental datasets. SASEA can be applied to other diseases with environmental risk factors. Copyright © 2014 Elsevier Inc. All rights reserved.

  19. Logistic regression analysis of the risk factors of anastomotic fistula after radical resection of esophageal‐cardiac cancer

    PubMed Central

    Huang, Jinxi; Wang, Chenghu; Yuan, Weiwei; Zhang, Zhandong; Chen, Beibei; Zhang, Xiefu

    2017-01-01

    Background This study was conducted to investigate the risk factors of anastomotic fistula after the radical resection of esophageal‐cardiac cancer. Methods Five hundred and forty‐four esophageal‐cardiac cancer patients who underwent surgery and had complete clinical data were included in the study. Fifty patients diagnosed with postoperative anastomotic fistula were considered the case group and the remaining 494 subjects who did not develop postoperative anastomotic fistula were considered the control. The potential risk factors for anastomotic fistula, such as age, gender, diabetes history, smoking history, were collected and compared between the groups. Statistically significant variables were substituted into logistic regression to further evaluate the independent risk factors for postoperative anastomotic fistulas in esophageal‐cardiac cancer. Results The incidence of anastomotic fistulas was 9.2% (50/544). Logistic regression analysis revealed that female gender (P < 0.05), laparoscopic surgery (P < 0.05), decreased postoperative albumin (P < 0.05), and postoperative renal dysfunction (P < 0.05) were independent risk factors for anastomotic fistulas in patients who received surgery for esophageal‐cardiac cancer. Of the 50 anastomotic fistulas, 16 cases were small fistulas, which were only discovered by conventional imaging examination and not presenting clinical symptoms. All of the anastomotic fistulas occurred within seven days after surgery. Five of the patients with anastomotic fistulas underwent a second surgery and three died. Conclusion Female patients with esophageal‐cardiac cancer treated with endoscopic surgery and suffering from postoperative hypoproteinemia and renal dysfunction were susceptible to postoperative anastomotic fistula. PMID:28940985

  20. Differentiating major depressive disorder in youths with attention deficit hyperactivity disorder.

    PubMed

    Diler, Rasim Somer; Daviss, W Burleson; Lopez, Adriana; Axelson, David; Iyengar, Satish; Birmaher, Boris

    2007-09-01

    Youths with attention deficit hyperactivity disorders (ADHD) frequently have comorbid major depressive disorders (MDD) sharing overlapping symptoms. Our objective was to examine which depressive symptoms best discriminate MDD among youths with ADHD. One-hundred-eleven youths with ADHD (5.2-17.8 years old) and their parents completed interviews with the K-SADS-PL and respective versions of the child or the parent Mood and Feelings Questionnaire (MFQ-C, MFQ-P). Controlling for group differences, logistic regression was used to calculate odds ratios reflecting the accuracy with which various depressive symptoms on the MFQ-C or MFQ-P discriminated MDD. Stepwise logistic regression then identified depressive symptoms that best discriminated the groups with and without MDD, using cross-validated misclassification rate as the criterion. Symptoms that discriminated youths with MDD (n=18) from those without MDD (n=93) were 4 of 6 mood/anhedonia symptoms, all 14 depressed cognition symptoms, and only 3 of 11 physical/vegetative symptoms. Mild irritability, miserable/unhappy moods, and symptoms related to sleep, appetite, energy levels and concentration did not discriminate MDD. A stepwise logistic regression correctly classified 89% of the comorbid MDD subjects, with only age, anhedonia at school, thoughts about killing self, thoughts that bad things would happen, and talking more slowly remaining in the final model. Results of this study may not generalize to community samples because subjects were drawn largely from a university-based outpatient psychiatric clinic. These findings stress the importance of social withdrawal, anhedonia, depressive cognitions, suicidal thoughts, and psychomotor retardation when trying to identify MDD among ADHD youths.

  1. Relaxing the rule of ten events per variable in logistic and Cox regression.

    PubMed

    Vittinghoff, Eric; McCulloch, Charles E

    2007-03-15

    The rule of thumb that logistic and Cox models should be used with a minimum of 10 outcome events per predictor variable (EPV), based on two simulation studies, may be too conservative. The authors conducted a large simulation study of other influences on confidence interval coverage, type I error, relative bias, and other model performance measures. They found a range of circumstances in which coverage and bias were within acceptable levels despite less than 10 EPV, as well as other factors that were as influential as or more influential than EPV. They conclude that this rule can be relaxed, in particular for sensitivity analyses undertaken to demonstrate adequate control of confounding.

  2. A Comparison of the Logistic Regression and Contingency Table Methods for Simultaneous Detection of Uniform and Nonuniform DIF

    ERIC Educational Resources Information Center

    Guler, Nese; Penfield, Randall D.

    2009-01-01

    In this study, we investigate the logistic regression (LR), Mantel-Haenszel (MH), and Breslow-Day (BD) procedures for the simultaneous detection of both uniform and nonuniform differential item functioning (DIF). A simulation study was used to assess and compare the Type I error rate and power of a combined decision rule (CDR), which assesses DIF…

  3. The Overall Odds Ratio as an Intuitive Effect Size Index for Multiple Logistic Regression: Examination of Further Refinements

    ERIC Educational Resources Information Center

    Le, Huy; Marcus, Justin

    2012-01-01

    This study used Monte Carlo simulation to examine the properties of the overall odds ratio (OOR), which was recently introduced as an index for overall effect size in multiple logistic regression. It was found that the OOR was relatively independent of study base rate and performed better than most commonly used R-square analogs in indexing model…

  4. Predicting Student Success on the Texas Chemistry STAAR Test: A Logistic Regression Analysis

    ERIC Educational Resources Information Center

    Johnson, William L.; Johnson, Annabel M.; Johnson, Jared

    2012-01-01

    Background: The context is the new Texas STAAR end-of-course testing program. Purpose: The authors developed a logistic regression model to predict who would pass-or-fail the new Texas chemistry STAAR end-of-course exam. Setting: Robert E. Lee High School (5A) with an enrollment of 2700 students, Tyler, Texas. Date of the study was the 2011-2012…

  5. Using ROC curves to compare neural networks and logistic regression for modeling individual noncatastrophic tree mortality

    Treesearch

    Susan L. King

    2003-01-01

    The performance of two classifiers, logistic regression and neural networks, are compared for modeling noncatastrophic individual tree mortality for 21 species of trees in West Virginia. The output of the classifier is usually a continuous number between 0 and 1. A threshold is selected between 0 and 1 and all of the trees below the threshold are classified as...

  6. Using Logistic Regression to Predict the Probability of Debris Flows in Areas Burned by Wildfires, Southern California, 2003-2006

    USGS Publications Warehouse

    Rupert, Michael G.; Cannon, Susan H.; Gartner, Joseph E.; Michael, John A.; Helsel, Dennis R.

    2008-01-01

    Logistic regression was used to develop statistical models that can be used to predict the probability of debris flows in areas recently burned by wildfires by using data from 14 wildfires that burned in southern California during 2003-2006. Twenty-eight independent variables describing the basin morphology, burn severity, rainfall, and soil properties of 306 drainage basins located within those burned areas were evaluated. The models were developed as follows: (1) Basins that did and did not produce debris flows soon after the 2003 to 2006 fires were delineated from data in the National Elevation Dataset using a geographic information system; (2) Data describing the basin morphology, burn severity, rainfall, and soil properties were compiled for each basin. These data were then input to a statistics software package for analysis using logistic regression; and (3) Relations between the occurrence or absence of debris flows and the basin morphology, burn severity, rainfall, and soil properties were evaluated, and five multivariate logistic regression models were constructed. All possible combinations of independent variables were evaluated to determine which combinations produced the most effective models, and the multivariate models that best predicted the occurrence of debris flows were identified. Percentage of high burn severity and 3-hour peak rainfall intensity were significant variables in all models. Soil organic matter content and soil clay content were significant variables in all models except Model 5. Soil slope was a significant variable in all models except Model 4. The most suitable model can be selected from these five models on the basis of the availability of independent variables in the particular area of interest and field checking of probability maps. The multivariate logistic regression models can be entered into a geographic information system, and maps showing the probability of debris flows can be constructed in recently burned areas of southern California. This study demonstrates that logistic regression is a valuable tool for developing models that predict the probability of debris flows occurring in recently burned landscapes.

  7. A population-based study on the association between chronic periodontitis and sialolithiasis.

    PubMed

    Hung, Shih-Han; Huang, Hung-Meng; Lee, Hsin-Chien; Ching Lin, Herng; Kao, Li-Ting; Wu, Chuan-Song

    2016-04-01

    Whereas the impression that poor oral hygiene is linked to the development of sialolithiasis may be widely accepted, very few studies provide evidence to support this. This study therefore aimed to evaluate the association between chronic periodontitis (CP) and the subsequent development of salivary gland stone based on a nationwide coverage database. A case-control study. A total of 987 subjects with sialolithiasis were included as cases. In a ratio of five controls per case, 4,935 controls matched in terms of sex and age group were selected. Conditional logistic regression analysis was performed to determine the possible association of sialolithiasis with previously diagnosed CP. The prevalence of prior CP between cases and controls demonstrated that 1,831 (30.9%) out of the 5,922 sampled subjects had prior CP. By Chi-square test, there was a significant difference in the prevalence of prior CP between the cases and controls (36.8% vs. 29.7%, P < 0.001). By conditional logistic regression analysis, the odds ratio (OR) of prior CP for cases was 1.37 (95% confidence interval [CI], 1.19-1.56) compared to the controls after adjusting for geographic location and tobacco use. Further analyzing the relationship between sialolithiasis and prior CP according to sex, sialolithiasis was associated with prior CP regardless of sex. The adjusted OR of prior CP for the cases was 1.34 (95% CI, 1.10-1.64) and 1.41 (95% CI, 1.15-1.73) for males and females, respectively, when compared to controls. This study demonstrates an association between CP and sialolithiasis. 3b. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.

  8. A case-control study on the association between bladder cancer and prior bladder calculus.

    PubMed

    Chung, Shiu-Dong; Tsai, Ming-Chieh; Lin, Ching-Chun; Lin, Herng-Ching

    2013-03-15

    Bladder calculus is associated with chronic irritation and inflammation. As there is substantial documentation that inflammation can play a direct role in carcinogenesis, to date the relationship between stone formation and bladder cancer (BC) remains unclear. This study aimed to examine the association between BC and prior bladder calculus using a population-based dataset. This case-control study included 2,086 cases who had received their first-time diagnosis of BC between 2001 and 2009 and 10,430 randomly selected controls without BC. Conditional logistic regressions were employed to explore the association between BC and having been previously diagnosed with bladder calculus. Of the sampled subjects, bladder calculus was found in 71 (3.4%) cases and 105 (1.1%) controls. Conditional logistic regression analysis revealed that the odds ratio (OR) of having been diagnosed with bladder calculus before the index date for cases was 3.42 (95% CI = 2.48-4.72) when compared with controls after adjusting for monthly income, geographic region, hypertension, diabetes, coronary heart disease, and renal disease, tobacco use disorder, obesity, alcohol abuse, and schistosomiasis, bladder outlet obstruction, and urinary tract infection. We further analyzed according to sex and found that among males, the OR of having been previously diagnosed with bladder calculus for cases was 3.45 (95% CI = 2.39-4.99) that of controls. Among females, the OR was 3.05 (95% CI = 1.53-6.08) that of controls. These results add to the evidence surrounding the conflicting reports regarding the association between BC and prior bladder calculus and highlight a potential target population for bladder cancer screening.

  9. Applications of statistics to medical science, III. Correlation and regression.

    PubMed

    Watanabe, Hiroshi

    2012-01-01

    In this third part of a series surveying medical statistics, the concepts of correlation and regression are reviewed. In particular, methods of linear regression and logistic regression are discussed. Arguments related to survival analysis will be made in a subsequent paper.

  10. Clinical and sonographic risk factors and complications of shoulder dystocia - a case-control study with parity and gestational age matched controls.

    PubMed

    Parantainen, Jukka; Palomäki, Outi; Talola, Nina; Uotila, Jukka

    2014-06-01

    To examine the clinical risk factors and complications of shoulder dystocia today and to evaluate ultrasound methods predicting it. Retrospective, matched case-control study at a University Hospital with 5000 annual deliveries. The study population consisted of 152 deliveries complicated by shoulder dystocia over a period of 8.5 years (January 2004-June 2012) and 152 controls matched for gestational age and parity. The data was collected from the medical records of mothers and children and analyzed by conditional logistic regression. Incidences and odds ratios were calculated for risk factors and complications. Antenatal ultrasound data was analyzed when available by conditional logistic regression to test for significant differences between study groups. Birthweight (OR 12.1 for ≥4000 g; 95% CI 4.18-35.0) and vacuum extraction (OR 3.98; 95% CI 1.25-12.7) remained the most significant clinical risk factors. Only a trend of an association of pregestational or gestational diabetes was noticed (OR 1.87; 95% CI 0.997-3.495, probability of type II error 51%). Of the complications of shoulder dystocia the incidence of brachial plexus palsies was high (40%). Antenatal ultrasound method based on the difference between abdominal and biparietal diameters had a significant difference between cases and controls. The impact of diabetes as a risk factor has diminished, which may reflect improved screening and treatment. Antenatal ultrasound methods are showing some promise, but the predictive value of ultrasound alone is probably low. Copyright © 2014. Published by Elsevier Ireland Ltd.

  11. Bladder Pain Syndrome/Interstitial Cystitis Is Associated with Hyperthyroidism

    PubMed Central

    Liu, Shih-Ping; Lin, Ching-Chun; Lin, Herng-Ching

    2013-01-01

    Background Although the etiology of bladder pain syndrome/interstitial cystitis (BPS/IC) is still unclear, a common theme with BPS/IC patients is comorbid disorders which are related to the autonomic nervous system that connects the nervous system to end-organs. Nevertheless, no study to date has reported the association between hyperthyroidism and BPS/IC. In this study, we examined the association of IC/BPS with having previously been diagnosed with hyperthyroidism in Taiwan. Design Data in this study were retrieved from the Longitudinal Health Insurance Database. Our study consisted of 736 female cases with BPS/IC and 2208 randomly selected female controls. We performed a conditional logistic regression to calculate the odds ratio (OR) for having previously been diagnosed with hyperthyroidism between cases and controls. Results Of the 2944 sampled subjects, there was a significant difference in the prevalence of prior hyperthyroidism between cases and controls (3.3% vs. 1.5%, p<0.001). The conditional logistic regression analysis revealed that compared to controls, the OR for prior hyperthyroidism among cases was 2.16 (95% confidence interval (CI): 1.27∼3.66). Furthermore, the OR for prior hyperthyroidism among cases was 2.01 (95% CI: 1.15∼3.53) compared to controls after adjusting for diabetes, coronary heart disease, obesity, hyperlipidemia, chronic pelvic pain, irritable bowel syndrome, fibromyalgia, chronic fatigue syndrome, depression, panic disorder, migraines, sicca syndrome, allergies, endometriosis, and asthma. Conclusions Our study results indicated an association between hyperthyroidism and BPS/IC. We suggest that clinicians treating female subjects with hyperthyroidism be alert to urinary complaints in this population. PMID:23991081

  12. Anisodamine accelerates spontaneous passage of single symptomatic bile duct stones ≤ 10 mm

    PubMed Central

    Gao, Jun; Ding, Xue-Mei; Ke, Shan; Zhou, Yi-Ming; Qian, Xiao-Jun; Ma, Rui-Liang; Ning, Chun-Min; Xin, Zong-Hai; Sun, Wen-Bing

    2013-01-01

    AIM: To investigate the rate of spontaneous passage of single and symptomatic common bile duct (CBD) stones ≤ 10 mm in diameter in 4 wk with or without a 2-wk course of anisodamine. METHODS: A multicenter, randomized, placebo-controlled trial was undertaken. A total of 197 patients who met the inclusion criteria were enrolled. Ninety-seven patients were assigned randomly to the control group and the other 100 to the anisodamine group. The anisodamine group received intravenous infusions of anisodamine (10 mg every 8 h) for 2 wk. The control group received the same volume of 0.9% isotonic saline for 2 wk. Patients underwent imaging studies and liver-function tests every week for 4 wk. The rate of spontaneous passage of CBD stones was analyzed. RESULTS: The rate of spontaneous passage of CBD stones was significantly higher in the anisodamine group than that in the control group (47.0% vs 22.7%). Most (87.2%, 41/47) stone passages in the anisodamine group occurred in the first 2 wk, and passages in the control group occurred at a comparable rate each week. Factors significantly increasing the possibility of spontaneous passage by univariate logistic regression analyses were stone diameter (< 5 mm vs ≥ 5 mm and ≤ 10 mm) and anisodamine therapy. Multivariate logistic regression analyses revealed that these two factors were significantly associated with spontaneous passage. CONCLUSION: Two weeks of anisodamine administration can safely accelerate spontaneous passage of single and symptomatic CBD stones ≤ 10 mm in diameter, especially for stones < 5 mm. PMID:24151390

  13. Bladder pain syndrome/interstitial cystitis is associated with hyperthyroidism.

    PubMed

    Chung, Shiu-Dong; Liu, Shih-Ping; Lin, Ching-Chun; Li, Hsien-Chang; Lin, Herng-Ching

    2013-01-01

    Although the etiology of bladder pain syndrome/interstitial cystitis (BPS/IC) is still unclear, a common theme with BPS/IC patients is comorbid disorders which are related to the autonomic nervous system that connects the nervous system to end-organs. Nevertheless, no study to date has reported the association between hyperthyroidism and BPS/IC. In this study, we examined the association of IC/BPS with having previously been diagnosed with hyperthyroidism in Taiwan. Data in this study were retrieved from the Longitudinal Health Insurance Database. Our study consisted of 736 female cases with BPS/IC and 2208 randomly selected female controls. We performed a conditional logistic regression to calculate the odds ratio (OR) for having previously been diagnosed with hyperthyroidism between cases and controls. Of the 2944 sampled subjects, there was a significant difference in the prevalence of prior hyperthyroidism between cases and controls (3.3% vs. 1.5%, p<0.001). The conditional logistic regression analysis revealed that compared to controls, the OR for prior hyperthyroidism among cases was 2.16 (95% confidence interval (CI): 1.27∼3.66). Furthermore, the OR for prior hyperthyroidism among cases was 2.01 (95% CI: 1.15∼3.53) compared to controls after adjusting for diabetes, coronary heart disease, obesity, hyperlipidemia, chronic pelvic pain, irritable bowel syndrome, fibromyalgia, chronic fatigue syndrome, depression, panic disorder, migraines, sicca syndrome, allergies, endometriosis, and asthma. Our study results indicated an association between hyperthyroidism and BPS/IC. We suggest that clinicians treating female subjects with hyperthyroidism be alert to urinary complaints in this population.

  14. Is Pseudoexfoliation Syndrome a Risk Factor for Cerebro Vascular Disease?

    PubMed

    Kan, Emrah; Yılmaz, Ahmet; Demirağ, Mehmet Derya; Çalık, Murat

    2017-01-01

    To determine the relationship between cerebro vascular disease and pseudoexfoliation syndrome. This cross-sectional case control study consisted of 50 patients with ischemic-type cerebro vascular disease and 50 control subjects. All subjects were investigated for diabetes mellitus and hypertension status and underwent a detailed ophthalmic examination. A diagnosis of pseudoexfoliation syndrome was made if characteristic greyish particulate matter was found on the anterior lens capsule after pupillary dilatation by slit-lamp examination. All subjects were compared in terms of pseudoexfoliation syndrome, diabetes mellitus, and hypertension. Pearson Chi Square and Student's t test were used for statistical analysis. Logistic regression analyses of the risk factors between groups were also made. The presence of pseudoexfoliation syndrome was significantly higher in patients with cerebro vascular disease when compared to the control subjects (p = 0.02). The frequency of diabetes mellitus was similar between the two groups. Arterial hypertension was significantly more frequent in the patient group when compared to the control subjects (p < 0.01). The logistic regression analysis showed that both pseudoexfoliation syndrome and hypertension were significantly associated with cerebro vascular disease. In the present study, we found that pseudoexfoliation syndrome frequency was found to be higher in patients with cerebro vascular disease than in control subjects. A slit-lamp examination of the eye could be an important marker that indicates the risk of cerebro vascular disease. We recommend an evaluation of all subjects with pseudoexfoliation syndrome for the presence of cerebro vascular disease. Longitudinal studies with larger populations are needed to confirm this relationship.

  15. Filtering data from the collaborative initial glaucoma treatment study for improved identification of glaucoma progression.

    PubMed

    Schell, Greggory J; Lavieri, Mariel S; Stein, Joshua D; Musch, David C

    2013-12-21

    Open-angle glaucoma (OAG) is a prevalent, degenerate ocular disease which can lead to blindness without proper clinical management. The tests used to assess disease progression are susceptible to process and measurement noise. The aim of this study was to develop a methodology which accounts for the inherent noise in the data and improve significant disease progression identification. Longitudinal observations from the Collaborative Initial Glaucoma Treatment Study (CIGTS) were used to parameterize and validate a Kalman filter model and logistic regression function. The Kalman filter estimates the true value of biomarkers associated with OAG and forecasts future values of these variables. We develop two logistic regression models via generalized estimating equations (GEE) for calculating the probability of experiencing significant OAG progression: one model based on the raw measurements from CIGTS and another model based on the Kalman filter estimates of the CIGTS data. Receiver operating characteristic (ROC) curves and associated area under the ROC curve (AUC) estimates are calculated using cross-fold validation. The logistic regression model developed using Kalman filter estimates as data input achieves higher sensitivity and specificity than the model developed using raw measurements. The mean AUC for the Kalman filter-based model is 0.961 while the mean AUC for the raw measurements model is 0.889. Hence, using the probability function generated via Kalman filter estimates and GEE for logistic regression, we are able to more accurately classify patients and instances as experiencing significant OAG progression. A Kalman filter approach for estimating the true value of OAG biomarkers resulted in data input which improved the accuracy of a logistic regression classification model compared to a model using raw measurements as input. This methodology accounts for process and measurement noise to enable improved discrimination between progression and nonprogression in chronic diseases.

  16. Influential factors of red-light running at signalized intersection and prediction using a rare events logistic regression model.

    PubMed

    Ren, Yilong; Wang, Yunpeng; Wu, Xinkai; Yu, Guizhen; Ding, Chuan

    2016-10-01

    Red light running (RLR) has become a major safety concern at signalized intersection. To prevent RLR related crashes, it is critical to identify the factors that significantly impact the drivers' behaviors of RLR, and to predict potential RLR in real time. In this research, 9-month's RLR events extracted from high-resolution traffic data collected by loop detectors from three signalized intersections were applied to identify the factors that significantly affect RLR behaviors. The data analysis indicated that occupancy time, time gap, used yellow time, time left to yellow start, whether the preceding vehicle runs through the intersection during yellow, and whether there is a vehicle passing through the intersection on the adjacent lane were significantly factors for RLR behaviors. Furthermore, due to the rare events nature of RLR, a modified rare events logistic regression model was developed for RLR prediction. The rare events logistic regression method has been applied in many fields for rare events studies and shows impressive performance, but so far none of previous research has applied this method to study RLR. The results showed that the rare events logistic regression model performed significantly better than the standard logistic regression model. More importantly, the proposed RLR prediction method is purely based on loop detector data collected from a single advance loop detector located 400 feet away from stop-bar. This brings great potential for future field applications of the proposed method since loops have been widely implemented in many intersections and can collect data in real time. This research is expected to contribute to the improvement of intersection safety significantly. Copyright © 2016 Elsevier Ltd. All rights reserved.

  17. Use of genetic programming, logistic regression, and artificial neural nets to predict readmission after coronary artery bypass surgery.

    PubMed

    Engoren, Milo; Habib, Robert H; Dooner, John J; Schwann, Thomas A

    2013-08-01

    As many as 14 % of patients undergoing coronary artery bypass surgery are readmitted within 30 days. Readmission is usually the result of morbidity and may lead to death. The purpose of this study is to develop and compare statistical and genetic programming models to predict readmission. Patients were divided into separate Construction and Validation populations. Using 88 variables, logistic regression, genetic programs, and artificial neural nets were used to develop predictive models. Models were first constructed and tested on the Construction populations, then validated on the Validation population. Areas under the receiver operator characteristic curves (AU ROC) were used to compare the models. Two hundred and two patients (7.6 %) in the 2,644 patient Construction group and 216 (8.0 %) of the 2,711 patient Validation group were re-admitted within 30 days of CABG surgery. Logistic regression predicted readmission with AU ROC = .675 ± .021 in the Construction group. Genetic programs significantly improved the accuracy, AU ROC = .767 ± .001, p < .001). Artificial neural nets were less accurate with AU ROC = 0.597 ± .001 in the Construction group. Predictive accuracy of all three techniques fell in the Validation group. However, the accuracy of genetic programming (AU ROC = .654 ± .001) was still trivially but statistically non-significantly better than that of the logistic regression (AU ROC = .644 ± .020, p = .61). Genetic programming and logistic regression provide alternative methods to predict readmission that are similarly accurate.

  18. Artificial neural network, genetic algorithm, and logistic regression applications for predicting renal colic in emergency settings.

    PubMed

    Eken, Cenker; Bilge, Ugur; Kartal, Mutlu; Eray, Oktay

    2009-06-03

    Logistic regression is the most common statistical model for processing multivariate data in the medical literature. Artificial intelligence models like an artificial neural network (ANN) and genetic algorithm (GA) may also be useful to interpret medical data. The purpose of this study was to perform artificial intelligence models on a medical data sheet and compare to logistic regression. ANN, GA, and logistic regression analysis were carried out on a data sheet of a previously published article regarding patients presenting to an emergency department with flank pain suspicious for renal colic. The study population was composed of 227 patients: 176 patients had a diagnosis of urinary stone, while 51 ultimately had no calculus. The GA found two decision rules in predicting urinary stones. Rule 1 consisted of being male, pain not spreading to back, and no fever. In rule 2, pelvicaliceal dilatation on bedside ultrasonography replaced no fever. ANN, GA rule 1, GA rule 2, and logistic regression had a sensitivity of 94.9, 67.6, 56.8, and 95.5%, a specificity of 78.4, 76.47, 86.3, and 47.1%, a positive likelihood ratio of 4.4, 2.9, 4.1, and 1.8, and a negative likelihood ratio of 0.06, 0.42, 0.5, and 0.09, respectively. The area under the curve was found to be 0.867, 0.720, 0.715, and 0.713 for all applications, respectively. Data mining techniques such as ANN and GA can be used for predicting renal colic in emergency settings and to constitute clinical decision rules. They may be an alternative to conventional multivariate analysis applications used in biostatistics.

  19. Application of logistic regression for landslide susceptibility zoning of Cekmece Area, Istanbul, Turkey

    NASA Astrophysics Data System (ADS)

    Duman, T. Y.; Can, T.; Gokceoglu, C.; Nefeslioglu, H. A.; Sonmez, H.

    2006-11-01

    As a result of industrialization, throughout the world, cities have been growing rapidly for the last century. One typical example of these growing cities is Istanbul, the population of which is over 10 million. Due to rapid urbanization, new areas suitable for settlement and engineering structures are necessary. The Cekmece area located west of the Istanbul metropolitan area is studied, because the landslide activity is extensive in this area. The purpose of this study is to develop a model that can be used to characterize landslide susceptibility in map form using logistic regression analysis of an extensive landslide database. A database of landslide activity was constructed using both aerial-photography and field studies. About 19.2% of the selected study area is covered by deep-seated landslides. The landslides that occur in the area are primarily located in sandstones with interbedded permeable and impermeable layers such as claystone, siltstone and mudstone. About 31.95% of the total landslide area is located at this unit. To apply logistic regression analyses, a data matrix including 37 variables was constructed. The variables used in the forwards stepwise analyses are different measures of slope, aspect, elevation, stream power index (SPI), plan curvature, profile curvature, geology, geomorphology and relative permeability of lithological units. A total of 25 variables were identified as exerting strong influence on landslide occurrence, and included by the logistic regression equation. Wald statistics values indicate that lithology, SPI and slope are more important than the other parameters in the equation. Beta coefficients of the 25 variables included the logistic regression equation provide a model for landslide susceptibility in the Cekmece area. This model is used to generate a landslide susceptibility map that correctly classified 83.8% of the landslide-prone areas.

  20. New robust statistical procedures for the polytomous logistic regression models.

    PubMed

    Castilla, Elena; Ghosh, Abhik; Martin, Nirian; Pardo, Leandro

    2018-05-17

    This article derives a new family of estimators, namely the minimum density power divergence estimators, as a robust generalization of the maximum likelihood estimator for the polytomous logistic regression model. Based on these estimators, a family of Wald-type test statistics for linear hypotheses is introduced. Robustness properties of both the proposed estimators and the test statistics are theoretically studied through the classical influence function analysis. Appropriate real life examples are presented to justify the requirement of suitable robust statistical procedures in place of the likelihood based inference for the polytomous logistic regression model. The validity of the theoretical results established in the article are further confirmed empirically through suitable simulation studies. Finally, an approach for the data-driven selection of the robustness tuning parameter is proposed with empirical justifications. © 2018, The International Biometric Society.

  1. Updated logistic regression equations for the calculation of post-fire debris-flow likelihood in the western United States

    USGS Publications Warehouse

    Staley, Dennis M.; Negri, Jacquelyn A.; Kean, Jason W.; Laber, Jayme L.; Tillery, Anne C.; Youberg, Ann M.

    2016-06-30

    Wildfire can significantly alter the hydrologic response of a watershed to the extent that even modest rainstorms can generate dangerous flash floods and debris flows. To reduce public exposure to hazard, the U.S. Geological Survey produces post-fire debris-flow hazard assessments for select fires in the western United States. We use publicly available geospatial data describing basin morphology, burn severity, soil properties, and rainfall characteristics to estimate the statistical likelihood that debris flows will occur in response to a storm of a given rainfall intensity. Using an empirical database and refined geospatial analysis methods, we defined new equations for the prediction of debris-flow likelihood using logistic regression methods. We showed that the new logistic regression model outperformed previous models used to predict debris-flow likelihood.

  2. Nowcasting of Low-Visibility Procedure States with Ordered Logistic Regression at Vienna International Airport

    NASA Astrophysics Data System (ADS)

    Kneringer, Philipp; Dietz, Sebastian; Mayr, Georg J.; Zeileis, Achim

    2017-04-01

    Low-visibility conditions have a large impact on aviation safety and economic efficiency of airports and airlines. To support decision makers, we develop a statistical probabilistic nowcasting tool for the occurrence of capacity-reducing operations related to low visibility. The probabilities of four different low visibility classes are predicted with an ordered logistic regression model based on time series of meteorological point measurements. Potential predictor variables for the statistical models are visibility, humidity, temperature and wind measurements at several measurement sites. A stepwise variable selection method indicates that visibility and humidity measurements are the most important model inputs. The forecasts are tested with a 30 minute forecast interval up to two hours, which is a sufficient time span for tactical planning at Vienna Airport. The ordered logistic regression models outperform persistence and are competitive with human forecasters.

  3. EXpectation Propagation LOgistic REgRession (EXPLORER): distributed privacy-preserving online model learning.

    PubMed

    Wang, Shuang; Jiang, Xiaoqian; Wu, Yuan; Cui, Lijuan; Cheng, Samuel; Ohno-Machado, Lucila

    2013-06-01

    We developed an EXpectation Propagation LOgistic REgRession (EXPLORER) model for distributed privacy-preserving online learning. The proposed framework provides a high level guarantee for protecting sensitive information, since the information exchanged between the server and the client is the encrypted posterior distribution of coefficients. Through experimental results, EXPLORER shows the same performance (e.g., discrimination, calibration, feature selection, etc.) as the traditional frequentist logistic regression model, but provides more flexibility in model updating. That is, EXPLORER can be updated one point at a time rather than having to retrain the entire data set when new observations are recorded. The proposed EXPLORER supports asynchronized communication, which relieves the participants from coordinating with one another, and prevents service breakdown from the absence of participants or interrupted communications. Copyright © 2013 Elsevier Inc. All rights reserved.

  4. A computational approach to compare regression modelling strategies in prediction research.

    PubMed

    Pajouheshnia, Romin; Pestman, Wiebe R; Teerenstra, Steven; Groenwold, Rolf H H

    2016-08-25

    It is often unclear which approach to fit, assess and adjust a model will yield the most accurate prediction model. We present an extension of an approach for comparing modelling strategies in linear regression to the setting of logistic regression and demonstrate its application in clinical prediction research. A framework for comparing logistic regression modelling strategies by their likelihoods was formulated using a wrapper approach. Five different strategies for modelling, including simple shrinkage methods, were compared in four empirical data sets to illustrate the concept of a priori strategy comparison. Simulations were performed in both randomly generated data and empirical data to investigate the influence of data characteristics on strategy performance. We applied the comparison framework in a case study setting. Optimal strategies were selected based on the results of a priori comparisons in a clinical data set and the performance of models built according to each strategy was assessed using the Brier score and calibration plots. The performance of modelling strategies was highly dependent on the characteristics of the development data in both linear and logistic regression settings. A priori comparisons in four empirical data sets found that no strategy consistently outperformed the others. The percentage of times that a model adjustment strategy outperformed a logistic model ranged from 3.9 to 94.9 %, depending on the strategy and data set. However, in our case study setting the a priori selection of optimal methods did not result in detectable improvement in model performance when assessed in an external data set. The performance of prediction modelling strategies is a data-dependent process and can be highly variable between data sets within the same clinical domain. A priori strategy comparison can be used to determine an optimal logistic regression modelling strategy for a given data set before selecting a final modelling approach.

  5. A pre-admission program for underrepresented minority and disadvantaged students: application, acceptance, graduation rates and timeliness of graduating from medical school.

    PubMed

    Strayhorn, G

    2000-04-01

    To determine whether students' performances in a pre-admission program predicted whether participants would (1) apply to medical school, (2) get accepted, and (3) graduate. Using prospectively collected data from participants in the University of North Carolina at Chapel Hill's Medical Education Development Program (MEDP) and data from the Association of American Colleges Student and Applicant Information Management System, the author identified 371 underrepresented minority (URM) students who were full-time participants and completed the program between 1984 and 1989, prior to their acceptance into medical school. Logistic regression analysis was used to determine whether MEDP performance significantly predicted (after statistically controlling for traditional predictors of these outcomes) the proportions of URM participants who applied to medical school and were accepted, the timeliness of graduating, and the proportion graduating. Odds ratios with 95% confidence intervals were calculated to determine the associations between the independent and outcome variables. In separate logistic regression models, MEDP performance predicted the study's outcomes after statistically controlling for traditional predictors with 95% confidence intervals. Pre-admission programs with similar outcomes can improve the diversity of the physician workforce and the access to health care for underrepresented minority and economically disadvantaged populations.

  6. The association between self-reported racial discrimination and 12-month DSM-IV mental disorders among Asian Americans nationwide

    PubMed Central

    Spencer, Michael; Chen, Juan; Yip, Tiffany; Takeuchi, David T.

    2007-01-01

    Growing research finds that reports of discrimination are associated with mental health. However, many US studies are focused on regional samples and do not control for important confounders such as other stressors and health conditions. The present study examines the association between self-reported racial discrimination and DSM-IV defined mental disorders among Asian respondents to the 2002–2003 US National Latino and Asian American Study (n=2,047). Logistic regression analyses indicated that self-reported racial discrimination was associated with greater odds of having any DSM-IV disorder, depressive disorder, or anxiety disorder within the past 12 months -- controlling for sociodemographic characteristics, acculturative stress, family cohesion, poverty, self-rated health, chronic physical conditions, and social desirability. Further, multinomial logistic regression found that individuals who reported discrimination were at a twofold greater risk of having one disorder within the past 12 months, and a threefold greater risk of having two or more disorders. Thus, self-reported discrimination was associated with increased risk of mental disorders among Asian Americans across the United States and this relationship was not explained by social desirability, physical health, other stressors, and sociodemographic factors. Should these associations ultimately be shown enduring and causal, they suggest that policies designed to reduce discrimination may help improve mental health. PMID:17374553

  7. Dietary patterns and breast cancer risk among women in northern Tanzania: a case-control study.

    PubMed

    Jordan, Irmgard; Hebestreit, Antje; Swai, Britta; Krawinkel, Michael B

    2013-04-01

    Breast cancer is the second most common cancer among women in the Kilimanjaro Region of Tanzania. It was tested within a case-control study in this region whether a specific dietary pattern impacts on the breast cancer risk. A validated semi-quantitative Food Frequency Questionnaire was used to assess the dietary intake of 115 female breast cancer patients and 230 healthy age-matched women living in the same districts. A logistic regression was performed to estimate breast cancer risk. Dietary patterns were obtained using principal component analysis with Varimax rotation. The adjusted logistic regression estimated an increased risk for a "Fatty Diet", characterized by a higher consumption of milk, vegetable oils and fats, butter, lard and red meat (OR = 1.42, 95 % CI 1.08-1.87; P = 0.01), and for a "Fruity Diet", characterized by a higher consumption of fish, mango, papaya, avocado and watery fruits (OR = 1.61, 95 % CI 1.14-2.28; P = 0.01). Both diets showed an inverse association with the ratio between polyunsaturated and saturated fatty acids (P/S ratio). A diet characterized by a low P/S ratio seems to be more important for the development of breast cancer than total fat intake.

  8. Risk factors for the breakdown of perineal laceration repair after vaginal delivery.

    PubMed

    Williams, Meredith K; Chames, Mark C

    2006-09-01

    The purpose of this study was to identify risk factors that are associated with the breakdown of perineal laceration repair in the postpartum period. We conducted a retrospective, case-control study to review perineal laceration repair breakdown in patients who were delivered between September 1995 and February 2005 at the University of Michigan. Bivariate analysis with chi-square test and t-test and stepwise logistic regression analysis were performed. Fifty-nine cases and 118 control deliveries were identified from a total of 14,124 vaginal deliveries. Risk factors were longer second stage of labor (142 vs 87 minutes; P = .001), operative vaginal delivery (odds ratio, 3.6; 95% CI, 1.8-7.3), mediolateral episiotomy (odds ratio, 6.9; 95% CI, 2.6-18.7), third- or fourth-degree laceration (odds ratio, 3.1; 95% CI, 1.5-6.4), and meconium-stained amniotic fluid (odds ratio, 3.0; 95% CI, 1.1-7.9). Previous vaginal delivery was protective (odds ratio, 0.38; 95% CI, 0.18-0.84). Logistic regression showed the most significant factor to be an interaction between operative vaginal delivery and mediolateral episiotomy (odd ratio, 6.36; 95% CI, 2.18-18.57). The most significant events were mediolateral episiotomy, especially in conjunction with operative vaginal delivery, third- and fourth-degree lacerations, and meconium.

  9. Dietary n-3 Fatty Acid, α-Tocopherol, Zinc, vitamin D, vitamin C, and β-carotene are Associated with Age-Related Macular Degeneration in Japan.

    PubMed

    Aoki, Aya; Inoue, Maiko; Nguyen, Elizabeth; Obata, Ryo; Kadonosono, Kazuaki; Shinkai, Shoji; Hashimoto, Hideki; Sasaki, Satoshi; Yanagi, Yasuo

    2016-02-05

    This case-control study reports the association between nutrient intake and neovascular age-related macular degeneration (AMD) in Japan. The nutrient intake of 161 neovascular AMD cases from two university hospitals and 369 population-based control subjects from a cohort study was assessed using a brief-type self-administered questionnaire on diet history, which required respondent recall of the usual intake of 58 foods during the preceding month. Energy-adjusted nutrient intake values were compared between the groups. Logistic regression analysis was used to estimate odds ratios (ORs) and 95% CIs adjusted for smoking history, age, sex, chronic disease history, supplement use, and alcohol consumption. Logistic regression analysis demonstrated that low intakes of n-3 fatty acid, α-tocopherol, zinc, vitamin D, vitamin C, and β-carotene were associated with neovascular AMD (Trend P < 0.0001 for n-3 fatty acid, Trend P < 0.0001 for α-tocopherol, Trend P < 0.0001 for zinc, Trend P = 0.002 for vitamin D, Trend P = 0.04 for vitamin C, Trend P = 0.0004 for β-carotene). There was no association with retinol or cryptoxanthin intake and neovascular AMD (P = 0.67, 0.06).

  10. Robust experimental design for optimizing the microbial inhibitor test for penicillin detection in milk.

    PubMed

    Nagel, O G; Molina, M P; Basílico, J C; Zapata, M L; Althaus, R L

    2009-06-01

    To use experimental design techniques and a multiple logistic regression model to optimize a microbiological inhibition test with dichotomous response for the detection of Penicillin G in milk. A 2(3) x 2(2) robust experimental design with two replications was used. The effects of three control factors (V: culture medium volume, S: spore concentration of Geobacillus stearothermophilus, I: indicator concentration), two noise factors (Dt: diffusion time, Ip: incubation period) and their interactions were studied. The V, S, Dt, Ip factors and V x S, V x Ip, S x Ip interactions showed significant effects. The use of 100 microl culture medium volume, 2 x 10(5) spores ml(-1), 60 min diffusion time and 3 h incubation period is recommended. In these elaboration conditions, the penicillin detection limit was of 3.9 microg l(-1), similar to the maximum residue limit (MRL). Of the two noise factors studied, the incubation period can be controlled by means of the culture medium volume and spore concentration. We were able to optimize bioassays of dichotomous response using an experimental design and logistic regression model for the detection of residues at the level of MRL, aiding in the avoidance of health problems in the consumer.

  11. Serum antioxidant vitamins and the risk of oral cancer in patients seen at a tertiary institution in Nigeria.

    PubMed

    Lawal, A O; Kolude, B; Adeyemi, B F; Lawoyin, J O; Akang, E E

    2012-01-01

    Tobacco and alcohol are major risk factors of oral cancer, but nutritional deficiency may also contribute to development of oral cancer. This study compared serum antioxidant vitamin levels in oral cancer patients and controls in order to validate the role of vitamin deficiencies in the etiology of oral cancer. Serum vitamin A, C, and E levels of 33 oral cancer patients and 30 controls at University College Hospital, Ibadan, Nigeria, were determined using standard methods. The data obtained were analyzed using the Student t-test, odds ratio, and logistic regression. Mean vitamin A, C, and E levels were significantly lower in oral cancer patients (P=0.022, P=0.000, and P=0.013 respectively). Risk of oral cancer was 10.89, 11.35, and 5.6 times more in patients with low serum vitamins A, C, and E, respectively. However, on logistic regression analysis, only low serum vitamin E independently predicted occurrence of oral cancer. The lower serum vitamin A, C, and E levels in oral cancer patients could be either a cause or an effect of the oral cancer. Further studies using a larger sample size and cohort studies with long-term follow-up of subjects are desirable.

  12. To Control False Positives in Gene-Gene Interaction Analysis: Two Novel Conditional Entropy-Based Approaches

    PubMed Central

    Lin, Meihua; Li, Haoli; Zhao, Xiaolei; Qin, Jiheng

    2013-01-01

    Genome-wide analysis of gene-gene interactions has been recognized as a powerful avenue to identify the missing genetic components that can not be detected by using current single-point association analysis. Recently, several model-free methods (e.g. the commonly used information based metrics and several logistic regression-based metrics) were developed for detecting non-linear dependence between genetic loci, but they are potentially at the risk of inflated false positive error, in particular when the main effects at one or both loci are salient. In this study, we proposed two conditional entropy-based metrics to challenge this limitation. Extensive simulations demonstrated that the two proposed metrics, provided the disease is rare, could maintain consistently correct false positive rate. In the scenarios for a common disease, our proposed metrics achieved better or comparable control of false positive error, compared to four previously proposed model-free metrics. In terms of power, our methods outperformed several competing metrics in a range of common disease models. Furthermore, in real data analyses, both metrics succeeded in detecting interactions and were competitive with the originally reported results or the logistic regression approaches. In conclusion, the proposed conditional entropy-based metrics are promising as alternatives to current model-based approaches for detecting genuine epistatic effects. PMID:24339984

  13. Socioeconomic disparities and chronic respiratory diseases in Thailand: The National Socioeconomics Survey.

    PubMed

    Luenam, Amornrat; Laohasiriwong, Wongsa; Puttanapong, Nattapong; Saengsuwan, Jiamjit; Phajan, Teerasak

    2018-05-10

    This study aimed to determine the association between socioeconomic determinants and Chronic Respiratory Diseases (CRDs) in Thailand. The data were used from the National Socioeconomics Survey (NSS), a cross-sectional study conducted by the National Statistical Office (NSO), in 2010 and 2012. The survey used stratified two-stage sampling to select a nationally representative sample to respond to a structured questionnaire. A total of 17,040 and 16,905 individuals in 2010 and 2012, respectively, were included in this analysis. Multiple logistic regressions were used to identify the association between socioeconomic factors while controlling for other covariates. The prevalence of CRDs was 3.81% and 2.79% in 2010 and 2012, respectively. The bivariate analysis indicated that gender, family size, geographic location, fuels used for cooking and smoking were significantly associated with CRDs in 2010, whereas education, family size, occupation, region, geographic location, and smoking were significantly associated with CRDs in 2012. Both in 2010 and 2012, the multiple logistic regression indicated that the odds of having CRDs were significantly higher among those who lived in urban areas, females, those aged ≥41-50 or ≥61 yr old, and smokers when controlling for other covariates. However, fuels used for cooking, wood and gas, are associated with CRDs in 2010.

  14. Personality traits, level of anxiety and styles of coping with stressin people with asthma and chronic obstructive pulmonary disease - a comparative analysis.

    PubMed

    Tabała, Klaudia; Wrzesińska, Magdalena; Stecz, Patryk; Kocur, Józef

    2016-12-23

    Chronic obstructive pulmonary disease (COPD) and asthma are a challenge to public health, with the sufferers experiencing a range of psychological factors affecting their health and behavior. The aim of the present study was to determine the level of anxiety, personality traits and stress-coping ability of patients with obstructive lung disease and comparison with a group of healthy controls. The research was conducted on a group of 150 people with obstructive lung diseases (asthma and COPD) and healthy controls (mean age = 56.0 ± 16.00). Four surveys were used: a sociodemographic survey, NEO-FFI Personality Inventory, State-Trait Anxiety Inventory (STAI), and Brief Cope Inventory. Logistic regression was used to identify the investigated variables which best differentiated the healthy and sick individuals. Patients with asthma or COPD demonstrated a significantly lower level of conscientiousness, openness to experience, active coping and planning, as well as higher levels of neuroticism and a greater tendency to behavioral disengagement. Logistic regression found trait-anxiety, openness to experience, positive reframing, acceptance, humor and behavioral disengagement to be best at distinguishing people with lung diseases from healthy individuals. The results indicate the need for intervention in the psychological functioning of people with obstructive diseases.

  15. Association between Diet and Lifestyle Habits and Irritable Bowel Syndrome: A Case-Control Study

    PubMed Central

    Guo, Yu-Bin; Zhuang, Kang-Min; Kuang, Lei; Zhan, Qiang; Wang, Xian-Fei; Liu, Si-De

    2015-01-01

    Background/Aims Recent papers have highlighted the role of diet and lifestyle habits in irritable bowel syndrome (IBS), but very few population-based studies have evaluated this association in developing countries. The aim of this study was to evaluate the association between diet and lifestyle habits and IBS. Methods A food frequency and lifestyle habits questionnaire was used to record the diet and lifestyle habits of 78 IBS subjects and 79 healthy subjects. Cross-tabulation analysis and logistic regression were used to reveal any association among lifestyle habits, eating habits, food consumption frequency, and other associated conditions. Results The results from logistic regression analysis indicated that IBS was associated with irregular eating (odds ratio [OR], 3.257), physical inactivity (OR, 3.588), and good quality sleep (OR, 0.132). IBS subjects ate fruit (OR, 3.082) vegetables (OR, 3.778), and legumes (OR, 2.111) and drank tea (OR, 2.221) significantly more frequently than the control subjects. After adjusting for age and sex, irregular eating (OR, 3.963), physical inactivity (OR, 6.297), eating vegetables (OR, 7.904), legumes (OR, 2.674), drinking tea (OR, 3.421) and good quality sleep (OR, 0.054) were independent predictors of IBS. Conclusions This study reveals a possible association between diet and lifestyle habits and IBS. PMID:25266811

  16. Impact of major depressive disorder, distinct subtypes, and symptom severity on lifestyle in the BiDirect Study.

    PubMed

    Rahe, Corinna; Khil, Laura; Wellmann, Jürgen; Baune, Bernhard T; Arolt, Volker; Berger, Klaus

    2016-11-30

    The aim of this study was to examine associations of major depressive disorder (MDD), its distinct subtypes, and symptom severity with the individual lifestyle factors smoking, diet quality, physical activity, and body mass index as well as with a combined lifestyle index measuring the co-occurrence of these lifestyle factors. A sample of 823 patients with MDD and 597 non-depressed controls was examined. The psychiatric assessment was based on a clinical interview including the Mini International Neuropsychiatric Interview and the Hamilton Depression Rating Scale. Each lifestyle factor was scored as either healthy or unhealthy, and the number of unhealthy lifestyle factors was added up in a combined lifestyle index. Cross-sectional analyses were performed using alternating logistic regression and ordinal logistic regression, adjusted for socio-demographic characteristics. After adjustment, MDD was significantly associated with smoking, low physical activity, and overweight. Likewise, MDD was significantly related to the overall lifestyle index. When stratifying for subtypes, all subtypes showed higher odds for an overall unhealthier lifestyle than controls, but the associations with the individual lifestyle factors were partly different. Symptom severity was associated with the lifestyle index in a dose-response manner. In conclusion, patients with MDD represent an important target group for lifestyle interventions. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  17. Correlations between quality ratings of skilled nursing facilities and multidrug-resistant urinary tract infections.

    PubMed

    Gucwa, Azad L; Dolar, Veronika; Ye, Chao; Epstein, Stephanie

    2016-11-01

    The purpose of this study was to determine risk factors for the acquisition of urinary tract infections (UTIs) and multidrug-resistant organisms (MDROs) in residents of skilled nursing facilities (SNFs). Using the informational database provided by the Centers for Medicare and Medicaid Services (CMS), a retrospective logistic regression was performed on 1,523 urine cultures from 12 SNFs located in Long Island, New York. Of the 1,142 positive urine cultures, Escherichia coli was most prevalent. Additionally, 164 (14.4%) of the UTIs were attributed to an MDRO. In multivariate logistic regression, sex and overall quality rating predicted the occurrence of UTIs, whereas identification of MDROs was dependent on the level of nursing care received. The mean predicted probability of UTIs and receipt of contaminated samples was inversely dependent on the facility's rating, where the likelihood increased as overall quality ratings decreased. The CMS's quality rating system may provide some insight into the status of infection control practices in SNFs. The results of this study suggest that potential consumers should focus on the overall star ratings and the competency of the nursing staff in these facilities rather than on individual quality measures. Copyright © 2016 Association for Professionals in Infection Control and Epidemiology, Inc. Published by Elsevier Inc. All rights reserved.

  18. The CD4/CD8 ratio is associated with coronary artery disease (CAD) in elderly Chinese patients.

    PubMed

    Gao, Pan; Rong, Hong-Hui; Lu, Ting; Tang, Gang; Si, Liang-Yi; Lederer, James A; Xiong, Wei

    2017-01-01

    The aim of this study was to investigate the relationship between number of circulating T cells and coronary artery disease (CAD) in an elderly Chinese population. A total of 295 elderly inpatients (age≥60) were included in this cross-sectional study. Their clinical and biochemical characteristics were recorded. Patients were divided to two groups: control patients and CAD patients. The risk factors of CAD were explored by binary logistic regression analysis. Compared with control patients, the ratio of CD4 to CD8 T cells was significantly increased in CAD patients. There was no difference in the number of CD3, CD4, and CD8 T cells between the two groups. Multiple logistic regression analysis showed that CAD was independently associated with age, gender, body mass index (BMI), systolic blood pressure (SBP), chronic heart failure (CHF) and the CD4/CD8 ratio. In addition, after adjusting for different clinical parameters (including gender, age, CHF, hypertension, arrhythmia, SBP, and BMI), the risk of CAD was significantly increased in patients with a CD4/CD8 ratio>1.5. There was a strong and independent association between the ratio of CD4/CD8 and CAD in elderly Chinese population. Copyright © 2016. Published by Elsevier B.V.

  19. The prevalence of postpartum depression: the relative significance of three social status indices.

    PubMed

    Segre, Lisa S; O'Hara, Michael W; Arndt, Stephan; Stuart, Scott

    2007-04-01

    Little is known about the prevalence of clinically significant postpartum depression in women of varying social status. The purpose of the present study was to examine the prevalence of postpartum depression as a function of three indices of social status: income, education and occupational prestige. A sample of 4,332 postpartum women completed a demographic interview and the Inventory to Diagnose Depression, a self-report scale developed to identify a major depressive episode in accordance with DSM diagnostic criteria. Logistic regression was used to assess the relative significance of the three social status variables as risk factors for postpartum depression controlling for the effects of correlated demographic variables. In the logistic regression, income, occupational prestige, marital status, and number of children were significant predictors of postpartum depression controlling for the effects of other related demographic characteristics. The Wald Chi Square value for each of these significant predictors indicates that income was the strongest predictor. The prevalence of postpartum depression was significantly higher in financially poor relative to financially affluent women. Maternal depression screening programs targeting women who are financially poor are well placed. Future research is needed to replicate the present findings in a more ethnically diverse sample that includes the full age range of teenage mothers.

  20. Job characteristics in relation to the prevalence of myocardial infarction in the US Health Examination Survey (HES) and the Health and Nutrition Examination Survey (HANES).

    PubMed

    Karasek, R A; Theorell, T; Schwartz, J E; Schnall, P L; Pieper, C F; Michela, J L

    1988-08-01

    Associations between psychosocial job characteristics and past myocardial infarction (MI) prevalence for employed males were tested with the Health Examination Survey (HES) 1960-61, N = 2,409, and the Health and Nutrition Examination Survey (HANES) 1971-75, N = 2,424. A new estimation method is used which imputes to census occupation codes, job characteristic information from national surveys of job characteristics (US Department of Labor, Quality of Employment Surveys). Controlling for age, we find that employed males with jobs which are simultaneously low in decision latitude and high in psychological work load (a multiplicative product term isolating 20 per cent of the population) have a higher prevalence of myocardial infarction in both data bases. In a logistic regression analysis, using job measures adjusted for demographic factors and controlling for age, race, education, systolic blood pressure, serum cholesterol, smoking (HANES only), and physical exertion, we find a low decision latitude/high psychological demand multiplicative product term associated with MI in both data bases. Additional multiple logistic regressions show that low decision latitude is associated with increased prevalence of MI in both the HES and the HANES. Psychological workload and physical exertion are significant only in the HANES.

  1. Job characteristics in relation to the prevalence of myocardial infarction in the US Health Examination Survey (HES) and the Health and Nutrition Examination Survey (HANES).

    PubMed Central

    Karasek, R A; Theorell, T; Schwartz, J E; Schnall, P L; Pieper, C F; Michela, J L

    1988-01-01

    Associations between psychosocial job characteristics and past myocardial infarction (MI) prevalence for employed males were tested with the Health Examination Survey (HES) 1960-61, N = 2,409, and the Health and Nutrition Examination Survey (HANES) 1971-75, N = 2,424. A new estimation method is used which imputes to census occupation codes, job characteristic information from national surveys of job characteristics (US Department of Labor, Quality of Employment Surveys). Controlling for age, we find that employed males with jobs which are simultaneously low in decision latitude and high in psychological work load (a multiplicative product term isolating 20 per cent of the population) have a higher prevalence of myocardial infarction in both data bases. In a logistic regression analysis, using job measures adjusted for demographic factors and controlling for age, race, education, systolic blood pressure, serum cholesterol, smoking (HANES only), and physical exertion, we find a low decision latitude/high psychological demand multiplicative product term associated with MI in both data bases. Additional multiple logistic regressions show that low decision latitude is associated with increased prevalence of MI in both the HES and the HANES. Psychological workload and physical exertion are significant only in the HANES. PMID:3389427

  2. Cytopathologic differential diagnosis of low-grade urothelial carcinoma and reactive urothelial proliferation in bladder washings: a logistic regression analysis.

    PubMed

    Cakir, Ebru; Kucuk, Ulku; Pala, Emel Ebru; Sezer, Ozlem; Ekin, Rahmi Gokhan; Cakmak, Ozgur

    2017-05-01

    Conventional cytomorphologic assessment is the first step to establish an accurate diagnosis in urinary cytology. In cytologic preparations, the separation of low-grade urothelial carcinoma (LGUC) from reactive urothelial proliferation (RUP) can be exceedingly difficult. The bladder washing cytologies of 32 LGUC and 29 RUP were reviewed. The cytologic slides were examined for the presence or absence of the 28 cytologic features. The cytologic criteria showing statistical significance in LGUC were increased numbers of monotonous single (non-umbrella) cells, three-dimensional cellular papillary clusters without fibrovascular cores, irregular bordered clusters, atypical single cells, irregular nuclear overlap, cytoplasmic homogeneity, increased N/C ratio, pleomorphism, nuclear border irregularity, nuclear eccentricity, elongated nuclei, and hyperchromasia (p ˂ 0.05), and the cytologic criteria showing statistical significance in RUP were inflammatory background, mixture of small and large urothelial cells, loose monolayer aggregates, and vacuolated cytoplasm (p ˂ 0.05). When these variables were subjected to a stepwise logistic regression analysis, four features were selected to distinguish LGUC from RUP: increased numbers of monotonous single (non-umbrella) cells, increased nuclear cytoplasmic ratio, hyperchromasia, and presence of small and large urothelial cells (p = 0.0001). By this logistic model of the 32 cases with proven LGUC, the stepwise logistic regression analysis correctly predicted 31 (96.9%) patients with this diagnosis, and of the 29 patients with RUP, the logistic model correctly predicted 26 (89.7%) patients as having this disease. There are several cytologic features to separate LGUC from RUP. Stepwise logistic regression analysis is a valuable tool for determining the most useful cytologic criteria to distinguish these entities. © 2017 APMIS. Published by John Wiley & Sons Ltd.

  3. Tobacco advertising, environmental smoking bans, and smoking in Chinese urban areas.

    PubMed

    Yang, Tingzhong; Rockett, Ian R H; Li, Mu; Xu, Xiaochao; Gu, Yaming

    2012-07-01

    To evaluate whether cigarette smoking in Chinese urban areas was respectively associated with exposure to tobacco advertising and smoking bans in households, workplaces, and public places. Participants were 4735 urban residents aged 15 years and older, who were identified through multi-stage quota-sampling conducted in six Chinese cities. Data were collected on individual sociodemographics and smoking status, and regional tobacco control measures. The sample was characterized in terms of smoking prevalence, and multilevel logistic models were employed to analyze the association between smoking and tobacco advertising and environmental smoking restrictions, respectively. Smoking prevalence was 30%. Multilevel logistic regression analysis showed that smoking was positively associated with exposure to tobacco advertising, and negatively associated with workplace and household smoking bans. The association of smoking with both tobacco advertising and environmental smoking bans further justifies implementation of comprehensive smoking interventions and tobacco control programs in China. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.

  4. Internet gaming disorder in early adolescence: Associations with parental and adolescent mental health.

    PubMed

    Wartberg, L; Kriston, L; Kramer, M; Schwedler, A; Lincoln, T M; Kammerl, R

    2017-06-01

    Internet gaming disorder (IGD) has been included in the Diagnostic and Statistical Manual of Mental Disorders (DSM-5). Currently, associations between IGD in early adolescence and mental health are largely unexplained. In the present study, the relation of IGD with adolescent and parental mental health was investigated for the first time. We surveyed 1095 family dyads (an adolescent aged 12-14 years and a related parent) with a standardized questionnaire for IGD as well as for adolescent and parental mental health. We conducted linear (dimensional approach) and logistic (categorical approach) regression analyses. Both with dimensional and categorical approaches, we observed statistically significant associations between IGD and male gender, a higher degree of adolescent antisocial behavior, anger control problems, emotional distress, self-esteem problems, hyperactivity/inattention and parental anxiety (linear regression model: corrected R 2 =0.41, logistic regression model: Nagelkerke's R 2 =0.41). IGD appears to be associated with internalizing and externalizing problems in adolescents. Moreover, the findings of the present study provide first evidence that not only adolescent but also parental mental health is relevant to IGD in early adolescence. Adolescent and parental mental health should be considered in prevention and intervention programs for IGD in adolescence. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

  5. Science of Test Research Consortium: Year Two Final Report

    DTIC Science & Technology

    2012-10-02

    July 2012. Analysis of an Intervention for Small Unmanned Aerial System ( SUAS ) Accidents, submitted to Quality Engineering, LQEN-2012-0056. Stone... Systems Engineering. Wolf, S. E., R. R. Hill, and J. J. Pignatiello. June 2012. Using Neural Networks and Logistic Regression to Model Small Unmanned ...Human Retina. 6. Wolf, S. E. March 2012. Modeling Small Unmanned Aerial System Mishaps using Logistic Regression and Artificial Neural Networks. 7

  6. Binary Logistic Regression Analysis for Detecting Differential Item Functioning: Effectiveness of R[superscript 2] and Delta Log Odds Ratio Effect Size Measures

    ERIC Educational Resources Information Center

    Hidalgo, Mª Dolores; Gómez-Benito, Juana; Zumbo, Bruno D.

    2014-01-01

    The authors analyze the effectiveness of the R[superscript 2] and delta log odds ratio effect size measures when using logistic regression analysis to detect differential item functioning (DIF) in dichotomous items. A simulation study was carried out, and the Type I error rate and power estimates under conditions in which only statistical testing…

  7. Logistic quantile regression provides improved estimates for bounded avian counts: a case study of California Spotted Owl fledgling production

    Treesearch

    Brian S. Cade; Barry R. Noon; Rick D. Scherer; John J. Keane

    2017-01-01

    Counts of avian fledglings, nestlings, or clutch size that are bounded below by zero and above by some small integer form a discrete random variable distribution that is not approximated well by conventional parametric count distributions such as the Poisson or negative binomial. We developed a logistic quantile regression model to provide estimates of the empirical...

  8. Comparison of four methods for deriving hospital standardised mortality ratios from a single hierarchical logistic regression model.

    PubMed

    Mohammed, Mohammed A; Manktelow, Bradley N; Hofer, Timothy P

    2016-04-01

    There is interest in deriving case-mix adjusted standardised mortality ratios so that comparisons between healthcare providers, such as hospitals, can be undertaken in the controversial belief that variability in standardised mortality ratios reflects quality of care. Typically standardised mortality ratios are derived using a fixed effects logistic regression model, without a hospital term in the model. This fails to account for the hierarchical structure of the data - patients nested within hospitals - and so a hierarchical logistic regression model is more appropriate. However, four methods have been advocated for deriving standardised mortality ratios from a hierarchical logistic regression model, but their agreement is not known and neither do we know which is to be preferred. We found significant differences between the four types of standardised mortality ratios because they reflect a range of underlying conceptual issues. The most subtle issue is the distinction between asking how an average patient fares in different hospitals versus how patients at a given hospital fare at an average hospital. Since the answers to these questions are not the same and since the choice between these two approaches is not obvious, the extent to which profiling hospitals on mortality can be undertaken safely and reliably, without resolving these methodological issues, remains questionable. © The Author(s) 2012.

  9. Three methods to construct predictive models using logistic regression and likelihood ratios to facilitate adjustment for pretest probability give similar results.

    PubMed

    Chan, Siew Foong; Deeks, Jonathan J; Macaskill, Petra; Irwig, Les

    2008-01-01

    To compare three predictive models based on logistic regression to estimate adjusted likelihood ratios allowing for interdependency between diagnostic variables (tests). This study was a review of the theoretical basis, assumptions, and limitations of published models; and a statistical extension of methods and application to a case study of the diagnosis of obstructive airways disease based on history and clinical examination. Albert's method includes an offset term to estimate an adjusted likelihood ratio for combinations of tests. Spiegelhalter and Knill-Jones method uses the unadjusted likelihood ratio for each test as a predictor and computes shrinkage factors to allow for interdependence. Knottnerus' method differs from the other methods because it requires sequencing of tests, which limits its application to situations where there are few tests and substantial data. Although parameter estimates differed between the models, predicted "posttest" probabilities were generally similar. Construction of predictive models using logistic regression is preferred to the independence Bayes' approach when it is important to adjust for dependency of tests errors. Methods to estimate adjusted likelihood ratios from predictive models should be considered in preference to a standard logistic regression model to facilitate ease of interpretation and application. Albert's method provides the most straightforward approach.

  10. A comparison of three methods of assessing differential item functioning (DIF) in the Hospital Anxiety Depression Scale: ordinal logistic regression, Rasch analysis and the Mantel chi-square procedure.

    PubMed

    Cameron, Isobel M; Scott, Neil W; Adler, Mats; Reid, Ian C

    2014-12-01

    It is important for clinical practice and research that measurement scales of well-being and quality of life exhibit only minimal differential item functioning (DIF). DIF occurs where different groups of people endorse items in a scale to different extents after being matched by the intended scale attribute. We investigate the equivalence or otherwise of common methods of assessing DIF. Three methods of measuring age- and sex-related DIF (ordinal logistic regression, Rasch analysis and Mantel χ(2) procedure) were applied to Hospital Anxiety Depression Scale (HADS) data pertaining to a sample of 1,068 patients consulting primary care practitioners. Three items were flagged by all three approaches as having either age- or sex-related DIF with a consistent direction of effect; a further three items identified did not meet stricter criteria for important DIF using at least one method. When applying strict criteria for significant DIF, ordinal logistic regression was slightly less sensitive. Ordinal logistic regression, Rasch analysis and contingency table methods yielded consistent results when identifying DIF in the HADS depression and HADS anxiety scales. Regardless of methods applied, investigators should use a combination of statistical significance, magnitude of the DIF effect and investigator judgement when interpreting the results.

  11. Extreme Sparse Multinomial Logistic Regression: A Fast and Robust Framework for Hyperspectral Image Classification

    NASA Astrophysics Data System (ADS)

    Cao, Faxian; Yang, Zhijing; Ren, Jinchang; Ling, Wing-Kuen; Zhao, Huimin; Marshall, Stephen

    2017-12-01

    Although the sparse multinomial logistic regression (SMLR) has provided a useful tool for sparse classification, it suffers from inefficacy in dealing with high dimensional features and manually set initial regressor values. This has significantly constrained its applications for hyperspectral image (HSI) classification. In order to tackle these two drawbacks, an extreme sparse multinomial logistic regression (ESMLR) is proposed for effective classification of HSI. First, the HSI dataset is projected to a new feature space with randomly generated weight and bias. Second, an optimization model is established by the Lagrange multiplier method and the dual principle to automatically determine a good initial regressor for SMLR via minimizing the training error and the regressor value. Furthermore, the extended multi-attribute profiles (EMAPs) are utilized for extracting both the spectral and spatial features. A combinational linear multiple features learning (MFL) method is proposed to further enhance the features extracted by ESMLR and EMAPs. Finally, the logistic regression via the variable splitting and the augmented Lagrangian (LORSAL) is adopted in the proposed framework for reducing the computational time. Experiments are conducted on two well-known HSI datasets, namely the Indian Pines dataset and the Pavia University dataset, which have shown the fast and robust performance of the proposed ESMLR framework.

  12. Latin hypercube approach to estimate uncertainty in ground water vulnerability

    USGS Publications Warehouse

    Gurdak, J.J.; McCray, J.E.; Thyne, G.; Qi, S.L.

    2007-01-01

    A methodology is proposed to quantify prediction uncertainty associated with ground water vulnerability models that were developed through an approach that coupled multivariate logistic regression with a geographic information system (GIS). This method uses Latin hypercube sampling (LHS) to illustrate the propagation of input error and estimate uncertainty associated with the logistic regression predictions of ground water vulnerability. Central to the proposed method is the assumption that prediction uncertainty in ground water vulnerability models is a function of input error propagation from uncertainty in the estimated logistic regression model coefficients (model error) and the values of explanatory variables represented in the GIS (data error). Input probability distributions that represent both model and data error sources of uncertainty were simultaneously sampled using a Latin hypercube approach with logistic regression calculations of probability of elevated nonpoint source contaminants in ground water. The resulting probability distribution represents the prediction intervals and associated uncertainty of the ground water vulnerability predictions. The method is illustrated through a ground water vulnerability assessment of the High Plains regional aquifer. Results of the LHS simulations reveal significant prediction uncertainties that vary spatially across the regional aquifer. Additionally, the proposed method enables a spatial deconstruction of the prediction uncertainty that can lead to improved prediction of ground water vulnerability. ?? 2007 National Ground Water Association.

  13. Poor sleep quality has an adverse effect on childhood asthma control and lung function measures.

    PubMed

    Sheen, Youn Ho; Choi, Sun Hee; Jang, Sun Jung; Baek, Ji Hyeon; Jee, Hye Mi; Kim, Mi Ae; Chae, Kyu Young; Han, Man Yong

    2017-08-01

    It is unclear as to whether sleep respiratory breathing disorder (SRBD) is a risk factor for uncontrolled asthma in children. The aim of this study was therefore to investigate whether SRBD may have an adverse effect on childhood asthma control and lung function measures. This was a cross-sectional study of 220 children with well-controlled (n = 108), partly controlled (n = 92), and uncontrolled asthma (n = 20) according to the Global Initiative for Asthma guideline. SRBD was assessed using the Pediatric Sleep Questionnaire (PSQ). The association of SRBD with partly controlled/uncontrolled asthma was investigated on multivariate logistic regression analysis. Of 220 children with asthma, 43 (19.6%) had SRBD: well-controlled, 16.7% (18/108); partly controlled, 21.7% (20/92); and uncontrolled, 25.0% (5/20; P = 0.54). There was a significant difference in forced expiratory volume in 1 s/forced vital capacity (FEV 1 /FVC; P = 0.007) and childhood asthma control test (C-ACT) score (P < 0.001) according to asthma control status, but not in PSQ score (P = 0.18). Children with obstructive sleep apnea (PSQ >0.33) had a lower C-ACT score compared with controls (PSQ ≤0.33; 19.6 ± 5.1 vs 22.0 ± 4.2, P = 0.002). PSQ score was negatively correlated with FEV 1 /FVC (r = -0.16, P = 0.02). On multivariate logistic regression analysis, high PSQ score increased the odds of having partly controlled/uncontrolled asthma by 9.12 (95% CI: 1.04-79.72, P = 0.046) after adjusting for confounding factors. SRBD is an independent risk factor for partly controlled/uncontrolled asthma and has an adverse effect on lung function measures in children. Further research is warranted to determine whether the improvement of sleep quality may also enhance level of asthma control and lung function in children. © 2017 Japan Pediatric Society.

  14. Beyond logistic regression: structural equations modelling for binary variables and its application to investigating unobserved confounders.

    PubMed

    Kupek, Emil

    2006-03-15

    Structural equation modelling (SEM) has been increasingly used in medical statistics for solving a system of related regression equations. However, a great obstacle for its wider use has been its difficulty in handling categorical variables within the framework of generalised linear models. A large data set with a known structure among two related outcomes and three independent variables was generated to investigate the use of Yule's transformation of odds ratio (OR) into Q-metric by (OR-1)/(OR+1) to approximate Pearson's correlation coefficients between binary variables whose covariance structure can be further analysed by SEM. Percent of correctly classified events and non-events was compared with the classification obtained by logistic regression. The performance of SEM based on Q-metric was also checked on a small (N = 100) random sample of the data generated and on a real data set. SEM successfully recovered the generated model structure. SEM of real data suggested a significant influence of a latent confounding variable which would have not been detectable by standard logistic regression. SEM classification performance was broadly similar to that of the logistic regression. The analysis of binary data can be greatly enhanced by Yule's transformation of odds ratios into estimated correlation matrix that can be further analysed by SEM. The interpretation of results is aided by expressing them as odds ratios which are the most frequently used measure of effect in medical statistics.

  15. Predictors of postoperative outcomes of cubital tunnel syndrome treatments using multiple logistic regression analysis.

    PubMed

    Suzuki, Taku; Iwamoto, Takuji; Shizu, Kanae; Suzuki, Katsuji; Yamada, Harumoto; Sato, Kazuki

    2017-05-01

    This retrospective study was designed to investigate prognostic factors for postoperative outcomes for cubital tunnel syndrome (CubTS) using multiple logistic regression analysis with a large number of patients. Eighty-three patients with CubTS who underwent surgeries were enrolled. The following potential prognostic factors for disease severity were selected according to previous reports: sex, age, type of surgery, disease duration, body mass index, cervical lesion, presence of diabetes mellitus, Workers' Compensation status, preoperative severity, and preoperative electrodiagnostic testing. Postoperative severity of disease was assessed 2 years after surgery by Messina's criteria which is an outcome measure specifically for CubTS. Bivariate analysis was performed to select candidate prognostic factors for multiple linear regression analyses. Multiple logistic regression analysis was conducted to identify the association between postoperative severity and selected prognostic factors. Both bivariate and multiple linear regression analysis revealed only preoperative severity as an independent risk factor for poor prognosis, while other factors did not show any significant association. Although conflicting results exist regarding prognosis of CubTS, this study supports evidence from previous studies and concludes early surgical intervention portends the most favorable prognosis. Copyright © 2017 The Japanese Orthopaedic Association. Published by Elsevier B.V. All rights reserved.

  16. Advertising for Demand Creation for Voluntary Medical Male Circumcision.

    PubMed

    Wilson, Nicholas; Frade, Sasha; Rech, Dino; Friedman, Willa

    2016-08-15

    To measure the effects of information, a challenge, and a conditional cash transfer on take-up of voluntary medical male circumcision (VMMC). A randomized, controlled experiment with 4000 postcard recipients in Soweto (Johannesburg), South Africa. We examined differences in take-up of several decisions in the VMMC cascade between the control arm and each of several intervention arms using logistic regression. Logistic regression analysis indicated that the group offered US $10 as compensation and the group challenged with "Are you tough enough?" had significantly higher take-up of the VMMC procedure than did the control group [odds ratios, respectively, 5.30 (CI: 2.20 to 12.76) and 2.70 (CI: 1.05 to 6.91)]. Similarly, the compensation group had significantly higher take-up of the VMMC counseling session than did the control group [odds ratio 3.76 (CI: 1.79 to 7.89)]. The analysis did not reveal significantly different take-up of either the VMMC counseling session or the procedure in the partner preference information group compared with the control group [odds ratios, respectively, 1.23 (CI: 0.51 to 2.97) and 1.67 (CI: 0.61 to 4.62)]. The analysis did not reveal significantly higher take-up of the VMMC nurse hotline in any intervention group compared with the control group [odds ratios for US $10, information, and challenge, respectively, 1.17 (CI: 0.67 to 2.07), 0.69 (CI: 0.36 to 1.32), and 0.60 (0.31 to 1.18)]. Among adult males in Soweto, South Africa, compensation of US $10 provided conditional on completing the VMMC counseling session compared with no compensation offer and a postcard with a challenge, "Are you tough enough?" compared with no challenge, resulted in moderate increases in take-up of circumcision.

  17. [A prospective study of risk factors in pregnant women with abnormal glucose metabolism].

    PubMed

    Yang, Hui-xia; Zhang, Mei-hua; Sun, Wei-jie; Zhao, Yi

    2005-11-01

    To evaluate the risk factors for gestational diabetes mellitus (GDM) and gestational impaired glucose tolerance (GIGT). A prospective case-control study was performed in 85 women with GDM, 63 cases with GIGT and 125 cases as control recruited from Feb 2004 to Aug 2004 in Peking University First Hospital. Univariate analysis and multivariate logistic regression were used to identify risk factors of GDM and GIGT. (1) The mean age, and body mass index (BMI) before pregnancy and larger maternal weight gains during pregnancy were significantly different between GDM/GIGT and control group (P < 0.05). (2) More intakes of fruits and carbohydrate per day increased the incidence of GDM and GIGT (P < 0.05). (3) There was a higher proportion of women with family history of diabetes among the GDM (42.2%) and GIGT (36.5%) compared with control group (19.2%). Irregular menses (16.5%, 23.8%), and polycystic ovary syndrome (PCOS) (5.9%, 3.2%) were more prevalent in the GDM, GIGT groups versus control subjects (6.4%, 0). The incidence of vulvovaginal candidiasis (VVC) was significantly higher in pregnant women with GDM and GIGT (15.3% and 17.4%) than in control group (7.2%). (4) Multivariate logistic regression showed that age, irregular menses, BMI before pregnancy, history of spontaneous abortion, educational level and VVC all were independent factors for GDM or GIGT. Maternal age, irregular menses, obesity before gestation, rapid weight gains during pregnancy, history of spontaneous abortion as well as VVC are independent risk factors for GDM or GIGT. PCOS and family history of diabetes increase the incidence of GDM and GIGT but these are not independent risk factors for GDM and GIGT.

  18. Polygenic risk score in postmortem diagnosed sporadic early-onset Alzheimer's disease.

    PubMed

    Chaudhury, Sultan; Patel, Tulsi; Barber, Imelda S; Guetta-Baranes, Tamar; Brookes, Keeley J; Chappell, Sally; Turton, James; Guerreiro, Rita; Bras, Jose; Hernandez, Dena; Singleton, Andrew; Hardy, John; Mann, David; Morgan, Kevin

    2018-02-01

    Sporadic early-onset Alzheimer's disease (sEOAD) exhibits the symptoms of late-onset Alzheimer's disease but lacks the familial aspect of the early-onset familial form. The genetics of Alzheimer's disease (AD) identifies APOEε4 to be the greatest risk factor; however, it is a complex disease involving both environmental risk factors and multiple genetic loci. Polygenic risk scores (PRSs) accumulate the total risk of a phenotype in an individual based on variants present in their genome. We determined whether sEOAD cases had a higher PRS compared to controls. A cohort of sEOAD cases was genotyped on the NeuroX array, and PRSs were generated using PRSice. The target data set consisted of 408 sEOAD cases and 436 controls. The base data set was collated by the International Genomics of Alzheimer's Project consortium, with association data from 17,008 late-onset Alzheimer's disease cases and 37,154 controls, which can be used for identifying sEOAD cases due to having shared phenotype. PRSs were generated using all common single nucleotide polymorphisms between the base and target data set, PRS were also generated using only single nucleotide polymorphisms within a 500 kb region surrounding the APOE gene. Sex and number of APOE ε2 or ε4 alleles were used as variables for logistic regression and combined with PRS. The results show that PRS is higher on average in sEOAD cases than controls, although there is still overlap among the whole cohort. Predictive ability of identifying cases and controls using PRSice was calculated with 72.9% accuracy, greater than the APOE locus alone (65.2%). Predictive ability was further improved with logistic regression, identifying cases and controls with 75.5% accuracy. Copyright © 2017 Elsevier Inc. All rights reserved.

  19. Blood harmane (1-methyl-9H-pyrido[3,4-b]indole) concentration in essential tremor cases in Spain.

    PubMed

    Louis, Elan D; Benito-León, Julian; Moreno-García, Sara; Vega, Saturio; Romero, Juan Pablo; Bermejo-Pareja, Felix; Gerbin, Marina; Viner, Amanda S; Factor-Litvak, Pam; Jiang, Wendy; Zheng, Wei

    2013-01-01

    Environmental correlates for essential tremor (ET) are largely unexplored. The search for such environmental factors has involved the study of a number of neurotoxins. Harmane (1-methyl-9H-pyrido[3,4-b]indole) is a potent tremor-producing toxin. In two prior case-control studies in New York, we demonstrated that blood harmane concentration was elevated in ET patients vs. controls, and especially in familial ET cases. These findings, however, have been derived from a study of cases ascertained through a single tertiary referral center in New York. Our objective was to determine whether blood harmane concentrations are elevated in familial and sporadic ET cases, ascertained from central Spain, compared to controls without ET. Blood harmane concentrations were quantified by a well-established high performance liquid chromatography method. The median harmane concentrations were: 2.09 g(-10)/ml (138 controls), 2.41 g(-10)/ml (68 sporadic ET), and 2.90 g(-10)/ml (62 familial ET). In an unadjusted logistic regression analysis, log blood harmane concentration was not significantly associated with diagnosis (familial ET vs. control): odds ratio=1.56, p=0.26. In a logistic regression analysis that adjusted for evaluation start time, which was an important confounding variable, the odds ratio increased to 2.35, p=0.049. Blood harmane levels were slightly elevated in a group of familial ET cases compared to a group of controls in Spain. These data seem to further extend our observations from New York to a second cohort of ET cases in Spain. This neurotoxin continues to be a source of interest for future confirmatory research. Copyright © 2012 Elsevier Inc. All rights reserved.

  20. BLOOD HARMANE (1-METHYL-9H-PYRIDO[3,4-B]INDOLE) CONCENTRATION IN ESSENTIAL TREMOR CASES IN SPAIN

    PubMed Central

    Louis, Elan D.; Benito-León, Julian; Moreno-García, Sara; Vega, Saturio; Romero, Juan Pablo; Bermejo-Pareja, Felix; Gerbin, Marina; Viner, Amanda S.; Factor-Litvak, Pam; Jiang, Wendy; Zheng, Wei

    2012-01-01

    Background Environmental correlates for essential tremor (ET) are largely unexplored. The search for such environmental factors has involved the study of a number of neurotoxins. Harmane (1-methyl-9H-pyrido[3,4-b]indole) is a potent tremor-producing toxin. In two prior case-control studies in New York, we demonstrated that blood harmane concentration was elevated in ET patients vs. controls, and especially in familial ET cases. These findings, however, have been derived from a study of cases ascertained through a single tertiary referral center in New York. Objective Our objective was to determine whether blood harmane concentrations are elevated in familial and sporadic ET cases, ascertained from central Spain, compared to controls without ET. Methods Blood harmane concentrations were quantified by a well-established high performance liquid chromatography method. Results The median harmane concentrations were: 2.09 g−10/ml (138 controls), 2.41 g−10/ml (68 sporadic ET), and 2.90 g−10/ml (62 familial ET). In an unadjusted logistic regression analysis, log blood harmane concentration was not significantly associated with diagnosis (familial ET vs. control): odds ratio = 1.56, p = 0.26. In a logistic regression analysis that adjusted for evaluation start time, which was an important confounding variable, the odds ratio increased to 2.35, p = 0.049. Conclusions Blood harmane levels were slightly elevated in a group of familial ET cases compared to a group of controls in Spain. These data seem to further extend our observations from New York to a second cohort of ET cases in Spain. This neurotoxin continues to be a source of interest for future confirmatory research. PMID:22981972

  1. Impact of poor glycemic control of type 2 diabetes mellitus on serum prostate-specific antigen concentrations in men.

    PubMed

    Atalay, Hasan Anıl; Akarsu, Murat; Canat, Lutfi; Ülker, Volkan; Alkan, İlter; Ozkuvancı, Unsal

    2017-09-01

    To evaluate the impact of poor glycemic control of type 2 diabetes mellitus (T2DM) on serum prostate-specific antigen (PSA) concentrations in men. We performed a prospective analysis of 215 consecutive patients affected by erectile dysfunction (ED). ED was evaluated using the IIEF-5 questionnaire and the poor glycemic control (PGC) of T2DM was assessed according to the HbA1c criteria (International Diabetes Federation). Patients were divided into PGC group (HbA1c ≥ 7%) and control group (CG) (HbA1c < 6%). Correlations between serum HbA1c levels and various variables were evaluated and multivariate logistic regression analyses were carried out to identify variables for PGC. We compared 110 cases to 105 controls men ranging from 44 to 81 years of age, lower PSA concentrations were observed in men with PGC (PGC mean PSA: 0.9 ng/dl, CG mean PSA: 2.1 ng/dl, p < 0.001). Also mean prostate volume was 60% was smaller among men with PGC compared with men with CG (PGC mean prostate volume: 26 ml, CG prostate volume: 43 ml, p < 0.001). A strong negative correlation was found between serum HbA1c levels and serum PSA (p < 0.001 and r = -0.665) concentrations in men with PGC. We also found at the multivariate logistic regression model that PSA, prostate volume and peak systolic velocity were independent predictors of PGC. Our results suggest that there is significant impact of PGC on serum PSA levels in T2DM. Poor glycemic control of type 2 diabetes was associated with lower serum PSA levels and smaller prostate volumes.

  2. Predictors of asthma control in children from different ethnic origins living in Amsterdam.

    PubMed

    van Dellen, Q M; Stronks, K; Bindels, P J E; Ory, F G; Bruil, J; van Aalderen, W M C

    2007-04-01

    To identify factors associated with asthma control in a multi-ethnic paediatric population. We interviewed 278 children with paediatrician diagnosed asthma (aged 7-17 years) and one of their parents. Asthma control was assessed with the Asthma Control Questionnaire (ACQ). Detailed information about sociodemographic variables, asthma medication, knowledge of asthma, inhalation technique and environmental factors were collected. Turkish and Moroccan parents were interviewed in their language of choice. Logistic regression analyses were used to identify correlates of asthma control. Of the 278 children, 85 (30.6%) were Dutch, 84 (30.2%) were Moroccan, 58 (20.9%) were Turkish and 51 (18.3%) were Surinamese. Overall, almost 60% had a status of well-controlled asthma, as indicated by the ACQ. Only 51 of the 142 (35.9%) Moroccan and Turkish parents had a good comprehension of the Dutch language. In logistic regression analyses the risk of having uncontrolled asthma was significantly higher among Surinamese children (OR 2.3; 95% CI 1.06-4.83), respondents with insufficient comprehension of the Dutch language (OR 2.3; 95% CI 1.08-4.78), children using woollen blankets (OR 9.8; 95% CI 1.52-63.42), and significantly lower among male (OR 0.5; 95% CI 0.31-0.91) and non-daily users of inhaled corticosteroids (OR 0.6; 95% CI 0.38-1.07). In conclusion, ethnicity as well as insufficient comprehension of the Dutch language appeared to be independent risk factors for uncontrolled asthma. Special attention should be given to children from immigrants groups for example by calling in an interpreter by physicians when comprehension is insufficient.

  3. Long working hours and skipping breakfast concomitant with late evening meals are associated with suboptimal glycemic control among young male Japanese patients with type 2 diabetes.

    PubMed

    Azami, Yasushi; Funakoshi, Mitsuhiko; Matsumoto, Hisashi; Ikota, Akemi; Ito, Koichi; Okimoto, Hisashi; Shimizu, Nobuaki; Tsujimura, Fumihiro; Fukuda, Hiroshi; Miyagi, Chozi; Osawa, Sayaka; Osawa, Ryo; Miura, Jiro

    2018-04-17

    To assess the associations of working conditions, eating habits and glycemic control among young Japanese workers with type 2 diabetes. This hospital- and clinic-based prospective study included 352 male and 126 female working patients with diabetes aged 20-40 years. Data were obtained from June to July 2012 and June to July 2013. Logistic regression analysis was used to estimate multivariable-adjusted odds ratios (ORs) and 95% confidence intervals (CIs) for suboptimal glycemic control (glycosylated hemoglobin level of ≥7%) obtained from June to July 2013. Multivariable logistic regression analysis showed that disease duration of ≥10 years (OR 2.43, 95% CI 1.02-5.80), glycosylated hemoglobin level of ≥7% in 2012 (OR 8.50, 95% CI 4.90-14.80), skipping breakfast and late evening meals (OR 2.50, 95% CI 1.25-5.00) and working ≥60 h/week (OR 2.92, 95% CI 1.16-7.40) were predictive of suboptimal glycemic control in male workers, whereas a glycosylated hemoglobin level of ≥7% in 2012 (OR 17.96, 95% CI 5.93-54.4), oral hyperglycemic agent therapy (OR 12.49, 95% CI 2.75-56.86) and insulin therapy (OR 11.60, 95% CI 2.35-57.63) were predictive of suboptimal glycemic control in female workers. Working ≥60 h/week and habitual skipping breakfast concomitant with late evening meals might affect the ability of young male workers with type 2 diabetes to achieve and maintain glycemic control. © 2018 The Authors. Journal of Diabetes Investigation published by Asian Association for the Study of Diabetes (AASD) and John Wiley & Sons Australia, Ltd.

  4. Biomass Stoves and Lens Opacity and Cataract in Nepalese Women

    PubMed Central

    Pokhrel, Amod K.; Bates, Michael N.; Shrestha, Sachet P.; Bailey, Ian L.; DiMartino, Robert B.; Smith, Kirk R.; Joshi, N. D.

    2014-01-01

    Purpose Cataract is the most prevalent cause of blindness in Nepal. Several epidemiologic studies have associated cataracts with use of biomass cookstoves. These studies, however, have had limitations, including potential control selection bias and limited adjustment for possible confounding. This study, in Pokhara city, in an area of Nepal where biomass cookstoves are widely used without direct venting of the smoke to the outdoors, focuses on pre-clinical measures of opacity, while avoiding selection bias and taking into account comprehensive data on potential confounding factors Methods Using a cross-sectional study design, severity of lenticular damage, judged on the LOCS III scales, was investigated in females (n=143), aged 20-65 years, without previously diagnosed cataract. Linear and logistic regression analyses were used to examine the relationships with stove type and length of use. Clinically significant cataract, used in the logistic regression models, was defined as a LOCS III score > 2. Results Using gas cookstoves as the reference group, logistic regression analysis for nuclear cataract showed the evidence of relationships with stove type: for biomass stoves, the odds ratio (OR) was 2.58 (95% confidence interval [CI]: 1.22-5.46) and, for kerosene stoves, the OR was 5.18 (95% CI: 0.88-30.38). Similar results were found for nuclear color (LOCS III score > 2), but no association was found with cortical cataracts. Supporting a relationship between biomass stoves and nuclear cataract was a trend with years of exposure to biomass cookstoves (p=0.01). Linear regression analyses did not show clear evidence of an association between lenticular damage and stove types. Biomass fuel used for heating was not associated with any form of opacity. Conclusions This study provides support for associations of biomass and kerosene cookstoves with nuclear opacity and change in nuclear color. The novel associations with kerosene cookstove use deserve further investigation. PMID:23400024

  5. Association of LMX1A genetic polymorphisms with susceptibility to congenital scoliosis in Chinese Han population.

    PubMed

    Wu, Nan; Yuan, Suomao; Liu, Jiaqi; Chen, Jun; Fei, Qi; Liu, Sen; Su, Xinlin; Wang, Shengru; Zhang, Jianguo; Li, Shugang; Wang, Yipeng; Qiu, Guixing; Wu, Zhihong

    2014-10-01

    A genetic association study of single nucleotide polymorphisms (SNPs) for the LMX1A gene with congenital scoliosis (CS) in the Chinese Han population. To determine whether LMX1A genetic polymorphisms are associated with susceptibility to CS. CS is a lateral curvature of the spine due to congenital vertebral defects, whose exact genetic cause has not been well established. The LMX1A gene was suggested as a potential human candidate gene for CS. However, no genetic study of LMX1A in CS has ever been reported. We genotyped 13 SNPs of the LMX1A gene in 154 patients with CS and 144 controls with matched sex and age. After conducting the Hardy-Weinberg equilibrium test, the data of 13 SNPs were analyzed by the allelic and genotypic association with logistic regression analysis. Furthermore, the genotype-phenotype association and haplotype association analysis were also performed. The 13 SNPs of the LMX1A gene met Hardy-Weinberg equilibrium in the controls, which was not in the cases. None of the allelic and genotypic frequencies of these SNPs showed significant difference between case and control groups (P > 0.05). However, the genotypic frequencies of rs1354510 and rs16841013 in the LMX1A gene were associated with CS predisposition in the unconditional logistic regression analysis (P = 0.02 and 0.018, respectively). Genotypic frequencies of 3 SNPs at rs6671290, rs1354510, and rs16841013 were found to exhibit significant differences between patients with CS with failure of formation and the healthy controls (P = 0.019, 0.007, and 0.006, respectively). Besides, in the model analysis by using unconditional logistic regression analysis, the optimized model for the 3 genotypic positive SNPs with failure of formation were rs6671290 (codominant; P = 0.025, Akaike information value = 316.6, Bayesian information criterion = 333.9), rs1354510 (overdominant; P = 0.0017, Akaike information value = 312.1, Bayesian information criterion = 325.9), and rsl6841013 (overdominant; P = 0.0016, Akaike information value = 311.1, Bayesian information criterion = 325), respectively. However, the haplotype distributions in the case group were not significantly different from those of the control group in the 3 haplotype blocks. To our knowledge, this is the first study to identify that the SNPs of the LMX1A gene might be associated with the susceptibility to CS and different clinical phenotypes of CS in the Chinese Han population. 4.

  6. A Logistic Regression Analysis of Turkey's 15-Year-Olds' Scoring above the OECD Average on the PISA'09 Reading Assessment

    ERIC Educational Resources Information Center

    Kasapoglu, Koray

    2014-01-01

    This study aims to investigate which factors are associated with Turkey's 15-year-olds' scoring above the OECD average (493) on the PISA'09 reading assessment. Collected from a total of 4,996 15-year-old students from Turkey, data were analyzed by logistic regression analysis in order to model the data of students who were split into two: (1)…

  7. Analyzing thresholds and efficiency with hierarchical Bayesian logistic regression.

    PubMed

    Houpt, Joseph W; Bittner, Jennifer L

    2018-07-01

    Ideal observer analysis is a fundamental tool used widely in vision science for analyzing the efficiency with which a cognitive or perceptual system uses available information. The performance of an ideal observer provides a formal measure of the amount of information in a given experiment. The ratio of human to ideal performance is then used to compute efficiency, a construct that can be directly compared across experimental conditions while controlling for the differences due to the stimuli and/or task specific demands. In previous research using ideal observer analysis, the effects of varying experimental conditions on efficiency have been tested using ANOVAs and pairwise comparisons. In this work, we present a model that combines Bayesian estimates of psychometric functions with hierarchical logistic regression for inference about both unadjusted human performance metrics and efficiencies. Our approach improves upon the existing methods by constraining the statistical analysis using a standard model connecting stimulus intensity to human observer accuracy and by accounting for variability in the estimates of human and ideal observer performance scores. This allows for both individual and group level inferences. Copyright © 2018 Elsevier Ltd. All rights reserved.

  8. Determination of Landslide and Driftwood Potentials by Fixed-wing UAV-Borne RGB and NIR images: A Case Study of Shenmu Area in Taiwan

    NASA Astrophysics Data System (ADS)

    Chen, Su-Chin; Hsiao, Yu-Shen; Chung, Ta-Hsien

    2015-04-01

    This study is aimed at determining the landslide and driftwood potentials at Shenmu area in Taiwan by Unmanned Aerial Vehicle (UAV). High-resolution orthomosaics and digital surface models (DSMs) are both obtained from several UAV practical surveys by using a red-green-blue(RGB) camera and a near-infrared(NIR) one, respectively. Couples of artificial aerial survey targets are used for ground control in photogrammtry. The algorithm for this study is based on Logistic regression. 8 main factors, which are elevations, terrain slopes, terrain aspects, terrain reliefs, terrain roughness, distances to roads, distances to rivers, land utilizations, are taken into consideration in our Logistic regression model. The related results from UAV are compared with those from traditional photogrammetry. Overall, the study is focusing on monitoring the distribution of the areas with high-risk landslide and driftwood potentials in Shenmu area by Fixed-wing UAV-Borne RGB and NIR images. We also further analyze the relationship between forests, landslides, disaster potentials and upper river areas.

  9. Macular microcirculation in hypertensive patients with and without branch retinal vein occlusion.

    PubMed

    Noma, Hidetaka; Funatsu, Hideharu; Sakata, Kumi; Harino, Seiyo; Mimura, Tatsuya; Hori, Sadao

    2009-09-01

    Our purpose was to determine whether a reduction in blood flow velocity (BFV) in the perifoveal capillaries is involved in the pathogenesis of branch retinal vein occlusion (BRVO) in patients with hypertension. Subjects included hypertensive patients with (n = 12) and without (n = 16) BRVO and healthy volunteers (n = 16). Perifoveal BFV was measured by the tracing method using fluorescein angiography and a scanning laser ophthalmoscope. Logistic regression analysis was performed to assess factors that influenced the presence or absence of BRVO. Mean BFV showed a significant decrease across the three groups (healthy controls: 1.49 +/- 0.11 mm/second; hypertensive patients without BRVO: 1.36 +/- 0.12 mm/second; hypertensive patients with BRVO: 1.16 +/- 0.24 mm/second; p(trend) < 0.001). Multivariate logistic regression analysis showed that BFV was a significant risk factor for the presence of BRVO. Perifoveal capillary BFV is reduced in hypertensive patients with and without BRVO. It is possible that a decrease in BFV may be involved in the occurrence of BRVO. Measurement of perifoveal capillary BFV may be useful for investigating the pathogenesis and progression of BRVO.

  10. The effect of the Family Case Management Program on 1996 birth outcomes in Illinois.

    PubMed

    Keeton, Kristie; Saunders, Stephen E; Koltun, David

    2004-03-01

    The purpose of this study was to determine if birth outcomes for Medicaid recipients were improved with participation in the Illinois Family Case Management Program. Health program data files were linked with the 1996 Illinois Vital Records linked birth-death certificate file. Logistic regression was used to characterize the variation in birth outcomes as a function of Family Case Management participation while statistically controlling for measurable factors found to be confounders. Results of the logistic regression analysis show that women who participated in the Family Care Management Program were significantly less likely to give birth to very low birth weight infants (odds ratio [OR] = 0.86, 95% confidence interval [CI] = 0.75, 0.99) and low birth weight infants (OR = 0.83, CI = 0.79, 0.89). For infant mortality, however, the adjusted OR (OR = 0.98, CI = 0.82, 1.17), although under 1, was not statistically significant. These results suggest that the Family Case Management Program may be effective in reducing very low birth weight and low birth weight rates among infants born to low-income women.

  11. Identifying and quantifying secondhand smoke in multiunit homes with tobacco smoke odor complaints

    NASA Astrophysics Data System (ADS)

    Dacunto, Philip J.; Cheng, Kai-Chung; Acevedo-Bolton, Viviana; Klepeis, Neil E.; Repace, James L.; Ott, Wayne R.; Hildemann, Lynn M.

    2013-06-01

    Accurate identification and quantification of the secondhand tobacco smoke (SHS) that drifts between multiunit homes (MUHs) is essential for assessing resident exposure and health risk. We collected 24 gaseous and particle measurements over 6-9 day monitoring periods in five nonsmoking MUHs with reported SHS intrusion problems. Nicotine tracer sampling showed evidence of SHS intrusion in all five homes during the monitoring period; logistic regression and chemical mass balance (CMB) analysis enabled identification and quantification of some of the precise periods of SHS entry. Logistic regression models identified SHS in eight periods when residents complained of SHS odor, and CMB provided estimates of SHS magnitude in six of these eight periods. Both approaches properly identified or apportioned all six cooking periods used as no-SHS controls. Finally, both approaches enabled identification and/or apportionment of suspected SHS in five additional periods when residents did not report smelling smoke. The time resolution of this methodology goes beyond sampling methods involving single tracers (such as nicotine), enabling the precise identification of the magnitude and duration of SHS intrusion, which is essential for accurate assessment of human exposure.

  12. Seroprevalence and Risk Factors of Chlamydia abortus Infection in Tibetan Sheep in Gansu Province, Northwest China

    PubMed Central

    Qin, Si-Yuan; Yin, Ming-Yang; Cong, Wei; Zhou, Dong-Hui; Zhang, Xiao-Xuan; Zhao, Quan; Zhu, Xing-Quan; Zhou, Ji-Zhang; Qian, Ai-Dong

    2014-01-01

    Chlamydia abortus, an important pathogen in a variety of animals, is associated with abortion in sheep. In the present study, 1732 blood samples, collected from Tibetan sheep between June 2013 and April 2014, were examined by the indirect hemagglutination (IHA) test, aiming to evaluate the seroprevalence and risk factors of C. abortus infection in Tibetan sheep. 323 of 1732 (18.65%) samples were seropositive for C. abortus antibodies at the cut-off of 1 : 16. A multivariate logistic regression analysis was used to evaluate the risk factors associated with seroprevalence, which could provide foundation to prevent and control C. abortus infection in Tibetan sheep. Gender of Tibetan sheep was left out of the final model because it is not significant in the logistic regression analysis (P > 0.05). Region, season, and age were considered as major risk factors associated with C. abortus infection in Tibetan sheep. Our study revealed a widespread and high prevalence of C. abortus infection in Tibetan sheep in Gansu province, northwest China, with higher exposure risk in different seasons and ages and distinct geographical distribution. PMID:25401129

  13. Analysis of nonlinear relationships in dual epidemics, and its application to the management of grapevine downy and powdery mildews.

    PubMed

    Savary, Serge; Delbac, Lionel; Rochas, Amélie; Taisant, Guillaume; Willocquet, Laetitia

    2009-08-01

    Dual epidemics are defined as epidemics developing on two or several plant organs in the course of a cropping season. Agricultural pathosystems where such epidemics develop are often very important, because the harvestable part is one of the organs affected. These epidemics also are often difficult to manage, because the linkage between epidemiological components occurring on different organs is poorly understood, and because prediction of the risk toward the harvestable organs is difficult. In the case of downy mildew (DM) and powdery mildew (PM) of grapevine, nonlinear modeling and logistic regression indicated nonlinearity in the foliage-cluster relationships. Nonlinear modeling enabled the parameterization of a transmission coefficient that numerically links the two components, leaves and clusters, in DM and PM epidemics. Logistic regression analysis yielded a series of probabilistic models that enabled predicting preset levels of cluster infection risks based on DM and PM severities on the foliage at successive crop stages. The usefulness of this framework for tactical decision-making for disease control is discussed.

  14. Upgrade Summer Severe Weather Tool

    NASA Technical Reports Server (NTRS)

    Watson, Leela

    2011-01-01

    The goal of this task was to upgrade to the existing severe weather database by adding observations from the 2010 warm season, update the verification dataset with results from the 2010 warm season, use statistical logistic regression analysis on the database and develop a new forecast tool. The AMU analyzed 7 stability parameters that showed the possibility of providing guidance in forecasting severe weather, calculated verification statistics for the Total Threat Score (TTS), and calculated warm season verification statistics for the 2010 season. The AMU also performed statistical logistic regression analysis on the 22-year severe weather database. The results indicated that the logistic regression equation did not show an increase in skill over the previously developed TTS. The equation showed less accuracy than TTS at predicting severe weather, little ability to distinguish between severe and non-severe weather days, and worse standard categorical accuracy measures and skill scores over TTS.

  15. Estimating the Probability of Rare Events Occurring Using a Local Model Averaging.

    PubMed

    Chen, Jin-Hua; Chen, Chun-Shu; Huang, Meng-Fan; Lin, Hung-Chih

    2016-10-01

    In statistical applications, logistic regression is a popular method for analyzing binary data accompanied by explanatory variables. But when one of the two outcomes is rare, the estimation of model parameters has been shown to be severely biased and hence estimating the probability of rare events occurring based on a logistic regression model would be inaccurate. In this article, we focus on estimating the probability of rare events occurring based on logistic regression models. Instead of selecting a best model, we propose a local model averaging procedure based on a data perturbation technique applied to different information criteria to obtain different probability estimates of rare events occurring. Then an approximately unbiased estimator of Kullback-Leibler loss is used to choose the best one among them. We design complete simulations to show the effectiveness of our approach. For illustration, a necrotizing enterocolitis (NEC) data set is analyzed. © 2016 Society for Risk Analysis.

  16. Evaluating the perennial stream using logistic regression in central Taiwan

    NASA Astrophysics Data System (ADS)

    Ruljigaljig, T.; Cheng, Y. S.; Lin, H. I.; Lee, C. H.; Yu, T. T.

    2014-12-01

    This study produces a perennial stream head potential map, based on a logistic regression method with a Geographic Information System (GIS). Perennial stream initiation locations, indicates the location of the groundwater and surface contact, were identified in the study area from field survey. The perennial stream potential map in central Taiwan was constructed using the relationship between perennial stream and their causative factors, such as Catchment area, slope gradient, aspect, elevation, groundwater recharge and precipitation. Here, the field surveys of 272 streams were determined in the study area. The areas under the curve for logistic regression methods were calculated as 0.87. The results illustrate the importance of catchment area and groundwater recharge as key factors within the model. The results obtained from the model within the GIS were then used to produce a map of perennial stream and estimate the location of perennial stream head.

  17. The use of logistic regression to enhance risk assessment and decision making by mental health administrators.

    PubMed

    Menditto, Anthony A; Linhorst, Donald M; Coleman, James C; Beck, Niels C

    2006-04-01

    Development of policies and procedures to contend with the risks presented by elopement, aggression, and suicidal behaviors are long-standing challenges for mental health administrators. Guidance in making such judgments can be obtained through the use of a multivariate statistical technique known as logistic regression. This procedure can be used to develop a predictive equation that is mathematically formulated to use the best combination of predictors, rather than considering just one factor at a time. This paper presents an overview of logistic regression and its utility in mental health administrative decision making. A case example of its application is presented using data on elopements from Missouri's long-term state psychiatric hospitals. Ultimately, the use of statistical prediction analyses tempered with differential qualitative weighting of classification errors can augment decision-making processes in a manner that provides guidance and flexibility while wrestling with the complex problem of risk assessment and decision making.

  18. Classification and regression tree analysis of acute-on-chronic hepatitis B liver failure: Seeing the forest for the trees.

    PubMed

    Shi, K-Q; Zhou, Y-Y; Yan, H-D; Li, H; Wu, F-L; Xie, Y-Y; Braddock, M; Lin, X-Y; Zheng, M-H

    2017-02-01

    At present, there is no ideal model for predicting the short-term outcome of patients with acute-on-chronic hepatitis B liver failure (ACHBLF). This study aimed to establish and validate a prognostic model by using the classification and regression tree (CART) analysis. A total of 1047 patients from two separate medical centres with suspected ACHBLF were screened in the study, which were recognized as derivation cohort and validation cohort, respectively. CART analysis was applied to predict the 3-month mortality of patients with ACHBLF. The accuracy of the CART model was tested using the area under the receiver operating characteristic curve, which was compared with the model for end-stage liver disease (MELD) score and a new logistic regression model. CART analysis identified four variables as prognostic factors of ACHBLF: total bilirubin, age, serum sodium and INR, and three distinct risk groups: low risk (4.2%), intermediate risk (30.2%-53.2%) and high risk (81.4%-96.9%). The new logistic regression model was constructed with four independent factors, including age, total bilirubin, serum sodium and prothrombin activity by multivariate logistic regression analysis. The performances of the CART model (0.896), similar to the logistic regression model (0.914, P=.382), exceeded that of MELD score (0.667, P<.001). The results were confirmed in the validation cohort. We have developed and validated a novel CART model superior to MELD for predicting three-month mortality of patients with ACHBLF. Thus, the CART model could facilitate medical decision-making and provide clinicians with a validated practical bedside tool for ACHBLF risk stratification. © 2016 John Wiley & Sons Ltd.

  19. Identification of immune correlates of protection in Shigella infection by application of machine learning.

    PubMed

    Arevalillo, Jorge M; Sztein, Marcelo B; Kotloff, Karen L; Levine, Myron M; Simon, Jakub K

    2017-10-01

    Immunologic correlates of protection are important in vaccine development because they give insight into mechanisms of protection, assist in the identification of promising vaccine candidates, and serve as endpoints in bridging clinical vaccine studies. Our goal is the development of a methodology to identify immunologic correlates of protection using the Shigella challenge as a model. The proposed methodology utilizes the Random Forests (RF) machine learning algorithm as well as Classification and Regression Trees (CART) to detect immune markers that predict protection, identify interactions between variables, and define optimal cutoffs. Logistic regression modeling is applied to estimate the probability of protection and the confidence interval (CI) for such a probability is computed by bootstrapping the logistic regression models. The results demonstrate that the combination of Classification and Regression Trees and Random Forests complements the standard logistic regression and uncovers subtle immune interactions. Specific levels of immunoglobulin IgG antibody in blood on the day of challenge predicted protection in 75% (95% CI 67-86). Of those subjects that did not have blood IgG at or above a defined threshold, 100% were protected if they had IgA antibody secreting cells above a defined threshold. Comparison with the results obtained by applying only logistic regression modeling with standard Akaike Information Criterion for model selection shows the usefulness of the proposed method. Given the complexity of the immune system, the use of machine learning methods may enhance traditional statistical approaches. When applied together, they offer a novel way to quantify important immune correlates of protection that may help the development of vaccines. Copyright © 2017 Elsevier Inc. All rights reserved.

  20. The quest for conditional independence in prospectivity modeling: weights-of-evidence, boost weights-of-evidence, and logistic regression

    NASA Astrophysics Data System (ADS)

    Schaeben, Helmut; Semmler, Georg

    2016-09-01

    The objective of prospectivity modeling is prediction of the conditional probability of the presence T = 1 or absence T = 0 of a target T given favorable or prohibitive predictors B, or construction of a two classes 0,1 classification of T. A special case of logistic regression called weights-of-evidence (WofE) is geologists' favorite method of prospectivity modeling due to its apparent simplicity. However, the numerical simplicity is deceiving as it is implied by the severe mathematical modeling assumption of joint conditional independence of all predictors given the target. General weights of evidence are explicitly introduced which are as simple to estimate as conventional weights, i.e., by counting, but do not require conditional independence. Complementary to the regression view is the classification view on prospectivity modeling. Boosting is the construction of a strong classifier from a set of weak classifiers. From the regression point of view it is closely related to logistic regression. Boost weights-of-evidence (BoostWofE) was introduced into prospectivity modeling to counterbalance violations of the assumption of conditional independence even though relaxation of modeling assumptions with respect to weak classifiers was not the (initial) purpose of boosting. In the original publication of BoostWofE a fabricated dataset was used to "validate" this approach. Using the same fabricated dataset it is shown that BoostWofE cannot generally compensate lacking conditional independence whatever the consecutively processing order of predictors. Thus the alleged features of BoostWofE are disproved by way of counterexamples, while theoretical findings are confirmed that logistic regression including interaction terms can exactly compensate violations of joint conditional independence if the predictors are indicators.

  1. Interpreting Multiple Logistic Regression Coefficients in Prospective Observational Studies

    DTIC Science & Technology

    1982-11-01

    TG HDL -C Males T-C 50-80 MRW pɘ.05 pɘ.10 1HDL-C = high density lipoprotein cholesterol MRW...consider a more complete analy- sis, attempting to uncover the relationship between CHD and TG controlling for covariables such a high density ...for T-C can be re- duced, when among older individuals, elevated T-C may increase the capacity to carry cholesterol in the high density lipoprotein

  2. Blood Based Biomarkers of Early Onset Breast Cancer

    DTIC Science & Technology

    2016-12-01

    discretizes the data, and also using logistic elastic net – a form of linear regression - we were unable to build a classifier that could accurately...classifier for differentiating cases from controls off discretized data. The first pass analysis demonstrated a 35 gene signature that differentiated...to the discretized data for mRNA gene signature, the samples used to “train” were also included in the final samples used to “test” the algorithm

  3. Optimization of Game Formats in U-10 Soccer Using Logistic Regression Analysis

    PubMed Central

    Amatria, Mario; Arana, Javier; Anguera, M. Teresa; Garzón, Belén

    2016-01-01

    Abstract Small-sided games provide young soccer players with better opportunities to develop their skills and progress as individual and team players. There is, however, little evidence on the effectiveness of different game formats in different age groups, and furthermore, these formats can vary between and even within countries. The Royal Spanish Soccer Association replaced the traditional grassroots 7-a-side format (F-7) with the 8-a-side format (F-8) in the 2011-12 season and the country’s regional federations gradually followed suit. The aim of this observational methodology study was to investigate which of these formats best suited the learning needs of U-10 players transitioning from 5-aside futsal. We built a multiple logistic regression model to predict the success of offensive moves depending on the game format and the area of the pitch in which the move was initiated. Success was defined as a shot at the goal. We also built two simple logistic regression models to evaluate how the game format influenced the acquisition of technicaltactical skills. It was found that the probability of a shot at the goal was higher in F-7 than in F-8 for moves initiated in the Creation Sector-Own Half (0.08 vs 0.07) and the Creation Sector-Opponent's Half (0.18 vs 0.16). The probability was the same (0.04) in the Safety Sector. Children also had more opportunities to control the ball and pass or take a shot in the F-7 format (0.24 vs 0.20), and these were also more likely to be successful in this format (0.28 vs 0.19). PMID:28031768

  4. Resident Self-Assessment and Learning Goal Development: Evaluation of Resident-Reported Competence and Future Goals.

    PubMed

    Li, Su-Ting T; Paterniti, Debora A; Tancredi, Daniel J; Burke, Ann E; Trimm, R Franklin; Guillot, Ann; Guralnick, Susan; Mahan, John D

    2015-01-01

    To determine incidence of learning goals by competency area and to assess which goals fall into competency areas with lower self-assessment scores. Cross-sectional analysis of existing deidentified American Academy of Pediatrics' PediaLink individualized learning plan data for the academic year 2009-2010. Residents self-assessed competencies in the 6 Accreditation Council for Graduate Medical Education (ACGME) competency areas and wrote learning goals. Textual responses for goals were mapped to 6 ACGME competency areas, future practice, or personal attributes. Adjusted mean differences and associations were estimated using multiple linear and logistic regression. A total of 2254 residents reported 6078 goals. Residents self-assessed their systems-based practice (51.8) and medical knowledge (53.0) competencies lowest and professionalism (68.9) and interpersonal and communication skills (62.2) highest. Residents were most likely to identify goals involving medical knowledge (70.5%) and patient care (50.5%) and least likely to write goals on systems-based practice (11.0%) and professionalism (6.9%). In logistic regression analysis adjusting for postgraduate year (PGY), gender, and degree type (MD/DO), resident-reported goal area showed no association with the learner's relative self-assessment score for that competency area. In the conditional logistic regression analysis, with each learner serving as his or her own control, senior residents (PGY2/3+s) who rated themselves relatively lower in a competency area were more likely to write a learning goal in that area than were PGY1s. Senior residents appear to develop better skills and/or motivation to explicitly turn self-assessed learning gaps into learning goals, suggesting that individualized learning plans may help improve self-regulated learning during residency. Copyright © 2015 Academic Pediatric Association. Published by Elsevier Inc. All rights reserved.

  5. Association between family structure, maternal education level, and maternal employment with sedentary lifestyle in primary school-age children.

    PubMed

    Vázquez-Nava, Francisco; Treviño-Garcia-Manzo, Norberto; Vázquez-Rodríguez, Carlos F; Vázquez-Rodríguez, Eliza M

    2013-01-01

    To determine the association between family structure, maternal education level, and maternal employment with sedentary lifestyle in primary school-age children. Data were obtained from 897 children aged 6 to 12 years. A questionnaire was used to collect information. Body mass index (BMI) was determined using the age- and gender-specific Centers for Disease Control and Prevention definition. Children were categorized as: normal weight (5(th) percentile≤BMI<85(th) percentile), at risk for overweight (85(th)≤BMI<95(th) percentile), overweight (≥ 95(th) percentile). For the analysis, overweight was defined as BMI at or above the 85(th) percentile for each gender. Adjusted odds ratios (adjusted ORs) for physical inactivity were determined using a logistic regression model. The prevalence of overweight was 40.7%, and of sedentary lifestyle, 57.2%. The percentage of non-intact families was 23.5%. Approximately 48.7% of the mothers had a non-acceptable educational level, and 38.8% of the mothers worked outside of the home. The logistic regression model showed that living in a non-intact family household (adjusted OR=1.67; 95% CI=1.04-2.66) is associated with sedentary lifestyle in overweight children. In the group of normal weight children, logistic regression analysis show that living in a non-intact family, having a mother with a non-acceptable education level, and having a mother who works outside of the home were not associated with sedentary lifestyle. Living in a non-intact family, more than low maternal educational level and having a working mother, appears to be associated with sedentary lifestyle in overweight primary school-age children. Copyright © 2013 Sociedade Brasileira de Pediatria. Published by Elsevier Editora Ltda. All rights reserved.

  6. Logistic regression analysis of the risk factors of anastomotic fistula after radical resection of esophageal-cardiac cancer.

    PubMed

    Huang, Jinxi; Zhou, Yi; Wang, Chenghu; Yuan, Weiwei; Zhang, Zhandong; Chen, Beibei; Zhang, Xiefu

    2017-11-01

    This study was conducted to investigate the risk factors of anastomotic fistula after the radical resection of esophageal-cardiac cancer. Five hundred and forty-four esophageal-cardiac cancer patients who underwent surgery and had complete clinical data were included in the study. Fifty patients diagnosed with postoperative anastomotic fistula were considered the case group and the remaining 494 subjects who did not develop postoperative anastomotic fistula were considered the control. The potential risk factors for anastomotic fistula, such as age, gender, diabetes history, smoking history, were collected and compared between the groups. Statistically significant variables were substituted into logistic regression to further evaluate the independent risk factors for postoperative anastomotic fistulas in esophageal-cardiac cancer. The incidence of anastomotic fistulas was 9.2% (50/544). Logistic regression analysis revealed that female gender (P < 0.05), laparoscopic surgery (P < 0.05), decreased postoperative albumin (P < 0.05), and postoperative renal dysfunction (P < 0.05) were independent risk factors for anastomotic fistulas in patients who received surgery for esophageal-cardiac cancer. Of the 50 anastomotic fistulas, 16 cases were small fistulas, which were only discovered by conventional imaging examination and not presenting clinical symptoms. All of the anastomotic fistulas occurred within seven days after surgery. Five of the patients with anastomotic fistulas underwent a second surgery and three died. Female patients with esophageal-cardiac cancer treated with endoscopic surgery and suffering from postoperative hypoproteinemia and renal dysfunction were susceptible to postoperative anastomotic fistula. © 2017 The Authors. Thoracic Cancer published by China Lung Oncology Group and John Wiley & Sons Australia, Ltd.

  7. Hip fracture in the elderly: a re-analysis of the EPIDOS study with causal Bayesian networks.

    PubMed

    Caillet, Pascal; Klemm, Sarah; Ducher, Michel; Aussem, Alexandre; Schott, Anne-Marie

    2015-01-01

    Hip fractures commonly result in permanent disability, institutionalization or death in elderly. Existing hip-fracture predicting tools are underused in clinical practice, partly due to their lack of intuitive interpretation. By use of a graphical layer, Bayesian network models could increase the attractiveness of fracture prediction tools. Our aim was to study the potential contribution of a causal Bayesian network in this clinical setting. A logistic regression was performed as a standard control approach to check the robustness of the causal Bayesian network approach. EPIDOS is a multicenter study, conducted in an ambulatory care setting in five French cities between 1992 and 1996 and updated in 2010. The study included 7598 women aged 75 years or older, in which fractures were assessed quarterly during 4 years. A causal Bayesian network and a logistic regression were performed on EPIDOS data to describe major variables involved in hip fractures occurrences. Both models had similar association estimations and predictive performances. They detected gait speed and mineral bone density as variables the most involved in the fracture process. The causal Bayesian network showed that gait speed and bone mineral density were directly connected to fracture and seem to mediate the influence of all the other variables included in our model. The logistic regression approach detected multiple interactions involving psychotropic drug use, age and bone mineral density. Both approaches retrieved similar variables as predictors of hip fractures. However, Bayesian network highlighted the whole web of relation between the variables involved in the analysis, suggesting a possible mechanism leading to hip fracture. According to the latter results, intervention focusing concomitantly on gait speed and bone mineral density may be necessary for an optimal prevention of hip fracture occurrence in elderly people.

  8. Transformation of Summary Statistics from Linear Mixed Model Association on All-or-None Traits to Odds Ratio.

    PubMed

    Lloyd-Jones, Luke R; Robinson, Matthew R; Yang, Jian; Visscher, Peter M

    2018-04-01

    Genome-wide association studies (GWAS) have identified thousands of loci that are robustly associated with complex diseases. The use of linear mixed model (LMM) methodology for GWAS is becoming more prevalent due to its ability to control for population structure and cryptic relatedness and to increase power. The odds ratio (OR) is a common measure of the association of a disease with an exposure ( e.g. , a genetic variant) and is readably available from logistic regression. However, when the LMM is applied to all-or-none traits it provides estimates of genetic effects on the observed 0-1 scale, a different scale to that in logistic regression. This limits the comparability of results across studies, for example in a meta-analysis, and makes the interpretation of the magnitude of an effect from an LMM GWAS difficult. In this study, we derived transformations from the genetic effects estimated under the LMM to the OR that only rely on summary statistics. To test the proposed transformations, we used real genotypes from two large, publicly available data sets to simulate all-or-none phenotypes for a set of scenarios that differ in underlying model, disease prevalence, and heritability. Furthermore, we applied these transformations to GWAS summary statistics for type 2 diabetes generated from 108,042 individuals in the UK Biobank. In both simulation and real-data application, we observed very high concordance between the transformed OR from the LMM and either the simulated truth or estimates from logistic regression. The transformations derived and validated in this study improve the comparability of results from prospective and already performed LMM GWAS on complex diseases by providing a reliable transformation to a common comparative scale for the genetic effects. Copyright © 2018 by the Genetics Society of America.

  9. Impulsivity, Attention, Memory, and Decision-Making among Adolescent Marijuana Users

    PubMed Central

    Dougherty, Donald M.; Mathias, Charles W.; Dawes, Michael A.; Furr, R. Michael; Charles, Nora E.; Liguori, Anthony; Shannon, Erin E.; Acheson, Ashley

    2012-01-01

    Rationale Marijuana is a popular drug of abuse among adolescents, and they may be uniquely vulnerable to resulting cognitive and behavioral impairments. Previous studies have found impairments among adolescent marijuana users. However, the majority of this research has examined measures individually rather than multiple domains in a single cohesive analysis. This study used a logistic regression model that combines performance on a range of tasks to identify which measures were most altered among adolescent marijuana users. Objectives The purpose of this research was to determine unique associations between adolescent marijuana user and performances on multiple cognitive and behavioral domains (attention, memory, decision-making, and impulsivity) in 14- to 17-year-olds while simultaneously controlling for performances across the measures to determine which measures most strongly distinguish marijuana users from non-users. Methods Marijuana-using adolescents (n=45) and controls (n=48) were tested. Logistic regression analyses were conducted to test for: (a) differences between marijuana users and non-users on each measure, (b) associations between marijuana use and each measure after controlling for the other measures, and (c) the degree to which (a) and (b) together elucidated differences among marijuana users and non-users. Results Of all the cognitive and behavioral domains tested, impaired short-term recall memory and consequence sensitivity impulsivity were associated with marijuana use after controlling for performances across all measures. Conclusions This study extends previous findings by identifying cognitive and behavioral impairments most strongly associated with adolescent marijuana users. These specific deficits are potential targets of intervention for this at-risk population. PMID:23138434

  10. [Risk factors for the kidney stones: a hospital-based case-control study in a distric hospital in Beijing].

    PubMed

    Wang, Jiao; Luo, Gong-tang; Niu, Wei-jing; Gong, Man-man; Liu, Lu; Zhou, Jie; Zhou, Xue-wei; He, Li-hua

    2013-12-18

    To explore the risk and protective factors of kidney calculi in order to put forward theoretical basis for preventive and control measures. A 1:1 matched case-control study was performed using data from a hospital in Beijing. The case group included 100 inpatients who were diagnosed kidney calculi using B ultrasonic, X-ray and intravenous pyelography during the survey while other 100 urolithiasis and endocrine disease excluded inpatients who shared the same sex, within five years gap to the case group inpatients were for the control group. A face-to-face survey was conducted with self-made questionnaires which covered demographic characteristics, water issues, dietary habits, genetic and medical history. Epidata 3.0 was used to build the database and SPSS 19.0 for the statistical analysis. In the univariate Logistic regression analysis, ten variables were found showing statistical significance. For the multivariate Logistic regression analysis, variables left in the model were labor intensity (OR=0.622, 95%CI: 0.435-0.889), preferring to drink after dinner (OR=0.316, 95%CI: 0.122-0.815), loving drinking (OR=0.232, 95%CI: 0.084-0.642), drinking tea regularly (OR=1.463, 95%CI: 1.033-2.071), eating more vegetables (OR=0.571, 95%CI: 0.328-0.993), the history of the urolithiasis (OR=2.127, 95%CI: 1.065-90.145). Drinking tea regularly, urolithiasis history and brain work are the risk factors of kidney calculi while loving drinking and eating more vegetables for the protection.

  11. PubMed Central

    GUZZO, A.S.; MEGGIOLARO, A.; MANNOCCI, A.; TECCA, M.; SALOMONE, I.

    2015-01-01

    Summary Introduction. "Umberto I" Teaching Hospital adopted 'Conley scale' as internal procedure for fall risk assessment, with the aim of strengthening surveillance and improving prevention and management of impatient falls. Materials and methods. Case-control study was performed. Fall events from 1st March 2012 to 30th September 2013 were considered. Cases have been matched for gender, department and period of hospitalization with two or three controls when it is possible. A table including intrinsic and extrinsic 'fall risk' factors, not foreseen by Conley Scale, and setted up after a literature overview was built. Univariate analysis and conditional logistic regression model have been performed. Results. 50 cases and 102 controls were included. Adverse event 'fall' were associated with filled Conley scale at the admission to care unit (OR = 4.92, 95%CI = 2.34-10.37). Univariate analysis identified intrinsic factors increasing risk of falls: dizziness (OR = 3.22; 95%CI = 1.34-7.75), psychomotor agitation (OR = 2.61; 95%CI = 1.06-6.43); and use of means of restraint (OR = 5.05 95%CI = 1.77-14.43). Conditional logistic regression model revealed a significant association with the following variables: use of instruments of restraint (HR = 5.54, 95%CI = 1.2- 23.80), dizziness (OR = 3.97, 95%CI = 1.22-12.89). Discussion. Conley Scale must be filled at the access of patient to care unit. There were no significant differences between cases and controls with regard to risk factors provided by Conley, except for the use of means of restraint. Empowerment strategies for Conley compilation are needed. PMID:26789993

  12. Thoracic Inlet Parameters for Degenerative Cervical Spondylolisthesis Imaging Measurement.

    PubMed

    Wang, Quanbing; Wang, Xiao-Tao; Zhu, Lei; Wei, Yu-Xi

    2018-04-05

    BACKGROUND The aim of this study was to explore the diagnostic value of sagittal measurement of thoracic inlet parameters for degenerative cervical spondylolisthesis (DCS). MATERIAL AND METHODS We initially included 65 patients with DCS and the same number of health people as the control group by using cervical radiograph evaluations. We analyzed the x-ray and computer tomographic (CT) data in prone and standing position at the same time. Measurement of cervical sagittal parameters was carried out in a standardized supine position. Multivariate logistic regression analysis was performed to evaluate these parameters as a diagnostic index for DCS. RESULTS There were 60 cases enrolled in the DCS group, and 62 cases included in the control group. The T1 slope and thoracic inlet angle (TIA) were significantly greater for the DCS group compared to the control group (24.33±2.85º versus 19.59±2.04º, p=0.00; 76.11±9.82º versus 72.86±7.31º, p=0.03, respectively). We observed no significant difference for the results of the neck tilt (NT), C2-C7 angle in the control and the DSC group (p>0.05). Logistic regression analysis and receiver operating characteristic (ROC) curve revealed that preoperative T1 slope of more than 22.0º showed significantly diagnostic value for the DCS group (p<0.05). CONCLUSIONS Patients with preoperative sagittal imbalance of thoracic inlet have a statistically significant increased risk of DCS. T1 slope of more than 22.0º showed significantly diagnostic value for the incidence of DCS.

  13. Increased risk of pulmonary tuberculosis among patients with appendectomy in Taiwan.

    PubMed

    Lai, S-W; Lin, C-L; Liao, K-F; Tsai, S-M

    2014-09-01

    The aim of this study was to determine whether there is a relationship between appendectomy and pulmonary tuberculosis in Taiwan. We designed a case-control study by analyzing the database from the Taiwan National Health Insurance Program. In total, we found 11,366 individuals (aged 20 years and older) with newly diagnosed pulmonary tuberculosis as the case group and 45,464 individuals without pulmonary tuberculosis as the control group from 1998 to 2011. The case group and the control group were matched on sex, age, and index year of diagnosing pulmonary tuberculosis. Using the multivariable unconditional logistic regression model, we measured the odds ratio (OR) and 95 % confidence interval (CI) for the risk of pulmonary tuberculosis associated with appendectomy and other comorbidities. After controlling for covariables, the multivariable unconditional logistic regression model disclosed that the OR of pulmonary tuberculosis was 1.4 in appendectomized patients (95 % CI = 1.13, 1.75) when compared to individuals without appendectomy. In further analysis, comorbidity with chronic obstructive pulmonary diseases (OR = 4.63, 95 % CI = 3.21, 6.68), pneumoconiosis (OR = 7.80, 95 % CI = 1.43, 42.5), chronic kidney diseases (OR = 5.65, 95 % CI = 1.79, 17.8), or diabetes mellitus (OR = 2.11, 95 % CI = 1.30, 3.44) increased the risk of pulmonary tuberculosis in appendectomized patients. Individuals with appendectomy are at a 1.4-fold increased risk of pulmonary tuberculosis. Comorbidities, including chronic obstructive pulmonary disease, pneumoconiosis, chronic kidney diseases, and diabetes mellitus, enhance the risk of pulmonary tuberculosis.

  14. [Bone mineral density in overweight and obese adolescents].

    PubMed

    Cobayashi, Fernanda; Lopes, Luiz A; Taddei, José Augusto de A C

    2005-01-01

    To study bone density as a concomitant factor for obesity in post-pubertal adolescents, controlling for other variables that may interfere in such a relation. Study comprising 83 overweight and obese adolescents (BMI > or = P85) and 89 non obese ones (P5 < or = BMI < or = P85). Cases and controls were selected out of 1,420 students (aged 14-19) from a public school in the city of São Paulo. The bone mineral density of the lumbar spine (L2-L4 in g/cm2) was assessed by dual-energy x-ray absorptiometry (LUNARtrade mark DPX-L). The variable bone density was dichotomized using 1.194 g/cm2 as cutoff point. Bivariate analyses were conducted considering the prevalence of overweight and obesity followed by multivariate analysis (logistic regression) according to a hierarchical conceptual model. The prevalence of bone density above the median was twice more frequent among cases (69.3%) than among controls (32.1%). In the bivariate analysis such prevalence resulted in an odds ratio (OR) of 4.78. The logistic regression model showed that the association between obesity and mineral density is yet more intense with an OR of 6.65 after the control of variables related to sedentary lifestyle and intake of milk and dairy products. Obese and overweight adolescents in the final stages of sexual maturity presented higher bone mineral density in relation to their normal-weight counterparts; however, cohort studies will be necessary to evaluate the influence of such characteristic on bone resistance in adulthood and, consequently, on the incidence of osteopenia and osteoporosis at older ages.

  15. Risk of Peripheral Artery Occlusive Disease in Patients with Vertigo, Tinnitus, or Sudden Deafness: A Secondary Case-Control Analysis of a Nationwide, Population-Based Health Claims Database

    PubMed Central

    Hwang, Juen-Haur

    2016-01-01

    Background Cochleovestibular symptoms, such as vertigo, tinnitus, and sudden deafness, are common manifestations of microvascular diseases. However, it is unclear whether these symptoms occurred preceding the diagnosis of peripheral artery occlusive disease (PAOD). Therefore, the aim of this case-control study was to investigate the risk of PAOD among patients with vertigo, tinnitus, and sudden deafness using a nationwide, population-based health claim database in Taiwan. Methods We identified 5,340 adult patients with PAOD diagnosed between January 1, 2006 and December 31, 2010 and 16,020 controls, frequency matched on age interval, sex, and year of index date, from the Taiwan National Health Insurance Research Database. Risks of PAOD in patients with vertigo, tinnitus, or sudden deafness were separately evaluated with multivariate logistic regression analyses. Results Of the 5,340 patients with PAOD, 12.7%, 6.7%, and 0.3% were diagnosed with vertigo, tinnitus, and sudden deafness, respectively. In the controls, 10.6%, 6.1%, and 0.3% were diagnosed with vertigo (P < 0.001), tinnitus (P = 0.161), and sudden deafness (P = 0.774), respectively. Results from the multivariate logistic regression analyses showed that the risk of PAOD was significantly increased in patients with vertigo (adjusted odds ratio = 1.12, P = 0.027) but not in those with tinnitus or sudden deafness. Conclusions A modest increase in the risk of PAOD was observed among Taiwanese patients with vertigo, after adjustment for comorbidities. PMID:27631630

  16. Alteration in plasma free amino acid levels and its association with gout.

    PubMed

    Mahbub, M H; Yamaguchi, Natsu; Takahashi, Hidekazu; Hase, Ryosuke; Amano, Hiroki; Kobayashi-Miura, Mikiko; Kanda, Hideyuki; Fujita, Yasuyuki; Yamamoto, Hiroshi; Yamamoto, Mai; Kikuchi, Shinya; Ikeda, Atsuko; Kageyama, Naoko; Nakamura, Mina; Ishimaru, Yasutaka; Sunagawa, Hiroshi; Tanabe, Tsuyoshi

    2017-03-16

    Studies on the association of plasma-free amino acids with gout are very limited and produced conflicting results. Therefore, we sought to explore and characterize the plasma-free amino acid (PFAA) profile in patients with gout and evaluate its association with the latter. Data from a total of 819 subjects (including 34 patients with gout) undergoing an annual health examination program in Shimane, Japan were considered for this study. Venous blood samples were collected from the subjects and concentrations of 19 plasma amino acids were determined by high-performance liquid chromatography-electrospray ionization-mass spectrometry. Student's t-test was applied for comparison of variables between patient and control groups. The relationships between the presence or absence of gout and individual amino acids were investigated by logistic regression analysis controlling for the effects of potential demographic confounders. Among 19 amino acids, the levels of 10 amino acids (alanine, glycine, isoleucine, leucine, methionine, phenylalanine, proline, serine, tryptophan, valine) differed significantly (P < .001 to .05) between the patient and control groups. Univariate logistic regression analysis revealed that plasma levels of alanine, isoleucine, leucine, phenylalanine, tryptophan and valine had significant positive associations (P < .005 to .05) whereas glycine and serine had significant inverse association (P < .05) with gout. The observed significant changes in PFAA profiles may have important implications for improving our understanding of pathophysiology, diagnosis and prevention of gout. The findings of this study need further confirmation in future large-scale studies involving a larger number of patients with gout.

  17. Effects of edaravone on early outcomes in acute ischemic stroke patients treated with recombinant tissue plasminogen activator.

    PubMed

    Wada, Tomoki; Yasunaga, Hideo; Inokuchi, Ryota; Horiguchi, Hiromasa; Fushimi, Kiyohide; Matsubara, Takehiro; Nakajima, Susumu; Yahagi, Naoki

    2014-10-15

    We investigated whether edaravone could improve early outcomes in acute ischemic stroke patients treated with recombinant tissue plasminogen activator (rtPA). We conducted a retrospective cohort study using the Japanese Diagnosis Procedure Combination database. We identified patients admitted with a primary diagnosis of ischemic stroke from 1 July 2010 to 31 March 2012 and treated with rtPA on the same day of stroke onset or the following day. Thereafter, we selected those who received edaravone on the same day of rtPA administration (edaravone group), and those who received rtPA without edaravone (control group). The primary outcomes were modified Rankin Scale (mRS) scores at discharge. One-to-one propensity-score matching was performed between the edaravone and control groups. An ordinal logistic regression analysis for mRS scores at discharge was performed with adjustment for possible variables as well as clustering of patients within hospitals using a generalized estimating equation. We identified 6336 eligible patients for inclusion in the edaravone group (n=5979; 94%) and the control group (n=357; 6%) as the total population. In 356 pairs of the propensity-matched population, the ordinal logistic regression analysis showed that edaravone was significantly associated with lower mRS scores of patients at discharge (adjusted odds ratio: 0.74; 95% confidence interval: 0.57-0.96). Edaravone may improve early outcomes in acute ischemic stroke patients treated with rtPA. Copyright © 2014 Elsevier B.V. All rights reserved.

  18. The status of diabetes control in Kurdistan province, west of Iran.

    PubMed

    Esmailnasab, Nader; Afkhamzadeh, Abdorrahim; Roshani, Daem; Moradi, Ghobad

    2013-09-17

    Based on some estimation more than two million peoples in Iran are affected by Type 2 diabetes. The present study was designed to evaluate the status of diabetes control among Type 2 diabetes patients in Kurdistan, west of Iran and its associated factors. In our cross sectional study conducted in 2010, 411 Type 2 diabetes patients were randomly recruited from Sanandaj, Capital of Kurdistan. Chi square test was used in univariate analysis to address the association between HgAlc and FBS status and other variables. The significant results from Univariate analysis were entered in multivariate analysis and multinomial logistic regression model. In 38% of patients, FBS was in normal range (70-130) and in 47% HgA1c was <7% which is normal range for HgA1c. In univariate analysis, FBS level was associated with educational levels (P=0.001), referral style (P=0.001), referral time (P=0.009), and insulin injection (P=0.016). In addition, HgA1c had a relationship with sex (P=0.023), age (P=0.035), education (P=0.001), referral style (P=0.001), and insulin injection (P=0.008). After using multinomial logistic regression for significant results of univariate analysis, it was found that FBS was significantly associated with referral style. In addition HgA1c was significantly associated with referral style and Insulin injection. Although some of patients were under the coverage of specialized cares, but their diabetes were not properly controlled.

  19. Education, Employment, Income, and Marital Status Among Adults Diagnosed With Inflammatory Bowel Diseases During Childhood or Adolescence.

    PubMed

    El-Matary, Wael; Dufault, Brenden; Moroz, Stan P; Schellenberg, Jeannine; Bernstein, Charles N

    2017-04-01

    We aimed to assess levels of education attained, employment, and marital status of adults diagnosed with inflammatory bowel diseases (IBD) during childhood or adolescence, compared with healthy individuals in Canada. We performed a cross-sectional study of adults diagnosed with IBD in childhood or adolescence at Children's Hospital in Winnipeg, Manitoba from January 1978 through December 2007. Participants (n = 112) answered a semi-structured questionnaire on educational achievements, employment, and marital status. Patients were matched for age and sex with random healthy individuals from the 2012 Canadian Community Health Survey (controls, 5 per patient). Conditional binary logistic regression and random-effects ordinal logistic regression models were used for analysis. Patients were followed for a mean duration of 14.3 years (range, 3.1-34.5 years). Persons with IBD were more likely to earn more money per annum and attain a post-secondary school degree or receive a diploma than controls (odds ratio, 1.72; 95% confidence interval, 1.13-2.60; P < .01 and odds ratio, 2.73; 95% confidence interval, 1.48-5.04; P < .01, respectively). There was no significant difference between patients and controls in employment or marital status. Adults diagnosed with IBD during childhood seem to achieve higher education levels than individuals without IBD. This observation should provide reassurance to children with IBD and their parents. ClinicalTrials.gov number: NCT02152241. Copyright © 2017 AGA Institute. Published by Elsevier Inc. All rights reserved.

  20. Perioperative factors associated with pressure ulcer development after major surgery.

    PubMed

    Kim, Jeong Min; Lee, Hyunjeong; Ha, Taehoon; Na, Sungwon

    2018-02-01

    Postoperative pressure ulcers are important indicators of perioperative care quality, and are serious and expensive complications during critical care. This study aimed to identify perioperative risk factors for postoperative pressure ulcers. This retrospective case-control study evaluated 2,498 patients who underwent major surgery. Forty-three patients developed postoperative pressure ulcers and were matched to 86 control patients based on age, sex, surgery, and comorbidities. The pressure ulcer group had lower baseline hemoglobin and albumin levels, compared to the control group. The pressure ulcer group also had higher values for lactate levels, blood loss, and number of packed red blood cell ( p RBC) units. Univariate analysis revealed that pressure ulcer development was associated with preoperative hemoglobin levels, albumin levels, lactate levels, intraoperative blood loss, number of p RBC units, Acute Physiologic and Chronic Health Evaluation II score, Braden scale score, postoperative ventilator care, and patient restraint. In the multiple logistic regression analysis, only preoperative low albumin levels (odds ratio [OR]: 0.21, 95% CI: 0.05-0.82; P < 0.05) and high lactate levels (OR: 1.70, 95% CI: 1.07-2.71; P < 0.05) were independently associated with pressure ulcer development. A receiver operating characteristic curve was used to assess the predictive power of the logistic regression model, and the area under the curve was 0.88 (95% CI: 0.79-0.97; P < 0.001). The present study revealed that preoperative low albumin levels and high lactate levels were significantly associated with pressure ulcer development after surgery.

  1. Modeling the dynamics of urban growth using multinomial logistic regression: a case study of Jiayu County, Hubei Province, China

    NASA Astrophysics Data System (ADS)

    Nong, Yu; Du, Qingyun; Wang, Kun; Miao, Lei; Zhang, Weiwei

    2008-10-01

    Urban growth modeling, one of the most important aspects of land use and land cover change study, has attracted substantial attention because it helps to comprehend the mechanisms of land use change thus helps relevant policies made. This study applied multinomial logistic regression to model urban growth in the Jiayu county of Hubei province, China to discover the relationship between urban growth and the driving forces of which biophysical and social-economic factors are selected as independent variables. This type of regression is similar to binary logistic regression, but it is more general because the dependent variable is not restricted to two categories, as those previous studies did. The multinomial one can simulate the process of multiple land use competition between urban land, bare land, cultivated land and orchard land. Taking the land use type of Urban as reference category, parameters could be estimated with odds ratio. A probability map is generated from the model to predict where urban growth will occur as a result of the computation.

  2. A large-scale assessment of two-way SNP interactions in breast cancer susceptibility using 46 450 cases and 42 461 controls from the breast cancer association consortium

    PubMed Central

    Milne, Roger L.; Herranz, Jesús; Michailidou, Kyriaki; Dennis, Joe; Tyrer, Jonathan P.; Zamora, M. Pilar; Arias-Perez, José Ignacio; González-Neira, Anna; Pita, Guillermo; Alonso, M. Rosario; Wang, Qin; Bolla, Manjeet K.; Czene, Kamila; Eriksson, Mikael; Humphreys, Keith; Darabi, Hatef; Li, Jingmei; Anton-Culver, Hoda; Neuhausen, Susan L.; Ziogas, Argyrios; Clarke, Christina A.; Hopper, John L.; Dite, Gillian S.; Apicella, Carmel; Southey, Melissa C.; Chenevix-Trench, Georgia; Swerdlow, Anthony; Ashworth, Alan; Orr, Nicholas; Schoemaker, Minouk; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katarzyna; Andrulis, Irene L.; Knight, Julia A.; Glendon, Gord; Mulligan, Anna Marie; Bojesen, Stig E.; Nordestgaard, Børge G.; Flyger, Henrik; Nevanlinna, Heli; Muranen, Taru A.; Aittomäki, Kristiina; Blomqvist, Carl; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Wang, Xianshu; Olson, Janet E.; Vachon, Celine; Purrington, Kristen; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Dunning, Alison M.; Shah, Mitul; Guénel, Pascal; Truong, Thérèse; Sanchez, Marie; Mulot, Claire; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Lindblom, Annika; Margolin, Sara; Hooning, Maartje J.; Hollestelle, Antoinette; Collée, J. Margriet; Jager, Agnes; Cox, Angela; Brock, Ian W.; Reed, Malcolm W.R.; Devilee, Peter; Tollenaar, Robert A.E.M.; Seynaeve, Caroline; Haiman, Christopher A.; Henderson, Brian E.; Schumacher, Fredrick; Le Marchand, Loic; Simard, Jacques; Dumont, Martine; Soucy, Penny; Dörk, Thilo; Bogdanova, Natalia V.; Hamann, Ute; Försti, Asta; Rüdiger, Thomas; Ulmer, Hans-Ulrich; Fasching, Peter A.; Häberle, Lothar; Ekici, Arif B.; Beckmann, Matthias W.; Fletcher, Olivia; Johnson, Nichola; dos Santos Silva, Isabel; Peto, Julian; Radice, Paolo; Peterlongo, Paolo; Peissel, Bernard; Mariani, Paolo; Giles, Graham G.; Severi, Gianluca; Baglietto, Laura; Sawyer, Elinor; Tomlinson, Ian; Kerin, Michael; Miller, Nicola; Marme, Federik; Burwinkel, Barbara; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M.; Lambrechts, Diether; Yesilyurt, Betul T.; Floris, Giuseppe; Leunen, Karin; Alnæs, Grethe Grenaker; Kristensen, Vessela; Børresen-Dale, Anne-Lise; García-Closas, Montserrat; Chanock, Stephen J.; Lissowska, Jolanta; Figueroa, Jonine D.; Schmidt, Marjanka K.; Broeks, Annegien; Verhoef, Senno; Rutgers, Emiel J.; Brauch, Hiltrud; Brüning, Thomas; Ko, Yon-Dschun; Couch, Fergus J.; Toland, Amanda E.; Yannoukakos, Drakoulis; Pharoah, Paul D.P.; Hall, Per; Benítez, Javier; Malats, Núria; Easton, Douglas F.

    2014-01-01

    Part of the substantial unexplained familial aggregation of breast cancer may be due to interactions between common variants, but few studies have had adequate statistical power to detect interactions of realistic magnitude. We aimed to assess all two-way interactions in breast cancer susceptibility between 70 917 single nucleotide polymorphisms (SNPs) selected primarily based on prior evidence of a marginal effect. Thirty-eight international studies contributed data for 46 450 breast cancer cases and 42 461 controls of European origin as part of a multi-consortium project (COGS). First, SNPs were preselected based on evidence (P < 0.01) of a per-allele main effect, and all two-way combinations of those were evaluated by a per-allele (1 d.f.) test for interaction using logistic regression. Second, all 2.5 billion possible two-SNP combinations were evaluated using Boolean operation-based screening and testing, and SNP pairs with the strongest evidence of interaction (P < 10−4) were selected for more careful assessment by logistic regression. Under the first approach, 3277 SNPs were preselected, but an evaluation of all possible two-SNP combinations (1 d.f.) identified no interactions at P < 10−8. Results from the second analytic approach were consistent with those from the first (P > 10−10). In summary, we observed little evidence of two-way SNP interactions in breast cancer susceptibility, despite the large number of SNPs with potential marginal effects considered and the very large sample size. This finding may have important implications for risk prediction, simplifying the modelling required. Further comprehensive, large-scale genome-wide interaction studies may identify novel interacting loci if the inherent logistic and computational challenges can be overcome. PMID:24242184

  3. A large-scale assessment of two-way SNP interactions in breast cancer susceptibility using 46,450 cases and 42,461 controls from the breast cancer association consortium.

    PubMed

    Milne, Roger L; Herranz, Jesús; Michailidou, Kyriaki; Dennis, Joe; Tyrer, Jonathan P; Zamora, M Pilar; Arias-Perez, José Ignacio; González-Neira, Anna; Pita, Guillermo; Alonso, M Rosario; Wang, Qin; Bolla, Manjeet K; Czene, Kamila; Eriksson, Mikael; Humphreys, Keith; Darabi, Hatef; Li, Jingmei; Anton-Culver, Hoda; Neuhausen, Susan L; Ziogas, Argyrios; Clarke, Christina A; Hopper, John L; Dite, Gillian S; Apicella, Carmel; Southey, Melissa C; Chenevix-Trench, Georgia; Swerdlow, Anthony; Ashworth, Alan; Orr, Nicholas; Schoemaker, Minouk; Jakubowska, Anna; Lubinski, Jan; Jaworska-Bieniek, Katarzyna; Durda, Katarzyna; Andrulis, Irene L; Knight, Julia A; Glendon, Gord; Mulligan, Anna Marie; Bojesen, Stig E; Nordestgaard, Børge G; Flyger, Henrik; Nevanlinna, Heli; Muranen, Taru A; Aittomäki, Kristiina; Blomqvist, Carl; Chang-Claude, Jenny; Rudolph, Anja; Seibold, Petra; Flesch-Janys, Dieter; Wang, Xianshu; Olson, Janet E; Vachon, Celine; Purrington, Kristen; Winqvist, Robert; Pylkäs, Katri; Jukkola-Vuorinen, Arja; Grip, Mervi; Dunning, Alison M; Shah, Mitul; Guénel, Pascal; Truong, Thérèse; Sanchez, Marie; Mulot, Claire; Brenner, Hermann; Dieffenbach, Aida Karina; Arndt, Volker; Stegmaier, Christa; Lindblom, Annika; Margolin, Sara; Hooning, Maartje J; Hollestelle, Antoinette; Collée, J Margriet; Jager, Agnes; Cox, Angela; Brock, Ian W; Reed, Malcolm W R; Devilee, Peter; Tollenaar, Robert A E M; Seynaeve, Caroline; Haiman, Christopher A; Henderson, Brian E; Schumacher, Fredrick; Le Marchand, Loic; Simard, Jacques; Dumont, Martine; Soucy, Penny; Dörk, Thilo; Bogdanova, Natalia V; Hamann, Ute; Försti, Asta; Rüdiger, Thomas; Ulmer, Hans-Ulrich; Fasching, Peter A; Häberle, Lothar; Ekici, Arif B; Beckmann, Matthias W; Fletcher, Olivia; Johnson, Nichola; dos Santos Silva, Isabel; Peto, Julian; Radice, Paolo; Peterlongo, Paolo; Peissel, Bernard; Mariani, Paolo; Giles, Graham G; Severi, Gianluca; Baglietto, Laura; Sawyer, Elinor; Tomlinson, Ian; Kerin, Michael; Miller, Nicola; Marme, Federik; Burwinkel, Barbara; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli-Matti; Hartikainen, Jaana M; Lambrechts, Diether; Yesilyurt, Betul T; Floris, Giuseppe; Leunen, Karin; Alnæs, Grethe Grenaker; Kristensen, Vessela; Børresen-Dale, Anne-Lise; García-Closas, Montserrat; Chanock, Stephen J; Lissowska, Jolanta; Figueroa, Jonine D; Schmidt, Marjanka K; Broeks, Annegien; Verhoef, Senno; Rutgers, Emiel J; Brauch, Hiltrud; Brüning, Thomas; Ko, Yon-Dschun; Couch, Fergus J; Toland, Amanda E; Yannoukakos, Drakoulis; Pharoah, Paul D P; Hall, Per; Benítez, Javier; Malats, Núria; Easton, Douglas F

    2014-04-01

    Part of the substantial unexplained familial aggregation of breast cancer may be due to interactions between common variants, but few studies have had adequate statistical power to detect interactions of realistic magnitude. We aimed to assess all two-way interactions in breast cancer susceptibility between 70,917 single nucleotide polymorphisms (SNPs) selected primarily based on prior evidence of a marginal effect. Thirty-eight international studies contributed data for 46,450 breast cancer cases and 42,461 controls of European origin as part of a multi-consortium project (COGS). First, SNPs were preselected based on evidence (P < 0.01) of a per-allele main effect, and all two-way combinations of those were evaluated by a per-allele (1 d.f.) test for interaction using logistic regression. Second, all 2.5 billion possible two-SNP combinations were evaluated using Boolean operation-based screening and testing, and SNP pairs with the strongest evidence of interaction (P < 10(-4)) were selected for more careful assessment by logistic regression. Under the first approach, 3277 SNPs were preselected, but an evaluation of all possible two-SNP combinations (1 d.f.) identified no interactions at P < 10(-8). Results from the second analytic approach were consistent with those from the first (P > 10(-10)). In summary, we observed little evidence of two-way SNP interactions in breast cancer susceptibility, despite the large number of SNPs with potential marginal effects considered and the very large sample size. This finding may have important implications for risk prediction, simplifying the modelling required. Further comprehensive, large-scale genome-wide interaction studies may identify novel interacting loci if the inherent logistic and computational challenges can be overcome.

  4. [Associated factors in newborns with intrauterine growth retardation].

    PubMed

    Thompson-Chagoyán, Oscar C; Vega-Franco, Leopoldo

    2008-01-01

    To identify the risk factors implicated in the intrauterine growth retardation (IUGR) of neonates born in a social security institution. Case controls design study in 376 neonates: 188 with IUGR (weight < 10 percentile) and 188 without IUGR. When they born, information about 30 variables of risk for IUGR were obtained from mothers. Risk analysis and logistical regression (stepwise) were used. Odds ratios were significant for 12 of the variables. The model obtains by stepwise regression included: weight gain at pregnancy, prenatal care attendance, toxemia, chocolate ingestion, father's weight, and the environmental house. Must of the variables included in the model are related to socioeconomic disadvantages related to the risk of RCIU in the population.

  5. The Persistence of the Gender Gap in Introductory Physics

    NASA Astrophysics Data System (ADS)

    Kost, Lauren E.; Pollock, Steven J.; Finkelstein, Noah D.

    2008-10-01

    We previously showed[l] that despite teaching with interactive engagement techniques, the gap in performance between males and females on conceptual learning surveys persisted from pre- to posttest, at our institution. Such findings were counter to previously published work[2]. Our current work analyzes factors that may influence the observed gender gap in our courses. Posttest conceptual assessment data are modeled using both multiple regression and logistic regression analyses to estimate the gender gap in posttest scores after controlling for background factors that vary by gender. We find that at our institution the gender gap persists in interactive physics classes, but is largely due to differences in physics and math preparation and incoming attitudes and beliefs.

  6. Logistic Mixed Models to Investigate Implicit and Explicit Belief Tracking.

    PubMed

    Lages, Martin; Scheel, Anne

    2016-01-01

    We investigated the proposition of a two-systems Theory of Mind in adults' belief tracking. A sample of N = 45 participants predicted the choice of one of two opponent players after observing several rounds in an animated card game. Three matches of this card game were played and initial gaze direction on target and subsequent choice predictions were recorded for each belief task and participant. We conducted logistic regressions with mixed effects on the binary data and developed Bayesian logistic mixed models to infer implicit and explicit mentalizing in true belief and false belief tasks. Although logistic regressions with mixed effects predicted the data well a Bayesian logistic mixed model with latent task- and subject-specific parameters gave a better account of the data. As expected explicit choice predictions suggested a clear understanding of true and false beliefs (TB/FB). Surprisingly, however, model parameters for initial gaze direction also indicated belief tracking. We discuss why task-specific parameters for initial gaze directions are different from choice predictions yet reflect second-order perspective taking.

  7. Model selection for logistic regression models

    NASA Astrophysics Data System (ADS)

    Duller, Christine

    2012-09-01

    Model selection for logistic regression models decides which of some given potential regressors have an effect and hence should be included in the final model. The second interesting question is whether a certain factor is heterogeneous among some subsets, i.e. whether the model should include a random intercept or not. In this paper these questions will be answered with classical as well as with Bayesian methods. The application show some results of recent research projects in medicine and business administration.

  8. Radiomorphometric analysis of frontal sinus for sex determination.

    PubMed

    Verma, Saumya; Mahima, V G; Patil, Karthikeya

    2014-09-01

    Sex determination of unknown individuals carries crucial significance in forensic research, in cases where fragments of skull persist with no likelihood of identification based on dental arch. In these instances sex determination becomes important to rule out certain number of possibilities instantly and helps in establishing a biological profile of human remains. The aim of the study is to evaluate a mathematical method based on logistic regression analysis capable of ascertaining the sex of individuals in the South Indian population. The study was conducted in the department of Oral Medicine and Radiology. The right and left areas, maximum height, width of frontal sinus were determined in 100 Caldwell views of 50 women and 50 men aged 20 years and above, with the help of Vernier callipers and a square grid with 1 square measuring 1mm(2) in area. Student's t-test, logistic regression analysis. The mean values of variables were greater in men, based on Student's t-test at 5% level of significance. The mathematical model based on logistic regression analysis gave percentage agreement of total area to correctly predict the female gender as 55.2%, of right area as 60.9% and of left area as 55.2%. The areas of the frontal sinus and the logistic regression proved to be unreliable in sex determination. (Logit = 0.924 - 0.00217 × right area).

  9. Estimating multilevel logistic regression models when the number of clusters is low: a comparison of different statistical software procedures.

    PubMed

    Austin, Peter C

    2010-04-22

    Multilevel logistic regression models are increasingly being used to analyze clustered data in medical, public health, epidemiological, and educational research. Procedures for estimating the parameters of such models are available in many statistical software packages. There is currently little evidence on the minimum number of clusters necessary to reliably fit multilevel regression models. We conducted a Monte Carlo study to compare the performance of different statistical software procedures for estimating multilevel logistic regression models when the number of clusters was low. We examined procedures available in BUGS, HLM, R, SAS, and Stata. We found that there were qualitative differences in the performance of different software procedures for estimating multilevel logistic models when the number of clusters was low. Among the likelihood-based procedures, estimation methods based on adaptive Gauss-Hermite approximations to the likelihood (glmer in R and xtlogit in Stata) or adaptive Gaussian quadrature (Proc NLMIXED in SAS) tended to have superior performance for estimating variance components when the number of clusters was small, compared to software procedures based on penalized quasi-likelihood. However, only Bayesian estimation with BUGS allowed for accurate estimation of variance components when there were fewer than 10 clusters. For all statistical software procedures, estimation of variance components tended to be poor when there were only five subjects per cluster, regardless of the number of clusters.

  10. Building a Decision Support System for Inpatient Admission Prediction With the Manchester Triage System and Administrative Check-in Variables.

    PubMed

    Zlotnik, Alexander; Alfaro, Miguel Cuchí; Pérez, María Carmen Pérez; Gallardo-Antolín, Ascensión; Martínez, Juan Manuel Montero

    2016-05-01

    The usage of decision support tools in emergency departments, based on predictive models, capable of estimating the probability of admission for patients in the emergency department may give nursing staff the possibility of allocating resources in advance. We present a methodology for developing and building one such system for a large specialized care hospital using a logistic regression and an artificial neural network model using nine routinely collected variables available right at the end of the triage process.A database of 255.668 triaged nonobstetric emergency department presentations from the Ramon y Cajal University Hospital of Madrid, from January 2011 to December 2012, was used to develop and test the models, with 66% of the data used for derivation and 34% for validation, with an ordered nonrandom partition. On the validation dataset areas under the receiver operating characteristic curve were 0.8568 (95% confidence interval, 0.8508-0.8583) for the logistic regression model and 0.8575 (95% confidence interval, 0.8540-0. 8610) for the artificial neural network model. χ Values for Hosmer-Lemeshow fixed "deciles of risk" were 65.32 for the logistic regression model and 17.28 for the artificial neural network model. A nomogram was generated upon the logistic regression model and an automated software decision support system with a Web interface was built based on the artificial neural network model.

  11. Product unit neural network models for predicting the growth limits of Listeria monocytogenes.

    PubMed

    Valero, A; Hervás, C; García-Gimeno, R M; Zurera, G

    2007-08-01

    A new approach to predict the growth/no growth interface of Listeria monocytogenes as a function of storage temperature, pH, citric acid (CA) and ascorbic acid (AA) is presented. A linear logistic regression procedure was performed and a non-linear model was obtained by adding new variables by means of a Neural Network model based on Product Units (PUNN). The classification efficiency of the training data set and the generalization data of the new Logistic Regression PUNN model (LRPU) were compared with Linear Logistic Regression (LLR) and Polynomial Logistic Regression (PLR) models. 92% of the total cases from the LRPU model were correctly classified, an improvement on the percentage obtained using the PLR model (90%) and significantly higher than the results obtained with the LLR model, 80%. On the other hand predictions of LRPU were closer to data observed which permits to design proper formulations in minimally processed foods. This novel methodology can be applied to predictive microbiology for describing growth/no growth interface of food-borne microorganisms such as L. monocytogenes. The optimal balance is trying to find models with an acceptable interpretation capacity and with good ability to fit the data on the boundaries of variable range. The results obtained conclude that these kinds of models might well be very a valuable tool for mathematical modeling.

  12. Analysis of a database to predict the result of allergy testing in vivo in patients with chronic nasal symptoms.

    PubMed

    Lacagnina, Valerio; Leto-Barone, Maria S; La Piana, Simona; Seidita, Aurelio; Pingitore, Giuseppe; Di Lorenzo, Gabriele

    2014-01-01

    This article uses the logistic regression model for diagnostic decision making in patients with chronic nasal symptoms. We studied the ability of the logistic regression model, obtained by the evaluation of a database, to detect patients with positive allergy skin-prick test (SPT) and patients with negative SPT. The model developed was validated using the data set obtained from another medical institution. The analysis was performed using a database obtained from a questionnaire administered to the patients with nasal symptoms containing personal data, clinical data, and results of allergy testing (SPT). All variables found to be significantly different between patients with positive and negative SPT (p < 0.05) were selected for the logistic regression models and were analyzed with backward stepwise logistic regression, evaluated with area under the curve of the receiver operating characteristic curve. A second set of patients from another institution was used to prove the model. The accuracy of the model in identifying, over the second set, both patients whose SPT will be positive and negative was high. The model detected 96% of patients with nasal symptoms and positive SPT and classified 94% of those with negative SPT. This study is preliminary to the creation of a software that could help the primary care doctors in a diagnostic decision making process (need of allergy testing) in patients complaining of chronic nasal symptoms.

  13. Comparing machine learning and logistic regression methods for predicting hypertension using a combination of gene expression and next-generation sequencing data.

    PubMed

    Held, Elizabeth; Cape, Joshua; Tintle, Nathan

    2016-01-01

    Machine learning methods continue to show promise in the analysis of data from genetic association studies because of the high number of variables relative to the number of observations. However, few best practices exist for the application of these methods. We extend a recently proposed supervised machine learning approach for predicting disease risk by genotypes to be able to incorporate gene expression data and rare variants. We then apply 2 different versions of the approach (radial and linear support vector machines) to simulated data from Genetic Analysis Workshop 19 and compare performance to logistic regression. Method performance was not radically different across the 3 methods, although the linear support vector machine tended to show small gains in predictive ability relative to a radial support vector machine and logistic regression. Importantly, as the number of genes in the models was increased, even when those genes contained causal rare variants, model predictive ability showed a statistically significant decrease in performance for both the radial support vector machine and logistic regression. The linear support vector machine showed more robust performance to the inclusion of additional genes. Further work is needed to evaluate machine learning approaches on larger samples and to evaluate the relative improvement in model prediction from the incorporation of gene expression data.

  14. Risk factors for syphilis in women: case-control study

    PubMed Central

    de Macêdo, Vilma Costa; de Lira, Pedro Israel Cabral; de Frias, Paulo Germano; Romaguera, Luciana Maria Delgado; Caires, Silvana de Fátima Ferreira; Ximenes, Ricardo Arraes de Alencar

    2017-01-01

    ABSTRACT OBJECTIVE To determine the sociodemographic, behavioral, and health care factors related to the occurrence of syphilis in women treated at public maternity hospitals. METHODS This is a case-control study (239 cases and 322 controls) with women admitted to seven maternity hospitals in the municipality of Recife, Brazil, from July 2013 to July 2014. Eligible women were recruited after the result of the VDRL (Venereal Disease Research Laboratory) under any titration. The selection of cases and controls was based on the result of the serology for syphilis using ELISA (enzyme-linked immunosorbent assay). The independent variables were grouped into: sociodemographic, behavioral, clinical and obstetric history, and health care in prenatal care and maternity hospital. Information was obtained by interview, during hospitalization, with the application of a questionnaire. Odds ratios and 95% confidence intervals were estimated using logistic regression to identify the predicting factors of the variable to be explained. RESULTS The logistic regression analysis identified as determinant factors for gestational syphilis: education level of incomplete basic education or illiterate (OR = 2.02), lack of access to telephone (OR = 2.4), catholic religion (OR = 1.70 ), four or more pregnancies (OR = 2.2), three or more sexual partners in the last year (OR = 3.1), use of illicit drugs before the age of 18 (OR = 3.0), and use of illicit drugs by the current partner (OR = 1.7). Only one to three prenatal appointments (OR = 3.5) and a previous history of sexually transmitted infection (OR = 9.7) were also identified as determinant factors. CONCLUSIONS Sociodemographic, behavioral, and health care factors are associated with the occurrence of syphilis in women and should be taken into account in the elaboration of universal strategies aimed at the prevention and control of syphilis, but with a focus on situations of greater vulnerability. PMID:28832758

  15. Clinical Outcome And Arginine Serum of Acute Ischemic Stroke Patients Supplemented by Snakehead Fish Extract

    NASA Astrophysics Data System (ADS)

    Pudjonarko, Dwi; Retnaningsih; Abidin, Zainal

    2018-02-01

    Background: Levels of arginine associated with clinical outcome in acute ischemic stroke (AIS). Arginine is a protein needed to synthesis nitric oxide (NO), a potential vasodilator and antioxidant. Snakehead fish is a source of protein which has antioxidant activity. Snakehead fish contains mineral, vitamin, and amino acids. One of the amino acids that were found quite high in snakehead fish extract is arginine. The aim of this study was done to determine the effect of snakehead fish extracts (SFE) on serum arginin levels and clinical outcome of AIS patients. Methods: It was double-blind randomized pretest-posttest control group design, with. AIS patients were divided into two groups i.e. snakehead fish extracts (SFE) and control. SFE group were administered 15 grams SFE for 7 days . Arginine serum levels and clinical outcome (measured by National Institute of Health Stroke Scale = NIHSS) were measured before and after treatment, other related factors were also analyzed in Logistic regression. Results: A total of 42 subjects who were performed random allocation as SFE or control group. There was no differences in subject characteristics between the two groups. There was a differences Δ arginine serum levels between SFE and control (33.6±19.95 μmol/L 0.3±2.51 μmol/L p<0.001). Change in NIHSS score in SFE improved significantly compared to the control group (4.14 ± 2.03; 2.52 ± 1.81;p=0.009 ). Logistic regression analysis showed only female gender factor that affected on improvement of NIHSS (OR=7; p=0,01). Conclusion: There is Clinical outcome improvement and enhancement of arginine serum levels in AIS patient with snakehead fish extract supplementation.

  16. Association between prostate cancer and urinary calculi: a population-based study.

    PubMed

    Chung, Shiu-Dong; Liu, Shih-Ping; Lin, Herng-Ching

    2013-01-01

    Understanding the reasons underlying the emerging trend and the changing demographics of Asian prostate cancer (PC) has become an important field of study. This study set out to explore the possibility that urinary calculi (UC) and PC may share an association by conducting a case-control study on a population-based database in Taiwan. The cases of this study included 2,900 subjects ≥ 40 years-old who had received their first-time diagnosis of PC and 14,500 randomly selected controls without PC. Conditional logistic regressions were employed to explore the association between PC and having been previously diagnosed with UC. We found that prior UC was found among 608 (21.0%) cases and 2,037 (14.1%) controls (p<0.001). Conditional logistic regression analysis revealed that compared to controls, the odds ratio (OR) of prior UC for cases was 1.63 (95% CI = 1.47-1.80). Furthermore, we found that cases were more likely to have been previously diagnosed with kidney calculus (OR = 1.71; 95% CI = 1.42-2.05), bladder calculus (OR = 2.06; 95% CI = 1.32-3.23), unspecified calculus (OR = 1.66; 95% CI = 1.37-2.00), and ≥2 locations of UC (OR = 1.73; 1.47-2.02) than controls. However, there was no significant relationship between PC and prior ureter calculus. We also found that of the patients with UC, there was no significant difference between PC and treatment method. This investigation detected an association between PC and prior UC. These results highlight a potential target population for PC screening.

  17. Association of bladder pain syndrome/interstitial cystitis with urinary calculus: a nationwide population-based study.

    PubMed

    Keller, Joseph; Chen, Yi-Kuang; Lin, Herng-Ching

    2013-04-01

    Although one prior study reported an association between bladder pain syndrome/interstitial cystitis (BPS/IC) and urinary calculi (UC), no population-based study to date has been conducted to explore this relationship. Therefore, using a population-based data set in Taiwan, this study set out to investigate the association between BPS/IC and a prior diagnosis of UC. This study included 9,269 cases who had received their first-time diagnosis of BPS/IC between 2006 and 2007 and 46,345 randomly selected controls. We used conditional logistic regression analysis to compute the odds ratio (OR) and its corresponding 95 % confidence interval (CI) for having been previously diagnosed with UC between cases and controls. There was a significant difference in the prevalence of prior UC between cases and controls (8.1 vs 4.3 %, p < 0.001). Conditional logistic regression analysis revealed that cases were more likely to have been previously diagnosed with UC than controls (OR = 1.70; 95 % CI = 1.56-1.84) after adjusting for chronic pelvic pain, irritable bowel syndrome, fibromyalgia, chronic fatigue syndrome, depression, panic disorder, migraine, sicca syndrome, allergy, endometriosis, and asthma. BPS/IC was found to be significantly associated with prior UC regardless of stone location; the adjusted ORs of kidney calculus, ureter calculus, bladder calculus, and unspecified calculus when compared to controls were 1.58 (95 % CI = 1.38-1.81), 1.73 (95 % CI = 1.45-2.05), 3.80 (95 % CI = 2.18-6.62), and 1.83 (95 % CI = 1.59-2.11), respectively. This work generates the hypothesis that UC may be associated with BPS/IC.

  18. Risk factors associated with default among tuberculosis patients in Darjeeling district of West Bengal, India.

    PubMed

    Roy, Nirmalya; Basu, Mausumi; Das, Sibasis; Mandal, Amitava; Dutt, Debashis; Dasgupta, Samir

    2015-01-01

    The treatment outcome "default" under Revised National Tuberculosis Control Program (RNTCP) is a patient who after treatment initiation has interrupted treatment consecutively for more than 2 months. To assess the timing, characteristics and distribution of the reasons for default with relation to some sociodemographic variables among new sputum-positive (NSP) tuberculosis (TB) patients in Darjeeling District, West Bengal. A case-control study was conducted in three tuberculosis units (TUs) of Darjeeling from August'2011 to December'2011 among NSP TB patients enrolled for treatment in the TB register from 1(st) Qtr'09 to 2(nd) Qtr'10. Patients defaulted from treatment were considered as "cases" and those completed treatment as "controls" (79 cases and 79 controls). The enrolled cases and controls were interviewed by the health workers using a predesigned structured pro-forma. Logistic regression analysis, odds ratios (OR), adjusted odds ratios (AOR). 75% of the default occurred in the intensive phase (IP); 54.24% retrieval action was done within 1 day during IP and 75% within 1 week during continuation phase (CP); cent percent of the documented retrieval actions were undertaken by the contractual TB program staffs. Most commonly cited reasons for default were alcohol consumption (29.11%), adverse effects of drugs (25.32%), and long distance of DOT center (21.52%). In the logistic regression analysis, the factors independently associated were consumption of alcohol, inadequate knowledge about TB, inadequate patient provider interaction, instances of missed doses, adverse reactions of anti-TB drugs, Government Directly Observed Treatment (DOT) provider and smoking. Most defaults occurred in the intensive phase; pre-treatment counseling and initial home visit play very important role in this regard. Proper counseling by health care workers in patient provider meeting is needed.

  19. Helicobacter pylori coinfection is a confounder, modulating mucosal inflammation in oral submucous fibrosis.

    PubMed

    Rajendran, R; Rajeev, R; Anil, S; Alasqah, Mohammed; Rabi, Abdul Gafoor

    2009-01-01

    The oral cavity has been considered a potential reservoir for Helicobacter pylori (H pylori) , from where the organism causes recurrent gastric infections. With this case-control study we tried to evaluate the role of H pylori in the etiology of mucosal inflammation, a condition that compounds the morbid state associated with oral submucous fibrosis (OSF). Subjects ( n = 150) were selected following institutional regulations on sample collection and grouped into test cases and positive and negative controls based on the presence of mucosal fibrosis and inflammation. The negative controls had none of the clinical signs. All patients underwent an oral examination as well as tests to assess oral hygiene/periodontal disease status; a rapid urease test (RUT) of plaque samples was also done to estimate the H pylori bacterial load. We used univariate and mutivariate logistic regression for statistical analysis of the data and calculated the odds ratios to assess the risk posed by the different variables. The RUT results differed significantly between the groups, reflecting the variations in the bacterial loads in each category. The test was positive in 52% in the positive controls (where nonspecific inflammation of oral mucosa was seen unassociated with fibrosis), in 46% of the test cases, and in 18% of the negative controls (healthy volunteers) (chi2 = 13.887; P < 0.01). A positive correlation was seen between the oral hygiene/periodontal disease indices and RUT reactivity in all the three groups. The contribution of the H pylori in dental plaque to mucosal inflammation and periodontal disease was significant. Logistic regression analysis showed gastrointestinal disease and poor oral hygiene as being the greatest risk factors for bacterial colonization, irrespective of the subject groups. A positive correlation exists between RUT reactivity and the frequency of mucosal inflammation.

  20. Familial trends in a population with macular holes.

    PubMed

    Kay, Christine Nichols; Pavan, Peter Reed; Small, Laurie Buccina; Zhang, Tao; Zamba, Gideon K D; Cohen, Steven Myles

    2012-04-01

    To determine if patients with macular hole report an increased family history of macular hole compared with control patients and compare the report of family history between patients with unilateral and bilateral macular holes. This was a multicenter case-control study. Charts of patients coded with diagnosis of macular hole were reviewed, and the diagnosis of idiopathic full-thickness macular hole was ascertained in 166 patients. The control group comprised 136 patients without macular hole or trauma who presented with senile cataract. Family history was obtained from all patients through a telephone interview. Six of 166 (3.6%) macular hole patients surveyed reported a history of macular hole in a primary relative compared with none of 136 (0.0%) control patients (odds ratio is infinity, with 95% confidence interval 1.295 to infinity); however, this finding may be explained by confounders such as age and number of family members. Two of the 142 (1.4%) patients with unilateral holes versus 4 of the 24 (16.7%) patients with bilateral holes reported a family history (odds ratio is 0.0714, with 95% confidence interval 0.0063 to 0.5537), and this finding remains significant when logistic regression is performed to evaluate variables of age and number of family members as potential confounders. There is an increased report of familial occurrence of macular hole in patients with macular holes compared with control patients; however, logistic regression relates this finding to variables of age and number of family members. Patients with bilateral macular holes are more likely to report a family history of macular hole than patients with unilateral macular holes, and this finding remains significant in the presence of age and number of family members. These findings may suggest a familial component to macular hole.

Top