Sample records for disease statistical models

  1. ICD-11 and DSM-5 personality trait domains capture categorical personality disorders: Finding a common ground.

    PubMed

    Bach, Bo; Sellbom, Martin; Skjernov, Mathias; Simonsen, Erik

    2018-05-01

    The five personality disorder trait domains in the proposed International Classification of Diseases, 11th edition and the Diagnostic and Statistical Manual of Mental Disorders, 5th edition are comparable in terms of Negative Affectivity, Detachment, Antagonism/Dissociality and Disinhibition. However, the International Classification of Diseases, 11th edition model includes a separate domain of Anankastia, whereas the Diagnostic and Statistical Manual of Mental Disorders, 5th edition model includes an additional domain of Psychoticism. This study examined associations of International Classification of Diseases, 11th edition and Diagnostic and Statistical Manual of Mental Disorders, 5th edition trait domains, simultaneously, with categorical personality disorders. Psychiatric outpatients ( N = 226) were administered the Structured Clinical Interview for DSM-IV Axis II Personality Disorders Interview and the Personality Inventory for DSM-5. International Classification of Diseases, 11th edition and Diagnostic and Statistical Manual of Mental Disorders, 5th edition trait domain scores were obtained using pertinent scoring algorithms for the Personality Inventory for DSM-5. Associations between categorical personality disorders and trait domains were examined using correlation and multiple regression analyses. Both the International Classification of Diseases, 11th edition and the Diagnostic and Statistical Manual of Mental Disorders, 5th edition domain models showed relevant continuity with categorical personality disorders and captured a substantial amount of their information. As expected, the International Classification of Diseases, 11th edition model was superior in capturing obsessive-compulsive personality disorder, whereas the Diagnostic and Statistical Manual of Mental Disorders, 5th edition model was superior in capturing schizotypal personality disorder. These preliminary findings suggest that little information is 'lost' in a transition to trait domain models and potentially adds to narrowing the gap between Diagnostic and Statistical Manual of Mental Disorders, 5th edition and the proposed International Classification of Diseases, 11th edition model. Accordingly, the International Classification of Diseases, 11th edition and Diagnostic and Statistical Manual of Mental Disorders, 5th edition domain models may be used to delineate one another as well as features of familiar categorical personality disorder types. A preliminary category-to-domain 'cross walk' is provided in the article.

  2. Statistical Mechanics of Prion Diseases

    NASA Astrophysics Data System (ADS)

    Slepoy, A.; Singh, R. R.; Pázmándi, F.; Kulkarni, R. V.; Cox, D. L.

    2001-07-01

    We present a two-dimensional, lattice based, protein-level statistical mechanical model for prion diseases (e.g., mad cow disease) with concomitant prion protein misfolding and aggregation. Our studies lead us to the hypothesis that the observed broad incubation time distribution in epidemiological data reflect fluctuation dominated growth seeded by a few nanometer scale aggregates, while much narrower incubation time distributions for innoculated lab animals arise from statistical self-averaging. We model ``species barriers'' to prion infection and assess a related treatment protocol.

  3. Statistical Model of Dynamic Markers of the Alzheimer's Pathological Cascade.

    PubMed

    Balsis, Steve; Geraci, Lisa; Benge, Jared; Lowe, Deborah A; Choudhury, Tabina K; Tirso, Robert; Doody, Rachelle S

    2018-05-05

    Alzheimer's disease (AD) is a progressive disease reflected in markers across assessment modalities, including neuroimaging, cognitive testing, and evaluation of adaptive function. Identifying a single continuum of decline across assessment modalities in a single sample is statistically challenging because of the multivariate nature of the data. To address this challenge, we implemented advanced statistical analyses designed specifically to model complex data across a single continuum. We analyzed data from the Alzheimer's Disease Neuroimaging Initiative (ADNI; N = 1,056), focusing on indicators from the assessments of magnetic resonance imaging (MRI) volume, fluorodeoxyglucose positron emission tomography (FDG-PET) metabolic activity, cognitive performance, and adaptive function. Item response theory was used to identify the continuum of decline. Then, through a process of statistical scaling, indicators across all modalities were linked to that continuum and analyzed. Findings revealed that measures of MRI volume, FDG-PET metabolic activity, and adaptive function added measurement precision beyond that provided by cognitive measures, particularly in the relatively mild range of disease severity. More specifically, MRI volume, and FDG-PET metabolic activity become compromised in the very mild range of severity, followed by cognitive performance and finally adaptive function. Our statistically derived models of the AD pathological cascade are consistent with existing theoretical models.

  4. Risk Estimation for Lung Cancer in Libya: Analysis Based on Standardized Morbidity Ratio, Poisson-Gamma Model, BYM Model and Mixture Model

    PubMed

    Alhdiri, Maryam Ahmed; Samat, Nor Azah; Mohamed, Zulkifley

    2017-03-01

    Cancer is the most rapidly spreading disease in the world, especially in developing countries, including Libya. Cancer represents a significant burden on patients, families, and their societies. This disease can be controlled if detected early. Therefore, disease mapping has recently become an important method in the fields of public health research and disease epidemiology. The correct choice of statistical model is a very important step to producing a good map of a disease. Libya was selected to perform this work and to examine its geographical variation in the incidence of lung cancer. The objective of this paper is to estimate the relative risk for lung cancer. Four statistical models to estimate the relative risk for lung cancer and population censuses of the study area for the time period 2006 to 2011 were used in this work. They are initially known as Standardized Morbidity Ratio, which is the most popular statistic, which used in the field of disease mapping, Poisson-gamma model, which is one of the earliest applications of Bayesian methodology, Besag, York and Mollie (BYM) model and Mixture model. As an initial step, this study begins by providing a review of all proposed models, which we then apply to lung cancer data in Libya. Maps, tables and graph, goodness-of-fit (GOF) were used to compare and present the preliminary results. This GOF is common in statistical modelling to compare fitted models. The main general results presented in this study show that the Poisson-gamma model, BYM model, and Mixture model can overcome the problem of the first model (SMR) when there is no observed lung cancer case in certain districts. Results show that the Mixture model is most robust and provides better relative risk estimates across a range of models. Creative Commons Attribution License

  5. Risk Estimation for Lung Cancer in Libya: Analysis Based on Standardized Morbidity Ratio, Poisson-Gamma Model, BYM Model and Mixture Model

    PubMed Central

    Alhdiri, Maryam Ahmed; Samat, Nor Azah; Mohamed, Zulkifley

    2017-01-01

    Cancer is the most rapidly spreading disease in the world, especially in developing countries, including Libya. Cancer represents a significant burden on patients, families, and their societies. This disease can be controlled if detected early. Therefore, disease mapping has recently become an important method in the fields of public health research and disease epidemiology. The correct choice of statistical model is a very important step to producing a good map of a disease. Libya was selected to perform this work and to examine its geographical variation in the incidence of lung cancer. The objective of this paper is to estimate the relative risk for lung cancer. Four statistical models to estimate the relative risk for lung cancer and population censuses of the study area for the time period 2006 to 2011 were used in this work. They are initially known as Standardized Morbidity Ratio, which is the most popular statistic, which used in the field of disease mapping, Poisson-gamma model, which is one of the earliest applications of Bayesian methodology, Besag, York and Mollie (BYM) model and Mixture model. As an initial step, this study begins by providing a review of all proposed models, which we then apply to lung cancer data in Libya. Maps, tables and graph, goodness-of-fit (GOF) were used to compare and present the preliminary results. This GOF is common in statistical modelling to compare fitted models. The main general results presented in this study show that the Poisson-gamma model, BYM model, and Mixture model can overcome the problem of the first model (SMR) when there is no observed lung cancer case in certain districts. Results show that the Mixture model is most robust and provides better relative risk estimates across a range of models. PMID:28440974

  6. The potential of composite cognitive scores for tracking progression in Huntington's disease.

    PubMed

    Jones, Rebecca; Stout, Julie C; Labuschagne, Izelle; Say, Miranda; Justo, Damian; Coleman, Allison; Dumas, Eve M; Hart, Ellen; Owen, Gail; Durr, Alexandra; Leavitt, Blair R; Roos, Raymund; O'Regan, Alison; Langbehn, Doug; Tabrizi, Sarah J; Frost, Chris

    2014-01-01

    Composite scores derived from joint statistical modelling of individual risk factors are widely used to identify individuals who are at increased risk of developing disease or of faster disease progression. We investigated the ability of composite measures developed using statistical models to differentiate progressive cognitive deterioration in Huntington's disease (HD) from natural decline in healthy controls. Using longitudinal data from TRACK-HD, the optimal combinations of quantitative cognitive measures to differentiate premanifest and early stage HD individuals respectively from controls was determined using logistic regression. Composite scores were calculated from the parameters of each statistical model. Linear regression models were used to calculate effect sizes (ES) quantifying the difference in longitudinal change over 24 months between premanifest and early stage HD groups respectively and controls. ES for the composites were compared with ES for individual cognitive outcomes and other measures used in HD research. The 0.632 bootstrap was used to eliminate biases which result from developing and testing models in the same sample. In early HD, the composite score from the HD change prediction model produced an ES for difference in rate of 24-month change relative to controls of 1.14 (95% CI: 0.90 to 1.39), larger than the ES for any individual cognitive outcome and UHDRS Total Motor Score and Total Functional Capacity. In addition, this composite gave a statistically significant difference in rate of change in premanifest HD compared to controls over 24-months (ES: 0.24; 95% CI: 0.04 to 0.44), even though none of the individual cognitive outcomes produced statistically significant ES over this period. Composite scores developed using appropriate statistical modelling techniques have the potential to materially reduce required sample sizes for randomised controlled trials.

  7. Forecasting infectious disease emergence subject to seasonal forcing.

    PubMed

    Miller, Paige B; O'Dea, Eamon B; Rohani, Pejman; Drake, John M

    2017-09-06

    Despite high vaccination coverage, many childhood infections pose a growing threat to human populations. Accurate disease forecasting would be of tremendous value to public health. Forecasting disease emergence using early warning signals (EWS) is possible in non-seasonal models of infectious diseases. Here, we assessed whether EWS also anticipate disease emergence in seasonal models. We simulated the dynamics of an immunizing infectious pathogen approaching the tipping point to disease endemicity. To explore the effect of seasonality on the reliability of early warning statistics, we varied the amplitude of fluctuations around the average transmission. We proposed and analyzed two new early warning signals based on the wavelet spectrum. We measured the reliability of the early warning signals depending on the strength of their trend preceding the tipping point and then calculated the Area Under the Curve (AUC) statistic. Early warning signals were reliable when disease transmission was subject to seasonal forcing. Wavelet-based early warning signals were as reliable as other conventional early warning signals. We found that removing seasonal trends, prior to analysis, did not improve early warning statistics uniformly. Early warning signals anticipate the onset of critical transitions for infectious diseases which are subject to seasonal forcing. Wavelet-based early warning statistics can also be used to forecast infectious disease.

  8. Effects of alcohol tax increases on alcohol-related disease mortality in Alaska: time-series analyses from 1976 to 2004.

    PubMed

    Wagenaar, Alexander C; Maldonado-Molina, Mildred M; Wagenaar, Bradley H

    2009-08-01

    We evaluated the effects of tax increases on alcoholic beverages in 1983 and 2002 on alcohol-related disease mortality in Alaska. We used a quasi-experimental design with quarterly measures of mortality from 1976 though 2004, and we included other states for comparison. Our statistical approach combined an autoregressive integrated moving average model with structural parameters in interrupted time-series models. We observed statistically significant reductions in the numbers and rates of deaths caused by alcohol-related disease beginning immediately after the 1983 and 2002 alcohol tax increases in Alaska. In terms of effect size, the reductions were -29% (Cohen's d = -0.57) and -11% (Cohen's d = -0.52) for the 2 tax increases. Statistical tests of temporary-effect models versus long-term-effect models showed little dissipation of the effect over time. Increases in alcohol excise tax rates were associated with immediate and sustained reductions in alcohol-related disease mortality in Alaska. Reductions in mortality occurred after 2 tax increases almost 20 years apart. Taxing alcoholic beverages is an effective public health strategy for reducing the burden of alcohol-related disease.

  9. A statistical model for monitoring shell disease in inshore lobster fisheries: A case study in Long Island Sound

    PubMed Central

    Chen, Yong

    2017-01-01

    The expansion of shell disease is an emerging threat to the inshore lobster fisheries in the northeastern United States. The development of models to improve the efficiency and precision of existing monitoring programs is advocated as an important step in mitigating its harmful effects. The objective of this study is to construct a statistical model that could enhance the existing monitoring effort through (1) identification of potential disease-associated abiotic and biotic factors, and (2) estimation of spatial variation in disease prevalence in the lobster fishery. A delta-generalized additive modeling (GAM) approach was applied using bottom trawl survey data collected from 2001–2013 in Long Island Sound, a tidal estuary between New York and Connecticut states. Spatial distribution of shell disease prevalence was found to be strongly influenced by the interactive effects of latitude and longitude, possibly indicative of a geographic origin of shell disease. Bottom temperature, bottom salinity, and depth were also important factors affecting the spatial variability in shell disease prevalence. The delta-GAM projected high disease prevalence in non-surveyed locations. Additionally, a potential spatial discrepancy was found between modeled disease hotspots and survey-based gravity centers of disease prevalence. This study provides a modeling framework to enhance research, monitoring and management of emerging and continuing marine disease threats. PMID:28196150

  10. Statistical Modeling of Disease Progression for Chronic Obstructive Pulmonary Disease Using Data from the ECLIPSE Study.

    PubMed

    Exuzides, Alex; Colby, Chris; Briggs, Andrew H; Lomas, David A; Rutten-van Mölken, Maureen P M H; Tabberer, Maggie; Chambers, Mike; Muellerova, Hana; Locantore, Nicholas; Risebrough, Nancy A; Ismaila, Afisi S; Gonzalez-McQuire, Sebastian

    2017-05-01

    To develop statistical models predicting disease progression and outcomes in chronic obstructive pulmonary disease (COPD), using data from ECLIPSE, a large, observational study of current and former smokers with COPD. Based on a conceptual model of COPD disease progression and data from 2164 patients, associations were made between baseline characteristics, COPD disease progression attributes (exacerbations, lung function, exercise capacity, and symptoms), health-related quality of life (HRQoL), and survival. Linear and nonlinear functional forms of random intercept models were used to characterize these relationships. Endogeneity was addressed by time-lagging variables in the regression models. At the 5% significance level, an exacerbation history in the year before baseline was associated with increased risk of future exacerbations (moderate: +125.8%; severe: +89.2%) and decline in lung function (forced expiratory volume in 1 second [FEV 1 ]) (-94.20 mL per year). Each 1% increase in FEV 1 % predicted was associated with decreased risk of exacerbations (moderate: -1.1%; severe: -3.0%) and increased 6-minute walk test distance (6MWD) (+1.5 m). Increases in baseline exercise capacity (6MWD, per meter) were associated with slightly increased risk of moderate exacerbations (+0.04%) and increased FEV 1 (+0.62 mL). Symptoms (dyspnea, cough, and/or sputum) were associated with an increased risk of moderate exacerbations (+13.4% to +31.1%), and baseline dyspnea (modified Medical Research Council score ≥2 v. <2) was associated with lower FEV 1 (-112.3 mL). A series of linked statistical regression equations have been developed to express associations between indicators of COPD disease severity and HRQoL and survival. These can be used to represent disease progression, for example, in new economic models of COPD.

  11. When mechanism matters: Bayesian forecasting using models of ecological diffusion

    USGS Publications Warehouse

    Hefley, Trevor J.; Hooten, Mevin B.; Russell, Robin E.; Walsh, Daniel P.; Powell, James A.

    2017-01-01

    Ecological diffusion is a theory that can be used to understand and forecast spatio-temporal processes such as dispersal, invasion, and the spread of disease. Hierarchical Bayesian modelling provides a framework to make statistical inference and probabilistic forecasts, using mechanistic ecological models. To illustrate, we show how hierarchical Bayesian models of ecological diffusion can be implemented for large data sets that are distributed densely across space and time. The hierarchical Bayesian approach is used to understand and forecast the growth and geographic spread in the prevalence of chronic wasting disease in white-tailed deer (Odocoileus virginianus). We compare statistical inference and forecasts from our hierarchical Bayesian model to phenomenological regression-based methods that are commonly used to analyse spatial occurrence data. The mechanistic statistical model based on ecological diffusion led to important ecological insights, obviated a commonly ignored type of collinearity, and was the most accurate method for forecasting.

  12. Estimation of diagnostic test accuracy without full verification: a review of latent class methods

    PubMed Central

    Collins, John; Huynh, Minh

    2014-01-01

    The performance of a diagnostic test is best evaluated against a reference test that is without error. For many diseases, this is not possible, and an imperfect reference test must be used. However, diagnostic accuracy estimates may be biased if inaccurately verified status is used as the truth. Statistical models have been developed to handle this situation by treating disease as a latent variable. In this paper, we conduct a systematized review of statistical methods using latent class models for estimating test accuracy and disease prevalence in the absence of complete verification. PMID:24910172

  13. Methodological considerations, such as directed acyclic graphs, for studying "acute on chronic" disease epidemiology: chronic obstructive pulmonary disease example.

    PubMed

    Tsai, Chu-Lin; Camargo, Carlos A

    2009-09-01

    Acute exacerbations of chronic disease are ubiquitous in clinical medicine, and thus far, there has been a paucity of integrated methodological discussion on this phenomenon. We use acute exacerbations of chronic obstructive pulmonary disease as an example to emphasize key epidemiological and statistical issues for this understudied field in clinical epidemiology. Directed acyclic graphs are a useful epidemiological tool to explain the differential effects of risk factor on health outcomes in studies of acute and chronic phases of disease. To study the pathogenesis of acute exacerbations of chronic disease, case-crossover design and time-series analysis are well-suited study designs to differentiate acute and chronic effect. Modeling changes over time and setting appropriate thresholds are important steps to separate acute from chronic phases of disease in serial measurements. In statistical analysis, acute exacerbations are recurrent events, and some individuals are more prone to recurrences than others. Therefore, appropriate statistical modeling should take into account intraindividual dependence. Finally, we recommend the use of "event-based" number needed to treat (NNT) to prevent a single exacerbation instead of traditional patient-based NNT. Addressing these methodological challenges will advance research quality in acute on chronic disease epidemiology.

  14. Statistical genetics concepts and approaches in schizophrenia and related neuropsychiatric research.

    PubMed

    Schork, Nicholas J; Greenwood, Tiffany A; Braff, David L

    2007-01-01

    Statistical genetics is a research field that focuses on mathematical models and statistical inference methodologies that relate genetic variations (ie, naturally occurring human DNA sequence variations or "polymorphisms") to particular traits or diseases (phenotypes) usually from data collected on large samples of families or individuals. The ultimate goal of such analysis is the identification of genes and genetic variations that influence disease susceptibility. Although of extreme interest and importance, the fact that many genes and environmental factors contribute to neuropsychiatric diseases of public health importance (eg, schizophrenia, bipolar disorder, and depression) complicates relevant studies and suggests that very sophisticated mathematical and statistical modeling may be required. In addition, large-scale contemporary human DNA sequencing and related projects, such as the Human Genome Project and the International HapMap Project, as well as the development of high-throughput DNA sequencing and genotyping technologies have provided statistical geneticists with a great deal of very relevant and appropriate information and resources. Unfortunately, the use of these resources and their interpretation are not straightforward when applied to complex, multifactorial diseases such as schizophrenia. In this brief and largely nonmathematical review of the field of statistical genetics, we describe many of the main concepts, definitions, and issues that motivate contemporary research. We also provide a discussion of the most pressing contemporary problems that demand further research if progress is to be made in the identification of genes and genetic variations that predispose to complex neuropsychiatric diseases.

  15. Statistical physics of vaccination

    NASA Astrophysics Data System (ADS)

    Wang, Zhen; Bauch, Chris T.; Bhattacharyya, Samit; d'Onofrio, Alberto; Manfredi, Piero; Perc, Matjaž; Perra, Nicola; Salathé, Marcel; Zhao, Dawei

    2016-12-01

    Historically, infectious diseases caused considerable damage to human societies, and they continue to do so today. To help reduce their impact, mathematical models of disease transmission have been studied to help understand disease dynamics and inform prevention strategies. Vaccination-one of the most important preventive measures of modern times-is of great interest both theoretically and empirically. And in contrast to traditional approaches, recent research increasingly explores the pivotal implications of individual behavior and heterogeneous contact patterns in populations. Our report reviews the developmental arc of theoretical epidemiology with emphasis on vaccination, as it led from classical models assuming homogeneously mixing (mean-field) populations and ignoring human behavior, to recent models that account for behavioral feedback and/or population spatial/social structure. Many of the methods used originated in statistical physics, such as lattice and network models, and their associated analytical frameworks. Similarly, the feedback loop between vaccinating behavior and disease propagation forms a coupled nonlinear system with analogs in physics. We also review the new paradigm of digital epidemiology, wherein sources of digital data such as online social media are mined for high-resolution information on epidemiologically relevant individual behavior. Armed with the tools and concepts of statistical physics, and further assisted by new sources of digital data, models that capture nonlinear interactions between behavior and disease dynamics offer a novel way of modeling real-world phenomena, and can help improve health outcomes. We conclude the review by discussing open problems in the field and promising directions for future research.

  16. Stochastical modeling for Viral Disease: Statistical Mechanics and Network Theory

    NASA Astrophysics Data System (ADS)

    Zhou, Hao; Deem, Michael

    2007-04-01

    Theoretical methods of statistical mechanics are developed and applied to study the immunological response against viral disease, such as dengue. We use this theory to show how the immune response to four different dengue serotypes may be sculpted. It is the ability of avian influenza, to change and to mix, that has given rise to the fear of a new human flu pandemic. Here we propose to utilize a scale free network based stochastic model to investigate the mitigation strategies and analyze the risk.

  17. Traversing the conceptual divide between biological and statistical epistasis: systems biology and a more modern synthesis.

    PubMed

    Moore, Jason H; Williams, Scott M

    2005-06-01

    Epistasis plays an important role in the genetic architecture of common human diseases and can be viewed from two perspectives, biological and statistical, each derived from and leading to different assumptions and research strategies. Biological epistasis is the result of physical interactions among biomolecules within gene regulatory networks and biochemical pathways in an individual such that the effect of a gene on a phenotype is dependent on one or more other genes. In contrast, statistical epistasis is defined as deviation from additivity in a mathematical model summarizing the relationship between multilocus genotypes and phenotypic variation in a population. The goal of this essay is to review definitions and examples of biological and statistical epistasis and to explore the relationship between the two. Specifically, we present and discuss the following two questions in the context of human health and disease. First, when does statistical evidence of epistasis in human populations imply underlying biomolecular interactions in the etiology of disease? Second, when do biomolecular interactions produce patterns of statistical epistasis in human populations? Answers to these two reciprocal questions will provide an important framework for using genetic information to improve our ability to diagnose, prevent and treat common human diseases. We propose that systems biology will provide the necessary information for addressing these questions and that model systems such as bacteria, yeast and digital organisms will be a useful place to start.

  18. Simulation, identification and statistical variation in cardiovascular analysis (SISCA) - A software framework for multi-compartment lumped modeling.

    PubMed

    Huttary, Rudolf; Goubergrits, Leonid; Schütte, Christof; Bernhard, Stefan

    2017-08-01

    It has not yet been possible to obtain modeling approaches suitable for covering a wide range of real world scenarios in cardiovascular physiology because many of the system parameters are uncertain or even unknown. Natural variability and statistical variation of cardiovascular system parameters in healthy and diseased conditions are characteristic features for understanding cardiovascular diseases in more detail. This paper presents SISCA, a novel software framework for cardiovascular system modeling and its MATLAB implementation. The framework defines a multi-model statistical ensemble approach for dimension reduced, multi-compartment models and focuses on statistical variation, system identification and patient-specific simulation based on clinical data. We also discuss a data-driven modeling scenario as a use case example. The regarded dataset originated from routine clinical examinations and comprised typical pre and post surgery clinical data from a patient diagnosed with coarctation of aorta. We conducted patient and disease specific pre/post surgery modeling by adapting a validated nominal multi-compartment model with respect to structure and parametrization using metadata and MRI geometry. In both models, the simulation reproduced measured pressures and flows fairly well with respect to stenosis and stent treatment and by pre-treatment cross stenosis phase shift of the pulse wave. However, with post-treatment data showing unrealistic phase shifts and other more obvious inconsistencies within the dataset, the methods and results we present suggest that conditioning and uncertainty management of routine clinical data sets needs significantly more attention to obtain reasonable results in patient-specific cardiovascular modeling. Copyright © 2017 Elsevier Ltd. All rights reserved.

  19. Multivariate mixed linear model analysis of longitudinal data: an information-rich statistical technique for analyzing disease resistance data

    USDA-ARS?s Scientific Manuscript database

    The mixed linear model (MLM) is currently among the most advanced and flexible statistical modeling techniques and its use in tackling problems in plant pathology has begun surfacing in the literature. The longitudinal MLM is a multivariate extension that handles repeatedly measured data, such as r...

  20. Forecast and virtual weather driven plant disease risk modeling system

    USDA-ARS?s Scientific Manuscript database

    We describe a system in use and development that leverages public weather station data, several spatialized weather forecast types, leaf wetness estimation, generic plant disease models, and online statistical evaluation. Convergent technological developments in all these areas allow, with funding f...

  1. Model-based economic evaluation in Alzheimer's disease: a review of the methods available to model Alzheimer's disease progression.

    PubMed

    Green, Colin; Shearer, James; Ritchie, Craig W; Zajicek, John P

    2011-01-01

    To consider the methods available to model Alzheimer's disease (AD) progression over time to inform on the structure and development of model-based evaluations, and the future direction of modelling methods in AD. A systematic search of the health care literature was undertaken to identify methods to model disease progression in AD. Modelling methods are presented in a descriptive review. The literature search identified 42 studies presenting methods or applications of methods to model AD progression over time. The review identified 10 general modelling frameworks available to empirically model the progression of AD as part of a model-based evaluation. Seven of these general models are statistical models predicting progression of AD using a measure of cognitive function. The main concerns with models are on model structure, around the limited characterization of disease progression, and on the use of a limited number of health states to capture events related to disease progression over time. None of the available models have been able to present a comprehensive model of the natural history of AD. Although helpful, there are serious limitations in the methods available to model progression of AD over time. Advances are needed to better model the progression of AD and the effects of the disease on peoples' lives. Recent evidence supports the need for a multivariable approach to the modelling of AD progression, and indicates that a latent variable analytic approach to characterising AD progression is a promising avenue for advances in the statistical development of modelling methods. Copyright © 2011 International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. All rights reserved.

  2. Cure Models as a Useful Statistical Tool for Analyzing Survival

    PubMed Central

    Othus, Megan; Barlogie, Bart; LeBlanc, Michael L.; Crowley, John J.

    2013-01-01

    Cure models are a popular topic within statistical literature but are not as widely known in the clinical literature. Many patients with cancer can be long-term survivors of their disease, and cure models can be a useful tool to analyze and describe cancer survival data. The goal of this article is to review what a cure model is, explain when cure models can be used, and use cure models to describe multiple myeloma survival trends. Multiple myeloma is generally considered an incurable disease, and this article shows that by using cure models, rather than the standard Cox proportional hazards model, we can evaluate whether there is evidence that therapies at the University of Arkansas for Medical Sciences induce a proportion of patients to be long-term survivors. PMID:22675175

  3. Evaluating the utility of companion animal tick surveillance practices for monitoring spread and occurrence of human Lyme disease in West Virginia, 2014-2016.

    PubMed

    Hendricks, Brian; Mark-Carew, Miguella; Conley, Jamison

    2017-11-13

    Domestic dogs and cats are potentially effective sentinel populations for monitoring occurrence and spread of Lyme disease. Few studies have evaluated the public health utility of sentinel programmes using geo-analytic approaches. Confirmed Lyme disease cases diagnosed by physicians and ticks submitted by veterinarians to the West Virginia State Health Department were obtained for 2014-2016. Ticks were identified to species, and only Ixodes scapularis were incorporated in the analysis. Separate ordinary least squares (OLS) and spatial lag regression models were conducted to estimate the association between average numbers of Ix. scapularis collected on pets and human Lyme disease incidence. Regression residuals were visualised using Local Moran's I as a diagnostic tool to identify spatial dependence. Statistically significant associations were identified between average numbers of Ix. scapularis collected from dogs and human Lyme disease in the OLS (β=20.7, P<0.001) and spatial lag (β=12.0, P=0.002) regression. No significant associations were identified for cats in either regression model. Statistically significant (P≤0.05) spatial dependence was identified in all regression models. Local Moran's I maps produced for spatial lag regression residuals indicated a decrease in model over- and under-estimation, but identified a higher number of statistically significant outliers than OLS regression. Results support previous conclusions that dogs are effective sentinel populations for monitoring risk of human exposure to Lyme disease. Findings reinforce the utility of spatial analysis of surveillance data, and highlight West Virginia's unique position within the eastern United States in regards to Lyme disease occurrence.

  4. Weather variability, tides, and Barmah Forest virus disease in the Gladstone region, Australia.

    PubMed

    Naish, Suchithra; Hu, Wenbiao; Nicholls, Neville; Mackenzie, John S; McMichael, Anthony J; Dale, Pat; Tong, Shilu

    2006-05-01

    In this study we examined the impact of weather variability and tides on the transmission of Barmah Forest virus (BFV) disease and developed a weather-based forecasting model for BFV disease in the Gladstone region, Australia. We used seasonal autoregressive integrated moving-average (SARIMA) models to determine the contribution of weather variables to BFV transmission after the time-series data of response and explanatory variables were made stationary through seasonal differencing. We obtained data on the monthly counts of BFV cases, weather variables (e.g., mean minimum and maximum temperature, total rainfall, and mean relative humidity), high and low tides, and the population size in the Gladstone region between January 1992 and December 2001 from the Queensland Department of Health, Australian Bureau of Meteorology, Queensland Department of Transport, and Australian Bureau of Statistics, respectively. The SARIMA model shows that the 5-month moving average of minimum temperature (b=0.15, p-value<0.001) was statistically significantly and positively associated with BFV disease, whereas high tide in the current month (b=-1.03, p-value=0.04) was statistically significantly and inversely associated with it. However, no significant association was found for other variables. These results may be applied to forecast the occurrence of BFV disease and to use public health resources in BFV control and prevention.

  5. Monitoring Method of Cow Anthrax Based on Gis and Spatial Statistical Analysis

    NASA Astrophysics Data System (ADS)

    Li, Lin; Yang, Yong; Wang, Hongbin; Dong, Jing; Zhao, Yujun; He, Jianbin; Fan, Honggang

    Geographic information system (GIS) is a computer application system, which possesses the ability of manipulating spatial information and has been used in many fields related with the spatial information management. Many methods and models have been established for analyzing animal diseases distribution models and temporal-spatial transmission models. Great benefits have been gained from the application of GIS in animal disease epidemiology. GIS is now a very important tool in animal disease epidemiological research. Spatial analysis function of GIS can be widened and strengthened by using spatial statistical analysis, allowing for the deeper exploration, analysis, manipulation and interpretation of spatial pattern and spatial correlation of the animal disease. In this paper, we analyzed the cow anthrax spatial distribution characteristics in the target district A (due to the secret of epidemic data we call it district A) based on the established GIS of the cow anthrax in this district in combination of spatial statistical analysis and GIS. The Cow anthrax is biogeochemical disease, and its geographical distribution is related closely to the environmental factors of habitats and has some spatial characteristics, and therefore the correct analysis of the spatial distribution of anthrax cow for monitoring and the prevention and control of anthrax has a very important role. However, the application of classic statistical methods in some areas is very difficult because of the pastoral nomadic context. The high mobility of livestock and the lack of enough suitable sampling for the some of the difficulties in monitoring currently make it nearly impossible to apply rigorous random sampling methods. It is thus necessary to develop an alternative sampling method, which could overcome the lack of sampling and meet the requirements for randomness. The GIS computer application software ArcGIS9.1 was used to overcome the lack of data of sampling sites.Using ArcGIS 9.1 and GEODA to analyze the cow anthrax spatial distribution of district A. we gained some conclusions about cow anthrax' density: (1) there is a spatial clustering model. (2) there is an intensely spatial autocorrelation. We established a prediction model to estimate the anthrax distribution based on the spatial characteristic of the density of cow anthrax. Comparing with the true distribution, the prediction model has a well coincidence and is feasible to the application. The method using a GIS tool facilitates can be implemented significantly in the cow anthrax monitoring and investigation, and the space statistics - related prediction model provides a fundamental use for other study on space-related animal diseases.

  6. Meta-markers for the differential diagnosis of lung cancer and lung disease.

    PubMed

    Kim, Yong-In; Ahn, Jung-Mo; Sung, Hye-Jin; Na, Sang-Su; Hwang, Jaesung; Kim, Yongdai; Cho, Je-Yoel

    2016-10-04

    Misdiagnosis of lung cancer remains a serious problem due to the difficulty of distinguishing lung cancer from other respiratory lung diseases. As a result, the development of serum-based differential diagnostic biomarkers is in high demand. In this study, 198 clinical serum samples from non-cancer lung disease and lung cancer patients were analyzed using nLC-MRM-MS for the levels of seven lung cancer biomarker candidates. When the candidates were assessed individually, only SERPINEA4 showed statistically significant changes in the serum levels. The MRM results and clinical information were analyzed using a logistic regression analysis to select model for the best 'meta-marker', or combination of biomarkers for differential diagnosis. Also, under consideration of statistical interaction, variables having low significance as a single factor but statistically influencing on meta-marker model were selected. Using this probabilistic classification, the best meta-marker was determined to be made up of two proteins SERPINA4 and PON1 with age factor. This meta-marker showed an enhanced differential diagnostic capability (AUC=0.915) for distinguishing the two patient groups. Our results suggest that a statistical model can determine optimal meta-markers, which may have better specificity and sensitivity than a single biomarker and thus improve the differential diagnosis of lung cancer and lung disease patients. Diagnosing lung cancer commonly involves the use of radiographic methods. However, an imaging-based diagnosis may fail to differentiate lung cancer from non-cancerous lung disease. In this study, we examined several serum proteins in the sera of 198 lung cancer and non-cancerous lung disease patients by multiple-reaction monitoring. We then used a combination of variables to generate a meta-marker model that is useful as a differential diagnostic biomarker. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.

  7. Prediction of Ischemic Heart Disease and Stroke in Survivors of Childhood Cancer.

    PubMed

    Chow, Eric J; Chen, Yan; Hudson, Melissa M; Feijen, Elizabeth A M; Kremer, Leontien C; Border, William L; Green, Daniel M; Meacham, Lillian R; Mulrooney, Daniel A; Ness, Kirsten K; Oeffinger, Kevin C; Ronckers, Cécile M; Sklar, Charles A; Stovall, Marilyn; van der Pal, Helena J; van Dijk, Irma W E M; van Leeuwen, Flora E; Weathers, Rita E; Robison, Leslie L; Armstrong, Gregory T; Yasui, Yutaka

    2018-01-01

    Purpose We aimed to predict individual risk of ischemic heart disease and stroke in 5-year survivors of childhood cancer. Patients and Methods Participants in the Childhood Cancer Survivor Study (CCSS; n = 13,060) were observed through age 50 years for the development of ischemic heart disease and stroke. Siblings (n = 4,023) established the baseline population risk. Piecewise exponential models with backward selection estimated the relationships between potential predictors and each outcome. The St Jude Lifetime Cohort Study (n = 1,842) and the Emma Children's Hospital cohort (n = 1,362) were used to validate the CCSS models. Results Ischemic heart disease and stroke occurred in 265 and 295 CCSS participants, respectively. Risk scores based on a standard prediction model that included sex, chemotherapy, and radiotherapy (cranial, neck, and chest) exposures achieved an area under the curve and concordance statistic of 0.70 and 0.70 for ischemic heart disease and 0.63 and 0.66 for stroke, respectively. Validation cohort area under the curve and concordance statistics ranged from 0.66 to 0.67 for ischemic heart disease and 0.68 to 0.72 for stroke. Risk scores were collapsed to form statistically distinct low-, moderate-, and high-risk groups. The cumulative incidences at age 50 years among CCSS low-risk groups were < 5%, compared with approximately 20% for high-risk groups ( P < .001); cumulative incidence was only 1% for siblings ( P < .001 v low-risk survivors). Conclusion Information available to clinicians soon after completion of childhood cancer therapy can predict individual risk for subsequent ischemic heart disease and stroke with reasonable accuracy and discrimination through age 50 years. These models provide a framework on which to base future screening strategies and interventions.

  8. When More Transmission Equals Less Disease: Reconciling the Disconnect between Disease Hotspots and Parasite Transmission

    PubMed Central

    Park, Andrew W.; Magori, Krisztian; White, Brad A.; Stallknecht, David E.

    2013-01-01

    The assumed straightforward connection between transmission intensity and disease occurrence impacts surveillance and control efforts along with statistical methodology, including parameter inference and niche modeling. Many infectious disease systems have the potential for this connection to be more complicated–although demonstrating this in any given disease system has remained elusive. Hemorrhagic disease (HD) is one of the most important diseases of white-tailed deer and is caused by viruses in the Orbivirus genus. Like many infectious diseases, the probability or severity of disease increases with age (after loss of maternal antibodies) and the probability of disease is lower upon re-infection compared to first infection (based on cross-immunity between virus strains). These broad criteria generate a prediction that disease occurrence is maximized at intermediate levels of transmission intensity. Using published US field data, we first fit a statistical model to predict disease occurrence as a function of seroprevalence (a proxy for transmission intensity), demonstrating that states with intermediate seroprevalence have the highest level of case reporting. We subsequently introduce an independently parameterized mechanistic model supporting the theory that high case reporting should come from areas with intermediate levels of transmission. This is the first rigorous demonstration of this phenomenon and illustrates that variation in transmission rate (e.g. along an ecologically-controlled transmission gradient) can create cryptic refuges for infectious diseases. PMID:23579922

  9. Sequential Markov chain Monte Carlo filter with simultaneous model selection for electrocardiogram signal modeling.

    PubMed

    Edla, Shwetha; Kovvali, Narayan; Papandreou-Suppappola, Antonia

    2012-01-01

    Constructing statistical models of electrocardiogram (ECG) signals, whose parameters can be used for automated disease classification, is of great importance in precluding manual annotation and providing prompt diagnosis of cardiac diseases. ECG signals consist of several segments with different morphologies (namely the P wave, QRS complex and the T wave) in a single heart beat, which can vary across individuals and diseases. Also, existing statistical ECG models exhibit a reliance upon obtaining a priori information from the ECG data by using preprocessing algorithms to initialize the filter parameters, or to define the user-specified model parameters. In this paper, we propose an ECG modeling technique using the sequential Markov chain Monte Carlo (SMCMC) filter that can perform simultaneous model selection, by adaptively choosing from different representations depending upon the nature of the data. Our results demonstrate the ability of the algorithm to track various types of ECG morphologies, including intermittently occurring ECG beats. In addition, we use the estimated model parameters as the feature set to classify between ECG signals with normal sinus rhythm and four different types of arrhythmia.

  10. Development and validation of a risk calculator predicting exercise-induced ventricular arrhythmia in patients with cardiovascular disease.

    PubMed

    Hermes, Ilarraza-Lomelí; Marianna, García-Saldivia; Jessica, Rojano-Castillo; Carlos, Barrera-Ramírez; Rafael, Chávez-Domínguez; María Dolores, Rius-Suárez; Pedro, Iturralde

    2016-10-01

    Mortality due to cardiovascular disease is often associated with ventricular arrhythmias. Nowadays, patients with cardiovascular disease are more encouraged to take part in physical training programs. Nevertheless, high-intensity exercise is associated to a higher risk for sudden death, even in apparently healthy people. During an exercise testing (ET), health care professionals provide patients, in a controlled scenario, an intense physiological stimulus that could precipitate cardiac arrhythmia in high risk individuals. There is still no clinical or statistical tool to predict this incidence. The aim of this study was to develop a statistical model to predict the incidence of exercise-induced potentially life-threatening ventricular arrhythmia (PLVA) during high intensity exercise. 6415 patients underwent a symptom-limited ET with a Balke ramp protocol. A multivariate logistic regression model where the primary outcome was PLVA was performed. Incidence of PLVA was 548 cases (8.5%). After a bivariate model, thirty one clinical or ergometric variables were statistically associated with PLVA and were included in the regression model. In the multivariate model, 13 of these variables were found to be statistically significant. A regression model (G) with a X(2) of 283.987 and a p<0.001, was constructed. Significant variables included: heart failure, antiarrhythmic drugs, myocardial lower-VD, age and use of digoxin, nitrates, among others. This study allows clinicians to identify patients at risk of ventricular tachycardia or couplets during exercise, and to take preventive measures or appropriate supervision. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  11. Weather Variability, Tides, and Barmah Forest Virus Disease in the Gladstone Region, Australia

    PubMed Central

    Naish, Suchithra; Hu, Wenbiao; Nicholls, Neville; Mackenzie, John S.; McMichael, Anthony J.; Dale, Pat; Tong, Shilu

    2006-01-01

    In this study we examined the impact of weather variability and tides on the transmission of Barmah Forest virus (BFV) disease and developed a weather-based forecasting model for BFV disease in the Gladstone region, Australia. We used seasonal autoregressive integrated moving-average (SARIMA) models to determine the contribution of weather variables to BFV transmission after the time-series data of response and explanatory variables were made stationary through seasonal differencing. We obtained data on the monthly counts of BFV cases, weather variables (e.g., mean minimum and maximum temperature, total rainfall, and mean relative humidity), high and low tides, and the population size in the Gladstone region between January 1992 and December 2001 from the Queensland Department of Health, Australian Bureau of Meteorology, Queensland Department of Transport, and Australian Bureau of Statistics, respectively. The SARIMA model shows that the 5-month moving average of minimum temperature (β = 0.15, p-value < 0.001) was statistically significantly and positively associated with BFV disease, whereas high tide in the current month (β = −1.03, p-value = 0.04) was statistically significantly and inversely associated with it. However, no significant association was found for other variables. These results may be applied to forecast the occurrence of BFV disease and to use public health resources in BFV control and prevention. PMID:16675420

  12. A proposed metabolic strategy for monitoring disease progression in Alzheimer's disease.

    PubMed

    Greenberg, Nicola; Grassano, Antonio; Thambisetty, Madhav; Lovestone, Simon; Legido-Quigley, Cristina

    2009-04-01

    A specific, sensitive and essentially non-invasive assay to diagnose and monitor Alzheimer's disease (AD) would be valuable to both clinicians and medical researchers. The aim of this study was to perform a metabonomic statistical analysis on plasma fingerprints. Objectives were to investigate novel biomarkers indicative of AD, to consider the role of bile acids as AD biomarkers and to consider whether mild cognitive impairment (MCI) is a separate disease from AD. Samples were analysed by ultraperformance liquid chromatography-MS and resulting data sets were interpreted using soft-independent modelling of class analogy statistical analysis methods. PCA models did not show any grouping of subjects by disease state. Partial least-squares discriminant analysis (PLS-DS) models yielded class separation for AD. However, as with earlier studies, model validation revealed a predictive power of Q(2)<0.5 and indicating their unsuitability for predicting disease state. Three bile acids were extracted from the data and quantified, up-regulation was observed for MCI and AD patients. PLS-DA did not support MCI being considered as a separate disease from AD with MCI patient metabolic profiles being significantly closer to AD patients than controls. This study suggested that further investigation into the lipid fraction of the metabolome may yield useful biomarkers for AD and metabolomic profiles could be used to predict disease state in a clinical setting.

  13. Generalized functional linear models for gene-based case-control association studies.

    PubMed

    Fan, Ruzong; Wang, Yifan; Mills, James L; Carter, Tonia C; Lobach, Iryna; Wilson, Alexander F; Bailey-Wilson, Joan E; Weeks, Daniel E; Xiong, Momiao

    2014-11-01

    By using functional data analysis techniques, we developed generalized functional linear models for testing association between a dichotomous trait and multiple genetic variants in a genetic region while adjusting for covariates. Both fixed and mixed effect models are developed and compared. Extensive simulations show that Rao's efficient score tests of the fixed effect models are very conservative since they generate lower type I errors than nominal levels, and global tests of the mixed effect models generate accurate type I errors. Furthermore, we found that the Rao's efficient score test statistics of the fixed effect models have higher power than the sequence kernel association test (SKAT) and its optimal unified version (SKAT-O) in most cases when the causal variants are both rare and common. When the causal variants are all rare (i.e., minor allele frequencies less than 0.03), the Rao's efficient score test statistics and the global tests have similar or slightly lower power than SKAT and SKAT-O. In practice, it is not known whether rare variants or common variants in a gene region are disease related. All we can assume is that a combination of rare and common variants influences disease susceptibility. Thus, the improved performance of our models when the causal variants are both rare and common shows that the proposed models can be very useful in dissecting complex traits. We compare the performance of our methods with SKAT and SKAT-O on real neural tube defects and Hirschsprung's disease datasets. The Rao's efficient score test statistics and the global tests are more sensitive than SKAT and SKAT-O in the real data analysis. Our methods can be used in either gene-disease genome-wide/exome-wide association studies or candidate gene analyses. © 2014 WILEY PERIODICALS, INC.

  14. Generalized Functional Linear Models for Gene-based Case-Control Association Studies

    PubMed Central

    Mills, James L.; Carter, Tonia C.; Lobach, Iryna; Wilson, Alexander F.; Bailey-Wilson, Joan E.; Weeks, Daniel E.; Xiong, Momiao

    2014-01-01

    By using functional data analysis techniques, we developed generalized functional linear models for testing association between a dichotomous trait and multiple genetic variants in a genetic region while adjusting for covariates. Both fixed and mixed effect models are developed and compared. Extensive simulations show that Rao's efficient score tests of the fixed effect models are very conservative since they generate lower type I errors than nominal levels, and global tests of the mixed effect models generate accurate type I errors. Furthermore, we found that the Rao's efficient score test statistics of the fixed effect models have higher power than the sequence kernel association test (SKAT) and its optimal unified version (SKAT-O) in most cases when the causal variants are both rare and common. When the causal variants are all rare (i.e., minor allele frequencies less than 0.03), the Rao's efficient score test statistics and the global tests have similar or slightly lower power than SKAT and SKAT-O. In practice, it is not known whether rare variants or common variants in a gene are disease-related. All we can assume is that a combination of rare and common variants influences disease susceptibility. Thus, the improved performance of our models when the causal variants are both rare and common shows that the proposed models can be very useful in dissecting complex traits. We compare the performance of our methods with SKAT and SKAT-O on real neural tube defects and Hirschsprung's disease data sets. The Rao's efficient score test statistics and the global tests are more sensitive than SKAT and SKAT-O in the real data analysis. Our methods can be used in either gene-disease genome-wide/exome-wide association studies or candidate gene analyses. PMID:25203683

  15. A novel approach to simulate gene-environment interactions in complex diseases.

    PubMed

    Amato, Roberto; Pinelli, Michele; D'Andrea, Daniel; Miele, Gennaro; Nicodemi, Mario; Raiconi, Giancarlo; Cocozza, Sergio

    2010-01-05

    Complex diseases are multifactorial traits caused by both genetic and environmental factors. They represent the major part of human diseases and include those with largest prevalence and mortality (cancer, heart disease, obesity, etc.). Despite a large amount of information that has been collected about both genetic and environmental risk factors, there are few examples of studies on their interactions in epidemiological literature. One reason can be the incomplete knowledge of the power of statistical methods designed to search for risk factors and their interactions in these data sets. An improvement in this direction would lead to a better understanding and description of gene-environment interactions. To this aim, a possible strategy is to challenge the different statistical methods against data sets where the underlying phenomenon is completely known and fully controllable, for example simulated ones. We present a mathematical approach that models gene-environment interactions. By this method it is possible to generate simulated populations having gene-environment interactions of any form, involving any number of genetic and environmental factors and also allowing non-linear interactions as epistasis. In particular, we implemented a simple version of this model in a Gene-Environment iNteraction Simulator (GENS), a tool designed to simulate case-control data sets where a one gene-one environment interaction influences the disease risk. The main aim has been to allow the input of population characteristics by using standard epidemiological measures and to implement constraints to make the simulator behaviour biologically meaningful. By the multi-logistic model implemented in GENS it is possible to simulate case-control samples of complex disease where gene-environment interactions influence the disease risk. The user has full control of the main characteristics of the simulated population and a Monte Carlo process allows random variability. A knowledge-based approach reduces the complexity of the mathematical model by using reasonable biological constraints and makes the simulation more understandable in biological terms. Simulated data sets can be used for the assessment of novel statistical methods or for the evaluation of the statistical power when designing a study.

  16. A spatial scan statistic for nonisotropic two-level risk cluster.

    PubMed

    Li, Xiao-Zhou; Wang, Jin-Feng; Yang, Wei-Zhong; Li, Zhong-Jie; Lai, Sheng-Jie

    2012-01-30

    Spatial scan statistic methods are commonly used for geographical disease surveillance and cluster detection. The standard spatial scan statistic does not model any variability in the underlying risks of subregions belonging to a detected cluster. For a multilevel risk cluster, the isotonic spatial scan statistic could model a centralized high-risk kernel in the cluster. Because variations in disease risks are anisotropic owing to different social, economical, or transport factors, the real high-risk kernel will not necessarily take the central place in a whole cluster area. We propose a spatial scan statistic for a nonisotropic two-level risk cluster, which could be used to detect a whole cluster and a noncentralized high-risk kernel within the cluster simultaneously. The performance of the three methods was evaluated through an intensive simulation study. Our proposed nonisotropic two-level method showed better power and geographical precision with two-level risk cluster scenarios, especially for a noncentralized high-risk kernel. Our proposed method is illustrated using the hand-foot-mouth disease data in Pingdu City, Shandong, China in May 2009, compared with two other methods. In this practical study, the nonisotropic two-level method is the only way to precisely detect a high-risk area in a detected whole cluster. Copyright © 2011 John Wiley & Sons, Ltd.

  17. The Hanford Thyroid Disease Study: an alternative view of the findings.

    PubMed

    Hoffman, F Owen; Ruttenber, A James; Apostoaei, A Iulian; Carroll, Raymond J; Greenland, Sander

    2007-02-01

    The Hanford Thyroid Disease Study (HTDS) is one of the largest and most complex epidemiologic studies of the relation between environmental exposures to I and thyroid disease. The study detected no dose-response relation using a 0.05 level for statistical significance. The results for thyroid cancer appear inconsistent with those from other studies of populations with similar exposures, and either reflect inadequate statistical power, bias, or unique relations between exposure and disease risk. In this paper, we explore these possibilities, and present evidence that the HTDS statistical power was inadequate due to complex uncertainties associated with the mathematical models and assumptions used to reconstruct individual doses. We conclude that, at the very least, the confidence intervals reported by the HTDS for thyroid cancer and other thyroid diseases are too narrow because they fail to reflect key uncertainties in the measurement-error structure. We recommend that the HTDS results be interpreted as inconclusive rather than as evidence for little or no disease risk from Hanford exposures.

  18. Cognitive predictors of balance in Parkinson's disease.

    PubMed

    Fernandes, Ângela; Mendes, Andreia; Rocha, Nuno; Tavares, João Manuel R S

    2016-06-01

    Postural instability is one of the most incapacitating symptoms of Parkinson's disease (PD) and appears to be related to cognitive deficits. This study aims to determine the cognitive factors that can predict deficits in static and dynamic balance in individuals with PD. A sociodemographic questionnaire characterized 52 individuals with PD for this work. The Trail Making Test, Rule Shift Cards Test, and Digit Span Test assessed the executive functions. The static balance was assessed using a plantar pressure platform, and dynamic balance was based on the Timed Up and Go Test. The results were statistically analysed using SPSS Statistics software through linear regression analysis. The results show that a statistically significant model based on cognitive outcomes was able to explain the variance of motor variables. Also, the explanatory value of the model tended to increase with the addition of individual and clinical variables, although the resulting model was not statistically significant The model explained 25-29% of the variability of the Timed Up and Go Test, while for the anteroposterior displacement it was 23-34%, and for the mediolateral displacement it was 24-39%. From the findings, we conclude that the cognitive performance, especially the executive functions, is a predictor of balance deficit in individuals with PD.

  19. Association of a Bacteriophage with Meningococcal Disease in Young Adults

    PubMed Central

    Gray, Stephen J.; Kaczmarski, Edward B.; McCarthy, Noel D.; Nassif, Xavier; Maiden, Martin C. J.; Tinsley, Colin R.

    2008-01-01

    Despite being the agent of life-threatening meningitis, Neisseria meningitidis is usually carried asymptomatically in the nasopharynx of humans and only occasionally causes disease. The genetic bases for virulence have not been entirely elucidated and the search for new virulence factors in this species is hampered by the lack of an animal model representative of the human disease. As an alternative strategy we employ a molecular epidemiological approach to establish a statistical association of a candidate virulence gene with disease in the human population. We examine the distribution of a previously-identified genetic element, a temperate bacteriophage, in 1288 meningococci isolated from cases of disease and asymptomatic carriage. The phage was over-represented in disease isolates from young adults indicating that it may contribute to invasive disease in this age group. Further statistical analysis indicated that between 20% and 45% of the pathogenic potential of the five most common disease-causing meningococcal groups was linked to the presence of the phage. In the absence of an animal model of human disease, this molecular epidemiological approach permitted the estimation of the influence of the candidate virulence factor. Such an approach is particularly valuable in the investigation of exclusively human diseases. PMID:19065260

  20. Routine Discovery of Complex Genetic Models using Genetic Algorithms

    PubMed Central

    Moore, Jason H.; Hahn, Lance W.; Ritchie, Marylyn D.; Thornton, Tricia A.; White, Bill C.

    2010-01-01

    Simulation studies are useful in various disciplines for a number of reasons including the development and evaluation of new computational and statistical methods. This is particularly true in human genetics and genetic epidemiology where new analytical methods are needed for the detection and characterization of disease susceptibility genes whose effects are complex, nonlinear, and partially or solely dependent on the effects of other genes (i.e. epistasis or gene-gene interaction). Despite this need, the development of complex genetic models that can be used to simulate data is not always intuitive. In fact, only a few such models have been published. We have previously developed a genetic algorithm approach to discovering complex genetic models in which two single nucleotide polymorphisms (SNPs) influence disease risk solely through nonlinear interactions. In this paper, we extend this approach for the discovery of high-order epistasis models involving three to five SNPs. We demonstrate that the genetic algorithm is capable of routinely discovering interesting high-order epistasis models in which each SNP influences risk of disease only through interactions with the other SNPs in the model. This study opens the door for routine simulation of complex gene-gene interactions among SNPs for the development and evaluation of new statistical and computational approaches for identifying common, complex multifactorial disease susceptibility genes. PMID:20948983

  1. Enrichment of statistical power for genome-wide association studies

    USDA-ARS?s Scientific Manuscript database

    The inheritance of most human diseases and agriculturally important traits is controlled by many genes with small effects. Identifying these genes, while simultaneously controlling false positives, is challenging. Among available statistical methods, the mixed linear model (MLM) has been the most fl...

  2. Bladder cancer mapping in Libya based on standardized morbidity ratio and log-normal model

    NASA Astrophysics Data System (ADS)

    Alhdiri, Maryam Ahmed; Samat, Nor Azah; Mohamed, Zulkifley

    2017-05-01

    Disease mapping contains a set of statistical techniques that detail maps of rates based on estimated mortality, morbidity, and prevalence. A traditional approach to measure the relative risk of the disease is called Standardized Morbidity Ratio (SMR). It is the ratio of an observed and expected number of accounts in an area, which has the greatest uncertainty if the disease is rare or if geographical area is small. Therefore, Bayesian models or statistical smoothing based on Log-normal model are introduced which might solve SMR problem. This study estimates the relative risk for bladder cancer incidence in Libya from 2006 to 2007 based on the SMR and log-normal model, which were fitted to data using WinBUGS software. This study starts with a brief review of these models, starting with the SMR method and followed by the log-normal model, which is then applied to bladder cancer incidence in Libya. All results are compared using maps and tables. The study concludes that the log-normal model gives better relative risk estimates compared to the classical method. The log-normal model has can overcome the SMR problem when there is no observed bladder cancer in an area.

  3. Prediction model to estimate presence of coronary artery disease: retrospective pooled analysis of existing cohorts

    PubMed Central

    Genders, Tessa S S; Steyerberg, Ewout W; Nieman, Koen; Galema, Tjebbe W; Mollet, Nico R; de Feyter, Pim J; Krestin, Gabriel P; Alkadhi, Hatem; Leschka, Sebastian; Desbiolles, Lotus; Meijs, Matthijs F L; Cramer, Maarten J; Knuuti, Juhani; Kajander, Sami; Bogaert, Jan; Goetschalckx, Kaatje; Cademartiri, Filippo; Maffei, Erica; Martini, Chiara; Seitun, Sara; Aldrovandi, Annachiara; Wildermuth, Simon; Stinn, Björn; Fornaro, Jürgen; Feuchtner, Gudrun; De Zordo, Tobias; Auer, Thomas; Plank, Fabian; Friedrich, Guy; Pugliese, Francesca; Petersen, Steffen E; Davies, L Ceri; Schoepf, U Joseph; Rowe, Garrett W; van Mieghem, Carlos A G; van Driessche, Luc; Sinitsyn, Valentin; Gopalan, Deepa; Nikolaou, Konstantin; Bamberg, Fabian; Cury, Ricardo C; Battle, Juan; Maurovich-Horvat, Pál; Bartykowszki, Andrea; Merkely, Bela; Becker, Dávid; Hadamitzky, Martin; Hausleiter, Jörg; Dewey, Marc; Zimmermann, Elke; Laule, Michael

    2012-01-01

    Objectives To develop prediction models that better estimate the pretest probability of coronary artery disease in low prevalence populations. Design Retrospective pooled analysis of individual patient data. Setting 18 hospitals in Europe and the United States. Participants Patients with stable chest pain without evidence for previous coronary artery disease, if they were referred for computed tomography (CT) based coronary angiography or catheter based coronary angiography (indicated as low and high prevalence settings, respectively). Main outcome measures Obstructive coronary artery disease (≥50% diameter stenosis in at least one vessel found on catheter based coronary angiography). Multiple imputation accounted for missing predictors and outcomes, exploiting strong correlation between the two angiography procedures. Predictive models included a basic model (age, sex, symptoms, and setting), clinical model (basic model factors and diabetes, hypertension, dyslipidaemia, and smoking), and extended model (clinical model factors and use of the CT based coronary calcium score). We assessed discrimination (c statistic), calibration, and continuous net reclassification improvement by cross validation for the four largest low prevalence datasets separately and the smaller remaining low prevalence datasets combined. Results We included 5677 patients (3283 men, 2394 women), of whom 1634 had obstructive coronary artery disease found on catheter based coronary angiography. All potential predictors were significantly associated with the presence of disease in univariable and multivariable analyses. The clinical model improved the prediction, compared with the basic model (cross validated c statistic improvement from 0.77 to 0.79, net reclassification improvement 35%); the coronary calcium score in the extended model was a major predictor (0.79 to 0.88, 102%). Calibration for low prevalence datasets was satisfactory. Conclusions Updated prediction models including age, sex, symptoms, and cardiovascular risk factors allow for accurate estimation of the pretest probability of coronary artery disease in low prevalence populations. Addition of coronary calcium scores to the prediction models improves the estimates. PMID:22692650

  4. A space-time scan statistic for detecting emerging outbreaks.

    PubMed

    Tango, Toshiro; Takahashi, Kunihiko; Kohriyama, Kazuaki

    2011-03-01

    As a major analytical method for outbreak detection, Kulldorff's space-time scan statistic (2001, Journal of the Royal Statistical Society, Series A 164, 61-72) has been implemented in many syndromic surveillance systems. Since, however, it is based on circular windows in space, it has difficulty correctly detecting actual noncircular clusters. Takahashi et al. (2008, International Journal of Health Geographics 7, 14) proposed a flexible space-time scan statistic with the capability of detecting noncircular areas. It seems to us, however, that the detection of the most likely cluster defined in these space-time scan statistics is not the same as the detection of localized emerging disease outbreaks because the former compares the observed number of cases with the conditional expected number of cases. In this article, we propose a new space-time scan statistic which compares the observed number of cases with the unconditional expected number of cases, takes a time-to-time variation of Poisson mean into account, and implements an outbreak model to capture localized emerging disease outbreaks more timely and correctly. The proposed models are illustrated with data from weekly surveillance of the number of absentees in primary schools in Kitakyushu-shi, Japan, 2006. © 2010, The International Biometric Society.

  5. Application of statistical mining in healthcare data management for allergic diseases

    NASA Astrophysics Data System (ADS)

    Wawrzyniak, Zbigniew M.; Martínez Santolaya, Sara

    2014-11-01

    The paper aims to discuss data mining techniques based on statistical tools in medical data management in case of long-term diseases. The data collected from a population survey is the source for reasoning and identifying disease processes responsible for patient's illness and its symptoms, and prescribing a knowledge and decisions in course of action to correct patient's condition. The case considered as a sample of constructive approach to data management is a dependence of allergic diseases of chronic nature on some symptoms and environmental conditions. The knowledge summarized in a systematic way as accumulated experience constitutes to an experiential simplified model of the diseases with feature space constructed of small set of indicators. We have presented the model of disease-symptom-opinion with knowledge discovery for data management in healthcare. The feature is evident that the model is purely data-driven to evaluate the knowledge of the diseases` processes and probability dependence of future disease events on symptoms and other attributes. The example done from the outcomes of the survey of long-term (chronic) disease shows that a small set of core indicators as 4 or more symptoms and opinions could be very helpful in reflecting health status change over disease causes. Furthermore, the data driven understanding of the mechanisms of diseases gives physicians the basis for choices of treatment what outlines the need of data governance in this research domain of discovered knowledge from surveys.

  6. Cost effectiveness of a targeted age-based West Nile virus vaccination program.

    PubMed

    Shankar, Manjunath B; Staples, J Erin; Meltzer, Martin I; Fischer, Marc

    2017-05-25

    West Nile virus (WNV) is the leading cause of domestically-acquired arboviral disease in the United States. Several WNV vaccines are in various stages of development. We estimate the cost-effectiveness of WNV vaccination programs targeting groups at increased risk for severe WNV disease. We used a mathematical model to estimate costs and health outcomes of vaccination with WNV vaccine compared to no vaccination among seven cohorts, spaced at 10year intervals from ages 10 to 70years, each followed until 90-years-old. U.S. surveillance data were used to estimate WNV neuroinvasive disease incidence. Data for WNV seroprevalence, acute and long-term care costs of WNV disease patients, quality-adjusted life-years (QALYs), and vaccine characteristics were obtained from published reports. We assumed vaccine efficacy to either last lifelong or for 10years with booster doses given every 10years. There was a statistically significant difference in cost-effectiveness ratios across cohorts in both models and all outcomes assessed (Kruskal-Wallis test p<0.0001). The 60-year-cohort had a mean cost per neuroinvasive disease case prevented of $664,000 and disability averted of $1,421,000 in lifelong model and $882,000 and $1,887,000, respectively in 10-year immunity model; these costs were statistically significantly lower than costs for other cohorts (p<0.0001). Vaccinating 70-year-olds had the lowest cost per death averted in both models at around $4.7 million (95%CI $2-$8 million). Cost per disease case averted was lowest among 40- and 50-year-old cohorts and cost per QALY saved lowest among 60-year cohorts in lifelong immunity model. The models were most sensitive to disease incidence, vaccine cost, and proportion of persons developing disease among infected. Age-based WNV vaccination program targeting those at higher risk for severe disease is more cost-effective than universal vaccination. Annual variation in WNV disease incidence, QALY weights, and vaccine costs impact the cost effectiveness ratios. Published by Elsevier Ltd.

  7. Statistical inference to advance network models in epidemiology.

    PubMed

    Welch, David; Bansal, Shweta; Hunter, David R

    2011-03-01

    Contact networks are playing an increasingly important role in the study of epidemiology. Most of the existing work in this area has focused on considering the effect of underlying network structure on epidemic dynamics by using tools from probability theory and computer simulation. This work has provided much insight on the role that heterogeneity in host contact patterns plays on infectious disease dynamics. Despite the important understanding afforded by the probability and simulation paradigm, this approach does not directly address important questions about the structure of contact networks such as what is the best network model for a particular mode of disease transmission, how parameter values of a given model should be estimated, or how precisely the data allow us to estimate these parameter values. We argue that these questions are best answered within a statistical framework and discuss the role of statistical inference in estimating contact networks from epidemiological data. Copyright © 2011 Elsevier B.V. All rights reserved.

  8. Multimodal Image Analysis in Alzheimer’s Disease via Statistical Modelling of Non-local Intensity Correlations

    NASA Astrophysics Data System (ADS)

    Lorenzi, Marco; Simpson, Ivor J.; Mendelson, Alex F.; Vos, Sjoerd B.; Cardoso, M. Jorge; Modat, Marc; Schott, Jonathan M.; Ourselin, Sebastien

    2016-04-01

    The joint analysis of brain atrophy measured with magnetic resonance imaging (MRI) and hypometabolism measured with positron emission tomography with fluorodeoxyglucose (FDG-PET) is of primary importance in developing models of pathological changes in Alzheimer’s disease (AD). Most of the current multimodal analyses in AD assume a local (spatially overlapping) relationship between MR and FDG-PET intensities. However, it is well known that atrophy and hypometabolism are prominent in different anatomical areas. The aim of this work is to describe the relationship between atrophy and hypometabolism by means of a data-driven statistical model of non-overlapping intensity correlations. For this purpose, FDG-PET and MRI signals are jointly analyzed through a computationally tractable formulation of partial least squares regression (PLSR). The PLSR model is estimated and validated on a large clinical cohort of 1049 individuals from the ADNI dataset. Results show that the proposed non-local analysis outperforms classical local approaches in terms of predictive accuracy while providing a plausible description of disease dynamics: early AD is characterised by non-overlapping temporal atrophy and temporo-parietal hypometabolism, while the later disease stages show overlapping brain atrophy and hypometabolism spread in temporal, parietal and cortical areas.

  9. Respiratory Disease Related Mortality and Morbidity on an Island of Greece Exposed to Perlite and Bentonite Mining Dust

    PubMed Central

    Sampatakakis, Stefanos; Linos, Athena; Papadimitriou, Eleni; Petralias, Athanasios; Dalma, Archontoula; Papasaranti, Eirini Saranti; Christoforidou, Eleni; Stoltidis, Melina

    2013-01-01

    A morbidity and mortality study took place, focused on Milos Island, where perlite and bentonite mining sites are located. Official data concerning number and cause of deaths, regarding specific respiratory diseases and the total of respiratory diseases, for both Milos Island and the Cyclades Prefecture were used. Standardized Mortality Ratios (SMRs) were computed, adjusted specifically for age, gender and calendar year. Tests of linear trend were performed. By means of a predefined questionnaire, the morbidity rates of specific respiratory diseases in Milos, were compared to those of the municipality of Oinofita, an industrial region. Chi-square analysis was used and the confounding factors of age, gender and smoking were taken into account, by estimating binary logistic regression models. The SMRs for Pneumonia and Chronic Obstructive Pulmonary Disease (COPD) were found elevated for both genders, although they did not reach statistical significance. For the total of respiratory diseases, a statistically significant SMR was identified regarding the decade 1989–1998. The morbidity study revealed elevated and statistically significant Odds Ratios (ORs), associated with allergic rhinitis, pneumonia, COPD and bronchiectasis. An elevated OR was also identified for asthma. After controlling for age, gender and smoking, the ORs were statistically significant and towards the same direction. PMID:24129114

  10. Respiratory disease related mortality and morbidity on an island of Greece exposed to perlite and bentonite mining dust.

    PubMed

    Sampatakakis, Stefanos; Linos, Athena; Papadimitriou, Eleni; Petralias, Athanasios; Dalma, Archontoula; Papasaranti, Eirini Saranti; Christoforidou, Eleni; Stoltidis, Melina

    2013-10-14

    A morbidity and mortality study took place, focused on Milos Island, where perlite and bentonite mining sites are located. Official data concerning number and cause of deaths, regarding specific respiratory diseases and the total of respiratory diseases, for both Milos Island and the Cyclades Prefecture were used. Standardized Mortality Ratios (SMRs) were computed, adjusted specifically for age, gender and calendar year. Tests of linear trend were performed. By means of a predefined questionnaire, the morbidity rates of specific respiratory diseases in Milos, were compared to those of the municipality of Oinofita, an industrial region. Chi-square analysis was used and the confounding factors of age, gender and smoking were taken into account, by estimating binary logistic regression models. The SMRs for Pneumonia and Chronic Obstructive Pulmonary Disease (COPD) were found elevated for both genders, although they did not reach statistical significance. For the total of respiratory diseases, a statistically significant SMR was identified regarding the decade 1989-1998. The morbidity study revealed elevated and statistically significant Odds Ratios (ORs), associated with allergic rhinitis, pneumonia, COPD and bronchiectasis. An elevated OR was also identified for asthma. After controlling for age, gender and smoking, the ORs were statistically significant and towards the same direction.

  11. Comparisons of non-Gaussian statistical models in DNA methylation analysis.

    PubMed

    Ma, Zhanyu; Teschendorff, Andrew E; Yu, Hong; Taghia, Jalil; Guo, Jun

    2014-06-16

    As a key regulatory mechanism of gene expression, DNA methylation patterns are widely altered in many complex genetic diseases, including cancer. DNA methylation is naturally quantified by bounded support data; therefore, it is non-Gaussian distributed. In order to capture such properties, we introduce some non-Gaussian statistical models to perform dimension reduction on DNA methylation data. Afterwards, non-Gaussian statistical model-based unsupervised clustering strategies are applied to cluster the data. Comparisons and analysis of different dimension reduction strategies and unsupervised clustering methods are presented. Experimental results show that the non-Gaussian statistical model-based methods are superior to the conventional Gaussian distribution-based method. They are meaningful tools for DNA methylation analysis. Moreover, among several non-Gaussian methods, the one that captures the bounded nature of DNA methylation data reveals the best clustering performance.

  12. Comparisons of Non-Gaussian Statistical Models in DNA Methylation Analysis

    PubMed Central

    Ma, Zhanyu; Teschendorff, Andrew E.; Yu, Hong; Taghia, Jalil; Guo, Jun

    2014-01-01

    As a key regulatory mechanism of gene expression, DNA methylation patterns are widely altered in many complex genetic diseases, including cancer. DNA methylation is naturally quantified by bounded support data; therefore, it is non-Gaussian distributed. In order to capture such properties, we introduce some non-Gaussian statistical models to perform dimension reduction on DNA methylation data. Afterwards, non-Gaussian statistical model-based unsupervised clustering strategies are applied to cluster the data. Comparisons and analysis of different dimension reduction strategies and unsupervised clustering methods are presented. Experimental results show that the non-Gaussian statistical model-based methods are superior to the conventional Gaussian distribution-based method. They are meaningful tools for DNA methylation analysis. Moreover, among several non-Gaussian methods, the one that captures the bounded nature of DNA methylation data reveals the best clustering performance. PMID:24937687

  13. A multi-level approach for investigating socio-economic and agricultural risk factors associated with rates of reported cases of Escherichia coli O157 in humans in Alberta, Canada.

    PubMed

    Pearl, D L; Louie, M; Chui, L; Doré, K; Grimsrud, K M; Martin, S W; Michel, P; Svenson, L W; McEwen, S A

    2009-10-01

    Using negative binomial and multi-level Poisson models, the authors determined the statistical significance of agricultural and socio-economic risk factors for rates of reported disease associated with Escherichia coli O157 in census subdivisions (CSDs) in Alberta, Canada, 2000-2002. Variables relating to population stability, aboriginal composition of the CSDs, and the economic relationship between CSDs and urban centres were significant risk factors. The percentage of individuals living in low-income households was not a statistically significant risk factor for rates of disease. The statistical significance of cattle density, recorded at a higher geographical level, depended on the method used to correct for overdispersion, the number of levels included in the multi-level models, and the choice of using all reported cases or only sporadic cases. Our results highlight the importance of local socio-economic risk factors in determining rates of disease associated with E. coli O157, but their relationship with individual risk factors requires further evaluation.

  14. Prognostic factors in patients with advanced cancer: use of the patient-generated subjective global assessment in survival prediction.

    PubMed

    Martin, Lisa; Watanabe, Sharon; Fainsinger, Robin; Lau, Francis; Ghosh, Sunita; Quan, Hue; Atkins, Marlis; Fassbender, Konrad; Downing, G Michael; Baracos, Vickie

    2010-10-01

    To determine whether elements of a standard nutritional screening assessment are independently prognostic of survival in patients with advanced cancer. A prospective nested cohort of patients with metastatic cancer were accrued from different units of a Regional Palliative Care Program. Patients completed a nutritional screen on admission. Data included age, sex, cancer site, height, weight history, dietary intake, 13 nutrition impact symptoms, and patient- and physician-reported performance status (PS). Univariate and multivariate survival analyses were conducted. Concordance statistics (c-statistics) were used to test the predictive accuracy of models based on training and validation sets; a c-statistic of 0.5 indicates the model predicts the outcome as well as chance; perfect prediction has a c-statistic of 1.0. A training set of patients in palliative home care (n = 1,164) was used to identify prognostic variables. Primary disease site, PS, short-term weight change (either gain or loss), dietary intake, and dysphagia predicted survival in multivariate analysis (P < .05). A model including only patients separated by disease site and PS with high c-statistics between predicted and observed responses for survival in the training set (0.90) and validation set (0.88; n = 603). The addition of weight change, dietary intake, and dysphagia did not further improve the c-statistic of the model. The c-statistic was also not altered by substituting physician-rated palliative PS for patient-reported PS. We demonstrate a high probability of concordance between predicted and observed survival for patients in distinct palliative care settings (home care, tertiary inpatient, ambulatory outpatient) based on patient-reported information.

  15. Heuristic Identification of Biological Architectures for Simulating Complex Hierarchical Genetic Interactions

    PubMed Central

    Moore, Jason H; Amos, Ryan; Kiralis, Jeff; Andrews, Peter C

    2015-01-01

    Simulation plays an essential role in the development of new computational and statistical methods for the genetic analysis of complex traits. Most simulations start with a statistical model using methods such as linear or logistic regression that specify the relationship between genotype and phenotype. This is appealing due to its simplicity and because these statistical methods are commonly used in genetic analysis. It is our working hypothesis that simulations need to move beyond simple statistical models to more realistically represent the biological complexity of genetic architecture. The goal of the present study was to develop a prototype genotype–phenotype simulation method and software that are capable of simulating complex genetic effects within the context of a hierarchical biology-based framework. Specifically, our goal is to simulate multilocus epistasis or gene–gene interaction where the genetic variants are organized within the framework of one or more genes, their regulatory regions and other regulatory loci. We introduce here the Heuristic Identification of Biological Architectures for simulating Complex Hierarchical Interactions (HIBACHI) method and prototype software for simulating data in this manner. This approach combines a biological hierarchy, a flexible mathematical framework, a liability threshold model for defining disease endpoints, and a heuristic search strategy for identifying high-order epistatic models of disease susceptibility. We provide several simulation examples using genetic models exhibiting independent main effects and three-way epistatic effects. PMID:25395175

  16. Disconcordance in Statistical Models of Bisphenol A and Chronic Disease Outcomes in NHANES 2003-08

    PubMed Central

    Casey, Martin F.; Neidell, Matthew

    2013-01-01

    Background Bisphenol A (BPA), a high production chemical commonly found in plastics, has drawn great attention from researchers due to the substance’s potential toxicity. Using data from three National Health and Nutrition Examination Survey (NHANES) cycles, we explored the consistency and robustness of BPA’s reported effects on coronary heart disease and diabetes. Methods And Findings We report the use of three different statistical models in the analysis of BPA: (1) logistic regression, (2) log-linear regression, and (3) dose-response logistic regression. In each variation, confounders were added in six blocks to account for demographics, urinary creatinine, source of BPA exposure, healthy behaviours, and phthalate exposure. Results were sensitive to the variations in functional form of our statistical models, but no single model yielded consistent results across NHANES cycles. Reported ORs were also found to be sensitive to inclusion/exclusion criteria. Further, observed effects, which were most pronounced in NHANES 2003-04, could not be explained away by confounding. Conclusions Limitations in the NHANES data and a poor understanding of the mode of action of BPA have made it difficult to develop informative statistical models. Given the sensitivity of effect estimates to functional form, researchers should report results using multiple specifications with different assumptions about BPA measurement, thus allowing for the identification of potential discrepancies in the data. PMID:24223205

  17. No sex differences in use of dopaminergic medication in early Parkinson disease in the US and Canada - baseline findings of a multicenter trial.

    PubMed

    Umeh, Chizoba C; Pérez, Adriana; Augustine, Erika F; Dhall, Rohit; Dewey, Richard B; Mari, Zoltan; Simon, David K; Wills, Anne-Marie A; Christine, Chadwick W; Schneider, Jay S; Suchowersky, Oksana

    2014-01-01

    Sex differences in Parkinson disease clinical features have been reported, but few studies have examined sex influences on use of dopaminergic medication in early Parkinson disease. The objective of this study was to test if there are differences in the type of dopaminergic medication used and levodopa equivalent daily dose between men and women with early Parkinson disease enrolled in a large multicenter study of Creatine as a potential disease modifying therapy - the National Institute of Neurological Disorders and Stroke Exploratory Trials in Parkinson Disease Long-Term Study-1. Baseline data of 1,741 participants from 45 participating sites were analyzed. Participants from the United States and Canada were enrolled within five years of Parkinson Disease diagnosis. Two outcome variables were studied: type of dopaminergic medication used and levodopa equivalent daily dose at baseline in the Long-Term Study-1. Chi-square statistic and linear regression models were used for statistical analysis. There were no statistically significant differences in the frequency of use of different types of dopaminergic medications at baseline between men and women with Parkinson Disease. A small but statistically significant difference was observed in the median unadjusted levodopa equivalent daily dose at baseline between women (300 mg) and men (325 mg), but this was not observed after controlling for disease duration (years since Parkinson disease diagnosis), disease severity (Unified Parkinson's Disease Rating Scale Motor and Activities of Daily Living Scores), and body weight. In this large multicenter study, we did not observe sex differences in the type and dose of dopaminergic medications used in early Parkinson Disease. Further research is needed to evaluate the influence of male or female sex on use of dopaminergic medication in mid- and late-stage Parkinson Disease.

  18. Leptospirosis disease mapping with standardized morbidity ratio and Poisson-Gamma model: An analysis of Leptospirosis disease in Kelantan, Malaysia

    NASA Astrophysics Data System (ADS)

    Che Awang, Aznida; Azah Samat, Nor

    2017-09-01

    Leptospirosis is a disease caused by the infection of pathogenic species from the genus of Leptospira. Human can be infected by the leptospirosis from direct or indirect exposure to the urine of infected animals. The excretion of urine from the animal host that carries pathogenic Leptospira causes the soil or water to be contaminated. Therefore, people can become infected when they are exposed to contaminated soil and water by cut on the skin as well as open wound. It also can enter the human body by mucous membrane such nose, eyes and mouth, for example by splashing contaminated water or urine into the eyes or swallowing contaminated water or food. Currently, there is no vaccine available for the prevention or treatment of leptospirosis disease but this disease can be treated if it is diagnosed early to avoid any complication. The disease risk mapping is important in a way to control and prevention of disease. Using a good choice of statistical model will produce a good disease risk map. Therefore, the aim of this study is to estimate the relative risk for leptospirosis disease based initially on the most common statistic used in disease mapping called Standardized Morbidity Ratio (SMR) and Poisson-gamma model. This paper begins by providing a review of the SMR method and Poisson-gamma model, which we then applied to leptospirosis data of Kelantan, Malaysia. Both results are displayed and compared using graph, tables and maps. The result shows that the second method Poisson-gamma model produces better relative risk estimates compared to the SMR method. This is because the Poisson-gamma model can overcome the drawback of SMR where the relative risk will become zero when there is no observed leptospirosis case in certain regions. However, the Poisson-gamma model also faced problems where the covariate adjustment for this model is difficult and no possibility for allowing spatial correlation between risks in neighbouring areas. The problems of this model have motivated many researchers to introduce other alternative methods for estimating the risk.

  19. A Supervised Statistical Learning Approach for Accurate Legionella pneumophila Source Attribution during Outbreaks

    PubMed Central

    Buultjens, Andrew H.; Chua, Kyra Y. L.; Baines, Sarah L.; Kwong, Jason; Gao, Wei; Cutcher, Zoe; Adcock, Stuart; Ballard, Susan; Schultz, Mark B.; Tomita, Takehiro; Subasinghe, Nela; Carter, Glen P.; Pidot, Sacha J.; Franklin, Lucinda; Seemann, Torsten; Gonçalves Da Silva, Anders

    2017-01-01

    ABSTRACT Public health agencies are increasingly relying on genomics during Legionnaires' disease investigations. However, the causative bacterium (Legionella pneumophila) has an unusual population structure, with extreme temporal and spatial genome sequence conservation. Furthermore, Legionnaires' disease outbreaks can be caused by multiple L. pneumophila genotypes in a single source. These factors can confound cluster identification using standard phylogenomic methods. Here, we show that a statistical learning approach based on L. pneumophila core genome single nucleotide polymorphism (SNP) comparisons eliminates ambiguity for defining outbreak clusters and accurately predicts exposure sources for clinical cases. We illustrate the performance of our method by genome comparisons of 234 L. pneumophila isolates obtained from patients and cooling towers in Melbourne, Australia, between 1994 and 2014. This collection included one of the largest reported Legionnaires' disease outbreaks, which involved 125 cases at an aquarium. Using only sequence data from L. pneumophila cooling tower isolates and including all core genome variation, we built a multivariate model using discriminant analysis of principal components (DAPC) to find cooling tower-specific genomic signatures and then used it to predict the origin of clinical isolates. Model assignments were 93% congruent with epidemiological data, including the aquarium Legionnaires' disease outbreak and three other unrelated outbreak investigations. We applied the same approach to a recently described investigation of Legionnaires' disease within a UK hospital and observed a model predictive ability of 86%. We have developed a promising means to breach L. pneumophila genetic diversity extremes and provide objective source attribution data for outbreak investigations. IMPORTANCE Microbial outbreak investigations are moving to a paradigm where whole-genome sequencing and phylogenetic trees are used to support epidemiological investigations. It is critical that outbreak source predictions are accurate, particularly for pathogens, like Legionella pneumophila, which can spread widely and rapidly via cooling system aerosols, causing Legionnaires' disease. Here, by studying hundreds of Legionella pneumophila genomes collected over 21 years around a major Australian city, we uncovered limitations with the phylogenetic approach that could lead to a misidentification of outbreak sources. We implement instead a statistical learning technique that eliminates the ambiguity of inferring disease transmission from phylogenies. Our approach takes geolocation information and core genome variation from environmental L. pneumophila isolates to build statistical models that predict with high confidence the environmental source of clinical L. pneumophila during disease outbreaks. We show the versatility of the technique by applying it to unrelated Legionnaires' disease outbreaks in Australia and the UK. PMID:28821546

  20. Consideration of species community composition in statistical ...

    EPA Pesticide Factsheets

    Diseases are increasing in marine ecosystems, and these increases have been attributed to a number of environmental factors including climate change, pollution, and overfishing. However, many studies pool disease prevalence into taxonomic groups, disregarding host species composition when comparing sites or assessing environmental impacts on patterns of disease presence. We used simulated data under a known environmental effect to assess the ability of standard statistical methods (binomial and linear regression, ANOVA) to detect a significant environmental effect on pooled disease prevalence with varying species abundance distributions and relative susceptibilities to disease. When one species was more susceptible to a disease and both species only partially overlapped in their distributions, models tended to produce a greater number of false positives (Type I error). Differences in disease risk between regions or along an environmental gradient tended to be underestimated, or even in the wrong direction, when highly susceptible taxa had reduced abundances in impacted sites, a situation likely to be common in nature. Including relative abundance as an additional variable in regressions improved model accuracy, but tended to be conservative, producing more false negatives (Type II error) when species abundance was strongly correlated with the environmental effect. Investigators should be cautious of underlying assumptions of species similarity in susceptib

  1. Multifactor-Dimensionality Reduction Reveals High-Order Interactions among Estrogen-Metabolism Genes in Sporadic Breast Cancer

    PubMed Central

    Ritchie, Marylyn D.; Hahn, Lance W.; Roodi, Nady; Bailey, L. Renee; Dupont, William D.; Parl, Fritz F.; Moore, Jason H.

    2001-01-01

    One of the greatest challenges facing human geneticists is the identification and characterization of susceptibility genes for common complex multifactorial human diseases. This challenge is partly due to the limitations of parametric-statistical methods for detection of gene effects that are dependent solely or partially on interactions with other genes and with environmental exposures. We introduce multifactor-dimensionality reduction (MDR) as a method for reducing the dimensionality of multilocus information, to improve the identification of polymorphism combinations associated with disease risk. The MDR method is nonparametric (i.e., no hypothesis about the value of a statistical parameter is made), is model-free (i.e., it assumes no particular inheritance model), and is directly applicable to case-control and discordant-sib-pair studies. Using simulated case-control data, we demonstrate that MDR has reasonable power to identify interactions among two or more loci in relatively small samples. When it was applied to a sporadic breast cancer case-control data set, in the absence of any statistically significant independent main effects, MDR identified a statistically significant high-order interaction among four polymorphisms from three different estrogen-metabolism genes. To our knowledge, this is the first report of a four-locus interaction associated with a common complex multifactorial disease. PMID:11404819

  2. A diagnostic model for chronic hypersensitivity pneumonitis

    PubMed Central

    Johannson, Kerri A; Elicker, Brett M; Vittinghoff, Eric; Assayag, Deborah; de Boer, Kaïssa; Golden, Jeffrey A; Jones, Kirk D; King, Talmadge E; Koth, Laura L; Lee, Joyce S; Ley, Brett; Wolters, Paul J; Collard, Harold R

    2017-01-01

    The objective of this study was to develop a diagnostic model that allows for a highly specific diagnosis of chronic hypersensitivity pneumonitis using clinical and radiological variables alone. Chronic hypersensitivity pneumonitis and other interstitial lung disease cases were retrospectively identified from a longitudinal database. High-resolution CT scans were blindly scored for radiographic features (eg, ground-glass opacity, mosaic perfusion) as well as the radiologist’s diagnostic impression. Candidate models were developed then evaluated using clinical and radiographic variables and assessed by the cross-validated C-statistic. Forty-four chronic hypersensitivity pneumonitis and eighty other interstitial lung disease cases were identified. Two models were selected based on their statistical performance, clinical applicability and face validity. Key model variables included age, down feather and/or bird exposure, radiographic presence of ground-glass opacity and mosaic perfusion and moderate or high confidence in the radiographic impression of chronic hypersensitivity pneumonitis. Models were internally validated with good performance, and cut-off values were established that resulted in high specificity for a diagnosis of chronic hypersensitivity pneumonitis. PMID:27245779

  3. A Model of Compound Heterozygous, Loss-of-Function Alleles Is Broadly Consistent with Observations from Complex-Disease GWAS Datasets

    PubMed Central

    Sanjak, Jaleal S.; Long, Anthony D.; Thornton, Kevin R.

    2017-01-01

    The genetic component of complex disease risk in humans remains largely unexplained. A corollary is that the allelic spectrum of genetic variants contributing to complex disease risk is unknown. Theoretical models that relate population genetic processes to the maintenance of genetic variation for quantitative traits may suggest profitable avenues for future experimental design. Here we use forward simulation to model a genomic region evolving under a balance between recurrent deleterious mutation and Gaussian stabilizing selection. We consider multiple genetic and demographic models, and several different methods for identifying genomic regions harboring variants associated with complex disease risk. We demonstrate that the model of gene action, relating genotype to phenotype, has a qualitative effect on several relevant aspects of the population genetic architecture of a complex trait. In particular, the genetic model impacts genetic variance component partitioning across the allele frequency spectrum and the power of statistical tests. Models with partial recessivity closely match the minor allele frequency distribution of significant hits from empirical genome-wide association studies without requiring homozygous effect sizes to be small. We highlight a particular gene-based model of incomplete recessivity that is appealing from first principles. Under that model, deleterious mutations in a genomic region partially fail to complement one another. This model of gene-based recessivity predicts the empirically observed inconsistency between twin and SNP based estimated of dominance heritability. Furthermore, this model predicts considerable levels of unexplained variance associated with intralocus epistasis. Our results suggest a need for improved statistical tools for region based genetic association and heritability estimation. PMID:28103232

  4. Modeling longitudinal data, I: principles of multivariate analysis.

    PubMed

    Ravani, Pietro; Barrett, Brendan; Parfrey, Patrick

    2009-01-01

    Statistical models are used to study the relationship between exposure and disease while accounting for the potential role of other factors' impact on outcomes. This adjustment is useful to obtain unbiased estimates of true effects or to predict future outcomes. Statistical models include a systematic component and an error component. The systematic component explains the variability of the response variable as a function of the predictors and is summarized in the effect estimates (model coefficients). The error element of the model represents the variability in the data unexplained by the model and is used to build measures of precision around the point estimates (confidence intervals).

  5. Mapping the Risks of Malaria, Dengue and Influenza Using Satellite Data

    NASA Astrophysics Data System (ADS)

    Kiang, R. K.; Soebiyanto, R. P.

    2012-07-01

    It has long been recognized that environment and climate may affect the transmission of infectious diseases. The effects are most obvious for vector-borne infectious diseases, such as malaria and dengue, but less so for airborne and contact diseases, such as seasonal influenza. In this paper, we examined the meteorological and environmental parameters that influence the transmission of malaria, dengue and seasonal influenza. Remotely sensed parameters that provide such parameters were discussed. Both statistical and biologically inspired, processed based models can be used to model the transmission of these diseases utilizing the remotely sensed parameters as input. Examples were given for modelling malaria in Thailand, dengue in Indonesia, and seasonal influenza in Hong Kong.

  6. Towards first principle medical diagnostics: on the importance of disease-disease and sign-sign interactions

    NASA Astrophysics Data System (ADS)

    Ramezanpour, Abolfazl; Mashaghi, Alireza

    2017-07-01

    A fundamental problem in medicine and biology is to assign states, e.g. healthy or diseased, to cells, organs or individuals. State assignment or making a diagnosis is often a nontrivial and challenging process and, with the advent of omics technologies, the diagnostic challenge is becoming more and more serious. The challenge lies not only in the increasing number of measured properties and dynamics of the system (e.g. cell or human body) but also in the co-evolution of multiple states and overlapping properties, and degeneracy of states. We develop, from first principles, a generic rational framework for state assignment in cell biology and medicine, and demonstrate its applicability with a few simple theoretical case studies from medical diagnostics. We show how disease-related statistical information can be used to build a comprehensive model that includes the relevant dependencies between clinical and laboratory findings (signs) and diseases. In particular, we include disease-disease and sign-sign interactions and study how one can infer the probability of a disease in a patient with given signs. We perform comparative analysis with simple benchmark models to check the performances of our models. We find that including interactions can significantly change the statistical importance of the signs and diseases. This first principles approach, as we show, facilitates the early diagnosis of disease by taking interactions into accounts, and enables the construction of consensus diagnostic flow charts. Additionally, we envision that our approach will find applications in systems biology, and in particular, in characterizing the phenome via the metabolome, the proteome, the transcriptome, and the genome.

  7. Predicting clinical diagnosis in Huntington's disease: An imaging polymarker

    PubMed Central

    Daws, Richard E.; Soreq, Eyal; Johnson, Eileanoir B.; Scahill, Rachael I.; Tabrizi, Sarah J.; Barker, Roger A.; Hampshire, Adam

    2018-01-01

    Objective Huntington's disease (HD) gene carriers can be identified before clinical diagnosis; however, statistical models for predicting when overt motor symptoms will manifest are too imprecise to be useful at the level of the individual. Perfecting this prediction is integral to the search for disease modifying therapies. This study aimed to identify an imaging marker capable of reliably predicting real‐life clinical diagnosis in HD. Method A multivariate machine learning approach was applied to resting‐state and structural magnetic resonance imaging scans from 19 premanifest HD gene carriers (preHD, 8 of whom developed clinical disease in the 5 years postscanning) and 21 healthy controls. A classification model was developed using cross‐group comparisons between preHD and controls, and within the preHD group in relation to “estimated” and “actual” proximity to disease onset. Imaging measures were modeled individually, and combined, and permutation modeling robustly tested classification accuracy. Results Classification performance for preHDs versus controls was greatest when all measures were combined. The resulting polymarker predicted converters with high accuracy, including those who were not expected to manifest in that time scale based on the currently adopted statistical models. Interpretation We propose that a holistic multivariate machine learning treatment of brain abnormalities in the premanifest phase can be used to accurately identify those patients within 5 years of developing motor features of HD, with implications for prognostication and preclinical trials. Ann Neurol 2018;83:532–543 PMID:29405351

  8. Statistical molecular design of balanced compound libraries for QSAR modeling.

    PubMed

    Linusson, A; Elofsson, M; Andersson, I E; Dahlgren, M K

    2010-01-01

    A fundamental step in preclinical drug development is the computation of quantitative structure-activity relationship (QSAR) models, i.e. models that link chemical features of compounds with activities towards a target macromolecule associated with the initiation or progression of a disease. QSAR models are computed by combining information on the physicochemical and structural features of a library of congeneric compounds, typically assembled from two or more building blocks, and biological data from one or more in vitro assays. Since the models provide information on features affecting the compounds' biological activity they can be used as guides for further optimization. However, in order for a QSAR model to be relevant to the targeted disease, and drug development in general, the compound library used must contain molecules with balanced variation of the features spanning the chemical space believed to be important for interaction with the biological target. In addition, the assays used must be robust and deliver high quality data that are directly related to the function of the biological target and the associated disease state. In this review, we discuss and exemplify the concept of statistical molecular design (SMD) in the selection of building blocks and final synthetic targets (i.e. compounds to synthesize) to generate information-rich, balanced libraries for biological testing and computation of QSAR models.

  9. Time series modeling of pathogen-specific disease probabilities with subsampled data.

    PubMed

    Fisher, Leigh; Wakefield, Jon; Bauer, Cici; Self, Steve

    2017-03-01

    Many diseases arise due to exposure to one of multiple possible pathogens. We consider the situation in which disease counts are available over time from a study region, along with a measure of clinical disease severity, for example, mild or severe. In addition, we suppose a subset of the cases are lab tested in order to determine the pathogen responsible for disease. In such a context, we focus interest on modeling the probabilities of disease incidence given pathogen type. The time course of these probabilities is of great interest as is the association with time-varying covariates such as meteorological variables. In this set up, a natural Bayesian approach would be based on imputation of the unsampled pathogen information using Markov Chain Monte Carlo but this is computationally challenging. We describe a practical approach to inference that is easy to implement. We use an empirical Bayes procedure in a first step to estimate summary statistics. We then treat these summary statistics as the observed data and develop a Bayesian generalized additive model. We analyze data on hand, foot, and mouth disease (HFMD) in China in which there are two pathogens of primary interest, enterovirus 71 (EV71) and Coxackie A16 (CA16). We find that both EV71 and CA16 are associated with temperature, relative humidity, and wind speed, with reasonably similar functional forms for both pathogens. The important issue of confounding by time is modeled using a penalized B-spline model with a random effects representation. The level of smoothing is addressed by a careful choice of the prior on the tuning variance. © 2016, The International Biometric Society.

  10. Evaluation of the implementation of an integrated program for musculoskeletal system care.

    PubMed

    Larrañaga, Igor; Soto-Gordoa, Myriam; Arrospide, Arantzazu; Jauregi, María Luz; Millas, Jesús; San Vicente, Ricardo; Aguirrebeña, Jabier; Mar, Javier

    The chronic nature of musculoskeletal diseases requires an integrated care which involves the Primary Care and the specialities of Rheumatology, Traumatology and Rehabilitation. The aim of this study was to assess the implementation of an integrated organizational model in osteoporosis, low back pain, shoulder disease and knee disease using Deming's continuous improvement process and considering referrals and resource consumption. A simulation model was used in the planning to predict the evolution of musculoskeletal diseases resource consumption and to carry out a Budget Impact Analysis from 2012 to 2020 in the Goierri-Alto Urola region. In the checking stage the status of the process in 2014 was evaluated using statistical analysis to check the degree of achievement of the objectives for each speciality. Simulation models showed that population with musculoskeletal disease in Goierri-Alto Urola will increase a 4.4% by 2020. Because of that, the expenses for a conventional healthcare system will have increased a 5.9%. However, if the intervention reaches its objectives the budget would decrease an 8.5%. The statistical analysis evidenced a decline in referrals to Traumatology service and a reduction of successive consultations in all specialities. The implementation of the integrated organizational model in osteoporosis, low back pain, shoulder disease and knee disease is still at an early stage. However, the empowerment of Primary Care improved patient referrals and reduced the costs. Copyright © 2016 Elsevier España, S.L.U. and Sociedad Española de Reumatología y Colegio Mexicano de Reumatología. All rights reserved.

  11. Function-driven discovery of disease genes in zebrafish using an integrated genomics big data resource.

    PubMed

    Shim, Hongseok; Kim, Ji Hyun; Kim, Chan Yeong; Hwang, Sohyun; Kim, Hyojin; Yang, Sunmo; Lee, Ji Eun; Lee, Insuk

    2016-11-16

    Whole exome sequencing (WES) accelerates disease gene discovery using rare genetic variants, but further statistical and functional evidence is required to avoid false-discovery. To complement variant-driven disease gene discovery, here we present function-driven disease gene discovery in zebrafish (Danio rerio), a promising human disease model owing to its high anatomical and genomic similarity to humans. To facilitate zebrafish-based function-driven disease gene discovery, we developed a genome-scale co-functional network of zebrafish genes, DanioNet (www.inetbio.org/danionet), which was constructed by Bayesian integration of genomics big data. Rigorous statistical assessment confirmed the high prediction capacity of DanioNet for a wide variety of human diseases. To demonstrate the feasibility of the function-driven disease gene discovery using DanioNet, we predicted genes for ciliopathies and performed experimental validation for eight candidate genes. We also validated the existence of heterozygous rare variants in the candidate genes of individuals with ciliopathies yet not in controls derived from the UK10K consortium, suggesting that these variants are potentially involved in enhancing the risk of ciliopathies. These results showed that an integrated genomics big data for a model animal of diseases can expand our opportunity for harnessing WES data in disease gene discovery. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  12. Enhancing image classification models with multi-modal biomarkers

    NASA Astrophysics Data System (ADS)

    Caban, Jesus J.; Liao, David; Yao, Jianhua; Mollura, Daniel J.; Gochuico, Bernadette; Yoo, Terry

    2011-03-01

    Currently, most computer-aided diagnosis (CAD) systems rely on image analysis and statistical models to diagnose, quantify, and monitor the progression of a particular disease. In general, CAD systems have proven to be effective at providing quantitative measurements and assisting physicians during the decision-making process. As the need for more flexible and effective CADs continues to grow, questions about how to enhance their accuracy have surged. In this paper, we show how statistical image models can be augmented with multi-modal physiological values to create more robust, stable, and accurate CAD systems. In particular, this paper demonstrates how highly correlated blood and EKG features can be treated as biomarkers and used to enhance image classification models designed to automatically score subjects with pulmonary fibrosis. In our results, a 3-5% improvement was observed when comparing the accuracy of CADs that use multi-modal biomarkers with those that only used image features. Our results show that lab values such as Erythrocyte Sedimentation Rate and Fibrinogen, as well as EKG measurements such as QRS and I:40, are statistically significant and can provide valuable insights about the severity of the pulmonary fibrosis disease.

  13. Using multitype branching processes to quantify statistics of disease outbreaks in zoonotic epidemics

    USDA-ARS?s Scientific Manuscript database

    Despite the enormous relevance of zoonotic infections to world-wide public health, and despite much effort in modeling individual zoonoses, a fundamental understanding of the disease dynamics and the nature of outbreaks emanating from such a complex system is still lacking. We introduce a simple sto...

  14. A disease state fingerprint for evaluation of Alzheimer's disease.

    PubMed

    Mattila, Jussi; Koikkalainen, Juha; Virkki, Arho; Simonsen, Anja; van Gils, Mark; Waldemar, Gunhild; Soininen, Hilkka; Lötjönen, Jyrki

    2011-01-01

    Diagnostic processes of Alzheimer's disease (AD) are evolving. Knowledge about disease-specific biomarkers is constantly increasing and larger volumes of data are being measured from patients. To gain additional benefits from the collected data, a novel statistical modeling and data visualization system is proposed for supporting clinical diagnosis of AD. The proposed system computes an evidence-based estimate of a patient's AD state by comparing his or her heterogeneous neuropsychological, clinical, and biomarker data to previously diagnosed cases. The AD state in this context denotes a patient's degree of similarity to previously diagnosed disease population. A summary of patient data and results of the computation are displayed in a succinct Disease State Fingerprint (DSF) visualization. The visualization clearly discloses how patient data contributes to the AD state, facilitating rapid interpretation of the information. To model the AD state from complex and heterogeneous patient data, a statistical Disease State Index (DSI) method underlying the DSF has been developed. Using baseline data from the Alzheimer's Disease Neuroimaging Initiative (ADNI), the ability of the DSI to model disease progression from elderly healthy controls to AD and its ability to predict conversion from mild cognitive impairment (MCI) to AD were assessed. It was found that the DSI provides well-behaving AD state estimates, corresponding well with the actual diagnoses. For predicting conversion from MCI to AD, the DSI attains performance similar to state-of-the-art reference classifiers. The results suggest that the DSF establishes an effective decision support and data visualization framework for improving AD diagnostics, allowing clinicians to rapidly analyze large quantities of diverse patient data.

  15. Validating Lung Models Using the ASL 5000 Breathing Simulator.

    PubMed

    Dexter, Amanda; McNinch, Neil; Kaznoch, Destiny; Volsko, Teresa A

    2018-04-01

    This study sought to validate pediatric models with normal and altered pulmonary mechanics. PubMed and CINAHL databases were searched for studies directly measuring pulmonary mechanics of healthy infants and children, infants with severe bronchopulmonary dysplasia and neuromuscular disease. The ASL 5000 was used to construct models using tidal volume (VT), inspiratory time (TI), respiratory rate, resistance, compliance, and esophageal pressure gleaned from literature. Data were collected for a 1-minute period and repeated three times for each model. t tests compared modeled data with data abstracted from the literature. Repeated measures analyses evaluated model performance over multiple iterations. Statistical significance was established at a P value of less than 0.05. Maximum differences of means (experimental iteration mean - clinical standard mean) for TI and VT are the following: term infant without lung disease (TI = 0.09 s, VT = 0.29 mL), severe bronchopulmonary dysplasia (TI = 0.08 s, VT = 0.17 mL), child without lung disease (TI = 0.10 s, VT = 0.17 mL), and child with neuromuscular disease (TI = 0.09 s, VT = 0.57 mL). One-sample testing demonstrated statistically significant differences between clinical controls and VT and TI values produced by the ASL 5000 for each iteration and model (P < 0.01). The greatest magnitude of differences was negligible (VT < 1.6%, TI = 18%) and not clinically relevant. Inconsistencies occurred with the models constructed on the ASL 5000. It was deemed accurate for the study purposes. It is therefore essential to test models and evaluate magnitude of differences before use.

  16. New insights into the endophenotypic status of cognition in bipolar disorder: genetic modelling study of twins and siblings.

    PubMed

    Georgiades, Anna; Rijsdijk, Fruhling; Kane, Fergus; Rebollo-Mesa, Irene; Kalidindi, Sridevi; Schulze, Katja K; Stahl, Daniel; Walshe, Muriel; Sahakian, Barbara J; McDonald, Colm; Hall, Mei-Hua; Murray, Robin M; Kravariti, Eugenia

    2016-06-01

    Twin studies have lacked statistical power to apply advanced genetic modelling techniques to the search for cognitive endophenotypes for bipolar disorder. To quantify the shared genetic variability between bipolar disorder and cognitive measures. Structural equation modelling was performed on cognitive data collected from 331 twins/siblings of varying genetic relatedness, disease status and concordance for bipolar disorder. Using a parsimonious AE model, verbal episodic and spatial working memory showed statistically significant genetic correlations with bipolar disorder (rg = |0.23|-|0.27|), which lost statistical significance after covarying for affective symptoms. Using an ACE model, IQ and visual-spatial learning showed statistically significant genetic correlations with bipolar disorder (rg = |0.51|-|1.00|), which remained significant after covarying for affective symptoms. Verbal episodic and spatial working memory capture a modest fraction of the bipolar diathesis. IQ and visual-spatial learning may tap into genetic substrates of non-affective symptomatology in bipolar disorder. © The Royal College of Psychiatrists 2016.

  17. Assessing socioeconomic vulnerability to dengue fever in Cali, Colombia: statistical vs expert-based modeling

    PubMed Central

    2013-01-01

    Background As a result of changes in climatic conditions and greater resistance to insecticides, many regions across the globe, including Colombia, have been facing a resurgence of vector-borne diseases, and dengue fever in particular. Timely information on both (1) the spatial distribution of the disease, and (2) prevailing vulnerabilities of the population are needed to adequately plan targeted preventive intervention. We propose a methodology for the spatial assessment of current socioeconomic vulnerabilities to dengue fever in Cali, a tropical urban environment of Colombia. Methods Based on a set of socioeconomic and demographic indicators derived from census data and ancillary geospatial datasets, we develop a spatial approach for both expert-based and purely statistical-based modeling of current vulnerability levels across 340 neighborhoods of the city using a Geographic Information System (GIS). The results of both approaches are comparatively evaluated by means of spatial statistics. A web-based approach is proposed to facilitate the visualization and the dissemination of the output vulnerability index to the community. Results The statistical and the expert-based modeling approach exhibit a high concordance, globally, and spatially. The expert-based approach indicates a slightly higher vulnerability mean (0.53) and vulnerability median (0.56) across all neighborhoods, compared to the purely statistical approach (mean = 0.48; median = 0.49). Both approaches reveal that high values of vulnerability tend to cluster in the eastern, north-eastern, and western part of the city. These are poor neighborhoods with high percentages of young (i.e., < 15 years) and illiterate residents, as well as a high proportion of individuals being either unemployed or doing housework. Conclusions Both modeling approaches reveal similar outputs, indicating that in the absence of local expertise, statistical approaches could be used, with caution. By decomposing identified vulnerability “hotspots” into their underlying factors, our approach provides valuable information on both (1) the location of neighborhoods, and (2) vulnerability factors that should be given priority in the context of targeted intervention strategies. The results support decision makers to allocate resources in a manner that may reduce existing susceptibilities and strengthen resilience, and thus help to reduce the burden of vector-borne diseases. PMID:23945265

  18. Human Immunodeficiency Virus and the Enrolled Student: A Model Policy.

    ERIC Educational Resources Information Center

    Iowa State Dept. of Education, Des Moines.

    In the nearly 4 years since the initial publication of the model policy "Communicable Diseases and the Enrolled Student" in January 1986, the statistics, recommendations, and even the terminology of Acquired Immune Deficiency Syndrome (AIDS) have changed significantly. In light of the new information, the model policy, recommended for…

  19. Big data to smart data in Alzheimer's disease: Real-world examples of advanced modeling and simulation.

    PubMed

    Haas, Magali; Stephenson, Diane; Romero, Klaus; Gordon, Mark Forrest; Zach, Neta; Geerts, Hugo

    2016-09-01

    Many disease-modifying clinical development programs in Alzheimer's disease (AD) have failed to date, and development of new and advanced preclinical models that generate actionable knowledge is desperately needed. This review reports on computer-based modeling and simulation approach as a powerful tool in AD research. Statistical data-analysis techniques can identify associations between certain data and phenotypes, such as diagnosis or disease progression. Other approaches integrate domain expertise in a formalized mathematical way to understand how specific components of pathology integrate into complex brain networks. Private-public partnerships focused on data sharing, causal inference and pathway-based analysis, crowdsourcing, and mechanism-based quantitative systems modeling represent successful real-world modeling examples with substantial impact on CNS diseases. Similar to other disease indications, successful real-world examples of advanced simulation can generate actionable support of drug discovery and development in AD, illustrating the value that can be generated for different stakeholders. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  20. Senior Computational Scientist | Center for Cancer Research

    Cancer.gov

    The Basic Science Program (BSP) pursues independent, multidisciplinary research in basic and applied molecular biology, immunology, retrovirology, cancer biology, and human genetics. Research efforts and support are an integral part of the Center for Cancer Research (CCR) at the Frederick National Laboratory for Cancer Research (FNLCR). The Cancer & Inflammation Program (CIP), Basic Science Program, HLA Immunogenetics Section, under the leadership of Dr. Mary Carrington, studies the influence of human leukocyte antigens (HLA) and specific KIR/HLA genotypes on risk of and outcomes to infection, cancer, autoimmune disease, and maternal-fetal disease. Recent studies have focused on the impact of HLA gene expression in disease, the molecular mechanism regulating expression levels, and the functional basis for the effect of differential expression on disease outcome. The lab’s further focus is on the genetic basis for resistance/susceptibility to disease conferred by immunogenetic variation. KEY ROLES/RESPONSIBILITIES The Senior Computational Scientist will provide research support to the CIP-BSP-HLA Immunogenetics Section performing bio-statistical design, analysis and reporting of research projects conducted in the lab. This individual will be involved in the implementation of statistical models and data preparation. Successful candidate should have 5 or more years of competent, innovative biostatistics/bioinformatics research experience, beyond doctoral training Considerable experience with statistical software, such as SAS, R and S-Plus Sound knowledge, and demonstrated experience of theoretical and applied statistics Write program code to analyze data using statistical analysis software Contribute to the interpretation and publication of research results

  1. Challenges in developing methods for quantifying the effects of weather and climate on water-associated diseases: A systematic review

    PubMed Central

    Armstrong, Ben; Fleming, Lora E.; Elson, Richard; Kovats, Sari; Vardoulakis, Sotiris; Nichols, Gordon L.

    2017-01-01

    Infectious diseases attributable to unsafe water supply, sanitation and hygiene (e.g. Cholera, Leptospirosis, Giardiasis) remain an important cause of morbidity and mortality, especially in low-income countries. Climate and weather factors are known to affect the transmission and distribution of infectious diseases and statistical and mathematical modelling are continuously developing to investigate the impact of weather and climate on water-associated diseases. There have been little critical analyses of the methodological approaches. Our objective is to review and summarize statistical and modelling methods used to investigate the effects of weather and climate on infectious diseases associated with water, in order to identify limitations and knowledge gaps in developing of new methods. We conducted a systematic review of English-language papers published from 2000 to 2015. Search terms included concepts related to water-associated diseases, weather and climate, statistical, epidemiological and modelling methods. We found 102 full text papers that met our criteria and were included in the analysis. The most commonly used methods were grouped in two clusters: process-based models (PBM) and time series and spatial epidemiology (TS-SE). In general, PBM methods were employed when the bio-physical mechanism of the pathogen under study was relatively well known (e.g. Vibrio cholerae); TS-SE tended to be used when the specific environmental mechanisms were unclear (e.g. Campylobacter). Important data and methodological challenges emerged, with implications for surveillance and control of water-associated infections. The most common limitations comprised: non-inclusion of key factors (e.g. biological mechanism, demographic heterogeneity, human behavior), reporting bias, poor data quality, and collinearity in exposures. Furthermore, the methods often did not distinguish among the multiple sources of time-lags (e.g. patient physiology, reporting bias, healthcare access) between environmental drivers/exposures and disease detection. Key areas of future research include: disentangling the complex effects of weather/climate on each exposure-health outcome pathway (e.g. person-to-person vs environment-to-person), and linking weather data to individual cases longitudinally. PMID:28604791

  2. Challenges in developing methods for quantifying the effects of weather and climate on water-associated diseases: A systematic review.

    PubMed

    Lo Iacono, Giovanni; Armstrong, Ben; Fleming, Lora E; Elson, Richard; Kovats, Sari; Vardoulakis, Sotiris; Nichols, Gordon L

    2017-06-01

    Infectious diseases attributable to unsafe water supply, sanitation and hygiene (e.g. Cholera, Leptospirosis, Giardiasis) remain an important cause of morbidity and mortality, especially in low-income countries. Climate and weather factors are known to affect the transmission and distribution of infectious diseases and statistical and mathematical modelling are continuously developing to investigate the impact of weather and climate on water-associated diseases. There have been little critical analyses of the methodological approaches. Our objective is to review and summarize statistical and modelling methods used to investigate the effects of weather and climate on infectious diseases associated with water, in order to identify limitations and knowledge gaps in developing of new methods. We conducted a systematic review of English-language papers published from 2000 to 2015. Search terms included concepts related to water-associated diseases, weather and climate, statistical, epidemiological and modelling methods. We found 102 full text papers that met our criteria and were included in the analysis. The most commonly used methods were grouped in two clusters: process-based models (PBM) and time series and spatial epidemiology (TS-SE). In general, PBM methods were employed when the bio-physical mechanism of the pathogen under study was relatively well known (e.g. Vibrio cholerae); TS-SE tended to be used when the specific environmental mechanisms were unclear (e.g. Campylobacter). Important data and methodological challenges emerged, with implications for surveillance and control of water-associated infections. The most common limitations comprised: non-inclusion of key factors (e.g. biological mechanism, demographic heterogeneity, human behavior), reporting bias, poor data quality, and collinearity in exposures. Furthermore, the methods often did not distinguish among the multiple sources of time-lags (e.g. patient physiology, reporting bias, healthcare access) between environmental drivers/exposures and disease detection. Key areas of future research include: disentangling the complex effects of weather/climate on each exposure-health outcome pathway (e.g. person-to-person vs environment-to-person), and linking weather data to individual cases longitudinally.

  3. Epidemic modeling with discrete-space scheduled walkers: extensions and research opportunities

    PubMed Central

    2009-01-01

    Background This exploratory paper outlines an epidemic simulator built on an agent-based, data-driven model of the spread of a disease within an urban environment. An intent of the model is to provide insight into how a disease may reach a tipping point, spreading to an epidemic of uncontrollable proportions. Methods As a complement to analytical methods, simulation is arguably an effective means of gaining a better understanding of system-level disease dynamics within a population and offers greater utility in its modeling capabilities. Our investigation is based on this conjecture, supported by data-driven models that are reasonable, realistic and practical, in an attempt to demonstrate their efficacy in studying system-wide epidemic phenomena. An agent-based model (ABM) offers considerable flexibility in extending the study of the phenomena before, during and after an outbreak or catastrophe. Results An agent-based model was developed based on a paradigm of a 'discrete-space scheduled walker' (DSSW), modeling a medium-sized North American City of 650,000 discrete agents, built upon a conceptual framework of statistical reasoning (law of large numbers, statistical mechanics) as well as a correct-by-construction bias. The model addresses where, who, when and what elements, corresponding to network topography and agent characteristics, behaviours, and interactions upon that topography. The DSSW-ABM has an interface and associated scripts that allow for a variety of what-if scenarios modeling disease spread throughout the population, and for data to be collected and displayed via a web browser. Conclusion This exploratory paper also presents several research opportunities for exploiting data sources of a non-obvious and disparate nature for the purposes of epidemic modeling. There is an increasing amount and variety of data that will continue to contribute to the accuracy of agent-based models and improve their utility in modeling disease spread. The model developed here is well suited to diseases where there is not a predisposition for contraction within the population. One of the advantages of agent-based modeling is the ability to set up a rare event and develop policy as to how one may mitigate damages arising from it. PMID:19922684

  4. Epidemic modeling with discrete-space scheduled walkers: extensions and research opportunities.

    PubMed

    Borkowski, Maciej; Podaima, Blake W; McLeod, Robert D

    2009-11-18

    This exploratory paper outlines an epidemic simulator built on an agent-based, data-driven model of the spread of a disease within an urban environment. An intent of the model is to provide insight into how a disease may reach a tipping point, spreading to an epidemic of uncontrollable proportions. As a complement to analytical methods, simulation is arguably an effective means of gaining a better understanding of system-level disease dynamics within a population and offers greater utility in its modeling capabilities. Our investigation is based on this conjecture, supported by data-driven models that are reasonable, realistic and practical, in an attempt to demonstrate their efficacy in studying system-wide epidemic phenomena. An agent-based model (ABM) offers considerable flexibility in extending the study of the phenomena before, during and after an outbreak or catastrophe. An agent-based model was developed based on a paradigm of a 'discrete-space scheduled walker' (DSSW), modeling a medium-sized North American City of 650,000 discrete agents, built upon a conceptual framework of statistical reasoning (law of large numbers, statistical mechanics) as well as a correct-by-construction bias. The model addresses where, who, when and what elements, corresponding to network topography and agent characteristics, behaviours, and interactions upon that topography. The DSSW-ABM has an interface and associated scripts that allow for a variety of what-if scenarios modeling disease spread throughout the population, and for data to be collected and displayed via a web browser. This exploratory paper also presents several research opportunities for exploiting data sources of a non-obvious and disparate nature for the purposes of epidemic modeling. There is an increasing amount and variety of data that will continue to contribute to the accuracy of agent-based models and improve their utility in modeling disease spread. The model developed here is well suited to diseases where there is not a predisposition for contraction within the population. One of the advantages of agent-based modeling is the ability to set up a rare event and develop policy as to how one may mitigate damages arising from it.

  5. A model for family-based case-control studies of genetic imprinting and epistasis.

    PubMed

    Li, Xin; Sui, Yihan; Liu, Tian; Wang, Jianxin; Li, Yongci; Lin, Zhenwu; Hegarty, John; Koltun, Walter A; Wang, Zuoheng; Wu, Rongling

    2014-11-01

    Genetic imprinting, or called the parent-of-origin effect, has been recognized to play an important role in the formation and pathogenesis of human diseases. Although the epigenetic mechanisms that establish genetic imprinting have been a focus of many genetic studies, our knowledge about the number of imprinting genes and their chromosomal locations and interactions with other genes is still scarce, limiting precise inference of the genetic architecture of complex diseases. In this article, we present a statistical model for testing and estimating the effects of genetic imprinting on complex diseases using a commonly used case-control design with family structure. For each subject sampled from a case and control population, we not only genotype its own single nucleotide polymorphisms (SNPs) but also collect its parents' genotypes. By tracing the transmission pattern of SNP alleles from parental to offspring generation, the model allows the characterization of genetic imprinting effects based on Pearson tests of a 2 × 2 contingency table. The model is expanded to test the interactions between imprinting effects and additive, dominant and epistatic effects in a complex web of genetic interactions. Statistical properties of the model are investigated, and its practical usefulness is validated by a real data analysis. The model will provide a useful tool for genome-wide association studies aimed to elucidate the picture of genetic control over complex human diseases. © The Author 2013. Published by Oxford University Press. For Permissions, please email: journals.permissions@oup.com.

  6. A power comparison of generalized additive models and the spatial scan statistic in a case-control setting.

    PubMed

    Young, Robin L; Weinberg, Janice; Vieira, Verónica; Ozonoff, Al; Webster, Thomas F

    2010-07-19

    A common, important problem in spatial epidemiology is measuring and identifying variation in disease risk across a study region. In application of statistical methods, the problem has two parts. First, spatial variation in risk must be detected across the study region and, second, areas of increased or decreased risk must be correctly identified. The location of such areas may give clues to environmental sources of exposure and disease etiology. One statistical method applicable in spatial epidemiologic settings is a generalized additive model (GAM) which can be applied with a bivariate LOESS smoother to account for geographic location as a possible predictor of disease status. A natural hypothesis when applying this method is whether residential location of subjects is associated with the outcome, i.e. is the smoothing term necessary? Permutation tests are a reasonable hypothesis testing method and provide adequate power under a simple alternative hypothesis. These tests have yet to be compared to other spatial statistics. This research uses simulated point data generated under three alternative hypotheses to evaluate the properties of the permutation methods and compare them to the popular spatial scan statistic in a case-control setting. Case 1 was a single circular cluster centered in a circular study region. The spatial scan statistic had the highest power though the GAM method estimates did not fall far behind. Case 2 was a single point source located at the center of a circular cluster and Case 3 was a line source at the center of the horizontal axis of a square study region. Each had linearly decreasing logodds with distance from the point. The GAM methods outperformed the scan statistic in Cases 2 and 3. Comparing sensitivity, measured as the proportion of the exposure source correctly identified as high or low risk, the GAM methods outperformed the scan statistic in all three Cases. The GAM permutation testing methods provide a regression-based alternative to the spatial scan statistic. Across all hypotheses examined in this research, the GAM methods had competing or greater power estimates and sensitivities exceeding that of the spatial scan statistic.

  7. A power comparison of generalized additive models and the spatial scan statistic in a case-control setting

    PubMed Central

    2010-01-01

    Background A common, important problem in spatial epidemiology is measuring and identifying variation in disease risk across a study region. In application of statistical methods, the problem has two parts. First, spatial variation in risk must be detected across the study region and, second, areas of increased or decreased risk must be correctly identified. The location of such areas may give clues to environmental sources of exposure and disease etiology. One statistical method applicable in spatial epidemiologic settings is a generalized additive model (GAM) which can be applied with a bivariate LOESS smoother to account for geographic location as a possible predictor of disease status. A natural hypothesis when applying this method is whether residential location of subjects is associated with the outcome, i.e. is the smoothing term necessary? Permutation tests are a reasonable hypothesis testing method and provide adequate power under a simple alternative hypothesis. These tests have yet to be compared to other spatial statistics. Results This research uses simulated point data generated under three alternative hypotheses to evaluate the properties of the permutation methods and compare them to the popular spatial scan statistic in a case-control setting. Case 1 was a single circular cluster centered in a circular study region. The spatial scan statistic had the highest power though the GAM method estimates did not fall far behind. Case 2 was a single point source located at the center of a circular cluster and Case 3 was a line source at the center of the horizontal axis of a square study region. Each had linearly decreasing logodds with distance from the point. The GAM methods outperformed the scan statistic in Cases 2 and 3. Comparing sensitivity, measured as the proportion of the exposure source correctly identified as high or low risk, the GAM methods outperformed the scan statistic in all three Cases. Conclusions The GAM permutation testing methods provide a regression-based alternative to the spatial scan statistic. Across all hypotheses examined in this research, the GAM methods had competing or greater power estimates and sensitivities exceeding that of the spatial scan statistic. PMID:20642827

  8. Brain MRI analysis for Alzheimer's disease diagnosis using an ensemble system of deep convolutional neural networks.

    PubMed

    Islam, Jyoti; Zhang, Yanqing

    2018-05-31

    Alzheimer's disease is an incurable, progressive neurological brain disorder. Earlier detection of Alzheimer's disease can help with proper treatment and prevent brain tissue damage. Several statistical and machine learning models have been exploited by researchers for Alzheimer's disease diagnosis. Analyzing magnetic resonance imaging (MRI) is a common practice for Alzheimer's disease diagnosis in clinical research. Detection of Alzheimer's disease is exacting due to the similarity in Alzheimer's disease MRI data and standard healthy MRI data of older people. Recently, advanced deep learning techniques have successfully demonstrated human-level performance in numerous fields including medical image analysis. We propose a deep convolutional neural network for Alzheimer's disease diagnosis using brain MRI data analysis. While most of the existing approaches perform binary classification, our model can identify different stages of Alzheimer's disease and obtains superior performance for early-stage diagnosis. We conducted ample experiments to demonstrate that our proposed model outperformed comparative baselines on the Open Access Series of Imaging Studies dataset.

  9. Statistical shape modeling based renal volume measurement using tracked ultrasound

    NASA Astrophysics Data System (ADS)

    Pai Raikar, Vipul; Kwartowitz, David M.

    2017-03-01

    Autosomal dominant polycystic kidney disease (ADPKD) is the fourth most common cause of kidney transplant worldwide accounting for 7-10% of all cases. Although ADPKD usually progresses over many decades, accurate risk prediction is an important task.1 Identifying patients with progressive disease is vital to providing new treatments being developed and enable them to enter clinical trials for new therapy. Among other factors, total kidney volume (TKV) is a major biomarker predicting the progression of ADPKD. Consortium for Radiologic Imaging Studies in Polycystic Kidney Disease (CRISP)2 have shown that TKV is an early, and accurate measure of cystic burden and likely growth rate. It is strongly associated with loss of renal function.3 While ultrasound (US) has proven as an excellent tool for diagnosing the disease; monitoring short-term changes using ultrasound has been shown to not be accurate. This is attributed to high operator variability and reproducibility as compared to tomographic modalities such as CT and MR (Gold standard). Ultrasound has emerged as one of the standout modality for intra-procedural imaging and with methods for spatial localization has afforded us the ability to track 2D ultrasound in physical space which it is being used. In addition to this, the vast amount of recorded tomographic data can be used to generate statistical shape models that allow us to extract clinical value from archived image sets. In this work, we aim at improving the prognostic value of US in managing ADPKD by assessing the accuracy of using statistical shape model augmented US data, to predict TKV, with the end goal of monitoring short-term changes.

  10. Challenges in risk estimation using routinely collected clinical data: The example of estimating cervical cancer risks from electronic health-records.

    PubMed

    Landy, Rebecca; Cheung, Li C; Schiffman, Mark; Gage, Julia C; Hyun, Noorie; Wentzensen, Nicolas; Kinney, Walter K; Castle, Philip E; Fetterman, Barbara; Poitras, Nancy E; Lorey, Thomas; Sasieni, Peter D; Katki, Hormuzd A

    2018-06-01

    Electronic health-records (EHR) are increasingly used by epidemiologists studying disease following surveillance testing to provide evidence for screening intervals and referral guidelines. Although cost-effective, undiagnosed prevalent disease and interval censoring (in which asymptomatic disease is only observed at the time of testing) raise substantial analytic issues when estimating risk that cannot be addressed using Kaplan-Meier methods. Based on our experience analysing EHR from cervical cancer screening, we previously proposed the logistic-Weibull model to address these issues. Here we demonstrate how the choice of statistical method can impact risk estimates. We use observed data on 41,067 women in the cervical cancer screening program at Kaiser Permanente Northern California, 2003-2013, as well as simulations to evaluate the ability of different methods (Kaplan-Meier, Turnbull, Weibull and logistic-Weibull) to accurately estimate risk within a screening program. Cumulative risk estimates from the statistical methods varied considerably, with the largest differences occurring for prevalent disease risk when baseline disease ascertainment was random but incomplete. Kaplan-Meier underestimated risk at earlier times and overestimated risk at later times in the presence of interval censoring or undiagnosed prevalent disease. Turnbull performed well, though was inefficient and not smooth. The logistic-Weibull model performed well, except when event times didn't follow a Weibull distribution. We have demonstrated that methods for right-censored data, such as Kaplan-Meier, result in biased estimates of disease risks when applied to interval-censored data, such as screening programs using EHR data. The logistic-Weibull model is attractive, but the model fit must be checked against Turnbull non-parametric risk estimates. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  11. eHealth literacy in chronic disease patients: An item response theory analysis of the eHealth literacy scale (eHEALS).

    PubMed

    Paige, Samantha R; Krieger, Janice L; Stellefson, Michael; Alber, Julia M

    2017-02-01

    Chronic disease patients are affected by low computer and health literacy, which negatively affects their ability to benefit from access to online health information. To estimate reliability and confirm model specifications for eHealth Literacy Scale (eHEALS) scores among chronic disease patients using Classical Test (CTT) and Item Response Theory techniques. A stratified sample of Black/African American (N=341) and Caucasian (N=343) adults with chronic disease completed an online survey including the eHEALS. Item discrimination was explored using bi-variate correlations and Cronbach's alpha for internal consistency. A categorical confirmatory factor analysis tested a one-factor structure of eHEALS scores. Item characteristic curves, in-fit/outfit statistics, omega coefficient, and item reliability and separation estimates were computed. A 1-factor structure of eHEALS was confirmed by statistically significant standardized item loadings, acceptable model fit indices (CFI/TLI>0.90), and 70% variance explained by the model. Item response categories increased with higher theta levels, and there was evidence of acceptable reliability (ω=0.94; item reliability=89; item separation=8.54). eHEALS scores are a valid and reliable measure of self-reported eHealth literacy among Internet-using chronic disease patients. Providers can use eHEALS to help identify patients' eHealth literacy skills. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  12. Statistical physics of medical diagnostics: Study of a probabilistic model.

    PubMed

    Mashaghi, Alireza; Ramezanpour, Abolfazl

    2018-03-01

    We study a diagnostic strategy which is based on the anticipation of the diagnostic process by simulation of the dynamical process starting from the initial findings. We show that such a strategy could result in more accurate diagnoses compared to a strategy that is solely based on the direct implications of the initial observations. We demonstrate this by employing the mean-field approximation of statistical physics to compute the posterior disease probabilities for a given subset of observed signs (symptoms) in a probabilistic model of signs and diseases. A Monte Carlo optimization algorithm is then used to maximize an objective function of the sequence of observations, which favors the more decisive observations resulting in more polarized disease probabilities. We see how the observed signs change the nature of the macroscopic (Gibbs) states of the sign and disease probability distributions. The structure of these macroscopic states in the configuration space of the variables affects the quality of any approximate inference algorithm (so the diagnostic performance) which tries to estimate the sign-disease marginal probabilities. In particular, we find that the simulation (or extrapolation) of the diagnostic process is helpful when the disease landscape is not trivial and the system undergoes a phase transition to an ordered phase.

  13. Statistical physics of medical diagnostics: Study of a probabilistic model

    NASA Astrophysics Data System (ADS)

    Mashaghi, Alireza; Ramezanpour, Abolfazl

    2018-03-01

    We study a diagnostic strategy which is based on the anticipation of the diagnostic process by simulation of the dynamical process starting from the initial findings. We show that such a strategy could result in more accurate diagnoses compared to a strategy that is solely based on the direct implications of the initial observations. We demonstrate this by employing the mean-field approximation of statistical physics to compute the posterior disease probabilities for a given subset of observed signs (symptoms) in a probabilistic model of signs and diseases. A Monte Carlo optimization algorithm is then used to maximize an objective function of the sequence of observations, which favors the more decisive observations resulting in more polarized disease probabilities. We see how the observed signs change the nature of the macroscopic (Gibbs) states of the sign and disease probability distributions. The structure of these macroscopic states in the configuration space of the variables affects the quality of any approximate inference algorithm (so the diagnostic performance) which tries to estimate the sign-disease marginal probabilities. In particular, we find that the simulation (or extrapolation) of the diagnostic process is helpful when the disease landscape is not trivial and the system undergoes a phase transition to an ordered phase.

  14. A platform to integrate climate information and rural telemedicine in Malawi

    NASA Astrophysics Data System (ADS)

    Lowe, R.; Chadza, T.; Chirombo, J.; Fonda, C.; Muyepa, A.; Nkoloma, M.; Pietrosemoli, E.; Radicella, S. M.; Tompkins, A. M.; Zennaro, M.

    2012-04-01

    It is commonly accepted that climate plays a role in the transmission of many infectious diseases, particularly those transmitted by mosquitoes such as malaria, which is one of the most important causes of mortality and morbidity in developing countries. Due to time lags involved in the climate-disease transmission system, lagged observed climate variables could provide some predictive lead for forecasting disease epidemics. This lead time could be extended by using forecasts of the climate in disease prediction models. This project aims to implement a platform for the dissemination of climate-driven disease risk forecasts, using a telemedicine approach. A pilot project has been established in Malawi, where a 162 km wireless link has been installed, spanning from Blantyre City to remote health facilities in the district of Mangochi in the Southern region, bordering Lake Malawi. This long Wi-Fi technology allows rural health facilities to upload real-time disease cases as they occur to an online health information system (DHIS2); a national medical database repository administered by the Ministry of Health. This technology provides a real-time data logging system for disease incidence monitoring and facilitates the flow of information between local and national levels. This platform allows statistical and dynamical disease prediction models to be rapidly updated with real-time climate and epidemiological information. This permits health authorities to target timely interventions ahead of an imminent increase in malaria incidence. By integrating meteorological and health information systems in a statistical-dynamical prediction model, we show that a long-distance Wi-Fi link is a practical and inexpensive means to enable the rapid analysis of real-time information in order to target disease prevention and control measures and mobilise resources at the local level.

  15. [Mathematical modeling for conditionality of cardiovascular disease by housing conditions].

    PubMed

    Meshkov, N A

    2014-01-01

    There was studied the influence of living conditions (housing area per capita, availability of housing water supply, sewerage and central heating) on the morbidity of the cardiovascular diseases in child and adult population. With the method of regression analysis the morbidity rate was established to significantly decrease with the increase in the area of housing, constructed models are statistically significant, respectively, p = 0.01 and p = 0.02. There was revealed the relationship of the morbidity rate of cardiovascular diseases in children and adults with the supply with housing central heating (p = 0.02 and p = 0.009).

  16. Disease clusters, exact distributions of maxima, and P-values.

    PubMed

    Grimson, R C

    1993-10-01

    This paper presents combinatorial (exact) methods that are useful in the analysis of disease cluster data obtained from small environments, such as buildings and neighbourhoods. Maxwell-Boltzmann and Fermi-Dirac occupancy models are compared in terms of appropriateness of representation of disease incidence patterns (space and/or time) in these environments. The methods are illustrated by a statistical analysis of the incidence pattern of bone fractures in a setting wherein fracture clustering was alleged to be occurring. One of the methodological results derived in this paper is the exact distribution of the maximum cell frequency in occupancy models.

  17. Expected social utility of life time in the presence of a chronic disease.

    PubMed

    Mulder, P G; Hempenius, A L

    1993-10-01

    Interventive action aimed at reducing the incidence of an irreversible chronic noncommunicable disease in a population has various effects. Hopefully, it increases total longevity in the population and it causes the disease to develop later in time in a smaller portion of the population. In this paper a statistical model is built by which these effects can be estimated. A three dimensional probability density function that underlies this model is changed by the interventive action. It is shown how a three dimensional utility function can be defined to appropriately judge this change.

  18. [FROM STATISTICAL ASSOCIATIONS TO SCIENTIFIC CAUSALITY].

    PubMed

    Golan, Daniel; Linn, Shay

    2015-06-01

    The pathogenesis of most chronic diseases is complex and probably involves the interaction of multiple genetic and environmental risk factors. One way to learn about disease triggers is from statistically significant associations in epidemiological studies. However, associations do not necessarily prove causation. Associations can commonly result from bias, confounding and reverse causation. Several paradigms for causality inference have been developed. Henle-Koch postulates are mainly applied for infectious diseases. Austin Bradford Hill's criteria may serve as a practical tool to weigh the evidence regarding the probability that a single new risk factor for a given disease is indeed causal. These criteria are irrelevant for estimating the causal relationship between exposure to a risk factor and disease whenever biological causality has been previously established. Thus, it is highly probable that past exposure of an individual to definite carcinogens is related to his cancer, even without proving an association between this exposure and cancer in his group. For multifactorial diseases, Rothman's model of interacting sets of component causes can be applied.

  19. Computational Medicine: Translating Models to Clinical Care

    PubMed Central

    Winslow, Raimond L.; Trayanova, Natalia; Geman, Donald; Miller, Michael I.

    2013-01-01

    Because of the inherent complexity of coupled nonlinear biological systems, the development of computational models is necessary for achieving a quantitative understanding of their structure and function in health and disease. Statistical learning is applied to high-dimensional biomolecular data to create models that describe relationships between molecules and networks. Multiscale modeling links networks to cells, organs, and organ systems. Computational approaches are used to characterize anatomic shape and its variations in health and disease. In each case, the purposes of modeling are to capture all that we know about disease and to develop improved therapies tailored to the needs of individuals. We discuss advances in computational medicine, with specific examples in the fields of cancer, diabetes, cardiology, and neurology. Advances in translating these computational methods to the clinic are described, as well as challenges in applying models for improving patient health. PMID:23115356

  20. Twenty-Year Predictors of Peripheral Arterial Disease Compared With Coronary Heart Disease in the Scottish Heart Health Extended Cohort (SHHEC).

    PubMed

    Tunstall-Pedoe, Hugh; Peters, Sanne A E; Woodward, Mark; Struthers, Allan D; Belch, Jill J F

    2017-09-18

    Coronary heart disease and peripheral arterial disease (PAD) affect different vascular territories. Supplementing baseline findings with assays from stored serum, we compared their 20-year predictors. We randomly recruited 15 737 disease-free men and women aged 30 to 75 years across Scotland between 1984 and 1995 and followed them through 2009 for death and hospital diagnoses. Of these, 3098 developed coronary heart disease (19.7%), and 499 PAD (3.2%). Hazard ratios for 45 variables in the Cox model were adjusted for age and sex and for factors in the 2007 ASSIGN cardiovascular risk score. Forty-four of them were entered into parsimonious predictive models, tested by c-statistics and net reclassification improvements. Many hazard ratios diminished with adjustment and parsimonious modeling, leaving significant survivors. The hazard ratios were mostly higher in PAD. New parsimonious models increased the c-statistic and net reclassification improvements over ASSIGN variables alone but varied in their components and ranking. Coronary heart disease and PAD shared 7 of the 9 factors from ASSIGN: age, sex, family history, socioeconomic status, diabetes mellitus, tobacco smoking, and systolic blood pressure (but neither total nor high-density lipoprotein cholesterol); plus 4 new ones: NT-pro-BNP, cotinine, high-sensitivity C-reactive protein, and cystatin-C. The highest ranked hazard ratios for continuous factors in coronary heart disease were those for age, total cholesterol, high-sensitivity troponin, NT-pro-BNP, cotinine, apolipoprotein A, and waist circumference (plus 10 more); in PAD they were age, high-sensitivity C-reactive protein, systolic blood pressure, expired carbon monoxide, cotinine, socioeconomic status, and lipoprotein (a) (plus 5 more). The mixture of shared with disparate determinants for arterial disease in the heart and the legs implies nonidentical pathogenesis: cholesterol dominant in the former, and inflammation (high-sensitivity C-reactive protein, diabetes mellitus, smoking) in the latter. © 2017 The Authors. Published on behalf of the American Heart Association, Inc., by Wiley.

  1. Gene-Based Association Analysis for Censored Traits Via Fixed Effect Functional Regressions.

    PubMed

    Fan, Ruzong; Wang, Yifan; Yan, Qi; Ding, Ying; Weeks, Daniel E; Lu, Zhaohui; Ren, Haobo; Cook, Richard J; Xiong, Momiao; Swaroop, Anand; Chew, Emily Y; Chen, Wei

    2016-02-01

    Genetic studies of survival outcomes have been proposed and conducted recently, but statistical methods for identifying genetic variants that affect disease progression are rarely developed. Motivated by our ongoing real studies, here we develop Cox proportional hazard models using functional regression (FR) to perform gene-based association analysis of survival traits while adjusting for covariates. The proposed Cox models are fixed effect models where the genetic effects of multiple genetic variants are assumed to be fixed. We introduce likelihood ratio test (LRT) statistics to test for associations between the survival traits and multiple genetic variants in a genetic region. Extensive simulation studies demonstrate that the proposed Cox RF LRT statistics have well-controlled type I error rates. To evaluate power, we compare the Cox FR LRT with the previously developed burden test (BT) in a Cox model and sequence kernel association test (SKAT), which is based on mixed effect Cox models. The Cox FR LRT statistics have higher power than or similar power as Cox SKAT LRT except when 50%/50% causal variants had negative/positive effects and all causal variants are rare. In addition, the Cox FR LRT statistics have higher power than Cox BT LRT. The models and related test statistics can be useful in the whole genome and whole exome association studies. An age-related macular degeneration dataset was analyzed as an example. © 2016 WILEY PERIODICALS, INC.

  2. Gene-based Association Analysis for Censored Traits Via Fixed Effect Functional Regressions

    PubMed Central

    Fan, Ruzong; Wang, Yifan; Yan, Qi; Ding, Ying; Weeks, Daniel E.; Lu, Zhaohui; Ren, Haobo; Cook, Richard J; Xiong, Momiao; Swaroop, Anand; Chew, Emily Y.; Chen, Wei

    2015-01-01

    Summary Genetic studies of survival outcomes have been proposed and conducted recently, but statistical methods for identifying genetic variants that affect disease progression are rarely developed. Motivated by our ongoing real studies, we develop here Cox proportional hazard models using functional regression (FR) to perform gene-based association analysis of survival traits while adjusting for covariates. The proposed Cox models are fixed effect models where the genetic effects of multiple genetic variants are assumed to be fixed. We introduce likelihood ratio test (LRT) statistics to test for associations between the survival traits and multiple genetic variants in a genetic region. Extensive simulation studies demonstrate that the proposed Cox RF LRT statistics have well-controlled type I error rates. To evaluate power, we compare the Cox FR LRT with the previously developed burden test (BT) in a Cox model and sequence kernel association test (SKAT) which is based on mixed effect Cox models. The Cox FR LRT statistics have higher power than or similar power as Cox SKAT LRT except when 50%/50% causal variants had negative/positive effects and all causal variants are rare. In addition, the Cox FR LRT statistics have higher power than Cox BT LRT. The models and related test statistics can be useful in the whole genome and whole exome association studies. An age-related macular degeneration dataset was analyzed as an example. PMID:26782979

  3. A powerful score-based test statistic for detecting gene-gene co-association.

    PubMed

    Xu, Jing; Yuan, Zhongshang; Ji, Jiadong; Zhang, Xiaoshuai; Li, Hongkai; Wu, Xuesen; Xue, Fuzhong; Liu, Yanxun

    2016-01-29

    The genetic variants identified by Genome-wide association study (GWAS) can only account for a small proportion of the total heritability for complex disease. The existence of gene-gene joint effects which contains the main effects and their co-association is one of the possible explanations for the "missing heritability" problems. Gene-gene co-association refers to the extent to which the joint effects of two genes differ from the main effects, not only due to the traditional interaction under nearly independent condition but the correlation between genes. Generally, genes tend to work collaboratively within specific pathway or network contributing to the disease and the specific disease-associated locus will often be highly correlated (e.g. single nucleotide polymorphisms (SNPs) in linkage disequilibrium). Therefore, we proposed a novel score-based statistic (SBS) as a gene-based method for detecting gene-gene co-association. Various simulations illustrate that, under different sample sizes, marginal effects of causal SNPs and co-association levels, the proposed SBS has the better performance than other existed methods including single SNP-based and principle component analysis (PCA)-based logistic regression model, the statistics based on canonical correlations (CCU), kernel canonical correlation analysis (KCCU), partial least squares path modeling (PLSPM) and delta-square (δ (2)) statistic. The real data analysis of rheumatoid arthritis (RA) further confirmed its advantages in practice. SBS is a powerful and efficient gene-based method for detecting gene-gene co-association.

  4. Development of a Metabolic Biosignature for Detection of Early Lyme Disease

    PubMed Central

    Molins, Claudia R.; Ashton, Laura V.; Wormser, Gary P.; Hess, Ann M.; Delorey, Mark J.; Mahapatra, Sebabrata; Schriefer, Martin E.; Belisle, John T.

    2015-01-01

    Background. Early Lyme disease patients often present to the clinic prior to developing a detectable antibody response to Borrelia burgdorferi, the etiologic agent. Thus, existing 2-tier serology-based assays yield low sensitivities (29%–40%) for early infection. The lack of an accurate laboratory test for early Lyme disease contributes to misconceptions about diagnosis and treatment, and underscores the need for new diagnostic approaches. Methods. Retrospective serum samples from patients with early Lyme disease, other diseases, and healthy controls were analyzed for small molecule metabolites by liquid chromatography-mass spectrometry (LC-MS). A metabolomics data workflow was applied to select a biosignature for classifying early Lyme disease and non-Lyme disease patients. A statistical model of the biosignature was trained using the patients' LC-MS data, and subsequently applied as an experimental diagnostic tool with LC-MS data from additional patient sera. The accuracy of this method was compared with standard 2-tier serology. Results. Metabolic biosignature development selected 95 molecular features that distinguished early Lyme disease patients from healthy controls. Statistical modeling reduced the biosignature to 44 molecular features, and correctly classified early Lyme disease patients and healthy controls with a sensitivity of 88% (84%–95%), and a specificity of 95% (90%–100%). Importantly, the metabolic biosignature correctly classified 77%–95% of the of serology negative Lyme disease patients. Conclusions. The data provide proof-of-concept that metabolic profiling for early Lyme disease can achieve significantly greater (P < .0001) diagnostic sensitivity than current 2-tier serology, while retaining high specificity. PMID:25761869

  5. Detailed Analysis of the African Green Monkey Model of Nipah Virus Disease

    PubMed Central

    Johnston, Sara C.; Briese, Thomas; Bell, Todd M.; Pratt, William D.; Shamblin, Joshua D.; Esham, Heather L.; Donnelly, Ginger C.; Johnson, Joshua C.; Hensley, Lisa E.; Lipkin, W. Ian; Honko, Anna N.

    2015-01-01

    Henipaviruses are implicated in severe and frequently fatal pneumonia and encephalitis in humans. There are no approved vaccines or treatments available for human use, and testing of candidates requires the use of well-characterized animal models that mimic human disease. We performed a comprehensive and statistically-powered evaluation of the African green monkey model to define parameters critical to disease progression and the extent to which they correlate with human disease. African green monkeys were inoculated by the intratracheal route with 2.5×104 plaque forming units of the Malaysia strain of Nipah virus. Physiological data captured using telemetry implants and assessed in conjunction with clinical pathology were consistent with shock, and histopathology confirmed widespread tissue involvement associated with systemic vasculitis in animals that succumbed to acute disease. In addition, relapse encephalitis was identified in 100% of animals that survived beyond the acute disease phase. Our data suggest that disease progression in the African green monkey is comparable to the variable outcome of Nipah virus infection in humans. PMID:25706617

  6. A comparative analysis of chaotic particle swarm optimizations for detecting single nucleotide polymorphism barcodes.

    PubMed

    Chuang, Li-Yeh; Moi, Sin-Hua; Lin, Yu-Da; Yang, Cheng-Hong

    2016-10-01

    Evolutionary algorithms could overcome the computational limitations for the statistical evaluation of large datasets for high-order single nucleotide polymorphism (SNP) barcodes. Previous studies have proposed several chaotic particle swarm optimization (CPSO) methods to detect SNP barcodes for disease analysis (e.g., for breast cancer and chronic diseases). This work evaluated additional chaotic maps combined with the particle swarm optimization (PSO) method to detect SNP barcodes using a high-dimensional dataset. Nine chaotic maps were used to improve PSO method results and compared the searching ability amongst all CPSO methods. The XOR and ZZ disease models were used to compare all chaotic maps combined with PSO method. Efficacy evaluations of CPSO methods were based on statistical values from the chi-square test (χ 2 ). The results showed that chaotic maps could improve the searching ability of PSO method when population are trapped in the local optimum. The minor allele frequency (MAF) indicated that, amongst all CPSO methods, the numbers of SNPs, sample size, and the highest χ 2 value in all datasets were found in the Sinai chaotic map combined with PSO method. We used the simple linear regression results of the gbest values in all generations to compare the all methods. Sinai chaotic map combined with PSO method provided the highest β values (β≥0.32 in XOR disease model and β≥0.04 in ZZ disease model) and the significant p-value (p-value<0.001 in both the XOR and ZZ disease models). The Sinai chaotic map was found to effectively enhance the fitness values (χ 2 ) of PSO method, indicating that the Sinai chaotic map combined with PSO method is more effective at detecting potential SNP barcodes in both the XOR and ZZ disease models. Copyright © 2016 Elsevier B.V. All rights reserved.

  7. Presymptomatic and longitudinal neuroimaging in neurodegeneration--from snapshots to motion picture: a systematic review.

    PubMed

    Schuster, Christina; Elamin, Marwa; Hardiman, Orla; Bede, Peter

    2015-10-01

    Recent quantitative neuroimaging studies have been successful in capturing phenotype and genotype-specific changes in dementia syndromes, amyotrophic lateral sclerosis, Parkinson's disease and other neurodegenerative conditions. However, the majority of imaging studies are cross-sectional, despite the obvious superiority of longitudinal study designs in characterising disease trajectories, response to therapy, progression rates and evaluating the presymptomatic phase of neurodegenerative conditions. The aim of this work is to perform a systematic review of longitudinal imaging initiatives in neurodegeneration focusing on methodology, optimal statistical models, follow-up intervals, attrition rates, primary study outcomes and presymptomatic studies. Longitudinal imaging studies were identified from 'PubMed' and reviewed from 1990 to 2014. The search terms 'longitudinal', 'MRI', 'presymptomatic' and 'imaging' were utilised in combination with one of the following degenerative conditions; Alzheimer's disease, amyotrophic lateral sclerosis/motor neuron disease, frontotemporal dementia, Huntington's disease, multiple sclerosis, Parkinson's disease, ataxia, HIV, alcohol abuse/dependence. A total of 423 longitudinal imaging papers and 103 genotype-based presymptomatic studies were identified and systematically reviewed. Imaging techniques, follow-up intervals and attrition rates showed significant variation depending on the primary diagnosis. Commonly used statistical models included analysis of annualised percentage change, mixed and random effect models, and non-linear cumulative models with acceleration-deceleration components. Although longitudinal imaging studies have the potential to provide crucial insights into the presymptomatic phase and natural trajectory of neurodegenerative processes a standardised design is required to enable meaningful data interpretation. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  8. Risk assessment of vector-borne diseases for public health governance.

    PubMed

    Sedda, L; Morley, D W; Braks, M A H; De Simone, L; Benz, D; Rogers, D J

    2014-12-01

    In the context of public health, risk governance (or risk analysis) is a framework for the assessment and subsequent management and/or control of the danger posed by an identified disease threat. Generic frameworks in which to carry out risk assessment have been developed by various agencies. These include monitoring, data collection, statistical analysis and dissemination. Due to the inherent complexity of disease systems, however, the generic approach must be modified for individual, disease-specific risk assessment frameworks. The analysis was based on the review of the current risk assessments of vector-borne diseases adopted by the main Public Health organisations (OIE, WHO, ECDC, FAO, CDC etc…). Literature, legislation and statistical assessment of the risk analysis frameworks. This review outlines the need for the development of a general public health risk assessment method for vector-borne diseases, in order to guarantee that sufficient information is gathered to apply robust models of risk assessment. Stochastic (especially spatial) methods, often in Bayesian frameworks are now gaining prominence in standard risk assessment procedures because of their ability to assess accurately model uncertainties. Risk assessment needs to be addressed quantitatively wherever possible, and submitted with its quality assessment in order to enable successful public health measures to be adopted. In terms of current practice, often a series of different models and analyses are applied to the same problem, with results and outcomes that are difficult to compare because of the unknown model and data uncertainties. Therefore, the risk assessment areas in need of further research are identified in this article. Copyright © 2014 The Royal Society for Public Health. Published by Elsevier Ltd. All rights reserved.

  9. EpiModel: An R Package for Mathematical Modeling of Infectious Disease over Networks.

    PubMed

    Jenness, Samuel M; Goodreau, Steven M; Morris, Martina

    2018-04-01

    Package EpiModel provides tools for building, simulating, and analyzing mathematical models for the population dynamics of infectious disease transmission in R. Several classes of models are included, but the unique contribution of this software package is a general stochastic framework for modeling the spread of epidemics on networks. EpiModel integrates recent advances in statistical methods for network analysis (temporal exponential random graph models) that allow the epidemic modeling to be grounded in empirical data on contacts that can spread infection. This article provides an overview of both the modeling tools built into EpiModel , designed to facilitate learning for students new to modeling, and the application programming interface for extending package EpiModel , designed to facilitate the exploration of novel research questions for advanced modelers.

  10. EpiModel: An R Package for Mathematical Modeling of Infectious Disease over Networks

    PubMed Central

    Jenness, Samuel M.; Goodreau, Steven M.; Morris, Martina

    2018-01-01

    Package EpiModel provides tools for building, simulating, and analyzing mathematical models for the population dynamics of infectious disease transmission in R. Several classes of models are included, but the unique contribution of this software package is a general stochastic framework for modeling the spread of epidemics on networks. EpiModel integrates recent advances in statistical methods for network analysis (temporal exponential random graph models) that allow the epidemic modeling to be grounded in empirical data on contacts that can spread infection. This article provides an overview of both the modeling tools built into EpiModel, designed to facilitate learning for students new to modeling, and the application programming interface for extending package EpiModel, designed to facilitate the exploration of novel research questions for advanced modelers. PMID:29731699

  11. Application of a Novel Grey Self-Memory Coupling Model to Forecast the Incidence Rates of Two Notifiable Diseases in China: Dysentery and Gonorrhea

    PubMed Central

    Guo, Xiaojun; Liu, Sifeng; Wu, Lifeng; Tang, Lingling

    2014-01-01

    Objective In this study, a novel grey self-memory coupling model was developed to forecast the incidence rates of two notifiable infectious diseases (dysentery and gonorrhea); the effectiveness and applicability of this model was assessed based on its ability to predict the epidemiological trend of infectious diseases in China. Methods The linear model, the conventional GM(1,1) model and the GM(1,1) model with self-memory principle (SMGM(1,1) model) were used to predict the incidence rates of the two notifiable infectious diseases based on statistical incidence data. Both simulation accuracy and prediction accuracy were assessed to compare the predictive performances of the three models. The best-fit model was applied to predict future incidence rates. Results Simulation results show that the SMGM(1,1) model can take full advantage of the systematic multi-time historical data and possesses superior predictive performance compared with the linear model and the conventional GM(1,1) model. By applying the novel SMGM(1,1) model, we obtained the possible incidence rates of the two representative notifiable infectious diseases in China. Conclusion The disadvantages of the conventional grey prediction model, such as sensitivity to initial value, can be overcome by the self-memory principle. The novel grey self-memory coupling model can predict the incidence rates of infectious diseases more accurately than the conventional model, and may provide useful references for making decisions involving infectious disease prevention and control. PMID:25546054

  12. Application of a novel grey self-memory coupling model to forecast the incidence rates of two notifiable diseases in China: dysentery and gonorrhea.

    PubMed

    Guo, Xiaojun; Liu, Sifeng; Wu, Lifeng; Tang, Lingling

    2014-01-01

    In this study, a novel grey self-memory coupling model was developed to forecast the incidence rates of two notifiable infectious diseases (dysentery and gonorrhea); the effectiveness and applicability of this model was assessed based on its ability to predict the epidemiological trend of infectious diseases in China. The linear model, the conventional GM(1,1) model and the GM(1,1) model with self-memory principle (SMGM(1,1) model) were used to predict the incidence rates of the two notifiable infectious diseases based on statistical incidence data. Both simulation accuracy and prediction accuracy were assessed to compare the predictive performances of the three models. The best-fit model was applied to predict future incidence rates. Simulation results show that the SMGM(1,1) model can take full advantage of the systematic multi-time historical data and possesses superior predictive performance compared with the linear model and the conventional GM(1,1) model. By applying the novel SMGM(1,1) model, we obtained the possible incidence rates of the two representative notifiable infectious diseases in China. The disadvantages of the conventional grey prediction model, such as sensitivity to initial value, can be overcome by the self-memory principle. The novel grey self-memory coupling model can predict the incidence rates of infectious diseases more accurately than the conventional model, and may provide useful references for making decisions involving infectious disease prevention and control.

  13. A flexibly shaped space-time scan statistic for disease outbreak detection and monitoring.

    PubMed

    Takahashi, Kunihiko; Kulldorff, Martin; Tango, Toshiro; Yih, Katherine

    2008-04-11

    Early detection of disease outbreaks enables public health officials to implement disease control and prevention measures at the earliest possible time. A time periodic geographical disease surveillance system based on a cylindrical space-time scan statistic has been used extensively for disease surveillance along with the SaTScan software. In the purely spatial setting, many different methods have been proposed to detect spatial disease clusters. In particular, some spatial scan statistics are aimed at detecting irregularly shaped clusters which may not be detected by the circular spatial scan statistic. Based on the flexible purely spatial scan statistic, we propose a flexibly shaped space-time scan statistic for early detection of disease outbreaks. The performance of the proposed space-time scan statistic is compared with that of the cylindrical scan statistic using benchmark data. In order to compare their performances, we have developed a space-time power distribution by extending the purely spatial bivariate power distribution. Daily syndromic surveillance data in Massachusetts, USA, are used to illustrate the proposed test statistic. The flexible space-time scan statistic is well suited for detecting and monitoring disease outbreaks in irregularly shaped areas.

  14. Time series models of environmental exposures: Good predictions or good understanding.

    PubMed

    Barnett, Adrian G; Stephen, Dimity; Huang, Cunrui; Wolkewitz, Martin

    2017-04-01

    Time series data are popular in environmental epidemiology as they make use of the natural experiment of how changes in exposure over time might impact on disease. Many published time series papers have used parameter-heavy models that fully explained the second order patterns in disease to give residuals that have no short-term autocorrelation or seasonality. This is often achieved by including predictors of past disease counts (autoregression) or seasonal splines with many degrees of freedom. These approaches give great residuals, but add little to our understanding of cause and effect. We argue that modelling approaches should rely more on good epidemiology and less on statistical tests. This includes thinking about causal pathways, making potential confounders explicit, fitting a limited number of models, and not over-fitting at the cost of under-estimating the true association between exposure and disease. Copyright © 2017 Elsevier Inc. All rights reserved.

  15. Efficient Learning of Continuous-Time Hidden Markov Models for Disease Progression

    PubMed Central

    Liu, Yu-Ying; Li, Shuang; Li, Fuxin; Song, Le; Rehg, James M.

    2016-01-01

    The Continuous-Time Hidden Markov Model (CT-HMM) is an attractive approach to modeling disease progression due to its ability to describe noisy observations arriving irregularly in time. However, the lack of an efficient parameter learning algorithm for CT-HMM restricts its use to very small models or requires unrealistic constraints on the state transitions. In this paper, we present the first complete characterization of efficient EM-based learning methods for CT-HMM models. We demonstrate that the learning problem consists of two challenges: the estimation of posterior state probabilities and the computation of end-state conditioned statistics. We solve the first challenge by reformulating the estimation problem in terms of an equivalent discrete time-inhomogeneous hidden Markov model. The second challenge is addressed by adapting three approaches from the continuous time Markov chain literature to the CT-HMM domain. We demonstrate the use of CT-HMMs with more than 100 states to visualize and predict disease progression using a glaucoma dataset and an Alzheimer’s disease dataset. PMID:27019571

  16. Protein Biomarkers of New-Onset Cardiovascular Disease: A Prospective Study from the Systems Approach to Biomarker Research in Cardiovascular Disease (SABRe CVD) Initiative

    PubMed Central

    Yin, Xiaoyan; Subramanian, Subha; Hwang, Shih-Jen; O’Donnell, Christopher J.; Fox, Caroline S.; Courchesne, Paul; Muntendam, Pieter; Adourian, Aram; Juhasz, Peter; Larson, Martin G.; Levy, Daniel

    2014-01-01

    Objective Incorporation of novel plasma protein biomarkers may improve current models for prediction of atherosclerotic cardiovascular disease (ASCVD) risk. Approach and Results We utilized discovery mass spectrometry (MS) to determine plasma concentrations of 861 proteins in 135 myocardial infarction (MI) cases and 135 matched controls. We then measured 59markers by targeted MS in 336 ASCVD case-control pairs. Associations with MI or ASCVD were tested in single marker and multimarker analyses adjusted for established ASCVD risk factors. Twelve single markers from discovery MS were associated with MI incidence (at p<0.01) adjusting for clinical risk factors. Seven proteins in aggregate (cyclophilin A, CD5 antigen-like, cell surface glycoprotein MUC18, collagen-alpha 1 [XVIII] chain, salivary alpha-amylase 1, C-reactive protein, and multimerin-2) were highly associated with MI (p<0.0001) and significantly improved its prediction compared to a model with clinical risk factors alone (C-statistic of 0.71 vs. 0.84). Through targeted MS, twelve single proteins were predictors of ASCVD (at p<0.05) after adjusting for established risk factors. In multimarker analyses, four proteins in combination (alpha-1-acid glycoprotein 1, paraoxonase 1, tetranectin, and CD5 antigen-like, predicted incident ASCVD (p<0.0001) and moderately improved the C-statistic from the model with clinical covariates alone (C-statistic of 0.69 vs. 0.73). Conclusions Proteomics profiling identified single and multimarker protein panels that are associated with new onset ASCVD and may lead to a better understanding of underlying disease mechanisms. Our findings include many novel protein biomarkers that, if externally validated, may improve risk assessment for MI and ASCVD. PMID:24526693

  17. Endovascular Treatment of Diabetic Foot in a Selected Population of Patients with Below-the-Knee Disease: Is the Angiosome Model Effective?

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fossaceca, Rita, E-mail: rfossaceca@hotmail.com; Guzzardi, Giuseppe, E-mail: guz@libero.it; Cerini, Paolo, E-mail: cerini84@hotmail.it

    Purpose. To evaluate the efficacy of percutaneous transluminal angioplasty (PTA) in a selected population of diabetic patients with below-the-knee (BTK) disease and to analyze the reliability of the angiosome model. Methods. We made a retrospective analysis of the results of PTA performed in 201 diabetic patients with BTK-only disease treated at our institute from January 2005 to December 2011. We evaluated the postoperative technical success, and at 1, 6, and 12 months' follow-up, we assessed the rates and values of partial and complete ulcer healing, restenosis, major and minor amputation, limb salvage, and percutaneous oximetry (TcPO{sub 2}) (Student's t test).more » We used the angiosome model to compare different clinicolaboratory outcomes in patients treated by direct revascularization (DR) from patients treated with indirect revascularization (IR) technique by Student's t test and the {chi}{sup 2} test. Results. At a mean {+-} standard deviation follow-up of 17.5 {+-} 12 months, we observed a mortality rate of 3.5 %, a major amputation rate of 9.4 %, and a limb salvage rate of 87 % with a statistically significant increase of TcPO{sub 2} values at follow-up compared to baseline (p < 0.05). In 34 patients, treatment was performed with the IR technique and in 167 by DR; in both groups, there was a statistically significant increase of TcPO{sub 2} values at follow-up compared to baseline (p < 0.05), without statistically significant differences in therapeutic efficacy. Conclusion. PTA of the BTK-only disease is a safe and effective option. The DR technique is the first treatment option; we believe, however, that IR is similarly effective, with good results over time.« less

  18. Trends in Mortality After Primary Cytoreductive Surgery for Ovarian Cancer: A Systematic Review and Metaregression of Randomized Clinical Trials and Observational Studies.

    PubMed

    Di Donato, Violante; Kontopantelis, Evangelos; Aletti, Giovanni; Casorelli, Assunta; Piacenti, Ilaria; Bogani, Giorgio; Lecce, Francesca; Benedetti Panici, Pierluigi

    2017-06-01

    Primary cytoreductive surgery (PDS) followed by platinum-based chemotherapy is the cornerstone of treatment and the absence of residual tumor after PDS is universally considered the most important prognostic factor. The aim of the present analysis was to evaluate trend and predictors of 30-day mortality in patients undergoing primary cytoreduction for ovarian cancer. Literature was searched for records reporting 30-day mortality after PDS. All cohorts were rated for quality. Simple and multiple Poisson regression models were used to quantify the association between 30-day mortality and the following: overall or severe complications, proportion of patients with stage IV disease, median age, year of publication, and weighted surgical complexity index. Using the multiple regression model, we calculated the risk of perioperative mortality at different levels for statistically significant covariates of interest. Simple regression identified median age and proportion of patients with stage IV disease as statistically significant predictors of 30-day mortality. When included in the multiple Poisson regression model, both remained statistically significant, with an incidence rate ratio of 1.087 for median age and 1.017 for stage IV disease. Disease stage was a strong predictor, with the risk estimated to increase from 2.8% (95% confidence interval 2.02-3.66) for stage III to 16.1% (95% confidence interval 6.18-25.93) for stage IV, for a cohort with a median age of 65 years. Metaregression demonstrated that increased age and advanced clinical stage were independently associated with an increased risk of mortality, and the combined effects of both factors greatly increased the risk.

  19. Transformation of Summary Statistics from Linear Mixed Model Association on All-or-None Traits to Odds Ratio.

    PubMed

    Lloyd-Jones, Luke R; Robinson, Matthew R; Yang, Jian; Visscher, Peter M

    2018-04-01

    Genome-wide association studies (GWAS) have identified thousands of loci that are robustly associated with complex diseases. The use of linear mixed model (LMM) methodology for GWAS is becoming more prevalent due to its ability to control for population structure and cryptic relatedness and to increase power. The odds ratio (OR) is a common measure of the association of a disease with an exposure ( e.g. , a genetic variant) and is readably available from logistic regression. However, when the LMM is applied to all-or-none traits it provides estimates of genetic effects on the observed 0-1 scale, a different scale to that in logistic regression. This limits the comparability of results across studies, for example in a meta-analysis, and makes the interpretation of the magnitude of an effect from an LMM GWAS difficult. In this study, we derived transformations from the genetic effects estimated under the LMM to the OR that only rely on summary statistics. To test the proposed transformations, we used real genotypes from two large, publicly available data sets to simulate all-or-none phenotypes for a set of scenarios that differ in underlying model, disease prevalence, and heritability. Furthermore, we applied these transformations to GWAS summary statistics for type 2 diabetes generated from 108,042 individuals in the UK Biobank. In both simulation and real-data application, we observed very high concordance between the transformed OR from the LMM and either the simulated truth or estimates from logistic regression. The transformations derived and validated in this study improve the comparability of results from prospective and already performed LMM GWAS on complex diseases by providing a reliable transformation to a common comparative scale for the genetic effects. Copyright © 2018 by the Genetics Society of America.

  20. Spatial modelling of disease using data- and knowledge-driven approaches.

    PubMed

    Stevens, Kim B; Pfeiffer, Dirk U

    2011-09-01

    The purpose of spatial modelling in animal and public health is three-fold: describing existing spatial patterns of risk, attempting to understand the biological mechanisms that lead to disease occurrence and predicting what will happen in the medium to long-term future (temporal prediction) or in different geographical areas (spatial prediction). Traditional methods for temporal and spatial predictions include general and generalized linear models (GLM), generalized additive models (GAM) and Bayesian estimation methods. However, such models require both disease presence and absence data which are not always easy to obtain. Novel spatial modelling methods such as maximum entropy (MAXENT) and the genetic algorithm for rule set production (GARP) require only disease presence data and have been used extensively in the fields of ecology and conservation, to model species distribution and habitat suitability. Other methods, such as multicriteria decision analysis (MCDA), use knowledge of the causal factors of disease occurrence to identify areas potentially suitable for disease. In addition to their less restrictive data requirements, some of these novel methods have been shown to outperform traditional statistical methods in predictive ability (Elith et al., 2006). This review paper provides details of some of these novel methods for mapping disease distribution, highlights their advantages and limitations, and identifies studies which have used the methods to model various aspects of disease distribution. Copyright © 2011. Published by Elsevier Ltd.

  1. Update on work-related psychosocial factors and the development of ischemic heart disease: a systematic review.

    PubMed

    Pejtersen, Jan Hyld; Burr, Hermann; Hannerz, Harald; Fishta, Alba; Hurwitz Eller, Nanna

    2015-01-01

    The present review deals with the relationship between occupational psychosocial factors and the incidence of ischemic heart disease (IHD) with special regard to the statistical power of the findings. This review with 4 inclusion criteria is an update of a 2009 review of which the first 3 criteria were included in the original review: (1) STUDY: a prospective or case-control study if exposure was not self-reported (prognostic studies excluded); (2) OUTCOME: definite IHD determined externally; (3) EXPOSURE: psychosocial factors at work (excluding shift work, trauma, violence or accidents, and social capital); and (4) Statistical power: acceptable to detect a 20% increased risk in IHD. Eleven new papers met the inclusion criteria 1-3; a total of 44 papers were evaluated regarding inclusion criteria 4. Of 169 statistical analyses, only 10 analyses in 2 papers had acceptable statistical power. The results of the 2 papers pointed in the same direction, namely that only the control dimension of job strain explained the excess risk for myocardial infarction for job strain. The large number of underpowered studies and the focus on psychosocial models, such as the job strain models, make it difficult to determine to what extent psychosocial factors at work are risk factors of IHD. There is a need for considering statistical power when planning studies.

  2. Designing clinical trials to test disease-modifying agents: application to the treatment trials of Alzheimer's disease.

    PubMed

    Xiong, Chengjie; van Belle, Gerald; Miller, J Philip; Morris, John C

    2011-02-01

    Therapeutic trials of disease-modifying agents on Alzheimer's disease (AD) require novel designs and analyses involving switch of treatments for at least a portion of subjects enrolled. Randomized start and randomized withdrawal designs are two examples of such designs. Crucial design parameters such as sample size and the time of treatment switch are important to understand in designing such clinical trials. The purpose of this article is to provide methods to determine sample sizes and time of treatment switch as well as optimum statistical tests of treatment efficacy for clinical trials of disease-modifying agents on AD. A general linear mixed effects model is proposed to test the disease-modifying efficacy of novel therapeutic agents on AD. This model links the longitudinal growth from both the placebo arm and the treatment arm at the time of treatment switch for these in the delayed treatment arm or early withdrawal arm and incorporates the potential correlation on the rate of cognitive change before and after the treatment switch. Sample sizes and the optimum time for treatment switch of such trials as well as optimum test statistic for the treatment efficacy are determined according to the model. Assuming an evenly spaced longitudinal design over a fixed duration, the optimum treatment switching time in a randomized start or a randomized withdrawal trial is half way through the trial. With the optimum test statistic for the treatment efficacy and over a wide spectrum of model parameters, the optimum sample size allocations are fairly close to the simplest design with a sample size ratio of 1:1:1 among the treatment arm, the delayed treatment or early withdrawal arm, and the placebo arm. The application of the proposed methodology to AD provides evidence that much larger sample sizes are required to adequately power disease-modifying trials when compared with those for symptomatic agents, even when the treatment switch time and efficacy test are optimally chosen. The proposed method assumes that the only and immediate effect of treatment switch is on the rate of cognitive change. Crucial design parameters for the clinical trials of disease-modifying agents on AD can be optimally chosen. Government and industry officials as well as academia researchers should consider the optimum use of the clinical trials design for disease-modifying agents on AD in their effort to search for the treatments with the potential to modify the underlying pathophysiology of AD.

  3. Spatial statistical analysis of basal stem root disease under natural field epidemic of oil palm

    NASA Astrophysics Data System (ADS)

    Kamu, Assis; Phin, Chong Khim; Seman, Idris Abu; Wan, Hoong Hak; Mun, Ho Chong

    2015-02-01

    Oil palm or scientifically known as Elaeis guineensis Jacq. is the most important commodity crop in Malaysia and has greatly contributed to the economy growth of the country. As far as disease is concerned in the industry, Basal Stem Rot (BSR) caused by Ganoderma boninence remains the most important disease. BSR disease is the most widely studied with information available for oil palm disease in Malaysia. However, there is still limited study on the spatial as well as temporal pattern or distribution of the disease especially under natural field epidemic condition in oil palm plantation. The objective of this study is to spatially identify the pattern of BSR disease under natural field epidemic using two geospatial analytical techniques, which are quadrat analysis for the first order properties of partial pattern analysis and nearest-neighbor analysis (NNA) for the second order properties of partial pattern analysis. Two study sites were selected with different age of tree. Both sites are located in Tawau, Sabah and managed by the same company. The results showed that at least one of the point pattern analysis used which is NNA (i.e. the second order properties of partial pattern analysis) has confirmed the disease is complete spatial randomness. This suggests the spread of the disease is not from tree to tree and the age of palm does not play a significance role in determining the spatial pattern of the disease. From the spatial pattern of the disease, it would help in the disease management program and for the industry in the future. The statistical modelling is expected to help in identifying the right model to estimate the yield loss of oil palm due to BSR disease in the future.

  4. Biomechanisms of Comorbidity: Reviewing Integrative Analyses of Multi-omics Datasets and Electronic Health Records.

    PubMed

    Pouladi, N; Achour, I; Li, H; Berghout, J; Kenost, C; Gonzalez-Garay, M L; Lussier, Y A

    2016-11-10

    Disease comorbidity is a pervasive phenomenon impacting patients' health outcomes, disease management, and clinical decisions. This review presents past, current and future research directions leveraging both phenotypic and molecular information to uncover disease similarity underpinning the biology and etiology of disease comorbidity. We retrieved ~130 publications and retained 59, ranging from 2006 to 2015, that comprise a minimum number of five diseases and at least one type of biomolecule. We surveyed their methods, disease similarity metrics, and calculation of comorbidities in the electronic health records, if present. Among the surveyed studies, 44% generated or validated disease similarity metrics in context of comorbidity, with 60% being published in the last two years. As inputs, 87% of studies utilized intragenic loci and proteins while 13% employed RNA (mRNA, LncRNA or miRNA). Network modeling was predominantly used (35%) followed by statistics (28%) to impute similarity between these biomolecules and diseases. Studies with large numbers of biomolecules and diseases used network models or naïve overlap of disease-molecule associations, while machine learning, statistics, and information retrieval were utilized in smaller and moderate sized studies. Multiscale computations comprising shared function, network topology, and phenotypes were performed exclusively on proteins. This review highlighted the growing methods for identifying the molecular mechanisms underpinning comorbidities that leverage multiscale molecular information and patterns from electronic health records. The survey unveiled that intergenic polymorphisms have been overlooked for similarity imputation compared to their intragenic counterparts, offering new opportunities to bridge the mechanistic and similarity gaps of comorbidity.

  5. Indiana chronic disease management program risk stratification analysis.

    PubMed

    Li, Jingjin; Holmes, Ann M; Rosenman, Marc B; Katz, Barry P; Downs, Stephen M; Murray, Michael D; Ackermann, Ronald T; Inui, Thomas S

    2005-10-01

    The objective of this study was to compare the ability of risk stratification models derived from administrative data to classify groups of patients for enrollment in a tailored chronic disease management program. This study included 19,548 Medicaid patients with chronic heart failure or diabetes in the Indiana Medicaid data warehouse during 2001 and 2002. To predict costs (total claims paid) in FY 2002, we considered candidate predictor variables available in FY 2001, including patient characteristics, the number and type of prescription medications, laboratory tests, pharmacy charges, and utilization of primary, specialty, inpatient, emergency department, nursing home, and home health care. We built prospective models to identify patients with different levels of expenditure. Model fit was assessed using R statistics, whereas discrimination was assessed using the weighted kappa statistic, predictive ratios, and the area under the receiver operating characteristic curve. We found a simple least-squares regression model in which logged total charges in FY 2002 were regressed on the log of total charges in FY 2001, the number of prescriptions filled in FY 2001, and the FY 2001 eligibility category, performed as well as more complex models. This simple 3-parameter model had an R of 0.30 and, in terms in classification efficiency, had a sensitivity of 0.57, a specificity of 0.90, an area under the receiver operator curve of 0.80, and a weighted kappa statistic of 0.51. This simple model based on readily available administrative data stratified Medicaid members according to predicted future utilization as well as more complicated models.

  6. Tissue Chips to aid drug development and modeling for rare diseases

    PubMed Central

    Low, Lucie A.; Tagle, Danilo A.

    2016-01-01

    Introduction The technologies used to design, create and use microphysiological systems (MPS, “tissue chips” or “organs-on-chips”) have progressed rapidly in the last 5 years, and validation studies of the functional relevance of these platforms to human physiology, and response to drugs for individual model organ systems, are well underway. These studies are paving the way for integrated multi-organ systems that can model diseases and predict drug efficacy and toxicology of multiple organs in real-time, improving the potential for diagnostics and development of novel treatments of rare diseases in the future. Areas covered This review will briefly summarize the current state of tissue chip research and highlight model systems where these microfabricated (or bioengineered) devices are already being used to screen therapeutics, model disease states, and provide potential treatments in addition to helping elucidate the basic molecular and cellular phenotypes of rare diseases. Expert opinion Microphysiological systems hold great promise and potential for modeling rare disorders, as well as for their potential use to enhance the predictive power of new drug therapeutics, plus potentially increase the statistical power of clinical trials while removing the inherent risks of these trials in rare disease populations. PMID:28626620

  7. Brain age and other bodily 'ages': implications for neuropsychiatry.

    PubMed

    Cole, James H; Marioni, Riccardo E; Harris, Sarah E; Deary, Ian J

    2018-06-11

    As our brains age, we tend to experience cognitive decline and are at greater risk of neurodegenerative disease and dementia. Symptoms of chronic neuropsychiatric diseases are also exacerbated during ageing. However, the ageing process does not affect people uniformly; nor, in fact, does the ageing process appear to be uniform even within an individual. Here, we outline recent neuroimaging research into brain ageing and the use of other bodily ageing biomarkers, including telomere length, the epigenetic clock, and grip strength. Some of these techniques, using statistical approaches, have the ability to predict chronological age in healthy people. Moreover, they are now being applied to neurological and psychiatric disease groups to provide insights into how these diseases interact with the ageing process and to deliver individualised predictions about future brain and body health. We discuss the importance of integrating different types of biological measurements, from both the brain and the rest of the body, to build more comprehensive models of the biological ageing process. Finally, we propose seven steps for the field of brain-ageing research to take in coming years. This will help us reach the long-term goal of developing clinically applicable statistical models of biological processes to measure, track and predict brain and body health in ageing and disease.

  8. Statistical Modeling of Caregiver Burden and Distress among Informal Caregivers of Individuals with Amyotrophic Lateral Sclerosis, Alzheimer's Disease, and Cancer

    ERIC Educational Resources Information Center

    Cumming, John McClure

    2011-01-01

    Caregiver burden and distress have been associated with informal caregivers. Research findings on the specific aspects of the caregiving role that influence burden are mixed. Factors such as amount of time per day giving care and specific characteristics about the disease progression have been linked to caregiver burden and distress. Other…

  9. A conceptual disease model for adult Pompe disease.

    PubMed

    Kanters, Tim A; Redekop, W Ken; Rutten-Van Mölken, Maureen P M H; Kruijshaar, Michelle E; Güngör, Deniz; van der Ploeg, Ans T; Hakkaart, Leona

    2015-09-15

    Studies in orphan diseases are, by nature, confronted with small patient populations, meaning that randomized controlled trials will have limited statistical power. In order to estimate the effectiveness of treatments in orphan diseases and extrapolate effects into the future, alternative models might be needed. The purpose of this study is to develop a conceptual disease model for Pompe disease in adults (an orphan disease). This conceptual model describes the associations between the most important levels of health concepts for Pompe disease in adults, from biological parameters via physiological parameters, symptoms and functional indicators to health perceptions and final health outcomes as measured in terms of health-related quality of life. The structure of the Wilson-Cleary health outcomes model was used as a blueprint, and filled with clinically relevant aspects for Pompe disease based on literature and expert opinion. Multiple observations per patient from a Dutch cohort study in untreated patients were used to quantify the relationships between the different levels of health concepts in the model by means of regression analyses. Enzyme activity, muscle strength, respiratory function, fatigue, level of handicap, general health perceptions, mental and physical component scales and utility described the different levels of health concepts in the Wilson-Cleary model for Pompe disease. Regression analyses showed that functional status was affected by fatigue, muscle strength and respiratory function. Health perceptions were affected by handicap. In turn, self-reported quality of life was affected by health perceptions. We conceptualized a disease model that incorporated the mechanisms believed to be responsible for impaired quality of life in Pompe disease. The model provides a comprehensive overview of various aspects of Pompe disease in adults, which can be useful for both clinicians and policymakers to support their multi-faceted decision making.

  10. Early-Life State-of-Residence Characteristics and Later Life Hypertension, Diabetes, and Ischemic Heart Disease.

    PubMed

    Rehkopf, David H; Eisen, Ellen A; Modrek, Sepideh; Mokyr Horner, Elizabeth; Goldstein, Benjamin; Costello, Sadie; Cantley, Linda F; Slade, Martin D; Cullen, Mark R

    2015-08-01

    We examined how state characteristics in early life are associated with individual chronic disease later in life. We assessed early-life state of residence using the first 3 digits of social security numbers from blue- and white-collar workers from a US manufacturing company. Longitudinal data were available from 1997 to 2012, with 305 936 person-years of observation. Disease was assessed using medical claims. We modeled associations using pooled logistic regression with inverse probability of censoring weights. We found small but statistically significant associations between early-state-of-residence characteristics and later life hypertension, diabetes, and ischemic heart disease. The most consistent associations were with income inequality, percentage non-White, and education. These associations were similar after statistically controlling for individual socioeconomic and demographic characteristics and current state characteristics. Characteristics of the state in which an individual lives early in life are associated with prevalence of chronic disease later in life, with a strength of association equivalent to genetic associations found for these same health outcomes.

  11. Measuring outcomes of type 2 diabetes disease management program in an HMO setting.

    PubMed

    Ibrahim, Ibrahim Awad; Beich, Jeff; Sidorov, Jaan; Gabbay, Robert; Yu, Lucy

    2002-01-01

    There is a need to evaluate empirical disease management programs used in managing chronic diseases such as diabetes mellitus in managed care settings. We analyzed data from 252 patients with type 2 diabetes before and 1 year after enrollment in a disease management program. We examined clinical indicators such as HbA1C, HDL, LDL, total cholesterol, diastolic blood pressure, and BMI in addition to self-reported health status measured by SF-36 instrument. All clinical indicators showed statistically and clinically significant improvements. Only vitality and mental health showed statistically significant improvements in health status. Weak to moderate significant correlation between clinical indicators and health status was observed. Disease management can be effective at making significant clinical improvements for participants in a mixed-model HMO setting. No strong relationship between clinical indicators and health status was found. Future research is needed using a more specific health status measuring instrument and a randomized clinical trial design.

  12. A mechanistic spatio-temporal framework for modelling individual-to-individual transmission—With an application to the 2014-2015 West Africa Ebola outbreak

    PubMed Central

    McClelland, Amanda; Zelner, Jon; Streftaris, George; Funk, Sebastian; Metcalf, Jessica; Dalziel, Benjamin D.; Grenfell, Bryan T.

    2017-01-01

    In recent years there has been growing availability of individual-level spatio-temporal disease data, particularly due to the use of modern communicating devices with GPS tracking functionality. These detailed data have been proven useful for inferring disease transmission to a more refined level than previously. However, there remains a lack of statistically sound frameworks to model the underlying transmission dynamic in a mechanistic manner. Such a development is particularly crucial for enabling a general epidemic predictive framework at the individual level. In this paper we propose a new statistical framework for mechanistically modelling individual-to-individual disease transmission in a landscape with heterogeneous population density. Our methodology is first tested using simulated datasets, validating our inferential machinery. The methodology is subsequently applied to data that describes a regional Ebola outbreak in Western Africa (2014-2015). Our results show that the methods are able to obtain estimates of key epidemiological parameters that are broadly consistent with the literature, while revealing a significantly shorter distance of transmission. More importantly, in contrast to existing approaches, we are able to perform a more general model prediction that takes into account the susceptible population. Finally, our results show that, given reasonable scenarios, the framework can be an effective surrogate for susceptible-explicit individual models which are often computationally challenging. PMID:29084216

  13. A mechanistic spatio-temporal framework for modelling individual-to-individual transmission-With an application to the 2014-2015 West Africa Ebola outbreak.

    PubMed

    Lau, Max S Y; Gibson, Gavin J; Adrakey, Hola; McClelland, Amanda; Riley, Steven; Zelner, Jon; Streftaris, George; Funk, Sebastian; Metcalf, Jessica; Dalziel, Benjamin D; Grenfell, Bryan T

    2017-10-01

    In recent years there has been growing availability of individual-level spatio-temporal disease data, particularly due to the use of modern communicating devices with GPS tracking functionality. These detailed data have been proven useful for inferring disease transmission to a more refined level than previously. However, there remains a lack of statistically sound frameworks to model the underlying transmission dynamic in a mechanistic manner. Such a development is particularly crucial for enabling a general epidemic predictive framework at the individual level. In this paper we propose a new statistical framework for mechanistically modelling individual-to-individual disease transmission in a landscape with heterogeneous population density. Our methodology is first tested using simulated datasets, validating our inferential machinery. The methodology is subsequently applied to data that describes a regional Ebola outbreak in Western Africa (2014-2015). Our results show that the methods are able to obtain estimates of key epidemiological parameters that are broadly consistent with the literature, while revealing a significantly shorter distance of transmission. More importantly, in contrast to existing approaches, we are able to perform a more general model prediction that takes into account the susceptible population. Finally, our results show that, given reasonable scenarios, the framework can be an effective surrogate for susceptible-explicit individual models which are often computationally challenging.

  14. Accurate and robust genomic prediction of celiac disease using statistical learning.

    PubMed

    Abraham, Gad; Tye-Din, Jason A; Bhalala, Oneil G; Kowalczyk, Adam; Zobel, Justin; Inouye, Michael

    2014-02-01

    Practical application of genomic-based risk stratification to clinical diagnosis is appealing yet performance varies widely depending on the disease and genomic risk score (GRS) method. Celiac disease (CD), a common immune-mediated illness, is strongly genetically determined and requires specific HLA haplotypes. HLA testing can exclude diagnosis but has low specificity, providing little information suitable for clinical risk stratification. Using six European cohorts, we provide a proof-of-concept that statistical learning approaches which simultaneously model all SNPs can generate robust and highly accurate predictive models of CD based on genome-wide SNP profiles. The high predictive capacity replicated both in cross-validation within each cohort (AUC of 0.87-0.89) and in independent replication across cohorts (AUC of 0.86-0.9), despite differences in ethnicity. The models explained 30-35% of disease variance and up to ∼43% of heritability. The GRS's utility was assessed in different clinically relevant settings. Comparable to HLA typing, the GRS can be used to identify individuals without CD with ≥99.6% negative predictive value however, unlike HLA typing, fine-scale stratification of individuals into categories of higher-risk for CD can identify those that would benefit from more invasive and costly definitive testing. The GRS is flexible and its performance can be adapted to the clinical situation by adjusting the threshold cut-off. Despite explaining a minority of disease heritability, our findings indicate a genomic risk score provides clinically relevant information to improve upon current diagnostic pathways for CD and support further studies evaluating the clinical utility of this approach in CD and other complex diseases.

  15. Predicting trauma patient mortality: ICD [or ICD-10-AM] versus AIS based approaches.

    PubMed

    Willis, Cameron D; Gabbe, Belinda J; Jolley, Damien; Harrison, James E; Cameron, Peter A

    2010-11-01

    The International Classification of Diseases Injury Severity Score (ICISS) has been proposed as an International Classification of Diseases (ICD)-10-based alternative to mortality prediction tools that use Abbreviated Injury Scale (AIS) data, including the Trauma and Injury Severity Score (TRISS). To date, studies have not examined the performance of ICISS using Australian trauma registry data. This study aimed to compare the performance of ICISS with other mortality prediction tools in an Australian trauma registry. This was a retrospective review of prospectively collected data from the Victorian State Trauma Registry. A training dataset was created for model development and a validation dataset for evaluation. The multiplicative ICISS model was compared with a worst injury ICISS approach, Victorian TRISS (V-TRISS, using local coefficients), maximum AIS severity and a multivariable model including ICD-10-AM codes as predictors. Models were investigated for discrimination (C-statistic) and calibration (Hosmer-Lemeshow statistic). The multivariable approach had the highest level of discrimination (C-statistic 0.90) and calibration (H-L 7.65, P= 0.468). Worst injury ICISS, V-TRISS and maximum AIS had similar performance. The multiplicative ICISS produced the lowest level of discrimination (C-statistic 0.80) and poorest calibration (H-L 50.23, P < 0.001). The performance of ICISS may be affected by the data used to develop estimates, the ICD version employed, the methods for deriving estimates and the inclusion of covariates. In this analysis, a multivariable approach using ICD-10-AM codes was the best-performing method. A multivariable ICISS approach may therefore be a useful alternative to AIS-based methods and may have comparable predictive performance to locally derived TRISS models. © 2010 The Authors. ANZ Journal of Surgery © 2010 Royal Australasian College of Surgeons.

  16. Development of a metabolic biosignature for detection of early Lyme disease.

    PubMed

    Molins, Claudia R; Ashton, Laura V; Wormser, Gary P; Hess, Ann M; Delorey, Mark J; Mahapatra, Sebabrata; Schriefer, Martin E; Belisle, John T

    2015-06-15

    Early Lyme disease patients often present to the clinic prior to developing a detectable antibody response to Borrelia burgdorferi, the etiologic agent. Thus, existing 2-tier serology-based assays yield low sensitivities (29%-40%) for early infection. The lack of an accurate laboratory test for early Lyme disease contributes to misconceptions about diagnosis and treatment, and underscores the need for new diagnostic approaches. Retrospective serum samples from patients with early Lyme disease, other diseases, and healthy controls were analyzed for small molecule metabolites by liquid chromatography-mass spectrometry (LC-MS). A metabolomics data workflow was applied to select a biosignature for classifying early Lyme disease and non-Lyme disease patients. A statistical model of the biosignature was trained using the patients' LC-MS data, and subsequently applied as an experimental diagnostic tool with LC-MS data from additional patient sera. The accuracy of this method was compared with standard 2-tier serology. Metabolic biosignature development selected 95 molecular features that distinguished early Lyme disease patients from healthy controls. Statistical modeling reduced the biosignature to 44 molecular features, and correctly classified early Lyme disease patients and healthy controls with a sensitivity of 88% (84%-95%), and a specificity of 95% (90%-100%). Importantly, the metabolic biosignature correctly classified 77%-95% of the of serology negative Lyme disease patients. The data provide proof-of-concept that metabolic profiling for early Lyme disease can achieve significantly greater (P < .0001) diagnostic sensitivity than current 2-tier serology, while retaining high specificity. © The Author 2015. Published by Oxford University Press on behalf of the Infectious Diseases Society of America. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  17. Hypothesis testing of a change point during cognitive decline among Alzheimer's disease patients.

    PubMed

    Ji, Ming; Xiong, Chengjie; Grundman, Michael

    2003-10-01

    In this paper, we present a statistical hypothesis test for detecting a change point over the course of cognitive decline among Alzheimer's disease patients. The model under the null hypothesis assumes a constant rate of cognitive decline over time and the model under the alternative hypothesis is a general bilinear model with an unknown change point. When the change point is unknown, however, the null distribution of the test statistics is not analytically tractable and has to be simulated by parametric bootstrap. When the alternative hypothesis that a change point exists is accepted, we propose an estimate of its location based on the Akaike's Information Criterion. We applied our method to a data set from the Neuropsychological Database Initiative by implementing our hypothesis testing method to analyze Mini Mental Status Exam scores based on a random-slope and random-intercept model with a bilinear fixed effect. Our result shows that despite large amount of missing data, accelerated decline did occur for MMSE among AD patients. Our finding supports the clinical belief of the existence of a change point during cognitive decline among AD patients and suggests the use of change point models for the longitudinal modeling of cognitive decline in AD research.

  18. A diagnostic model for chronic hypersensitivity pneumonitis.

    PubMed

    Johannson, Kerri A; Elicker, Brett M; Vittinghoff, Eric; Assayag, Deborah; de Boer, Kaïssa; Golden, Jeffrey A; Jones, Kirk D; King, Talmadge E; Koth, Laura L; Lee, Joyce S; Ley, Brett; Wolters, Paul J; Collard, Harold R

    2016-10-01

    The objective of this study was to develop a diagnostic model that allows for a highly specific diagnosis of chronic hypersensitivity pneumonitis using clinical and radiological variables alone. Chronic hypersensitivity pneumonitis and other interstitial lung disease cases were retrospectively identified from a longitudinal database. High-resolution CT scans were blindly scored for radiographic features (eg, ground-glass opacity, mosaic perfusion) as well as the radiologist's diagnostic impression. Candidate models were developed then evaluated using clinical and radiographic variables and assessed by the cross-validated C-statistic. Forty-four chronic hypersensitivity pneumonitis and eighty other interstitial lung disease cases were identified. Two models were selected based on their statistical performance, clinical applicability and face validity. Key model variables included age, down feather and/or bird exposure, radiographic presence of ground-glass opacity and mosaic perfusion and moderate or high confidence in the radiographic impression of chronic hypersensitivity pneumonitis. Models were internally validated with good performance, and cut-off values were established that resulted in high specificity for a diagnosis of chronic hypersensitivity pneumonitis. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/

  19. Mapping the spatial distribution of Aedes aegypti and Aedes albopictus.

    PubMed

    Ding, Fangyu; Fu, Jingying; Jiang, Dong; Hao, Mengmeng; Lin, Gang

    2018-02-01

    Mosquito-borne infectious diseases, such as Rift Valley fever, Dengue, Chikungunya and Zika, have caused mass human death with the transnational expansion fueled by economic globalization. Simulating the distribution of the disease vectors is of great importance in formulating public health planning and disease control strategies. In the present study, we simulated the global distribution of Aedes aegypti and Aedes albopictus at a 5×5km spatial resolution with high-dimensional multidisciplinary datasets and machine learning methods Three relatively popular and robust machine learning models, including support vector machine (SVM), gradient boosting machine (GBM) and random forest (RF), were used. During the fine-tuning process based on training datasets of A. aegypti and A. albopictus, RF models achieved the highest performance with an area under the curve (AUC) of 0.973 and 0.974, respectively, followed by GBM (AUC of 0.971 and 0.972, respectively) and SVM (AUC of 0.963 and 0.964, respectively) models. The simulation difference between RF and GBM models was not statistically significant (p>0.05) based on the validation datasets, whereas statistically significant differences (p<0.05) were observed for RF and GBM simulations compared with SVM simulations. From the simulated maps derived from RF models, we observed that the distribution of A. albopictus was wider than that of A. aegypti along a latitudinal gradient. The discriminatory power of each factor in simulating the global distribution of the two species was also analyzed. Our results provided fundamental information for further study on disease transmission simulation and risk assessment. Copyright © 2017 Elsevier B.V. All rights reserved.

  20. Malaria Disease Mapping in Malaysia based on Besag-York-Mollie (BYM) Model

    NASA Astrophysics Data System (ADS)

    Azah Samat, Nor; Mey, Liew Wan

    2017-09-01

    Disease mapping is the visual representation of the geographical distribution which give an overview info about the incidence of disease within a population through spatial epidemiology data. Based on the result of map, it helps in monitoring and planning resource needs at all levels of health care and designing appropriate interventions, tailored towards areas that deserve closer scrutiny or communities that lead to further investigations to identify important risk factors. Therefore, the choice of statistical model used for relative risk estimation is important because production of disease risk map relies on the model used. This paper proposes Besag-York-Mollie (BYM) model to estimate the relative risk for Malaria in Malaysia. The analysis involved using the number of Malaria cases that obtained from the Ministry of Health Malaysia. The outcomes of analysis are displayed through graph and map, including Malaria disease risk map that constructed according to the estimation of relative risk. The distribution of high and low risk areas of Malaria disease occurrences for all states in Malaysia can be identified in the risk map.

  1. Statistical strategy for anisotropic adventitia modelling in IVUS.

    PubMed

    Gil, Debora; Hernández, Aura; Rodriguez, Oriol; Mauri, Josepa; Radeva, Petia

    2006-06-01

    Vessel plaque assessment by analysis of intravascular ultrasound sequences is a useful tool for cardiac disease diagnosis and intervention. Manual detection of luminal (inner) and media-adventitia (external) vessel borders is the main activity of physicians in the process of lumen narrowing (plaque) quantification. Difficult definition of vessel border descriptors, as well as, shades, artifacts, and blurred signal response due to ultrasound physical properties trouble automated adventitia segmentation. In order to efficiently approach such a complex problem, we propose blending advanced anisotropic filtering operators and statistical classification techniques into a vessel border modelling strategy. Our systematic statistical analysis shows that the reported adventitia detection achieves an accuracy in the range of interobserver variability regardless of plaque nature, vessel geometry, and incomplete vessel borders.

  2. A case-control study of malignant and non-malignant respiratory disease among employees of a fiberglass manufacturing facility.

    PubMed Central

    Chiazze, L; Watkins, D K; Fryar, C

    1992-01-01

    A case-control study was conducted to determine the influence of non-workplace factors on risk of respiratory disease among workers at the Owens-Corning Fiberglas plant in Newark, Ohio. Cases and controls were drawn from a historical cohort mortality study conducted on behalf of the Thermal Insulation Manufacturers Association (TIMA) of workers employed at Newark for at least one year between 1 January 1940 and 31 December 1963 and followed up to the end of 1982. The TIMA study reported a statistically significant increase in respiratory cancer (compared with national death rates). Interviews were completed for 144 lung cancer cases and 299 matching controls and 102 non-malignant respiratory disease cases and 201 matching controls. Unadjusted odds ratios (ORs) were used to assess the association between lung cancer or non-malignant respiratory disease and birthplace, education, income, marital state, smoking with a duration of six months or more, age at which smoking first started, and duration of smoking. Only the smoking variables were statistically significant. For lung cancer, of the variables entered into a conditional logistic regression model, only the smoking OR of 23.4 (95% CI 3.2-172.9) was statistically significant. For non-malignant respiratory disease no variables entered into the final model were statistically significant. Results of the interview portion of our case-control study clearly indicate that smoking is the most important non-workplace factor for risk of lung cancer in this group of workers. Smoking does not seem to play as important a part, however, for non-malignant respiratory disease. Prevalence of cigarette smoking at the Newark plant was estimated for birth cohorts by calendar year. Corresponding data for the United States were compiled from national smoking surveys. Prevalence of cigarette smoking for Newark in 1955 appears to be sufficiently greater than the corresponding United States data in 1955 to suggest that some of the previously reported excess of lung cancer for Newark based on United States mortality may be accounted for by differences in the prevalence of cigarette smoking between white men in Newark and those in the United States as a whole. PMID:1599870

  3. Bayesian mixture modeling for blood sugar levels of diabetes mellitus patients (case study in RSUD Saiful Anwar Malang Indonesia)

    NASA Astrophysics Data System (ADS)

    Budi Astuti, Ani; Iriawan, Nur; Irhamah; Kuswanto, Heri; Sasiarini, Laksmi

    2017-10-01

    Bayesian statistics proposes an approach that is very flexible in the number of samples and distribution of data. Bayesian Mixture Model (BMM) is a Bayesian approach for multimodal models. Diabetes Mellitus (DM) is more commonly known in the Indonesian community as sweet pee. This disease is one type of chronic non-communicable diseases but it is very dangerous to humans because of the effects of other diseases complications caused. WHO reports in 2013 showed DM disease was ranked 6th in the world as the leading causes of human death. In Indonesia, DM disease continues to increase over time. These research would be studied patterns and would be built the BMM models of the DM data through simulation studies where the simulation data built on cases of blood sugar levels of DM patients in RSUD Saiful Anwar Malang. The results have been successfully demonstrated pattern of distribution of the DM data which has a normal mixture distribution. The BMM models have succeed to accommodate the real condition of the DM data based on the data driven concept.

  4. Clinical trials with velnacrine: (PROPP) the physician reference of predicted probabilities--a statistical model for the estimation of hepatotoxicity risk with velnacrine maleate.

    PubMed

    Hardiman, S; Miller, K; Murphy, M

    1993-01-01

    Safety observations during the clinical development of Mentane (velnacrine maleate) have included the occurrence of generally asymptomatic liver enzyme elevations confined to patients with Alzheimer's disease (AD). The clinical presentation of this reversible hepatocellular injury is analogous to that reported for tetrahydroaminoacridine (THA). Direct liver injury, possibly associated with the production of a toxic metabolite, would be consistent with reports of aberrant xenobiotic metabolism in Alzheimer's disease patients. Since a patient related aberration in drug metabolism was suspected, a biostatistical strategy was developed with the objective of predicting hepatotoxicity in individual patients prior to exposure to velnacrine maleate. The method used logistic regression techniques with variable selection restricted to those items which could be routinely and inexpensively accessed at screen evaluation for potential candidates for treatment. The model was to be predictive (a marker for eventual hepatotoxicity) rather than a causative model, and techniques employed "goodness of fit", percentage correct, and positive and negative predictive values. On the basis of demographic and baseline laboratory data from 942 patients, the PROPP statistic was developed (the Physician Reference Of Predicted Probabilities). Main effect variables included age, gender, and nine hematological and serum chemistry variables. The sensitivity of the current model is approximately 49%, specificity approximately 88%. Using prior probability estimates, however, in which the patient's likelihood of liver toxicity is presumed to be at least 30%, the positive predictive value ranged between 64-77%. Although the clinical utility of this statistic will require refinements and additional prospective confirmation, its potential existence speaks to the possibility of markers for idiosyncratic drug metabolism in patients with Alzheimer's disease.

  5. Health Statistics

    MedlinePlus

    ... births, deaths, marriages, and divorces are sometimes called "vital statistics." Researchers use statistics to see patterns of diseases in groups of people. This can help in figuring out who is at risk for certain diseases, finding ways to control diseases and deciding which diseases ...

  6. Testing Modeling Assumptions in the West Africa Ebola Outbreak

    NASA Astrophysics Data System (ADS)

    Burghardt, Keith; Verzijl, Christopher; Huang, Junming; Ingram, Matthew; Song, Binyang; Hasne, Marie-Pierre

    2016-10-01

    The Ebola virus in West Africa has infected almost 30,000 and killed over 11,000 people. Recent models of Ebola Virus Disease (EVD) have often made assumptions about how the disease spreads, such as uniform transmissibility and homogeneous mixing within a population. In this paper, we test whether these assumptions are necessarily correct, and offer simple solutions that may improve disease model accuracy. First, we use data and models of West African migration to show that EVD does not homogeneously mix, but spreads in a predictable manner. Next, we estimate the initial growth rate of EVD within country administrative divisions and find that it significantly decreases with population density. Finally, we test whether EVD strains have uniform transmissibility through a novel statistical test, and find that certain strains appear more often than expected by chance.

  7. The utility of liver function tests for mortality prediction within one year in primary care using the algorithm for liver function investigations (ALFI).

    PubMed

    McLernon, David J; Dillon, John F; Sullivan, Frank M; Roderick, Paul; Rosenberg, William M; Ryder, Stephen D; Donnan, Peter T

    2012-01-01

    Although liver function tests (LFTs) are routinely measured in primary care, raised levels in patients with no obvious liver disease may trigger a range of subsequent expensive and unnecessary management plans. The aim of this study was to develop and validate a prediction model to guide decision-making by general practitioners, which estimates risk of one year all-cause mortality in patients with no obvious liver disease. In this population-based historical cohort study, biochemistry data from patients in Tayside, Scotland, with LFTs performed in primary care were record-linked to secondary care and prescription databases to ascertain baseline characteristics, and to mortality data. Using this derivation cohort a survival model was developed to predict mortality. The model was assessed for calibration, discrimination (using the C-statistic) and performance, and validated using a separate cohort of Scottish primary care practices. From the derivation cohort (n = 95 977), 2.7% died within one year. Predictors of mortality included: age; male gender; social deprivation; history of cancer, renal disease, stroke, ischaemic heart disease or respiratory disease; statin use; and LFTs (albumin, transaminase, alkaline phosphatase, bilirubin, and gamma-glutamyltransferase). The C-statistic for the final model was 0.82 (95% CI 0.80-0.84), and was similar in the validation cohort (n = 11 653) 0.86 (0.79-0.90). As an example of performance, for a 10% predicted probability cut-off, sensitivity = 52.8%, specificity = 94.0%, PPV = 21.0%, NPV = 98.5%. For the model without LFTs the respective values were 43.8%, 92.8%, 15.6%, 98.1%. The Algorithm for Liver Function Investigations (ALFI) is the first model to successfully estimate the probability of all-cause mortality in patients with no apparent liver disease having LFTs in primary care. While LFTs added to the model's discrimination and sensitivity, the clinical utility of ALFI remains to be established since LFTs did not improve an already high NPV for short term mortality and only modestly improved a very low PPV.

  8. Radiation-epidemiological Study of Cerebrovascular Diseases in the Cohort of Russian Recovery Operation Workers of the Chernobyl Accident.

    PubMed

    Kashcheev, V V; Chekin, S Yu; Maksioutov, M A; Tumanov, K A; Menyaylo, A N; Kochergina, E V; Kashcheeva, P V; Gorsky, A I; Shchukina, N V; Karpenko, S V; Ivanov, V K

    2016-08-01

    The paper presents an analysis of the incidence of cerebrovascular diseases (CeVD) in the cohort of Russian workers involved in recovery tasks after the Chernobyl accident. The studied cohort consists of 53,772 recovery operation workers (liquidators) who arrived in the zone of the Chernobyl accident within the first year after this accident (26 April 1986-26 April 1987). The mean external whole body dose in the cohort was 0.161 Gy, while individual doses varied from 0.0001 Gy to 1.42 Gy. During the follow-up period 1986-2012, a total of 23,264 cases of CeVD were diagnosed as a result of annual health examinations. A Poisson regression model was applied for estimation of radiation risks and for an assessment of other risk factors of CeVD. The following factors were considered as risk factors for CeVD: the dose, duration of the liquidators' work in the Chernobyl zone, and the concomitant diseases (hypertension, ischemic heart disease, atherosclerosis, and diabetes). The baseline incidence of CeVD is statistically significantly (p < 0.001) associated with all studied concomitant diseases. The incidence of CeVD has revealed a statistically significant dose response with the lack of a latent period and with the average ERR/Gy = 0.45, 95% CI: (0.28, 0.62), p < 0.001. Radiation risks of CeVD statistically significantly (p = 0.03) varied with the duration of liquidators' stay in the Chernobyl zone; for those who stayed in the Chernobyl zone less than 6 wk, ERR/Gy = 0.64, 95% CI = (0.38; 0.93), p < 0.001. Among studied concomitant diseases, diabetes mellitus statistically significantly (p = 0.002) increases the radiation risk of CeVD: for liquidators with diagnosed diabetes, ERR/Gy = 1.29.

  9. Niche harmony search algorithm for detecting complex disease associated high-order SNP combinations.

    PubMed

    Tuo, Shouheng; Zhang, Junying; Yuan, Xiguo; He, Zongzhen; Liu, Yajun; Liu, Zhaowen

    2017-09-14

    Genome-wide association study is especially challenging in detecting high-order disease-causing models due to model diversity, possible low or even no marginal effect of the model, and extraordinary search and computations. In this paper, we propose a niche harmony search algorithm where joint entropy is utilized as a heuristic factor to guide the search for low or no marginal effect model, and two computationally lightweight scores are selected to evaluate and adapt to diverse of disease models. In order to obtain all possible suspected pathogenic models, niche technique merges with HS, which serves as a taboo region to avoid HS trapping into local search. From the resultant set of candidate SNP-combinations, we use G-test statistic for testing true positives. Experiments were performed on twenty typical simulation datasets in which 12 models are with marginal effect and eight ones are with no marginal effect. Our results indicate that the proposed algorithm has very high detection power for searching suspected disease models in the first stage and it is superior to some typical existing approaches in both detection power and CPU runtime for all these datasets. Application to age-related macular degeneration (AMD) demonstrates our method is promising in detecting high-order disease-causing models.

  10. Model-Based Linkage Analysis of a Quantitative Trait.

    PubMed

    Song, Yeunjoo E; Song, Sunah; Schnell, Audrey H

    2017-01-01

    Linkage Analysis is a family-based method of analysis to examine whether any typed genetic markers cosegregate with a given trait, in this case a quantitative trait. If linkage exists, this is taken as evidence in support of a genetic basis for the trait. Historically, linkage analysis was performed using a binary disease trait, but has been extended to include quantitative disease measures. Quantitative traits are desirable as they provide more information than binary traits. Linkage analysis can be performed using single-marker methods (one marker at a time) or multipoint (using multiple markers simultaneously). In model-based linkage analysis the genetic model for the trait of interest is specified. There are many software options for performing linkage analysis. Here, we use the program package Statistical Analysis for Genetic Epidemiology (S.A.G.E.). S.A.G.E. was chosen because it also includes programs to perform data cleaning procedures and to generate and test genetic models for a quantitative trait, in addition to performing linkage analysis. We demonstrate in detail the process of running the program LODLINK to perform single-marker analysis, and MLOD to perform multipoint analysis using output from SEGREG, where SEGREG was used to determine the best fitting statistical model for the trait.

  11. Big Data for Infectious Disease Surveillance and Modeling

    PubMed Central

    Bansal, Shweta; Chowell, Gerardo; Simonsen, Lone; Vespignani, Alessandro; Viboud, Cécile

    2016-01-01

    We devote a special issue of the Journal of Infectious Diseases to review the recent advances of big data in strengthening disease surveillance, monitoring medical adverse events, informing transmission models, and tracking patient sentiments and mobility. We consider a broad definition of big data for public health, one encompassing patient information gathered from high-volume electronic health records and participatory surveillance systems, as well as mining of digital traces such as social media, Internet searches, and cell-phone logs. We introduce nine independent contributions to this special issue and highlight several cross-cutting areas that require further research, including representativeness, biases, volatility, and validation, and the need for robust statistical and hypotheses-driven analyses. Overall, we are optimistic that the big-data revolution will vastly improve the granularity and timeliness of available epidemiological information, with hybrid systems augmenting rather than supplanting traditional surveillance systems, and better prospects for accurate infectious diseases models and forecasts. PMID:28830113

  12. Spatiotemporal Bayesian analysis of Lyme disease in New York state, 1990-2000.

    PubMed

    Chen, Haiyan; Stratton, Howard H; Caraco, Thomas B; White, Dennis J

    2006-07-01

    Mapping ordinarily increases our understanding of nontrivial spatial and temporal heterogeneities in disease rates. However, the large number of parameters required by the corresponding statistical models often complicates detailed analysis. This study investigates the feasibility of a fully Bayesian hierarchical regression approach to the problem and identifies how it outperforms two more popular methods: crude rate estimates (CRE) and empirical Bayes standardization (EBS). In particular, we apply a fully Bayesian approach to the spatiotemporal analysis of Lyme disease incidence in New York state for the period 1990-2000. These results are compared with those obtained by CRE and EBS in Chen et al. (2005). We show that the fully Bayesian regression model not only gives more reliable estimates of disease rates than the other two approaches but also allows for tractable models that can accommodate more numerous sources of variation and unknown parameters.

  13. Recent development of risk-prediction models for incident hypertension: An updated systematic review

    PubMed Central

    Xiao, Lei; Liu, Ya; Wang, Zuoguang; Li, Chuang; Jin, Yongxin; Zhao, Qiong

    2017-01-01

    Background Hypertension is a leading global health threat and a major cardiovascular disease. Since clinical interventions are effective in delaying the disease progression from prehypertension to hypertension, diagnostic prediction models to identify patient populations at high risk for hypertension are imperative. Methods Both PubMed and Embase databases were searched for eligible reports of either prediction models or risk scores of hypertension. The study data were collected, including risk factors, statistic methods, characteristics of study design and participants, performance measurement, etc. Results From the searched literature, 26 studies reporting 48 prediction models were selected. Among them, 20 reports studied the established models using traditional risk factors, such as body mass index (BMI), age, smoking, blood pressure (BP) level, parental history of hypertension, and biochemical factors, whereas 6 reports used genetic risk score (GRS) as the prediction factor. AUC ranged from 0.64 to 0.97, and C-statistic ranged from 60% to 90%. Conclusions The traditional models are still the predominant risk prediction models for hypertension, but recently, more models have begun to incorporate genetic factors as part of their model predictors. However, these genetic predictors need to be well selected. The current reported models have acceptable to good discrimination and calibration ability, but whether the models can be applied in clinical practice still needs more validation and adjustment. PMID:29084293

  14. MDD-carb: a combinatorial model for the identification of protein carbonylation sites with substrate motifs.

    PubMed

    Kao, Hui-Ju; Weng, Shun-Long; Huang, Kai-Yao; Kaunang, Fergie Joanda; Hsu, Justin Bo-Kai; Huang, Chien-Hsun; Lee, Tzong-Yi

    2017-12-21

    Carbonylation, which takes place through oxidation of reactive oxygen species (ROS) on specific residues, is an irreversibly oxidative modification of proteins. It has been reported that the carbonylation is related to a number of metabolic or aging diseases including diabetes, chronic lung disease, Parkinson's disease, and Alzheimer's disease. Due to the lack of computational methods dedicated to exploring motif signatures of protein carbonylation sites, we were motivated to exploit an iterative statistical method to characterize and identify carbonylated sites with motif signatures. By manually curating experimental data from research articles, we obtained 332, 144, 135, and 140 verified substrate sites for K (lysine), R (arginine), T (threonine), and P (proline) residues, respectively, from 241 carbonylated proteins. In order to examine the informative attributes for classifying between carbonylated and non-carbonylated sites, multifarious features including composition of twenty amino acids (AAC), composition of amino acid pairs (AAPC), position-specific scoring matrix (PSSM), and positional weighted matrix (PWM) were investigated in this study. Additionally, in an attempt to explore the motif signatures of carbonylation sites, an iterative statistical method was adopted to detect statistically significant dependencies of amino acid compositions between specific positions around substrate sites. Profile hidden Markov model (HMM) was then utilized to train a predictive model from each motif signature. Moreover, based on the method of support vector machine (SVM), we adopted it to construct an integrative model by combining the values of bit scores obtained from profile HMMs. The combinatorial model could provide an enhanced performance with evenly predictive sensitivity and specificity in the evaluation of cross-validation and independent testing. This study provides a new scheme for exploring potential motif signatures at substrate sites of protein carbonylation. The usefulness of the revealed motifs in the identification of carbonylated sites is demonstrated by their effective performance in cross-validation and independent testing. Finally, these substrate motifs were adopted to build an available online resource (MDD-Carb, http://csb.cse.yzu.edu.tw/MDDCarb/ ) and are also anticipated to facilitate the study of large-scale carbonylated proteomes.

  15. The power to detect linkage in complex disease by means of simple LOD-score analyses.

    PubMed Central

    Greenberg, D A; Abreu, P; Hodge, S E

    1998-01-01

    Maximum-likelihood analysis (via LOD score) provides the most powerful method for finding linkage when the mode of inheritance (MOI) is known. However, because one must assume an MOI, the application of LOD-score analysis to complex disease has been questioned. Although it is known that one can legitimately maximize the maximum LOD score with respect to genetic parameters, this approach raises three concerns: (1) multiple testing, (2) effect on power to detect linkage, and (3) adequacy of the approximate MOI for the true MOI. We evaluated the power of LOD scores to detect linkage when the true MOI was complex but a LOD score analysis assumed simple models. We simulated data from 14 different genetic models, including dominant and recessive at high (80%) and low (20%) penetrances, intermediate models, and several additive two-locus models. We calculated LOD scores by assuming two simple models, dominant and recessive, each with 50% penetrance, then took the higher of the two LOD scores as the raw test statistic and corrected for multiple tests. We call this test statistic "MMLS-C." We found that the ELODs for MMLS-C are >=80% of the ELOD under the true model when the ELOD for the true model is >=3. Similarly, the power to reach a given LOD score was usually >=80% that of the true model, when the power under the true model was >=60%. These results underscore that a critical factor in LOD-score analysis is the MOI at the linked locus, not that of the disease or trait per se. Thus, a limited set of simple genetic models in LOD-score analysis can work well in testing for linkage. PMID:9718328

  16. The power to detect linkage in complex disease by means of simple LOD-score analyses.

    PubMed

    Greenberg, D A; Abreu, P; Hodge, S E

    1998-09-01

    Maximum-likelihood analysis (via LOD score) provides the most powerful method for finding linkage when the mode of inheritance (MOI) is known. However, because one must assume an MOI, the application of LOD-score analysis to complex disease has been questioned. Although it is known that one can legitimately maximize the maximum LOD score with respect to genetic parameters, this approach raises three concerns: (1) multiple testing, (2) effect on power to detect linkage, and (3) adequacy of the approximate MOI for the true MOI. We evaluated the power of LOD scores to detect linkage when the true MOI was complex but a LOD score analysis assumed simple models. We simulated data from 14 different genetic models, including dominant and recessive at high (80%) and low (20%) penetrances, intermediate models, and several additive two-locus models. We calculated LOD scores by assuming two simple models, dominant and recessive, each with 50% penetrance, then took the higher of the two LOD scores as the raw test statistic and corrected for multiple tests. We call this test statistic "MMLS-C." We found that the ELODs for MMLS-C are >=80% of the ELOD under the true model when the ELOD for the true model is >=3. Similarly, the power to reach a given LOD score was usually >=80% that of the true model, when the power under the true model was >=60%. These results underscore that a critical factor in LOD-score analysis is the MOI at the linked locus, not that of the disease or trait per se. Thus, a limited set of simple genetic models in LOD-score analysis can work well in testing for linkage.

  17. [Establishment of diagnostic model to monitor minimal residual disease of acute promyelocytic leukemia by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry].

    PubMed

    Zhang, Lin-lin; Xu, Zhi-fang; Tan, Yan-hong; Chen, Xiu-hua; Xu, Ai-ning; Ren, Fang-gang; Wang, Hong-wei

    2013-01-01

    To screen the potential protein biomarkers in minimal residual disease (MRD) of the acute promyelocytic leukemia (APL) by comparison of differentially expressed serum protein between APL patients at diagnosis and after complete remission (CR) and healthy controls, and to establish and verify a diagnostic model. Serum proteins from 36 cases of primary APL, 29 cases of APL during complete remission and 32 healthy controls were purified by magnetic beads and then analyzed by matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS). The spectra were analyzed statistically using FlexAnalysis(TM) and ClinProt(TM) software. Two prediction model of primary APL/healthy control, primary APL/APL CR were developed. Thirty four statistically significant peptide peaks were obtained with the m/z value ranging from 1000 to 10 000 (P < 0.001) in primary APL/healthy control model. Seven statistically significant peptide peaks were obtained in primary APL/APL CR model (P < 0.001). Comparison of the protein profiles between the two models, three peptides with m/z 4642, 7764 and 9289 were considered as the protein biomarker of APL MRD. A diagnostic pattern for APL CR using m/z 4642 and 9289 was established. Blind validation yielded correct classification of 6 out of 8 cases. The MALDI-TOF MS analysis of APL patients serum protein can be used as a promising dynamic method for MRD detection and the two peptides with m/z 4642 and 9289 may be better biomarkers.

  18. A rat model system to study complex disease risks, fitness, aging, and longevity.

    PubMed

    Koch, Lauren Gerard; Britton, Steven L; Wisløff, Ulrik

    2012-02-01

    The association between low exercise capacity and all-cause morbidity and mortality is statistically strong yet mechanistically unresolved. By connecting clinical observation with a theoretical base, we developed a working hypothesis that variation in capacity for oxygen metabolism is the central mechanistic determinant between disease and health (aerobic hypothesis). As an unbiased test, we show that two-way artificial selective breeding of rats for low and high intrinsic endurance exercise capacity also produces rats that differ for numerous disease risks, including the metabolic syndrome, cardiovascular complications, premature aging, and reduced longevity. This contrasting animal model system may prove to be translationally superior relative to more widely used simplistic models for understanding geriatric biology and medicine. Copyright © 2012 Elsevier Inc. All rights reserved.

  19. Can multivariate models based on MOAKS predict OA knee pain? Data from the Osteoarthritis Initiative

    NASA Astrophysics Data System (ADS)

    Luna-Gómez, Carlos D.; Zanella-Calzada, Laura A.; Galván-Tejada, Jorge I.; Galván-Tejada, Carlos E.; Celaya-Padilla, José M.

    2017-03-01

    Osteoarthritis is the most common rheumatic disease in the world. Knee pain is the most disabling symptom in the disease, the prediction of pain is one of the targets in preventive medicine, this can be applied to new therapies or treatments. Using the magnetic resonance imaging and the grading scales, a multivariate model based on genetic algorithms is presented. Using a predictive model can be useful to associate minor structure changes in the joint with the future knee pain. Results suggest that multivariate models can be predictive with future knee chronic pain. All models; T0, T1 and T2, were statistically significant, all p values were < 0.05 and all AUC > 0.60.

  20. Modeling two strains of disease via aggregate-level infectivity curves.

    PubMed

    Romanescu, Razvan; Deardon, Rob

    2016-04-01

    Well formulated models of disease spread, and efficient methods to fit them to observed data, are powerful tools for aiding the surveillance and control of infectious diseases. Our project considers the problem of the simultaneous spread of two related strains of disease in a context where spatial location is the key driver of disease spread. We start our modeling work with the individual level models (ILMs) of disease transmission, and extend these models to accommodate the competing spread of the pathogens in a two-tier hierarchical population (whose levels we refer to as 'farm' and 'animal'). The postulated interference mechanism between the two strains is a period of cross-immunity following infection. We also present a framework for speeding up the computationally intensive process of fitting the ILM to data, typically done using Markov chain Monte Carlo (MCMC) in a Bayesian framework, by turning the inference into a two-stage process. First, we approximate the number of animals infected on a farm over time by infectivity curves. These curves are fit to data sampled from farms, using maximum likelihood estimation, then, conditional on the fitted curves, Bayesian MCMC inference proceeds for the remaining parameters. Finally, we use posterior predictive distributions of salient epidemic summary statistics, in order to assess the model fitted.

  1. Decision-making for foot-and-mouth disease control: Objectives matter

    USGS Publications Warehouse

    Probert, William J. M.; Shea, Katriona; Fonnesbeck, Christopher J.; Runge, Michael C.; Carpenter, Tim E.; Durr, Salome; Garner, M. Graeme; Harvey, Neil; Stevenson, Mark A.; Webb, Colleen T.; Werkman, Marleen; Tildesley, Michael J.; Ferrari, Matthew J.

    2016-01-01

    Formal decision-analytic methods can be used to frame disease control problems, the first step of which is to define a clear and specific objective. We demonstrate the imperative of framing clearly-defined management objectives in finding optimal control actions for control of disease outbreaks. We illustrate an analysis that can be applied rapidly at the start of an outbreak when there are multiple stakeholders involved with potentially multiple objectives, and when there are also multiple disease models upon which to compare control actions. The output of our analysis frames subsequent discourse between policy-makers, modellers and other stakeholders, by highlighting areas of discord among different management objectives and also among different models used in the analysis. We illustrate this approach in the context of a hypothetical foot-and-mouth disease (FMD) outbreak in Cumbria, UK using outputs from five rigorously-studied simulation models of FMD spread. We present both relative rankings and relative performance of controls within each model and across a range of objectives. Results illustrate how control actions change across both the base metric used to measure management success and across the statistic used to rank control actions according to said metric. This work represents a first step towards reconciling the extensive modelling work on disease control problems with frameworks for structured decision making.

  2. Outcomes and opportunities: a nurse-led model of chronic disease management in Australian general practice.

    PubMed

    Eley, Diann S; Patterson, Elizabeth; Young, Jacqui; Fahey, Paul P; Del Mar, Chris B; Hegney, Desley G; Synnott, Robyn L; Mahomed, Rosemary; Baker, Peter G; Scuffham, Paul A

    2013-01-01

    The Australian government's commitment to health service reform has placed general practice at the centre of its agenda to manage chronic disease. Concerns about the capacity of GPs to meet the growing chronic disease burden has stimulated the implementation and testing of new models of care that better utilise practice nurses (PN). This paper reports on a mixed-methods study nested within a larger study that trialled the feasibility and acceptability of a new model of nurse-led chronic disease management in three general practices. Patients over 18 years of age with type 2 diabetes, hypertension or stable ischaemic heart disease were randomised into PN-led or usual GP-led care. Primary outcomes were self-reported quality of life and perceptions of the model's feasibility and acceptability from the perspective of patients and GPs. Over the 12-month study quality of life decreased but the trend between groups was not statistically different. Qualitative data indicate that the PN-led model was acceptable and feasible to GPs and patients. It is possible to extend the scope of PN care to lead the routine clinical management of patients' stable chronic diseases. All GPs identified significant advantages to the model and elected to continue with the PN-led care after our study concluded.

  3. Detecting global and local hippocampal shape changes in Alzheimer's disease using statistical shape models.

    PubMed

    Shen, Kai-kai; Fripp, Jurgen; Mériaudeau, Fabrice; Chételat, Gaël; Salvado, Olivier; Bourgeat, Pierrick

    2012-02-01

    The hippocampus is affected at an early stage in the development of Alzheimer's disease (AD). With the use of structural magnetic resonance (MR) imaging, we can investigate the effect of AD on the morphology of the hippocampus. The hippocampal shape variations among a population can be usually described using statistical shape models (SSMs). Conventional SSMs model the modes of variations among the population via principal component analysis (PCA). Although these modes are representative of variations within the training data, they are not necessarily discriminative on labeled data or relevant to the differences between the subpopulations. We use the shape descriptors from SSM as features to classify AD from normal control (NC) cases. In this study, a Hotelling's T2 test is performed to select a subset of landmarks which are used in PCA. The resulting variation modes are used as predictors of AD from NC. The discrimination ability of these predictors is evaluated in terms of their classification performances with bagged support vector machines (SVMs). Restricting the model to landmarks with better separation between AD and NC increases the discrimination power of SSM. The predictors extracted on the subregions also showed stronger correlation with the memory-related measurements such as Logical Memory, Auditory Verbal Learning Test (AVLT) and the memory subscores of Alzheimer Disease Assessment Scale (ADAS). Crown Copyright © 2011. Published by Elsevier Inc. All rights reserved.

  4. Modeling to Optimize Terminal Stem Cell Differentiation

    PubMed Central

    Gallicano, G. Ian

    2013-01-01

    Embryonic stem cell (ESC), iPCs, and adult stem cells (ASCs) all are among the most promising potential treatments for heart failure, spinal cord injury, neurodegenerative diseases, and diabetes. However, considerable uncertainty in the production of ESC-derived terminally differentiated cell types has limited the efficiency of their development. To address this uncertainty, we and other investigators have begun to employ a comprehensive statistical model of ESC differentiation for determining the role of intracellular pathways (e.g., STAT3) in ESC differentiation and determination of germ layer fate. The approach discussed here applies the Baysian statistical model to cell/developmental biology combining traditional flow cytometry methodology and specific morphological observations with advanced statistical and probabilistic modeling and experimental design. The final result of this study is a unique tool and model that enhances the understanding of how and when specific cell fates are determined during differentiation. This model provides a guideline for increasing the production efficiency of therapeutically viable ESCs/iPSCs/ASC derived neurons or any other cell type and will eventually lead to advances in stem cell therapy. PMID:24278782

  5. Disease Ecology, Biodiversity, and the Latitudinal Gradient in Income

    PubMed Central

    Bonds, Matthew H.; Dobson, Andrew P.; Keenan, Donald C.

    2012-01-01

    While most of the world is thought to be on long-term economic growth paths, more than one-sixth of the world is roughly as poor today as their ancestors were hundreds of years ago. The majority of the extremely poor live in the tropics. The latitudinal gradient in income is highly suggestive of underlying biophysical drivers, of which disease conditions are an especially salient example. However, conclusions have been confounded by the simultaneous causality between income and disease, in addition to potentially spurious relationships. We use a simultaneous equations model to estimate the relative effects of vector-borne and parasitic diseases (VBPDs) and income on each other, controlling for other factors. Our statistical model indicates that VBPDs have systematically affected economic development, evident in contemporary levels of per capita income. The burden of VBDPs is, in turn, determined by underlying ecological conditions. In particular, the model predicts it to rise as biodiversity falls. Through these positive effects on human health, the model thus identifies measurable economic benefits of biodiversity. PMID:23300379

  6. Serum urate gene associations with incident gout, measured in the Framingham Heart Study, are modified by renal disease and not by body mass index.

    PubMed

    Reynolds, Richard J; Vazquez, Ana I; Srinivasasainagendra, Vinodh; Klimentidis, Yann C; Bridges, S Louis; Allison, David B; Singh, Jasvinder A

    2016-02-01

    We hypothesized that serum urate-associated SNPs, individually or collectively, interact with BMI and renal disease to contribute to risk of incident gout. We measured the incidence of gout and associated comorbidities using the original and offspring cohorts of the Framingham Heart Study. We used direct and imputed genotypes for eight validated serum urate loci. We fit binomial regression models of gout incidence as a function of the covariates, age, type 2 diabetes, sex, and all main and interaction effects of the eight serum urate SNPs with BMI and renal disease. Models were also fit with a genetic risk score for serum urate levels which corresponds to the sum of risk alleles at the eight SNPs. Model covariates, age (P = 5.95E-06), sex (P = 2.46E-39), diabetes (P = 2.34E-07), BMI (P = 1.14E-11) and the SNPs, rs1967017 (P = 9.54E-03), rs13129697 (P = 4.34E-07), rs2199936 (P = 7.28E-03) and rs675209 (P = 4.84E-02) were all associated with incident gout. No BMI by SNP or BMI by serum urate genetic risk score interactions were statistically significant, but renal disease by rs1106766 was statistically significant (P = 6.12E-03). We demonstrated that minor alleles of rs1106766 (intergenic, INHBC) were negatively associated with the risk of incident gout in subjects without renal disease, but not for individuals with renal disease. These analyses demonstrate that a significant component of the risk of gout may involve complex interplay between genes and environment.

  7. Serum urate gene associations with incident gout, measured in the Framingham Heart Study, are modified by renal disease and not by body mass index

    PubMed Central

    Reynolds, Richard J.; Vazquez, Ana I.; Srinivasasainagendra, Vinodh; Klimentidis, Yann C.; Bridges, S. Louis; Allison, David B.; Singh, Jasvinder A.

    2015-01-01

    Introduction/objectives We hypothesized that serum urate-associated SNPs, individually or collectively, interact with BMI and renal disease to contribute to risk for incident gout. Method We measured the incidence of gout and associated comorbidities using the Original and Offspring cohorts of the Framingham Heart Study. We used direct and imputed genotypes for eight validated serum urate loci. We fit binomial regression models of gout incidence as a function of the covariates, age, type 2 diabetes, sex, and all main and interaction effects of the eight serum urate SNPs with BMI and renal disease. Models were also fit with a genetic risk score for serum urate levels which corresponds to the sum of risk alleles at the 8 SNPs. Results Model covariates, age (P = 5.95E-06), sex (P = 2.46E-39), diabetes (P = 2.34E-07), BMI (P = 1.14E-11) and the SNPs, rs1967017 (P = 9.54E-03), rs13129697 (P = 4.34E-07), rs2199936 (P = 7.28E-03) and rs675209 (P = 4.84E-02) were all associated with incident gout. No BMI by SNP or BMI by serum urate genetic risk score (GRS) interactions were statistically significant, but renal disease by rs1106766 was statistically significant (P=6.12E-03). Conclusions We demonstrated that minor alleles of rs1106766 (intergenic, INHBC) were negatively associated with the risk of incident gout in subjects without renal disease, but not for individuals with renal disease. These analyses demonstrate that a significant component of the risk for gout may involve complex interplay between genes and environment. PMID:26427508

  8. The use of modelling to evaluate and adapt strategies for animal disease control.

    PubMed

    Saegerman, C; Porter, S R; Humblet, M F

    2011-08-01

    Disease is often associated with debilitating clinical signs, disorders or production losses in animals and/or humans, leading to severe socio-economic repercussions. This explains the high priority that national health authorities and international organisations give to selecting control strategies for and the eradication of specific diseases. When a control strategy is selected and implemented, an effective method of evaluating its efficacy is through modelling. To illustrate the usefulness of models in evaluating control strategies, the authors describe several examples in detail, including three examples of classification and regression tree modelling to evaluate and improve the early detection of disease: West Nile fever in equids, bovine spongiform encephalopathy (BSE) and multifactorial diseases, such as colony collapse disorder (CCD) in the United States. Also examined are regression modelling to evaluate skin test practices and the efficacy of an awareness campaign for bovine tuberculosis (bTB); mechanistic modelling to monitor the progress of a control strategy for BSE; and statistical nationwide modelling to analyse the spatio-temporal dynamics of bTB and search for potential risk factors that could be used to target surveillance measures more effectively. In the accurate application of models, an interdisciplinary rather than a multidisciplinary approach is required, with the fewest assumptions possible.

  9. FHSA-SED: Two-Locus Model Detection for Genome-Wide Association Study with Harmony Search Algorithm.

    PubMed

    Tuo, Shouheng; Zhang, Junying; Yuan, Xiguo; Zhang, Yuanyuan; Liu, Zhaowen

    2016-01-01

    Two-locus model is a typical significant disease model to be identified in genome-wide association study (GWAS). Due to intensive computational burden and diversity of disease models, existing methods have drawbacks on low detection power, high computation cost, and preference for some types of disease models. In this study, two scoring functions (Bayesian network based K2-score and Gini-score) are used for characterizing two SNP locus as a candidate model, the two criteria are adopted simultaneously for improving identification power and tackling the preference problem to disease models. Harmony search algorithm (HSA) is improved for quickly finding the most likely candidate models among all two-locus models, in which a local search algorithm with two-dimensional tabu table is presented to avoid repeatedly evaluating some disease models that have strong marginal effect. Finally G-test statistic is used to further test the candidate models. We investigate our method named FHSA-SED on 82 simulated datasets and a real AMD dataset, and compare it with two typical methods (MACOED and CSE) which have been developed recently based on swarm intelligent search algorithm. The results of simulation experiments indicate that our method outperforms the two compared algorithms in terms of detection power, computation time, evaluation times, sensitivity (TPR), specificity (SPC), positive predictive value (PPV) and accuracy (ACC). Our method has identified two SNPs (rs3775652 and rs10511467) that may be also associated with disease in AMD dataset.

  10. FHSA-SED: Two-Locus Model Detection for Genome-Wide Association Study with Harmony Search Algorithm

    PubMed Central

    Tuo, Shouheng; Zhang, Junying; Yuan, Xiguo; Zhang, Yuanyuan; Liu, Zhaowen

    2016-01-01

    Motivation Two-locus model is a typical significant disease model to be identified in genome-wide association study (GWAS). Due to intensive computational burden and diversity of disease models, existing methods have drawbacks on low detection power, high computation cost, and preference for some types of disease models. Method In this study, two scoring functions (Bayesian network based K2-score and Gini-score) are used for characterizing two SNP locus as a candidate model, the two criteria are adopted simultaneously for improving identification power and tackling the preference problem to disease models. Harmony search algorithm (HSA) is improved for quickly finding the most likely candidate models among all two-locus models, in which a local search algorithm with two-dimensional tabu table is presented to avoid repeatedly evaluating some disease models that have strong marginal effect. Finally G-test statistic is used to further test the candidate models. Results We investigate our method named FHSA-SED on 82 simulated datasets and a real AMD dataset, and compare it with two typical methods (MACOED and CSE) which have been developed recently based on swarm intelligent search algorithm. The results of simulation experiments indicate that our method outperforms the two compared algorithms in terms of detection power, computation time, evaluation times, sensitivity (TPR), specificity (SPC), positive predictive value (PPV) and accuracy (ACC). Our method has identified two SNPs (rs3775652 and rs10511467) that may be also associated with disease in AMD dataset. PMID:27014873

  11. Statistical Analysis of Big Data on Pharmacogenomics

    PubMed Central

    Fan, Jianqing; Liu, Han

    2013-01-01

    This paper discusses statistical methods for estimating complex correlation structure from large pharmacogenomic datasets. We selectively review several prominent statistical methods for estimating large covariance matrix for understanding correlation structure, inverse covariance matrix for network modeling, large-scale simultaneous tests for selecting significantly differently expressed genes and proteins and genetic markers for complex diseases, and high dimensional variable selection for identifying important molecules for understanding molecule mechanisms in pharmacogenomics. Their applications to gene network estimation and biomarker selection are used to illustrate the methodological power. Several new challenges of Big data analysis, including complex data distribution, missing data, measurement error, spurious correlation, endogeneity, and the need for robust statistical methods, are also discussed. PMID:23602905

  12. Identifying Potential Areas of Human Zika Infection in the City of Los Angeles, California by Use of Remote Sensing Imagery

    NASA Astrophysics Data System (ADS)

    Lee, J.

    2017-12-01

    As of April 2017, California is the third most prevalent state on the United States for Zika Infection and Southern California has an ever growing population of Aedes mosquitos. Zika is a disease which poses a significant risk to humans and other mammals due to its effects on pregnancy. This emerging disease is highly contagious due to its spread of infection primarily by Aedes aegypti mosquitos. Aedes mosquitos are able to breed in small rain collecting containers which allow the species to persevere in urban and semi urban environments. We hope to identify potential areas with risk of human infection within Los Angeles and its surrounding areas. This study integrates remote sensing, GIS, statistical, and environmental techniques to study favorable habitats for this particular species of mosquitos and their larvae. The study of the geographic and landscape factors which promote the larvae development allow for the disease spread to be analyzed and modeled. There are several goals in the development of this study. These include the coordination of statistical data with local epidemiology departments, identify workflows to improve efficiency, create models which can be utilized for disease prevention, and identify geographic risk factors for the spread of Zika.

  13. Primary Sclerosing Cholangitis Risk Estimate Tool (PREsTo) Predicts Outcomes in PSC: A Derivation & Validation Study Using Machine Learning.

    PubMed

    Eaton, John E; Vesterhus, Mette; McCauley, Bryan M; Atkinson, Elizabeth J; Schlicht, Erik M; Juran, Brian D; Gossard, Andrea A; LaRusso, Nicholas F; Gores, Gregory J; Karlsen, Tom H; Lazaridis, Konstantinos N

    2018-05-09

    Improved methods are needed to risk stratify and predict outcomes in patients with primary sclerosing cholangitis (PSC). Therefore, we sought to derive and validate a new prediction model and compare its performance to existing surrogate markers. The model was derived using 509 subjects from a multicenter North American cohort and validated in an international multicenter cohort (n=278). Gradient boosting, a machine based learning technique, was used to create the model. The endpoint was hepatic decompensation (ascites, variceal hemorrhage or encephalopathy). Subjects with advanced PSC or cholangiocarcinoma at baseline were excluded. The PSC risk estimate tool (PREsTo) consists of 9 variables: bilirubin, albumin, serum alkaline phosphatase (SAP) times the upper limit of normal (ULN), platelets, AST, hemoglobin, sodium, patient age and the number of years since PSC was diagnosed. Validation in an independent cohort confirms PREsTo accurately predicts decompensation (C statistic 0.90, 95% confidence interval (CI) 0.84-0.95) and performed well compared to MELD score (C statistic 0.72, 95% CI 0.57-0.84), Mayo PSC risk score (C statistic 0.85, 95% CI 0.77-0.92) and SAP < 1.5x ULN (C statistic 0.65, 95% CI 0.55-0.73). PREsTo continued to be accurate among individuals with a bilirubin < 2.0 mg/dL (C statistic 0.90, 95% CI 0.82-0.96) and when the score was re-applied at a later course in the disease (C statistic 0.82, 95% CI 0.64-0.95). PREsTo accurately predicts hepatic decompensation in PSC and exceeds the performance among other widely available, noninvasive prognostic scoring systems. This article is protected by copyright. All rights reserved. © 2018 by the American Association for the Study of Liver Diseases.

  14. Reproductive Risk Factors and Coronary Heart Disease in the Women’s Health Initiative Observational Study

    PubMed Central

    Parikh, Nisha I.; Jeppson, Rebecca P.; Berger, Jeffrey S.; Eaton, Charles B.; Kroenke, Candyce H.; LeBlanc, Erin S.; Lewis, Cora E.; Loucks, Eric B.; Parker, Donna R.; Rillamas-Sun, Eileen; Ryckman, Kelli K; Waring, Molly E.; Schenken, Robert S.; Johnson, Karen C; Edstedt-Bonamy, Anna-Karin; Allison, Matthew A.; Howard, Barbara V.

    2016-01-01

    Background Reproductive factors provide an early window into a woman’s coronary heart disease (CHD) risk, however their contribution to CHD risk stratification is uncertain. Methods and Results In the Women’s Health Initiative Observational Study, we constructed Cox proportional hazards models for CHD including age, pregnancy status, number of live births, age at menarche, menstrual irregularity, age at first birth, stillbirths, miscarriages, infertility ≥ 1 year, infertility cause, and breastfeeding. We next added each candidate reproductive factor to an established CHD risk factor model. A final model was then constructed with significant reproductive factors added to established CHD risk factors. Improvement in C-statistic, net reclassification index (or NRI with risk categories of <5%, 5–<10%, and ≥10% 10-year risk of CHD) and integrated discriminatory index (IDI) were assessed. Among 72,982 women [n=4607 CHD events, median follow-up=12.0 (IQR=8.3–13.7) years, mean (SD) age 63.2 (7.2) years], an age-adjusted reproductive risk factor model had a C-statistic of 0.675 for CHD. In a model adjusted for established CHD risk factors, younger age at first birth, number of still births, number of miscarriages and lack of breastfeeding were positively associated with CHD. Reproductive factors modestly improved model discrimination (C-statistic increased from 0.726 to 0.730; IDI=0.0013, p-value < 0.0001). Net reclassification for women with events was not improved (NRI events=0.007, p-value=0.18); and for women without events was marginally improved (NRI non-events=0.002, p-value=0.04) Conclusions Key reproductive factors are associated with CHD independently of established CHD risk factors, very modestly improve model discrimination and do not materially improve net reclassification. PMID:27143682

  15. Cluster detection methods applied to the Upper Cape Cod cancer data.

    PubMed

    Ozonoff, Al; Webster, Thomas; Vieira, Veronica; Weinberg, Janice; Ozonoff, David; Aschengrau, Ann

    2005-09-15

    A variety of statistical methods have been suggested to assess the degree and/or the location of spatial clustering of disease cases. However, there is relatively little in the literature devoted to comparison and critique of different methods. Most of the available comparative studies rely on simulated data rather than real data sets. We have chosen three methods currently used for examining spatial disease patterns: the M-statistic of Bonetti and Pagano; the Generalized Additive Model (GAM) method as applied by Webster; and Kulldorff's spatial scan statistic. We apply these statistics to analyze breast cancer data from the Upper Cape Cancer Incidence Study using three different latency assumptions. The three different latency assumptions produced three different spatial patterns of cases and controls. For 20 year latency, all three methods generally concur. However, for 15 year latency and no latency assumptions, the methods produce different results when testing for global clustering. The comparative analyses of real data sets by different statistical methods provides insight into directions for further research. We suggest a research program designed around examining real data sets to guide focused investigation of relevant features using simulated data, for the purpose of understanding how to interpret statistical methods applied to epidemiological data with a spatial component.

  16. Treatment effects model for assessing disease management: measuring outcomes and strengthening program management.

    PubMed

    Wendel, Jeanne; Dumitras, Diana

    2005-06-01

    This paper describes an analytical methodology for obtaining statistically unbiased outcomes estimates for programs in which participation decisions may be correlated with variables that impact outcomes. This methodology is particularly useful for intraorganizational program evaluations conducted for business purposes. In this situation, data is likely to be available for a population of managed care members who are eligible to participate in a disease management (DM) program, with some electing to participate while others eschew the opportunity. The most pragmatic analytical strategy for in-house evaluation of such programs is likely to be the pre-intervention/post-intervention design in which the control group consists of people who were invited to participate in the DM program, but declined the invitation. Regression estimates of program impacts may be statistically biased if factors that impact participation decisions are correlated with outcomes measures. This paper describes an econometric procedure, the Treatment Effects model, developed to produce statistically unbiased estimates of program impacts in this type of situation. Two equations are estimated to (a) estimate the impacts of patient characteristics on decisions to participate in the program, and then (b) use this information to produce a statistically unbiased estimate of the impact of program participation on outcomes. This methodology is well-established in economics and econometrics, but has not been widely applied in the DM outcomes measurement literature; hence, this paper focuses on one illustrative application.

  17. Statistics of Optical Coherence Tomography Data From Human Retina

    PubMed Central

    de Juan, Joaquín; Ferrone, Claudia; Giannini, Daniela; Huang, David; Koch, Giorgio; Russo, Valentina; Tan, Ou; Bruni, Carlo

    2010-01-01

    Optical coherence tomography (OCT) has recently become one of the primary methods for noninvasive probing of the human retina. The pseudoimage formed by OCT (the so-called B-scan) varies probabilistically across pixels due to complexities in the measurement technique. Hence, sensitive automatic procedures of diagnosis using OCT may exploit statistical analysis of the spatial distribution of reflectance. In this paper, we perform a statistical study of retinal OCT data. We find that the stretched exponential probability density function can model well the distribution of intensities in OCT pseudoimages. Moreover, we show a small, but significant correlation between neighbor pixels when measuring OCT intensities with pixels of about 5 µm. We then develop a simple joint probability model for the OCT data consistent with known retinal features. This model fits well the stretched exponential distribution of intensities and their spatial correlation. In normal retinas, fit parameters of this model are relatively constant along retinal layers, but varies across layers. However, in retinas with diabetic retinopathy, large spikes of parameter modulation interrupt the constancy within layers, exactly where pathologies are visible. We argue that these results give hope for improvement in statistical pathology-detection methods even when the disease is in its early stages. PMID:20304733

  18. Can upstaging of ductal carcinoma in situ be predicted at biopsy by histologic and mammographic features?

    NASA Astrophysics Data System (ADS)

    Shi, Bibo; Grimm, Lars J.; Mazurowski, Maciej A.; Marks, Jeffrey R.; King, Lorraine M.; Maley, Carlo C.; Hwang, E. Shelley; Lo, Joseph Y.

    2017-03-01

    Reducing the overdiagnosis and overtreatment associated with ductal carcinoma in situ (DCIS) requires accurate prediction of the invasive potential at cancer screening. In this work, we investigated the utility of pre-operative histologic and mammographic features to predict upstaging of DCIS. The goal was to provide intentionally conservative baseline performance using readily available data from radiologists and pathologists and only linear models. We conducted a retrospective analysis on 99 patients with DCIS. Of those 25 were upstaged to invasive cancer at the time of definitive surgery. Pre-operative factors including both the histologic features extracted from stereotactic core needle biopsy (SCNB) reports and the mammographic features annotated by an expert breast radiologist were investigated with statistical analysis. Furthermore, we built classification models based on those features in an attempt to predict the presence of an occult invasive component in DCIS, with generalization performance assessed by receiver operating characteristic (ROC) curve analysis. Histologic features including nuclear grade and DCIS subtype did not show statistically significant differences between cases with pure DCIS and with DCIS plus invasive disease. However, three mammographic features, i.e., the major axis length of DCIS lesion, the BI-RADS level of suspicion, and radiologist's assessment did achieve the statistical significance. Using those three statistically significant features as input, a linear discriminant model was able to distinguish patients with DCIS plus invasive disease from those with pure DCIS, with AUC-ROC equal to 0.62. Overall, mammograms used for breast screening contain useful information that can be perceived by radiologists and help predict occult invasive components in DCIS.

  19. Socio-Demographic and Clinical Characteristics are Not Clinically Useful Predictors of Refill Adherence in Patients with Hypertension

    PubMed Central

    Steiner, John F.; Ho, P. Michael; Beaty, Brenda L.; Dickinson, L. Miriam; Hanratty, Rebecca; Zeng, Chan; Tavel, Heather M.; Havranek, Edward P.; Davidson, Arthur J.; Magid, David J.; Estacio, Raymond O.

    2009-01-01

    Background Although many studies have identified patient characteristics or chronic diseases associated with medication adherence, the clinical utility of such predictors has rarely been assessed. We attempted to develop clinical prediction rules for adherence with antihypertensive medications in two health care delivery systems. Methods and Results Retrospective cohort studies of hypertension registries in an inner-city health care delivery system (N = 17176) and a health maintenance organization (N = 94297) in Denver, Colorado. Adherence was defined by acquisition of 80% or more of antihypertensive medications. A multivariable model in the inner-city system found that adherent patients (36.3% of the total) were more likely than non-adherent patients to be older, white, married, and acculturated in US society, to have diabetes or cerebrovascular disease, not to abuse alcohol or controlled substances, and to be prescribed less than three antihypertensive medications. Although statistically significant, all multivariate odds ratios were 1.7 or less, and the model did not accurately discriminate adherent from non-adherent patients (C-statistic = 0.606). In the health maintenance organization, where 72.1% of patients were adherent, significant but weak associations existed between adherence and older age, white race, the lack of alcohol abuse, and fewer antihypertensive medications. The multivariate model again failed to accurately discriminate adherent from non-adherent individuals (C-statistic = 0.576). Conclusions Although certain socio-demographic characteristics or clinical diagnoses are statistically associated with adherence to refills of antihypertensive medications, a combination of these characteristics is not sufficiently accurate to allow clinicians to predict whether their patients will be adherent with treatment. PMID:20031876

  20. Clinical Prediction Models for Cardiovascular Disease: Tufts Predictive Analytics and Comparative Effectiveness Clinical Prediction Model Database.

    PubMed

    Wessler, Benjamin S; Lai Yh, Lana; Kramer, Whitney; Cangelosi, Michael; Raman, Gowri; Lutz, Jennifer S; Kent, David M

    2015-07-01

    Clinical prediction models (CPMs) estimate the probability of clinical outcomes and hold the potential to improve decision making and individualize care. For patients with cardiovascular disease, there are numerous CPMs available although the extent of this literature is not well described. We conducted a systematic review for articles containing CPMs for cardiovascular disease published between January 1990 and May 2012. Cardiovascular disease includes coronary heart disease, heart failure, arrhythmias, stroke, venous thromboembolism, and peripheral vascular disease. We created a novel database and characterized CPMs based on the stage of development, population under study, performance, covariates, and predicted outcomes. There are 796 models included in this database. The number of CPMs published each year is increasing steadily over time. Seven hundred seventeen (90%) are de novo CPMs, 21 (3%) are CPM recalibrations, and 58 (7%) are CPM adaptations. This database contains CPMs for 31 index conditions, including 215 CPMs for patients with coronary artery disease, 168 CPMs for population samples, and 79 models for patients with heart failure. There are 77 distinct index/outcome pairings. Of the de novo models in this database, 450 (63%) report a c-statistic and 259 (36%) report some information on calibration. There is an abundance of CPMs available for a wide assortment of cardiovascular disease conditions, with substantial redundancy in the literature. The comparative performance of these models, the consistency of effects and risk estimates across models and the actual and potential clinical impact of this body of literature is poorly understood. © 2015 American Heart Association, Inc.

  1. Stochastic modelling, Bayesian inference, and new in vivo measurements elucidate the debated mtDNA bottleneck mechanism

    PubMed Central

    Johnston, Iain G; Burgstaller, Joerg P; Havlicek, Vitezslav; Kolbe, Thomas; Rülicke, Thomas; Brem, Gottfried; Poulton, Jo; Jones, Nick S

    2015-01-01

    Dangerous damage to mitochondrial DNA (mtDNA) can be ameliorated during mammalian development through a highly debated mechanism called the mtDNA bottleneck. Uncertainty surrounding this process limits our ability to address inherited mtDNA diseases. We produce a new, physically motivated, generalisable theoretical model for mtDNA populations during development, allowing the first statistical comparison of proposed bottleneck mechanisms. Using approximate Bayesian computation and mouse data, we find most statistical support for a combination of binomial partitioning of mtDNAs at cell divisions and random mtDNA turnover, meaning that the debated exact magnitude of mtDNA copy number depletion is flexible. New experimental measurements from a wild-derived mtDNA pairing in mice confirm the theoretical predictions of this model. We analytically solve a mathematical description of this mechanism, computing probabilities of mtDNA disease onset, efficacy of clinical sampling strategies, and effects of potential dynamic interventions, thus developing a quantitative and experimentally-supported stochastic theory of the bottleneck. DOI: http://dx.doi.org/10.7554/eLife.07464.001 PMID:26035426

  2. Evaluating the contribution of genetics and familial shared environment to common disease using the UK Biobank.

    PubMed

    Muñoz, María; Pong-Wong, Ricardo; Canela-Xandri, Oriol; Rawlik, Konrad; Haley, Chris S; Tenesa, Albert

    2016-09-01

    Genome-wide association studies have detected many loci underlying susceptibility to disease, but most of the genetic factors that contribute to disease susceptibility remain unknown. Here we provide evidence that part of the 'missing heritability' can be explained by an overestimation of heritability. We estimated the heritability of 12 complex human diseases using family history of disease in 1,555,906 individuals of white ancestry from the UK Biobank. Estimates using simple family-based statistical models were inflated on average by ∼47% when compared with those from structural equation modeling (SEM), which specifically accounted for shared familial environmental factors. In addition, heritabilities estimated using SNP data explained an average of 44.2% of the simple family-based estimates across diseases and an average of 57.3% of the SEM-estimated heritabilities, accounting for almost all of the SEM heritability for hypertension. Our results show that both genetics and familial environment make substantial contributions to familial clustering of disease.

  3. Interpreter of maladies: redescription mining applied to biomedical data analysis.

    PubMed

    Waltman, Peter; Pearlman, Alex; Mishra, Bud

    2006-04-01

    Comprehensive, systematic and integrated data-centric statistical approaches to disease modeling can provide powerful frameworks for understanding disease etiology. Here, one such computational framework based on redescription mining in both its incarnations, static and dynamic, is discussed. The static framework provides bioinformatic tools applicable to multifaceted datasets, containing genetic, transcriptomic, proteomic, and clinical data for diseased patients and normal subjects. The dynamic redescription framework provides systems biology tools to model complex sets of regulatory, metabolic and signaling pathways in the initiation and progression of a disease. As an example, the case of chronic fatigue syndrome (CFS) is considered, which has so far remained intractable and unpredictable in its etiology and nosology. The redescription mining approaches can be applied to the Centers for Disease Control and Prevention's Wichita (KS, USA) dataset, integrating transcriptomic, epidemiological and clinical data, and can also be used to study how pathways in the hypothalamic-pituitary-adrenal axis affect CFS patients.

  4. Syndromic surveillance models using Web data: the case of scarlet fever in the UK.

    PubMed

    Samaras, Loukas; García-Barriocanal, Elena; Sicilia, Miguel-Angel

    2012-03-01

    Recent research has shown the potential of Web queries as a source for syndromic surveillance, and existing studies show that these queries can be used as a basis for estimation and prediction of the development of a syndromic disease, such as influenza, using log linear (logit) statistical models. Two alternative models are applied to the relationship between cases and Web queries in this paper. We examine the applicability of using statistical methods to relate search engine queries with scarlet fever cases in the UK, taking advantage of tools to acquire the appropriate data from Google, and using an alternative statistical method based on gamma distributions. The results show that using logit models, the Pearson correlation factor between Web queries and the data obtained from the official agencies must be over 0.90, otherwise the prediction of the peak and the spread of the distributions gives significant deviations. In this paper, we describe the gamma distribution model and show that we can obtain better results in all cases using gamma transformations, and especially in those with a smaller correlation factor.

  5. TREAT (TREe-based Association Test)

    Cancer.gov

    TREAT is an R package for detecting complex joint effects in case-control studies. The test statistic is derived from a tree-structure model by recursive partitioning the data. Ultra-fast algorithm is designed to evaluate the significance of association between candidate gene and disease outcome

  6. A mathematical model for HIV and hepatitis C co-infection and its assessment from a statistical perspective.

    PubMed

    Castro Sanchez, Amparo Yovanna; Aerts, Marc; Shkedy, Ziv; Vickerman, Peter; Faggiano, Fabrizio; Salamina, Guiseppe; Hens, Niel

    2013-03-01

    The hepatitis C virus (HCV) and the human immunodeficiency virus (HIV) are a clear threat for public health, with high prevalences especially in high risk groups such as injecting drug users. People with HIV infection who are also infected by HCV suffer from a more rapid progression to HCV-related liver disease and have an increased risk for cirrhosis and liver cancer. Quantifying the impact of HIV and HCV co-infection is therefore of great importance. We propose a new joint mathematical model accounting for co-infection with the two viruses in the context of injecting drug users (IDUs). Statistical concepts and methods are used to assess the model from a statistical perspective, in order to get further insights in: (i) the comparison and selection of optional model components, (ii) the unknown values of the numerous model parameters, (iii) the parameters to which the model is most 'sensitive' and (iv) the combinations or patterns of values in the high-dimensional parameter space which are most supported by the data. Data from a longitudinal study of heroin users in Italy are used to illustrate the application of the proposed joint model and its statistical assessment. The parameters associated with contact rates (sharing syringes) and the transmission rates per syringe-sharing event are shown to play a major role. Copyright © 2013 Elsevier B.V. All rights reserved.

  7. Meta-analysis of diagnostic accuracy studies accounting for disease prevalence: alternative parameterizations and model selection.

    PubMed

    Chu, Haitao; Nie, Lei; Cole, Stephen R; Poole, Charles

    2009-08-15

    In a meta-analysis of diagnostic accuracy studies, the sensitivities and specificities of a diagnostic test may depend on the disease prevalence since the severity and definition of disease may differ from study to study due to the design and the population considered. In this paper, we extend the bivariate nonlinear random effects model on sensitivities and specificities to jointly model the disease prevalence, sensitivities and specificities using trivariate nonlinear random-effects models. Furthermore, as an alternative parameterization, we also propose jointly modeling the test prevalence and the predictive values, which reflect the clinical utility of a diagnostic test. These models allow investigators to study the complex relationship among the disease prevalence, sensitivities and specificities; or among test prevalence and the predictive values, which can reveal hidden information about test performance. We illustrate the proposed two approaches by reanalyzing the data from a meta-analysis of radiological evaluation of lymph node metastases in patients with cervical cancer and a simulation study. The latter illustrates the importance of carefully choosing an appropriate normality assumption for the disease prevalence, sensitivities and specificities, or the test prevalence and the predictive values. In practice, it is recommended to use model selection techniques to identify a best-fitting model for making statistical inference. In summary, the proposed trivariate random effects models are novel and can be very useful in practice for meta-analysis of diagnostic accuracy studies. Copyright 2009 John Wiley & Sons, Ltd.

  8. Predictive Big Data Analytics: A Study of Parkinson's Disease Using Large, Complex, Heterogeneous, Incongruent, Multi-Source and Incomplete Observations.

    PubMed

    Dinov, Ivo D; Heavner, Ben; Tang, Ming; Glusman, Gustavo; Chard, Kyle; Darcy, Mike; Madduri, Ravi; Pa, Judy; Spino, Cathie; Kesselman, Carl; Foster, Ian; Deutsch, Eric W; Price, Nathan D; Van Horn, John D; Ames, Joseph; Clark, Kristi; Hood, Leroy; Hampstead, Benjamin M; Dauer, William; Toga, Arthur W

    2016-01-01

    A unique archive of Big Data on Parkinson's Disease is collected, managed and disseminated by the Parkinson's Progression Markers Initiative (PPMI). The integration of such complex and heterogeneous Big Data from multiple sources offers unparalleled opportunities to study the early stages of prevalent neurodegenerative processes, track their progression and quickly identify the efficacies of alternative treatments. Many previous human and animal studies have examined the relationship of Parkinson's disease (PD) risk to trauma, genetics, environment, co-morbidities, or life style. The defining characteristics of Big Data-large size, incongruency, incompleteness, complexity, multiplicity of scales, and heterogeneity of information-generating sources-all pose challenges to the classical techniques for data management, processing, visualization and interpretation. We propose, implement, test and validate complementary model-based and model-free approaches for PD classification and prediction. To explore PD risk using Big Data methodology, we jointly processed complex PPMI imaging, genetics, clinical and demographic data. Collective representation of the multi-source data facilitates the aggregation and harmonization of complex data elements. This enables joint modeling of the complete data, leading to the development of Big Data analytics, predictive synthesis, and statistical validation. Using heterogeneous PPMI data, we developed a comprehensive protocol for end-to-end data characterization, manipulation, processing, cleaning, analysis and validation. Specifically, we (i) introduce methods for rebalancing imbalanced cohorts, (ii) utilize a wide spectrum of classification methods to generate consistent and powerful phenotypic predictions, and (iii) generate reproducible machine-learning based classification that enables the reporting of model parameters and diagnostic forecasting based on new data. We evaluated several complementary model-based predictive approaches, which failed to generate accurate and reliable diagnostic predictions. However, the results of several machine-learning based classification methods indicated significant power to predict Parkinson's disease in the PPMI subjects (consistent accuracy, sensitivity, and specificity exceeding 96%, confirmed using statistical n-fold cross-validation). Clinical (e.g., Unified Parkinson's Disease Rating Scale (UPDRS) scores), demographic (e.g., age), genetics (e.g., rs34637584, chr12), and derived neuroimaging biomarker (e.g., cerebellum shape index) data all contributed to the predictive analytics and diagnostic forecasting. Model-free Big Data machine learning-based classification methods (e.g., adaptive boosting, support vector machines) can outperform model-based techniques in terms of predictive precision and reliability (e.g., forecasting patient diagnosis). We observed that statistical rebalancing of cohort sizes yields better discrimination of group differences, specifically for predictive analytics based on heterogeneous and incomplete PPMI data. UPDRS scores play a critical role in predicting diagnosis, which is expected based on the clinical definition of Parkinson's disease. Even without longitudinal UPDRS data, however, the accuracy of model-free machine learning based classification is over 80%. The methods, software and protocols developed here are openly shared and can be employed to study other neurodegenerative disorders (e.g., Alzheimer's, Huntington's, amyotrophic lateral sclerosis), as well as for other predictive Big Data analytics applications.

  9. The conceptual basis of mathematics in cardiology IV: statistics and model fitting.

    PubMed

    Bates, Jason H T; Sobel, Burton E

    2003-06-01

    This is the fourth in a series of four articles developed for the readers of Coronary Artery Disease. Without language ideas cannot be articulated. What may not be so immediately obvious is that they cannot be formulated either. One of the essential languages of cardiology is mathematics. Unfortunately, medical education does not emphasize, and in fact, often neglects empowering physicians to think mathematically. Reference to statistics, conditional probability, multicompartmental modeling, algebra, calculus and transforms is common but often without provision of genuine conceptual understanding. At the University of Vermont College of Medicine, Professor Bates developed a course designed to address these deficiencies. The course covered mathematical principles pertinent to clinical cardiovascular and pulmonary medicine and research. It focused on fundamental concepts to facilitate formulation and grasp of ideas. This series of four articles was developed to make the material available for a wider audience. The articles will be published sequentially in Coronary Artery Disease. Beginning with fundamental axioms and basic algebraic manipulations they address algebra, function and graph theory, real and complex numbers, calculus and differential equations, mathematical modeling, linear system theory and integral transforms and statistical theory. The principles and concepts they address provide the foundation needed for in-depth study of any of these topics. Perhaps of even more importance, they should empower cardiologists and cardiovascular researchers to utilize the language of mathematics in assessing the phenomena of immediate pertinence to diagnosis, pathophysiology and therapeutics. The presentations are interposed with queries (by Coronary Artery Disease abbreviated as CAD) simulating the nature of interactions that occurred during the course itself. Each article concludes with one or more examples illustrating application of the concepts covered to cardiovascular medicine and biology.

  10. Periodontal Disease and Incident Lung Cancer Risk: A Meta-Analysis of Cohort Studies.

    PubMed

    Zeng, Xian-Tao; Xia, Ling-Yun; Zhang, Yong-Gang; Li, Sheng; Leng, Wei-Dong; Kwong, Joey S W

    2016-10-01

    Periodontal disease is linked to a number of systemic diseases such as cardiovascular diseases and diabetes mellitus. Recent evidence has suggested periodontal disease might be associated with lung cancer. However, their precise relationship is yet to be explored. Hence, this study aims to investigate the association of periodontal disease and risk of incident lung cancer using a meta-analytic approach. PubMed, Scopus, and ScienceDirect were searched up to June 10, 2015. Cohort and nested case-control studies investigating risk of lung cancer in patients with periodontal disease were included. Hazard ratios (HRs) were calculated, as were their 95% confidence intervals (CIs) using a fixed-effect inverse-variance model. Statistical heterogeneity was explored using the Q test as well as the I(2) statistic. Publication bias was assessed by visual inspection of funnel plots symmetry and Egger's test. Five cohort studies were included, involving 321,420 participants in this meta-analysis. Summary estimates based on adjusted data showed that periodontal disease was associated with a significant risk of lung cancer (HR = 1.24, 95% CI = 1.13 to 1.36; I(2) = 30%). No publication bias was detected. Subgroup analysis indicated that the association of periodontal disease and lung cancer remained significant in the female population. Evidence from cohort studies suggests that patients with periodontal disease are at increased risk of developing lung cancer.

  11. Forecasting disease risk for increased epidemic preparedness in public health

    NASA Technical Reports Server (NTRS)

    Myers, M. F.; Rogers, D. J.; Cox, J.; Flahault, A.; Hay, S. I.

    2000-01-01

    Emerging infectious diseases pose a growing threat to human populations. Many of the world's epidemic diseases (particularly those transmitted by intermediate hosts) are known to be highly sensitive to long-term changes in climate and short-term fluctuations in the weather. The application of environmental data to the study of disease offers the capability to demonstrate vector-environment relationships and potentially forecast the risk of disease outbreaks or epidemics. Accurate disease forecasting models would markedly improve epidemic prevention and control capabilities. This chapter examines the potential for epidemic forecasting and discusses the issues associated with the development of global networks for surveillance and prediction. Existing global systems for epidemic preparedness focus on disease surveillance using either expert knowledge or statistical modelling of disease activity and thresholds to identify times and areas of risk. Predictive health information systems would use monitored environmental variables, linked to a disease system, to be observed and provide prior information of outbreaks. The components and varieties of forecasting systems are discussed with selected examples, along with issues relating to further development.

  12. Forecasting Disease Risk for Increased Epidemic Preparedness in Public Health

    PubMed Central

    Myers, M.F.; Rogers, D.J.; Cox, J.; Flahault, A.; Hay, S.I.

    2011-01-01

    Emerging infectious diseases pose a growing threat to human populations. Many of the world’s epidemic diseases (particularly those transmitted by intermediate hosts) are known to be highly sensitive to long-term changes in climate and short-term fluctuations in the weather. The application of environmental data to the study of disease offers the capability to demonstrate vector–environment relationships and potentially forecast the risk of disease outbreaks or epidemics. Accurate disease forecasting models would markedly improve epidemic prevention and control capabilities. This chapter examines the potential for epidemic forecasting and discusses the issues associated with the development of global networks for surveillance and prediction. Existing global systems for epidemic preparedness focus on disease surveillance using either expert knowledge or statistical modelling of disease activity and thresholds to identify times and areas of risk. Predictive health information systems would use monitored environmental variables, linked to a disease system, to be observed and provide prior information of outbreaks. The components and varieties of forecasting systems are discussed with selected examples, along with issues relating to further development. PMID:10997211

  13. Laboratory-based versus non-laboratory-based method for assessment of cardiovascular disease risk: the NHANES I Follow-up Study cohort

    PubMed Central

    Gaziano, Thomas A; Young, Cynthia R; Fitzmaurice, Garrett; Atwood, Sidney; Gaziano, J Michael

    2008-01-01

    Summary Background Around 80% of all cardiovascular deaths occur in developing countries. Assessment of those patients at high risk is an important strategy for prevention. Since developing countries have limited resources for prevention strategies that require laboratory testing, we assessed if a risk prediction method that did not require any laboratory tests could be as accurate as one requiring laboratory information. Methods The National Health and Nutrition Examination Survey (NHANES) was a prospective cohort study of 14 407 US participants aged between 25–74 years at the time they were first examined (between 1971 and 1975). Our follow-up study population included participants with complete information on these surveys who did not report a history of cardiovascular disease (myocardial infarction, heart failure, stroke, angina) or cancer, yielding an analysis dataset N=6186. We compared how well either method could predict first-time fatal and non-fatal cardiovascular disease events in this cohort. For the laboratory-based model, which required blood testing, we used standard risk factors to assess risk of cardiovascular disease: age, systolic blood pressure, smoking status, total cholesterol, reported diabetes status, and current treatment for hypertension. For the non-laboratory-based model, we substituted body-mass index for cholesterol. Findings In the cohort of 6186, there were 1529 first-time cardiovascular events and 578 (38%) deaths due to cardiovascular disease over 21 years. In women, the laboratory-based model was useful for predicting events, with a c statistic of 0·829. The c statistic of the non-laboratory-based model was 0·831. In men, the results were similar (0·784 for the laboratory-based model and 0·783 for the non-laboratory-based model). Results were similar between the laboratory-based and non-laboratory-based models in both men and women when restricted to fatal events only. Interpretation A method that uses non-laboratory-based risk factors predicted cardiovascular events as accurately as one that relied on laboratory-based values. This approach could simplify risk assessment in situations where laboratory testing is inconvenient or unavailable. PMID:18342687

  14. Functional Logistic Regression Approach to Detecting Gene by Longitudinal Environmental Exposure Interaction in a Case-Control Study

    PubMed Central

    Wei, Peng; Tang, Hongwei; Li, Donghui

    2014-01-01

    Most complex human diseases are likely the consequence of the joint actions of genetic and environmental factors. Identification of gene-environment (GxE) interactions not only contributes to a better understanding of the disease mechanisms, but also improves disease risk prediction and targeted intervention. In contrast to the large number of genetic susceptibility loci discovered by genome-wide association studies, there have been very few successes in identifying GxE interactions which may be partly due to limited statistical power and inaccurately measured exposures. While existing statistical methods only consider interactions between genes and static environmental exposures, many environmental/lifestyle factors, such as air pollution and diet, change over time, and cannot be accurately captured at one measurement time point or by simply categorizing into static exposure categories. There is a dearth of statistical methods for detecting gene by time-varying environmental exposure interactions. Here we propose a powerful functional logistic regression (FLR) approach to model the time-varying effect of longitudinal environmental exposure and its interaction with genetic factors on disease risk. Capitalizing on the powerful functional data analysis framework, our proposed FLR model is capable of accommodating longitudinal exposures measured at irregular time points and contaminated by measurement errors, commonly encountered in observational studies. We use extensive simulations to show that the proposed method can control the Type I error and is more powerful than alternative ad hoc methods. We demonstrate the utility of this new method using data from a case-control study of pancreatic cancer to identify the windows of vulnerability of lifetime body mass index on the risk of pancreatic cancer as well as genes which may modify this association. PMID:25219575

  15. Functional logistic regression approach to detecting gene by longitudinal environmental exposure interaction in a case-control study.

    PubMed

    Wei, Peng; Tang, Hongwei; Li, Donghui

    2014-11-01

    Most complex human diseases are likely the consequence of the joint actions of genetic and environmental factors. Identification of gene-environment (G × E) interactions not only contributes to a better understanding of the disease mechanisms, but also improves disease risk prediction and targeted intervention. In contrast to the large number of genetic susceptibility loci discovered by genome-wide association studies, there have been very few successes in identifying G × E interactions, which may be partly due to limited statistical power and inaccurately measured exposures. Although existing statistical methods only consider interactions between genes and static environmental exposures, many environmental/lifestyle factors, such as air pollution and diet, change over time, and cannot be accurately captured at one measurement time point or by simply categorizing into static exposure categories. There is a dearth of statistical methods for detecting gene by time-varying environmental exposure interactions. Here, we propose a powerful functional logistic regression (FLR) approach to model the time-varying effect of longitudinal environmental exposure and its interaction with genetic factors on disease risk. Capitalizing on the powerful functional data analysis framework, our proposed FLR model is capable of accommodating longitudinal exposures measured at irregular time points and contaminated by measurement errors, commonly encountered in observational studies. We use extensive simulations to show that the proposed method can control the Type I error and is more powerful than alternative ad hoc methods. We demonstrate the utility of this new method using data from a case-control study of pancreatic cancer to identify the windows of vulnerability of lifetime body mass index on the risk of pancreatic cancer as well as genes that may modify this association. © 2014 Wiley Periodicals, Inc.

  16. Multiplatform serum metabolic phenotyping combined with pathway mapping to identify biochemical differences in smokers.

    PubMed

    Kaluarachchi, Manuja R; Boulangé, Claire L; Garcia-Perez, Isabel; Lindon, John C; Minet, Emmanuel F

    2016-10-01

    Determining perturbed biochemical functions associated with tobacco smoking should be helpful for establishing causal relationships between exposure and adverse events. A multiplatform comparison of serum of smokers (n = 55) and never-smokers (n = 57) using nuclear magnetic resonance spectroscopy, UPLC-MS and statistical modeling revealed clustering of the classes, distinguished by metabolic biomarkers. The identified metabolites were subjected to metabolic pathway enrichment, modeling adverse biological events using available databases. Perturbation of metabolites involved in chronic obstructive pulmonary disease, cardiovascular diseases and cancer were identified and discussed. Combining multiplatform metabolic phenotyping with knowledge-based mapping gives mechanistic insights into disease development, which can be applied to next-generation tobacco and nicotine products for comparative risk assessment.

  17. Genetic-based prediction of disease traits: prediction is very difficult, especially about the future†

    PubMed Central

    Schrodi, Steven J.; Mukherjee, Shubhabrata; Shan, Ying; Tromp, Gerard; Sninsky, John J.; Callear, Amy P.; Carter, Tonia C.; Ye, Zhan; Haines, Jonathan L.; Brilliant, Murray H.; Crane, Paul K.; Smelser, Diane T.; Elston, Robert C.; Weeks, Daniel E.

    2014-01-01

    Translation of results from genetic findings to inform medical practice is a highly anticipated goal of human genetics. The aim of this paper is to review and discuss the role of genetics in medically-relevant prediction. Germline genetics presages disease onset and therefore can contribute prognostic signals that augment laboratory tests and clinical features. As such, the impact of genetic-based predictive models on clinical decisions and therapy choice could be profound. However, given that (i) medical traits result from a complex interplay between genetic and environmental factors, (ii) the underlying genetic architectures for susceptibility to common diseases are not well-understood, and (iii) replicable susceptibility alleles, in combination, account for only a moderate amount of disease heritability, there are substantial challenges to constructing and implementing genetic risk prediction models with high utility. In spite of these challenges, concerted progress has continued in this area with an ongoing accumulation of studies that identify disease predisposing genotypes. Several statistical approaches with the aim of predicting disease have been published. Here we summarize the current state of disease susceptibility mapping and pharmacogenetics efforts for risk prediction, describe methods used to construct and evaluate genetic-based predictive models, and discuss applications. PMID:24917882

  18. A new scoring system in Cystic Fibrosis: statistical tools for database analysis - a preliminary report.

    PubMed

    Hafen, G M; Hurst, C; Yearwood, J; Smith, J; Dzalilov, Z; Robinson, P J

    2008-10-05

    Cystic fibrosis is the most common fatal genetic disorder in the Caucasian population. Scoring systems for assessment of Cystic fibrosis disease severity have been used for almost 50 years, without being adapted to the milder phenotype of the disease in the 21st century. The aim of this current project is to develop a new scoring system using a database and employing various statistical tools. This study protocol reports the development of the statistical tools in order to create such a scoring system. The evaluation is based on the Cystic Fibrosis database from the cohort at the Royal Children's Hospital in Melbourne. Initially, unsupervised clustering of the all data records was performed using a range of clustering algorithms. In particular incremental clustering algorithms were used. The clusters obtained were characterised using rules from decision trees and the results examined by clinicians. In order to obtain a clearer definition of classes expert opinion of each individual's clinical severity was sought. After data preparation including expert-opinion of an individual's clinical severity on a 3 point-scale (mild, moderate and severe disease), two multivariate techniques were used throughout the analysis to establish a method that would have a better success in feature selection and model derivation: 'Canonical Analysis of Principal Coordinates' and 'Linear Discriminant Analysis'. A 3-step procedure was performed with (1) selection of features, (2) extracting 5 severity classes out of a 3 severity class as defined per expert-opinion and (3) establishment of calibration datasets. (1) Feature selection: CAP has a more effective "modelling" focus than DA.(2) Extraction of 5 severity classes: after variables were identified as important in discriminating contiguous CF severity groups on the 3-point scale as mild/moderate and moderate/severe, Discriminant Function (DF) was used to determine the new groups mild, intermediate moderate, moderate, intermediate severe and severe disease. (3) Generated confusion tables showed a misclassification rate of 19.1% for males and 16.5% for females, with a majority of misallocations into adjacent severity classes particularly for males. Our preliminary data show that using CAP for detection of selection features and Linear DA to derive the actual model in a CF database might be helpful in developing a scoring system. However, there are several limitations, particularly more data entry points are needed to finalize a score and the statistical tools have further to be refined and validated, with re-running the statistical methods in the larger dataset.

  19. Characterizing the lung tissue mechanical properties using a micromechanical model of alveolar sac

    NASA Astrophysics Data System (ADS)

    Karami, Elham; Seify, Behzad; Moghadas, Hadi; Sabsalinejad, Masoomeh; Lee, Ting-Yim; Samani, Abbas

    2017-03-01

    According to statistics, lung disease is among the leading causes of death worldwide. As such, many research groups are developing powerful tools for understanding, diagnosis and treatment of various lung diseases. Recently, biomechanical modeling has emerged as an effective tool for better understanding of human physiology, disease diagnosis and computer assisted medical intervention. Mechanical properties of lung tissue are important requirements for methods developed for lung disease diagnosis and medical intervention. As such, the main objective of this study is to develop an effective tool for estimating the mechanical properties of normal and pathological lung parenchyma tissue based on its microstructure. For this purpose, a micromechanical model of the lung tissue was developed using finite element (FE) method, and the model was demonstrated to have application in estimating the mechanical properties of lung alveolar wall. The proposed model was developed by assembling truncated octahedron tissue units resembling the alveoli. A compression test was simulated using finite element method on the created geometry and the hyper-elastic parameters of the alveoli wall were calculated using reported alveolar wall stress-strain data and an inverse optimization framework. Preliminary results indicate that the proposed model can be potentially used to reconstruct microstructural images of lung tissue using macro-scale tissue response for normal and different pathological conditions. Such images can be used for effective diagnosis of lung diseases such as Chronic Obstructive Pulmonary Disease (COPD).

  20. Imaging plus X: multimodal models of neurodegenerative disease.

    PubMed

    Oxtoby, Neil P; Alexander, Daniel C

    2017-08-01

    This article argues that the time is approaching for data-driven disease modelling to take centre stage in the study and management of neurodegenerative disease. The snowstorm of data now available to the clinician defies qualitative evaluation; the heterogeneity of data types complicates integration through traditional statistical methods; and the large datasets becoming available remain far from the big-data sizes necessary for fully data-driven machine-learning approaches. The recent emergence of data-driven disease progression models provides a balance between imposed knowledge of disease features and patterns learned from data. The resulting models are both predictive of disease progression in individual patients and informative in terms of revealing underlying biological patterns. Largely inspired by observational models, data-driven disease progression models have emerged in the last few years as a feasible means for understanding the development of neurodegenerative diseases. These models have revealed insights into frontotemporal dementia, Huntington's disease, multiple sclerosis, Parkinson's disease and other conditions. For example, event-based models have revealed finer graded understanding of progression patterns; self-modelling regression and differential equation models have provided data-driven biomarker trajectories; spatiotemporal models have shown that brain shape changes, for example of the hippocampus, can occur before detectable neurodegeneration; and network models have provided some support for prion-like mechanistic hypotheses of disease propagation. The most mature results are in sporadic Alzheimer's disease, in large part because of the availability of the Alzheimer's disease neuroimaging initiative dataset. Results generally support the prevailing amyloid-led hypothetical model of Alzheimer's disease, while revealing finer detail and insight into disease progression. The emerging field of disease progression modelling provides a natural mechanism to integrate different kinds of information, for example from imaging, serum and cerebrospinal fluid markers and cognitive tests, to obtain new insights into progressive diseases. Such insights include fine-grained longitudinal patterns of neurodegeneration, from early stages, and the heterogeneity of these trajectories over the population. More pragmatically, such models enable finer precision in patient staging and stratification, prediction of progression rates and earlier and better identification of at-risk individuals. We argue that this will make disease progression modelling invaluable for recruitment and end-points in future clinical trials, potentially ameliorating the high failure rate in trials of, e.g., Alzheimer's disease therapies. We review the state of the art in these techniques and discuss the future steps required to translate the ideas to front-line application.

  1. An argument for mechanism-based statistical inference in cancer

    PubMed Central

    Ochs, Michael; Price, Nathan D.; Tomasetti, Cristian; Younes, Laurent

    2015-01-01

    Cancer is perhaps the prototypical systems disease, and as such has been the focus of extensive study in quantitative systems biology. However, translating these programs into personalized clinical care remains elusive and incomplete. In this perspective, we argue that realizing this agenda—in particular, predicting disease phenotypes, progression and treatment response for individuals—requires going well beyond standard computational and bioinformatics tools and algorithms. It entails designing global mathematical models over network-scale configurations of genomic states and molecular concentrations, and learning the model parameters from limited available samples of high-dimensional and integrative omics data. As such, any plausible design should accommodate: biological mechanism, necessary for both feasible learning and interpretable decision making; stochasticity, to deal with uncertainty and observed variation at many scales; and a capacity for statistical inference at the patient level. This program, which requires a close, sustained collaboration between mathematicians and biologists, is illustrated in several contexts, including learning bio-markers, metabolism, cell signaling, network inference and tumorigenesis. PMID:25381197

  2. Semi-quantitative assessment of disease risks at the human, livestock, wildlife interface for the Republic of Korea using a nationwide survey of experts: A model for other countries

    USGS Publications Warehouse

    Hwang, Jusun; Lee, Kyunglee; Walsh, Daniel P.; Kim, SangWha; Sleeman, Jonathan M.; Lee, Hang

    2018-01-01

    Wildlife-associated diseases and pathogens have increased in importance; however, management of a large number of diseases and diversity of hosts is prohibitively expensive. Thus, the determination of priority wildlife pathogens and risk factors for disease emergence is warranted. We used an online questionnaire survey to assess release and exposure risks, and consequences of wildlife-associated diseases and pathogens in the Republic of Korea (ROK). We also surveyed opinions on pathways for disease exposure, and risk factors for disease emergence and spread. For the assessment of risk, we employed a two-tiered, statistical K-means clustering algorithm to group diseases into three levels (high, medium and low) of perceived risk based on release and exposure risks, societal consequences and the level of uncertainty of the experts’ opinions. To examine the experts’ perceived risk of routes of introduction of pathogens and disease amplification and spread, we used a Bayesian, multivariate normal order-statistics model. Six diseases or pathogens, including four livestock and two wildlife diseases, were identified as having high risk with low uncertainty. Similarly, 13 diseases were characterized as having high risk with medium uncertainty with three of these attributed to livestock, six associated with human disease, and the remainder having the potential to affect human, livestock and wildlife (i.e., One Health). Lastly, four diseases were described as high risk with high certainty, and were associated solely with fish diseases. Experts identified migration of wildlife, international human movement and illegal importation of wildlife as the three routes posing the greatest risk of pathogen introduction into ROK. Proximity of humans, livestock and wildlife was the most significant risk factor for promoting the spread of wildlife-associated diseases and pathogens, followed by high density of livestock populations, habitat loss and environmental degradation, and climate change. This study provides useful information to decision makers responsible for allocating resources to address disease risks. This approach provided a rapid, cost-effective method of risk assessment of wildlife-associated diseases and pathogens for which the published literature is sparse.

  3. [Modelling the effect of local climatic variability on dengue transmission in Medellin (Colombia) by means of time series analysis].

    PubMed

    Rúa-Uribe, Guillermo L; Suárez-Acosta, Carolina; Chauca, José; Ventosilla, Palmira; Almanza, Rita

    2013-09-01

    Dengue fever is a major impact on public health vector-borne disease, and its transmission is influenced by entomological, sociocultural and economic factors. Additionally, climate variability plays an important role in the transmission dynamics. A large scientific consensus has indicated that the strong association between climatic variables and disease could be used to develop models to explain the incidence of the disease. To develop a model that provides a better understanding of dengue transmission dynamics in Medellin and predicts increases in the incidence of the disease. The incidence of dengue fever was used as dependent variable, and weekly climatic factors (maximum, mean and minimum temperature, relative humidity and precipitation) as independent variables. Expert Modeler was used to develop a model to better explain the behavior of the disease. Climatic variables with significant association to the dependent variable were selected through ARIMA models. The model explains 34% of observed variability. Precipitation was the climatic variable showing statistically significant association with the incidence of dengue fever, but with a 20 weeks delay. In Medellin, the transmission of dengue fever was influenced by climate variability, especially precipitation. The strong association dengue fever/precipitation allowed the construction of a model to help understand dengue transmission dynamics. This information will be useful to develop appropriate and timely strategies for dengue control.

  4. Towards a resource-based habitat approach for spatial modelling of vector-borne disease risks.

    PubMed

    Hartemink, Nienke; Vanwambeke, Sophie O; Purse, Bethan V; Gilbert, Marius; Van Dyck, Hans

    2015-11-01

    Given the veterinary and public health impact of vector-borne diseases, there is a clear need to assess the suitability of landscapes for the emergence and spread of these diseases. Current approaches for predicting disease risks neglect key features of the landscape as components of the functional habitat of vectors or hosts, and hence of the pathogen. Empirical-statistical methods do not explicitly incorporate biological mechanisms, whereas current mechanistic models are rarely spatially explicit; both methods ignore the way animals use the landscape (i.e. movement ecology). We argue that applying a functional concept for habitat, i.e. the resource-based habitat concept (RBHC), can solve these issues. The RBHC offers a framework to identify systematically the different ecological resources that are necessary for the completion of the transmission cycle and to relate these resources to (combinations of) landscape features and other environmental factors. The potential of the RBHC as a framework for identifying suitable habitats for vector-borne pathogens is explored and illustrated with the case of bluetongue virus, a midge-transmitted virus affecting ruminants. The concept facilitates the study of functional habitats of the interacting species (vectors as well as hosts) and provides new insight into spatial and temporal variation in transmission opportunities and exposure that ultimately determine disease risks. It may help to identify knowledge gaps and control options arising from changes in the spatial configuration of key resources across the landscape. The RBHC framework may act as a bridge between existing mechanistic and statistical modelling approaches. © 2014 The Authors. Biological Reviews published by John Wiley & Sons Ltd on behalf of Cambridge Philosophical Society.

  5. Étude statistique et dynamique de la propagation d'épidémies dans un réseau de petit mondeStatistical and dynamical study of the epidemics propagation in a small world network

    NASA Astrophysics Data System (ADS)

    Zekri, Nouredine; Clerc, Jean Pierre

    We study numerically in this work the statistical and dynamical properties of the clusters in a one dimensional small world model. The parameters chosen correspond to a realistic network of children of school age where a disease like measles can propagate. Extensive results on the statistical behavior of the clusters around the percolation threshold, as well as the evoltion with time, are discussed. To cite this article: N. Zekri, J.P. Clerc, C. R. Physique 3 (2002) 741-747.

  6. Staging Liver Fibrosis with Statistical Observers

    NASA Astrophysics Data System (ADS)

    Brand, Jonathan Frieman

    Chronic liver disease is a worldwide health problem, and hepatic fibrosis (HF) is one of the hallmarks of the disease. Pathology diagnosis of HF is based on textural change in the liver as a lobular collagen network that develops within portal triads. The scale of collagen lobules is characteristically on order of 1mm, which close to the resolution limit of in vivo Gd-enhanced MRI. In this work the methods to collect training and testing images for a Hotelling observer are covered. An observer based on local texture analysis is trained and tested using wet-tissue phantoms. The technique is used to optimize the MRI sequence based on task performance. The final method developed is a two stage model observer to classify fibrotic and healthy tissue in both phantoms and in vivo MRI images. The first stage observer tests for the presence of local texture. Test statistics from the first observer are used to train the second stage observer to globally sample the local observer results. A decision of the disease class is made for an entire MRI image slice using test statistics collected from the second observer. The techniques are tested on wet-tissue phantoms and in vivo clinical patient data.

  7. An electronic health record based model predicts statin adherence, LDL cholesterol, and cardiovascular disease in the United States Military Health System

    PubMed Central

    Lucas, Joseph E.; Bazemore, Taylor C.; Alo, Celan; Monahan, Patrick B.

    2017-01-01

    HMG-CoA reductase inhibitors (or “statins”) are important and commonly used medications to lower cholesterol and prevent cardiovascular disease. Nearly half of patients stop taking statin medications one year after they are prescribed leading to higher cholesterol, increased cardiovascular risk, and costs due to excess hospitalizations. Identifying which patients are at highest risk for not adhering to long-term statin therapy is an important step towards individualizing interventions to improve adherence. Electronic health records (EHR) are an increasingly common source of data that are challenging to analyze but have potential for generating more accurate predictions of disease risk. The aim of this study was to build an EHR based model for statin adherence and link this model to biologic and clinical outcomes in patients receiving statin therapy. We gathered EHR data from the Military Health System which maintains administrative data for active duty, retirees, and dependents of the United States armed forces military that receive health care benefits. Data were gathered from patients prescribed their first statin prescription in 2005 and 2006. Baseline billing, laboratory, and pharmacy claims data were collected from the two years leading up to the first statin prescription and summarized using non-negative matrix factorization. Follow up statin prescription refill data was used to define the adherence outcome (> 80 percent days covered). The subsequent factors to emerge from this model were then used to build cross-validated, predictive models of 1) overall disease risk using coalescent regression and 2) statin adherence (using random forest regression). The predicted statin adherence for each patient was subsequently used to correlate with cholesterol lowering and hospitalizations for cardiovascular disease during the 5 year follow up period using Cox regression. The analytical dataset included 138 731 individuals and 1840 potential baseline predictors that were reduced to 30 independent EHR “factors”. A random forest predictive model taking patient, statin prescription, predicted disease risk, and the EHR factors as potential inputs produced a cross-validated c-statistic of 0.736 for classifying statin non-adherence. The addition of the first refill to the model increased the c-statistic to 0.81. The predicted statin adherence was independently associated with greater cholesterol lowering (correlation = 0.14, p < 1e-20) and lower hospitalization for myocardial infarction, coronary artery disease, and stroke (hazard ratio = 0.84, p = 1.87E-06). Electronic health records data can be used to build a predictive model of statin adherence that also correlates with statins’ cardiovascular benefits. PMID:29155848

  8. Spatiotemporal Variation in Distance Dependent Animal Movement Contacts: One Size Doesn’t Fit All

    PubMed Central

    Brommesson, Peter; Wennergren, Uno; Lindström, Tom

    2016-01-01

    The structure of contacts that mediate transmission has a pronounced effect on the outbreak dynamics of infectious disease and simulation models are powerful tools to inform policy decisions. Most simulation models of livestock disease spread rely to some degree on predictions of animal movement between holdings. Typically, movements are more common between nearby farms than between those located far away from each other. Here, we assessed spatiotemporal variation in such distance dependence of animal movement contacts from an epidemiological perspective. We evaluated and compared nine statistical models, applied to Swedish movement data from 2008. The models differed in at what level (if at all), they accounted for regional and/or seasonal heterogeneities in the distance dependence of the contacts. Using a kernel approach to describe how probability of contacts between farms changes with distance, we developed a hierarchical Bayesian framework and estimated parameters by using Markov Chain Monte Carlo techniques. We evaluated models by three different approaches of model selection. First, we used Deviance Information Criterion to evaluate their performance relative to each other. Secondly, we estimated the log predictive posterior distribution, this was also used to evaluate their relative performance. Thirdly, we performed posterior predictive checks by simulating movements with each of the parameterized models and evaluated their ability to recapture relevant summary statistics. Independent of selection criteria, we found that accounting for regional heterogeneity improved model accuracy. We also found that accounting for seasonal heterogeneity was beneficial, in terms of model accuracy, according to two of three methods used for model selection. Our results have important implications for livestock disease spread models where movement is an important risk factor for between farm transmission. We argue that modelers should refrain from using methods to simulate animal movements that assume the same pattern across all regions and seasons without explicitly testing for spatiotemporal variation. PMID:27760155

  9. A rigorous statistical framework for spatio-temporal pollution prediction and estimation of its long-term impact on health.

    PubMed

    Lee, Duncan; Mukhopadhyay, Sabyasachi; Rushworth, Alastair; Sahu, Sujit K

    2017-04-01

    In the United Kingdom, air pollution is linked to around 40000 premature deaths each year, but estimating its health effects is challenging in a spatio-temporal study. The challenges include spatial misalignment between the pollution and disease data; uncertainty in the estimated pollution surface; and complex residual spatio-temporal autocorrelation in the disease data. This article develops a two-stage model that addresses these issues. The first stage is a spatio-temporal fusion model linking modeled and measured pollution data, while the second stage links these predictions to the disease data. The methodology is motivated by a new five-year study investigating the effects of multiple pollutants on respiratory hospitalizations in England between 2007 and 2011, using pollution and disease data relating to local and unitary authorities on a monthly time scale. © The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  10. Sequence-Based Prioritization of Nonsynonymous Single-Nucleotide Polymorphisms for the Study of Disease Mutations

    PubMed Central

    Jiang, Rui ; Yang, Hua ; Zhou, Linqi ; Kuo, C.-C. Jay ; Sun, Fengzhu ; Chen, Ting 

    2007-01-01

    The increasing demand for the identification of genetic variation responsible for common diseases has translated into a need for sophisticated methods for effectively prioritizing mutations occurring in disease-associated genetic regions. In this article, we prioritize candidate nonsynonymous single-nucleotide polymorphisms (nsSNPs) through a bioinformatics approach that takes advantages of a set of improved numeric features derived from protein-sequence information and a new statistical learning model called “multiple selection rule voting” (MSRV). The sequence-based features can maximize the scope of applications of our approach, and the MSRV model can capture subtle characteristics of individual mutations. Systematic validation of the approach demonstrates that this approach is capable of prioritizing causal mutations for both simple monogenic diseases and complex polygenic diseases. Further studies of familial Alzheimer diseases and diabetes show that the approach can enrich mutations underlying these polygenic diseases among the top of candidate mutations. Application of this approach to unclassified mutations suggests that there are 10 suspicious mutations likely to cause diseases, and there is strong support for this in the literature. PMID:17668383

  11. Prediction of First Cardiovascular Disease Event in Type 1 Diabetes Mellitus: The Steno Type 1 Risk Engine.

    PubMed

    Vistisen, Dorte; Andersen, Gregers Stig; Hansen, Christian Stevns; Hulman, Adam; Henriksen, Jan Erik; Bech-Nielsen, Henning; Jørgensen, Marit Eika

    2016-03-15

    Patients with type 1 diabetes mellitus are at increased risk of developing cardiovascular disease (CVD), but they are currently undertreated. There are no risk scores used on a regular basis in clinical practice for assessing the risk of CVD in type 1 diabetes mellitus. From 4306 clinically diagnosed adult patients with type 1 diabetes mellitus, we developed a prediction model for estimating the risk of first fatal or nonfatal CVD event (ischemic heart disease, ischemic stroke, heart failure, and peripheral artery disease). Detailed clinical data including lifestyle factors were linked to event data from validated national registers. The risk prediction model was developed by using a 2-stage approach. First, a nonparametric, data-driven approach was used to identify potentially informative risk factors and interactions (random forest and survival tree analysis). Second, based on results from the first step, Poisson regression analysis was used to derive the final model. The final CVD prediction model was externally validated in a different population of 2119 patients with type 1 diabetes mellitus. During a median follow-up of 6.8 years (interquartile range, 2.9-10.9) a total of 793 (18.4%) patients developed CVD. The final prediction model included age, sex, diabetes duration, systolic blood pressure, low-density lipoprotein cholesterol, hemoglobin A1c, albuminuria, glomerular filtration rate, smoking, and exercise. Discrimination was excellent for a 5-year CVD event with a C-statistic of 0.826 (95% confidence interval, 0.807-0.845) in the derivation data and a C-statistic of 0.803 (95% confidence interval, 0.767-0.839) in the validation data. The Hosmer-Lemeshow test showed good calibration (P>0.05) in both cohorts. This high-performing CVD risk model allows for the implementation of decision rules in a clinical setting. © 2016 American Heart Association, Inc.

  12. Computational modeling and statistical analyses on individual contact rate and exposure to disease in complex and confined transportation hubs

    NASA Astrophysics Data System (ADS)

    Wang, W. L.; Tsui, K. L.; Lo, S. M.; Liu, S. B.

    2018-01-01

    Crowded transportation hubs such as metro stations are thought as ideal places for the development and spread of epidemics. However, for the special features of complex spatial layout, confined environment with a large number of highly mobile individuals, it is difficult to quantify human contacts in such environments, wherein disease spreading dynamics were less explored in the previous studies. Due to the heterogeneity and dynamic nature of human interactions, increasing studies proved the importance of contact distance and length of contact in transmission probabilities. In this study, we show how detailed information on contact and exposure patterns can be obtained by statistical analyses on microscopic crowd simulation data. To be specific, a pedestrian simulation model-CityFlow was employed to reproduce individuals' movements in a metro station based on site survey data, values and distributions of individual contact rate and exposure in different simulation cases were obtained and analyzed. It is interesting that Weibull distribution fitted the histogram values of individual-based exposure in each case very well. Moreover, we found both individual contact rate and exposure had linear relationship with the average crowd densities of the environments. The results obtained in this paper can provide reference to epidemic study in complex and confined transportation hubs and refine the existing disease spreading models.

  13. Big Data for Infectious Disease Surveillance and Modeling.

    PubMed

    Bansal, Shweta; Chowell, Gerardo; Simonsen, Lone; Vespignani, Alessandro; Viboud, Cécile

    2016-12-01

    We devote a special issue of the Journal of Infectious Diseases to review the recent advances of big data in strengthening disease surveillance, monitoring medical adverse events, informing transmission models, and tracking patient sentiments and mobility. We consider a broad definition of big data for public health, one encompassing patient information gathered from high-volume electronic health records and participatory surveillance systems, as well as mining of digital traces such as social media, Internet searches, and cell-phone logs. We introduce nine independent contributions to this special issue and highlight several cross-cutting areas that require further research, including representativeness, biases, volatility, and validation, and the need for robust statistical and hypotheses-driven analyses. Overall, we are optimistic that the big-data revolution will vastly improve the granularity and timeliness of available epidemiological information, with hybrid systems augmenting rather than supplanting traditional surveillance systems, and better prospects for accurate infectious diseases models and forecasts. Published by Oxford University Press for the Infectious Diseases Society of America 2016. This work is written by (a) US Government employee(s) and is in the public domain in the US.

  14. Organ allocation for chronic liver disease: model for end-stage liver disease and beyond.

    PubMed

    Asrani, Sumeet K; Kim, W Ray

    2010-05-01

    Implementation of the model for end-stage liver disease (MELD) score has led to a reduction in waiting list registration and waitlist mortality. Prognostic models have been proposed to either refine or improve the current MELD-based liver allocation. The model for end-stage liver disease - sodium (MELDNa) incorporates serum sodium and has been shown to improve the predictive accuracy of the MELD score. However, laboratory variation and manipulation of serum sodium is a concern. Organ allocation in the United Kingdom is now based on a model that includes serum sodium. An updated MELD score is associated with a lower relative weight for serum creatinine coefficient and a higher relative weight for bilirubin coefficient, although the contribution of reweighting coefficients as compared with addition of variables is unclear. The D-MELD, the arithmetic product of donor age and preoperative MELD, proposes donor-recipient matching; however, inappropriate transplantation of high-risk donors is a concern. Finally, the net benefit model ranks patients according to the net survival benefit that they would derive from the transplant. However, complex statistical models are required and unmeasured characteristics may unduly affect the model. Despite their limitations, efforts to improve the current MELD-based organ allocation are encouraging.

  15. Accounting for Population Structure in Gene-by-Environment Interactions in Genome-Wide Association Studies Using Mixed Models.

    PubMed

    Sul, Jae Hoon; Bilow, Michael; Yang, Wen-Yun; Kostem, Emrah; Furlotte, Nick; He, Dan; Eskin, Eleazar

    2016-03-01

    Although genome-wide association studies (GWASs) have discovered numerous novel genetic variants associated with many complex traits and diseases, those genetic variants typically explain only a small fraction of phenotypic variance. Factors that account for phenotypic variance include environmental factors and gene-by-environment interactions (GEIs). Recently, several studies have conducted genome-wide gene-by-environment association analyses and demonstrated important roles of GEIs in complex traits. One of the main challenges in these association studies is to control effects of population structure that may cause spurious associations. Many studies have analyzed how population structure influences statistics of genetic variants and developed several statistical approaches to correct for population structure. However, the impact of population structure on GEI statistics in GWASs has not been extensively studied and nor have there been methods designed to correct for population structure on GEI statistics. In this paper, we show both analytically and empirically that population structure may cause spurious GEIs and use both simulation and two GWAS datasets to support our finding. We propose a statistical approach based on mixed models to account for population structure on GEI statistics. We find that our approach effectively controls population structure on statistics for GEIs as well as for genetic variants.

  16. Gene genealogies for genetic association mapping, with application to Crohn's disease

    PubMed Central

    Burkett, Kelly M.; Greenwood, Celia M. T.; McNeney, Brad; Graham, Jinko

    2013-01-01

    A gene genealogy describes relationships among haplotypes sampled from a population. Knowledge of the gene genealogy for a set of haplotypes is useful for estimation of population genetic parameters and it also has potential application in finding disease-predisposing genetic variants. As the true gene genealogy is unknown, Markov chain Monte Carlo (MCMC) approaches have been used to sample genealogies conditional on data at multiple genetic markers. We previously implemented an MCMC algorithm to sample from an approximation to the distribution of the gene genealogy conditional on haplotype data. Our approach samples ancestral trees, recombination and mutation rates at a genomic focal point. In this work, we describe how our sampler can be used to find disease-predisposing genetic variants in samples of cases and controls. We use a tree-based association statistic that quantifies the degree to which case haplotypes are more closely related to each other around the focal point than control haplotypes, without relying on a disease model. As the ancestral tree is a latent variable, so is the tree-based association statistic. We show how the sampler can be used to estimate the posterior distribution of the latent test statistic and corresponding latent p-values, which together comprise a fuzzy p-value. We illustrate the approach on a publicly-available dataset from a study of Crohn's disease that consists of genotypes at multiple SNP markers in a small genomic region. We estimate the posterior distribution of the tree-based association statistic and the recombination rate at multiple focal points in the region. Reassuringly, the posterior mean recombination rates estimated at the different focal points are consistent with previously published estimates. The tree-based association approach finds multiple sub-regions where the case haplotypes are more genetically related than the control haplotypes, and that there may be one or multiple disease-predisposing loci. PMID:24348515

  17. Using multitype branching processes to quantify statistics of disease outbreaks in zoonotic epidemics

    NASA Astrophysics Data System (ADS)

    Singh, Sarabjeet; Schneider, David J.; Myers, Christopher R.

    2014-03-01

    Branching processes have served as a model for chemical reactions, biological growth processes, and contagion (of disease, information, or fads). Through this connection, these seemingly different physical processes share some common universalities that can be elucidated by analyzing the underlying branching process. In this work we focus on coupled branching processes as a model of infectious diseases spreading from one population to another. An exceedingly important example of such coupled outbreaks are zoonotic infections that spill over from animal populations to humans. We derive several statistical quantities characterizing the first spillover event from animals to humans, including the probability of spillover, the first passage time distribution for human infection, and disease prevalence in the animal population at spillover. Large stochastic fluctuations in those quantities can make inference of the state of the system at the time of spillover difficult. Focusing on outbreaks in the human population, we then characterize the critical threshold for a large outbreak, the distribution of outbreak sizes, and associated scaling laws. These all show a strong dependence on the basic reproduction number in the animal population and indicate the existence of a novel multicritical point with altered scaling behavior. The coupling of animal and human infection dynamics has crucial implications, most importantly allowing for the possibility of large human outbreaks even when human-to-human transmission is subcritical.

  18. The economic consequences of neurosurgical disease in low- and middle-income countries.

    PubMed

    Rudolfson, Niclas; Dewan, Michael C; Park, Kee B; Shrime, Mark G; Meara, John G; Alkire, Blake C

    2018-05-18

    OBJECTIVE The objective of this study was to estimate the economic consequences of neurosurgical disease in low- and middle-income countries (LMICs). METHODS The authors estimated gross domestic product (GDP) losses and the broader welfare losses attributable to 5 neurosurgical disease categories in LMICs using two distinct economic models. The value of lost output (VLO) model projects annual GDP losses due to neurosurgical disease during 2015-2030, and is based on the WHO's "Projecting the Economic Cost of Ill-health" tool. The value of lost economic welfare (VLW) model estimates total welfare losses, which is based on the value of a statistical life and includes nonmarket losses such as the inherent value placed on good health, resulting from neurosurgical disease in 2015 alone. RESULTS The VLO model estimates the selected neurosurgical diseases will result in $4.4 trillion (2013 US dollars, purchasing power parity) in GDP losses during 2015-2030 in the 90 included LMICs. Economic losses are projected to disproportionately affect low- and lower-middle-income countries, risking up to a 0.6% and 0.54% loss of GDP, respectively, in 2030. The VLW model evaluated 127 LMICs, and estimates that these countries experienced $3 trillion (2013 US dollars, purchasing power parity) in economic welfare losses in 2015. Regardless of the model used, the majority of the losses can be attributed to stroke and traumatic brain injury. CONCLUSIONS The economic impact of neurosurgical diseases in LMICs is significant. The magnitude of economic losses due to neurosurgical diseases in LMICs provides further motivation beyond already compelling humanitarian reasons for action.

  19. [Application of statistics on chronic-diseases-relating observational research papers].

    PubMed

    Hong, Zhi-heng; Wang, Ping; Cao, Wei-hua

    2012-09-01

    To study the application of statistics on Chronic-diseases-relating observational research papers which were recently published in the Chinese Medical Association Magazines, with influential index above 0.5. Using a self-developed criterion, two investigators individually participated in assessing the application of statistics on Chinese Medical Association Magazines, with influential index above 0.5. Different opinions reached an agreement through discussion. A total number of 352 papers from 6 magazines, including the Chinese Journal of Epidemiology, Chinese Journal of Oncology, Chinese Journal of Preventive Medicine, Chinese Journal of Cardiology, Chinese Journal of Internal Medicine and Chinese Journal of Endocrinology and Metabolism, were reviewed. The rate of clear statement on the following contents as: research objectives, t target audience, sample issues, objective inclusion criteria and variable definitions were 99.43%, 98.57%, 95.43%, 92.86% and 96.87%. The correct rates of description on quantitative and qualitative data were 90.94% and 91.46%, respectively. The rates on correctly expressing the results, on statistical inference methods related to quantitative, qualitative data and modeling were 100%, 95.32% and 87.19%, respectively. 89.49% of the conclusions could directly response to the research objectives. However, 69.60% of the papers did not mention the exact names of the study design, statistically, that the papers were using. 11.14% of the papers were in lack of further statement on the exclusion criteria. Percentage of the papers that could clearly explain the sample size estimation only taking up as 5.16%. Only 24.21% of the papers clearly described the variable value assignment. Regarding the introduction on statistical conduction and on database methods, the rate was only 24.15%. 18.75% of the papers did not express the statistical inference methods sufficiently. A quarter of the papers did not use 'standardization' appropriately. As for the aspect of statistical inference, the rate of description on statistical testing prerequisite was only 24.12% while 9.94% papers did not even employ the statistical inferential method that should be used. The main deficiencies on the application of Statistics used in papers related to Chronic-diseases-related observational research were as follows: lack of sample-size determination, variable value assignment description not sufficient, methods on statistics were not introduced clearly or properly, lack of consideration for pre-requisition regarding the use of statistical inferences.

  20. The challenging use and interpretation of circulating biomarkers of exposure to persistent organic pollutants in environmental health: Comparison of lipid adjustment approaches in a case study related to endometriosis.

    PubMed

    Cano-Sancho, German; Labrune, Léa; Ploteau, Stéphane; Marchand, Philippe; Le Bizec, Bruno; Antignac, Jean-Philippe

    2018-06-01

    The gold-standard matrix for measuring the internal levels of persistent organic pollutants (POPs) is the adipose tissue, however in epidemiological studies the use of serum is preferred due to the low cost and higher accessibility. The interpretation of serum biomarkers is tightly related to the understanding of the underlying causal structure relating the POPs, serum lipids and the disease. Considering the extended benefits of using serum biomarkers we aimed to further examine if through statistical modelling we would be able to improve the use and interpretation of serum biomarkers in the study of endometriosis. Hence, we have conducted a systematic comparison of statistical approaches commonly used to lipid-adjust the circulating biomarkers of POPs based on existing methods, using data from a pilot case-control study focused on severe deep infiltrating endometriosis. The odds ratios (ORs) obtained from unconditional regression for those models with serum biomarkers were further compared to those obtained from adipose tissue. The results of this exploratory study did not support the use of blood biomarkers as proxy estimates of POPs in adipose tissue to implement in risk models for endometriosis with the available statistical approaches to correct for lipids. The current statistical approaches commonly used to lipid-adjust circulating POPs, do not fully represent the underlying biological complexity between POPs, lipids and disease (especially those directly or indirectly affecting or affected by lipid metabolism). Hence, further investigations are warranted to improve the use and interpretation of blood biomarkers under complex scenarios of lipid dynamics. Copyright © 2018 Elsevier Ltd. All rights reserved.

  1. Impacts of clustering on interacting epidemics.

    PubMed

    Wang, Bing; Cao, Lang; Suzuki, Hideyuki; Aihara, Kazuyuki

    2012-07-07

    Since community structures in real networks play a major role for the epidemic spread, we therefore explore two interacting diseases spreading in networks with community structures. As a network model with community structures, we propose a random clique network model composed of different orders of cliques. We further assume that each disease spreads only through one type of cliques; this assumption corresponds to the issue that two diseases spread inside communities and outside them. Considering the relationship between the susceptible-infected-recovered (SIR) model and the bond percolation theory, we apply this theory to clique random networks under the assumption that the occupation probability is clique-type dependent, which is consistent with the observation that infection rates inside a community and outside it are different, and obtain a number of statistical properties for this model. Two interacting diseases that compete the same hosts are also investigated, which leads to a natural generalization of analyzing an arbitrary number of infectious diseases. For two-disease dynamics, the clustering effect is hypersensitive to the cohesiveness and concentration of cliques; this illustrates the impacts of clustering and the composition of subgraphs in networks on epidemic behavior. The analysis of coexistence/bistability regions provides significant insight into the relationship between the network structure and the potential epidemic prevalence. Copyright © 2012 Elsevier Ltd. All rights reserved.

  2. Modeling Composite Assessment Data Using Item Response Theory

    PubMed Central

    Ueckert, Sebastian

    2018-01-01

    Composite assessments aim to combine different aspects of a disease in a single score and are utilized in a variety of therapeutic areas. The data arising from these evaluations are inherently discrete with distinct statistical properties. This tutorial presents the framework of the item response theory (IRT) for the analysis of this data type in a pharmacometric context. The article considers both conceptual (terms and assumptions) and practical questions (modeling software, data requirements, and model building). PMID:29493119

  3. Illness-death model: statistical perspective and differential equations.

    PubMed

    Brinks, Ralph; Hoyer, Annika

    2018-01-27

    The aim of this work is to relate the theory of stochastic processes with the differential equations associated with multistate (compartment) models. We show that the Kolmogorov Forward Differential Equations can be used to derive a relation between the prevalence and the transition rates in the illness-death model. Then, we prove mathematical well-definedness and epidemiological meaningfulness of the prevalence of the disease. As an application, we derive the incidence of diabetes from a series of cross-sections.

  4. Triple burden of disease and out of pocket healthcare expenditure of women in India

    PubMed Central

    Ladusingh, Laishram; Mohanty, Sanjay Kumar

    2018-01-01

    Aim Women, unlike men, are subjected to triple burden of disease, namely, non-communicable and communicable diseases and reproductive health related diseases. To assess prevalence of triple burden of disease of currently married women and to contrast out of pocket maternal care expenditure of these diseases in India. Subject and methods This study uses nationally representative unit level data from the 71st round (2014) of the National Sample Survey Organisation. Descriptive statistics and bivariate analysis are employed to assess triple burden of diseases by background of currently married women. Mean out of pocket (OOP) expenditure for healthcare care by demographic and household characteristics of women are also compared by type of diseases. Two parts model is adopted for assessment of determents of out of pocket healthcare expenditure of women. Results Overall medical and non-medical expenses of non–communicable disease are much higher than those of other disease and disability, reproductive health related and communicable diseases. OOP expenditure for treatment of non-communicable diseases, reproductive health and related diseases and other disease and disability are significantly higher than the inpatient treatment of communicable diseases and the differences are statistically significant. Conclusion Out of pocket expenditure for treatment of non-communicable diseases is the highest, followed by that of other diseases & disability, then reproductive health related diseases and the least is for communicable diseases. OOP expenditures for maternal healthcare in private health facilities are much higher than in public health facilities regardless of types of disease. Women from households having insurance of any member spent less than that of women from households not having health insurance. There is an urgent need to expand the outreach of the public health system in India to rural areas. PMID:29746506

  5. Teleosts as Model Organisms To Understand Host-Microbe Interactions.

    PubMed

    Lescak, Emily A; Milligan-Myhre, Kathryn C

    2017-08-01

    Host-microbe interactions are influenced by complex host genetics and environment. Studies across animal taxa have aided our understanding of how intestinal microbiota influence vertebrate development, disease, and physiology. However, traditional mammalian studies can be limited by the use of isogenic strains, husbandry constraints that result in small sample sizes and limited statistical power, reliance on indirect characterization of gut microbial communities from fecal samples, and concerns of whether observations in artificial conditions are actually reflective of what occurs in the wild. Fish models are able to overcome many of these limitations. The extensive variation in the physiology, ecology, and natural history of fish enriches studies of the evolution and ecology of host-microbe interactions. They share physiological and immunological features common among vertebrates, including humans, and harbor complex gut microbiota, which allows identification of the mechanisms driving microbial community assembly. Their accelerated life cycles and large clutch sizes and the ease of sampling both internal and external microbial communities make them particularly well suited for robust statistical studies of microbial diversity. Gnotobiotic techniques, genetic manipulation of the microbiota and host, and transparent juveniles enable novel insights into mechanisms underlying development of the digestive tract and disease states. Many diseases involve a complex combination of genes which are difficult to manipulate in homogeneous model organisms. By taking advantage of the natural genetic variation found in wild fish populations, as well as of the availability of powerful genetic tools, future studies should be able to identify conserved genes and pathways that contribute to human genetic diseases characterized by dysbiosis. Copyright © 2017 Lescak and Milligan-Myhre.

  6. Teleosts as Model Organisms To Understand Host-Microbe Interactions

    PubMed Central

    2017-01-01

    ABSTRACT Host-microbe interactions are influenced by complex host genetics and environment. Studies across animal taxa have aided our understanding of how intestinal microbiota influence vertebrate development, disease, and physiology. However, traditional mammalian studies can be limited by the use of isogenic strains, husbandry constraints that result in small sample sizes and limited statistical power, reliance on indirect characterization of gut microbial communities from fecal samples, and concerns of whether observations in artificial conditions are actually reflective of what occurs in the wild. Fish models are able to overcome many of these limitations. The extensive variation in the physiology, ecology, and natural history of fish enriches studies of the evolution and ecology of host-microbe interactions. They share physiological and immunological features common among vertebrates, including humans, and harbor complex gut microbiota, which allows identification of the mechanisms driving microbial community assembly. Their accelerated life cycles and large clutch sizes and the ease of sampling both internal and external microbial communities make them particularly well suited for robust statistical studies of microbial diversity. Gnotobiotic techniques, genetic manipulation of the microbiota and host, and transparent juveniles enable novel insights into mechanisms underlying development of the digestive tract and disease states. Many diseases involve a complex combination of genes which are difficult to manipulate in homogeneous model organisms. By taking advantage of the natural genetic variation found in wild fish populations, as well as of the availability of powerful genetic tools, future studies should be able to identify conserved genes and pathways that contribute to human genetic diseases characterized by dysbiosis. PMID:28439034

  7. EHR-based phenotyping: Bulk learning and evaluation.

    PubMed

    Chiu, Po-Hsiang; Hripcsak, George

    2017-06-01

    In data-driven phenotyping, a core computational task is to identify medical concepts and their variations from sources of electronic health records (EHR) to stratify phenotypic cohorts. A conventional analytic framework for phenotyping largely uses a manual knowledge engineering approach or a supervised learning approach where clinical cases are represented by variables encompassing diagnoses, medicinal treatments and laboratory tests, among others. In such a framework, tasks associated with feature engineering and data annotation remain a tedious and expensive exercise, resulting in poor scalability. In addition, certain clinical conditions, such as those that are rare and acute in nature, may never accumulate sufficient data over time, which poses a challenge to establishing accurate and informative statistical models. In this paper, we use infectious diseases as the domain of study to demonstrate a hierarchical learning method based on ensemble learning that attempts to address these issues through feature abstraction. We use a sparse annotation set to train and evaluate many phenotypes at once, which we call bulk learning. In this batch-phenotyping framework, disease cohort definitions can be learned from within the abstract feature space established by using multiple diseases as a substrate and diagnostic codes as surrogates. In particular, using surrogate labels for model training renders possible its subsequent evaluation using only a sparse annotated sample. Moreover, statistical models can be trained and evaluated, using the same sparse annotation, from within the abstract feature space of low dimensionality that encapsulates the shared clinical traits of these target diseases, collectively referred to as the bulk learning set. Copyright © 2017 Elsevier Inc. All rights reserved.

  8. Race and hormone receptor-positive breast cancer outcomes in a randomized chemotherapy trial.

    PubMed

    Sparano, Joseph A; Wang, Molin; Zhao, Fengmin; Stearns, Vered; Martino, Silvana; Ligibel, Jennifer A; Perez, Edith A; Saphner, Tom; Wolff, Antonio C; Sledge, George W; Wood, William C; Davidson, Nancy E

    2012-03-07

    The association between black race and worse outcomes in operable breast cancer reported in previous studies has been attributed to a higher incidence of more aggressive triple-negative disease, disparities in care, and comorbidities. We evaluated associations between black race and outcomes, by tumor hormone receptor and HER2 expression, in patients who were treated with contemporary adjuvant therapy. The effect of black race on disease-free and overall survival was evaluated using Cox proportional hazards models adjusted for multiple covariates in a clinical trial population that was treated with anthracycline- and taxane-containing chemotherapy. Categorical variables were compared using the Fisher exact test. All P values are two-sided. Of 4817 eligible patients, 405 (8.4%) were black. Compared with nonblack patients, black patients had a higher rate of triple-negative disease (31.9% vs 17.2%; P < .001) and a higher body mass index (median: 31.7 vs 27.4 kg/m(2); P < .001). Black race was statistically significantly associated with worse disease-free survival (5-year disease-free survival, black vs nonblack: 76.7% vs 84.5%; hazard ratio of recurrence or death = 1.58, 95% confidence interval = 1.19 to 2.10, P = .0015) and overall survival (5-year overall survival, black vs nonblack: 87.6% vs 91.9%; hazard ratio of death = 1.49, 95% confidence interval = 1.05 to 2.12, P = .025) in patients with hormone receptor-positive HER2-negative disease but not in patients with triple-negative or HER2-positive disease. In a model that included black race, hormone receptor-positive HER2-negative disease vs other subtypes, and their interaction, the interaction term was statistically significant for disease-free survival (P = .027) but not for overall survival (P = .086). Factors other than disparities in care or aggressive disease contribute to increased recurrence in black women with hormone receptor-positive breast cancer.

  9. Effect of social support on informed consent in older adults with Parkinson disease and their caregivers.

    PubMed

    Ford, M E; Kallen, M; Richardson, P; Matthiesen, E; Cox, V; Teng, E J; Cook, K F; Petersen, N J

    2008-01-01

    To evaluate the effects of social support on comprehension and recall of consent form information in a study of Parkinson disease patients and their caregivers. Comparison of comprehension and recall outcomes among participants who read and signed the consent form accompanied by a family member/friend versus those of participants who read and signed the consent form unaccompanied. Comprehension and recall of consent form information were measured at one week and one month respectively, using Part A of the Quality of Informed Consent Questionnaire (QuIC). The mean age of the sample of 143 participants was 71 years (SD = 8.6 years). Analysis of covariance was used to compare QuIC scores between the intervention group (n = 70) and control group (n = 73). In the 1-week model, no statistically significant intervention effect was found (p = 0.860). However, the intervention status by patient status interaction was statistically significant (p = 0.012). In the 1-month model, no statistically significant intervention effect was found (p = 0.480). Again, however, the intervention status by patient status interaction was statistically significant (p = 0.040). At both time periods, intervention group patients scored higher (better) on the QuIC than did intervention group caregivers, and control group patients scored lower (worse) on the QuIC than did control group caregivers. Social support played a significant role in enhancing comprehension and recall of consent form information among patients.

  10. A Powerful Approach to Estimating Annotation-Stratified Genetic Covariance via GWAS Summary Statistics.

    PubMed

    Lu, Qiongshi; Li, Boyang; Ou, Derek; Erlendsdottir, Margret; Powles, Ryan L; Jiang, Tony; Hu, Yiming; Chang, David; Jin, Chentian; Dai, Wei; He, Qidu; Liu, Zefeng; Mukherjee, Shubhabrata; Crane, Paul K; Zhao, Hongyu

    2017-12-07

    Despite the success of large-scale genome-wide association studies (GWASs) on complex traits, our understanding of their genetic architecture is far from complete. Jointly modeling multiple traits' genetic profiles has provided insights into the shared genetic basis of many complex traits. However, large-scale inference sets a high bar for both statistical power and biological interpretability. Here we introduce a principled framework to estimate annotation-stratified genetic covariance between traits using GWAS summary statistics. Through theoretical and numerical analyses, we demonstrate that our method provides accurate covariance estimates, thereby enabling researchers to dissect both the shared and distinct genetic architecture across traits to better understand their etiologies. Among 50 complex traits with publicly accessible GWAS summary statistics (N total ≈ 4.5 million), we identified more than 170 pairs with statistically significant genetic covariance. In particular, we found strong genetic covariance between late-onset Alzheimer disease (LOAD) and amyotrophic lateral sclerosis (ALS), two major neurodegenerative diseases, in single-nucleotide polymorphisms (SNPs) with high minor allele frequencies and in SNPs located in the predicted functional genome. Joint analysis of LOAD, ALS, and other traits highlights LOAD's correlation with cognitive traits and hints at an autoimmune component for ALS. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  11. Stargardt disease: towards developing a model to predict phenotype.

    PubMed

    Heathfield, Laura; Lacerda, Miguel; Nossek, Christel; Roberts, Lisa; Ramesar, Rajkumar S

    2013-10-01

    Stargardt disease is an ABCA4-associated retinopathy, which generally follows an autosomal recessive inheritance pattern and is a frequent cause of macular degeneration in childhood. ABCA4 displays significant allelic heterogeneity whereby different mutations can cause retinal diseases with varying severity and age of onset. A genotype-phenotype model has been proposed linking ABCA4 mutations, purported ABCA4 functional protein activity and severity of disease, as measured by degree of visual loss and the age of onset. It has, however, been difficult to verify this model statistically in observational studies, as the number of individuals sharing any particular mutation combination is typically low. Seven founder mutations have been identified in a large number of Caucasian Afrikaner patients in South Africa, making it possible to test the genotype-phenotype model. A generalised linear model was developed to predict and assess the relative pathogenic contribution of the seven mutations to the age of onset of Stargardt disease. It is shown that the pathogenicity of an individual mutation can differ significantly depending on the genetic context in which it occurs. The results reported here may be used to identify suitable candidates for inclusion in clinical trials, as well as guide the genetic counselling of affected individuals and families.

  12. Stargardt Disease: towards developing a model to predict phenotype

    PubMed Central

    Heathfield, Laura; Lacerda, Miguel; Nossek, Christel; Roberts, Lisa; Ramesar, Rajkumar S

    2013-01-01

    Stargardt disease is an ABCA4-associated retinopathy, which generally follows an autosomal recessive inheritance pattern and is a frequent cause of macular degeneration in childhood. ABCA4 displays significant allelic heterogeneity whereby different mutations can cause retinal diseases with varying severity and age of onset. A genotype–phenotype model has been proposed linking ABCA4 mutations, purported ABCA4 functional protein activity and severity of disease, as measured by degree of visual loss and the age of onset. It has, however, been difficult to verify this model statistically in observational studies, as the number of individuals sharing any particular mutation combination is typically low. Seven founder mutations have been identified in a large number of Caucasian Afrikaner patients in South Africa, making it possible to test the genotype–phenotype model. A generalised linear model was developed to predict and assess the relative pathogenic contribution of the seven mutations to the age of onset of Stargardt disease. It is shown that the pathogenicity of an individual mutation can differ significantly depending on the genetic context in which it occurs. The results reported here may be used to identify suitable candidates for inclusion in clinical trials, as well as guide the genetic counselling of affected individuals and families. PMID:23695285

  13. Validation of Statistical Models for Estimating Hospitalization Associated with Influenza and Other Respiratory Viruses

    PubMed Central

    Chan, King-Pan; Chan, Kwok-Hung; Wong, Wilfred Hing-Sang; Peiris, J. S. Malik; Wong, Chit-Ming

    2011-01-01

    Background Reliable estimates of disease burden associated with respiratory viruses are keys to deployment of preventive strategies such as vaccination and resource allocation. Such estimates are particularly needed in tropical and subtropical regions where some methods commonly used in temperate regions are not applicable. While a number of alternative approaches to assess the influenza associated disease burden have been recently reported, none of these models have been validated with virologically confirmed data. Even fewer methods have been developed for other common respiratory viruses such as respiratory syncytial virus (RSV), parainfluenza and adenovirus. Methods and Findings We had recently conducted a prospective population-based study of virologically confirmed hospitalization for acute respiratory illnesses in persons <18 years residing in Hong Kong Island. Here we used this dataset to validate two commonly used models for estimation of influenza disease burden, namely the rate difference model and Poisson regression model, and also explored the applicability of these models to estimate the disease burden of other respiratory viruses. The Poisson regression models with different link functions all yielded estimates well correlated with the virologically confirmed influenza associated hospitalization, especially in children older than two years. The disease burden estimates for RSV, parainfluenza and adenovirus were less reliable with wide confidence intervals. The rate difference model was not applicable to RSV, parainfluenza and adenovirus and grossly underestimated the true burden of influenza associated hospitalization. Conclusion The Poisson regression model generally produced satisfactory estimates in calculating the disease burden of respiratory viruses in a subtropical region such as Hong Kong. PMID:21412433

  14. Use of neural networks to model complex immunogenetic associations of disease: human leukocyte antigen impact on the progression of human immunodeficiency virus infection.

    PubMed

    Ioannidis, J P; McQueen, P G; Goedert, J J; Kaslow, R A

    1998-03-01

    Complex immunogenetic associations of disease involving a large number of gene products are difficult to evaluate with traditional statistical methods and may require complex modeling. The authors evaluated the performance of feed-forward backpropagation neural networks in predicting rapid progression to acquired immunodeficiency syndrome (AIDS) for patients with human immunodeficiency virus (HIV) infection on the basis of major histocompatibility complex variables. Networks were trained on data from patients from the Multicenter AIDS Cohort Study (n = 139) and then validated on patients from the DC Gay cohort (n = 102). The outcome of interest was rapid disease progression, defined as progression to AIDS in <6 years from seroconversion. Human leukocyte antigen (HLA) variables were selected as network inputs with multivariate regression and a previously described algorithm selecting markers with extreme point estimates for progression risk. Network performance was compared with that of logistic regression. Networks with 15 HLA inputs and a single hidden layer of five nodes achieved a sensitivity of 87.5% and specificity of 95.6% in the training set, vs. 77.0% and 76.9%, respectively, achieved by logistic regression. When validated on the DC Gay cohort, networks averaged a sensitivity of 59.1% and specificity of 74.3%, vs. 53.1% and 61.4%, respectively, for logistic regression. Neural networks offer further support to the notion that HIV disease progression may be dependent on complex interactions between different class I and class II alleles and transporters associated with antigen processing variants. The effect in the current models is of moderate magnitude, and more data as well as other host and pathogen variables may need to be considered to improve the performance of the models. Artificial intelligence methods may complement linear statistical methods for evaluating immunogenetic associations of disease.

  15. Fitting mechanistic epidemic models to data: A comparison of simple Markov chain Monte Carlo approaches.

    PubMed

    Li, Michael; Dushoff, Jonathan; Bolker, Benjamin M

    2018-07-01

    Simple mechanistic epidemic models are widely used for forecasting and parameter estimation of infectious diseases based on noisy case reporting data. Despite the widespread application of models to emerging infectious diseases, we know little about the comparative performance of standard computational-statistical frameworks in these contexts. Here we build a simple stochastic, discrete-time, discrete-state epidemic model with both process and observation error and use it to characterize the effectiveness of different flavours of Bayesian Markov chain Monte Carlo (MCMC) techniques. We use fits to simulated data, where parameters (and future behaviour) are known, to explore the limitations of different platforms and quantify parameter estimation accuracy, forecasting accuracy, and computational efficiency across combinations of modeling decisions (e.g. discrete vs. continuous latent states, levels of stochasticity) and computational platforms (JAGS, NIMBLE, Stan).

  16. Nonlinear dynamic mechanism of vocal tremor from voice analysis and model simulations

    NASA Astrophysics Data System (ADS)

    Zhang, Yu; Jiang, Jack J.

    2008-09-01

    Nonlinear dynamic analysis and model simulations are used to study the nonlinear dynamic characteristics of vocal folds with vocal tremor, which can typically be characterized by low-frequency modulation and aperiodicity. Tremor voices from patients with disorders such as paresis, Parkinson's disease, hyperfunction, and adductor spasmodic dysphonia show low-dimensional characteristics, differing from random noise. Correlation dimension analysis statistically distinguishes tremor voices from normal voices. Furthermore, a nonlinear tremor model is proposed to study the vibrations of the vocal folds with vocal tremor. Fractal dimensions and positive Lyapunov exponents demonstrate the evidence of chaos in the tremor model, where amplitude and frequency play important roles in governing vocal fold dynamics. Nonlinear dynamic voice analysis and vocal fold modeling may provide a useful set of tools for understanding the dynamic mechanism of vocal tremor in patients with laryngeal diseases.

  17. Dose-responses for mortality from cerebrovascular and heart diseases in atomic bomb survivors: 1950-2003.

    PubMed

    Schöllnberger, Helmut; Eidemüller, Markus; Cullings, Harry M; Simonetto, Cristoforo; Neff, Frauke; Kaiser, Jan Christian

    2018-03-01

    The scientific community faces important discussions on the validity of the linear no-threshold (LNT) model for radiation-associated cardiovascular diseases at low and moderate doses. In the present study, mortalities from cerebrovascular diseases (CeVD) and heart diseases from the latest data on atomic bomb survivors were analyzed. The analysis was performed with several radio-biologically motivated linear and nonlinear dose-response models. For each detrimental health outcome one set of models was identified that all fitted the data about equally well. This set was used for multi-model inference (MMI), a statistical method of superposing different models to allow risk estimates to be based on several plausible dose-response models rather than just relying on a single model of choice. MMI provides a more accurate determination of the dose response and a more comprehensive characterization of uncertainties. It was found that for CeVD, the dose-response curve from MMI is located below the linear no-threshold model at low and medium doses (0-1.4 Gy). At higher doses MMI predicts a higher risk compared to the LNT model. A sublinear dose-response was also found for heart diseases (0-3 Gy). The analyses provide no conclusive answer to the question whether there is a radiation risk below 0.75 Gy for CeVD and 2.6 Gy for heart diseases. MMI suggests that the dose-response curves for CeVD and heart diseases in the Lifespan Study are sublinear at low and moderate doses. This has relevance for radiotherapy treatment planning and for international radiation protection practices in general.

  18. Improved Statistics for Genome-Wide Interaction Analysis

    PubMed Central

    Ueki, Masao; Cordell, Heather J.

    2012-01-01

    Recently, Wu and colleagues [1] proposed two novel statistics for genome-wide interaction analysis using case/control or case-only data. In computer simulations, their proposed case/control statistic outperformed competing approaches, including the fast-epistasis option in PLINK and logistic regression analysis under the correct model; however, reasons for its superior performance were not fully explored. Here we investigate the theoretical properties and performance of Wu et al.'s proposed statistics and explain why, in some circumstances, they outperform competing approaches. Unfortunately, we find minor errors in the formulae for their statistics, resulting in tests that have higher than nominal type 1 error. We also find minor errors in PLINK's fast-epistasis and case-only statistics, although theory and simulations suggest that these errors have only negligible effect on type 1 error. We propose adjusted versions of all four statistics that, both theoretically and in computer simulations, maintain correct type 1 error rates under the null hypothesis. We also investigate statistics based on correlation coefficients that maintain similar control of type 1 error. Although designed to test specifically for interaction, we show that some of these previously-proposed statistics can, in fact, be sensitive to main effects at one or both loci, particularly in the presence of linkage disequilibrium. We propose two new “joint effects” statistics that, provided the disease is rare, are sensitive only to genuine interaction effects. In computer simulations we find, in most situations considered, that highest power is achieved by analysis under the correct genetic model. Such an analysis is unachievable in practice, as we do not know this model. However, generally high power over a wide range of scenarios is exhibited by our joint effects and adjusted Wu statistics. We recommend use of these alternative or adjusted statistics and urge caution when using Wu et al.'s originally-proposed statistics, on account of the inflated error rate that can result. PMID:22496670

  19. Surveillance of Arthropod Vector-Borne Infectious Diseases Using Remote Sensing Techniques: A Review

    PubMed Central

    Kalluri, Satya; Gilruth, Peter; Rogers, David; Szczur, Martha

    2007-01-01

    Epidemiologists are adopting new remote sensing techniques to study a variety of vector-borne diseases. Associations between satellite-derived environmental variables such as temperature, humidity, and land cover type and vector density are used to identify and characterize vector habitats. The convergence of factors such as the availability of multi-temporal satellite data and georeferenced epidemiological data, collaboration between remote sensing scientists and biologists, and the availability of sophisticated, statistical geographic information system and image processing algorithms in a desktop environment creates a fertile research environment. The use of remote sensing techniques to map vector-borne diseases has evolved significantly over the past 25 years. In this paper, we review the status of remote sensing studies of arthropod vector-borne diseases due to mosquitoes, ticks, blackflies, tsetse flies, and sandflies, which are responsible for the majority of vector-borne diseases in the world. Examples of simple image classification techniques that associate land use and land cover types with vector habitats, as well as complex statistical models that link satellite-derived multi-temporal meteorological observations with vector biology and abundance, are discussed here. Future improvements in remote sensing applications in epidemiology are also discussed. PMID:17967056

  20. Public Opinions Toward Diseases: Infodemiological Study on News Media Data

    PubMed Central

    ElTayeby, Omar; Zolnoori, Maryam

    2018-01-01

    Background Society always has limited resources to expend on health care, or anything else. What are the unmet medical needs? How do we allocate limited resources to maximize the health and welfare of the people? These challenging questions might be re-examined systematically within an infodemiological frame on a much larger scale, leveraging the latest advancement in information technology and data science. Objective We expanded our previous work by investigating news media data to reveal the coverage of different diseases and medical conditions, together with their sentiments and topics in news articles over three decades. We were motivated to do so since news media plays a significant role in politics and affects the public policy making. Methods We analyzed over 3.5 million archive news articles from Reuters media during the periods of 1996/1997, 2008 and 2016, using summary statistics, sentiment analysis, and topic modeling. Summary statistics illustrated the coverage of various diseases and medical conditions during the last 3 decades. Sentiment analysis and topic modeling helped us automatically detect the sentiments of news articles (ie, positive versus negative) and topics (ie, a series of keywords) associated with each disease over time. Results The percentages of news articles mentioning diseases and medical conditions were 0.44%, 0.57% and 0.81% in the three time periods, suggesting that news media or the public has gradually increased its interests in medicine since 1996. Certain diseases such as other malignant neoplasm (34%), other infectious diseases (20%), and influenza (11%) represented the most covered diseases. Two hundred and twenty-six diseases and medical conditions (97.8%) were found to have neutral or negative sentiments in the news articles. Using topic modeling, we identified meaningful topics on these diseases and medical conditions. For instance, the smoking theme appeared in the news articles on other malignant neoplasm only during 1996/1997. The topic phrases HIV and Zika virus were linked to other infectious diseases during 1996/1997 and 2016, respectively. Conclusions The multi-dimensional analysis of news media data allows the discovery of focus, sentiments and topics of news media in terms of diseases and medical conditions. These infodemiological discoveries could shed light on unmet medical needs and research priorities for future and provide guidance for the decision making in public policy. PMID:29739741

  1. Stochastic modelling of direct costs of pancreas disease (PD) in Norwegian farmed Atlantic salmon (Salmo salar L.).

    PubMed

    Aunsmo, Arnfinn; Valle, Paul Steinar; Sandberg, Marianne; Midtlyng, Paul Johan; Bruheim, Torkjel

    2010-02-01

    An economic model for estimating the direct costs of disease in industrial aquaculture was developed to include the following areas: biological losses, extraordinary costs, costs of treatment, costs of prevention and insurance pay-out. Direct costs of a pancreas disease (PD) outbreak in Norwegian farmed Atlantic salmon were estimated in the model, using probability distributions for the biological losses and expenditures associated with the disease. The biological effects of PD on mortality, growth, feed conversion and carcass quality and their correlations, together with costs of prevention were established using elicited data from an expert panel, and combined with basal losses in a control model. Extraordinary costs and costs associated with treatment were collected through a questionnaire sent to staff managing disease outbreaks. Norwegian national statistics for 2007 were used for prices and production costs in the model. Direct costs associated with a PD-outbreak in a site stocked with 500,000 smolts (vs. a similar site without the disease) were estimated to NOK (Norwegian kroner) 14.4 million (5% and 95% percentile: 10.5 and 17.8) (NOK=euro0.12 or $0.17 for 2007). Production was reduced to 70% (5% and 95% percentile: 57% and 81%) saleable biomass, and at an increased production cost of NOK 6.0 per kg (5% and 95% percentile: 3.5 and 8.7). Copyright 2009 Elsevier B.V. All rights reserved.

  2. Subject-enabled analytics model on measurement statistics in health risk expert system for public health informatics.

    PubMed

    Chung, Chi-Jung; Kuo, Yu-Chen; Hsieh, Yun-Yu; Li, Tsai-Chung; Lin, Cheng-Chieh; Liang, Wen-Miin; Liao, Li-Na; Li, Chia-Ing; Lin, Hsueh-Chun

    2017-11-01

    This study applied open source technology to establish a subject-enabled analytics model that can enhance measurement statistics of case studies with the public health data in cloud computing. The infrastructure of the proposed model comprises three domains: 1) the health measurement data warehouse (HMDW) for the case study repository, 2) the self-developed modules of online health risk information statistics (HRIStat) for cloud computing, and 3) the prototype of a Web-based process automation system in statistics (PASIS) for the health risk assessment of case studies with subject-enabled evaluation. The system design employed freeware including Java applications, MySQL, and R packages to drive a health risk expert system (HRES). In the design, the HRIStat modules enforce the typical analytics methods for biomedical statistics, and the PASIS interfaces enable process automation of the HRES for cloud computing. The Web-based model supports both modes, step-by-step analysis and auto-computing process, respectively for preliminary evaluation and real time computation. The proposed model was evaluated by computing prior researches in relation to the epidemiological measurement of diseases that were caused by either heavy metal exposures in the environment or clinical complications in hospital. The simulation validity was approved by the commercial statistics software. The model was installed in a stand-alone computer and in a cloud-server workstation to verify computing performance for a data amount of more than 230K sets. Both setups reached efficiency of about 10 5 sets per second. The Web-based PASIS interface can be used for cloud computing, and the HRIStat module can be flexibly expanded with advanced subjects for measurement statistics. The analytics procedure of the HRES prototype is capable of providing assessment criteria prior to estimating the potential risk to public health. Copyright © 2017 Elsevier B.V. All rights reserved.

  3. A phylogenetic transform enhances analysis of compositional microbiota data.

    PubMed

    Silverman, Justin D; Washburne, Alex D; Mukherjee, Sayan; David, Lawrence A

    2017-02-15

    Surveys of microbial communities (microbiota), typically measured as relative abundance of species, have illustrated the importance of these communities in human health and disease. Yet, statistical artifacts commonly plague the analysis of relative abundance data. Here, we introduce the PhILR transform, which incorporates microbial evolutionary models with the isometric log-ratio transform to allow off-the-shelf statistical tools to be safely applied to microbiota surveys. We demonstrate that analyses of community-level structure can be applied to PhILR transformed data with performance on benchmarks rivaling or surpassing standard tools. Additionally, by decomposing distance in the PhILR transformed space, we identified neighboring clades that may have adapted to distinct human body sites. Decomposing variance revealed that covariation of bacterial clades within human body sites increases with phylogenetic relatedness. Together, these findings illustrate how the PhILR transform combines statistical and phylogenetic models to overcome compositional data challenges and enable evolutionary insights relevant to microbial communities.

  4. Seasonality Impact on the Transmission Dynamics of Tuberculosis

    PubMed Central

    2016-01-01

    The statistical data of monthly pulmonary tuberculosis (TB) incidence cases from January 2004 to December 2012 show the seasonality fluctuations in Shaanxi of China. A seasonality TB epidemic model with periodic varying contact rate, reactivation rate, and disease-induced death rate is proposed to explore the impact of seasonality on the transmission dynamics of TB. Simulations show that the basic reproduction number of time-averaged autonomous systems may underestimate or overestimate infection risks in some cases, which may be up to the value of period. The basic reproduction number of the seasonality model is appropriately given, which determines the extinction and uniform persistence of TB disease. If it is less than one, then the disease-free equilibrium is globally asymptotically stable; if it is greater than one, the system at least has a positive periodic solution and the disease will persist. Moreover, numerical simulations demonstrate these theorem results. PMID:27042199

  5. Risk assessment model for development of advanced age-related macular degeneration.

    PubMed

    Klein, Michael L; Francis, Peter J; Ferris, Frederick L; Hamon, Sara C; Clemons, Traci E

    2011-12-01

    To design a risk assessment model for development of advanced age-related macular degeneration (AMD) incorporating phenotypic, demographic, environmental, and genetic risk factors. We evaluated longitudinal data from 2846 participants in the Age-Related Eye Disease Study. At baseline, these individuals had all levels of AMD, ranging from none to unilateral advanced AMD (neovascular or geographic atrophy). Follow-up averaged 9.3 years. We performed a Cox proportional hazards analysis with demographic, environmental, phenotypic, and genetic covariates and constructed a risk assessment model for development of advanced AMD. Performance of the model was evaluated using the C statistic and the Brier score and externally validated in participants in the Complications of Age-Related Macular Degeneration Prevention Trial. The final model included the following independent variables: age, smoking history, family history of AMD (first-degree member), phenotype based on a modified Age-Related Eye Disease Study simple scale score, and genetic variants CFH Y402H and ARMS2 A69S. The model did well on performance measures, with very good discrimination (C statistic = 0.872) and excellent calibration and overall performance (Brier score at 5 years = 0.08). Successful external validation was performed, and a risk assessment tool was designed for use with or without the genetic component. We constructed a risk assessment model for development of advanced AMD. The model performed well on measures of discrimination, calibration, and overall performance and was successfully externally validated. This risk assessment tool is available for online use.

  6. Fluctuations in epidemic modeling - disease extinction and control

    NASA Astrophysics Data System (ADS)

    Schwartz, Ira

    2009-03-01

    The analysis of infectious disease fluctuations has recently seen an increasing rise in the use of new tools and models from stochastic dynamics and statistical physics. Examples arise in modeling fluctuations of multi-strain diseases, in modeling adaptive social behavior and its impact on disease fluctuations, and in the analysis of disease extinction in finite population models. Proper stochastic model reduction [1] allows one to predict unobserved fluctuations from observed data in multi-strain models [2]. Degree alteration and power law behavior is predicted in adaptive network epidemic models [3,4]. And extinction rates derived from large fluctuation theory exhibit scaling with respect to distance to the bifurcation point of disease onset with an unusual exponent [5]. In addition to outbreak prediction, another main goal of epidemic modeling is one of eliminating the disease to extinction through various control mechanisms, such as vaccine implementation or quarantine. In this talk, a description will be presented of the fluctuational behavior of several epidemic models and their extinction rates. A general framework and analysis of the effect of non-Gaussian control actuations which enhance the rate to disease extinction will be described. In particular, in it is shown that even in the presence of a small Poisson distributed vaccination program, there is an exponentially enhanced rate to disease extinction. These ideas may lead to improved methods of controlling disease where random vaccinations are prevalent. [4pt] Recent papers:[0pt] [1] E. Forgoston and I. B. Schwartz, ``Escape Rates in a Stochastic Environment with Multiple Scales,'' arXiv:0809.1345 2008.[0pt] [2] L. B. Shaw, L. Billings, I. B. Schwartz, ``Using dimension reduction to improve outbreak predictability of multi-strain diseases,'' J. Math. Bio. 55, 1 2007.[0pt] [3] L. B. Shaw and I. B. Schwartz, ``Fluctuating epidemics on adaptive networks,'' Physical Review E 77, 066101 2008.[0pt] [4] L. B. Shaw and I. B. Schwartz, ``Noise induced dynamics in adaptivenetworks with applications to epidemiology,'' arXiv:0807.3455 2008.[0pt] [5] M. I. Dykman, I. B. Schwartz, A. S. Landsman, ``Disease Extinction in the Presence of Random Vaccination,'' Phys. Rev. Letts. 101, 078101 2008.

  7. An epidemiological modeling and data integration framework.

    PubMed

    Pfeifer, B; Wurz, M; Hanser, F; Seger, M; Netzer, M; Osl, M; Modre-Osprian, R; Schreier, G; Baumgartner, C

    2010-01-01

    In this work, a cellular automaton software package for simulating different infectious diseases, storing the simulation results in a data warehouse system and analyzing the obtained results to generate prediction models as well as contingency plans, is proposed. The Brisbane H3N2 flu virus, which has been spreading during the winter season 2009, was used for simulation in the federal state of Tyrol, Austria. The simulation-modeling framework consists of an underlying cellular automaton. The cellular automaton model is parameterized by known disease parameters and geographical as well as demographical conditions are included for simulating the spreading. The data generated by simulation are stored in the back room of the data warehouse using the Talend Open Studio software package, and subsequent statistical and data mining tasks are performed using the tool, termed Knowledge Discovery in Database Designer (KD3). The obtained simulation results were used for generating prediction models for all nine federal states of Austria. The proposed framework provides a powerful and easy to handle interface for parameterizing and simulating different infectious diseases in order to generate prediction models and improve contingency plans for future events.

  8. Current Risk Adjustment and Comorbidity Index Underperformance in Predicting Post-Acute Utilization and Hospital Readmissions After Joint Replacements: Implications for Comprehensive Care for Joint Replacement Model.

    PubMed

    Kumar, Amit; Karmarkar, Amol; Downer, Brian; Vashist, Amit; Adhikari, Deepak; Al Snih, Soham; Ottenbacher, Kenneth

    2017-11-01

    To compare the performances of 3 comorbidity indices, the Charlson Comorbidity Index, the Elixhauser Comorbidity Index, and the Centers for Medicare & Medicaid Services (CMS) risk adjustment model, Hierarchical Condition Category (HCC), in predicting post-acute discharge settings and hospital readmission for patients after joint replacement. A retrospective study of Medicare beneficiaries with total knee replacement (TKR) or total hip replacement (THR) discharged from hospitals in 2009-2011 (n = 607,349) was performed. Study outcomes were post-acute discharge setting and unplanned 30-, 60-, and 90-day hospital readmissions. Logistic regression models were built to compare the performance of the 3 comorbidity indices using C statistics. The base model included patient demographics and hospital use. Subsequent models included 1 of the 3 comorbidity indices. Additional multivariable logistic regression models were built to identify individual comorbid conditions associated with high risk of hospital readmissions. The 30-, 60-, and 90-day unplanned hospital readmission rates were 5.3%, 7.2%, and 8.5%, respectively. Patients were most frequently discharged to home health (46.3%), followed by skilled nursing facility (40.9%) and inpatient rehabilitation facility (12.7%). The C statistics for the base model in predicting post-acute discharge setting and 30-, 60-, and 90-day readmission in TKR and THR were between 0.63 and 0.67. Adding the Charlson Comorbidity Index, the Elixhauser Comorbidity Index, or HCC increased the C statistic minimally from the base model for predicting both discharge settings and hospital readmission. The health conditions most frequently associated with hospital readmission were diabetes mellitus, pulmonary disease, arrhythmias, and heart disease. The comorbidity indices and CMS-HCC demonstrated weak discriminatory ability to predict post-acute discharge settings and hospital readmission following joint replacement. © 2017, American College of Rheumatology.

  9. Branching processes in disease epidemics

    NASA Astrophysics Data System (ADS)

    Singh, Sarabjeet

    Branching processes have served as a model for chemical reactions, biological growth processes and contagion (of disease, information or fads). Through this connection, these seemingly different physical processes share some common universalities that can be elucidated by analyzing the underlying branching process. In this thesis, we focus on branching processes as a model for infectious diseases spreading between individuals belonging to different populations. The distinction between populations can arise from species separation (as in the case of diseases which jump across species) or spatial separation (as in the case of disease spreading between farms, cities, urban centers, etc). A prominent example of the former is zoonoses -- infectious diseases that spill from animals to humans -- whose specific examples include Nipah virus, monkeypox, HIV and avian influenza. A prominent example of the latter is infectious diseases of animals such as foot and mouth disease and bovine tuberculosis that spread between farms or cattle herds. Another example of the latter is infectious diseases of humans such as H1N1 that spread from one city to another through migration of infectious hosts. This thesis consists of three main chapters, an introduction and an appendix. The introduction gives a brief history of mathematics in modeling the spread of infectious diseases along with a detailed description of the most commonly used disease model -- the Susceptible-Infectious-Recovered (SIR) model. The introduction also describes how the stochastic formulation of the model reduces to a branching process in the limit of large population which is analyzed in detail. The second chapter describes a two species model of zoonoses with coupled SIR processes and proceeds into the calculation of statistics pertinent to cross species infection using multitype branching processes. The third chapter describes an SIR process driven by a Poisson process of infection spillovers. This is posed as a model of infectious diseases where a `reservoir' of infection exists that infects a susceptible host population at a constant rate. The final chapter of the thesis describes a general framework of modeling infectious diseases in a network of populations using multitype branching processes.

  10. Evaluation of chronic disease management on outcomes and cost of care for Medicaid beneficiaries.

    PubMed

    Zhang, Ning Jackie; Wan, Thomas T H; Rossiter, Louis F; Murawski, Matthew M; Patel, Urvashi B

    2008-05-01

    To evaluate the impacts of the chronic disease management program on the outcomes and cost of care for Virginia Medicaid beneficiaries. A total of 35,628 patients and their physicians and pharmacists received interventions for five chronic diseases and comorbidities from 1999 to 2001. Comparisons of medical utilization and clinical outcomes between experimental groups and control group were conducted using ANOVA and ANCOVA analyses. Findings indicate that the disease state management (DSM) program statistically significantly improved patient's drug compliance and quality of life while reducing (ER), hospital, and physician office visits and adverse events. The average cost per hospitalization would have been $42 higher without the interventions. A coordinated disease management program designed for Medicaid patients experiencing significant chronic diseases can substantially improve clinical outcomes and reduce unnecessary medical utilization, while lowering costs, although these results were not observed across all disease groups. The DSM model may be potentially useful for Medicaid programs in states or other countries. If the adoption of the DSM model is to be promoted, evidence of its effectiveness should be tested in broader settings and best practice standards are expected.

  11. Complex network theory for the identification and assessment of candidate protein targets.

    PubMed

    McGarry, Ken; McDonald, Sharon

    2018-06-01

    In this work we use complex network theory to provide a statistical model of the connectivity patterns of human proteins and their interaction partners. Our intention is to identify important proteins that may be predisposed to be potential candidates as drug targets for therapeutic interventions. Target proteins usually have more interaction partners than non-target proteins, but there are no hard-and-fast rules for defining the actual number of interactions. We devise a statistical measure for identifying hub proteins, we score our target proteins with gene ontology annotations. The important druggable protein targets are likely to have similar biological functions that can be assessed for their potential therapeutic value. Our system provides a statistical analysis of the local and distant neighborhood protein interactions of the potential targets using complex network measures. This approach builds a more accurate model of drug-to-target activity and therefore the likely impact on treating diseases. We integrate high quality protein interaction data from the HINT database and disease associated proteins from the DrugTarget database. Other sources include biological knowledge from Gene Ontology and drug information from DrugBank. The problem is a very challenging one since the data is highly imbalanced between target proteins and the more numerous nontargets. We use undersampling on the training data and build Random Forest classifier models which are used to identify previously unclassified target proteins. We validate and corroborate these findings from the available literature. Copyright © 2018 Elsevier Ltd. All rights reserved.

  12. Interval Timing Accuracy and Scalar Timing in C57BL/6 Mice

    PubMed Central

    Buhusi, Catalin V.; Aziz, Dyana; Winslow, David; Carter, Rickey E.; Swearingen, Joshua E.; Buhusi, Mona C.

    2010-01-01

    In many species, interval timing behavior is accurate—appropriate estimated durations—and scalar—errors vary linearly with estimated durations. While accuracy has been previously examined, scalar timing has not been yet clearly demonstrated in house mice (Mus musculus), raising concerns about mouse models of human disease. We estimated timing accuracy and precision in C57BL/6 mice, the most used background strain for genetic models of human disease, in a peak-interval procedure with multiple intervals. Both when timing two intervals (Experiment 1) or three intervals (Experiment 2), C57BL/6 mice demonstrated varying degrees of timing accuracy. Importantly, both at individual and group level, their precision varied linearly with the subjective estimated duration. Further evidence for scalar timing was obtained using an intraclass correlation statistic. This is the first report of consistent, reliable scalar timing in a sizable sample of house mice, thus validating the PI procedure as a valuable technique, the intraclass correlation statistic as a powerful test of the scalar property, and the C57BL/6 strain as a suitable background for behavioral investigations of genetically engineered mice modeling disorders of interval timing. PMID:19824777

  13. Irradiation-hyperthermia in canine hemangiopericytomas: large-animal model for therapeutic response.

    PubMed

    Richardson, R C; Anderson, V L; Voorhees, W D; Blevins, W E; Inskeep, T K; Janas, W; Shupe, R E; Babbs, C F

    1984-11-01

    Results of irradiation-hyperthermia treatment in 11 dogs with naturally occurring hemangiopericytoma were reported. Similarities of canine and human hemangiopericytomas were described. Orthovoltage X-irradiation followed by microwave-induced hyperthermia resulted in a 91% objective response rate. A statistical procedure was given to evaluate quantitatively the clinical behavior of locally invasive, nonmetastatic tumors in dogs that were undergoing therapy for control of local disease. The procedure used a small sample size and demonstrated distribution of the data on a scaled response as well as transformation of the data through classical parametric and nonparametric statistical methods. These statistical methods set confidence limits on the population mean and placed tolerance limits on a population percentage. Application of the statistical methods to human and animal clinical trials was apparent.

  14. A weighted U statistic for association analyses considering genetic heterogeneity.

    PubMed

    Wei, Changshuai; Elston, Robert C; Lu, Qing

    2016-07-20

    Converging evidence suggests that common complex diseases with the same or similar clinical manifestations could have different underlying genetic etiologies. While current research interests have shifted toward uncovering rare variants and structural variations predisposing to human diseases, the impact of heterogeneity in genetic studies of complex diseases has been largely overlooked. Most of the existing statistical methods assume the disease under investigation has a homogeneous genetic effect and could, therefore, have low power if the disease undergoes heterogeneous pathophysiological and etiological processes. In this paper, we propose a heterogeneity-weighted U (HWU) method for association analyses considering genetic heterogeneity. HWU can be applied to various types of phenotypes (e.g., binary and continuous) and is computationally efficient for high-dimensional genetic data. Through simulations, we showed the advantage of HWU when the underlying genetic etiology of a disease was heterogeneous, as well as the robustness of HWU against different model assumptions (e.g., phenotype distributions). Using HWU, we conducted a genome-wide analysis of nicotine dependence from the Study of Addiction: Genetics and Environments dataset. The genome-wide analysis of nearly one million genetic markers took 7h, identifying heterogeneous effects of two new genes (i.e., CYP3A5 and IKBKB) on nicotine dependence. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  15. Addressing issues associated with evaluating prediction models for survival endpoints based on the concordance statistic.

    PubMed

    Wang, Ming; Long, Qi

    2016-09-01

    Prediction models for disease risk and prognosis play an important role in biomedical research, and evaluating their predictive accuracy in the presence of censored data is of substantial interest. The standard concordance (c) statistic has been extended to provide a summary measure of predictive accuracy for survival models. Motivated by a prostate cancer study, we address several issues associated with evaluating survival prediction models based on c-statistic with a focus on estimators using the technique of inverse probability of censoring weighting (IPCW). Compared to the existing work, we provide complete results on the asymptotic properties of the IPCW estimators under the assumption of coarsening at random (CAR), and propose a sensitivity analysis under the mechanism of noncoarsening at random (NCAR). In addition, we extend the IPCW approach as well as the sensitivity analysis to high-dimensional settings. The predictive accuracy of prediction models for cancer recurrence after prostatectomy is assessed by applying the proposed approaches. We find that the estimated predictive accuracy for the models in consideration is sensitive to NCAR assumption, and thus identify the best predictive model. Finally, we further evaluate the performance of the proposed methods in both settings of low-dimensional and high-dimensional data under CAR and NCAR through simulations. © 2016, The International Biometric Society.

  16. Effect of Chronic Kidney Diseases on Mortality among Digoxin Users Treated for Non-Valvular Atrial Fibrillation: A Nationwide Register-Based Retrospective Cohort Study.

    PubMed

    Sessa, Maurizio; Mascolo, Annamaria; Andersen, Mikkel Porsborg; Rosano, Giuseppe; Rossi, Francesco; Capuano, Annalisa; Torp-Pedersen, Christian

    2016-01-01

    This study investigated the impact of chronic kidney disease on all-causes and cardiovascular mortality in patients with atrial fibrillation treated with digoxin. All patients with non-valvular atrial fibrillation and/or atrial flutter as hospitalization diagnosis from January 1, 1997 to December 31, 2012 were identified in Danish nationwide administrative registries. Cox proportional hazard model was used to compare the adjusted risk of all-causes and cardiovascular mortality among patients with and without chronic kidney disease and among patients with different chronic kidney disease stages within 180 days and 2 years from the first digoxin prescription. We identified 37,981 patients receiving digoxin; 1884 patients had the diagnosis of chronic kidney disease. Cox regression analysis showed no statistically significant differences in all-causes (Hazard Ratio, HR 0.89; 95% confident interval, CI 0.78-1.03) and cardiovascular mortality (HR 0.88; 95%CI 0.74-1.05) among patients with and without chronic kidney disease within 180 days of follow-up period. No statistically significant differences was found using a 2 years follow-up period neither for all causes mortality (HR 0.90; 95%CI 0.79-1.03), nor for cardiovascular mortality (HR 0.87; 95%CI 0.74-1.02). No statistically significant differences was found comparing patients with and without estimated Glomerular Filtration Rate <30ml/min/1.73m2 and patients with different stages of chronic kidney disease, for all-causes and cardiovascular mortality within 180 days and 2 years from the first digoxin prescription. This study suggest no direct effect of chronic kidney disease and chronic kidney disease stages on all-causes and cardiovascular mortality within both 180 days and 2 years from the first digoxin prescription in patients treatment-naïve with digoxin for non-valvular atrial fibrillation.

  17. The use of machine learning for the identification of peripheral artery disease and future mortality risk.

    PubMed

    Ross, Elsie Gyang; Shah, Nigam H; Dalman, Ronald L; Nead, Kevin T; Cooke, John P; Leeper, Nicholas J

    2016-11-01

    A key aspect of the precision medicine effort is the development of informatics tools that can analyze and interpret "big data" sets in an automated and adaptive fashion while providing accurate and actionable clinical information. The aims of this study were to develop machine learning algorithms for the identification of disease and the prognostication of mortality risk and to determine whether such models perform better than classical statistical analyses. Focusing on peripheral artery disease (PAD), patient data were derived from a prospective, observational study of 1755 patients who presented for elective coronary angiography. We employed multiple supervised machine learning algorithms and used diverse clinical, demographic, imaging, and genomic information in a hypothesis-free manner to build models that could identify patients with PAD and predict future mortality. Comparison was made to standard stepwise linear regression models. Our machine-learned models outperformed stepwise logistic regression models both for the identification of patients with PAD (area under the curve, 0.87 vs 0.76, respectively; P = .03) and for the prediction of future mortality (area under the curve, 0.76 vs 0.65, respectively; P = .10). Both machine-learned models were markedly better calibrated than the stepwise logistic regression models, thus providing more accurate disease and mortality risk estimates. Machine learning approaches can produce more accurate disease classification and prediction models. These tools may prove clinically useful for the automated identification of patients with highly morbid diseases for which aggressive risk factor management can improve outcomes. Copyright © 2016 Society for Vascular Surgery. Published by Elsevier Inc. All rights reserved.

  18. A DNA vaccine for Crimean-Congo hemorrhagic fever protects against disease and death in two lethal mouse models

    PubMed Central

    Fitzpatrick, Collin J.; Suschak, John J.; Richards, Michelle J.; Badger, Catherine V.; Six, Carolyn M.; Martin, Jacqueline D.; Hannaman, Drew; Zivcec, Marko; Bergeron, Eric; Koehler, Jeffrey W.; Schmaljohn, Connie S.

    2017-01-01

    Crimean-Congo hemorrhagic fever virus (CCHFV) is a tick-borne virus capable of causing a severe hemorrhagic fever disease in humans. There are currently no licensed vaccines to prevent CCHFV-associated disease. We developed a DNA vaccine expressing the M-segment glycoprotein precursor gene of CCHFV and assessed its immunogenicity and protective efficacy in two lethal mouse models of disease: type I interferon receptor knockout (IFNAR-/-) mice; and a novel transiently immune suppressed (IS) mouse model. Vaccination of mice by muscle electroporation of the M-segment DNA vaccine elicited strong antigen-specific humoral immune responses with neutralizing titers after three vaccinations in both IFNAR-/- and IS mouse models. To compare the protective efficacy of the vaccine in the two models, groups of vaccinated mice (7–10 per group) were intraperitoneally (IP) challenged with a lethal dose of CCHFV strain IbAr 10200. Weight loss was markedly reduced in CCHFV DNA-vaccinated mice as compared to controls. Furthermore, whereas all vector-control vaccinated mice succumbed to disease by day 5, the DNA vaccine protected >60% of the animals from lethal disease. Mice from both models developed comparable levels of antibodies, but the IS mice had a more balanced Th1/Th2 response to vaccination. There were no statistical differences in the protective efficacies of the vaccine in the two models. Our results provide the first comparison of these two mouse models for assessing a vaccine against CCHFV and offer supportive data indicating that a DNA vaccine expressing the glycoprotein genes of CCHFV elicits protective immunity against CCHFV. PMID:28922426

  19. A DNA vaccine for Crimean-Congo hemorrhagic fever protects against disease and death in two lethal mouse models.

    PubMed

    Garrison, Aura R; Shoemaker, Charles J; Golden, Joseph W; Fitzpatrick, Collin J; Suschak, John J; Richards, Michelle J; Badger, Catherine V; Six, Carolyn M; Martin, Jacqueline D; Hannaman, Drew; Zivcec, Marko; Bergeron, Eric; Koehler, Jeffrey W; Schmaljohn, Connie S

    2017-09-01

    Crimean-Congo hemorrhagic fever virus (CCHFV) is a tick-borne virus capable of causing a severe hemorrhagic fever disease in humans. There are currently no licensed vaccines to prevent CCHFV-associated disease. We developed a DNA vaccine expressing the M-segment glycoprotein precursor gene of CCHFV and assessed its immunogenicity and protective efficacy in two lethal mouse models of disease: type I interferon receptor knockout (IFNAR-/-) mice; and a novel transiently immune suppressed (IS) mouse model. Vaccination of mice by muscle electroporation of the M-segment DNA vaccine elicited strong antigen-specific humoral immune responses with neutralizing titers after three vaccinations in both IFNAR-/- and IS mouse models. To compare the protective efficacy of the vaccine in the two models, groups of vaccinated mice (7-10 per group) were intraperitoneally (IP) challenged with a lethal dose of CCHFV strain IbAr 10200. Weight loss was markedly reduced in CCHFV DNA-vaccinated mice as compared to controls. Furthermore, whereas all vector-control vaccinated mice succumbed to disease by day 5, the DNA vaccine protected >60% of the animals from lethal disease. Mice from both models developed comparable levels of antibodies, but the IS mice had a more balanced Th1/Th2 response to vaccination. There were no statistical differences in the protective efficacies of the vaccine in the two models. Our results provide the first comparison of these two mouse models for assessing a vaccine against CCHFV and offer supportive data indicating that a DNA vaccine expressing the glycoprotein genes of CCHFV elicits protective immunity against CCHFV.

  20. Diurnal fluctuations in brain volume: Statistical analyses of MRI from large populations.

    PubMed

    Nakamura, Kunio; Brown, Robert A; Narayanan, Sridar; Collins, D Louis; Arnold, Douglas L

    2015-09-01

    We investigated fluctuations in brain volume throughout the day using statistical modeling of magnetic resonance imaging (MRI) from large populations. We applied fully automated image analysis software to measure the brain parenchymal fraction (BPF), defined as the ratio of the brain parenchymal volume and intracranial volume, thus accounting for variations in head size. The MRI data came from serial scans of multiple sclerosis (MS) patients in clinical trials (n=755, 3269 scans) and from subjects participating in the Alzheimer's Disease Neuroimaging Initiative (ADNI, n=834, 6114 scans). The percent change in BPF was modeled with a linear mixed effect (LME) model, and the model was applied separately to the MS and ADNI datasets. The LME model for the MS datasets included random subject effects (intercept and slope over time) and fixed effects for the time-of-day, time from the baseline scan, and trial, which accounted for trial-related effects (for example, different inclusion criteria and imaging protocol). The model for ADNI additionally included the demographics (baseline age, sex, subject type [normal, mild cognitive impairment, or Alzheimer's disease], and interaction between subject type and time from baseline). There was a statistically significant effect of time-of-day on the BPF change in MS clinical trial datasets (-0.180 per day, that is, 0.180% of intracranial volume, p=0.019) as well as the ADNI dataset (-0.438 per day, that is, 0.438% of intracranial volume, p<0.0001), showing that the brain volume is greater in the morning. Linearly correcting the BPF values with the time-of-day reduced the required sample size to detect a 25% treatment effect (80% power and 0.05 significance level) on change in brain volume from 2 time-points over a period of 1year by 2.6%. Our results have significant implications for future brain volumetric studies, suggesting that there is a potential acquisition time bias that should be randomized or statistically controlled to account for the day-to-day brain volume fluctuations. Copyright © 2015 Elsevier Inc. All rights reserved.

  1. Predictive Big Data Analytics: A Study of Parkinson’s Disease Using Large, Complex, Heterogeneous, Incongruent, Multi-Source and Incomplete Observations

    PubMed Central

    Dinov, Ivo D.; Heavner, Ben; Tang, Ming; Glusman, Gustavo; Chard, Kyle; Darcy, Mike; Madduri, Ravi; Pa, Judy; Spino, Cathie; Kesselman, Carl; Foster, Ian; Deutsch, Eric W.; Price, Nathan D.; Van Horn, John D.; Ames, Joseph; Clark, Kristi; Hood, Leroy; Hampstead, Benjamin M.; Dauer, William; Toga, Arthur W.

    2016-01-01

    Background A unique archive of Big Data on Parkinson’s Disease is collected, managed and disseminated by the Parkinson’s Progression Markers Initiative (PPMI). The integration of such complex and heterogeneous Big Data from multiple sources offers unparalleled opportunities to study the early stages of prevalent neurodegenerative processes, track their progression and quickly identify the efficacies of alternative treatments. Many previous human and animal studies have examined the relationship of Parkinson’s disease (PD) risk to trauma, genetics, environment, co-morbidities, or life style. The defining characteristics of Big Data–large size, incongruency, incompleteness, complexity, multiplicity of scales, and heterogeneity of information-generating sources–all pose challenges to the classical techniques for data management, processing, visualization and interpretation. We propose, implement, test and validate complementary model-based and model-free approaches for PD classification and prediction. To explore PD risk using Big Data methodology, we jointly processed complex PPMI imaging, genetics, clinical and demographic data. Methods and Findings Collective representation of the multi-source data facilitates the aggregation and harmonization of complex data elements. This enables joint modeling of the complete data, leading to the development of Big Data analytics, predictive synthesis, and statistical validation. Using heterogeneous PPMI data, we developed a comprehensive protocol for end-to-end data characterization, manipulation, processing, cleaning, analysis and validation. Specifically, we (i) introduce methods for rebalancing imbalanced cohorts, (ii) utilize a wide spectrum of classification methods to generate consistent and powerful phenotypic predictions, and (iii) generate reproducible machine-learning based classification that enables the reporting of model parameters and diagnostic forecasting based on new data. We evaluated several complementary model-based predictive approaches, which failed to generate accurate and reliable diagnostic predictions. However, the results of several machine-learning based classification methods indicated significant power to predict Parkinson’s disease in the PPMI subjects (consistent accuracy, sensitivity, and specificity exceeding 96%, confirmed using statistical n-fold cross-validation). Clinical (e.g., Unified Parkinson's Disease Rating Scale (UPDRS) scores), demographic (e.g., age), genetics (e.g., rs34637584, chr12), and derived neuroimaging biomarker (e.g., cerebellum shape index) data all contributed to the predictive analytics and diagnostic forecasting. Conclusions Model-free Big Data machine learning-based classification methods (e.g., adaptive boosting, support vector machines) can outperform model-based techniques in terms of predictive precision and reliability (e.g., forecasting patient diagnosis). We observed that statistical rebalancing of cohort sizes yields better discrimination of group differences, specifically for predictive analytics based on heterogeneous and incomplete PPMI data. UPDRS scores play a critical role in predicting diagnosis, which is expected based on the clinical definition of Parkinson’s disease. Even without longitudinal UPDRS data, however, the accuracy of model-free machine learning based classification is over 80%. The methods, software and protocols developed here are openly shared and can be employed to study other neurodegenerative disorders (e.g., Alzheimer’s, Huntington’s, amyotrophic lateral sclerosis), as well as for other predictive Big Data analytics applications. PMID:27494614

  2. Cognitive control related network analysis: A novel way to measure neuron fiber connection of Alzheimer's disease.

    PubMed

    Changle Zhang; Tao Chai; Na Gao; Ma, Heather T

    2017-07-01

    Effective measurement of cognitive impairment caused by Alzheimer's disease (AD) will provide a chance for early medical intervention and delay the disease onset. Diffusion tensor imaging (DTI) provides a non-intrusive examination of cranial nerve diseases which can help us observe the microstructure of neuron fibers. Cognitive control network (CCN) consists of the brain regions that highly related to human self-control. In this study, hub-and-spoke model which was widely used in transportation and sociology area had been employed to analyze the relationship of CCN and other regions under its control, cognitive control related network (CCRN) was built by applying this model. Local and global graph theoretical parameters were calculated and went through statistical analysis. Significant difference had been found in the scale of local as well as global which may represent the impairment of cognitive control ability. This result may provide a potential bio-marker for the loss of connection caused by Alzheimer's disease.

  3. Dengue: recent past and future threats

    PubMed Central

    Rogers, David J.

    2015-01-01

    This article explores four key questions about statistical models developed to describe the recent past and future of vector-borne diseases, with special emphasis on dengue: (1) How many variables should be used to make predictions about the future of vector-borne diseases?(2) Is the spatial resolution of a climate dataset an important determinant of model accuracy?(3) Does inclusion of the future distributions of vectors affect predictions of the futures of the diseases they transmit?(4) Which are the key predictor variables involved in determining the distributions of vector-borne diseases in the present and future?Examples are given of dengue models using one, five or 10 meteorological variables and at spatial resolutions of from one-sixth to two degrees. Model accuracy is improved with a greater number of descriptor variables, but is surprisingly unaffected by the spatial resolution of the data. Dengue models with a reduced set of climate variables derived from the HadCM3 global circulation model predictions for the 1980s are improved when risk maps for dengue's two main vectors (Aedes aegypti and Aedes albopictus) are also included as predictor variables; disease and vector models are projected into the future using the global circulation model predictions for the 2020s, 2040s and 2080s. The Garthwaite–Koch corr-max transformation is presented as a novel way of showing the relative contribution of each of the input predictor variables to the map predictions. PMID:25688021

  4. An agent-based approach for modeling dynamics of contagious disease spread

    PubMed Central

    Perez, Liliana; Dragicevic, Suzana

    2009-01-01

    Background The propagation of communicable diseases through a population is an inherent spatial and temporal process of great importance for modern society. For this reason a spatially explicit epidemiologic model of infectious disease is proposed for a greater understanding of the disease's spatial diffusion through a network of human contacts. Objective The objective of this study is to develop an agent-based modelling approach the integrates geographic information systems (GIS) to simulate the spread of a communicable disease in an urban environment, as a result of individuals' interactions in a geospatial context. Methods The methodology for simulating spatiotemporal dynamics of communicable disease propagation is presented and the model is implemented using measles outbreak in an urban environment as a case study. Individuals in a closed population are explicitly represented by agents associated to places where they interact with other agents. They are endowed with mobility, through a transportation network allowing them to move between places within the urban environment, in order to represent the spatial heterogeneity and the complexity involved in infectious diseases diffusion. The model is implemented on georeferenced land use dataset from Metro Vancouver and makes use of census data sets from Statistics Canada for the municipality of Burnaby, BC, Canada study site. Results The results provide insights into the application of the model to calculate ratios of susceptible/infected in specific time frames and urban environments, due to its ability to depict the disease progression based on individuals' interactions. It is demonstrated that the dynamic spatial interactions within the population lead to high numbers of exposed individuals who perform stationary activities in areas after they have finished commuting. As a result, the sick individuals are concentrated in geographical locations like schools and universities. Conclusion The GIS-agent based model designed for this study can be easily customized to study the disease spread dynamics of any other communicable disease by simply adjusting the modeled disease timeline and/or the infection model and modifying the transmission process. This type of simulations can help to improve comprehension of disease spread dynamics and to take better steps towards the prevention and control of an epidemic outbreak. PMID:19656403

  5. Role of Statistical Random-Effects Linear Models in Personalized Medicine.

    PubMed

    Diaz, Francisco J; Yeh, Hung-Wen; de Leon, Jose

    2012-03-01

    Some empirical studies and recent developments in pharmacokinetic theory suggest that statistical random-effects linear models are valuable tools that allow describing simultaneously patient populations as a whole and patients as individuals. This remarkable characteristic indicates that these models may be useful in the development of personalized medicine, which aims at finding treatment regimes that are appropriate for particular patients, not just appropriate for the average patient. In fact, published developments show that random-effects linear models may provide a solid theoretical framework for drug dosage individualization in chronic diseases. In particular, individualized dosages computed with these models by means of an empirical Bayesian approach may produce better results than dosages computed with some methods routinely used in therapeutic drug monitoring. This is further supported by published empirical and theoretical findings that show that random effects linear models may provide accurate representations of phase III and IV steady-state pharmacokinetic data, and may be useful for dosage computations. These models have applications in the design of clinical algorithms for drug dosage individualization in chronic diseases; in the computation of dose correction factors; computation of the minimum number of blood samples from a patient that are necessary for calculating an optimal individualized drug dosage in therapeutic drug monitoring; measure of the clinical importance of clinical, demographic, environmental or genetic covariates; study of drug-drug interactions in clinical settings; the implementation of computational tools for web-site-based evidence farming; design of pharmacogenomic studies; and in the development of a pharmacological theory of dosage individualization.

  6. Accounting for disease modifying therapy in models of clinical progression in multiple sclerosis.

    PubMed

    Healy, Brian C; Engler, David; Gholipour, Taha; Weiner, Howard; Bakshi, Rohit; Chitnis, Tanuja

    2011-04-15

    Identifying predictors of clinical progression in patients with relapsing-remitting multiple sclerosis (RRMS) is complicated in the era of disease modifying therapy (DMT) because patients follow many different DMT regimens. To investigate predictors of progression in a treated RRMS sample, a cohort of RRMS patients was prospectively followed in the Comprehensive Longitudinal Investigation of Multiple Sclerosis at the Brigham and Women's Hospital (CLIMB). Enrollment criteria were exposure to either interferon-β (IFN-β, n=164) or glatiramer acetate (GA, n=114) for at least 6 months prior to study entry. Baseline demographic and clinical features were used as candidate predictors of longitudinal clinical change on the Expanded Disability Status Scale (EDSS). We compared three approaches to account for DMT effects in statistical modeling. In all approaches, we analyzed all patients together and stratified based on baseline DMT. Model 1 used all available longitudinal EDSS scores, even those after on-study DMT changes. Model 2 used only clinical observations prior to changing DMT. Model 3 used causal statistical models to identify predictors of clinical change. When all patients were considered using Model 1, patients with a motor symptom as the first relapse had significantly larger change in EDSS scores during follow-up (p=0.04); none of the other clinical or demographic variables significantly predicted change. In Models 2 and 3, results were generally unchanged. DMT modeling choice had a modest impact on the variables classified as predictors of EDSS score change. Importantly, however, interpretation of these predictors is dependent upon modeling choice. Copyright © 2011 Elsevier B.V. All rights reserved.

  7. Personalizing oncology treatments by predicting drug efficacy, side-effects, and improved therapy: mathematics, statistics, and their integration.

    PubMed

    Agur, Zvia; Elishmereni, Moran; Kheifetz, Yuri

    2014-01-01

    Despite its great promise, personalized oncology still faces many hurdles, and it is increasingly clear that targeted drugs and molecular biomarkers alone yield only modest clinical benefit. One reason is the complex relationships between biomarkers and the patient's response to drugs, obscuring the true weight of the biomarkers in the overall patient's response. This complexity can be disentangled by computational models that integrate the effects of personal biomarkers into a simulator of drug-patient dynamic interactions, for predicting the clinical outcomes. Several computational tools have been developed for personalized oncology, notably evidence-based tools for simulating pharmacokinetics, Bayesian-estimated tools for predicting survival, etc. We describe representative statistical and mathematical tools, and discuss their merits, shortcomings and preliminary clinical validation attesting to their potential. Yet, the individualization power of mathematical models alone, or statistical models alone, is limited. More accurate and versatile personalization tools can be constructed by a new application of the statistical/mathematical nonlinear mixed effects modeling (NLMEM) approach, which until recently has been used only in drug development. Using these advanced tools, clinical data from patient populations can be integrated with mechanistic models of disease and physiology, for generating personal mathematical models. Upon a more substantial validation in the clinic, this approach will hopefully be applied in personalized clinical trials, P-trials, hence aiding the establishment of personalized medicine within the main stream of clinical oncology. © 2014 Wiley Periodicals, Inc.

  8. Diagnostic index: an open-source tool to classify TMJ OA condyles

    NASA Astrophysics Data System (ADS)

    Paniagua, Beatriz; Pascal, Laura; Prieto, Juan; Vimort, Jean Baptiste; Gomes, Liliane; Yatabe, Marilia; Ruellas, Antonio Carlos; Budin, Francois; Pieper, Steve; Styner, Martin; Benavides, Erika; Cevidanes, Lucia

    2017-03-01

    Osteoarthritis (OA) of temporomandibular joints (TMJ) occurs in about 40% of the patients who present TMJ disorders. Despite its prevalence, OA diagnosis and treatment remain controversial since there are no clear symptoms of the disease, especially in early stages. Quantitative tools based on 3D imaging of the TMJ condyle have the potential to help characterize TMJ OA changes. The goals of the tools proposed in this study are to ultimately develop robust imaging markers for diagnosis and assessment of treatment efficacy. This work proposes to identify differences among asymptomatic controls and different clinical phenotypes of TMJ OA by means of Statistical Shape Modeling (SSM), obtained via clinical expert consensus. From three different grouping schemes (with 3, 5 and 7 groups), our best results reveal that that the majority (74.5%) of the classifications occur in agreement with the groups assigned by consensus between our clinical experts. Our findings suggest the existence of different disease-based phenotypic morphologies in TMJ OA. Our preliminary findings with statistical shape modeling based biomarkers may provide a quantitative staging of the disease. The methodology used in this study is included in an open source image analysis toolbox, to ensure reproducibility and appropriate distribution and dissemination of the solution proposed.

  9. Association of the insertion allele of the common ACE gene polymorphism with type 2 diabetes mellitus among Kuwaiti cardiovascular disease patients.

    PubMed

    Al-Serri, Ahmad; Ismael, Fatma G; Al-Bustan, Suzanne A; Al-Rashdan, Ibrahim

    2015-12-01

    The D allele of the common angiotensin-converting enzyme (ACE) I/D gene polymorphism (rs4646994) predisposes to type 2 diabetes mellitus (T2DM) and cardiovascular disease (CVD). However, results on which allele predisposes to disease susceptibility remain controversial in Asian populations. This study was performed to evaluate the association of the common ACE I/D gene polymorphism with both T2DM and CVD susceptibility in an Arab population. We genotyped the ACE I/D polymorphisms by direct allele-specific PCR in 183 healthy controls and 400 CVD patients with diabetes (n=204) and without (n=196). Statistical analysis comparing between the different groups were conducted using R statistic package "SNPassoc". Two genetic models were used: the additive and co-dominant models. The I allele was found to be associated with T2DM (OR=1.84, p=0.00009) after adjusting for age, sex and body mass index. However, there was no association with CVD susceptibility (p>0.05). The ACE I allele is found to be associated with T2DM; however, no association was observed with CVD. The inconsistency between studies is suggested to be attributed to genetic diversity due to the existence of sub-populations found in Asian populations. © The Author(s) 2015.

  10. Risk models for post-endoscopic retrograde cholangiopancreatography pancreatitis (PEP): smoking and chronic liver disease are predictors of protection against PEP.

    PubMed

    DiMagno, Matthew J; Spaete, Joshua P; Ballard, Darren D; Wamsteker, Erik-Jan; Saini, Sameer D

    2013-08-01

    We investigated which variables independently associated with protection against or development of postendoscopic retrograde cholangiopancreatography (ERCP) pancreatitis (PEP) and severity of PEP. Subsequently, we derived predictive risk models for PEP. In a case-control design, 6505 patients had 8264 ERCPs, 211 patients had PEP, and 22 patients had severe PEP. We randomly selected 348 non-PEP controls. We examined 7 established- and 9 investigational variables. In univariate analysis, 7 variables predicted PEP: younger age, female sex, suspected sphincter of Oddi dysfunction (SOD), pancreatic sphincterotomy, moderate-difficult cannulation (MDC), pancreatic stent placement, and lower Charlson score. Protective variables were current smoking, former drinking, diabetes, and chronic liver disease (CLD, biliary/transplant complications). Multivariate analysis identified seven independent variables for PEP, three protective (current smoking, CLD-biliary, CLD-transplant/hepatectomy complications) and 4 predictive (younger age, suspected SOD, pancreatic sphincterotomy, MDC). Pre- and post-ERCP risk models of 7 variables have a C-statistic of 0.74. Removing age (seventh variable) did not significantly affect the predictive value (C-statistic of 0.73) and reduced model complexity. Severity of PEP did not associate with any variables by multivariate analysis. By using the newly identified protective variables with 3 predictive variables, we derived 2 risk models with a higher predictive value for PEP compared to prior studies.

  11. Statistical analysis plan for the family-led rehabilitation after stroke in India (ATTEND) trial: A multicenter randomized controlled trial of a new model of stroke rehabilitation compared to usual care.

    PubMed

    Billot, Laurent; Lindley, Richard I; Harvey, Lisa A; Maulik, Pallab K; Hackett, Maree L; Murthy, Gudlavalleti Vs; Anderson, Craig S; Shamanna, Bindiganavale R; Jan, Stephen; Walker, Marion; Forster, Anne; Langhorne, Peter; Verma, Shweta J; Felix, Cynthia; Alim, Mohammed; Gandhi, Dorcas Bc; Pandian, Jeyaraj Durai

    2017-02-01

    Background In low- and middle-income countries, few patients receive organized rehabilitation after stroke, yet the burden of chronic diseases such as stroke is increasing in these countries. Affordable models of effective rehabilitation could have a major impact. The ATTEND trial is evaluating a family-led caregiver delivered rehabilitation program after stroke. Objective To publish the detailed statistical analysis plan for the ATTEND trial prior to trial unblinding. Methods Based upon the published registration and protocol, the blinded steering committee and management team, led by the trial statistician, have developed a statistical analysis plan. The plan has been informed by the chosen outcome measures, the data collection forms and knowledge of key baseline data. Results The resulting statistical analysis plan is consistent with best practice and will allow open and transparent reporting. Conclusions Publication of the trial statistical analysis plan reduces potential bias in trial reporting, and clearly outlines pre-specified analyses. Clinical Trial Registrations India CTRI/2013/04/003557; Australian New Zealand Clinical Trials Registry ACTRN1261000078752; Universal Trial Number U1111-1138-6707.

  12. Pattern Recognition Analysis of Age-Related Retinal Ganglion Cell Signatures in the Human Eye

    PubMed Central

    Yoshioka, Nayuta; Zangerl, Barbara; Nivison-Smith, Lisa; Khuu, Sieu K.; Jones, Bryan W.; Pfeiffer, Rebecca L.; Marc, Robert E.; Kalloniatis, Michael

    2017-01-01

    Purpose To characterize macular ganglion cell layer (GCL) changes with age and provide a framework to assess changes in ocular disease. This study used data clustering to analyze macular GCL patterns from optical coherence tomography (OCT) in a large cohort of subjects without ocular disease. Methods Single eyes of 201 patients evaluated at the Centre for Eye Health (Sydney, Australia) were retrospectively enrolled (age range, 20–85); 8 × 8 grid locations obtained from Spectralis OCT macular scans were analyzed with unsupervised classification into statistically separable classes sharing common GCL thickness and change with age. The resulting classes and gridwise data were fitted with linear and segmented linear regression curves. Additionally, normalized data were analyzed to determine regression as a percentage. Accuracy of each model was examined through comparison of predicted 50-year-old equivalent macular GCL thickness for the entire cohort to a true 50-year-old reference cohort. Results Pattern recognition clustered GCL thickness across the macula into five to eight spatially concentric classes. F-test demonstrated segmented linear regression to be the most appropriate model for macular GCL change. The pattern recognition–derived and normalized model revealed less difference between the predicted macular GCL thickness and the reference cohort (average ± SD 0.19 ± 0.92 and −0.30 ± 0.61 μm) than a gridwise model (average ± SD 0.62 ± 1.43 μm). Conclusions Pattern recognition successfully identified statistically separable macular areas that undergo a segmented linear reduction with age. This regression model better predicted macular GCL thickness. The various unique spatial patterns revealed by pattern recognition combined with core GCL thickness data provide a framework to analyze GCL loss in ocular disease. PMID:28632847

  13. Early increase and late decrease of purkinje cell dendritic spine density in prion-infected organotypic mouse cerebellar cultures.

    PubMed

    Campeau, Jody L; Wu, Gengshu; Bell, John R; Rasmussen, Jay; Sim, Valerie L

    2013-01-01

    Prion diseases are infectious neurodegenerative diseases associated with the accumulation of protease-resistant prion protein, neuronal loss, spongiform change and astrogliosis. In the mouse model, the loss of dendritic spines is one of the earliest pathological changes observed in vivo, occurring 4-5 weeks after the first detection of protease-resistant prion protein in the brain. While there are cell culture models of prion infection, most do not recapitulate the neuropathology seen in vivo. Only the recently developed prion organotypic slice culture assay has been reported to undergo neuronal loss and the development of some aspects of prion pathology, namely small vacuolar degeneration and tubulovesicular bodies. Given the rapid replication of prions in this system, with protease-resistant prion protein detectable by 21 days, we investigated whether the dendritic spine loss and altered dendritic morphology seen in prion disease might also develop within the lifetime of this culture system. Indeed, six weeks after first detection of protease-resistant prion protein in tga20 mouse cerebellar slice cultures infected with RML prion strain, we found a statistically significant loss of Purkinje cell dendritic spines and altered dendritic morphology in infected cultures, analogous to that seen in vivo. In addition, we found a transient but statistically significant increase in Purkinje cell dendritic spine density during infection, at the time when protease-resistant prion protein was first detectable in culture. Our findings support the use of this slice culture system as one which recapitulates prion disease pathology and one which may facilitate study of the earliest stages of prion disease pathogenesis.

  14. A Bayesian Model for the Prediction and Early Diagnosis of Alzheimer's Disease.

    PubMed

    Alexiou, Athanasios; Mantzavinos, Vasileios D; Greig, Nigel H; Kamal, Mohammad A

    2017-01-01

    Alzheimer's disease treatment is still an open problem. The diversity of symptoms, the alterations in common pathophysiology, the existence of asymptomatic cases, the different types of sporadic and familial Alzheimer's and their relevance with other types of dementia and comorbidities, have already created a myth-fear against the leading disease of the twenty first century. Many failed latest clinical trials and novel medications have revealed the early diagnosis as the most critical treatment solution, even though scientists tested the amyloid hypothesis and few related drugs. Unfortunately, latest studies have indicated that the disease begins at the very young ages thus making it difficult to determine the right time of proper treatment. By taking into consideration all these multivariate aspects and unreliable factors against an appropriate treatment, we focused our research on a non-classic statistical evaluation of the most known and accepted Alzheimer's biomarkers. Therefore, in this paper, the code and few experimental results of a computational Bayesian tool have being reported, dedicated to the correlation and assessment of several Alzheimer's biomarkers to export a probabilistic medical prognostic process. This new statistical software is executable in the Bayesian software Winbugs, based on the latest Alzheimer's classification and the formulation of the known relative probabilities of the various biomarkers, correlated with Alzheimer's progression, through a set of discrete distributions. A user-friendly web page has been implemented for the supporting of medical doctors and researchers, to upload Alzheimer's tests and receive statistics on the occurrence of Alzheimer's disease development or presence, due to abnormal testing in one or more biomarkers.

  15. Analysis of Exhaled Breath Volatile Organic Compounds in Inflammatory Bowel Disease: A Pilot Study.

    PubMed

    Hicks, Lucy C; Huang, Juzheng; Kumar, Sacheen; Powles, Sam T; Orchard, Timothy R; Hanna, George B; Williams, Horace R T

    2015-09-01

    Distinguishing between the inflammatory bowel diseases [IBD], Crohn's disease [CD] and ulcerative colitis [UC], is important for determining management and prognosis. Selected ion flow tube mass spectrometry [SIFT-MS] may be used to analyse volatile organic compounds [VOCs] in exhaled breath: these may be altered in disease states, and distinguishing breath VOC profiles can be identified. The aim of this pilot study was to identify, quantify, and analyse VOCs present in the breath of IBD patients and controls, potentially providing insights into disease pathogenesis and complementing current diagnostic algorithms. SIFT-MS breath profiling of 56 individuals [20 UC, 18 CD, and 18 healthy controls] was undertaken. Multivariate analysis included principal components analysis and partial least squares discriminant analysis with orthogonal signal correction [OSC-PLS-DA]. Receiver operating characteristic [ROC] analysis was performed for each comparative analysis using statistically significant VOCs. OSC-PLS-DA modelling was able to distinguish both CD and UC from healthy controls and from one other with good sensitivity and specificity. ROC analysis using combinations of statistically significant VOCs [dimethyl sulphide, hydrogen sulphide, hydrogen cyanide, ammonia, butanal, and nonanal] gave integrated areas under the curve of 0.86 [CD vs healthy controls], 0.74 [UC vs healthy controls], and 0.83 [CD vs UC]. Exhaled breath VOC profiling was able to distinguish IBD patients from controls, as well as to separate UC from CD, using both multivariate and univariate statistical techniques. Copyright © 2015 European Crohn’s and Colitis Organisation (ECCO). Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  16. Community air pollution and mortality: Analysis of 1980 data from US metropolitan areas. 1: Particulate air pollution

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lipfert, F.W.

    1992-11-01

    1980 data from up to 149 metropolitan areas were used to define cross-sectional associations between community air pollution and excess human mortality. The regression model proposed by Oezkaynak and Thurston, which accounted for age, race, education, poverty, and population density, was evaluated and several new models were developed. The new models also accounted for population change, drinking water hardness, and smoking, and included a more detailed description of race. Cause-of-death categories analyzed include all causes, all non-external causes, major cardiovascular diseases, and chronic obstructive pulmonary diseases (COPD). Both annual mortality rates and their logarithms were analyzed. The data on particulatesmore » were averaged across all monitoring stations available for each SMSA and the TSP data were restricted to the year 1980. The associations between mortality and air pollution were found to be dependent on the socioeconomic factors included in the models, the specific locations included din the data set, and the type of statistical model used. Statistically significant associations were found between TSP and mortality due to non-external causes with log-linear models, but not with a linear model, and between TS and COPD mortality for both linear and log-linear models. When the sulfate contribution to TSP was subtracted, the relationship with COPD mortality was strengthened. Scatter plots and quintile analyses suggested a TSP threshold for COPD mortality at around 65 ug/m{sup 3} (annual average). SO{sub 4}{sup {minus}2}, Mn, PM{sup 15}, and PM{sub 2.5} were not significantly associated with mortality using the new models.« less

  17. Understanding amyloid aggregation by statistical analysis of atomic force microscopy images

    NASA Astrophysics Data System (ADS)

    Adamcik, Jozef; Jung, Jin-Mi; Flakowski, Jérôme; de Los Rios, Paolo; Dietler, Giovanni; Mezzenga, Raffaele

    2010-06-01

    The aggregation of proteins is central to many aspects of daily life, including food processing, blood coagulation, eye cataract formation disease and prion-related neurodegenerative infections. However, the physical mechanisms responsible for amyloidosis-the irreversible fibril formation of various proteins that is linked to disorders such as Alzheimer's, Creutzfeldt-Jakob and Huntington's diseases-have not yet been fully elucidated. Here, we show that different stages of amyloid aggregation can be examined by performing a statistical polymer physics analysis of single-molecule atomic force microscopy images of heat-denatured β-lactoglobulin fibrils. The atomic force microscopy analysis, supported by theoretical arguments, reveals that the fibrils have a multistranded helical shape with twisted ribbon-like structures. Our results also indicate a possible general model for amyloid fibril assembly and illustrate the potential of this approach for investigating fibrillar systems.

  18. A spatial scan statistic for compound Poisson data.

    PubMed

    Rosychuk, Rhonda J; Chang, Hsing-Ming

    2013-12-20

    The topic of spatial cluster detection gained attention in statistics during the late 1980s and early 1990s. Effort has been devoted to the development of methods for detecting spatial clustering of cases and events in the biological sciences, astronomy and epidemiology. More recently, research has examined detecting clusters of correlated count data associated with health conditions of individuals. Such a method allows researchers to examine spatial relationships of disease-related events rather than just incident or prevalent cases. We introduce a spatial scan test that identifies clusters of events in a study region. Because an individual case may have multiple (repeated) events, we base the test on a compound Poisson model. We illustrate our method for cluster detection on emergency department visits, where individuals may make multiple disease-related visits. Copyright © 2013 John Wiley & Sons, Ltd.

  19. Semi-quantitative assessment of disease risks at the human, livestock, wildlife interface for the Republic of Korea using a nationwide survey of experts: A model for other countries.

    PubMed

    Hwang, J; Lee, K; Walsh, D; Kim, S W; Sleeman, J M; Lee, H

    2018-02-01

    Wildlife-associated diseases and pathogens have increased in importance; however, management of a large number of diseases and diversity of hosts is prohibitively expensive. Thus, the determination of priority wildlife pathogens and risk factors for disease emergence is warranted. We used an online questionnaire survey to assess release and exposure risks, and consequences of wildlife-associated diseases and pathogens in the Republic of Korea (ROK). We also surveyed opinions on pathways for disease exposure, and risk factors for disease emergence and spread. For the assessment of risk, we employed a two-tiered, statistical K-means clustering algorithm to group diseases into three levels (high, medium and low) of perceived risk based on release and exposure risks, societal consequences and the level of uncertainty of the experts' opinions. To examine the experts' perceived risk of routes of introduction of pathogens and disease amplification and spread, we used a Bayesian, multivariate normal order-statistics model. Six diseases or pathogens, including four livestock and two wildlife diseases, were identified as having high risk with low uncertainty. Similarly, 13 diseases were characterized as having high risk with medium uncertainty with three of these attributed to livestock, six associated with human disease, and the remainder having the potential to affect human, livestock and wildlife (i.e., One Health). Lastly, four diseases were described as high risk with high certainty, and were associated solely with fish diseases. Experts identified migration of wildlife, international human movement and illegal importation of wildlife as the three routes posing the greatest risk of pathogen introduction into ROK. Proximity of humans, livestock and wildlife was the most significant risk factor for promoting the spread of wildlife-associated diseases and pathogens, followed by high density of livestock populations, habitat loss and environmental degradation, and climate change. This study provides useful information to decision makers responsible for allocating resources to address disease risks. This approach provided a rapid, cost-effective method of risk assessment of wildlife-associated diseases and pathogens for which the published literature is sparse. Published 2017. This article is a U.S. Government work and is in the public domain in the USA.

  20. Additive prognostic value of interleukin-6 at peak phase of dobutamine stress echocardiography in patients with coronary artery disease. A 6-year follow-up study.

    PubMed

    Ikonomidis, Ignatios; Athanassopoulos, George; Stamatelopoulos, Kimon; Lekakis, John; Revela, Ioanna; Venetsanou, Kiriaki; Marinou, Margarita; Monaco, Claudia; Cokkinos, Dennis V; Nihoyannopoulos, Petros

    2008-08-01

    Interleukin-6 (IL-6) and tissue factor (TF) are elevated after myocardial ischemia during dobutamine stress echo (DSE). We examined the incremental prognostic value of IL-6 or TF measured during DSE over echocardiographic and clinical factors in patients with chronic coronary artery disease (CAD). We studied 106 patients with angiographically documented CAD. IL-6 and TF were measured at rest, peak, and during recovery. A wall motion score index was calculated. Fifty-seven (54%) patients had ischemia at DSE. During follow-up (63.7 +/- 20 months), 36 patients (33%) had an adverse event (12 cardiac deaths, 24 acute coronary events). Patients with events had a higher peak IL-6 (P = .02) but similar rest and recovery IL-6 than those without. Patients with peak IL-6 > or =3.14 pg/mL (upper tertile) had a hazard ratio of 2.7 (95% CI 1.44-5.37) (P < .01 for an adverse event). The addition of peak wall motion score index in a multivariable model including risk factors, ejection fraction, revascularization, and multivessel disease increased the model's c statistic from 0.66 to 0.70 (P = .04). The addition of peak IL-6 further increased the model's c statistic to 0.75 (P = .04). Tissue factor was not related with cardiac events. Interleuikin-6 levels measured during the peak phase of DSE incrementally contribute to risk stratification in patients with chronic CAD.

  1. Plasma Lipidomic Profiles Improve on Traditional Risk Factors for the Prediction of Cardiovascular Events in Type 2 Diabetes Mellitus.

    PubMed

    Alshehry, Zahir H; Mundra, Piyushkumar A; Barlow, Christopher K; Mellett, Natalie A; Wong, Gerard; McConville, Malcolm J; Simes, John; Tonkin, Andrew M; Sullivan, David R; Barnes, Elizabeth H; Nestel, Paul J; Kingwell, Bronwyn A; Marre, Michel; Neal, Bruce; Poulter, Neil R; Rodgers, Anthony; Williams, Bryan; Zoungas, Sophia; Hillis, Graham S; Chalmers, John; Woodward, Mark; Meikle, Peter J

    2016-11-22

    Clinical lipid measurements do not show the full complexity of the altered lipid metabolism associated with diabetes mellitus or cardiovascular disease. Lipidomics enables the assessment of hundreds of lipid species as potential markers for disease risk. Plasma lipid species (310) were measured by a targeted lipidomic analysis with liquid chromatography electrospray ionization-tandem mass spectrometry on a case-cohort (n=3779) subset from the ADVANCE trial (Action in Diabetes and Vascular Disease: Preterax and Diamicron-MR Controlled Evaluation). The case-cohort was 61% male with a mean age of 67 years. All participants had type 2 diabetes mellitus with ≥1 additional cardiovascular risk factors, and 35% had a history of macrovascular disease. Weighted Cox regression was used to identify lipid species associated with future cardiovascular events (nonfatal myocardial infarction, nonfatal stroke, and cardiovascular death) and cardiovascular death during a 5-year follow-up period. Multivariable models combining traditional risk factors with lipid species were optimized with the Akaike information criteria. C statistics and NRIs were calculated within a 5-fold cross-validation framework. Sphingolipids, phospholipids (including lyso- and ether- species), cholesteryl esters, and glycerolipids were associated with future cardiovascular events and cardiovascular death. The addition of 7 lipid species to a base model (14 traditional risk factors and medications) to predict cardiovascular events increased the C statistic from 0.680 (95% confidence interval [CI], 0.678-0.682) to 0.700 (95% CI, 0.698-0.702; P<0.0001) with a corresponding continuous NRI of 0.227 (95% CI, 0.219-0.235). The prediction of cardiovascular death was improved with the incorporation of 4 lipid species into the base model, showing an increase in the C statistic from 0.740 (95% CI, 0.738-0.742) to 0.760 (95% CI, 0.757-0.762; P<0.0001) and a continuous net reclassification index of 0.328 (95% CI, 0.317-0.339). The results were validated in a subcohort with type 2 diabetes mellitus (n=511) from the LIPID trial (Long-Term Intervention With Pravastatin in Ischemic Disease). The improvement in the prediction of cardiovascular events, above traditional risk factors, demonstrates the potential of plasma lipid species as biomarkers for cardiovascular risk stratification in diabetes mellitus. URL: https://clinicaltrials.gov. Unique identifier: NCT00145925. © 2016 American Heart Association, Inc.

  2. Modeling Effects of Temperature, Soil, Moisture, Nutrition and Variety As Determinants of Severity of Pythium Damping-Off and Root Disease in Subterranean Clover

    PubMed Central

    You, Ming P.; Rensing, Kelly; Renton, Michael; Barbetti, Martin J.

    2017-01-01

    Subterranean clover (Trifolium subterraneum) is a critical pasture legume in Mediterranean regions of southern Australia and elsewhere, including Mediterranean-type climatic regions in Africa, Asia, Australia, Europe, North America, and South America. Pythium damping-off and root disease caused by Pythium irregulare is a significant threat to subterranean clover in Australia and a study was conducted to define how environmental factors (viz. temperature, soil type, moisture and nutrition) as well as variety, influence the extent of damping-off and root disease as well as subterranean clover productivity under challenge by this pathogen. Relationships were statistically modeled using linear and generalized linear models and boosted regression trees. Modeling found complex relationships between explanatory variables and the extent of Pythium damping-off and root rot. Linear modeling identified high-level (4 or 5-way) significant interactions for each dependent variable (dry shoot and root weight, emergence, tap and lateral root disease index). Furthermore, all explanatory variables (temperature, soil, moisture, nutrition, variety) were found significant as part of some interaction within these models. A significant five-way interaction between all explanatory variables was found for both dry shoot and root dry weights, and a four way interaction between temperature, soil, moisture, and nutrition was found for both tap and lateral root disease index. A second approach to modeling using boosted regression trees provided support for and helped clarify the complex nature of the relationships found in linear models. All explanatory variables showed at least 5% relative influence on each of the five dependent variables. All models indicated differences due to soil type, with the sand-based soil having either higher weights, greater emergence, or lower disease indices; while lowest weights and less emergence, as well as higher disease indices, were found for loam soil and low temperature. There was more severe tap and lateral root rot disease in higher moisture situations. PMID:29184544

  3. Quantitative analysis of Porcine Reproductive and Respiratory Syndrome (PRRS) viremia profiles from experimental infection: a statistical modelling approach

    USDA-ARS?s Scientific Manuscript database

    Porcine reproductive and respiratory syndrome (PRRS) is the most economically significant viral disease facing the global swine industry. Viremia profiles of PRRS virus challenged pigs reflect the severity and progression of the infection within the host and provide crucial information for subsequen...

  4. Relapse prediction in Graves´ disease: Towards mathematical modeling of clinical, immune and genetic markers.

    PubMed

    Langenstein, Christoph; Schork, Diana; Badenhoop, Klaus; Herrmann, Eva

    2016-12-01

    Graves' disease (GD) is an important and prevalent thyroid autoimmune disorder. Standard therapy for GD consists of antithyroid drugs (ATD) with treatment periods of around 12 months but relapse is frequent. Since predictors for relapse are difficult to identify the individual decision making for optimal treatment is often arbitrary. After reviewing the literature on this topic we summarize important factors involved in GD and with respect to their potential for relapse prediction from markers before and after treatment. This information was used to design a mathematical model integrating thyroid hormone parameters, thyroid size, antibody titers and a complex algorithm encompassing genetic predisposition, environmental exposures and current immune activity in order to arrive at a prognostic index for relapse risk after treatment. In the search for a tool to analyze and predict relapse in GD mathematical modeling is a promising approach. In analogy to mathematical modeling approaches in other diseases such as viral infections, we developed a differential equation model on the basis of published clinical trials in patients with GD. Although our model needs further evaluation to be applicable in a clinical context, it provides a perspective for an important contribution to a final statistical prediction model.

  5. Spatial distribution of unspecified chronic kidney disease in El Salvador by crop area cultivated and ambient temperature.

    PubMed

    VanDervort, Darcy R; López, Dina L; Orantes, Carlos M; Rodríguez, David S

    2014-04-01

    Chronic kidney disease of unknown etiology is occurring in various geographic areas worldwide. Cases lack typical risk factors associated with chronic kidney disease, such as diabetes and hypertension. It is epidemic in El Salvador, Central America, where it is diagnosed with increasing frequency in young, otherwise-healthy male farmworkers. Suspected causes include agrochemical use (especially in sugarcane fields), physical heat stress, and heavy metal exposure. To evaluate the geographic relationship between unspecified chronic kidney disease (unCKD) and nondiabetic chronic renal failure (ndESRD) hospital admissions in El Salvador with the proximity to cultivated crops and ambient temperatures. Data on unCKD and ndESRD were compared with environmental variables, crop area cultivated (indicator of agrochemical use) and high ambient temperatures. Using geographically weighted regression analysis, two model sets were created using reported municipal hospital admission rates are per thousand population for unCKD 2006-2010 and rates of ndESRD 2005-2010 [corrected]. These were assessed against local percent of land cultivated by crop (sugarcane, coffee, corn, cotton, sorghum, and beans) and mean maximum ambient temperature, with Moran's indices determining data clustering. Two-dimensional geographic models illustrated parameter spatial distribution. Bivariate geographically weighted regressions showed statistically significant correlations between percent area of sugarcane, corn, cotton, coffee, and bean cultivation, as well as mean maximum ambient temperature with both unCKD and ndESRD hospital admission rates. Percent area of sugarcane cultivation had greatest statistical weight (p ≤ 0.001; Rp2 = 0.77 for unCKD). The most statistically significant multivariate geographically weighted regression model for unCKD included percent area of sugarcane, cotton and corn cultivation (p ≤ 0.001; Rp2 = 0.80), while, for ndESRD, it included the percent area of sugarcane, corn, cotton and coffee cultivation (Rp2 = 0.52). Univariate unCKD and ndESRD Moran's I (0.20 and 0.33, respectively) indicated some degree of clustering. Ambient temperature did not improve multivariate geographically-weighted regression models for unCKD or ndESRD. Local bivariate Moran's indices with relatively high positive values and statistical significance (0.3-1.0, p ≤0.05) indicated positive clustering between unCKD hospital admission rates and percent area of sugarcane as well as cotton cultivation. The greatest positive response for clustering values did not consistently plot near the highest temperatures; there were some positive clusters in regions of lower temperatures. Clusters of ndESRD were also observed, some in areas of relatively low chronic kidney disease incidence in western El Salvador. High temperatures do not appear to strongly influence occurrence of unCKDu proxies. CKDu in El Salvador may arise from proximity to agriculture to which agrochemicals are applied, especially in sugarcane cultivation. The findings of this preliminary ecological study suggest that more research is needed to assess and quantify presence of specific agrochemicals in high-CKDu areas.

  6. Single-variant and multi-variant trend tests for genetic association with next-generation sequencing that are robust to sequencing error.

    PubMed

    Kim, Wonkuk; Londono, Douglas; Zhou, Lisheng; Xing, Jinchuan; Nato, Alejandro Q; Musolf, Anthony; Matise, Tara C; Finch, Stephen J; Gordon, Derek

    2012-01-01

    As with any new technology, next-generation sequencing (NGS) has potential advantages and potential challenges. One advantage is the identification of multiple causal variants for disease that might otherwise be missed by SNP-chip technology. One potential challenge is misclassification error (as with any emerging technology) and the issue of power loss due to multiple testing. Here, we develop an extension of the linear trend test for association that incorporates differential misclassification error and may be applied to any number of SNPs. We call the statistic the linear trend test allowing for error, applied to NGS, or LTTae,NGS. This statistic allows for differential misclassification. The observed data are phenotypes for unrelated cases and controls, coverage, and the number of putative causal variants for every individual at all SNPs. We simulate data considering multiple factors (disease mode of inheritance, genotype relative risk, causal variant frequency, sequence error rate in cases, sequence error rate in controls, number of loci, and others) and evaluate type I error rate and power for each vector of factor settings. We compare our results with two recently published NGS statistics. Also, we create a fictitious disease model based on downloaded 1000 Genomes data for 5 SNPs and 388 individuals, and apply our statistic to those data. We find that the LTTae,NGS maintains the correct type I error rate in all simulations (differential and non-differential error), while the other statistics show large inflation in type I error for lower coverage. Power for all three methods is approximately the same for all three statistics in the presence of non-differential error. Application of our statistic to the 1000 Genomes data suggests that, for the data downloaded, there is a 1.5% sequence misclassification rate over all SNPs. Finally, application of the multi-variant form of LTTae,NGS shows high power for a number of simulation settings, although it can have lower power than the corresponding single-variant simulation results, most probably due to our specification of multi-variant SNP correlation values. In conclusion, our LTTae,NGS addresses two key challenges with NGS disease studies; first, it allows for differential misclassification when computing the statistic; and second, it addresses the multiple-testing issue in that there is a multi-variant form of the statistic that has only one degree of freedom, and provides a single p value, no matter how many loci. Copyright © 2013 S. Karger AG, Basel.

  7. Single variant and multi-variant trend tests for genetic association with next generation sequencing that are robust to sequencing error

    PubMed Central

    Kim, Wonkuk; Londono, Douglas; Zhou, Lisheng; Xing, Jinchuan; Nato, Andrew; Musolf, Anthony; Matise, Tara C.; Finch, Stephen J.; Gordon, Derek

    2013-01-01

    As with any new technology, next generation sequencing (NGS) has potential advantages and potential challenges. One advantage is the identification of multiple causal variants for disease that might otherwise be missed by SNP-chip technology. One potential challenge is misclassification error (as with any emerging technology) and the issue of power loss due to multiple testing. Here, we develop an extension of the linear trend test for association that incorporates differential misclassification error and may be applied to any number of SNPs. We call the statistic the linear trend test allowing for error, applied to NGS, or LTTae,NGS. This statistic allows for differential misclassification. The observed data are phenotypes for unrelated cases and controls, coverage, and the number of putative causal variants for every individual at all SNPs. We simulate data considering multiple factors (disease mode of inheritance, genotype relative risk, causal variant frequency, sequence error rate in cases, sequence error rate in controls, number of loci, and others) and evaluate type I error rate and power for each vector of factor settings. We compare our results with two recently published NGS statistics. Also, we create a fictitious disease model, based on downloaded 1000 Genomes data for 5 SNPs and 388 individuals, and apply our statistic to that data. We find that the LTTae,NGS maintains the correct type I error rate in all simulations (differential and non-differential error), while the other statistics show large inflation in type I error for lower coverage. Power for all three methods is approximately the same for all three statistics in the presence of non-differential error. Application of our statistic to the 1000 Genomes data suggests that, for the data downloaded, there is a 1.5% sequence misclassification rate over all SNPs. Finally, application of the multi-variant form of LTTae,NGS shows high power for a number of simulation settings, although it can have lower power than the corresponding single variant simulation results, most probably due to our specification of multi-variant SNP correlation values. In conclusion, our LTTae,NGS addresses two key challenges with NGS disease studies; first, it allows for differential misclassification when computing the statistic; and second, it addresses the multiple-testing issue in that there is a multi-variant form of the statistic that has only one degree of freedom, and provides a single p-value, no matter how many loci. PMID:23594495

  8. graph-GPA: A graphical model for prioritizing GWAS results and investigating pleiotropic architecture.

    PubMed

    Chung, Dongjun; Kim, Hang J; Zhao, Hongyu

    2017-02-01

    Genome-wide association studies (GWAS) have identified tens of thousands of genetic variants associated with hundreds of phenotypes and diseases, which have provided clinical and medical benefits to patients with novel biomarkers and therapeutic targets. However, identification of risk variants associated with complex diseases remains challenging as they are often affected by many genetic variants with small or moderate effects. There has been accumulating evidence suggesting that different complex traits share common risk basis, namely pleiotropy. Recently, several statistical methods have been developed to improve statistical power to identify risk variants for complex traits through a joint analysis of multiple GWAS datasets by leveraging pleiotropy. While these methods were shown to improve statistical power for association mapping compared to separate analyses, they are still limited in the number of phenotypes that can be integrated. In order to address this challenge, in this paper, we propose a novel statistical framework, graph-GPA, to integrate a large number of GWAS datasets for multiple phenotypes using a hidden Markov random field approach. Application of graph-GPA to a joint analysis of GWAS datasets for 12 phenotypes shows that graph-GPA improves statistical power to identify risk variants compared to statistical methods based on smaller number of GWAS datasets. In addition, graph-GPA also promotes better understanding of genetic mechanisms shared among phenotypes, which can potentially be useful for the development of improved diagnosis and therapeutics. The R implementation of graph-GPA is currently available at https://dongjunchung.github.io/GGPA/.

  9. Is Going Beyond Rasch Analysis Necessary to Assess the Construct Validity of a Motor Function Scale?

    PubMed

    Guillot, Tiffanie; Roche, Sylvain; Rippert, Pascal; Hamroun, Dalil; Iwaz, Jean; Ecochard, René; Vuillerot, Carole

    2018-04-03

    To examine whether a Rasch analysis is sufficient to establish the construct validity of the Motor Function Measure (MFM) and discuss whether weighting the MFM item scores would improve the MFM construct validity. Observational cross-sectional multicenter study. Twenty-three physical medicine departments, neurology departments, or reference centers for neuromuscular diseases. Patients (N=911) aged 6 to 60 years with Charcot-Marie-Tooth disease (CMT), facioscapulohumeral dystrophy (FSHD), or myotonic dystrophy type 1 (DM1). None. Comparison of the goodness-of-fit of the confirmatory factor analysis (CFA) model vs that of a modified multidimensional Rasch model on MFM item scores in each considered disease. The CFA model showed good fit to the data and significantly better goodness of fit than the modified multidimensional Rasch model regardless of the disease (P<.001). Statistically significant differences in item standardized factor loadings were found between DM1, CMT, and FSHD in only 6 of 32 items (items 6, 27, 2, 7, 9 and 17). For multidimensional scales designed to measure patient abilities in various diseases, a Rasch analysis might not be the most convenient, whereas a CFA is able to establish the scale construct validity and provide weights to adapt the item scores to a specific disease. Copyright © 2018 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.

  10. Sparse SPM: Group Sparse-dictionary learning in SPM framework for resting-state functional connectivity MRI analysis.

    PubMed

    Lee, Young-Beom; Lee, Jeonghyeon; Tak, Sungho; Lee, Kangjoo; Na, Duk L; Seo, Sang Won; Jeong, Yong; Ye, Jong Chul

    2016-01-15

    Recent studies of functional connectivity MR imaging have revealed that the default-mode network activity is disrupted in diseases such as Alzheimer's disease (AD). However, there is not yet a consensus on the preferred method for resting-state analysis. Because the brain is reported to have complex interconnected networks according to graph theoretical analysis, the independency assumption, as in the popular independent component analysis (ICA) approach, often does not hold. Here, rather than using the independency assumption, we present a new statistical parameter mapping (SPM)-type analysis method based on a sparse graph model where temporal dynamics at each voxel position are described as a sparse combination of global brain dynamics. In particular, a new concept of a spatially adaptive design matrix has been proposed to represent local connectivity that shares the same temporal dynamics. If we further assume that local network structures within a group are similar, the estimation problem of global and local dynamics can be solved using sparse dictionary learning for the concatenated temporal data across subjects. Moreover, under the homoscedasticity variance assumption across subjects and groups that is often used in SPM analysis, the aforementioned individual and group analyses using sparse dictionary learning can be accurately modeled by a mixed-effect model, which also facilitates a standard SPM-type group-level inference using summary statistics. Using an extensive resting fMRI data set obtained from normal, mild cognitive impairment (MCI), and Alzheimer's disease patient groups, we demonstrated that the changes in the default mode network extracted by the proposed method are more closely correlated with the progression of Alzheimer's disease. Copyright © 2015 Elsevier Inc. All rights reserved.

  11. Impact of a Post-Discharge Integrated Disease Management Program on COPD Hospital Readmissions.

    PubMed

    Russo, Ashlee N; Sathiyamoorthy, Gayathri; Lau, Chris; Saygin, Didem; Han, Xiaozhen; Wang, Xiao-Feng; Rice, Richard; Aboussouan, Loutfi S; Stoller, James K; Hatipoğlu, Umur

    2017-11-01

    Readmission following a hospitalization for COPD is associated with significant health-care expenditure. A multicomponent COPD post-discharge integrated disease management program was implemented at the Cleveland Clinic to improve the care of patients with COPD and reduce readmissions. This retrospective study reports our experience with the program. Groups of subjects who were exposed to different components of the program were compared regarding their readmission rates. Multivariate logistic regression analysis was performed to build predictive models for 30- and 90-d readmission. One hundred sixty subjects completed a 90-d follow-up, of which, 67 attended the exacerbation clinic, 16 subjects received care coordination, 51 subjects completed both, and 26 subjects did not participate in any component despite referral. Thirty- and 90-d readmission rates for the entire group were 18.1 and 46.2%, respectively. Thirty- and 90-d readmission rates for the individual groups were: exacerbation clinic, 11.9 and 35.8%; care coordination, 25.0 and 50.0%; both, 19.6 and 41.2%; and neither, 26.9 and 80.8%, respectively. The model with the best predictive ability for 30-d readmission risk included the number of hospitalizations within the previous year and use of noninvasive ventilation (C statistic of 0.84). The model for 90-d readmission risk included receiving any component of the post-discharge integrated disease management program, the number of hospitalizations, and primary care physician visits within the previous year (C statistic of 0.87). Receiving any component of a post-discharge integrated disease management program was associated with reduced 90-d readmission rate. Previous health-care utilization and lung function impairment were strong predictors of readmission. Copyright © 2017 by Daedalus Enterprises.

  12. External Validation of a Tool Predicting 7-Year Risk of Developing Cardiovascular Disease, Type 2 Diabetes or Chronic Kidney Disease.

    PubMed

    Rauh, Simone P; Rutters, Femke; van der Heijden, Amber A W A; Luimes, Thomas; Alssema, Marjan; Heymans, Martijn W; Magliano, Dianna J; Shaw, Jonathan E; Beulens, Joline W; Dekker, Jacqueline M

    2018-02-01

    Chronic cardiometabolic diseases, including cardiovascular disease (CVD), type 2 diabetes (T2D) and chronic kidney disease (CKD), share many modifiable risk factors and can be prevented using combined prevention programs. Valid risk prediction tools are needed to accurately identify individuals at risk. We aimed to validate a previously developed non-invasive risk prediction tool for predicting the combined 7-year-risk for chronic cardiometabolic diseases. The previously developed tool is stratified for sex and contains the predictors age, BMI, waist circumference, use of antihypertensives, smoking, family history of myocardial infarction/stroke, and family history of diabetes. This tool was externally validated, evaluating model performance using area under the receiver operating characteristic curve (AUC)-assessing discrimination-and Hosmer-Lemeshow goodness-of-fit (HL) statistics-assessing calibration. The intercept was recalibrated to improve calibration performance. The risk prediction tool was validated in 3544 participants from the Australian Diabetes, Obesity and Lifestyle Study (AusDiab). Discrimination was acceptable, with an AUC of 0.78 (95% CI 0.75-0.81) in men and 0.78 (95% CI 0.74-0.81) in women. Calibration was poor (HL statistic: p < 0.001), but improved considerably after intercept recalibration. Examination of individual outcomes showed that in men, AUC was highest for CKD (0.85 [95% CI 0.78-0.91]) and lowest for T2D (0.69 [95% CI 0.65-0.74]). In women, AUC was highest for CVD (0.88 [95% CI 0.83-0.94)]) and lowest for T2D (0.71 [95% CI 0.66-0.75]). Validation of our previously developed tool showed robust discriminative performance across populations. Model recalibration is recommended to account for different disease rates. Our risk prediction tool can be useful in large-scale prevention programs for identifying those in need of further risk profiling because of their increased risk for chronic cardiometabolic diseases.

  13. Genome-wide expression profiling of five mouse models identifies similarities and differences with human psoriasis.

    PubMed

    Swindell, William R; Johnston, Andrew; Carbajal, Steve; Han, Gangwen; Wohn, Christian; Lu, Jun; Xing, Xianying; Nair, Rajan P; Voorhees, John J; Elder, James T; Wang, Xiao-Jing; Sano, Shigetoshi; Prens, Errol P; DiGiovanni, John; Pittelkow, Mark R; Ward, Nicole L; Gudjonsson, Johann E

    2011-04-04

    Development of a suitable mouse model would facilitate the investigation of pathomechanisms underlying human psoriasis and would also assist in development of therapeutic treatments. However, while many psoriasis mouse models have been proposed, no single model recapitulates all features of the human disease, and standardized validation criteria for psoriasis mouse models have not been widely applied. In this study, whole-genome transcriptional profiling is used to compare gene expression patterns manifested by human psoriatic skin lesions with those that occur in five psoriasis mouse models (K5-Tie2, imiquimod, K14-AREG, K5-Stat3C and K5-TGFbeta1). While the cutaneous gene expression profiles associated with each mouse phenotype exhibited statistically significant similarity to the expression profile of psoriasis in humans, each model displayed distinctive sets of similarities and differences in comparison to human psoriasis. For all five models, correspondence to the human disease was strong with respect to genes involved in epidermal development and keratinization. Immune and inflammation-associated gene expression, in contrast, was more variable between models as compared to the human disease. These findings support the value of all five models as research tools, each with identifiable areas of convergence to and divergence from the human disease. Additionally, the approach used in this paper provides an objective and quantitative method for evaluation of proposed mouse models of psoriasis, which can be strategically applied in future studies to score strengths of mouse phenotypes relative to specific aspects of human psoriasis.

  14. Impact of scale of aggregation on associations of cardiovascular hospitalization and socio-economic disadvantage

    PubMed Central

    2017-01-01

    Background There are numerous studies that show an increased incidence of cardiovascular disease with increasing levels of socio-economic disadvantage. Exposures that might influence the relationship include elements of the built environment and social systems that shape lifestyle risk behaviors. In Canberra (the Australian capital city) there has been a particular housing policy to create ‘mixed-tenure’ neighborhoods so that small pockets of disadvantage are surrounded by more affluent residences (known as a ‘salt-and-pepper’ pattern). This may contribute to a scatter of higher incidence rates in very small areas in this population that may be obscured if aggregated data are used. This study explored the effect of changing the scale of the spatial units used in small area disease modelling, aiming to understand the impact of this issue and the implications for local public health surveillance. Methods The residence location of hospitalized individuals were aggregated to two differently scaled area units. First, the Australian Bureau of Statistics Statistical Area 2 (SA2) which is normally used as the basis for deidentification and release of health data. Second, these data were aggregated to a smaller level: the Statistical Area 1 (SA1). Generalized Additive Models with penalized regression splines were used to assess the association of age-sex-standardized rates for cardiovascular disease hospital admissions with disadvantage. Results The relationships observed were different between the two types of spatial units. The SA1 level exposure-response curve for rates against the disadvantage index extended in a linear fashion above the midrange level, while that found at SA2-level suggested a curvilinear form with no evidence that rates increased with higher disadvantage beyond the midrange. Conclusion Our result supports findings of other work that has found disadvantage increases risk of cardiovascular disease. The shape of the curves suggest a difference in associations of cardiovascular disease rates with disadvantage scores between SA1 versus SA2. From these results it can be concluded that scale of analysis does influence the understanding of geographical patterns of socio-economic disadvantage and cardiovascular disease morbidity. Health surveillance and interventions in Canberra should take into account the impact of the scale of aggregation on the association between disadvantage and cardiovascular disease observed. PMID:29182618

  15. Predicting Rotator Cuff Tears Using Data Mining and Bayesian Likelihood Ratios

    PubMed Central

    Lu, Hsueh-Yi; Huang, Chen-Yuan; Su, Chwen-Tzeng; Lin, Chen-Chiang

    2014-01-01

    Objectives Rotator cuff tear is a common cause of shoulder diseases. Correct diagnosis of rotator cuff tears can save patients from further invasive, costly and painful tests. This study used predictive data mining and Bayesian theory to improve the accuracy of diagnosing rotator cuff tears by clinical examination alone. Methods In this retrospective study, 169 patients who had a preliminary diagnosis of rotator cuff tear on the basis of clinical evaluation followed by confirmatory MRI between 2007 and 2011 were identified. MRI was used as a reference standard to classify rotator cuff tears. The predictor variable was the clinical assessment results, which consisted of 16 attributes. This study employed 2 data mining methods (ANN and the decision tree) and a statistical method (logistic regression) to classify the rotator cuff diagnosis into “tear” and “no tear” groups. Likelihood ratio and Bayesian theory were applied to estimate the probability of rotator cuff tears based on the results of the prediction models. Results Our proposed data mining procedures outperformed the classic statistical method. The correction rate, sensitivity, specificity and area under the ROC curve of predicting a rotator cuff tear were statistical better in the ANN and decision tree models compared to logistic regression. Based on likelihood ratios derived from our prediction models, Fagan's nomogram could be constructed to assess the probability of a patient who has a rotator cuff tear using a pretest probability and a prediction result (tear or no tear). Conclusions Our predictive data mining models, combined with likelihood ratios and Bayesian theory, appear to be good tools to classify rotator cuff tears as well as determine the probability of the presence of the disease to enhance diagnostic decision making for rotator cuff tears. PMID:24733553

  16. Mathematical models used to inform study design or surveillance systems in infectious diseases: a systematic review.

    PubMed

    Herzog, Sereina A; Blaizot, Stéphanie; Hens, Niel

    2017-12-18

    Mathematical models offer the possibility to investigate the infectious disease dynamics over time and may help in informing design of studies. A systematic review was performed in order to determine to what extent mathematical models have been incorporated into the process of planning studies and hence inform study design for infectious diseases transmitted between humans and/or animals. We searched Ovid Medline and two trial registry platforms (Cochrane, WHO) using search terms related to infection, mathematical model, and study design from the earliest dates to October 2016. Eligible publications and registered trials included mathematical models (compartmental, individual-based, or Markov) which were described and used to inform the design of infectious disease studies. We extracted information about the investigated infection, population, model characteristics, and study design. We identified 28 unique publications but no registered trials. Focusing on compartmental and individual-based models we found 12 observational/surveillance studies and 11 clinical trials. Infections studied were equally animal and human infectious diseases for the observational/surveillance studies, while all but one between humans for clinical trials. The mathematical models were used to inform, amongst other things, the required sample size (n = 16), the statistical power (n = 9), the frequency at which samples should be taken (n = 6), and from whom (n = 6). Despite the fact that mathematical models have been advocated to be used at the planning stage of studies or surveillance systems, they are used scarcely. With only one exception, the publications described theoretical studies, hence, not being utilised in real studies.

  17. Extension of the lod score: the mod score.

    PubMed

    Clerget-Darpoux, F

    2001-01-01

    In 1955 Morton proposed the lod score method both for testing linkage between loci and for estimating the recombination fraction between them. If a disease is controlled by a gene at one of these loci, the lod score computation requires the prior specification of an underlying model that assigns the probabilities of genotypes from the observed phenotypes. To address the case of linkage studies for diseases with unknown mode of inheritance, we suggested (Clerget-Darpoux et al., 1986) extending the lod score function to a so-called mod score function. In this function, the variables are both the recombination fraction and the disease model parameters. Maximizing the mod score function over all these parameters amounts to maximizing the probability of marker data conditional on the disease status. Under the absence of linkage, the mod score conforms to a chi-square distribution, with extra degrees of freedom in comparison to the lod score function (MacLean et al., 1993). The mod score is asymptotically maximum for the true disease model (Clerget-Darpoux and Bonaïti-Pellié, 1992; Hodge and Elston, 1994). Consequently, the power to detect linkage through mod score will be highest when the space of models where the maximization is performed includes the true model. On the other hand, one must avoid overparametrization of the model space. For example, when the approach is applied to affected sibpairs, only two constrained disease model parameters should be used (Knapp et al., 1994) for the mod score maximization. It is also important to emphasize the existence of a strong correlation between the disease gene location and the disease model. Consequently, there is poor resolution of the location of the susceptibility locus when the disease model at this locus is unknown. Of course, this is true regardless of the statistics used. The mod score may also be applied in a candidate gene strategy to model the potential effect of this gene in the disease. Since, however, it ignores the information provided both by disease segregation and by linkage disequilibrium between the marker alleles and the functional disease alleles, its power of discrimination between genetic models is weak. The MASC method (Clerget-Darpoux et al., 1988) has been designed to address more efficiently the objectives of a candidate gene approach.

  18. Omnibus Risk Assessment via Accelerated Failure Time Kernel Machine Modeling

    PubMed Central

    Sinnott, Jennifer A.; Cai, Tianxi

    2013-01-01

    Summary Integrating genomic information with traditional clinical risk factors to improve the prediction of disease outcomes could profoundly change the practice of medicine. However, the large number of potential markers and possible complexity of the relationship between markers and disease make it difficult to construct accurate risk prediction models. Standard approaches for identifying important markers often rely on marginal associations or linearity assumptions and may not capture non-linear or interactive effects. In recent years, much work has been done to group genes into pathways and networks. Integrating such biological knowledge into statistical learning could potentially improve model interpretability and reliability. One effective approach is to employ a kernel machine (KM) framework, which can capture nonlinear effects if nonlinear kernels are used (Scholkopf and Smola, 2002; Liu et al., 2007, 2008). For survival outcomes, KM regression modeling and testing procedures have been derived under a proportional hazards (PH) assumption (Li and Luan, 2003; Cai et al., 2011). In this paper, we derive testing and prediction methods for KM regression under the accelerated failure time model, a useful alternative to the PH model. We approximate the null distribution of our test statistic using resampling procedures. When multiple kernels are of potential interest, it may be unclear in advance which kernel to use for testing and estimation. We propose a robust Omnibus Test that combines information across kernels, and an approach for selecting the best kernel for estimation. The methods are illustrated with an application in breast cancer. PMID:24328713

  19. Hormone replacement therapy is associated with gastro-oesophageal reflux disease: a retrospective cohort study.

    PubMed

    Close, Helen; Mason, James M; Wilson, Douglas; Hungin, A Pali S

    2012-05-29

    Oestrogen and progestogen have the potential to influence gastro-intestinal motility; both are key components of hormone replacement therapy (HRT). Results of observational studies in women taking HRT rely on self-reporting of gastro-oesophageal symptoms and the aetiology of gastro-oesophageal reflux disease (GORD) remains unclear. This study investigated the association between HRT and GORD in menopausal women using validated general practice records. 51,182 menopausal women were identified using the UK General Practice Research Database between 1995-2004. Of these, 8,831 were matched with and without hormone use. Odds ratios (ORs) were calculated for GORD and proton-pump inhibitor (PPI) use in hormone and non-hormone users, adjusting for age, co-morbidities, and co-pharmacy. In unadjusted analysis, all forms of hormone use (oestrogen-only, tibolone, combined HRT and progestogen) were statistically significantly associated with GORD. In adjusted models, this association remained statistically significant for oestrogen-only treatment (OR 1.49; 1.18-1.89). Unadjusted analysis showed a statistically significant association between PPI use and oestrogen-only and combined HRT treatment. When adjusted for covariates, oestrogen-only treatment was significant (OR 1.34; 95% CI 1.03-1.74). Findings from the adjusted model demonstrated the greater use of PPI by progestogen users (OR 1.50; 1.01-2.22). This first large cohort study of the association between GORD and HRT found a statistically significant association between oestrogen-only hormone and GORD and PPI use. This should be further investigated using prospective follow-up to validate the strength of association and describe its clinical significance.

  20. Mining a clinical data warehouse to discover disease-finding associations using co-occurrence statistics.

    PubMed

    Cao, Hui; Markatou, Marianthi; Melton, Genevieve B; Chiang, Michael F; Hripcsak, George

    2005-01-01

    This paper applies co-occurrence statistics to discover disease-finding associations in a clinical data warehouse. We used two methods, chi2 statistics and the proportion confidence interval (PCI) method, to measure the dependence of pairs of diseases and findings, and then used heuristic cutoff values for association selection. An intrinsic evaluation showed that 94 percent of disease-finding associations obtained by chi2 statistics and 76.8 percent obtained by the PCI method were true associations. The selected associations were used to construct knowledge bases of disease-finding relations (KB-chi2, KB-PCI). An extrinsic evaluation showed that both KB-chi2 and KB-PCI could assist in eliminating clinically non-informative and redundant findings from problem lists generated by our automated problem list summarization system.

  1. Species distribution models: A comparison of statistical approaches for livestock and disease epidemics.

    PubMed

    Hollings, Tracey; Robinson, Andrew; van Andel, Mary; Jewell, Chris; Burgman, Mark

    2017-01-01

    In livestock industries, reliable up-to-date spatial distribution and abundance records for animals and farms are critical for governments to manage and respond to risks. Yet few, if any, countries can afford to maintain comprehensive, up-to-date agricultural census data. Statistical modelling can be used as a proxy for such data but comparative modelling studies have rarely been undertaken for livestock populations. Widespread species, including livestock, can be difficult to model effectively due to complex spatial distributions that do not respond predictably to environmental gradients. We assessed three machine learning species distribution models (SDM) for their capacity to estimate national-level farm animal population numbers within property boundaries: boosted regression trees (BRT), random forests (RF) and K-nearest neighbour (K-NN). The models were built from a commercial livestock database and environmental and socio-economic predictor data for New Zealand. We used two spatial data stratifications to test (i) support for decision making in an emergency response situation, and (ii) the ability for the models to predict to new geographic regions. The performance of the three model types varied substantially, but the best performing models showed very high accuracy. BRTs had the best performance overall, but RF performed equally well or better in many simulations; RFs were superior at predicting livestock numbers for all but very large commercial farms. K-NN performed poorly relative to both RF and BRT in all simulations. The predictions of both multi species and single species models for farms and within hypothetical quarantine zones were very close to observed data. These models are generally applicable for livestock estimation with broad applications in disease risk modelling, biosecurity, policy and planning.

  2. Species distribution models: A comparison of statistical approaches for livestock and disease epidemics

    PubMed Central

    Robinson, Andrew; van Andel, Mary; Jewell, Chris; Burgman, Mark

    2017-01-01

    In livestock industries, reliable up-to-date spatial distribution and abundance records for animals and farms are critical for governments to manage and respond to risks. Yet few, if any, countries can afford to maintain comprehensive, up-to-date agricultural census data. Statistical modelling can be used as a proxy for such data but comparative modelling studies have rarely been undertaken for livestock populations. Widespread species, including livestock, can be difficult to model effectively due to complex spatial distributions that do not respond predictably to environmental gradients. We assessed three machine learning species distribution models (SDM) for their capacity to estimate national-level farm animal population numbers within property boundaries: boosted regression trees (BRT), random forests (RF) and K-nearest neighbour (K-NN). The models were built from a commercial livestock database and environmental and socio-economic predictor data for New Zealand. We used two spatial data stratifications to test (i) support for decision making in an emergency response situation, and (ii) the ability for the models to predict to new geographic regions. The performance of the three model types varied substantially, but the best performing models showed very high accuracy. BRTs had the best performance overall, but RF performed equally well or better in many simulations; RFs were superior at predicting livestock numbers for all but very large commercial farms. K-NN performed poorly relative to both RF and BRT in all simulations. The predictions of both multi species and single species models for farms and within hypothetical quarantine zones were very close to observed data. These models are generally applicable for livestock estimation with broad applications in disease risk modelling, biosecurity, policy and planning. PMID:28837685

  3. Hispanics/Latinos & Cardiovascular Disease: Statistical Fact Sheet

    MedlinePlus

    Statistical Fact Sheet 2013 Update Hispanics/Latinos & Cardiovascular Diseases Cardiovascular Disease (CVD) (ICD/10 codes I00-I99, Q20-Q28) (ICD/9 codes 390-459, 745-747)  Among Mexican-American adults age 20 ...

  4. Ultrasound criteria and guided fine-needle aspiration diagnostic yields in small animal peritoneal, mesenteric and omental disease.

    PubMed

    Feeney, Daniel A; Ober, Christopher P; Snyder, Laura A; Hill, Sara A; Jessen, Carl R

    2013-01-01

    Peritoneal, mesenteric, and omental diseases are important causes of morbidity and mortality in humans and animals, although information in the veterinary literature is limited. The purposes of this retrospective study were to determine whether objectively applied ultrasound interpretive criteria are statistically useful in differentiating among cytologically defined normal, inflammatory, and neoplastic peritoneal conditions in dogs and cats. A second goal was to determine the cytologically interpretable yield on ultrasound-guided, fine-needle sampling of peritoneal, mesenteric, or omental structures. Sonographic criteria agreed upon by the authors were retrospectively and independently applied by two radiologists to the available ultrasound images without knowledge of the cytologic diagnosis and statistically compared to the ultrasound-guided, fine-needle aspiration cytologic interpretations. A total of 72 dogs and 49 cats with abdominal peritoneal, mesenteric, or omental (peritoneal) surface or effusive disease and 17 dogs and 3 cats with no cytologic evidence of inflammation or neoplasia were included. The optimized, ultrasound criteria-based statistical model created independently for each radiologist yielded an equation-based diagnostic category placement accuracy of 63.2-69.9% across the two involved radiologists. Regional organ-associated masses or nodules as well as aggregated bowel and peritoneal thickening were more associated with peritoneal neoplasia whereas localized, severely complex fluid collections were more associated with inflammatory peritoneal disease. The cytologically interpretable yield for ultrasound-guided fine-needle sampling was 72.3% with no difference between species, making this a worthwhile clinical procedure. © 2013 Veterinary Radiology & Ultrasound.

  5. Forecasting incidence of dengue in Rajasthan, using time series analyses.

    PubMed

    Bhatnagar, Sunil; Lal, Vivek; Gupta, Shiv D; Gupta, Om P

    2012-01-01

    To develop a prediction model for dengue fever/dengue haemorrhagic fever (DF/DHF) using time series data over the past decade in Rajasthan and to forecast monthly DF/DHF incidence for 2011. Seasonal autoregressive integrated moving average (SARIMA) model was used for statistical modeling. During January 2001 to December 2010, the reported DF/DHF cases showed a cyclical pattern with seasonal variation. SARIMA (0,0,1) (0,1,1) 12 model had the lowest normalized Bayesian information criteria (BIC) of 9.426 and mean absolute percentage error (MAPE) of 263.361 and appeared to be the best model. The proportion of variance explained by the model was 54.3%. Adequacy of the model was established through Ljung-Box test (Q statistic 4.910 and P-value 0.996), which showed no significant correlation between residuals at different lag times. The forecast for the year 2011 showed a seasonal peak in the month of October with an estimated 546 cases. Application of SARIMA model may be useful for forecast of cases and impending outbreaks of DF/DHF and other infectious diseases, which exhibit seasonal pattern.

  6. Contemporary model for cardiovascular risk prediction in people with type 2 diabetes.

    PubMed

    Kengne, Andre Pascal; Patel, Anushka; Marre, Michel; Travert, Florence; Lievre, Michel; Zoungas, Sophia; Chalmers, John; Colagiuri, Stephen; Grobbee, Diederick E; Hamet, Pavel; Heller, Simon; Neal, Bruce; Woodward, Mark

    2011-06-01

    Existing cardiovascular risk prediction equations perform non-optimally in different populations with diabetes. Thus, there is a continuing need to develop new equations that will reliably estimate cardiovascular disease (CVD) risk and offer flexibility for adaptation in various settings. This report presents a contemporary model for predicting cardiovascular risk in people with type 2 diabetes mellitus. A 4.5-year follow-up of the Action in Diabetes and Vascular disease: preterax and diamicron-MR controlled evaluation (ADVANCE) cohort was used to estimate coefficients for significant predictors of CVD using Cox models. Similar Cox models were used to fit the 4-year risk of CVD in 7168 participants without previous CVD. The model's applicability was tested on the same sample and another dataset. A total of 473 major cardiovascular events were recorded during follow-up. Age at diagnosis, known duration of diabetes, sex, pulse pressure, treated hypertension, atrial fibrillation, retinopathy, HbA1c, urinary albumin/creatinine ratio and non-HDL cholesterol at baseline were significant predictors of cardiovascular events. The model developed using these predictors displayed an acceptable discrimination (c-statistic: 0.70) and good calibration during internal validation. The external applicability of the model was tested on an independent cohort of individuals with type 2 diabetes, where similar discrimination was demonstrated (c-statistic: 0.69). Major cardiovascular events in contemporary populations with type 2 diabetes can be predicted on the basis of routinely measured clinical and biological variables. The model presented here can be used to quantify risk and guide the intensity of treatment in people with diabetes.

  7. Antibody-dependent enhancement of severe dengue disease in humans*

    PubMed Central

    Katzelnick, Leah C.; Gresh, Lionel; Halloran, M. Elizabeth; Mercado, Juan Carlos; Kuan, Guillermina; Gordon, Aubree; Balmaseda, Angel; Harris, Eva

    2018-01-01

    For dengue viruses (DENV1-4), a specific range of antibody titer has been shown to enhance viral replication in vitro and severe disease in animal models. Although suspected, such antibody-dependent enhancement (ADE) of severe disease has not been shown to occur in humans. Using multiple statistical approaches to study a long-term pediatric cohort in Nicaragua, we show that risk of severe dengue disease is highest within a narrow range of pre-existing anti-DENV antibody titers. By contrast, we observe protection from all symptomatic dengue disease at high antibody titers. Thus, immune correlates of severe dengue must be evaluated separately from correlates of protection against symptomatic disease. These results have implications for studies of dengue pathogenesis and for vaccine development, because enhancement, not just lack of protection, is of concern. PMID:29097492

  8. Is it growing exponentially fast? -- Impact of assuming exponential growth for characterizing and forecasting epidemics with initial near-exponential growth dynamics.

    PubMed

    Chowell, Gerardo; Viboud, Cécile

    2016-10-01

    The increasing use of mathematical models for epidemic forecasting has highlighted the importance of designing models that capture the baseline transmission characteristics in order to generate reliable epidemic forecasts. Improved models for epidemic forecasting could be achieved by identifying signature features of epidemic growth, which could inform the design of models of disease spread and reveal important characteristics of the transmission process. In particular, it is often taken for granted that the early growth phase of different growth processes in nature follow early exponential growth dynamics. In the context of infectious disease spread, this assumption is often convenient to describe a transmission process with mass action kinetics using differential equations and generate analytic expressions and estimates of the reproduction number. In this article, we carry out a simulation study to illustrate the impact of incorrectly assuming an exponential-growth model to characterize the early phase (e.g., 3-5 disease generation intervals) of an infectious disease outbreak that follows near-exponential growth dynamics. Specifically, we assess the impact on: 1) goodness of fit, 2) bias on the growth parameter, and 3) the impact on short-term epidemic forecasts. Designing transmission models and statistical approaches that more flexibly capture the profile of epidemic growth could lead to enhanced model fit, improved estimates of key transmission parameters, and more realistic epidemic forecasts.

  9. Statistical Enrichment of Epigenetic States Around Triplet Repeats that Can Undergo Expansions

    PubMed Central

    Essebier, Alexandra; Vera Wolf, Patricia; Cao, Minh Duc; Carroll, Bernard J.; Balasubramanian, Sureshkumar; Bodén, Mikael

    2016-01-01

    More than 30 human genetic diseases are linked to tri-nucleotide repeat expansions. There is no known mechanism that explains repeat expansions in full, but changes in the epigenetic state of the associated locus has been implicated in the disease pathology for a growing number of examples. A comprehensive comparative analysis of the genomic features associated with diverse repeat expansions has been lacking. Here, in an effort to decipher the propensity of repeats to undergo expansion and result in a disease state, we determine the genomic coordinates of tri-nucleotide repeat tracts at base pair resolution and computationally establish epigenetic profiles around them. Using three complementary statistical tests, we reveal that several epigenetic states are enriched around repeats that are associated with disease, even in cells that do not harbor expansion, relative to a carefully stratified background. Analysis of over one hundred cell types reveals that epigenetic states generally tend to vary widely between genic regions and cell types. However, there is qualified consistency in the epigenetic signatures of repeats associated with disease suggesting that changes to the chromatin and the DNA around an expanding repeat locus are likely to be similar. These epigenetic signatures may be exploited further to develop models that could explain the propensity of repeats to undergo expansions. PMID:27013954

  10. Effects of noise on a computational model for disease states of mood disorders

    NASA Astrophysics Data System (ADS)

    Tobias Huber, Martin; Krieg, Jürgen-Christian; Braun, Hans Albert; Moss, Frank

    2000-03-01

    Nonlinear dynamics are currently proposed to explain the progressive course of recurrent mood disorders starting with isolated episodes and ending with accelerated irregular (``chaotic") mood fluctuations. Such a low-dimensional disease model is attractive because of its principal accordance with biological disease models, i.e. the kindling and biological rhythms model. However, most natural systems are nonlinear and noisy and several studies in the neuro- and physical sciences have demonstrated interesting cooperative behaviors arising from interacting random and deterministic dynamics. Here, we consider the effects of noise on a recent neurodynamical model for the timecourse of affective disorders (Huber et al.: Biological Psychiatry 1999;46:256-262). We describe noise effects on temporal patterns and mean episode frequencies of various in computo disease states. Our simulations demonstrate that noise can cause unstructured randomness or can maximize periodic order. The frequency of episode occurence can increase with noise but it can also remain unaffected or even can decrease. We show further that noise can make visible bifurcations before they would normally occur under deterministic conditions and we quantify this behavior with a recently developed statistical method. All these effects depend critically on both, the dynamic state and the noise intensity. Implications for neurobiology and course of mood disorders are discussed.

  11. The Burden of Gastroesophageal Reflux Disease on Patients' Daily Lives: A Cross-Sectional Study Conducted in a Primary Care Setting in Serbia.

    PubMed

    Bjelović, Miloš; Babić, Tamara; Dragicević, Igor; Corac, Aleksandar; Goran Trajković

    2015-01-01

    Recent data from the studies conducted in the Western countries have proved that patients with gastroesophageal reflux disease have significantly impaired health-related quality of life compared to general population. The study is aimed at evaluating the burden of reflux symptoms on patients'health-related quality of life. The study involved 1,593 patients with diagnosed gastroesophageal reflux disease.The Serbian version of a generic self-administered Centers for Disease Control and Prevention questionnaire was used. Statistical analyses included descriptive statistics, Pearson chi-square test and a multiple regression model. Among all participants, 43.9% reported fair or poor health. Mean value of unhealthy days during the past 30 days was 10.4 days, physically unhealthy days 6.4 days, mentally unhealthy days 5.3 days and activity limitation days 4.3 days. Furthermore, 24.8% participants reported having ≥ 14 unhealthy days, 14.9% had 14 physically unhealthy days, 11.8% reported 14 mentally unhealthy days, and 9.4% had ≥ 14 activity limitation days. This study addressed complex relationships between reflux symptoms and patients'impaired everyday lives.

  12. The increasing financial impact of chronic kidney disease in australia.

    PubMed

    Tucker, Patrick S; Kingsley, Michael I; Morton, R Hugh; Scanlan, Aaron T; Dalbo, Vincent J

    2014-01-01

    The aim of this investigation was to determine and compare current and projected expenditure associated with chronic kidney disease (CKD), renal replacement therapy (RRT), and cardiovascular disease (CVD) in Australia. Data published by Australia and New Zealand Dialysis and Transplant Registry, Australian Institute of Health and Welfare, and World Bank were used to compare CKD-, RRT-, and CVD-related expenditure and prevalence rates. Prevalence and expenditure predictions were made using a linear regression model. Direct statistical comparisons of rates of annual increase utilised indicator variables in combined regressions. Statistical significance was set at P < 0.05. Dollar amounts were adjusted for inflation prior to analysis. Between 2012 and 2020, prevalence, per-patient expenditure, and total disease expenditure associated with CKD and RRT are estimated to increase significantly more rapidly than CVD. RRT prevalence is estimated to increase by 29%, compared to 7% in CVD. Average annual RRT per-patient expenditure is estimated to increase by 16%, compared to 8% in CVD. Total CKD- and RRT-related expenditure had been estimated to increase by 37%, compared to 14% in CVD. Per-patient, CKD produces a considerably greater financial impact on Australia's healthcare system, compared to CVD. Research focusing on novel preventative/therapeutic interventions is warranted.

  13. A threshold method for immunological correlates of protection

    PubMed Central

    2013-01-01

    Background Immunological correlates of protection are biological markers such as disease-specific antibodies which correlate with protection against disease and which are measurable with immunological assays. It is common in vaccine research and in setting immunization policy to rely on threshold values for the correlate where the accepted threshold differentiates between individuals who are considered to be protected against disease and those who are susceptible. Examples where thresholds are used include development of a new generation 13-valent pneumococcal conjugate vaccine which was required in clinical trials to meet accepted thresholds for the older 7-valent vaccine, and public health decision making on vaccination policy based on long-term maintenance of protective thresholds for Hepatitis A, rubella, measles, Japanese encephalitis and others. Despite widespread use of such thresholds in vaccine policy and research, few statistical approaches have been formally developed which specifically incorporate a threshold parameter in order to estimate the value of the protective threshold from data. Methods We propose a 3-parameter statistical model called the a:b model which incorporates parameters for a threshold and constant but different infection probabilities below and above the threshold estimated using profile likelihood or least squares methods. Evaluation of the estimated threshold can be performed by a significance test for the existence of a threshold using a modified likelihood ratio test which follows a chi-squared distribution with 3 degrees of freedom, and confidence intervals for the threshold can be obtained by bootstrapping. The model also permits assessment of relative risk of infection in patients achieving the threshold or not. Goodness-of-fit of the a:b model may be assessed using the Hosmer-Lemeshow approach. The model is applied to 15 datasets from published clinical trials on pertussis, respiratory syncytial virus and varicella. Results Highly significant thresholds with p-values less than 0.01 were found for 13 of the 15 datasets. Considerable variability was seen in the widths of confidence intervals. Relative risks indicated around 70% or better protection in 11 datasets and relevance of the estimated threshold to imply strong protection. Goodness-of-fit was generally acceptable. Conclusions The a:b model offers a formal statistical method of estimation of thresholds differentiating susceptible from protected individuals which has previously depended on putative statements based on visual inspection of data. PMID:23448322

  14. A method for modeling co-occurrence propensity of clinical codes with application to ICD-10-PCS auto-coding.

    PubMed

    Subotin, Michael; Davis, Anthony R

    2016-09-01

    Natural language processing methods for medical auto-coding, or automatic generation of medical billing codes from electronic health records, generally assign each code independently of the others. They may thus assign codes for closely related procedures or diagnoses to the same document, even when they do not tend to occur together in practice, simply because the right choice can be difficult to infer from the clinical narrative. We propose a method that injects awareness of the propensities for code co-occurrence into this process. First, a model is trained to estimate the conditional probability that one code is assigned by a human coder, given than another code is known to have been assigned to the same document. Then, at runtime, an iterative algorithm is used to apply this model to the output of an existing statistical auto-coder to modify the confidence scores of the codes. We tested this method in combination with a primary auto-coder for International Statistical Classification of Diseases-10 procedure codes, achieving a 12% relative improvement in F-score over the primary auto-coder baseline. The proposed method can be used, with appropriate features, in combination with any auto-coder that generates codes with different levels of confidence. The promising results obtained for International Statistical Classification of Diseases-10 procedure codes suggest that the proposed method may have wider applications in auto-coding. © The Author 2016. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  15. Performance of Disease-Specific Scoring Models in Intensive Care Patients with Severe Liver Diseases.

    PubMed

    El-Ghannam, Maged T; Hassanien, Moataz H; El-Talkawy, Mohamed D; Saleem, Abdel Aziz A; Sabry, Amal I; Abu Taleb, Hoda M

    2017-06-01

    Egypt has the highest prevalence of Hepatitis C Virus (HCV) in the world, estimated nationally at 14.7%. HCV treatment consumes 20% ($80 million) of Egypt's annual health budget. Outcomes of cirrhotic patients admitted to the ICU may, in fact, largely depend on differences in the state of the disease, criteria and indications for admission, resource utilization, and intensity of treatment. The aim of the present study was to evaluate the efficacy of liver specific scoring models in predicting the outcome of critically ill cirrhotic patients in the ICU as it may help in prioritization of high risk patients and preservation of ICU resources. Over one year, a total of 777 patients with End Stage Liver Disease (ESLD) due to HCV infection were included in this retrospective non-randomized human study. All statistical analyses were performed by the statistical software SPSS version 22.0 (SPSS, Chicago, IL, USA). Child Turcotte Pugh (CTP) score, MELD score, MELD-Na, MESO, iMELD, Refit MELD and Refit MELD-Na were calculated on ICU admission. ICU admission was mainly due to Gastrointestinal (GI) bleeding and Hepatic Encephalopathy (HE). Overall mortality was 27%. Age and sex showed no statistical difference between survivors and non survivors. Significantly higher mean values were observed for all models among individuals who died compared to survivors. MELD-Na was the most specific compared to the other scores. MELD-Na was highly predictive of mortality at an optimized cut-off value of 20.4 (AURC=0.789±0.03-CI 95%=0.711-0.865) while original MELD was highly predictive of mortality at an optimized cut-off value of 17.4 (AURC=0.678±0.01-CI 95%=0.613-0.682) denoting the importance of adding serum sodium to the original MELD. INR, serum creatinine, bilirubin, white blood cells count and hyponatremia were significantly higher in non survivors compared to survivors, while hypoalbuminemia showed no statistical difference. The advent of Hepatorenal Syndrome (HRS) and Spontaneous Bacterial Peritonitis (SBP) carried worse prognosis. Hyponatremia and number of transfused blood bags were additional independent predictors of mortality. In cirrhosis of liver, due to HCV infection, patients who died during their ICU stay displayed significantly higher values on all prognostic scores at admission. The addition of sodium to MELD score greatly improves the predictive accuracy of mortality. MELD-Na showed the highest predictive value of all scores.

  16. Two-stage Bayesian model to evaluate the effect of air pollution on chronic respiratory diseases using drug prescriptions.

    PubMed

    Blangiardo, Marta; Finazzi, Francesco; Cameletti, Michela

    2016-08-01

    Exposure to high levels of air pollutant concentration is known to be associated with respiratory problems which can translate into higher morbidity and mortality rates. The link between air pollution and population health has mainly been assessed considering air quality and hospitalisation or mortality data. However, this approach limits the analysis to individuals characterised by severe conditions. In this paper we evaluate the link between air pollution and respiratory diseases using general practice drug prescriptions for chronic respiratory diseases, which allow to draw conclusions based on the general population. We propose a two-stage statistical approach: in the first stage we specify a space-time model to estimate the monthly NO2 concentration integrating several data sources characterised by different spatio-temporal resolution; in the second stage we link the concentration to the β2-agonists prescribed monthly by general practices in England and we model the prescription rates through a small area approach. Copyright © 2016 Elsevier Ltd. All rights reserved.

  17. Gene-environment studies: any advantage over environmental studies?

    PubMed

    Bermejo, Justo Lorenzo; Hemminki, Kari

    2007-07-01

    Gene-environment studies have been motivated by the likely existence of prevalent low-risk genes that interact with common environmental exposures. The present study assessed the statistical advantage of the simultaneous consideration of genes and environment to investigate the effect of environmental risk factors on disease. In particular, we contemplated the possibility that several genes modulate the environmental effect. Environmental exposures, genotypes and phenotypes were simulated according to a wide range of parameter settings. Different models of gene-gene-environment interaction were considered. For each parameter combination, we estimated the probability of detecting the main environmental effect, the power to identify the gene-environment interaction and the frequency of environmentally affected individuals at which environmental and gene-environment studies show the same statistical power. The proportion of cases in the population attributable to the modeled risk factors was also calculated. Our data indicate that environmental exposures with weak effects may account for a significant proportion of the population prevalence of the disease. A general result was that, if the environmental effect was restricted to rare genotypes, the power to detect the gene-environment interaction was higher than the power to identify the main environmental effect. In other words, when few individuals contribute to the overall environmental effect, individual contributions are large and result in easily identifiable gene-environment interactions. Moreover, when multiple genes interacted with the environment, the statistical benefit of gene-environment studies was limited to those studies that included major contributors to the gene-environment interaction. The advantage of gene-environment over plain environmental studies also depends on the inheritance mode of the involved genes, on the study design and, to some extend, on the disease prevalence.

  18. Numeric, Agent-based or System Dynamics Model? Which Modeling Approach is the Best for Vast Population Simulation?

    PubMed

    Cimler, Richard; Tomaskova, Hana; Kuhnova, Jitka; Dolezal, Ondrej; Pscheidl, Pavel; Kuca, Kamil

    2018-01-01

    Alzheimer's disease is one of the most common mental illnesses. It is posited that more than 25% of the population is affected by some mental disease during their lifetime. Treatment of each patient draws resources from the economy concerned. Therefore, it is important to quantify the potential economic impact. Agent-based, system dynamics and numerical approaches to dynamic modeling of the population of the European Union and its patients with Alzheimer's disease are presented in this article. Simulations, their characteristics, and the results from different modeling tools are compared. The results of these approaches are compared with EU population growth predictions from the statistical office of the EU by Eurostat. The methodology of a creation of the models is described and all three modeling approaches are compared. The suitability of each modeling approach for the population modeling is discussed. In this case study, all three approaches gave us the results corresponding with the EU population prediction. Moreover, we were able to predict the number of patients with AD and, based on the modeling method, we were also able to monitor different characteristics of the population. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.

  19. Disease Mapping for Stomach Cancer in Libya Based on Besag– York– Mollié (BYM) Model

    PubMed

    Alhdiri, Maryam Ahmed Salem; Samat, Nor Azah; Mohamed, Zulkifley

    2017-06-25

    Globally, Cancer is the ever-increasing health problem and most common cause of medical deaths. In Libya, it is an important health concern, especially in the setting of an aging population and limited healthcare facilities. Therefore, the goal of this research is to map of the county’ cancer incidence rate using the Bayesian method and identify the high-risk regions (for the first time in a decade). In the field of disease mapping, very little has been done to address the issue of analyzing sparse cancer diseases in Libya. Standardized Morbidity Ratio or SMR is known as a traditional approach to measure the relative risk of the disease, which is the ratio of observed and expected number of accounts in a region that has the greatest uncertainty if the disease is rare or small geographical region. Therefore, to solve some of SMR’s problems, we used statistical smoothing or Bayesian models to estimate the relative risk for stomach cancer incidence in Libya in 2007 based on the BYM model. This research begins with a short offer of the SMR and Bayesian model with BYM model, which we applied to stomach cancer incidence in Libya. We compared all of the results using maps and tables. We found that BYM model is potentially beneficial, because it gives better relative risk estimates compared to SMR method. As well as, it has can overcome the classical method problem when there is no observed stomach cancer in a region. Creative Commons Attribution License

  20. Coupled disease-behavior dynamics on complex networks: A review

    NASA Astrophysics Data System (ADS)

    Wang, Zhen; Andrews, Michael A.; Wu, Zhi-Xi; Wang, Lin; Bauch, Chris T.

    2015-12-01

    It is increasingly recognized that a key component of successful infection control efforts is understanding the complex, two-way interaction between disease dynamics and human behavioral and social dynamics. Human behavior such as contact precautions and social distancing clearly influence disease prevalence, but disease prevalence can in turn alter human behavior, forming a coupled, nonlinear system. Moreover, in many cases, the spatial structure of the population cannot be ignored, such that social and behavioral processes and/or transmission of infection must be represented with complex networks. Research on studying coupled disease-behavior dynamics in complex networks in particular is growing rapidly, and frequently makes use of analysis methods and concepts from statistical physics. Here, we review some of the growing literature in this area. We contrast network-based approaches to homogeneous-mixing approaches, point out how their predictions differ, and describe the rich and often surprising behavior of disease-behavior dynamics on complex networks, and compare them to processes in statistical physics. We discuss how these models can capture the dynamics that characterize many real-world scenarios, thereby suggesting ways that policy makers can better design effective prevention strategies. We also describe the growing sources of digital data that are facilitating research in this area. Finally, we suggest pitfalls which might be faced by researchers in the field, and we suggest several ways in which the field could move forward in the coming years.

  1. Lung necrosis and neutrophils reflect common pathways of susceptibility to Mycobacterium tuberculosis in genetically diverse, immune-competent mice

    PubMed Central

    Niazi, Muhammad K. K.; Dhulekar, Nimit; Schmidt, Diane; Major, Samuel; Cooper, Rachel; Abeijon, Claudia; Gatti, Daniel M.; Kramnik, Igor; Yener, Bulent; Gurcan, Metin; Beamer, Gillian

    2015-01-01

    ABSTRACT Pulmonary tuberculosis (TB) is caused by Mycobacterium tuberculosis in susceptible humans. Here, we infected Diversity Outbred (DO) mice with ∼100 bacilli by aerosol to model responses in a highly heterogeneous population. Following infection, ‘supersusceptible’, ‘susceptible’ and ‘resistant’ phenotypes emerged. TB disease (reduced survival, weight loss, high bacterial load) correlated strongly with neutrophils, neutrophil chemokines, tumor necrosis factor (TNF) and cell death. By contrast, immune cytokines were weak correlates of disease. We next applied statistical and machine learning approaches to our dataset of cytokines and chemokines from lungs and blood. Six molecules from the lung: TNF, CXCL1, CXCL2, CXCL5, interferon-γ (IFN-γ), interleukin 12 (IL-12); and two molecules from blood – IL-2 and TNF – were identified as being important by applying both statistical and machine learning methods. Using molecular features to generate tree classifiers, CXCL1, CXCL2 and CXCL5 distinguished four classes (supersusceptible, susceptible, resistant and non-infected) from each other with approximately 77% accuracy using completely independent experimental data. By contrast, models based on other molecules were less accurate. Low to no IFN-γ, IL-12, IL-2 and IL-10 successfully discriminated non-infected mice from infected mice but failed to discriminate disease status amongst supersusceptible, susceptible and resistant M.-tuberculosis-infected DO mice. Additional analyses identified CXCL1 as a promising peripheral biomarker of disease and of CXCL1 production in the lungs. From these results, we conclude that: (1) DO mice respond variably to M. tuberculosis infection and will be useful to identify pathways involving necrosis and neutrophils; (2) data from DO mice is suited for machine learning methods to build, validate and test models with independent data based solely on molecular biomarkers; (3) low levels of immunological cytokines best indicate a lack of exposure to M. tuberculosis but cannot distinguish infection from disease. PMID:26204894

  2. Public Opinions Toward Diseases: Infodemiological Study on News Media Data.

    PubMed

    Huang, Ming; ElTayeby, Omar; Zolnoori, Maryam; Yao, Lixia

    2018-05-08

    Society always has limited resources to expend on health care, or anything else. What are the unmet medical needs? How do we allocate limited resources to maximize the health and welfare of the people? These challenging questions might be re-examined systematically within an infodemiological frame on a much larger scale, leveraging the latest advancement in information technology and data science. We expanded our previous work by investigating news media data to reveal the coverage of different diseases and medical conditions, together with their sentiments and topics in news articles over three decades. We were motivated to do so since news media plays a significant role in politics and affects the public policy making. We analyzed over 3.5 million archive news articles from Reuters media during the periods of 1996/1997, 2008 and 2016, using summary statistics, sentiment analysis, and topic modeling. Summary statistics illustrated the coverage of various diseases and medical conditions during the last 3 decades. Sentiment analysis and topic modeling helped us automatically detect the sentiments of news articles (ie, positive versus negative) and topics (ie, a series of keywords) associated with each disease over time. The percentages of news articles mentioning diseases and medical conditions were 0.44%, 0.57% and 0.81% in the three time periods, suggesting that news media or the public has gradually increased its interests in medicine since 1996. Certain diseases such as other malignant neoplasm (34%), other infectious diseases (20%), and influenza (11%) represented the most covered diseases. Two hundred and twenty-six diseases and medical conditions (97.8%) were found to have neutral or negative sentiments in the news articles. Using topic modeling, we identified meaningful topics on these diseases and medical conditions. For instance, the smoking theme appeared in the news articles on other malignant neoplasm only during 1996/1997. The topic phrases HIV and Zika virus were linked to other infectious diseases during 1996/1997 and 2016, respectively. The multi-dimensional analysis of news media data allows the discovery of focus, sentiments and topics of news media in terms of diseases and medical conditions. These infodemiological discoveries could shed light on unmet medical needs and research priorities for future and provide guidance for the decision making in public policy. ©Ming Huang, Omar ElTayeby, Maryam Zolnoori, Lixia Yao. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 08.05.2018.

  3. Environmental and socio-economic risk modelling for Chagas disease in Bolivia.

    PubMed

    Mischler, Paula; Kearney, Michael; McCarroll, Jennifer C; Scholte, Ronaldo G C; Vounatsou, Penelope; Malone, John B

    2012-09-01

    Accurately defining disease distributions and calculating disease risk is an important step in the control and prevention of diseases. Geographical information systems (GIS) and remote sensing technologies, with maximum entropy (Maxent) ecological niche modelling computer software, were used to create predictive risk maps for Chagas disease in Bolivia. Prevalence rates were calculated from 2007 to 2009 household infection survey data for Bolivia, while environmental data were compiled from the Worldclim database and MODIS satellite imagery. Socio-economic data were obtained from the Bolivian National Institute of Statistics. Disease models identified altitudes at 500-3,500 m above the mean sea level (MSL), low annual precipitation (45-250 mm), and higher diurnal range of temperature (10-19 °C; peak 16 °C) as compatible with the biological requirements of the insect vectors. Socio-economic analyses demonstrated the importance of improved housing materials and water source. Home adobe wall materials and having to fetch drinking water from rivers or wells without pump were found to be highly related to distribution of the disease by the receiver operator characteristic (ROC) area under the curve (AUC) (0.69 AUC, 0.67 AUC and 0.62 AUC, respectively), while areas with hardwood floors demonstrated a direct negative relationship (-0.71 AUC). This study demonstrates that Maxent modelling can be used in disease prevalence and incidence studies to provide governmental agencies with an easily learned, understandable method to define areas as either high, moderate or low risk for the disease. This information may be used in resource planning, targeting and implementation. However, access to high-resolution, sub-municipality socio-economic data (e.g. census tracts) would facilitate elucidation of the relative influence of poverty-related factors on regional disease dynamics.

  4. Mining a clinical data warehouse to discover disease-finding associations using co-occurrence statistics

    PubMed Central

    Cao, Hui; Markatou, Marianthi; Melton, Genevieve B.; Chiang, Michael F.; Hripcsak, George

    2005-01-01

    This paper applies co-occurrence statistics to discover disease-finding associations in a clinical data warehouse. We used two methods, χ2 statistics and the proportion confidence interval (PCI) method, to measure the dependence of pairs of diseases and findings, and then used heuristic cutoff values for association selection. An intrinsic evaluation showed that 94 percent of disease-finding associations obtained by χ2 statistics and 76.8 percent obtained by the PCI method were true associations. The selected associations were used to construct knowledge bases of disease-finding relations (KB-χ2, KB-PCI). An extrinsic evaluation showed that both KB-χ2 and KB-PCI could assist in eliminating clinically non-informative and redundant findings from problem lists generated by our automated problem list summarization system. PMID:16779011

  5. The Association of Statin Use with Age-Related Macular Degeneration Progression: The Age-Related Eye Disease Study 2 Report Number 9.

    PubMed

    Al-Holou, Shaza N; Tucker, William R; Agrón, Elvira; Clemons, Traci E; Cukras, Catherine; Ferris, Frederick L; Chew, Emily Y

    2015-12-01

    To evaluate the association of statin use with progression of age-related macular degeneration (AMD). Preplanned, prospective cohort study within a controlled clinical trial of oral supplementation for age-related eye diseases. Age-Related Eye Disease Study 2 (AREDS2) participants, aged 50 to 85 years. Factors, including age, gender, smoking status, aspirin use, and history of diabetes, hypertension, heart disease, angina, and stroke-all known to be associated with statin use-were included in a logistic regression model to estimate propensity scores for each participant. Age-adjusted proportional hazards regression models, with and without propensity score matching, were performed to evaluate the association of statin use with progression to late AMD. Analyses adjusting for the competing risk of death were also performed. Baseline and annual stereoscopic fundus photographs were assessed centrally by masked graders for the development of late AMD, either neovascular AMD or geographic atrophy (GA). Of the 3791 participants (2462 with bilateral large drusen and 1329 with unilateral late AMD at baseline), 1659 (43.8%) were statin users. The overall analysis, with no matching of propensity scores and no adjustment for death as a competing risk, showed that statin use was not associated with progression to late AMD (hazard ratio [HR], 1.08; 95% confidence interval [CI], 0.83-1.41; P = 0.56). When matched for propensity scores and adjusted for death as a competing risk, the result was not statistically significant (HR, 0.81; 95% CI, 0.55-1.20; P = 0.29). Furthermore, subgroup analyses of persons with or without late AMD at baseline and the various components of late AMD (neovascular AMD, central GA, or any GA) also showed no statistically significant association of statin use with progression to AMD. Statin use was not statistically significantly associated with progression to late AMD in the AREDS2 participants, and these findings are consistent with findings in the majority of previous studies. Statins have been demonstrated to reduce the risk of cardiovascular disease, but our data do not provide evidence of a beneficial effect on slowing AMD progression. Published by Elsevier Inc.

  6. Comparing colon cancer outcomes: The impact of low hospital case volume and case-mix adjustment.

    PubMed

    Fischer, C; Lingsma, H F; van Leersum, N; Tollenaar, R A E M; Wouters, M W; Steyerberg, E W

    2015-08-01

    When comparing performance across hospitals it is essential to consider the noise caused by low hospital case volume and to perform adequate case-mix adjustment. We aimed to quantify the role of noise and case-mix adjustment on standardized postoperative mortality and anastomotic leakage (AL) rates. We studied 13,120 patients who underwent colon cancer resection in 85 Dutch hospitals. We addressed differences between hospitals in postoperative mortality and AL, using fixed (ignoring noise) and random effects (incorporating noise) logistic regression models with general and additional, disease specific, case-mix adjustment. Adding disease specific variables improved the performance of the case-mix adjustment models for postoperative mortality (c-statistic increased from 0.77 to 0.81). The overall variation in standardized mortality ratios was similar, but some individual hospitals changed considerably. For the standardized AL rates the performance of the adjustment models was poor (c-statistic 0.59 and 0.60) and overall variation was small. Most of the observed variation between hospitals was actually noise. Noise had a larger effect on hospital performance than extended case-mix adjustment, although some individual hospital outcome rates were affected by more detailed case-mix adjustment. To compare outcomes between hospitals it is crucial to consider noise due to low hospital case volume with a random effects model. Copyright © 2015 Elsevier Ltd. All rights reserved.

  7. A flexible spatial scan statistic with a restricted likelihood ratio for detecting disease clusters.

    PubMed

    Tango, Toshiro; Takahashi, Kunihiko

    2012-12-30

    Spatial scan statistics are widely used tools for detection of disease clusters. Especially, the circular spatial scan statistic proposed by Kulldorff (1997) has been utilized in a wide variety of epidemiological studies and disease surveillance. However, as it cannot detect noncircular, irregularly shaped clusters, many authors have proposed different spatial scan statistics, including the elliptic version of Kulldorff's scan statistic. The flexible spatial scan statistic proposed by Tango and Takahashi (2005) has also been used for detecting irregularly shaped clusters. However, this method sets a feasible limitation of a maximum of 30 nearest neighbors for searching candidate clusters because of heavy computational load. In this paper, we show a flexible spatial scan statistic implemented with a restricted likelihood ratio proposed by Tango (2008) to (1) eliminate the limitation of 30 nearest neighbors and (2) to have surprisingly much less computational time than the original flexible spatial scan statistic. As a side effect, it is shown to be able to detect clusters with any shape reasonably well as the relative risk of the cluster becomes large via Monte Carlo simulation. We illustrate the proposed spatial scan statistic with data on mortality from cerebrovascular disease in the Tokyo Metropolitan area, Japan. Copyright © 2012 John Wiley & Sons, Ltd.

  8. Modelling the Contributions of Malaria, HIV, Malnutrition and Rainfall to the Decline in Paediatric Invasive Non-typhoidal Salmonella Disease in Malawi.

    PubMed

    Feasey, Nicholas A; Everett, Dean; Faragher, E Brian; Roca-Feltrer, Arantxa; Kang'ombe, Arthur; Denis, Brigitte; Kerac, Marko; Molyneux, Elizabeth; Molyneux, Malcolm; Jahn, Andreas; Gordon, Melita A; Heyderman, Robert S

    2015-01-01

    Nontyphoidal Salmonellae (NTS) are responsible for a huge burden of bloodstream infection in Sub-Saharan African children. Recent reports of a decline in invasive NTS (iNTS) disease from Kenya and The Gambia have emphasised an association with malaria control. Following a similar decline in iNTS disease in Malawi, we have used 9 years of continuous longitudinal data to model the interrelationships between iNTS disease, malaria, HIV and malnutrition. Trends in monthly numbers of childhood iNTS disease presenting at Queen's Hospital, Blantyre, Malawi from 2002 to 2010 were reviewed in the context of longitudinal monthly data describing malaria slide-positivity among paediatric febrile admissions, paediatric HIV prevalence, nutritional rehabilitation unit admissions and monthly rainfall over the same 9 years, using structural equation models (SEM). Analysis of 3,105 iNTS episodes identified from 49,093 blood cultures, showed an 11.8% annual decline in iNTS (p < 0.001). SEM analysis produced a stable model with good fit, revealing direct and statistically significant seasonal effects of malaria and malnutrition on the prevalence of iNTS disease. When these data were smoothed to eliminate seasonal cyclic changes, these associations remained strong and there were additional significant effects of HIV prevalence. These data suggest that the overall decline in iNTS disease observed in Malawi is attributable to multiple public health interventions leading to reductions in malaria, HIV and acute malnutrition. Understanding the impacts of public health programmes on iNTS disease is essential to plan and evaluate interventions.

  9. A weighted U-statistic for genetic association analyses of sequencing data.

    PubMed

    Wei, Changshuai; Li, Ming; He, Zihuai; Vsevolozhskaya, Olga; Schaid, Daniel J; Lu, Qing

    2014-12-01

    With advancements in next-generation sequencing technology, a massive amount of sequencing data is generated, which offers a great opportunity to comprehensively investigate the role of rare variants in the genetic etiology of complex diseases. Nevertheless, the high-dimensional sequencing data poses a great challenge for statistical analysis. The association analyses based on traditional statistical methods suffer substantial power loss because of the low frequency of genetic variants and the extremely high dimensionality of the data. We developed a Weighted U Sequencing test, referred to as WU-SEQ, for the high-dimensional association analysis of sequencing data. Based on a nonparametric U-statistic, WU-SEQ makes no assumption of the underlying disease model and phenotype distribution, and can be applied to a variety of phenotypes. Through simulation studies and an empirical study, we showed that WU-SEQ outperformed a commonly used sequence kernel association test (SKAT) method when the underlying assumptions were violated (e.g., the phenotype followed a heavy-tailed distribution). Even when the assumptions were satisfied, WU-SEQ still attained comparable performance to SKAT. Finally, we applied WU-SEQ to sequencing data from the Dallas Heart Study (DHS), and detected an association between ANGPTL 4 and very low density lipoprotein cholesterol. © 2014 WILEY PERIODICALS, INC.

  10. On the difficulty to delimit disease risk hot spots

    NASA Astrophysics Data System (ADS)

    Charras-Garrido, M.; Azizi, L.; Forbes, F.; Doyle, S.; Peyrard, N.; Abrial, D.

    2013-06-01

    Representing the health state of a region is a helpful tool to highlight spatial heterogeneity and localize high risk areas. For ease of interpretation and to determine where to apply control procedures, we need to clearly identify and delineate homogeneous regions in terms of disease risk, and in particular disease risk hot spots. However, even if practical purposes require the delineation of different risk classes, such a classification does not correspond to a reality and is thus difficult to estimate. Working with grouped data, a first natural choice is to apply disease mapping models. We apply a usual disease mapping model, producing continuous estimations of the risks that requires a post-processing classification step to obtain clearly delimited risk zones. We also apply a risk partition model that build a classification of the risk levels in a one step procedure. Working with point data, we will focus on the scan statistic clustering method. We illustrate our article with a real example concerning the bovin spongiform encephalopathy (BSE) an animal disease whose zones at risk are well known by the epidemiologists. We show that in this difficult case of a rare disease and a very heterogeneous population, the different methods provide risk zones that are globally coherent. But, related to the dichotomy between the need and the reality, the exact delimitation of the risk zones, as well as the corresponding estimated risks are quite different.

  11. A BAYESIAN SPATIAL AND TEMPORAL MODELING APPROACH TO MAPPING GEOGRAPHIC VARIATION IN MORTALITY RATES FOR SUBNATIONAL AREAS WITH R-INLA.

    PubMed

    Khana, Diba; Rossen, Lauren M; Hedegaard, Holly; Warner, Margaret

    2018-01-01

    Hierarchical Bayes models have been used in disease mapping to examine small scale geographic variation. State level geographic variation for less common causes of mortality outcomes have been reported however county level variation is rarely examined. Due to concerns about statistical reliability and confidentiality, county-level mortality rates based on fewer than 20 deaths are suppressed based on Division of Vital Statistics, National Center for Health Statistics (NCHS) statistical reliability criteria, precluding an examination of spatio-temporal variation in less common causes of mortality outcomes such as suicide rates (SRs) at the county level using direct estimates. Existing Bayesian spatio-temporal modeling strategies can be applied via Integrated Nested Laplace Approximation (INLA) in R to a large number of rare causes of mortality outcomes to enable examination of spatio-temporal variations on smaller geographic scales such as counties. This method allows examination of spatiotemporal variation across the entire U.S., even where the data are sparse. We used mortality data from 2005-2015 to explore spatiotemporal variation in SRs, as one particular application of the Bayesian spatio-temporal modeling strategy in R-INLA to predict year and county-specific SRs. Specifically, hierarchical Bayesian spatio-temporal models were implemented with spatially structured and unstructured random effects, correlated time effects, time varying confounders and space-time interaction terms in the software R-INLA, borrowing strength across both counties and years to produce smoothed county level SRs. Model-based estimates of SRs were mapped to explore geographic variation.

  12. Aggregation and Disaggregation of Senile Plaques in Alzheimer Disease

    NASA Astrophysics Data System (ADS)

    Cruz, L.; Urbanc, B.; Buldyrev, S. V.; Christie, R.; Gomez-Isla, T.; Havlin, S.; McNamara, M.; Stanley, H. E.; Hyman, B. T.

    1997-07-01

    We quantitatively analyzed, using laser scanning confocal microscopy, the three-dimensional structure of individual senile plaques in Alzheimer disease. We carried out the quantitative analysis using statistical methods to gain insights about the processes that govern Aβ peptide deposition. Our results show that plaques are complex porous structures with characteristic pore sizes. We interpret plaque morphology in the context of a new dynamical model based on competing aggregation and disaggregation processes in kinetic steady-state equilibrium with an additional diffusion process allowing Aβ deposits to diffuse over the surface of plaques.

  13. Generalized Linear Models of Home Activity for Automatic Detection of Mild Cognitive Impairment in Older Adults*

    PubMed Central

    Akl, Ahmad; Snoek, Jasper; Mihailidis, Alex

    2015-01-01

    With a globally aging population, the burden of care of cognitively impaired older adults is becoming increasingly concerning. Instances of Alzheimer’s disease and other forms of dementia are becoming ever more frequent. Earlier detection of cognitive impairment offers significant benefits, but remains difficult to do in practice. In this paper, we develop statistical models of the behavior of older adults within their homes using sensor data in order to detect the early onset of cognitive decline. Specifically, we use inhomogenous Poisson processes to model the presence of subjects within different rooms throughout the day in the home using unobtrusive sensing technologies. We compare the distributions learned from cognitively intact and impaired subjects using information theoretic tools and observe statistical differences between the two populations which we believe can be used to help detect the onset of cognitive decline. PMID:25570050

  14. Generalized Linear Models of home activity for automatic detection of mild cognitive impairment in older adults.

    PubMed

    Akl, Ahmad; Snoek, Jasper; Mihailidis, Alex

    2014-01-01

    With a globally aging population, the burden of care of cognitively impaired older adults is becoming increasingly concerning. Instances of Alzheimer's disease and other forms of dementia are becoming ever more frequent. Earlier detection of cognitive impairment offers significant benefits, but remains difficult to do in practice. In this paper, we develop statistical models of the behavior of older adults within their homes using sensor data in order to detect the early onset of cognitive decline. Specifically, we use inhomogenous Poisson processes to model the presence of subjects within different rooms throughout the day in the home using unobtrusive sensing technologies. We compare the distributions learned from cognitively intact and impaired subjects using information theoretic tools and observe statistical differences between the two populations which we believe can be used to help detect the onset of cognitive decline.

  15. Oral contraceptives and the risk of gallbladder disease: a comparative safety study

    PubMed Central

    Etminan, Mahyar; Delaney, Joseph A.C.; Bressler, Brian; Brophy, James M.

    2011-01-01

    Background Recent concerns have been raised about the risk of gallbladder disease associated with the use of drospirenone, a fourth-generation progestin used in oral contraceptives. We conducted a study to determine the magnitude of this risk compared with other formulations of oral contraceptives. Methods We conducted a retrospective cohort study using the IMS LifeLink Health Plan Claims Database. We included women who were using an oral contraceptive containing ethinyl estradiol combined with a progestin during 1997–2009. To be eligible, women had to have been taking the oral contraceptive continuously for at least six months. We computed adjusted rate ratios (RRs) for gallbladder disease using a Cox proportional hazards model. In the primary analysis, gallbladder disease was defined as cholecystectomy; in a secondary analysis, it was defined as hospital admission secondary to gallbladder disease. Results We included 2 721 014 women in the cohort, 27 087 of whom underwent surgical or laparoscopic cholecystectomy during the follow-up period. Compared with levonorgestrel, an older second-generation progestin, a small, statistically significant increase in the risk of gallbladder disease was associated with desogestrel (adjusted RR 1.05, 95% confidence interval [CI] 1.01–1.09), drospirenone (adjusted RR 1.20, 95% CI 1.16–1.26) and norethindrone (adjusted RR 1.10, 95% CI 1.06–1.14). No statistically significant increase in risk was associated with the other formulations of oral contraceptive (ethynodiol diacetate, norgestrel and norgestimate). Interpretation In a large cohort of women using oral contraceptives, we found a small, statistically significant increase in the risk of gallbladder disease associated with desogestrel, drospirenone and norethindrone compared with levonorgestrel. However, the small effect sizes compounded with the possibility of residual biases in this observational study make it unlikely that these differences are clinically significant. PMID:21502354

  16. Border screening vs. community level disease control for infectious diseases: Timing and effectiveness

    NASA Astrophysics Data System (ADS)

    Kim, Sehjeong; Chang, Dong Eui

    2017-06-01

    There have been many studies of the border screening using a simple math model or a statistical analysis to investigate the ineffectiveness of border screening during 2003 and 2009 pandemics. However, the use of border screening is still a controversial issue. It is due to focusing only on the functionality of border screening without considering the timing to use. In this paper, we attempt to qualitatively answer whether the use of border screening is a desirable action during a disease pandemic. Thus, a novel mathematical model with a transition probability of status change during flight and border screening is developed. A condition to check a timing of the border screening is established in terms of a lower bound of the basic reproduction number. If the lower bound is greater than one, which indicates a pandemic, then the border screening may not be effective and the disease persists. In this case, a community level control strategy should be conducted.

  17. Antibiotics in Animal Products

    NASA Astrophysics Data System (ADS)

    Falcão, Amílcar C.

    The administration of antibiotics to animals to prevent or treat diseases led us to be concerned about the impact of these antibiotics on human health. In fact, animal products could be a potential vehicle to transfer drugs to humans. Using appropri ated mathematical and statistical models, one can predict the kinetic profile of drugs and their metabolites and, consequently, develop preventive procedures regarding drug transmission (i.e., determination of appropriate withdrawal periods). Nevertheless, in the present chapter the mathematical and statistical concepts for data interpretation are strictly given to allow understanding of some basic pharma-cokinetic principles and to illustrate the determination of withdrawal periods

  18. Complex Adaptive System Models and the Genetic Analysis of Plasma HDL-Cholesterol Concentration

    PubMed Central

    Rea, Thomas J.; Brown, Christine M.; Sing, Charles F.

    2006-01-01

    Despite remarkable advances in diagnosis and therapy, ischemic heart disease (IHD) remains a leading cause of morbidity and mortality in industrialized countries. Recent efforts to estimate the influence of genetic variation on IHD risk have focused on predicting individual plasma high-density lipoprotein cholesterol (HDL-C) concentration. Plasma HDL-C concentration (mg/dl), a quantitative risk factor for IHD, has a complex multifactorial etiology that involves the actions of many genes. Single gene variations may be necessary but are not individually sufficient to predict a statistically significant increase in risk of disease. The complexity of phenotype-genotype-environment relationships involved in determining plasma HDL-C concentration has challenged commonly held assumptions about genetic causation and has led to the question of which combination of variations, in which subset of genes, in which environmental strata of a particular population significantly improves our ability to predict high or low risk phenotypes. We document the limitations of inferences from genetic research based on commonly accepted biological models, consider how evidence for real-world dynamical interactions between HDL-C determinants challenges the simplifying assumptions implicit in traditional linear statistical genetic models, and conclude by considering research options for evaluating the utility of genetic information in predicting traits with complex etiologies. PMID:17146134

  19. Role of Statistical Random-Effects Linear Models in Personalized Medicine

    PubMed Central

    Diaz, Francisco J; Yeh, Hung-Wen; de Leon, Jose

    2012-01-01

    Some empirical studies and recent developments in pharmacokinetic theory suggest that statistical random-effects linear models are valuable tools that allow describing simultaneously patient populations as a whole and patients as individuals. This remarkable characteristic indicates that these models may be useful in the development of personalized medicine, which aims at finding treatment regimes that are appropriate for particular patients, not just appropriate for the average patient. In fact, published developments show that random-effects linear models may provide a solid theoretical framework for drug dosage individualization in chronic diseases. In particular, individualized dosages computed with these models by means of an empirical Bayesian approach may produce better results than dosages computed with some methods routinely used in therapeutic drug monitoring. This is further supported by published empirical and theoretical findings that show that random effects linear models may provide accurate representations of phase III and IV steady-state pharmacokinetic data, and may be useful for dosage computations. These models have applications in the design of clinical algorithms for drug dosage individualization in chronic diseases; in the computation of dose correction factors; computation of the minimum number of blood samples from a patient that are necessary for calculating an optimal individualized drug dosage in therapeutic drug monitoring; measure of the clinical importance of clinical, demographic, environmental or genetic covariates; study of drug-drug interactions in clinical settings; the implementation of computational tools for web-site-based evidence farming; design of pharmacogenomic studies; and in the development of a pharmacological theory of dosage individualization. PMID:23467392

  20. Ecological covariates based predictive model of malaria risk in the state of Chhattisgarh, India.

    PubMed

    Kumar, Rajesh; Dash, Chinmaya; Rani, Khushbu

    2017-09-01

    Malaria being an endemic disease in the state of Chhattisgarh and ecologically dependent mosquito-borne disease, the study is intended to identify the ecological covariates of malaria risk in districts of the state and to build a suitable predictive model based on those predictors which could assist developing a weather based early warning system. This secondary data based analysis used one month lagged district level malaria positive cases as response variable and ecological covariates as independent variables which were tested with fixed effect panelled negative binomial regression models. Interactions among the covariates were explored using two way factorial interaction in the model. Although malaria risk in the state possesses perennial characteristics, higher parasitic incidence was observed during the rainy and winter seasons. The univariate analysis indicated that the malaria incidence risk was statistically significant associated with rainfall, maximum humidity, minimum temperature, wind speed, and forest cover ( p  < 0.05). The efficient predictive model include the forest cover [IRR-1.033 (1.024-1.042)], maximum humidity [IRR-1.016 (1.013-1.018)], and two-way factorial interactions between district specific averaged monthly minimum temperature and monthly minimum temperature, monthly minimum temperature was statistically significant [IRR-1.44 (1.231-1.695)] whereas the interaction term has a protective effect [IRR-0.982 (0.974-0.990)] against malaria infections. Forest cover, maximum humidity, minimum temperature and wind speed emerged as potential covariates to be used in predictive models for modelling the malaria risk in the state which could be efficiently used for early warning systems in the state.

  1. External validation of ADO, DOSE, COTE and CODEX at predicting death in primary care patients with COPD using standard and machine learning approaches.

    PubMed

    Morales, Daniel R; Flynn, Rob; Zhang, Jianguo; Trucco, Emmanuel; Quint, Jennifer K; Zutis, Kris

    2018-05-01

    Several models for predicting the risk of death in people with chronic obstructive pulmonary disease (COPD) exist but have not undergone large scale validation in primary care. The objective of this study was to externally validate these models using statistical and machine learning approaches. We used a primary care COPD cohort identified using data from the UK Clinical Practice Research Datalink. Age-standardised mortality rates were calculated for the population by gender and discrimination of ADO (age, dyspnoea, airflow obstruction), COTE (COPD-specific comorbidity test), DOSE (dyspnoea, airflow obstruction, smoking, exacerbations) and CODEX (comorbidity, dyspnoea, airflow obstruction, exacerbations) at predicting death over 1-3 years measured using logistic regression and a support vector machine learning (SVM) method of analysis. The age-standardised mortality rate was 32.8 (95%CI 32.5-33.1) and 25.2 (95%CI 25.4-25.7) per 1000 person years for men and women respectively. Complete data were available for 54879 patients to predict 1-year mortality. ADO performed the best (c-statistic of 0.730) compared with DOSE (c-statistic 0.645), COTE (c-statistic 0.655) and CODEX (c-statistic 0.649) at predicting 1-year mortality. Discrimination of ADO and DOSE improved at predicting 1-year mortality when combined with COTE comorbidities (c-statistic 0.780 ADO + COTE; c-statistic 0.727 DOSE + COTE). Discrimination did not change significantly over 1-3 years. Comparable results were observed using SVM. In primary care, ADO appears superior at predicting death in COPD. Performance of ADO and DOSE improved when combined with COTE comorbidities suggesting better models may be generated with additional data facilitated using novel approaches. Copyright © 2018. Published by Elsevier Ltd.

  2. A phylogenetic transform enhances analysis of compositional microbiota data

    PubMed Central

    Silverman, Justin D; Washburne, Alex D; Mukherjee, Sayan; David, Lawrence A

    2017-01-01

    Surveys of microbial communities (microbiota), typically measured as relative abundance of species, have illustrated the importance of these communities in human health and disease. Yet, statistical artifacts commonly plague the analysis of relative abundance data. Here, we introduce the PhILR transform, which incorporates microbial evolutionary models with the isometric log-ratio transform to allow off-the-shelf statistical tools to be safely applied to microbiota surveys. We demonstrate that analyses of community-level structure can be applied to PhILR transformed data with performance on benchmarks rivaling or surpassing standard tools. Additionally, by decomposing distance in the PhILR transformed space, we identified neighboring clades that may have adapted to distinct human body sites. Decomposing variance revealed that covariation of bacterial clades within human body sites increases with phylogenetic relatedness. Together, these findings illustrate how the PhILR transform combines statistical and phylogenetic models to overcome compositional data challenges and enable evolutionary insights relevant to microbial communities. DOI: http://dx.doi.org/10.7554/eLife.21887.001 PMID:28198697

  3. Statistical approaches to account for missing values in accelerometer data: Applications to modeling physical activity.

    PubMed

    Yue Xu, Selene; Nelson, Sandahl; Kerr, Jacqueline; Godbole, Suneeta; Patterson, Ruth; Merchant, Gina; Abramson, Ian; Staudenmayer, John; Natarajan, Loki

    2018-04-01

    Physical inactivity is a recognized risk factor for many chronic diseases. Accelerometers are increasingly used as an objective means to measure daily physical activity. One challenge in using these devices is missing data due to device nonwear. We used a well-characterized cohort of 333 overweight postmenopausal breast cancer survivors to examine missing data patterns of accelerometer outputs over the day. Based on these observed missingness patterns, we created psuedo-simulated datasets with realistic missing data patterns. We developed statistical methods to design imputation and variance weighting algorithms to account for missing data effects when fitting regression models. Bias and precision of each method were evaluated and compared. Our results indicated that not accounting for missing data in the analysis yielded unstable estimates in the regression analysis. Incorporating variance weights and/or subject-level imputation improved precision by >50%, compared to ignoring missing data. We recommend that these simple easy-to-implement statistical tools be used to improve analysis of accelerometer data.

  4. A positive perspective of knowledge, attitude, and practices for health-promoting behaviors of adolescents with congenital heart disease.

    PubMed

    Huang, Hui-Ru; Chen, Chi-Wen; Chen, Chin-Mi; Yang, Hsiao-Ling; Su, Wen-Jen; Wang, Jou-Kou; Tsai, Pei-Kwei

    2018-03-01

    Health-promoting behaviors could serve as a major strategy to optimize long-term outcomes for adolescents with congenital heart disease. The associations assessed from a positive perspective of knowledge, attitudes, and practice model would potentially cultivate health-promoting behaviors during adolescence. The purpose of this study was to examine the relationships between disease knowledge, resilience, family functioning, and health-promoting behaviors in adolescents with congenital heart disease. A total of 320 adolescents with congenital heart disease who were aged 12-18 years were recruited from pediatric cardiology outpatient departments, and participated in a cross-sectional survey. The participants completed the Leuven Knowledge Questionnaire for Congenital Heart Disease; Haase Adolescent Resilience in Illness Scale; Family Adaptability, Partnership, Growth, Affection, and Resolve; and Adolescent Health Promotion scales. The collected data were analyzed using descriptive statistics and three multiple regression models. Greater knowledge of prevention of complications and higher resilience had a more powerful effect in enhancing health-promoting behaviors. Having symptoms and moderate or severe family dysfunction were significantly more negatively predictive of health-promoting behaviors than not having symptoms and positive family function. The third model explained 40% of the variance in engaging in health-promoting behaviors among adolescents with congenital heart disease. The findings of this study provide new insights into the role of disease knowledge, resilience, and family functioning in the health-promoting behavior of adolescents with congenital heart disease. Continued efforts are required to plan family care programs that promote the acquisition of sufficient disease knowledge and the development of resilience for adolescents with congenital heart disease.

  5. Expansion of the Lyme Disease Vector Ixodes Scapularis in Canada Inferred from CMIP5 Climate Projections

    PubMed Central

    McPherson, Michelle; García-García, Almudena; Cuesta-Valero, Francisco José; Hansen-Ketchum, Patti; MacDougall, Donna; Ogden, Nicholas Hume

    2017-01-01

    Background: A number of studies have assessed possible climate change impacts on the Lyme disease vector, Ixodes scapularis. However, most have used surface air temperature from only one climate model simulation and/or one emission scenario, representing only one possible climate future. Objectives: We quantified effects of different Representative Concentration Pathway (RCP) and climate model outputs on the projected future changes in the basic reproduction number (R0) of I. scapularis to explore uncertainties in future R0 estimates. Methods: We used surface air temperature generated by a complete set of General Circulation Models from the Coupled Model Intercomparison Project Phase 5 (CMIP5) to hindcast historical (1971–2000), and to forecast future effects of climate change on the R0 of I. scapularis for the periods 2011–2040 and 2041–2070. Results: Increases in the multimodel mean R0 values estimated for both future periods, relative to 1971–2000, were statistically significant under all RCP scenarios for all of Nova Scotia, areas of New Brunswick and Quebec, Ontario south of 47°N, and Manitoba south of 52°N. When comparing RCP scenarios, only the estimated R0 mean values between RCP6.0 and RCP8.5 showed statistically significant differences for any future time period. Conclusion: Our results highlight the potential for climate change to have an effect on future Lyme disease risk in Canada even if the Paris Agreement’s goal to keep global warming below 2°C is achieved, although mitigation reducing emissions from RCP8.5 levels to those of RCP6.0 or less would be expected to slow tick invasion after the 2030s. https://doi.org/10.1289/EHP57 PMID:28599266

  6. Modelling Future Cardiovascular Disease Mortality in the United States: National Trends and Racial and Ethnic Disparities

    PubMed Central

    Pearson-Stuttard, Jonathan; Guzman-Castillo, Maria; Penalvo, Jose L.; Rehm, Colin D.; Afshin, Ashkan; Danaei, Goodarz; Kypridemos, Chris; Gaziano, Tom; Mozaffarian, Dariush; Capewell, Simon; O’Flaherty, Martin

    2016-01-01

    Background Accurate forecasting of cardiovascular disease (CVD) mortality is crucial to guide policy and programming efforts. Prior forecasts have often not incorporated past trends in rates of reduction in CVD mortality. This creates uncertainties about future trends in CVD mortality and disparities. Methods and Results To forecast US CVD mortality and disparities to 2030, we developed a hierarchical Bayesian model to determine and incorporate prior age, period and cohort (APC) effects from 1979–2012, stratified by age, gender and race; which we combined with expected demographic shifts to 2030. Data sources included the National Vital Statistics System, SEER single year population estimates, and US Bureau of Statistics 2012 National Population projections. We projected coronary disease and stroke deaths to 2030, first based on constant APC effects at 2012 values, as most commonly done (conventional); and then using more rigorous projections incorporating expected trends in APC effects (trend-based). We primarily evaluated absolute mortality. The conventional model projected total coronary and stroke deaths by 2030 to increase by approximately 18% (67,000 additional coronary deaths/year) and 50% (64,000 additional stroke deaths/year). Conversely, the trend-based model projected that coronary mortality would fall by 2030 by approximately 27% (79,000 fewer deaths/year); and stroke mortality would remain unchanged (200 fewer deaths/year). Health disparities will be improved in stroke deaths, but not coronary deaths. Conclusions After accounting for prior mortality trends and expected demographic shifts, total US coronary deaths are expected to decline, while stroke mortality will remain relatively constant. Health disparities in stroke, but not coronary, deaths will be improved but not eliminated. These APC approaches offer more plausible predictions than conventional estimates. PMID:26846769

  7. The usefulness of administrative databases for identifying disease cohorts is increased with a multivariate model.

    PubMed

    van Walraven, Carl; Austin, Peter C; Manuel, Douglas; Knoll, Greg; Jennings, Allison; Forster, Alan J

    2010-12-01

    Administrative databases commonly use codes to indicate diagnoses. These codes alone are often inadequate to accurately identify patients with particular conditions. In this study, we determined whether we could quantify the probability that a person has a particular disease-in this case renal failure-using other routinely collected information available in an administrative data set. This would allow the accurate identification of a disease cohort in an administrative database. We determined whether patients in a randomly selected 100,000 hospitalizations had kidney disease (defined as two or more sequential serum creatinines or the single admission creatinine indicating a calculated glomerular filtration rate less than 60 mL/min/1.73 m²). The independent association of patient- and hospitalization-level variables with renal failure was measured using a multivariate logistic regression model in a random 50% sample of the patients. The model was validated in the remaining patients. Twenty thousand seven hundred thirteen patients had kidney disease (20.7%). A diagnostic code of kidney disease was strongly associated with kidney disease (relative risk: 34.4), but the accuracy of the code was poor (sensitivity: 37.9%; specificity: 98.9%). Twenty-nine patient- and hospitalization-level variables entered the kidney disease model. This model had excellent discrimination (c-statistic: 90.1%) and accurately predicted the probability of true renal failure. The probability threshold that maximized sensitivity and specificity for the identification of true kidney disease was 21.3% (sensitivity: 80.0%; specificity: 82.2%). Multiple variables available in administrative databases can be combined to quantify the probability that a person has a particular disease. This process permits accurate identification of a disease cohort in an administrative database. These methods may be extended to other diagnoses or procedures and could both facilitate and clarify the use of administrative databases for research and quality improvement. Copyright © 2010 Elsevier Inc. All rights reserved.

  8. OS104. Are preeclampsia and adverse obstetrical outcomes predictors for longterm cardiovascular disease?

    PubMed

    Sia, W W; Tsuyuki, R; Pertman, S; Hui, W

    2012-07-01

    Epidemiologic studies suggest that pregnancy complications such as preeclampsia, gestational diabetes, preterm delivery and low birth weight independently increase maternal risk for future development of cardiovascular disease. To further investigate whether preeclampsia, gestational diabetes, and adverse obstetrical outcomes such as placental abruption, intrauterine growth restriction and preterm delivery, are independent risk factors for longterm cardiovascular disease. This was a case-control study where 252 parous women (cases) with coronary artery disease were matched with a parous woman within 5 years of age with no known coronary artery disease (controls). Participants were recruited from the Royal Alexandra Hospital in cardiac catheterization lab recovery room in Edmonton, Canada. Women with significant angiographic coronary artery stenosis were eligible as cases and those without were eligible as controls. Participants were interviewed on their pregnancy histories and traditional cardiovascular risk factors, such as hypertension, diabetes etc. Descriptive statistics, chi-square tests and conditional regression analysis were performed. We recruited 244 cases and 246 controls. The average age was 66.3 and 65.8 respectively. Cases were more likely obese, had more pregnancies as well as traditional cardiovascular risk factors than controls. Adverse pregnancy outcomes were similar between the two groups except gestational hypertension. However, it was not statistically significant in the conditional logistic regression model. Independent risk factors for future cardiovascular diseases were: dyslipidemia (OR 12.8), hypertension (3.0), and being a current (OR 7.4) or former smoker (1.8). Adverse pregnancy outcomes In this study, adverse pregnancy outcomes were not independently associated with cardiovascular disorders. Our study was limited by recall bias, and ascertainment of diagnosis.Our study supports that dyslipidemia, hypertensiion and smoking increase future cardiovascular disease. Further studies are needed to examine a postpartum intervention model to proactively manage cardiovascular risk factors, such as lipids, in these at-risk women. Copyright © 2012. Published by Elsevier B.V.

  9. Seasonality and Coronary Heart Disease Deaths in United States Firefighters

    PubMed Central

    Mbanu, Ibeawuchi; Wellenius, Gregory A.; Mittleman, Murray A.; Peeples, Lynne; Stallings, Leonard A.; Kales, Stefanos N.

    2013-01-01

    United States firefighters have a high on-duty fatality rate and coronary heart disease is the leading cause. Seasonality affects the incidence of cardiovascular events in the general population, but its effects on firefighters are unknown. We statistically examined the seasonal and annual variation of all on-duty coronary heart disease deaths among US firefighters between 1994 and 2004 using the chi-square distribution and Poisson regression model of the monthly fatality counts. We also examined the effect of ambient temperature (apparent as well as wind chill temperature) on coronary heart disease fatalities during the study span using a time-stratified, case-crossover study design. When grouped by season, we observed the distribution of the 449 coronary heart disease fatalities to show a relative peak in winter (32%) and relative nadir in spring (21%). This pattern was significantly different (p=0.005) from the expected distribution under the null hypothesis where season has no effect. The pattern persisted in additional analyses, stratifying the deaths by the type of duty in which the firefighters were engaged at the time of their deaths. In the Poisson regression model of the monthly fatality counts, the overall goodness-of-fit between the actual and predicted case counts was excellent ( χ42 = 16.63; p = 0.002). Two distinct peaks were detected, one in January-February and the other in August-September. Overall, temperature was not associated with increased risk of on-duty death. After allowing for different effects of temperature in mild/hot versus cold periods, a 1°C increase was not protective in cold weather, nor did it increase the risk of death in warmer weather. The findings of this study reveal statistical evidence for excess coronary heart disease deaths among firefighters during winter; however, the temporal pattern coronary heart disease deaths was not linked to temperature variation. We also found the seasonal pattern to be independent of duty-related risks. PMID:17701682

  10. Radiation Risk of Cardiovascular Diseases in the Cohort of Russian Emergency Workers of the Chernobyl Accident.

    PubMed

    Kashcheev, V V; Chekin, S Yu; Karpenko, S V; Maksioutov, M A; Menyaylo, A N; Tumanov, K A; Kochergina, E V; Kashcheeva, P V; Gorsky, A I; Shchukina, N V; Lovachev, S S; Vlasov, O K; Ivanov, V K

    2017-07-01

    This paper continues a series of publications that analyze the impact of radiation on incidence of circulatory system diseases in the cohort of Russian recovery operation workers (liquidators) and presents the results of the analysis of cardiovascular disease (CVD) incidence. The studied cohort consists of 53,772 liquidators who arrived in the Chernobyl accident zone within the first year after the accident (26 April 1986 to 26 April 1987). The individual doses varied from 0.0001 Gy to 1.42 Gy, and the mean external whole body dose in the cohort was 0.161 Gy. A total of 27,456 cases of CVD were diagnosed during the follow-up period 1986-2012 as a result of annual health examinations. A Poisson regression model was applied to estimate radiation risks and other risk factors associated with CVD. The following factors were identified as risk factors for CVD: the dose, duration of the liquidators' work in the Chernobyl zone, and concomitant diseases (diabetes mellitus, hypertension, overweight, and alcohol dependence). The baseline incidence of CVD is statistically significantly (p < 0.001) associated with all studied concomitant diseases. The incidence of CVD has revealed a statistically significant dose response with the lack of a latent period and with the average ERR Gy = 0.47, 95% CI = 0.31, 0.63, p < 0.001. Radiation risks of CVD statistically significantly (p = 0.01) varied with the duration of liquidators' stay in the Chernobyl zone; for those who stayed in the Chernobyl zone less than 6 wk, ERR/Gy = 0.80, 95% CI = 0.53; 1.08, p < 0.001.

  11. The extension of total gain (TG) statistic in survival models: properties and applications.

    PubMed

    Choodari-Oskooei, Babak; Royston, Patrick; Parmar, Mahesh K B

    2015-07-01

    The results of multivariable regression models are usually summarized in the form of parameter estimates for the covariates, goodness-of-fit statistics, and the relevant p-values. These statistics do not inform us about whether covariate information will lead to any substantial improvement in prediction. Predictive ability measures can be used for this purpose since they provide important information about the practical significance of prognostic factors. R (2)-type indices are the most familiar forms of such measures in survival models, but they all have limitations and none is widely used. In this paper, we extend the total gain (TG) measure, proposed for a logistic regression model, to survival models and explore its properties using simulations and real data. TG is based on the binary regression quantile plot, otherwise known as the predictiveness curve. Standardised TG ranges from 0 (no explanatory power) to 1 ('perfect' explanatory power). The results of our simulations show that unlike many of the other R (2)-type predictive ability measures, TG is independent of random censoring. It increases as the effect of a covariate increases and can be applied to different types of survival models, including models with time-dependent covariate effects. We also apply TG to quantify the predictive ability of multivariable prognostic models developed in several disease areas. Overall, TG performs well in our simulation studies and can be recommended as a measure to quantify the predictive ability in survival models.

  12. Predicting survival of de novo metastatic breast cancer in Asian women: systematic review and validation study.

    PubMed

    Miao, Hui; Hartman, Mikael; Bhoo-Pathy, Nirmala; Lee, Soo-Chin; Taib, Nur Aishah; Tan, Ern-Yu; Chan, Patrick; Moons, Karel G M; Wong, Hoong-Seam; Goh, Jeremy; Rahim, Siti Mastura; Yip, Cheng-Har; Verkooijen, Helena M

    2014-01-01

    In Asia, up to 25% of breast cancer patients present with distant metastases at diagnosis. Given the heterogeneous survival probabilities of de novo metastatic breast cancer, individual outcome prediction is challenging. The aim of the study is to identify existing prognostic models for patients with de novo metastatic breast cancer and validate them in Asia. We performed a systematic review to identify prediction models for metastatic breast cancer. Models were validated in 642 women with de novo metastatic breast cancer registered between 2000 and 2010 in the Singapore Malaysia Hospital Based Breast Cancer Registry. Survival curves for low, intermediate and high-risk groups according to each prognostic score were compared by log-rank test and discrimination of the models was assessed by concordance statistic (C-statistic). We identified 16 prediction models, seven of which were for patients with brain metastases only. Performance status, estrogen receptor status, metastatic site(s) and disease-free interval were the most common predictors. We were able to validate nine prediction models. The capacity of the models to discriminate between poor and good survivors varied from poor to fair with C-statistics ranging from 0.50 (95% CI, 0.48-0.53) to 0.63 (95% CI, 0.60-0.66). The discriminatory performance of existing prediction models for de novo metastatic breast cancer in Asia is modest. Development of an Asian-specific prediction model is needed to improve prognostication and guide decision making.

  13. Increased mortality attributed to Chagas disease: a systematic review and meta-analysis.

    PubMed

    Cucunubá, Zulma M; Okuwoga, Omolade; Basáñez, María-Gloria; Nouvellet, Pierre

    2016-01-27

    The clinical outcomes associated with Chagas disease remain poorly understood. In addition to the burden of morbidity, the burden of mortality due to Trypanosoma cruzi infection can be substantial, yet its quantification has eluded rigorous scrutiny. This is partly due to considerable heterogeneity between studies, which can influence the resulting estimates. There is a pressing need for accurate estimates of mortality due to Chagas disease that can be used to improve mathematical modelling, burden of disease evaluations, and cost-effectiveness studies. A systematic literature review was conducted to select observational studies comparing mortality in populations with and without a diagnosis of Chagas disease using the PubMed, MEDLINE, EMBASE, Web of Science and LILACS databases, without restrictions on language or date of publication. The primary outcome of interest was mortality (as all-cause mortality, sudden cardiac death, heart transplant or cardiovascular deaths). Data were analysed using a random-effects model to obtain the relative risk (RR) of mortality, the attributable risk percent (ARP), and the annual mortality rates (AMR). The statistic I(2) (proportion of variance in the meta-analysis due to study heterogeneity) was calculated. Sensitivity analyses and publication bias test were also conducted. Twenty five studies were selected for quantitative analysis, providing data on 10,638 patients, 53,346 patient-years of follow-up, and 2739 events. Pooled estimates revealed that Chagas disease patients have significantly higher AMR compared with non-Chagas disease patients (0.18 versus 0.10; RR = 1.74, 95% CI 1.49-2.03). Substantial heterogeneity was found among studies (I(2) = 67.3%). The ARP above background mortality was 42.5%. Through a sub-analysis patients were classified by clinical group (severe, moderate, asymptomatic). While RR did not differ significantly between clinical groups, important differences in AMR were found: AMR = 0.43 in Chagas vs. 0.29 in non-Chagas patients (RR = 1.40, 95% CI 1.21-1.62) in the severe group; AMR = 0.16 (Chagas) vs. 0.08 (non-Chagas) (RR = 2.10, 95% CI 1.52-2.91) in the moderate group, and AMR = 0.02 vs. 0.01 (RR = 1.42, 95% CI 1.14-1.77) in the asymptomatic group. Meta-regression showed no evidence of study-level covariates on the effect size. Publication bias was not statistically significant (Egger's test p=0.08). The results indicate a statistically significant excess of mortality due to Chagas disease that is shared among both symptomatic and asymptomatic populations.

  14. Clinical profiles associated with influenza disease in the ferret model.

    PubMed

    Stark, Gregory V; Long, James P; Ortiz, Diana I; Gainey, Melicia; Carper, Benjamin A; Feng, Jingyu; Miller, Stephen M; Bigger, John E; Vela, Eric M

    2013-01-01

    Influenza A viruses continue to pose a threat to human health; thus, various vaccines and prophylaxis continue to be developed. Testing of these products requires various animal models including mice, guinea pigs, and ferrets. However, because ferrets are naturally susceptible to infection with human influenza viruses and because the disease state resembles that of human influenza, these animals have been widely used as a model to study influenza virus pathogenesis. In this report, a statistical analysis was performed to evaluate data involving 269 ferrets infected with seasonal influenza, swine influenza, and highly pathogenic avian influenza (HPAI) from 16 different studies over a five year period. The aim of the analyses was to better qualify the ferret model by identifying relationships among important animal model parameters (endpoints) and variables of interest, which include survival, time-to-death, changes in body temperature and weight, and nasal wash samples containing virus, in addition to significant changes from baseline in selected hematology and clinical chemistry parameters. The results demonstrate that a disease clinical profile, consisting of various changes in the biological parameters tested, is associated with various influenza A infections in ferrets. Additionally, the analysis yielded correlates of protection associated with HPAI disease in ferrets. In all, the results from this study further validate the use of the ferret as a model to study influenza A pathology and to evaluate product efficacy.

  15. Spatio-temporal surveillance of water based infectious disease (malaria) in Rawalpindi, Pakistan using geostatistical modeling techniques.

    PubMed

    Ahmad, Sheikh Saeed; Aziz, Neelam; Butt, Amna; Shabbir, Rabia; Erum, Summra

    2015-09-01

    One of the features of medical geography that has made it so useful in health research is statistical spatial analysis, which enables the quantification and qualification of health events. The main objective of this research was to study the spatial distribution patterns of malaria in Rawalpindi district using spatial statistical techniques to identify the hot spots and the possible risk factor. Spatial statistical analyses were done in ArcGIS, and satellite images for land use classification were processed in ERDAS Imagine. Four hundred and fifty water samples were also collected from the study area to identify the presence or absence of any microbial contamination. The results of this study indicated that malaria incidence varied according to geographical location, with eco-climatic condition and showing significant positive spatial autocorrelation. Hotspots or location of clusters were identified using Getis-Ord Gi* statistic. Significant clustering of malaria incidence occurred in rural central part of the study area including Gujar Khan, Kaller Syedan, and some part of Kahuta and Rawalpindi Tehsil. Ordinary least square (OLS) regression analysis was conducted to analyze the relationship of risk factors with the disease cases. Relationship of different land cover with the disease cases indicated that malaria was more related with agriculture, low vegetation, and water class. Temporal variation of malaria cases showed significant positive association with the meteorological variables including average monthly rainfall and temperature. The results of the study further suggested that water supply and sewage system and solid waste collection system needs a serious attention to prevent any outbreak in the study area.

  16. Nutritional epidemiology: New perspectives for understanding the diet-disease relationship?

    PubMed

    Boeing, H

    2013-05-01

    Nutritional epidemiology is a subdiscipline of epidemiology and provides specific knowledge to nutritional science. It provides data about the diet-disease relationships that is transformed by Public Health Nutrition into the practise of prevention. The specific contributions of nutritional epidemiology include dietary assessment, description of nutritional exposure and statistical modelling of the diet-disease relationship. In all these areas, substantial progress has been made over the last years and is described in this article. Dietary assessment is moving away from the food frequency questionnaire (FFQ) as main dietary assessment instrument in large-scale epidemiological studies towards the use of short-term quantitative instruments due to the potential of gross measurement errors. Web-based instruments for self-administration are therefore evaluated of being able to replace the costly interviewer conducted 24-h-recalls. Much interest is also directed towards the technique of taking and analysing photographs of all meals ingested, which might improve the dietary assessment in terms of precision. The description of nutritional exposure could greatly benefit from standardisation of the coding of foods across studies in order to improve comparability. For the investigations of bioactive substances as reflecting nutritional intake and status, the investigation of concentration measurements in body fluids as potential biomarkers will benefit from the new high-throughput technologies of mass spectrometry. Statistical modelling of the dietary data and the diet-disease relationships can refer to complex programmes that convert quantitative short-term measurements into habitual intakes of individuals and correct for the errors in the estimates of the diet-disease relationships by taking data from validation studies with biomarkers into account. For dietary data, substitution modelling should be preferred over simple adding modelling. More attention should also be put on the investigation of non-linear relationships. The increasing complexity of the conduct and analysis of nutritional epidemiological studies is calling for a distinct and advanced training programme for the young scientists moving into this area. This will also guarantee that in the future an increasing number of high-level manuscripts will show up in this and other journals in respect of nutritional epidemiological topics.

  17. Dopamine Transporter Neuroimaging as an Enrichment Biomarker in Early Parkinson's Disease Clinical Trials: A Disease Progression Modeling Analysis

    PubMed Central

    Nicholas, Timothy; Tsai, Kuenhi; Macha, Sreeraj; Sinha, Vikram; Stone, Julie; Corrigan, Brian; Bani, Massimo; Muglia, Pierandrea; Watson, Ian A.; Kern, Volker D.; Sheveleva, Elena; Marek, Kenneth; Stephenson, Diane T.; Romero, Klaus

    2017-01-01

    Abstract Given the recognition that disease‐modifying therapies should focus on earlier Parkinson's disease stages, trial enrollment based purely on clinical criteria poses significant challenges. The goal herein was to determine the utility of dopamine transporter neuroimaging as an enrichment biomarker in early motor Parkinson's disease clinical trials. Patient‐level longitudinal data of 672 subjects with early‐stage Parkinson's disease in the Parkinson's Progression Markers Initiative (PPMI) observational study and the Parkinson Research Examination of CEP‐1347 Trial (PRECEPT) clinical trial were utilized in a linear mixed‐effects model analysis. The rate of worsening in the motor scores between subjects with or without a scan without evidence of dopamine transporter deficit was different both statistically and clinically. The average difference in the change from baseline of motor scores at 24 months between biomarker statuses was –3.16 (90% confidence interval [CI] = –0.96 to –5.42) points. Dopamine transporter imaging could identify subjects with a steeper worsening of the motor scores, allowing trial enrichment and 24% reduction of sample size. PMID:28749580

  18. An Environmental Data Set for Vector-Borne Disease Modeling and Epidemiology

    PubMed Central

    Chabot-Couture, Guillaume; Nigmatulina, Karima; Eckhoff, Philip

    2014-01-01

    Understanding the environmental conditions of disease transmission is important in the study of vector-borne diseases. Low- and middle-income countries bear a significant portion of the disease burden; but data about weather conditions in those countries can be sparse and difficult to reconstruct. Here, we describe methods to assemble high-resolution gridded time series data sets of air temperature, relative humidity, land temperature, and rainfall for such areas; and we test these methods on the island of Madagascar. Air temperature and relative humidity were constructed using statistical interpolation of weather station measurements; the resulting median 95th percentile absolute errors were 2.75°C and 16.6%. Missing pixels from the MODIS11 remote sensing land temperature product were estimated using Fourier decomposition and time-series analysis; thus providing an alternative to the 8-day and 30-day aggregated products. The RFE 2.0 remote sensing rainfall estimator was characterized by comparing it with multiple interpolated rainfall products, and we observed significant differences in temporal and spatial heterogeneity relevant to vector-borne disease modeling. PMID:24755954

  19. Performance of non-invasive models of fibrosis in predicting mild to moderate fibrosis in patients with non-alcoholic fatty liver disease.

    PubMed

    Siddiqui, Mohammad S; Patidar, Kavish R; Boyett, Sherry; Luketic, Velimir A; Puri, Puneet; Sanyal, Arun J

    2016-04-01

    In non-alcoholic fatty liver disease, presence of fibrosis is predictive of long-term liver-related complications. Currently, there are no reliable and non-invasive means of quantifying fibrosis in those with non-alcoholic fatty liver disease. Therefore, we aimed to evaluate the performance of a panel of non-invasive models in predicting fibrosis in non-alcoholic fatty liver disease. The accuracy of FibroMeter non-alcoholic fatty liver disease, fibrosis 4 and four other non-invasive models in predicting fibrosis in those with biopsy proven non-alcoholic fatty liver disease was compared. These models were constructed post hoc in patients who had necessary clinical information collected within 2 months of a liver biopsy. The areas under receiver operating characteristics curves were compared for each model using Delong analysis. Optimum cut-off for each model and fibrosis stage were calculated using the Youden index. The area under receiver operating characteristics curves for F ≥ 1 fibrosis for fibrosis 4 and FibroMeter non-alcoholic fatty liver disease was 0.821 and 0.801 respectively. For F ≥ 3, the area under receiver operating characteristics curves was 0.866 for fibrosis 4 and 0.862 for FibroMeter non-alcoholic fatty liver disease. Delong analysis showed the area under receiver operating characteristics curves was statistically different for fibrosis 4 and FibroMeter non-alcoholic fatty liver disease compared with BARD, BAAT and aspartate aminotransferase:alanine aminotransferase ratio for F ≥ 1 and F ≥ 3. Area under receiver operating characteristics curves were significantly different for fibrosis 4 and FibroMeter non-alcoholic fatty liver disease for F ≥ 3 compared with non-alcoholic fatty liver disease fibrosis score. At a fixed sensitivity of 90%, FibroMeter non-alcoholic fatty liver disease had the highest specificity for F ≥ 1 (52.4%) and F ≥ 3 (63.8%). In contrast, at a fixed specificity of 90%, fibrosis 4 outperformed other models with a sensitivity of 60.2% for F ≥ 1 and 70.6% for F ≥ 3 fibrosis. These non-invasive models of fibrosis can predict varying degrees of fibrosis from routinely collected clinical information in non-alcoholic fatty liver disease. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  20. Design and analysis issues in gene and environment studies

    PubMed Central

    2012-01-01

    Both nurture (environmental) and nature (genetic factors) play an important role in human disease etiology. Traditionally, these effects have been thought of as independent. This perspective is ill informed for non-mendelian complex disorders which result as an interaction between genetics and environment. To understand health and disease we must study how nature and nurture interact. Recent advances in human genomics and high-throughput biotechnology make it possible to study large numbers of genetic markers and gene products simultaneously to explore their interactions with environment. The purpose of this review is to discuss design and analytic issues for gene-environment interaction studies in the “-omics” era, with a focus on environmental and genetic epidemiological studies. We present an expanded environmental genomic disease paradigm. We discuss several study design issues for gene-environmental interaction studies, including confounding and selection bias, measurement of exposures and genotypes. We discuss statistical issues in studying gene-environment interactions in different study designs, such as choices of statistical models, assumptions regarding biological factors, and power and sample size considerations, especially in genome-wide gene-environment studies. Future research directions are also discussed. PMID:23253229

  1. Design and analysis issues in gene and environment studies.

    PubMed

    Liu, Chen-yu; Maity, Arnab; Lin, Xihong; Wright, Robert O; Christiani, David C

    2012-12-19

    Both nurture (environmental) and nature (genetic factors) play an important role in human disease etiology. Traditionally, these effects have been thought of as independent. This perspective is ill informed for non-mendelian complex disorders which result as an interaction between genetics and environment. To understand health and disease we must study how nature and nurture interact. Recent advances in human genomics and high-throughput biotechnology make it possible to study large numbers of genetic markers and gene products simultaneously to explore their interactions with environment. The purpose of this review is to discuss design and analytic issues for gene-environment interaction studies in the "-omics" era, with a focus on environmental and genetic epidemiological studies. We present an expanded environmental genomic disease paradigm. We discuss several study design issues for gene-environmental interaction studies, including confounding and selection bias, measurement of exposures and genotypes. We discuss statistical issues in studying gene-environment interactions in different study designs, such as choices of statistical models, assumptions regarding biological factors, and power and sample size considerations, especially in genome-wide gene-environment studies. Future research directions are also discussed.

  2. Big heart data: advancing health informatics through data sharing in cardiovascular imaging.

    PubMed

    Suinesiaputra, Avan; Medrano-Gracia, Pau; Cowan, Brett R; Young, Alistair A

    2015-07-01

    The burden of heart disease is rapidly worsening due to the increasing prevalence of obesity and diabetes. Data sharing and open database resources for heart health informatics are important for advancing our understanding of cardiovascular function, disease progression and therapeutics. Data sharing enables valuable information, often obtained at considerable expense and effort, to be reused beyond the specific objectives of the original study. Many government funding agencies and journal publishers are requiring data reuse, and are providing mechanisms for data curation and archival. Tools and infrastructure are available to archive anonymous data from a wide range of studies, from descriptive epidemiological data to gigabytes of imaging data. Meta-analyses can be performed to combine raw data from disparate studies to obtain unique comparisons or to enhance statistical power. Open benchmark datasets are invaluable for validating data analysis algorithms and objectively comparing results. This review provides a rationale for increased data sharing and surveys recent progress in the cardiovascular domain. We also highlight the potential of recent large cardiovascular epidemiological studies enabling collaborative efforts to facilitate data sharing, algorithms benchmarking, disease modeling and statistical atlases.

  3. Deriving the expected utility of a predictive model when the utilities are uncertain.

    PubMed

    Cooper, Gregory F; Visweswaran, Shyam

    2005-01-01

    Predictive models are often constructed from clinical databases with the goal of eventually helping make better clinical decisions. Evaluating models using decision theory is therefore natural. When constructing a model using statistical and machine learning methods, however, we are often uncertain about precisely how the model will be used. Thus, decision-independent measures of classification performance, such as the area under an ROC curve, are popular. As a complementary method of evaluation, we investigate techniques for deriving the expected utility of a model under uncertainty about the model's utilities. We demonstrate an example of the application of this approach to the evaluation of two models that diagnose coronary artery disease.

  4. Reliability, reference values and predictor variables of the ulnar sensory nerve in disease free adults.

    PubMed

    Ruediger, T M; Allison, S C; Moore, J M; Wainner, R S

    2014-09-01

    The purposes of this descriptive and exploratory study were to examine electrophysiological measures of ulnar sensory nerve function in disease free adults to determine reliability, determine reference values computed with appropriate statistical methods, and examine predictive ability of anthropometric variables. Antidromic sensory nerve conduction studies of the ulnar nerve using surface electrodes were performed on 100 volunteers. Reference values were computed from optimally transformed data. Reliability was computed from 30 subjects. Multiple linear regression models were constructed from four predictor variables. Reliability was greater than 0.85 for all paired measures. Responses were elicited in all subjects; reference values for sensory nerve action potential (SNAP) amplitude from above elbow stimulation are 3.3 μV and decrement across-elbow less than 46%. No single predictor variable accounted for more than 15% of the variance in the response. Electrophysiologic measures of the ulnar sensory nerve are reliable. Absent SNAP responses are inconsistent with disease free individuals. Reference values recommended in this report are based on appropriate transformations of non-normally distributed data. No strong statistical model of prediction could be derived from the limited set of predictor variables. Reliability analyses combined with relatively low level of measurement error suggest that ulnar sensory reference values may be used with confidence. Copyright © 2014 Elsevier Masson SAS. All rights reserved.

  5. Spatio-temporal Genetic Structuring of Leishmania major in Tunisia by Microsatellite Analysis

    PubMed Central

    Harrabi, Myriam; Bettaieb, Jihène; Ghawar, Wissem; Toumi, Amine; Zaâtour, Amor; Yazidi, Rihab; Chaâbane, Sana; Chalghaf, Bilel; Hide, Mallorie; Bañuls, Anne-Laure; Ben Salah, Afif

    2015-01-01

    In Tunisia, cases of zoonotic cutaneous leishmaniasis caused by Leishmania major are increasing and spreading from the south-west to new areas in the center. To improve the current knowledge on L. major evolution and population dynamics, we performed multi-locus microsatellite typing of human isolates from Tunisian governorates where the disease is endemic (Gafsa, Kairouan and Sidi Bouzid governorates) and collected during two periods: 1991–1992 and 2008–2012. Analysis (F-statistics and Bayesian model-based approach) of the genotyping results of isolates collected in Sidi Bouzid in 1991–1992 and 2008–2012 shows that, over two decades, in the same area, Leishmania parasites evolved by generating genetically differentiated populations. The genetic patterns of 2008–2012 isolates from the three governorates indicate that L. major populations did not spread gradually from the south to the center of Tunisia, according to a geographical gradient, suggesting that human activities might be the source of the disease expansion. The genotype analysis also suggests previous (Bayesian model-based approach) and current (F-statistics) flows of genotypes between governorates and districts. Human activities as well as reservoir dynamics and the effects of environmental changes could explain how the disease progresses. This study provides new insights into the evolution and spread of L. major in Tunisia that might improve our understanding of the parasite flow between geographically and temporally distinct populations. PMID:26302440

  6. Mortality in East African shorthorn zebu cattle under one year: predictors of infectious-disease mortality.

    PubMed

    Thumbi, Samuel M; Bronsvoort, Mark B M de C; Kiara, Henry; Toye, P G; Poole, Jane; Ndila, Mary; Conradie, Ilana; Jennings, Amy; Handel, Ian G; Coetzer, J A W; Steyl, Johan; Hanotte, Olivier; Woolhouse, Mark E J

    2013-09-08

    Infectious livestock diseases remain a major threat to attaining food security and are a source of economic and livelihood losses for people dependent on livestock for their livelihood. Knowledge of the vital infectious diseases that account for the majority of deaths is crucial in determining disease control strategies and in the allocation of limited funds available for disease control. Here we have estimated the mortality rates in zebu cattle raised in a smallholder mixed farming system during their first year of life, identified the periods of increased risk of death and the risk factors for calf mortality, and through analysis of post-mortem data, determined the aetiologies of calf mortality in this population. A longitudinal cohort study of 548 zebu cattle was conducted between 2007 and 2010. Each calf was followed during its first year of life or until lost from the study. Calves were randomly selected from 20 sub-locations and recruited within a week of birth from different farms over a 45 km radius area centered on Busia in the Western part of Kenya. The data comprised of 481.1 calf years of observation. Clinical examinations, sample collection and analysis were carried out at 5 week intervals, from birth until one year old. Cox proportional hazard models with frailty terms were used for the statistical analysis of risk factors. A standardized post-mortem examination was conducted on all animals that died during the study and appropriate samples collected. The all-cause mortality rate was estimated at 16.1 (13.0-19.2; 95% CI) per 100 calf years at risk. The Cox models identified high infection intensity with Theileria spp., the most lethal of which causes East Coast Fever disease, infection with Trypanosome spp., and helminth infections as measured by Strongyle spp. eggs per gram of faeces as the three important infections statistically associated with infectious disease mortality in these calves. Analysis of post-mortem data identified East Coast Fever as the main cause of death accounting for 40% of all deaths, haemonchosis 12% and heartwater disease 7%. The findings demonstrate the impact of endemic parasitic diseases in indigenous animals expected to be well adapted against disease pressures. Additionally, agreement between results of Cox models using data from simple diagnostic procedures and results from post-mortem analysis underline the potential use such diagnostic data to reduce calf mortality. The control strategies for the identified infectious diseases have been discussed.

  7. A risk adjustment approach to estimating the burden of skin disease in the United States.

    PubMed

    Lim, Henry W; Collins, Scott A B; Resneck, Jack S; Bolognia, Jean; Hodge, Julie A; Rohrer, Thomas A; Van Beek, Marta J; Margolis, David J; Sober, Arthur J; Weinstock, Martin A; Nerenz, David R; Begolka, Wendy Smith; Moyano, Jose V

    2018-01-01

    Direct insurance claims tabulation and risk adjustment statistical methods can be used to estimate health care costs associated with various diseases. In this third manuscript derived from the new national Burden of Skin Disease Report from the American Academy of Dermatology, a risk adjustment method that was based on modeling the average annual costs of individuals with or without specific diseases, and specifically tailored for 24 skin disease categories, was used to estimate the economic burden of skin disease. The results were compared with the claims tabulation method used in the first 2 parts of this project. The risk adjustment method estimated the direct health care costs of skin diseases to be $46 billion in 2013, approximately $15 billion less than estimates using claims tabulation. For individual skin diseases, the risk adjustment cost estimates ranged from 11% to 297% of those obtained using claims tabulation for the 10 most costly skin disease categories. Although either method may be used for purposes of estimating the costs of skin disease, the choice of method will affect the end result. These findings serve as an important reference for future discussions about the method chosen in health care payment models to estimate both the cost of skin disease and the potential cost impact of care changes. Copyright © 2017 American Academy of Dermatology, Inc. Published by Elsevier Inc. All rights reserved.

  8. ON MODEL SELECTION STRATEGIES TO IDENTIFY GENES UNDERLYING BINARY TRAITS USING GENOME-WIDE ASSOCIATION DATA.

    PubMed

    Wu, Zheyang; Zhao, Hongyu

    2012-01-01

    For more fruitful discoveries of genetic variants associated with diseases in genome-wide association studies, it is important to know whether joint analysis of multiple markers is more powerful than the commonly used single-marker analysis, especially in the presence of gene-gene interactions. This article provides a statistical framework to rigorously address this question through analytical power calculations for common model search strategies to detect binary trait loci: marginal search, exhaustive search, forward search, and two-stage screening search. Our approach incorporates linkage disequilibrium, random genotypes, and correlations among score test statistics of logistic regressions. We derive analytical results under two power definitions: the power of finding all the associated markers and the power of finding at least one associated marker. We also consider two types of error controls: the discovery number control and the Bonferroni type I error rate control. After demonstrating the accuracy of our analytical results by simulations, we apply them to consider a broad genetic model space to investigate the relative performances of different model search strategies. Our analytical study provides rapid computation as well as insights into the statistical mechanism of capturing genetic signals under different genetic models including gene-gene interactions. Even though we focus on genetic association analysis, our results on the power of model selection procedures are clearly very general and applicable to other studies.

  9. ON MODEL SELECTION STRATEGIES TO IDENTIFY GENES UNDERLYING BINARY TRAITS USING GENOME-WIDE ASSOCIATION DATA

    PubMed Central

    Wu, Zheyang; Zhao, Hongyu

    2013-01-01

    For more fruitful discoveries of genetic variants associated with diseases in genome-wide association studies, it is important to know whether joint analysis of multiple markers is more powerful than the commonly used single-marker analysis, especially in the presence of gene-gene interactions. This article provides a statistical framework to rigorously address this question through analytical power calculations for common model search strategies to detect binary trait loci: marginal search, exhaustive search, forward search, and two-stage screening search. Our approach incorporates linkage disequilibrium, random genotypes, and correlations among score test statistics of logistic regressions. We derive analytical results under two power definitions: the power of finding all the associated markers and the power of finding at least one associated marker. We also consider two types of error controls: the discovery number control and the Bonferroni type I error rate control. After demonstrating the accuracy of our analytical results by simulations, we apply them to consider a broad genetic model space to investigate the relative performances of different model search strategies. Our analytical study provides rapid computation as well as insights into the statistical mechanism of capturing genetic signals under different genetic models including gene-gene interactions. Even though we focus on genetic association analysis, our results on the power of model selection procedures are clearly very general and applicable to other studies. PMID:23956610

  10. Genome-Wide Expression Profiling of Five Mouse Models Identifies Similarities and Differences with Human Psoriasis

    PubMed Central

    Swindell, William R.; Johnston, Andrew; Carbajal, Steve; Han, Gangwen; Wohn, Christian; Lu, Jun; Xing, Xianying; Nair, Rajan P.; Voorhees, John J.; Elder, James T.; Wang, Xiao-Jing; Sano, Shigetoshi; Prens, Errol P.; DiGiovanni, John; Pittelkow, Mark R.; Ward, Nicole L.; Gudjonsson, Johann E.

    2011-01-01

    Development of a suitable mouse model would facilitate the investigation of pathomechanisms underlying human psoriasis and would also assist in development of therapeutic treatments. However, while many psoriasis mouse models have been proposed, no single model recapitulates all features of the human disease, and standardized validation criteria for psoriasis mouse models have not been widely applied. In this study, whole-genome transcriptional profiling is used to compare gene expression patterns manifested by human psoriatic skin lesions with those that occur in five psoriasis mouse models (K5-Tie2, imiquimod, K14-AREG, K5-Stat3C and K5-TGFbeta1). While the cutaneous gene expression profiles associated with each mouse phenotype exhibited statistically significant similarity to the expression profile of psoriasis in humans, each model displayed distinctive sets of similarities and differences in comparison to human psoriasis. For all five models, correspondence to the human disease was strong with respect to genes involved in epidermal development and keratinization. Immune and inflammation-associated gene expression, in contrast, was more variable between models as compared to the human disease. These findings support the value of all five models as research tools, each with identifiable areas of convergence to and divergence from the human disease. Additionally, the approach used in this paper provides an objective and quantitative method for evaluation of proposed mouse models of psoriasis, which can be strategically applied in future studies to score strengths of mouse phenotypes relative to specific aspects of human psoriasis. PMID:21483750

  11. Do Different Mental Models Influence Cybersecurity Behavior? Evaluations via Statistical Reasoning Performance.

    PubMed

    Brase, Gary L; Vasserman, Eugene Y; Hsu, William

    2017-01-01

    Cybersecurity research often describes people as understanding internet security in terms of metaphorical mental models (e.g., disease risk, physical security risk, or criminal behavior risk). However, little research has directly evaluated if this is an accurate or productive framework. To assess this question, two experiments asked participants to respond to a statistical reasoning task framed in one of four different contexts (cybersecurity, plus the above alternative models). Each context was also presented using either percentages or natural frequencies, and these tasks were followed by a behavioral likelihood rating. As in previous research, consistent use of natural frequencies promoted correct Bayesian reasoning. There was little indication, however, that any of the alternative mental models generated consistently better understanding or reasoning over the actual cybersecurity context. There was some evidence that different models had some effects on patterns of responses, including the behavioral likelihood ratings, but these effects were small, as compared to the effect of the numerical format manipulation. This points to a need to improve the content of actual internet security warnings, rather than working to change the models users have of warnings.

  12. Do Different Mental Models Influence Cybersecurity Behavior? Evaluations via Statistical Reasoning Performance

    PubMed Central

    Brase, Gary L.; Vasserman, Eugene Y.; Hsu, William

    2017-01-01

    Cybersecurity research often describes people as understanding internet security in terms of metaphorical mental models (e.g., disease risk, physical security risk, or criminal behavior risk). However, little research has directly evaluated if this is an accurate or productive framework. To assess this question, two experiments asked participants to respond to a statistical reasoning task framed in one of four different contexts (cybersecurity, plus the above alternative models). Each context was also presented using either percentages or natural frequencies, and these tasks were followed by a behavioral likelihood rating. As in previous research, consistent use of natural frequencies promoted correct Bayesian reasoning. There was little indication, however, that any of the alternative mental models generated consistently better understanding or reasoning over the actual cybersecurity context. There was some evidence that different models had some effects on patterns of responses, including the behavioral likelihood ratings, but these effects were small, as compared to the effect of the numerical format manipulation. This points to a need to improve the content of actual internet security warnings, rather than working to change the models users have of warnings. PMID:29163304

  13. Cluster and propensity based approximation of a network

    PubMed Central

    2013-01-01

    Background The models in this article generalize current models for both correlation networks and multigraph networks. Correlation networks are widely applied in genomics research. In contrast to general networks, it is straightforward to test the statistical significance of an edge in a correlation network. It is also easy to decompose the underlying correlation matrix and generate informative network statistics such as the module eigenvector. However, correlation networks only capture the connections between numeric variables. An open question is whether one can find suitable decompositions of the similarity measures employed in constructing general networks. Multigraph networks are attractive because they support likelihood based inference. Unfortunately, it is unclear how to adjust current statistical methods to detect the clusters inherent in many data sets. Results Here we present an intuitive and parsimonious parametrization of a general similarity measure such as a network adjacency matrix. The cluster and propensity based approximation (CPBA) of a network not only generalizes correlation network methods but also multigraph methods. In particular, it gives rise to a novel and more realistic multigraph model that accounts for clustering and provides likelihood based tests for assessing the significance of an edge after controlling for clustering. We present a novel Majorization-Minimization (MM) algorithm for estimating the parameters of the CPBA. To illustrate the practical utility of the CPBA of a network, we apply it to gene expression data and to a bi-partite network model for diseases and disease genes from the Online Mendelian Inheritance in Man (OMIM). Conclusions The CPBA of a network is theoretically appealing since a) it generalizes correlation and multigraph network methods, b) it improves likelihood based significance tests for edge counts, c) it directly models higher-order relationships between clusters, and d) it suggests novel clustering algorithms. The CPBA of a network is implemented in Fortran 95 and bundled in the freely available R package PropClust. PMID:23497424

  14. Environmental correlates to behavioral health outcomes in Alzheimer's special care units.

    PubMed

    Zeisel, John; Silverstein, Nina M; Hyde, Joan; Levkoff, Sue; Lawton, M Powell; Holmes, William

    2003-10-01

    We systematically measured the associations between environmental design features of nursing home special care units and the incidence of aggression, agitation, social withdrawal, depression, and psychotic problems among persons living there who have Alzheimer's disease or a related disorder. We developed and tested a model of critical health-related environmental design features in settings for people with Alzheimer's disease. We used hierarchical linear modeling statistical techniques to assess associations between seven environmental design features and behavioral health measures for 427 residents in 15 special care units. Behavioral health measures included the Cohen-Mansfield physical agitation, verbal agitation, and aggressive behavior scales, the Multidimensional Observation Scale for Elderly Subjects depression and social withdrawal scales, and BEHAVE-AD (psychotic symptom list) misidentification and paranoid delusions scales. Statistical controls were included for the influence of, among others, cognitive status, need for assistance with activities of daily living, prescription drug use, amount of Alzheimer's staff training, and staff-to-resident ratio. Although hierarchical linear modeling minimizes the risk of Type II-false positive-error, this exploratory study also pays special attention to avoiding Type I error-the failure to recognize possible relationships between behavioral health characteristics and independent variables. We found associations between each behavioral health measure and particular environmental design features, as well as between behavioral health measures and both resident and nonenvironmental facility variables. This research demonstrates the potential that environment has for contributing to the improvement of Alzheimer's symptoms. A balanced combination of pharmacologic, behavioral, and environmental approaches is likely to be most effective in improving the health, behavior, and quality of life of people with Alzheimer's disease.

  15. CoNVaQ: a web tool for copy number variation-based association studies.

    PubMed

    Larsen, Simon Jonas; do Canto, Luisa Matos; Rogatto, Silvia Regina; Baumbach, Jan

    2018-05-18

    Copy number variations (CNVs) are large segments of the genome that are duplicated or deleted. Structural variations in the genome have been linked to many complex diseases. Similar to how genome-wide association studies (GWAS) have helped discover single-nucleotide polymorphisms linked to disease phenotypes, the extension of GWAS to CNVs has aided the discovery of structural variants associated with human traits and diseases. We present CoNVaQ, an easy-to-use web-based tool for CNV-based association studies. The web service allows users to upload two sets of CNV segments and search for genomic regions where the occurrence of CNVs is significantly associated with the phenotype. CoNVaQ provides two models: a simple statistical model using Fisher's exact test and a novel query-based model matching regions to user-defined queries. For each region, the method computes a global q-value statistic by repeated permutation of samples among the populations. We demonstrate our platform by using it to analyze a data set of HPV-positive and HPV-negative penile cancer patients. CoNVaQ provides a simple workflow for performing CNV-based association studies. It is made available as a web platform in order to provide a user-friendly workflow for biologists and clinicians to carry out CNV data analysis without installing any software. Through the web interface, users are also able to analyze their results to find overrepresented GO terms and pathways. In addition, our method is also available as a package for the R programming language. CoNVaQ is available at https://convaq.compbio.sdu.dk .

  16. Parameter inference in small world network disease models with approximate Bayesian Computational methods

    NASA Astrophysics Data System (ADS)

    Walker, David M.; Allingham, David; Lee, Heung Wing Joseph; Small, Michael

    2010-02-01

    Small world network models have been effective in capturing the variable behaviour of reported case data of the SARS coronavirus outbreak in Hong Kong during 2003. Simulations of these models have previously been realized using informed “guesses” of the proposed model parameters and tested for consistency with the reported data by surrogate analysis. In this paper we attempt to provide statistically rigorous parameter distributions using Approximate Bayesian Computation sampling methods. We find that such sampling schemes are a useful framework for fitting parameters of stochastic small world network models where simulation of the system is straightforward but expressing a likelihood is cumbersome.

  17. Leading indicators of mosquito-borne disease elimination.

    PubMed

    O'Regan, Suzanne M; Lillie, Jonathan W; Drake, John M

    Mosquito-borne diseases contribute significantly to the global disease burden. High-profile elimination campaigns are currently underway for many parasites, e.g., Plasmodium spp., the causal agent of malaria. Sustaining momentum near the end of elimination programs is often difficult to achieve and consequently quantitative tools that enable monitoring the effectiveness of elimination activities after the initial reduction of cases has occurred are needed. Documenting progress in vector-borne disease elimination is a potentially important application for the theory of critical transitions. Non-parametric approaches that are independent of model-fitting would advance infectious disease forecasting significantly. In this paper, we consider compartmental Ross-McDonald models that are slowly forced through a critical transition through gradually deployed control measures. We derive expressions for the behavior of candidate indicators, including the autocorrelation coefficient, variance, and coefficient of variation in the number of human cases during the approach to elimination. We conducted a simulation study to test the performance of each summary statistic as an early warning system of mosquito-borne disease elimination. Variance and coefficient of variation were highly predictive of elimination but autocorrelation performed poorly as an indicator in some control contexts. Our results suggest that tipping points (bifurcations) in mosquito-borne infectious disease systems may be foreshadowed by characteristic temporal patterns of disease prevalence.

  18. Reproducible detection of disease-associated markers from gene expression data.

    PubMed

    Omae, Katsuhiro; Komori, Osamu; Eguchi, Shinto

    2016-08-18

    Detection of disease-associated markers plays a crucial role in gene screening for biological studies. Two-sample test statistics, such as the t-statistic, are widely used to rank genes based on gene expression data. However, the resultant gene ranking is often not reproducible among different data sets. Such irreproducibility may be caused by disease heterogeneity. When we divided data into two subsets, we found that the signs of the two t-statistics were often reversed. Focusing on such instability, we proposed a sign-sum statistic that counts the signs of the t-statistics for all possible subsets. The proposed method excludes genes affected by heterogeneity, thereby improving the reproducibility of gene ranking. We compared the sign-sum statistic with the t-statistic by a theoretical evaluation of the upper confidence limit. Through simulations and applications to real data sets, we show that the sign-sum statistic exhibits superior performance. We derive the sign-sum statistic for getting a robust gene ranking. The sign-sum statistic gives more reproducible ranking than the t-statistic. Using simulated data sets we show that the sign-sum statistic excludes hetero-type genes well. Also for the real data sets, the sign-sum statistic performs well in a viewpoint of ranking reproducibility.

  19. A surface hydrology model for regional vector borne disease models

    NASA Astrophysics Data System (ADS)

    Tompkins, Adrian; Asare, Ernest; Bomblies, Arne; Amekudzi, Leonard

    2016-04-01

    Small, sun-lit temporary pools that form during the rainy season are important breeding sites for many key mosquito vectors responsible for the transmission of malaria and other diseases. The representation of this surface hydrology in mathematical disease models is challenging, due to their small-scale, dependence on the terrain and the difficulty of setting soil parameters. Here we introduce a model that represents the temporal evolution of the aggregate statistics of breeding sites in a single pond fractional coverage parameter. The model is based on a simple, geometrical assumption concerning the terrain, and accounts for the processes of surface runoff, pond overflow, infiltration and evaporation. Soil moisture, soil properties and large-scale terrain slope are accounted for using a calibration parameter that sets the equivalent catchment fraction. The model is calibrated and then evaluated using in situ pond measurements in Ghana and ultra-high (10m) resolution explicit simulations for a village in Niger. Despite the model's simplicity, it is shown to reproduce the variability and mean of the pond aggregate water coverage well for both locations and validation techniques. Example malaria simulations for Uganda will be shown using this new scheme with a generic calibration setting, evaluated using district malaria case data. Possible methods for implementing regional calibration will be briefly discussed.

  20. Detecting inflammation and fibrosis in bowel wall with photoacoustic imaging in a Crohn's disease animal model

    NASA Astrophysics Data System (ADS)

    Xu, Guan; Johnson, Laura A.; Hu, Jack; Dillman, Jonathan R.; Higgins, Peter D. R.; Wang, Xueding

    2015-03-01

    Crohn's disease (CD) is an autoimmune disease affecting 700,000 people in the United States. This condition may cause obstructing intestinal narrowings (strictures) due to inflammation, fibrosis (deposition of collagen), or a combination of both. Utilizing the unique strong optical absorption of hemoglobin at 532 nm and collagen at 1370 nm, this study investigated the feasibility of non-invasively characterizing intestinal strictures using photoacoustic imaging (PAI). Three normal controls, ten pure inflammation and 9 inflammation plus fibrosis rat bowel wall samples were imaged. Statistical analysis of the PA measurements has shown the capability of discriminating the purely inflammatory from mixed inflammatory and fibrotic strictures.

  1. Hormone replacement therapy is associated with gastro-oesophageal reflux disease: a retrospective cohort study

    PubMed Central

    2012-01-01

    Background Oestrogen and progestogen have the potential to influence gastro-intestinal motility; both are key components of hormone replacement therapy (HRT). Results of observational studies in women taking HRT rely on self-reporting of gastro-oesophageal symptoms and the aetiology of gastro-oesophageal reflux disease (GORD) remains unclear. This study investigated the association between HRT and GORD in menopausal women using validated general practice records. Methods 51,182 menopausal women were identified using the UK General Practice Research Database between 1995–2004. Of these, 8,831 were matched with and without hormone use. Odds ratios (ORs) were calculated for GORD and proton-pump inhibitor (PPI) use in hormone and non-hormone users, adjusting for age, co-morbidities, and co-pharmacy. Results In unadjusted analysis, all forms of hormone use (oestrogen-only, tibolone, combined HRT and progestogen) were statistically significantly associated with GORD. In adjusted models, this association remained statistically significant for oestrogen-only treatment (OR 1.49; 1.18–1.89). Unadjusted analysis showed a statistically significant association between PPI use and oestrogen-only and combined HRT treatment. When adjusted for covariates, oestrogen-only treatment was significant (OR 1.34; 95% CI 1.03–1.74). Findings from the adjusted model demonstrated the greater use of PPI by progestogen users (OR 1.50; 1.01–2.22). Conclusions This first large cohort study of the association between GORD and HRT found a statistically significant association between oestrogen-only hormone and GORD and PPI use. This should be further investigated using prospective follow-up to validate the strength of association and describe its clinical significance. PMID:22642788

  2. Empirical Bayes Estimation of Semi-parametric Hierarchical Mixture Models for Unbiased Characterization of Polygenic Disease Architectures

    PubMed Central

    Nishino, Jo; Kochi, Yuta; Shigemizu, Daichi; Kato, Mamoru; Ikari, Katsunori; Ochi, Hidenori; Noma, Hisashi; Matsui, Kota; Morizono, Takashi; Boroevich, Keith A.; Tsunoda, Tatsuhiko; Matsui, Shigeyuki

    2018-01-01

    Genome-wide association studies (GWAS) suggest that the genetic architecture of complex diseases consists of unexpectedly numerous variants with small effect sizes. However, the polygenic architectures of many diseases have not been well characterized due to lack of simple and fast methods for unbiased estimation of the underlying proportion of disease-associated variants and their effect-size distribution. Applying empirical Bayes estimation of semi-parametric hierarchical mixture models to GWAS summary statistics, we confirmed that schizophrenia was extremely polygenic [~40% of independent genome-wide SNPs are risk variants, most within odds ratio (OR = 1.03)], whereas rheumatoid arthritis was less polygenic (~4 to 8% risk variants, significant portion reaching OR = 1.05 to 1.1). For rheumatoid arthritis, stratified estimations revealed that expression quantitative loci in blood explained large genetic variance, and low- and high-frequency derived alleles were prone to be risk and protective, respectively, suggesting a predominance of deleterious-risk and advantageous-protective mutations. Despite genetic correlation, effect-size distributions for schizophrenia and bipolar disorder differed across allele frequency. These analyses distinguished disease polygenic architectures and provided clues for etiological differences in complex diseases. PMID:29740473

  3. Genetic Factors Affecting Late-Onset Alzheimer's Disease Susceptibility.

    PubMed

    Rezazadeh, Maryam; Khorrami, Aziz; Yeghaneh, Tarlan; Talebi, Mahnaz; Kiani, Seyed Jalal; Heshmati, Yaser; Gharesouran, Jalal

    2016-03-01

    Alzheimer's disease is considered a progressive brain disease in the older population. Late-onset Alzheimer's disease (LOAD) as a multifactorial dementia has a polygenic inheritance. Age, environment, and lifestyle along with a growing number of genetic factors have been reported as risk factors for LOAD. Our aim was to present results of LOAD association studies that have been done in northwestern Iran, and we also explored possible interactions with apolipoprotein E (APOE) status. We re-evaluated the association of these markers in dominant, recessive, and additive models. In all, 160 LOAD and 163 healthy control subjects of Azeri Turkish ethnicity were studied. The Chi-square test with Yates' correction and Fisher's exact test were used for statistical analysis. A Bonferroni-corrected p value, based on the number of statistical tests, was considered significant. Our results confirmed that chemokine receptor type 2 (CCR2), estrogen receptor 1 (ESR1), toll-like receptor 2 (TLR2), tumor necrosis factor alpha (TNF α), APOE, bridging integrator 1 (BIN1), and phosphatidylinositol-binding clathrin assembly protein (PICALM) are LOAD susceptibility loci in Azeri Turk ancestry populations. Among them, variants of CCR2, ESR1, TNF α, and APOE revealed associations in three different genetic models. After adjusting for APOE, the association (both allelic and genotypic) with CCR2, BIN1, and ESRα (PvuII) was evident only among subjects without the APOE ε4, whereas the association with CCR5, without Bonferroni correction, was significant only among subjects carrying the APOE ε4 allele. This result is an evidence of a synergistic and antagonistic effect of APOE on variant associations with LOAD.

  4. Sodium intake prediction with health promotion model constructs in rural hypertensive patients.

    PubMed

    Kamran, Aziz; Sharifirad, Gholamreza; Shafaeei, Yousef; Azadbakht, Leila

    2015-01-01

    Hypertension is the most common cause of cardiovascular disease, and the growing epidemic is a serious warning to pay more attention to this disease. The aims of this study were to examine the relationships between the health promotion model (HPM) constructs and sodium intake, and to determine the predictive power of the HPM constructs as the possible mediators of sodium intake in rural Iranian hypertensive patients. This cross-sectional study was conducted on 671 hypertensive patients in Ardabil, Iran in 2013. The data were obtained during a 25-40 min face-to-face conversation by validated and reliable instruments. The nutritional data were assessed with Nutritionist version 4 (N4) software. Descriptive statistics, Spearman's correlations were calculated using SPSS Statistics version 18.0. Structural equation modeling was conducted using AMOS version 18. Sodium intake was negatively correlated with perceived benefits (r = -0.707; P < 0.01), perceived self-efficacy (r = -0.719; P < 0.01), situational influences (r = -0.590; P < 0.01), interpersonal influences (r = -0.637; P < 0.01), commitment to action (r = -0.605; P < 0.01), affects related behavior (r = -0.499; P < 0.01), and positively associated with the perceived barriers score (r = 0.563; P < 0.01). The structural equation modeling showed that the model explained 63.0% of the variation in sodium intake. HPM constructs were significantly associated with sodium intake and dietary perceptions based on HPM constructs can predict acceptable rate of the variation of sodium intake. Therefore, we suggest using this model constructs to improve the effectiveness of nutritional interventions.

  5. Omnibus risk assessment via accelerated failure time kernel machine modeling.

    PubMed

    Sinnott, Jennifer A; Cai, Tianxi

    2013-12-01

    Integrating genomic information with traditional clinical risk factors to improve the prediction of disease outcomes could profoundly change the practice of medicine. However, the large number of potential markers and possible complexity of the relationship between markers and disease make it difficult to construct accurate risk prediction models. Standard approaches for identifying important markers often rely on marginal associations or linearity assumptions and may not capture non-linear or interactive effects. In recent years, much work has been done to group genes into pathways and networks. Integrating such biological knowledge into statistical learning could potentially improve model interpretability and reliability. One effective approach is to employ a kernel machine (KM) framework, which can capture nonlinear effects if nonlinear kernels are used (Scholkopf and Smola, 2002; Liu et al., 2007, 2008). For survival outcomes, KM regression modeling and testing procedures have been derived under a proportional hazards (PH) assumption (Li and Luan, 2003; Cai, Tonini, and Lin, 2011). In this article, we derive testing and prediction methods for KM regression under the accelerated failure time (AFT) model, a useful alternative to the PH model. We approximate the null distribution of our test statistic using resampling procedures. When multiple kernels are of potential interest, it may be unclear in advance which kernel to use for testing and estimation. We propose a robust Omnibus Test that combines information across kernels, and an approach for selecting the best kernel for estimation. The methods are illustrated with an application in breast cancer. © 2013, The International Biometric Society.

  6. Application of a Lifestyle-Based Tool to Estimate Premature Cardiovascular Disease Events in Young Adults: The Coronary Artery Risk Development in Young Adults (CARDIA) Study.

    PubMed

    Gooding, Holly C; Ning, Hongyan; Gillman, Matthew W; Shay, Christina; Allen, Norrina; Goff, David C; Lloyd-Jones, Donald; Chiuve, Stephanie

    2017-09-01

    Few tools exist for assessing the risk for early atherosclerotic cardiovascular disease (ASCVD) events in young adults. To assess the performance of the Healthy Heart Score (HHS), a lifestyle-based tool that estimates ASCVD events in older adults, for ASCVD events occurring before 55 years of age. This prospective cohort study included 4893 US adults aged 18 to 30 years from the Coronary Artery Risk Development in Young Adults (CARDIA) study. Participants underwent measurement of lifestyle factors from March 25, 1985, through June 7, 1986, and were followed up for a median of 27.1 years (interquartile range, 26.9-27.2 years). Data for this study were analyzed from February 24 through December 12, 2016. The HHS includes age, smoking status, body mass index, alcohol intake, exercise, and a diet score composed of self-reported daily intake of cereal fiber, fruits and/or vegetables, nuts, sugar-sweetened beverages, and red and/or processed meats. The HHS in the CARDIA study was calculated using sex-specific equations produced by its derivation cohorts. The ability of the HHS to assess the 25-year risk for ASCVD (death from coronary heart disease, nonfatal myocardial infarction, and fatal or nonfatal ischemic stroke) in the total sample, in race- and sex-specific subgroups, and in those with and without clinical ASCVD risk factors at baseline. Model discrimination was assessed with the Harrell C statistic; model calibration, with Greenwood-Nam-D'Agostino statistics. The study population of 4893 participants included 2205 men (45.1%) and 2688 women (54.9%) with a mean (SD) age at baseline of 24.8 (3.6) years; 2483 (50.7%) were black; and 427 (8.7%) had at least 1 clinical ASCVD risk factor (hypertension, hyperlipidemia, or diabetes types 1 and 2). Among these participants, 64 premature ASCVD events occurred in women and 99 in men. The HHS showed moderate discrimination for ASCVD risk assessment in this diverse population of mostly healthy young adults (C statistic, 0.71; 95% CI, 0.66-0.76); it performed better in men (C statistic, 0.74; 95% CI, 0.68-0.79) than in women (C statistic, 0.69; 95% CI, 0.62-0.75); in white (C statistic, 0.77; 95% CI, 0.71-0.84) than in black (C statistic, 0.66; 95% CI, 0.60-0.72) participants; and in those without (C statistic, 0.71; 95% CI, 0.66-0.76) vs with (C statistic, 0.64; 95% CI, 0.55-0.73) clinical risk factors at baseline. The HHS was adequately calibrated overall and within each subgroup. The HHS, when measured in younger persons without ASCVD risk factors, performs moderately well in assessing risk for ASCVD events by early middle age. Its reliance on self-reported, modifiable lifestyle factors makes it an attractive tool for risk assessment and counseling for early ASCVD prevention.

  7. Prediction of new onset of end stage renal disease in Chinese patients with type 2 diabetes mellitus - a population-based retrospective cohort study.

    PubMed

    Wan, Eric Yuk Fai; Fong, Daniel Yee Tak; Fung, Colman Siu Cheung; Yu, Esther Yee Tak; Chin, Weng Yee; Chan, Anca Ka Chun; Lam, Cindy Lo Kuen

    2017-08-01

    Since diabetes mellitus (DM) is the leading cause of end stage renal disease (ESRD), this study aimed to develop a 5-year ESRD risk prediction model among Chinese patients with Type 2 DM (T2DM) in primary care. A retrospective cohort study was conducted on 149,333 Chinese adult T2DM primary care patients without ESRD in 2010. Using the derivation cohort over a median of 5 years follow-up, the gender-specific models including the interaction effect between predictors and age were derived using Cox regression with a forward stepwise approach. Harrell's C-statistic and calibration plot were applied to the validation cohort to assess discrimination and calibration of the models. Prediction models showed better discrimination with Harrell's C-statistics of 0.866 (males) and 0.862 (females) and calibration power from the plots than other established models. The predictors included age, usages of anti-hypertensive drugs, anti-glucose drugs, and Hemogloblin A1c, blood pressure, urine albumin/creatinine ratio (ACR) and estimated glomerular filtration rate (eGFR). Specific predictors for male were smoking and presence of sight threatening diabetic retinopathy while additional predictors for female included longer duration of diabetes and quadratic effect of body mass index. Interaction factors with age showed a greater weighting of insulin and urine ACR in younger males, and eGFR in younger females. Our newly developed gender-specific models provide a more accurate 5-year ESRD risk predictions for Chinese diabetic primary care patients than other existing models. The models included several modifiable risk factors that clinicians can use to counsel patients, and to target at in the delivery of care to patients.

  8. (Draft) Community air pollution and mortality: Analysis of 1980 data from US metropolitan areas

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lipfert, F.W.

    1992-11-01

    1980 data from up to 149 metropolitan areas were used to define cross-sectional associations between community air pollution and ``excess`` human mortality. The regression model proposed by Ozkaynak and Thurston (1987), which accounted for age, race, education, poverty, and population density, was evaluated and several new models were developed. The new models also accounted for migration, drinking water hardness, and smoking, and included a more detailed description of race. Cause-of-death categories analyzed include all causes, all ``non-external`` causes, major cardiovascular diseases, and chronic obstructive pulmonary diseases (COPD). Both annual mortality rates and their logarithms were analyzed. Air quality data weremore » obtained from the EPA AIRS database (TSP, SO{sub 4}{sup =}, Mn, and ozone) and from the inhalable particulate network (PM{sub 15}, PM{sub 2.5} and SO{sub 4}{sup =}, for 63{sup 4} locations). The data on particulates were averaged across all monitoring stations available for each SMSA and the TSP data were restricted to the year 1980. The associations between mortality and air pollution were found to be dependent on the socioeconomic factors included in the models, the specific locations included in the data set, and the type of statistical model used. Statistically significant associations were found as follows: between TSP and mortality due to non-external causes with log-linear models, but not with a linear model betweenestimated 10-year average (1980--90) ozone levels and 1980 non-external and cardiovascular deaths; and between TSP and COPD mortality for both linear and log-linear models. When the sulfate contribution to TSP was subtracted, the relationship with COPD mortality was strengthened.« less

  9. (Draft) Community air pollution and mortality: Analysis of 1980 data from US metropolitan areas

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lipfert, F.W.

    1992-11-01

    1980 data from up to 149 metropolitan areas were used to define cross-sectional associations between community air pollution and excess'' human mortality. The regression model proposed by Ozkaynak and Thurston (1987), which accounted for age, race, education, poverty, and population density, was evaluated and several new models were developed. The new models also accounted for migration, drinking water hardness, and smoking, and included a more detailed description of race. Cause-of-death categories analyzed include all causes, all non-external'' causes, major cardiovascular diseases, and chronic obstructive pulmonary diseases (COPD). Both annual mortality rates and their logarithms were analyzed. Air quality data weremore » obtained from the EPA AIRS database (TSP, SO[sub 4][sup =], Mn, and ozone) and from the inhalable particulate network (PM[sub 15], PM[sub 2.5] and SO[sub 4][sup =], for 63[sup 4] locations). The data on particulates were averaged across all monitoring stations available for each SMSA and the TSP data were restricted to the year 1980. The associations between mortality and air pollution were found to be dependent on the socioeconomic factors included in the models, the specific locations included in the data set, and the type of statistical model used. Statistically significant associations were found as follows: between TSP and mortality due to non-external causes with log-linear models, but not with a linear model betweenestimated 10-year average (1980--90) ozone levels and 1980 non-external and cardiovascular deaths; and between TSP and COPD mortality for both linear and log-linear models. When the sulfate contribution to TSP was subtracted, the relationship with COPD mortality was strengthened.« less

  10. [Suicide due to mental diseases based on the Vital Statistics Survey Death Form].

    PubMed

    Takizawa, Tohru

    2012-06-01

    Mental diseases such as schizophrenia and depression put patients at risk for suicide. It is extremely important to understand that one way of preventing suicide is to determine the actual mental state of the individual. The purpose of this study was to analyze the true mental state of suicide victims reported in the vital statistics. This study investigated the vital statistics of 30,299 suicide victims in Japan in 2008. The use of these basic statistics for non-statistical purposes was approved by the Japanese Ministry of Health, Labour and Welfare. The method involved reviewing the Vital Statistics Survey Death Form at the Ministry of Health, Labour and Welfare as well as analyzing their Online Reporting of Vital Statistics. Furthermore, this study was able to validate 29,799 of the 30,299 suicides (98.3%) that occurred in 2008. Mental diseases were validated not only from the "Cause of death" section as marked on the death certificate, but also by information found in sections for "Additional items for death by external cause" and "Other special remarks." RESULTS; From the Vital Statistics Survey Death Form and Online Reporting of Vital Statistics, 2964 individuals with either a mental disease or mental disorder were identified. Of the 2964 identified individuals, 55 had dementia (of which 13 were dementia in Alzheimer's disease), 116 had alcohol dependence/psychotic disorder, 550 had schizophrenia, 101 had bipolar affective disorder, 1,913 has had a depressive episode, 13 had obsessive-compulsive disorder, 22 had adjustment disorders, 14 had eating disorders, 49 had nonorganic sleep disorders, 24 had personality disorder, and 6 had pervasive developmental disorders. In addition, 125 individuals had more than one mental disease. The national police statistics from 2008 show that 1,368 suicide victims had schizophrenia and 6,490 had depression. These figures show quite a difference between the results of this study and the police statistics. Further, there have been controversies regarding autopsies of suicide victims. Thus, further investigation into the cause of death is of great importance.

  11. Drug-disease modeling in the pharmaceutical industry - where mechanistic systems pharmacology and statistical pharmacometrics meet.

    PubMed

    Helmlinger, Gabriel; Al-Huniti, Nidal; Aksenov, Sergey; Peskov, Kirill; Hallow, Karen M; Chu, Lulu; Boulton, David; Eriksson, Ulf; Hamrén, Bengt; Lambert, Craig; Masson, Eric; Tomkinson, Helen; Stanski, Donald

    2017-11-15

    Modeling & simulation (M&S) methodologies are established quantitative tools, which have proven to be useful in supporting the research, development (R&D), regulatory approval, and marketing of novel therapeutics. Applications of M&S help design efficient studies and interpret their results in context of all available data and knowledge to enable effective decision-making during the R&D process. In this mini-review, we focus on two sets of modeling approaches: population-based models, which are well-established within the pharmaceutical industry today, and fall under the discipline of clinical pharmacometrics (PMX); and systems dynamics models, which encompass a range of models of (patho-)physiology amenable to pharmacological intervention, of signaling pathways in biology, and of substance distribution in the body (today known as physiologically-based pharmacokinetic models) - which today may be collectively referred to as quantitative systems pharmacology models (QSP). We next describe the convergence - or rather selected integration - of PMX and QSP approaches into 'middle-out' drug-disease models, which retain selected mechanistic aspects, while remaining parsimonious, fit-for-purpose, and able to address variability and the testing of covariates. We further propose development opportunities for drug-disease systems models, to increase their utility and applicability throughout the preclinical and clinical spectrum of pharmaceutical R&D. Copyright © 2017 Elsevier B.V. All rights reserved.

  12. Bayesian Tracking of Emerging Epidemics Using Ensemble Optimal Statistical Interpolation

    PubMed Central

    Cobb, Loren; Krishnamurthy, Ashok; Mandel, Jan; Beezley, Jonathan D.

    2014-01-01

    We present a preliminary test of the Ensemble Optimal Statistical Interpolation (EnOSI) method for the statistical tracking of an emerging epidemic, with a comparison to its popular relative for Bayesian data assimilation, the Ensemble Kalman Filter (EnKF). The spatial data for this test was generated by a spatial susceptible-infectious-removed (S-I-R) epidemic model of an airborne infectious disease. Both tracking methods in this test employed Poisson rather than Gaussian noise, so as to handle epidemic data more accurately. The EnOSI and EnKF tracking methods worked well on the main body of the simulated spatial epidemic, but the EnOSI was able to detect and track a distant secondary focus of infection that the EnKF missed entirely. PMID:25113590

  13. Spatial clustering of average risks and risk trends in Bayesian disease mapping.

    PubMed

    Anderson, Craig; Lee, Duncan; Dean, Nema

    2017-01-01

    Spatiotemporal disease mapping focuses on estimating the spatial pattern in disease risk across a set of nonoverlapping areal units over a fixed period of time. The key aim of such research is to identify areas that have a high average level of disease risk or where disease risk is increasing over time, thus allowing public health interventions to be focused on these areas. Such aims are well suited to the statistical approach of clustering, and while much research has been done in this area in a purely spatial setting, only a handful of approaches have focused on spatiotemporal clustering of disease risk. Therefore, this paper outlines a new modeling approach for clustering spatiotemporal disease risk data, by clustering areas based on both their mean risk levels and the behavior of their temporal trends. The efficacy of the methodology is established by a simulation study, and is illustrated by a study of respiratory disease risk in Glasgow, Scotland. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  14. Environmental risk of leptospirosis infections in the Netherlands: Spatial modelling of environmental risk factors of leptospirosis in the Netherlands.

    PubMed

    Rood, Ente J J; Goris, Marga G A; Pijnacker, Roan; Bakker, Mirjam I; Hartskeerl, Rudy A

    2017-01-01

    Leptospirosis is a globally emerging zoonotic disease, associated with various climatic, biotic and abiotic factors. Mapping and quantifying geographical variations in the occurrence of leptospirosis and the surrounding environment offer innovative methods to study disease transmission and to identify associations between the disease and the environment. This study aims to investigate geographic variations in leptospirosis incidence in the Netherlands and to identify associations with environmental factors driving the emergence of the disease. Individual case data derived over the period 1995-2012 in the Netherlands were geocoded and aggregated by municipality. Environmental covariate data were extracted for each municipality and stored in a spatial database. Spatial clusters were identified using kernel density estimations and quantified using local autocorrelation statistics. Associations between the incidence of leptospirosis and the local environment were determined using Simultaneous Autoregressive Models (SAR) explicitly modelling spatial dependence of the model residuals. Leptospirosis incidence rates were found to be spatially clustered, showing a marked spatial pattern. Fitting a spatial autoregressive model significantly improved model fit and revealed significant association between leptospirosis and the coverage of arable land, built up area, grassland and sabulous clay soils. The incidence of leptospirosis in the Netherlands could effectively be modelled using a combination of soil and land-use variables accounting for spatial dependence of incidence rates per municipality. The resulting spatially explicit risk predictions provide an important source of information which will benefit clinical awareness on potential leptospirosis infections in endemic areas.

  15. Environmental risk of leptospirosis infections in the Netherlands: Spatial modelling of environmental risk factors of leptospirosis in the Netherlands

    PubMed Central

    Goris, Marga G. A.; Pijnacker, Roan; Bakker, Mirjam I.; Hartskeerl, Rudy A.

    2017-01-01

    Leptospirosis is a globally emerging zoonotic disease, associated with various climatic, biotic and abiotic factors. Mapping and quantifying geographical variations in the occurrence of leptospirosis and the surrounding environment offer innovative methods to study disease transmission and to identify associations between the disease and the environment. This study aims to investigate geographic variations in leptospirosis incidence in the Netherlands and to identify associations with environmental factors driving the emergence of the disease. Individual case data derived over the period 1995–2012 in the Netherlands were geocoded and aggregated by municipality. Environmental covariate data were extracted for each municipality and stored in a spatial database. Spatial clusters were identified using kernel density estimations and quantified using local autocorrelation statistics. Associations between the incidence of leptospirosis and the local environment were determined using Simultaneous Autoregressive Models (SAR) explicitly modelling spatial dependence of the model residuals. Leptospirosis incidence rates were found to be spatially clustered, showing a marked spatial pattern. Fitting a spatial autoregressive model significantly improved model fit and revealed significant association between leptospirosis and the coverage of arable land, built up area, grassland and sabulous clay soils. The incidence of leptospirosis in the Netherlands could effectively be modelled using a combination of soil and land-use variables accounting for spatial dependence of incidence rates per municipality. The resulting spatially explicit risk predictions provide an important source of information which will benefit clinical awareness on potential leptospirosis infections in endemic areas. PMID:29065186

  16. Evaluating the performance of infectious disease forecasts: A comparison of climate-driven and seasonal dengue forecasts for Mexico.

    PubMed

    Johansson, Michael A; Reich, Nicholas G; Hota, Aditi; Brownstein, John S; Santillana, Mauricio

    2016-09-26

    Dengue viruses, which infect millions of people per year worldwide, cause large epidemics that strain healthcare systems. Despite diverse efforts to develop forecasting tools including autoregressive time series, climate-driven statistical, and mechanistic biological models, little work has been done to understand the contribution of different components to improved prediction. We developed a framework to assess and compare dengue forecasts produced from different types of models and evaluated the performance of seasonal autoregressive models with and without climate variables for forecasting dengue incidence in Mexico. Climate data did not significantly improve the predictive power of seasonal autoregressive models. Short-term and seasonal autocorrelation were key to improving short-term and long-term forecasts, respectively. Seasonal autoregressive models captured a substantial amount of dengue variability, but better models are needed to improve dengue forecasting. This framework contributes to the sparse literature of infectious disease prediction model evaluation, using state-of-the-art validation techniques such as out-of-sample testing and comparison to an appropriate reference model.

  17. Evaluating the performance of infectious disease forecasts: A comparison of climate-driven and seasonal dengue forecasts for Mexico

    PubMed Central

    Johansson, Michael A.; Reich, Nicholas G.; Hota, Aditi; Brownstein, John S.; Santillana, Mauricio

    2016-01-01

    Dengue viruses, which infect millions of people per year worldwide, cause large epidemics that strain healthcare systems. Despite diverse efforts to develop forecasting tools including autoregressive time series, climate-driven statistical, and mechanistic biological models, little work has been done to understand the contribution of different components to improved prediction. We developed a framework to assess and compare dengue forecasts produced from different types of models and evaluated the performance of seasonal autoregressive models with and without climate variables for forecasting dengue incidence in Mexico. Climate data did not significantly improve the predictive power of seasonal autoregressive models. Short-term and seasonal autocorrelation were key to improving short-term and long-term forecasts, respectively. Seasonal autoregressive models captured a substantial amount of dengue variability, but better models are needed to improve dengue forecasting. This framework contributes to the sparse literature of infectious disease prediction model evaluation, using state-of-the-art validation techniques such as out-of-sample testing and comparison to an appropriate reference model. PMID:27665707

  18. Integration of Spatial and Social Network Analysis in Disease Transmission Studies.

    PubMed

    Emch, Michael; Root, Elisabeth D; Giebultowicz, Sophia; Ali, Mohammad; Perez-Heydrich, Carolina; Yunus, Mohammad

    2012-01-01

    This study presents a case study of how social network and spatial analytical methods can be used simultaneously for disease transmission modeling. The paper first reviews strategies employed in previous studies and then offers the example of transmission of two bacterial diarrheal diseases in rural Bangladesh. The goal is to understand how diseases vary socially above and beyond the effects of the local neighborhood context. Patterns of cholera and shigellosis incidence are analyzed in space and within kinship-based social networks in Matlab, Bangladesh. Data include a spatially referenced longitudinal demographic database which consists of approximately 200,000 people and laboratory-confirmed cholera and shigellosis cases from 1983 to 2003. Matrices are created of kinship ties between households using a complete network design and distance matrices are also created to model spatial relationships. Moran's I statistics are calculated to measure clustering within both social and spatial matrices. Combined spatial effects-spatial disturbance models are built to simultaneously analyze spatial and social effects while controlling for local environmental context. Results indicate that cholera and shigellosis always clusters in space and only sometimes within social networks. This suggests that the local environment is most important for understanding transmission of both diseases however kinship-based social networks also influence their transmission. Simultaneous spatial and social network analysis can help us better understand disease transmission and this study has offered several strategies on how.

  19. Integration of Spatial and Social Network Analysis in Disease Transmission Studies

    PubMed Central

    Root, Elisabeth D; Giebultowicz, Sophia; Ali, Mohammad; Perez-Heydrich, Carolina; Yunus, Mohammad

    2013-01-01

    This study presents a case study of how social network and spatial analytical methods can be used simultaneously for disease transmission modeling. The paper first reviews strategies employed in previous studies and then offers the example of transmission of two bacterial diarrheal diseases in rural Bangladesh. The goal is to understand how diseases vary socially above and beyond the effects of the local neighborhood context. Patterns of cholera and shigellosis incidence are analyzed in space and within kinship-based social networks in Matlab, Bangladesh. Data include a spatially referenced longitudinal demographic database which consists of approximately 200,000 people and laboratory-confirmed cholera and shigellosis cases from 1983 to 2003. Matrices are created of kinship ties between households using a complete network design and distance matrices are also created to model spatial relationships. Moran's I statistics are calculated to measure clustering within both social and spatial matrices. Combined spatial effects-spatial disturbance models are built to simultaneously analyze spatial and social effects while controlling for local environmental context. Results indicate that cholera and shigellosis always clusters in space and only sometimes within social networks. This suggests that the local environment is most important for understanding transmission of both diseases however kinship-based social networks also influence their transmission. Simultaneous spatial and social network analysis can help us better understand disease transmission and this study has offered several strategies on how. PMID:24163443

  20. Evaluating the Appropriateness of Downscaled Climate Information for Projecting Risks of Salmonella.

    PubMed

    Guentchev, Galina S; Rood, Richard B; Ammann, Caspar M; Barsugli, Joseph J; Ebi, Kristie; Berrocal, Veronica; O'Neill, Marie S; Gronlund, Carina J; Vigh, Jonathan L; Koziol, Ben; Cinquini, Luca

    2016-02-29

    Foodborne diseases have large economic and societal impacts worldwide. To evaluate how the risks of foodborne diseases might change in response to climate change, credible and usable climate information tailored to the specific application question is needed. Global Climate Model (GCM) data generally need to, both, be downscaled to the scales of the application to be usable, and represent, well, the key characteristics that inflict health impacts. This study presents an evaluation of temperature-based heat indices for the Washington D.C. area derived from statistically downscaled GCM simulations for 1971-2000--a necessary step in establishing the credibility of these data. The indices approximate high weekly mean temperatures linked previously to occurrences of Salmonella infections. Due to bias-correction, included in the Asynchronous Regional Regression Model (ARRM) and the Bias Correction Constructed Analogs (BCCA) downscaling methods, the observed 30-year means of the heat indices were reproduced reasonably well. In April and May, however, some of the statistically downscaled data misrepresent the increase in the number of hot days towards the summer months. This study demonstrates the dependence of the outcomes to the selection of downscaled climate data and the potential for misinterpretation of future estimates of Salmonella infections.

  1. Spreading of diseases through comorbidity networks across life and gender

    NASA Astrophysics Data System (ADS)

    Chmiel, Anna; Klimek, Peter; Thurner, Stefan

    2014-11-01

    The state of health of patients is typically not characterized by a single disease alone but by multiple (comorbid) medical conditions. These comorbidities may depend strongly on age and gender. We propose a specific phenomenological comorbidity network of human diseases that is based on medical claims data of the entire population of Austria. The network is constructed from a two-layer multiplex network, where in one layer the links represent the conditional probability for a comorbidity, and in the other the links contain the respective statistical significance. We show that the network undergoes dramatic structural changes across the lifetime of patients. Disease networks for children consist of a single, strongly interconnected cluster. During adolescence and adulthood further disease clusters emerge that are related to specific classes of diseases, such as circulatory, mental, or genitourinary disorders. For people over 65 these clusters start to merge, and highly connected hubs dominate the network. These hubs are related to hypertension, chronic ischemic heart diseases, and chronic obstructive pulmonary diseases. We introduce a simple diffusion model to understand the spreading of diseases on the disease network at the population level. For the first time we are able to show that patients predominantly develop diseases that are in close network proximity to disorders that they already suffer. The model explains more than 85% of the variance of all disease incidents in the population. The presented methodology could be of importance for anticipating age-dependent disease profiles for entire populations, and for design and validation of prevention strategies.

  2. Health Monitors for Chronic Disease by Gait Analysis with Mobile Phones

    PubMed Central

    Juen, Joshua; Cheng, Qian; Prieto-Centurion, Valentin; Krishnan, Jerry A.

    2014-01-01

    Abstract We have developed GaitTrack, a phone application to detect health status while the smartphone is carried normally. GaitTrack software monitors walking patterns, using only accelerometers embedded in phones to record spatiotemporal motion, without the need for sensors external to the phone. Our software transforms smartphones into health monitors, using eight parameters of phone motion transformed into body motion by the gait model. GaitTrack is designed to detect health status while the smartphone is carried during normal activities, namely, free-living walking. The current method for assessing free-living walking is medical accelerometers, so we present evidence that mobile phones running our software are more accurate. We then show our gait model is more accurate than medical pedometers for counting steps of patients with chronic disease. Our gait model was evaluated in a pilot study involving 30 patients with chronic lung disease. The six-minute walk test (6MWT) is a major assessment for chronic heart and lung disease, including congestive heart failure and especially chronic obstructive pulmonary disease (COPD), affecting millions of persons. The 6MWT consists of walking back and forth along a measured distance for 6 minutes. The gait model using linear regression performed with 94.13% accuracy in measuring walk distance, compared with the established standard of direct observation. We also evaluated a different statistical model using the same gait parameters to predict health status through lung function. This gait model has high accuracy when applied to demographic cohorts, for example, 89.22% accuracy testing the cohort of 12 female patients with ages 50–64 years. PMID:24694291

  3. University of North Carolina Caries Risk Assessment Study: comparisons of high risk prediction, any risk prediction, and any risk etiologic models.

    PubMed

    Beck, J D; Weintraub, J A; Disney, J A; Graves, R C; Stamm, J W; Kaste, L M; Bohannan, H M

    1992-12-01

    The purpose of this analysis is to compare three different statistical models for predicting children likely to be at risk of developing dental caries over a 3-yr period. Data are based on 4117 children who participated in the University of North Carolina Caries Risk Assessment Study, a longitudinal study conducted in the Aiken, South Carolina, and Portland, Maine areas. The three models differed with respect to either the types of variables included or the definition of disease outcome. The two "Prediction" models included both risk factor variables thought to cause dental caries and indicator variables that are associated with dental caries, but are not thought to be causal for the disease. The "Etiologic" model included only etiologic factors as variables. A dichotomous outcome measure--none or any 3-yr increment, was used in the "Any Risk Etiologic model" and the "Any Risk Prediction Model". Another outcome, based on a gradient measure of disease, was used in the "High Risk Prediction Model". The variables that are significant in these models vary across grades and sites, but are more consistent among the Etiologic model than the Predictor models. However, among the three sets of models, the Any Risk Prediction Models have the highest sensitivity and positive predictive values, whereas the High Risk Prediction Models have the highest specificity and negative predictive values. Considerations in determining model preference are discussed.

  4. Modelling seasonal effects of temperature and precipitation on honey bee winter mortality in a temperate climate.

    PubMed

    Switanek, Matthew; Crailsheim, Karl; Truhetz, Heimo; Brodschneider, Robert

    2017-02-01

    Insect pollinators are essential to global food production. For this reason, it is alarming that honey bee (Apis mellifera) populations across the world have recently seen increased rates of mortality. These changes in colony mortality are often ascribed to one or more factors including parasites, diseases, pesticides, nutrition, habitat dynamics, weather and/or climate. However, the effect of climate on colony mortality has never been demonstrated. Therefore, in this study, we focus on longer-term weather conditions and/or climate's influence on honey bee winter mortality rates across Austria. Statistical correlations between monthly climate variables and winter mortality rates were investigated. Our results indicate that warmer and drier weather conditions in the preceding year were accompanied by increased winter mortality. We subsequently built a statistical model to predict colony mortality using temperature and precipitation data as predictors. Our model reduces the mean absolute error between predicted and observed colony mortalities by 9% and is statistically significant at the 99.9% confidence level. This is the first study to show clear evidence of a link between climate variability and honey bee winter mortality. Copyright © 2016 British Geological Survey, NERC. Published by Elsevier B.V. All rights reserved.

  5. A Statistical Analysis of Brain Morphology Using Wild Bootstrapping

    PubMed Central

    Ibrahim, Joseph G.; Tang, Niansheng; Rowe, Daniel B.; Hao, Xuejun; Bansal, Ravi; Peterson, Bradley S.

    2008-01-01

    Methods for the analysis of brain morphology, including voxel-based morphology and surface-based morphometries, have been used to detect associations between brain structure and covariates of interest, such as diagnosis, severity of disease, age, IQ, and genotype. The statistical analysis of morphometric measures usually involves two statistical procedures: 1) invoking a statistical model at each voxel (or point) on the surface of the brain or brain subregion, followed by mapping test statistics (e.g., t test) or their associated p values at each of those voxels; 2) correction for the multiple statistical tests conducted across all voxels on the surface of the brain region under investigation. We propose the use of new statistical methods for each of these procedures. We first use a heteroscedastic linear model to test the associations between the morphological measures at each voxel on the surface of the specified subregion (e.g., cortical or subcortical surfaces) and the covariates of interest. Moreover, we develop a robust test procedure that is based on a resampling method, called wild bootstrapping. This procedure assesses the statistical significance of the associations between a measure of given brain structure and the covariates of interest. The value of this robust test procedure lies in its computationally simplicity and in its applicability to a wide range of imaging data, including data from both anatomical and functional magnetic resonance imaging (fMRI). Simulation studies demonstrate that this robust test procedure can accurately control the family-wise error rate. We demonstrate the application of this robust test procedure to the detection of statistically significant differences in the morphology of the hippocampus over time across gender groups in a large sample of healthy subjects. PMID:17649909

  6. Can hepatic resection provide a long-term cure for patients with intrahepatic cholangiocarcinoma?

    PubMed

    Spolverato, Gaya; Vitale, Alessandro; Cucchetti, Alessandro; Popescu, Irinel; Marques, Hugo P; Aldrighetti, Luca; Gamblin, T Clark; Maithel, Shishir K; Sandroussi, Charbel; Bauer, Todd W; Shen, Feng; Poultsides, George A; Marsh, J Wallis; Pawlik, Timothy M

    2015-11-15

    A patient can be considered statistically cured from a specific disease when their mortality rate returns to the same level as that of the general population. In the current study, the authors sought to assess the probability of being statistically cured from intrahepatic cholangiocarcinoma (ICC) by hepatic resection. A total of 584 patients who underwent surgery with curative intent for ICC between 1990 and 2013 at 1 of 12 participating institutions were identified. A nonmixture cure model was adopted to compare mortality after hepatic resection with the mortality expected for the general population matched by sex and age. The median, 1-year, 3-year, and 5-year disease-free survival was 10 months, 44%, 18%, and 11%, respectively; the corresponding overall survival was 27 months, 75%, 37%, and 22%, respectively. The probability of being cured of ICC was 9.7% (95% confidence interval, 6.1%-13.4%). The mortality of patients undergoing surgery for ICC was higher than that of the general population until year 10, at which time patients alive without tumor recurrence can be considered cured with 99% certainty. Multivariate analysis demonstrated that cure probabilities ranged from 25.8% (time to cure, 9.8 years) in patients with a single, well-differentiated ICC measuring ≤5 cm that was without vascular/periductal invasion and lymph nodes metastases versus <0.1% (time to cure, 12.6 years) among patients with all 6 of these risk factors. A model with which to calculate cure fraction and time to cure was developed. The cure model indicated that statistical cure was possible in patients undergoing hepatic resection for ICC. The overall probability of cure was approximately 10% and varied based on several tumor-specific factors. Cancer 2015;121:3998-4006. © 2015 American Cancer Society. © 2015 American Cancer Society.

  7. Statistical analyses of the relative risk.

    PubMed Central

    Gart, J J

    1979-01-01

    Let P1 be the probability of a disease in one population and P2 be the probability of a disease in a second population. The ratio of these quantities, R = P1/P2, is termed the relative risk. We consider first the analyses of the relative risk from retrospective studies. The relation between the relative risk and the odds ratio (or cross-product ratio) is developed. The odds ratio can be considered a parameter of an exponential model possessing sufficient statistics. This permits the development of exact significance tests and confidence intervals in the conditional space. Unconditional tests and intervals are also considered briefly. The consequences of misclassification errors and ignoring matching or stratifying are also considered. The various methods are extended to combination of results over the strata. Examples of case-control studies testing the association between HL-A frequencies and cancer illustrate the techniques. The parallel analyses of prospective studies are given. If P1 and P2 are small with large samples sizes the appropriate model is a Poisson distribution. This yields a exponential model with sufficient statistics. Exact conditional tests and confidence intervals can then be developed. Here we consider the case where two populations are compared adjusting for sex differences as well as for the strata (or covariate) differences such as age. The methods are applied to two examples: (1) testing in the two sexes the ratio of relative risks of skin cancer in people living in different latitudes, and (2) testing over time the ratio of the relative risks of cancer in two cities, one of which fluoridated its drinking water and one which did not. PMID:540589

  8. Incidence rates of occupational diseases in the Dutch construction sector, 2010-2014.

    PubMed

    van der Molen, Henk F; de Vries, Sanne C; Stocks, S Jill; Warning, Jan; Frings-Dresen, Monique H W

    2016-05-01

    To estimate incidence and trends in incidence of occupational diseases (ODs) in the Dutch construction sector. In a dynamic prospective cohort over a 5-year period (2010-2014), ODs assessed by occupational physicians (OPs) participating in a voluntary construction workers health surveillance (WHS) were reported to the Netherlands Centre for Occupational Diseases (NCOD). ODs were defined as a disease with a specific clinical diagnosis (International Classification of Diseases) that was predominantly caused by work-related factors as assessed by an OP. Annual incidences were determined for the total number of ODs and six frequently occurring OD groups. Trends in incidence were estimated using a multilevel negative binominal regression model. In 2014 the incidence of all OD was 12 964 per 100 000 workers and there was no significant change in incidence between 2010 and 2014 (3%; 95% CI -2% to +9%). Hearing loss (8125 per 100 000 workers) and musculoskeletal disorders (2081 per 100 000 workers) were the most frequently occurring ODs. Noise-induced hearing loss (+7%; 95% CI 1% to 13%) and contact dermatitis (+19%; 95% CI 6% to 33%) showed increasing trends. There was no statistically significant change in the incidence of low back pain, arthrosis, repetitive strain injuries, distress/burnout and chronic obstructive pulmonary disease/asthma. In total, 13% of workers participating in WHS in the Dutch construction industry during 2014 had an OD diagnosed and reported by an OP. Over a 5-year period the annual incidence of reported ODs showed a statistically non-significant increase. Incidences in noise-induced hearing loss and contact dermatitis showed statistically significant increasing trends, 7% and 19%, respectively. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/

  9. Asthma in Adults Fact Sheet

    MedlinePlus

    ... American Lung Association Epidemiology and Statistics Unit using SPSS software. Center for Disease Control and Prevention. Behavioral ... American Lung Association Epidemiology and Statistics Unit using SPSS software. Centers for Disease Control and Prevention. National ...

  10. Systematic drug repositioning for a wide range of diseases with integrative analyses of phenotypic and molecular data.

    PubMed

    Iwata, Hiroaki; Sawada, Ryusuke; Mizutani, Sayaka; Yamanishi, Yoshihiro

    2015-02-23

    Drug repositioning, or the application of known drugs to new indications, is a challenging issue in pharmaceutical science. In this study, we developed a new computational method to predict unknown drug indications for systematic drug repositioning in a framework of supervised network inference. We defined a descriptor for each drug-disease pair based on the phenotypic features of drugs (e.g., medicinal effects and side effects) and various molecular features of diseases (e.g., disease-causing genes, diagnostic markers, disease-related pathways, and environmental factors) and constructed a statistical model to predict new drug-disease associations for a wide range of diseases in the International Classification of Diseases. Our results show that the proposed method outperforms previous methods in terms of accuracy and applicability, and its performance does not depend on drug chemical structure similarity. Finally, we performed a comprehensive prediction of a drug-disease association network consisting of 2349 drugs and 858 diseases and described biologically meaningful examples of newly predicted drug indications for several types of cancers and nonhereditary diseases.

  11. Coupled disease-behavior dynamics on complex networks: A review.

    PubMed

    Wang, Zhen; Andrews, Michael A; Wu, Zhi-Xi; Wang, Lin; Bauch, Chris T

    2015-12-01

    It is increasingly recognized that a key component of successful infection control efforts is understanding the complex, two-way interaction between disease dynamics and human behavioral and social dynamics. Human behavior such as contact precautions and social distancing clearly influence disease prevalence, but disease prevalence can in turn alter human behavior, forming a coupled, nonlinear system. Moreover, in many cases, the spatial structure of the population cannot be ignored, such that social and behavioral processes and/or transmission of infection must be represented with complex networks. Research on studying coupled disease-behavior dynamics in complex networks in particular is growing rapidly, and frequently makes use of analysis methods and concepts from statistical physics. Here, we review some of the growing literature in this area. We contrast network-based approaches to homogeneous-mixing approaches, point out how their predictions differ, and describe the rich and often surprising behavior of disease-behavior dynamics on complex networks, and compare them to processes in statistical physics. We discuss how these models can capture the dynamics that characterize many real-world scenarios, thereby suggesting ways that policy makers can better design effective prevention strategies. We also describe the growing sources of digital data that are facilitating research in this area. Finally, we suggest pitfalls which might be faced by researchers in the field, and we suggest several ways in which the field could move forward in the coming years. Copyright © 2015 Elsevier B.V. All rights reserved.

  12. [Construction of haplotype and haplotype block based on tag single nucleotide polymorphisms and their applications in association studies].

    PubMed

    Gu, Ming-liang; Chu, Jia-you

    2007-12-01

    Human genome has structures of haplotype and haplotype block which provide valuable information on human evolutionary history and may lead to the development of more efficient strategies to identify genetic variants that increase susceptibility to complex diseases. Haplotype block can be divided into discrete blocks of limited haplotype diversity. In each block, a small fraction of ptag SNPsq can be used to distinguish a large fraction of the haplotypes. These tag SNPs can be potentially useful for construction of haplotype and haplotype block, and association studies in complex diseases. There are two general classes of methods to construct haplotype and haplotype blocks based on genotypes on large pedigrees and statistical algorithms respectively. The author evaluate several construction methods to assess the power of different association tests with a variety of disease models and block-partitioning criteria. The advantages, limitations and applications of each method and the application in the association studies are discussed equitably. With the completion of the HapMap and development of statistical algorithms for addressing haplotype reconstruction, ideas of construction of haplotype based on combination of mathematics, physics, and computer science etc will have profound impacts on population genetics, location and cloning for susceptible genes in complex diseases, and related domain with life science etc.

  13. A dynamic spatio-temporal model for spatial data

    USGS Publications Warehouse

    Hefley, Trevor J.; Hooten, Mevin B.; Hanks, Ephraim M.; Russell, Robin; Walsh, Daniel P.

    2017-01-01

    Analyzing spatial data often requires modeling dependencies created by a dynamic spatio-temporal data generating process. In many applications, a generalized linear mixed model (GLMM) is used with a random effect to account for spatial dependence and to provide optimal spatial predictions. Location-specific covariates are often included as fixed effects in a GLMM and may be collinear with the spatial random effect, which can negatively affect inference. We propose a dynamic approach to account for spatial dependence that incorporates scientific knowledge of the spatio-temporal data generating process. Our approach relies on a dynamic spatio-temporal model that explicitly incorporates location-specific covariates. We illustrate our approach with a spatially varying ecological diffusion model implemented using a computationally efficient homogenization technique. We apply our model to understand individual-level and location-specific risk factors associated with chronic wasting disease in white-tailed deer from Wisconsin, USA and estimate the location the disease was first introduced. We compare our approach to several existing methods that are commonly used in spatial statistics. Our spatio-temporal approach resulted in a higher predictive accuracy when compared to methods based on optimal spatial prediction, obviated confounding among the spatially indexed covariates and the spatial random effect, and provided additional information that will be important for containing disease outbreaks.

  14. Meta-analysis of neutropenia or leukopenia as a prognostic factor in patients with malignant disease undergoing chemotherapy.

    PubMed

    Shitara, Kohei; Matsuo, Keitaro; Oze, Isao; Mizota, Ayako; Kondo, Chihiro; Nomura, Motoo; Yokota, Tomoya; Takahari, Daisuke; Ura, Takashi; Muro, Kei

    2011-08-01

    We performed a systematic review and meta-analysis to determine the impact of neutropenia or leukopenia experienced during chemotherapy on survival. Eligible studies included prospective or retrospective analyses that evaluated neutropenia or leukopenia as a prognostic factor for overall survival or disease-free survival. Statistical analyses were conducted to calculate a summary hazard ratio and 95% confidence interval (CI) using random-effects or fixed-effects models based on the heterogeneity of the included studies. Thirteen trials were selected for the meta-analysis, with a total of 9,528 patients. The hazard ratio of death was 0.69 (95% CI, 0.64-0.75) for patients with higher-grade neutropenia or leukopenia compared to patients with lower-grade or lack of cytopenia. Our analysis was also stratified by statistical method (any statistical method to decrease lead-time bias; time-varying analysis or landmark analysis), but no differences were observed. Our results indicate that neutropenia or leukopenia experienced during chemotherapy is associated with improved survival in patients with advanced cancer or hematological malignancies undergoing chemotherapy. Future prospective analyses designed to investigate the potential impact of chemotherapy dose adjustment coupled with monitoring of neutropenia or leukopenia on survival are warranted.

  15. Assessing electronic health record systems in emergency departments: Using a decision analytic Bayesian model.

    PubMed

    Ben-Assuli, Ofir; Leshno, Moshe

    2016-09-01

    In the last decade, health providers have implemented information systems to improve accuracy in medical diagnosis and decision-making. This article evaluates the impact of an electronic health record on emergency department physicians' diagnosis and admission decisions. A decision analytic approach using a decision tree was constructed to model the admission decision process to assess the added value of medical information retrieved from the electronic health record. Using a Bayesian statistical model, this method was evaluated on two coronary artery disease scenarios. The results show that the cases of coronary artery disease were better diagnosed when the electronic health record was consulted and led to more informed admission decisions. Furthermore, the value of medical information required for a specific admission decision in emergency departments could be quantified. The findings support the notion that physicians and patient healthcare can benefit from implementing electronic health record systems in emergency departments. © The Author(s) 2015.

  16. Predicting Survival of De Novo Metastatic Breast Cancer in Asian Women: Systematic Review and Validation Study

    PubMed Central

    Miao, Hui; Hartman, Mikael; Bhoo-Pathy, Nirmala; Lee, Soo-Chin; Taib, Nur Aishah; Tan, Ern-Yu; Chan, Patrick; Moons, Karel G. M.; Wong, Hoong-Seam; Goh, Jeremy; Rahim, Siti Mastura; Yip, Cheng-Har; Verkooijen, Helena M.

    2014-01-01

    Background In Asia, up to 25% of breast cancer patients present with distant metastases at diagnosis. Given the heterogeneous survival probabilities of de novo metastatic breast cancer, individual outcome prediction is challenging. The aim of the study is to identify existing prognostic models for patients with de novo metastatic breast cancer and validate them in Asia. Materials and Methods We performed a systematic review to identify prediction models for metastatic breast cancer. Models were validated in 642 women with de novo metastatic breast cancer registered between 2000 and 2010 in the Singapore Malaysia Hospital Based Breast Cancer Registry. Survival curves for low, intermediate and high-risk groups according to each prognostic score were compared by log-rank test and discrimination of the models was assessed by concordance statistic (C-statistic). Results We identified 16 prediction models, seven of which were for patients with brain metastases only. Performance status, estrogen receptor status, metastatic site(s) and disease-free interval were the most common predictors. We were able to validate nine prediction models. The capacity of the models to discriminate between poor and good survivors varied from poor to fair with C-statistics ranging from 0.50 (95% CI, 0.48–0.53) to 0.63 (95% CI, 0.60–0.66). Conclusion The discriminatory performance of existing prediction models for de novo metastatic breast cancer in Asia is modest. Development of an Asian-specific prediction model is needed to improve prognostication and guide decision making. PMID:24695692

  17. Role of socioeconomic status measures in long-term mortality risk prediction after myocardial infarction.

    PubMed

    Molshatzki, Noa; Drory, Yaacov; Myers, Vicki; Goldbourt, Uri; Benyamini, Yael; Steinberg, David M; Gerber, Yariv

    2011-07-01

    The relationship of risk factors to outcomes has traditionally been assessed by measures of association such as odds ratio or hazard ratio and their statistical significance from an adjusted model. However, a strong, highly significant association does not guarantee a gain in stratification capacity. Using recently developed model performance indices, we evaluated the incremental discriminatory power of individual and neighborhood socioeconomic status (SES) measures after myocardial infarction (MI). Consecutive patients aged ≤65 years (N=1178) discharged from 8 hospitals in central Israel after incident MI in 1992 to 1993 were followed-up through 2005. A basic model (demographic variables, traditional cardiovascular risk factors, and disease severity indicators) was compared with an extended model including SES measures (education, income, employment, living with a steady partner, and neighborhood SES) in terms of Harrell c statistic, integrated discrimination improvement (IDI), and net reclassification improvement (NRI). During the 13-year follow-up, 326 (28%) patients died. Cox proportional hazards models showed that all SES measures were significantly and independently associated with mortality. Furthermore, compared with the basic model, the extended model yielded substantial gains (all P<0.001) in c statistic (0.723 to 0.757), NRI (15.2%), IDI (5.9%), and relative IDI (32%). Improvement was observed both for sensitivity (classification of events) and specificity (classification of nonevents). This study illustrates the additional insights that can be gained from considering the IDI and NRI measures of model performance and suggests that, among community patients with incident MI, incorporating SES measures into a clinical-based model substantially improves long-term mortality risk prediction.

  18. Predicting Readmission at Early Hospitalization Using Electronic Clinical Data: An Early Readmission Risk Score.

    PubMed

    Tabak, Ying P; Sun, Xiaowu; Nunez, Carlos M; Gupta, Vikas; Johannes, Richard S

    2017-03-01

    Identifying patients at high risk for readmission early during hospitalization may aid efforts in reducing readmissions. We sought to develop an early readmission risk predictive model using automated clinical data available at hospital admission. We developed an early readmission risk model using a derivation cohort and validated the model with a validation cohort. We used a published Acute Laboratory Risk of Mortality Score as an aggregated measure of clinical severity at admission and the number of hospital discharges in the previous 90 days as a measure of disease progression. We then evaluated the administrative data-enhanced model by adding principal and secondary diagnoses and other variables. We examined the c-statistic change when additional variables were added to the model. There were 1,195,640 adult discharges from 70 hospitals with 39.8% male and the median age of 63 years (first and third quartile: 43, 78). The 30-day readmission rate was 11.9% (n=142,211). The early readmission model yielded a graded relationship of readmission and the Acute Laboratory Risk of Mortality Score and the number of previous discharges within 90 days. The model c-statistic was 0.697 with good calibration. When administrative variables were added to the model, the c-statistic increased to 0.722. Automated clinical data can generate a readmission risk score early at hospitalization with fair discrimination. It may have applied value to aid early care transition. Adding administrative data increases predictive accuracy. The administrative data-enhanced model may be used for hospital comparison and outcome research.

  19. Use of 3D models of congenital heart disease as an education tool for cardiac nurses.

    PubMed

    Biglino, Giovanni; Capelli, Claudio; Koniordou, Despina; Robertshaw, Di; Leaver, Lindsay-Kay; Schievano, Silvia; Taylor, Andrew M; Wray, Jo

    2017-01-01

    Nurse education and training are key to providing congenital heart disease (CHD) patients with consistent high standards of care as well as enabling career progression. One approach for improving educational experience is the use of 3D patient-specific models. To gather pilot data to assess the feasibility of using 3D models of CHD during a training course for cardiac nurses; to evaluate the potential of 3D models in this context, from the nurses' perspective; and to identify possible improvements to optimise their use for teaching. A cross-sectional survey. A national training week for cardiac nurses. One hundred cardiac nurses (of which 65 pediatric and 35 adult). Nurses were shown nine CHD models within the context of a specialized course, following a lecture on the process of making the models themselves, starting from medical imaging. Participants were asked about their general learning experience, if models were more/less informative than diagrams/drawings and lesion-specific/generic models, and their overall reaction to the models. Possible differences between adult and pediatric nurses were investigated. Written feedback was subjected to content analysis and quantitative data were analyzed using nonparametric statistics. Generally models were well liked and nurses considered them more informative than diagrams. Nurses found that 3D models helped in the appreciation of overall anatomy (86%), spatial orientation (70%), and anatomical complexity after treatment (66%). There was no statistically significant difference between adult and pediatric nurses' responses. Thematic analysis highlighted the need for further explanation, use of labels and use of colors to highlight the lesion of interest amongst improvements for optimizing 3D models for teaching/training purposes. 3D patient-specific models are useful tools for training adult and pediatric cardiac nurses and are particularly helpful for understanding CHD anatomy after repair. © 2016 Wiley Periodicals, Inc.

  20. A scan statistic to extract causal gene clusters from case-control genome-wide rare CNV data.

    PubMed

    Nishiyama, Takeshi; Takahashi, Kunihiko; Tango, Toshiro; Pinto, Dalila; Scherer, Stephen W; Takami, Satoshi; Kishino, Hirohisa

    2011-05-26

    Several statistical tests have been developed for analyzing genome-wide association data by incorporating gene pathway information in terms of gene sets. Using these methods, hundreds of gene sets are typically tested, and the tested gene sets often overlap. This overlapping greatly increases the probability of generating false positives, and the results obtained are difficult to interpret, particularly when many gene sets show statistical significance. We propose a flexible statistical framework to circumvent these problems. Inspired by spatial scan statistics for detecting clustering of disease occurrence in the field of epidemiology, we developed a scan statistic to extract disease-associated gene clusters from a whole gene pathway. Extracting one or a few significant gene clusters from a global pathway limits the overall false positive probability, which results in increased statistical power, and facilitates the interpretation of test results. In the present study, we applied our method to genome-wide association data for rare copy-number variations, which have been strongly implicated in common diseases. Application of our method to a simulated dataset demonstrated the high accuracy of this method in detecting disease-associated gene clusters in a whole gene pathway. The scan statistic approach proposed here shows a high level of accuracy in detecting gene clusters in a whole gene pathway. This study has provided a sound statistical framework for analyzing genome-wide rare CNV data by incorporating topological information on the gene pathway.

  1. Prediction of late disease recurrence and extended adjuvant letrozole benefit by the HOXB13/IL17BR biomarker.

    PubMed

    Sgroi, Dennis C; Carney, Erin; Zarrella, Elizabeth; Steffel, Lauren; Binns, Shemeica N; Finkelstein, Dianne M; Szymonifka, Jackie; Bhan, Atul K; Shepherd, Lois E; Zhang, Yi; Schnabel, Catherine A; Erlander, Mark G; Ingle, James N; Porter, Peggy; Muss, Hyman B; Pritchard, Katherine I; Tu, Dongsheng; Rimm, David L; Goss, Paul E

    2013-07-17

    Biomarkers to optimize extended adjuvant endocrine therapy for women with estrogen receptor (ER)-positive breast cancer are limited. The HOXB13/IL17BR (H/I) biomarker predicts recurrence risk in ER-positive, lymph node-negative breast cancer patients. H/I was evaluated in MA.17 trial for prognostic performance for late recurrence and treatment benefit from extended adjuvant letrozole. A prospective-retrospective, nested case-control design of 83 recurrences matched to 166 nonrecurrences from letrozole- and placebo-treated patients within MA.17 was conducted. Expression of H/I within primary tumors was determined by reverse-transcription polymerase chain reaction with a prespecified cutpoint. The predictive ability of H/I for ascertaining benefit from letrozole was determined using multivariable conditional logistic regression including standard clinicopathological factors as covariates. All statistical tests were two-sided. High H/I was statistically significantly associated with a decrease in late recurrence in patients receiving extended letrozole therapy (odds ratio [OR] = 0.35; 95% confidence interval [CI] = 0.16 to 0.75; P = .007). In an adjusted model with standard clinicopathological factors, high H/I remained statistically significantly associated with patient benefit from letrozole (OR = 0.33; 95% CI = 0.15 to 0.73; P = .006). Reduction in the absolute risk of recurrence at 5 years was 16.5% for patients with high H/I (P = .007). The interaction between H/I and letrozole treatment was statistically significant (P = .03). In the absence of extended letrozole therapy, high H/I identifies a subgroup of ER-positive patients disease-free after 5 years of tamoxifen who are at risk for late recurrence. When extended endocrine therapy with letrozole is prescribed, high H/I predicts benefit from therapy and a decreased probability of late disease recurrence.

  2. Evaluation of Penalized and Nonpenalized Methods for Disease Prediction with Large-Scale Genetic Data.

    PubMed

    Won, Sungho; Choi, Hosik; Park, Suyeon; Lee, Juyoung; Park, Changyi; Kwon, Sunghoon

    2015-01-01

    Owing to recent improvement of genotyping technology, large-scale genetic data can be utilized to identify disease susceptibility loci and this successful finding has substantially improved our understanding of complex diseases. However, in spite of these successes, most of the genetic effects for many complex diseases were found to be very small, which have been a big hurdle to build disease prediction model. Recently, many statistical methods based on penalized regressions have been proposed to tackle the so-called "large P and small N" problem. Penalized regressions including least absolute selection and shrinkage operator (LASSO) and ridge regression limit the space of parameters, and this constraint enables the estimation of effects for very large number of SNPs. Various extensions have been suggested, and, in this report, we compare their accuracy by applying them to several complex diseases. Our results show that penalized regressions are usually robust and provide better accuracy than the existing methods for at least diseases under consideration.

  3. Certolizumab pegol in a heterogeneous population of patients with moderate-to-severe rheumatoid arthritis

    PubMed Central

    Soriano, Enrique R; Dellepiane, Analia; Salvatierra, Gabriela; Benítez, Cristian Alejandro; Salinas, Rodrigo Garcia; Baruzzo, Carlos

    2018-01-01

    Aim: To determine the efficacy and safety of certolizumab pegol for the treatment of rheumatoid arthritis in a real-world setting. Materials & methods: Patients with moderate-to-severe rheumatoid arthritis who initiated therapy with certolizumab were followed for 12 weeks. Response was assessed with Disease Activity Score of 28 joints, European Ligue Against Rheumatism criteria and Simplified Disease Activity Index. Predictors of response were analyzed with binary logistic regression models. Results: Statistically significant decreases in tender and swollen joint counts, laboratory parameters and use of corticosteroids and disease-modifying antirheumatic drugs were found. Disease activity also significantly diminished. Higher Disease Activity Score of 28 joints at baseline was the main predictor of response. No severe adverse events were reported. Conclusion: Certolizumab was effective and well tolerated, particularly in the subpopulation with higher inflammatory burden at baseline. PMID:29682324

  4. Concepts and data model for a co-operative neurovascular database.

    PubMed

    Mansmann, U; Taylor, W; Porter, P; Bernarding, J; Jäger, H R; Lasjaunias, P; Terbrugge, K; Meisel, J

    2001-08-01

    Problems of clinical management of neurovascular diseases are very complex. This is caused by the chronic character of the diseases, a long history of symptoms and diverse treatments. If patients are to benefit from treatment, then treatment decisions have to rely on reliable and accurate knowledge of the natural history of the disease and the various treatments. Recent developments in statistical methodology and experience from electronic patient records are used to establish an information infrastructure based on a centralized register. A protocol to collect data on neurovascular diseases with technical as well as logistical aspects of implementing a database for neurovascular diseases are described. The database is designed as a co-operative tool of audit and research available to co-operating centres. When a database is linked to a systematic patient follow-up, it can be used to study prognosis. Careful analysis of patient outcome is valuable for decision-making.

  5. Discrimination of inflammatory bowel disease using Raman spectroscopy and linear discriminant analysis methods

    NASA Astrophysics Data System (ADS)

    Ding, Hao; Cao, Ming; DuPont, Andrew W.; Scott, Larry D.; Guha, Sushovan; Singhal, Shashideep; Younes, Mamoun; Pence, Isaac; Herline, Alan; Schwartz, David; Xu, Hua; Mahadevan-Jansen, Anita; Bi, Xiaohong

    2016-03-01

    Inflammatory bowel disease (IBD) is an idiopathic disease that is typically characterized by chronic inflammation of the gastrointestinal tract. Recently much effort has been devoted to the development of novel diagnostic tools that can assist physicians for fast, accurate, and automated diagnosis of the disease. Previous research based on Raman spectroscopy has shown promising results in differentiating IBD patients from normal screening cases. In the current study, we examined IBD patients in vivo through a colonoscope-coupled Raman system. Optical diagnosis for IBD discrimination was conducted based on full-range spectra using multivariate statistical methods. Further, we incorporated several feature selection methods in machine learning into the classification model. The diagnostic performance for disease differentiation was significantly improved after feature selection. Our results showed that improved IBD diagnosis can be achieved using Raman spectroscopy in combination with multivariate analysis and feature selection.

  6. Counteracting structural errors in ensemble forecast of influenza outbreaks.

    PubMed

    Pei, Sen; Shaman, Jeffrey

    2017-10-13

    For influenza forecasts generated using dynamical models, forecast inaccuracy is partly attributable to the nonlinear growth of error. As a consequence, quantification of the nonlinear error structure in current forecast models is needed so that this growth can be corrected and forecast skill improved. Here, we inspect the error growth of a compartmental influenza model and find that a robust error structure arises naturally from the nonlinear model dynamics. By counteracting these structural errors, diagnosed using error breeding, we develop a new forecast approach that combines dynamical error correction and statistical filtering techniques. In retrospective forecasts of historical influenza outbreaks for 95 US cities from 2003 to 2014, overall forecast accuracy for outbreak peak timing, peak intensity and attack rate, are substantially improved for predicted lead times up to 10 weeks. This error growth correction method can be generalized to improve the forecast accuracy of other infectious disease dynamical models.Inaccuracy of influenza forecasts based on dynamical models is partly due to nonlinear error growth. Here the authors address the error structure of a compartmental influenza model, and develop a new improved forecast approach combining dynamical error correction and statistical filtering techniques.

  7. [The application of the prospective space-time statistic in early warning of infectious disease].

    PubMed

    Yin, Fei; Li, Xiao-Song; Feng, Zi-Jian; Ma, Jia-Qi

    2007-06-01

    To investigate the application of prospective space-time scan statistic in the early stage of detecting infectious disease outbreaks. The prospective space-time scan statistic was tested by mimicking daily prospective analyses of bacillary dysentery data of Chengdu city in 2005 (3212 cases in 102 towns and villages). And the results were compared with that of purely temporal scan statistic. The prospective space-time scan statistic could give specific messages both in spatial and temporal. The results of June indicated that the prospective space-time scan statistic could timely detect the outbreaks that started from the local site, and the early warning message was powerful (P = 0.007). When the merely temporal scan statistic for detecting the outbreak was sent two days later, and the signal was less powerful (P = 0.039). The prospective space-time scan statistic could make full use of the spatial and temporal information in infectious disease data and could timely and effectively detect the outbreaks that start from the local sites. The prospective space-time scan statistic could be an important tool for local and national CDC to set up early detection surveillance systems.

  8. White Paper: Landscape on Technical and Conceptual Requirements and Competence Framework in Drug/Disease Modeling and Simulation

    PubMed Central

    Vlasakakis, G; Comets, E; Keunecke, A; Gueorguieva, I; Magni, P; Terranova, N; Della Pasqua, O; de Lange, E C; Kloft, C

    2013-01-01

    Pharmaceutical sciences experts and regulators acknowledge that pharmaceutical development as well as drug usage requires more than scientific advancements to cope with current attrition rates/therapeutic failures. Drug disease modeling and simulation (DDM&S) creates a paradigm to enable an integrated and higher-level understanding of drugs, (diseased)systems, and their interactions (systems pharmacology) through mathematical/statistical models (pharmacometrics)1—hence facilitating decision making during drug development and therapeutic usage of medicines. To identify gaps and challenges in DDM&S, an inventory of skills and competencies currently available in academia, industry, and clinical practice was obtained through survey. The survey outcomes revealed benefits, weaknesses, and hurdles for the implementation of DDM&S. In addition, the survey indicated that no consensus exists about the knowledge, skills, and attributes required to perform DDM&S activities effectively. Hence, a landscape of technical and conceptual requirements for DDM&S was identified and serves as a basis for developing a framework of competencies to guide future education and training in DDM&S. PMID:23887723

  9. An application of Mean Escape Time and metapopulation on forestry catastrophe insurance

    NASA Astrophysics Data System (ADS)

    Li, Jiangcheng; Zhang, Chunmin; Liu, Jifa; Li, Zhen; Yang, Xuan

    2018-04-01

    A forestry catastrophe insurance model due to forestry pest infestations and disease epidemics is developed by employing metapopulation dynamics and statistics properties of Mean Escape Time (MET). The probability of outbreak of forestry catastrophe loss and the catastrophe loss payment time with MET are respectively investigated. Forestry loss data in China is used for model simulation. Experimental results are concluded as: (1) The model with analytical results is shown to be a better fit; (2) Within the condition of big area of patches and structure of patches, high system factor, low extinction rate, high multiplicative noises, and additive noises with a high cross-correlated strength range, an outbreak of forestry catastrophe loss or catastrophe loss payment due to forestry pest infestations and disease epidemics could occur; (3) An optimal catastrophe loss payment time MET due to forestry pest infestations and disease epidemics can be identified by taking proper value of multiplicative noises and limits the additive noises on a low range of value, and cross-correlated strength at a high range of value.

  10. Intractable Ménière's disease. Modelling of the treatment by means of statistical analysis.

    PubMed

    Sanchez-Ferrandiz, Noelia; Fernandez-Gonzalez, Secundino; Guillen-Grima, Francisco; Perez-Fernandez, Nicolas

    2010-08-01

    To evaluate the value of different variables of the clinical history, auditory and vestibular tests and handicap measurements to define intractable or disabling Ménière's disease. This is a prospective study with 212 patients of which 155 were treated with intratympanic gentamicin and considered to be suffering a medically intractable Ménière's disease. Age and sex adjustments were performed with the 11 variables selected. Discriminant analysis was performed either using the aforementioned variables or following the stepwise method. Different variables needed to be sex and/or age adjusted and both data were included in the discriminant function. Two different mathematical formulas were obtained and four models were analyzed. With the model selected, diagnostic accuracy is 77.7%, sensitivity is 94.9% and specificity is 52.8%. After discriminant analysis we found that the most informative variables were the number of vertigo spells, the speech discrimination score, the time constant of the VOR and a measure of handicap, the "dizziness index". Copyright 2009 Elsevier Ireland Ltd. All rights reserved.

  11. Prevalence of diseases and statistical power of the Japan Nurses' Health Study.

    PubMed

    Fujita, Toshiharu; Hayashi, Kunihiko; Katanoda, Kota; Matsumura, Yasuhiro; Lee, Jung Su; Takagi, Hirofumi; Suzuki, Shosuke; Mizunuma, Hideki; Aso, Takeshi

    2007-10-01

    The Japan Nurses' Health Study (JNHS) is a long-term, large-scale cohort study investigating the effects of various lifestyle factors and healthcare habits on the health of Japanese women. Based on currently limited statistical data regarding the incidence of disease among Japanese women, our initial sample size was tentatively set at 50,000 during the design phase. The actual number of women who agreed to participate in follow-up surveys was approximately 18,000. Taking into account the actual sample size and new information on disease frequency obtained during the baseline component, we established the prevalence of past diagnoses of target diseases, predicted their incidence, and calculated the statistical power for JNHS follow-up surveys. For all diseases except ovarian cancer, the prevalence of a past diagnosis increased markedly with age, and incidence rates could be predicted based on the degree of increase in prevalence between two adjacent 5-yr age groups. The predicted incidence rate for uterine myoma, hypercholesterolemia, and hypertension was > or =3.0 (per 1,000 women, per year), while the rate of thyroid disease, hepatitis, gallstone disease, and benign breast tumor was predicted to be > or =1.0. For these diseases, the statistical power to detect risk factors with a relative risk of 1.5 or more within ten years, was 70% or higher.

  12. Tower of London test: a comparison between conventional statistic approach and modelling based on artificial neural network in differentiating fronto-temporal dementia from Alzheimer's disease.

    PubMed

    Franceschi, Massimo; Caffarra, Paolo; Savarè, Rita; Cerutti, Renata; Grossi, Enzo

    2011-01-01

    The early differentiation of Alzheimer's disease (AD) from frontotemporal dementia (FTD) may be difficult. The Tower of London (ToL), thought to assess executive functions such as planning and visuo-spatial working memory, could help in this purpose. Twentytwo Dementia Centers consecutively recruited patients with early FTD or AD. ToL performances of these groups were analyzed using both the conventional statistical approaches and the Artificial Neural Networks (ANNs) modelling. Ninety-four non aphasic FTD and 160 AD patients were recruited. ToL Accuracy Score (AS) significantly (p < 0.05) differentiated FTD from AD patients. However, the discriminant validity of AS checked by ROC curve analysis, yielded no significant results in terms of sensitivity and specificity (AUC 0.63). The performances of the 12 Success Subscores (SS) together with age, gender and schooling years were entered into advanced ANNs developed by Semeion Institute. The best ANNs were selected and submitted to ROC curves. The non-linear model was able to discriminate FTD from AD with an average AUC for 7 independent trials of 0.82. The use of hidden information contained in the different items of ToL and the non linear processing of the data through ANNs allows a high discrimination between FTD and AD in individual patients.

  13. Constructing Ebola transmission chains from West Africa and estimating model parameters using internet sources.

    PubMed

    Pettey, W B P; Carter, M E; Toth, D J A; Samore, M H; Gundlapalli, A V

    2017-07-01

    During the recent Ebola crisis in West Africa, individual person-level details of disease onset, transmissions, and outcomes such as survival or death were reported in online news media. We set out to document disease transmission chains for Ebola, with the goal of generating a timely account that could be used for surveillance, mathematical modeling, and public health decision-making. By accessing public web pages only, such as locally produced newspapers and blogs, we created a transmission chain involving two Ebola clusters in West Africa that compared favorably with other published transmission chains, and derived parameters for a mathematical model of Ebola disease transmission that were not statistically different from those derived from published sources. We present a protocol for responsibly gleaning epidemiological facts, transmission model parameters, and useful details from affected communities using mostly indigenously produced sources. After comparing our transmission parameters to published parameters, we discuss additional benefits of our method, such as gaining practical information about the affected community, its infrastructure, politics, and culture. We also briefly compare our method to similar efforts that used mostly non-indigenous online sources to generate epidemiological information.

  14. Novel Biomarker Signature That May Predict Aggressive Disease in African American Men With Prostate Cancer

    PubMed Central

    Yamoah, Kosj; Johnson, Michael H.; Choeurng, Voleak; Faisal, Farzana A.; Yousefi, Kasra; Haddad, Zaid; Ross, Ashley E.; Alshalafa, Mohammed; Den, Robert; Lal, Priti; Feldman, Michael; Dicker, Adam P.; Klein, Eric A.; Davicioni, Elai; Rebbeck, Timothy R.; Schaeffer, Edward M.

    2015-01-01

    Purpose We studied the ethnicity-specific expression of prostate cancer (PC) –associated biomarkers to evaluate whether genetic/biologic factors affect ethnic disparities in PC pathogenesis and disease progression. Patients and Methods A total of 154 African American (AA) and 243 European American (EA) patients from four medical centers were matched according to the Cancer of the Prostate Risk Assessment postsurgical score within each institution. The distribution of mRNA expression levels of 20 validated biomarkers reported to be associated with PC initiation and progression was compared with ethnicity using false discovery rate, adjusted Wilcoxon-Mann-Whitney, and logistic regression models. A conditional logistic regression model was used to evaluate the interaction between ethnicity and biomarkers for predicting clinicopathologic outcomes. Results Of the 20 biomarkers examined, six showed statistically significant differential expression in AA compared with EA men in one or more statistical models. These include ERG (P < .001), AMACR (P < .001), SPINK1 (P = .001), NKX3-1 (P = .03), GOLM1 (P = .03), and androgen receptor (P = .04). Dysregulation of AMACR (P = .036), ERG (P = .036), FOXP1 (P = .041), and GSTP1 (P = .049) as well as loss-of-function mutations for tumor suppressors NKX3-1 (P = .025) and RB1 (P = .037) predicted risk of pathologic T3 disease in an ethnicity-dependent manner. Dysregulation of GOLM1 (P = .037), SRD5A2 (P = .023), and MKi67 (P = .023) predicted clinical outcomes, including 3-year biochemical recurrence and metastasis at 5 years. A greater proportion of AA men than EA men had triple-negative (ERG-negative/ETS-negative/SPINK1-negative) disease (51% v 35%; P = .002). Conclusion We have identified a subset of PC biomarkers that predict the risk of clinicopathologic outcomes in an ethnicity-dependent manner. These biomarkers may explain in part the biologic contribution to ethnic disparity in PC outcomes between EA and AA men. PMID:26195723

  15. Novel Biomarker Signature That May Predict Aggressive Disease in African American Men With Prostate Cancer.

    PubMed

    Yamoah, Kosj; Johnson, Michael H; Choeurng, Voleak; Faisal, Farzana A; Yousefi, Kasra; Haddad, Zaid; Ross, Ashley E; Alshalafa, Mohammed; Den, Robert; Lal, Priti; Feldman, Michael; Dicker, Adam P; Klein, Eric A; Davicioni, Elai; Rebbeck, Timothy R; Schaeffer, Edward M

    2015-09-01

    We studied the ethnicity-specific expression of prostate cancer (PC) -associated biomarkers to evaluate whether genetic/biologic factors affect ethnic disparities in PC pathogenesis and disease progression. A total of 154 African American (AA) and 243 European American (EA) patients from four medical centers were matched according to the Cancer of the Prostate Risk Assessment postsurgical score within each institution. The distribution of mRNA expression levels of 20 validated biomarkers reported to be associated with PC initiation and progression was compared with ethnicity using false discovery rate, adjusted Wilcoxon-Mann-Whitney, and logistic regression models. A conditional logistic regression model was used to evaluate the interaction between ethnicity and biomarkers for predicting clinicopathologic outcomes. Of the 20 biomarkers examined, six showed statistically significant differential expression in AA compared with EA men in one or more statistical models. These include ERG (P < .001), AMACR (P < .001), SPINK1 (P = .001), NKX3-1 (P = .03), GOLM1 (P = .03), and androgen receptor (P = .04). Dysregulation of AMACR (P = .036), ERG (P = .036), FOXP1 (P = .041), and GSTP1 (P = .049) as well as loss-of-function mutations for tumor suppressors NKX3-1 (P = .025) and RB1 (P = .037) predicted risk of pathologic T3 disease in an ethnicity-dependent manner. Dysregulation of GOLM1 (P = .037), SRD5A2 (P = .023), and MKi67 (P = .023) predicted clinical outcomes, including 3-year biochemical recurrence and metastasis at 5 years. A greater proportion of AA men than EA men had triple-negative (ERG-negative/ETS-negative/SPINK1-negative) disease (51% v 35%; P = .002). We have identified a subset of PC biomarkers that predict the risk of clinicopathologic outcomes in an ethnicity-dependent manner. These biomarkers may explain in part the biologic contribution to ethnic disparity in PC outcomes between EA and AA men. © 2015 by American Society of Clinical Oncology.

  16. The role of psychosocial stress at work for the development of cardiovascular diseases: a systematic review.

    PubMed

    Backé, Eva-Maria; Seidler, Andreas; Latza, Ute; Rossnagel, Karin; Schumann, Barbara

    2012-01-01

    A systematic review was carried out to assess evidence for the association between different models of stress at work, and cardiovascular morbidity and mortality. A literature search was conducted using five databases (MEDLINE, Cochrane Library, EMBASE, PSYNDEX and PsycINFO). Inclusion criteria for studies were the following: self-reported stress for individual workplaces, prospective study design and incident disease (myocardial infarction, stroke, angina pectoris, high blood pressure). Evaluation, according to the criteria of the Scottish Intercollegiate Guidelines Network, was done by two readers. In case of disagreement, a third reader was involved. Twenty-six publications were included, describing 40 analyses out of 20 cohorts. The risk estimates for work stress were associated with a statistically significant increased risk of cardiovascular disease in 13 out of the 20 cohorts. Associations were significant for 7 out of 13 cohorts applying the demand-control model, all three cohorts using the effort-reward model and 3 out of 6 cohorts investigating other models. Most significant results came from analyses considering only men. Results for the association between job stress and cardiovascular diseases in women were not clear. Associations were weaker in participants above the age of 55. In accordance with other systematic reviews, this review stresses the importance of psychosocial factors at work in the aetiology of cardiovascular diseases. Besides individual measures to manage stress and to cope with demanding work situations, organisational changes at the workplace need to be considered to find options to reduce occupational risk factors for cardiovascular diseases.

  17. Detection by voxel-wise statistical analysis of significant changes in regional cerebral glucose uptake in an APP/PS1 transgenic mouse model of Alzheimer's disease.

    PubMed

    Dubois, Albertine; Hérard, Anne-Sophie; Delatour, Benoît; Hantraye, Philippe; Bonvento, Gilles; Dhenain, Marc; Delzescaux, Thierry

    2010-06-01

    Biomarkers and technologies similar to those used in humans are essential for the follow-up of Alzheimer's disease (AD) animal models, particularly for the clarification of mechanisms and the screening and validation of new candidate treatments. In humans, changes in brain metabolism can be detected by 1-deoxy-2-[(18)F] fluoro-D-glucose PET (FDG-PET) and assessed in a user-independent manner with dedicated software, such as Statistical Parametric Mapping (SPM). FDG-PET can be carried out in small animals, but its resolution is low as compared to the size of rodent brain structures. In mouse models of AD, changes in cerebral glucose utilization are usually detected by [(14)C]-2-deoxyglucose (2DG) autoradiography, but this requires prior manual outlining of regions of interest (ROI) on selected sections. Here, we evaluate the feasibility of applying the SPM method to 3D autoradiographic data sets mapping brain metabolic activity in a transgenic mouse model of AD. We report the preliminary results obtained with 4 APP/PS1 (64+/-1 weeks) and 3 PS1 (65+/-2 weeks) mice. We also describe new procedures for the acquisition and use of "blockface" photographs and provide the first demonstration of their value for the 3D reconstruction and spatial normalization of post mortem mouse brain volumes. Despite this limited sample size, our results appear to be meaningful, consistent, and more comprehensive than findings from previously published studies based on conventional ROI-based methods. The establishment of statistical significance at the voxel level, rather than with a user-defined ROI, makes it possible to detect more reliably subtle differences in geometrically complex regions, such as the hippocampus. Our approach is generic and could be easily applied to other biomarkers and extended to other species and applications. Copyright 2010 Elsevier Inc. All rights reserved.

  18. [Use of the methods of mathematical modeling for evaluation of the data of cerebrospinal fluid examination in patients with bacterial meningoencephalitis].

    PubMed

    Iarosh, O A; Iarosh, A A

    1991-01-01

    As many as 300 patients of different age groups underwent a probability statistical analysis of cytosis and CSF protein depending on the outcome of bacterial meningoencephalitis. The clinical and CSF interrelations discovered reflect the function of the blood-brain barrier and can be used as an additional test for predicting the disease outcome.

  19. A multilevel model for cardiovascular disease prevalence in the US and its application to micro area prevalence estimates.

    PubMed

    Congdon, Peter

    2009-01-30

    Estimates of disease prevalence for small areas are increasingly required for the allocation of health funds according to local need. Both individual level and geographic risk factors are likely to be relevant to explaining prevalence variations, and in turn relevant to the procedure for small area prevalence estimation. Prevalence estimates are of particular importance for major chronic illnesses such as cardiovascular disease. A multilevel prevalence model for cardiovascular outcomes is proposed that incorporates both survey information on patient risk factors and the effects of geographic location. The model is applied to derive micro area prevalence estimates, specifically estimates of cardiovascular disease for Zip Code Tabulation Areas in the USA. The model incorporates prevalence differentials by age, sex, ethnicity and educational attainment from the 2005 Behavioral Risk Factor Surveillance System survey. Influences of geographic context are modelled at both county and state level, with the county effects relating to poverty and urbanity. State level influences are modelled using a random effects approach that allows both for spatial correlation and spatial isolates. To assess the importance of geographic variables, three types of model are compared: a model with person level variables only; a model with geographic effects that do not interact with person attributes; and a full model, allowing for state level random effects that differ by ethnicity. There is clear evidence that geographic effects improve statistical fit. Geographic variations in disease prevalence partly reflect the demographic composition of area populations. However, prevalence variations may also show distinct geographic 'contextual' effects. The present study demonstrates by formal modelling methods that improved explanation is obtained by allowing for distinct geographic effects (for counties and states) and for interaction between geographic and person variables. Thus an appropriate methodology to estimate prevalence at small area level should include geographic effects as well as person level demographic variables.

  20. A multilevel model for cardiovascular disease prevalence in the US and its application to micro area prevalence estimates

    PubMed Central

    Congdon, Peter

    2009-01-01

    Background Estimates of disease prevalence for small areas are increasingly required for the allocation of health funds according to local need. Both individual level and geographic risk factors are likely to be relevant to explaining prevalence variations, and in turn relevant to the procedure for small area prevalence estimation. Prevalence estimates are of particular importance for major chronic illnesses such as cardiovascular disease. Methods A multilevel prevalence model for cardiovascular outcomes is proposed that incorporates both survey information on patient risk factors and the effects of geographic location. The model is applied to derive micro area prevalence estimates, specifically estimates of cardiovascular disease for Zip Code Tabulation Areas in the USA. The model incorporates prevalence differentials by age, sex, ethnicity and educational attainment from the 2005 Behavioral Risk Factor Surveillance System survey. Influences of geographic context are modelled at both county and state level, with the county effects relating to poverty and urbanity. State level influences are modelled using a random effects approach that allows both for spatial correlation and spatial isolates. Results To assess the importance of geographic variables, three types of model are compared: a model with person level variables only; a model with geographic effects that do not interact with person attributes; and a full model, allowing for state level random effects that differ by ethnicity. There is clear evidence that geographic effects improve statistical fit. Conclusion Geographic variations in disease prevalence partly reflect the demographic composition of area populations. However, prevalence variations may also show distinct geographic 'contextual' effects. The present study demonstrates by formal modelling methods that improved explanation is obtained by allowing for distinct geographic effects (for counties and states) and for interaction between geographic and person variables. Thus an appropriate methodology to estimate prevalence at small area level should include geographic effects as well as person level demographic variables. PMID:19183458

  1. Clinicopathological predictors for progression of chronic kidney disease in nephrosclerosis: a biopsy-based cohort study.

    PubMed

    Yamanouchi, Masayuki; Hoshino, Junichi; Ubara, Yoshifumi; Takaichi, Kenmei; Kinowaki, Keiichi; Fujii, Takeshi; Ohashi, Kenichi; Mise, Koki; Toyama, Tadashi; Hara, Akinori; Shimizu, Miho; Furuichi, Kengo; Wada, Takashi

    2018-05-19

    Biopsy-based studies on nephrosclerosis are lacking and the clinicopathological predictors for progression of chronic kidney disease (CKD) are not well established. We retrospectively assessed 401 patients with biopsy-proven nephrosclerosis in Japan. Progression of CKD was defined as new-onset end-stage renal disease, decrease of estimated glomerular filtration rate (eGFR) by  ≥50% or doubling of serum creatinine, and the sub-distribution hazard ratio (SHR) with 95% confidence interval (CI) for CKD progression was determined for various clinical and histological characteristics in competing risks analysis. The incremental value of pathological information for predicting CKD progression was assessed by calculating Harrell's C-statistics, the Akaike information criterion (AIC), net reclassification improvement and integrated discrimination improvement. During a median follow-up period of 5.3 years, 117 patients showed progression of CKD and 10 patients died before the defined kidney event. Multivariable sub-distribution hazards model identified serum albumin (SHR 0.48; 95% CI 0.35-0.67), hemoglobin A1c (SHR 0.71; 95% CI 0.54-0.94), eGFR (SHR 0.98; 95% CI 0.97-0.99), urinary albumin/creatinine ratio (UACR) (SHR 1.18; 95% CI 1.08-1.29), percentage of segmental/global glomerulosclerosis (%GS) (SHR 1.01; 95% CI 1.00-1.02) and interstitial fibrosis and tubular atrophy (IFTA) (SHR 1.52; 95% CI 1.20-1.92) as risk factors for CKD progression. The C-statistic of a model with only clinical variables was improved by adding %GS (0.790 versus 0.796, P < 0.01) and IFTA (0.790 versus 0.811, P < 0.01). The reclassification statistic was also improved after adding the biopsy data to the clinical data. The model including IFTA was superior, with the lowest AIC. The study implies that in addition to the traditional markers of eGFR and UACR, we may explore the markers of serum albumin and hemoglobin A1c, which are widely available but not routinely measured in patients with nephrosclerosis, and the biopsy data, especially the data on the severity of interstitial damage, for the better prediction of CKD progression in patients with nephrosclerosis.

  2. Media coverage and hospital notifications: Correlation analysis and optimal media impact duration to manage a pandemic.

    PubMed

    Yan, Qinling; Tang, Sanyi; Gabriele, Sandra; Wu, Jianhong

    2016-02-07

    News reporting has the potential to modify a community's knowledge of emerging infectious diseases and affect peoples' attitudes and behavior. Here we developed a quantitative approach to evaluate the effects of media on such behavior. Statistically significant correlations between the number of new hospital notifications, during the 2009 A/H1N1 influenza epidemic in the Shaanxi province of China, and the number of daily news items added to eight major websites were found from Pearson correlation and cross-correlation analyses. We also proposed a novel model to examine the implication for transmission dynamics of these correlations. The model incorporated the media impact function into the intensity of infection, and enhanced the traditional epidemic SEIR model with the addition of media dynamics. We used a nonlinear least squares estimation to identify the best-fit parameter values in the model from the observed data. We also carried out the uncertainty and sensitivity analyses to determine key parameters during early phase of the disease outbreak for the final outcome of the outbreak with media impact. The findings confirm the importance of responses by individuals to the media reports, with behavior changes having important consequence for the emerging infectious disease control. Therefore, for mitigating emerging infectious diseases, media reports should be focused on how to guide people's behavioral changes, which are critical for limiting the spread of disease. Copyright © 2015 Elsevier Ltd. All rights reserved.

  3. [Study on early intervention of compound nutrition for cognitive dysfunction in Alzheimer's disease].

    PubMed

    Wang, Chao; Xie, Wei; Zhu, Jinfeng; Dang, Rui; Wang, Decai

    2014-01-01

    To observe the early prevention effect of the compound nutrients recipe for cognitive dysfunction of Alzheimer' s disease model-APP-PSN transgenic mouse. 36 APP-PSN transgenic mice aged two months randomly were divided into the intervention group supplied with compound recipe in the diet and the control group fed based feed, the former had high dose and low dose, 12 APP-PSN transgenic negative mice aged two months as the negative control were fed based feed. After 3 months' intervention, four groups' cognitive functions were evaluated using the Morris water maze, active avoidance experiment and jumping stair experiment. There was not statistically different between all the four groups for the weight and food intake. Compared with the control group, Morris water maze's incubation period of the intervention group was lower obviously, and jumping stair experiment's incubation period of the intervention group was higher obviously. In the active avoidance experiment, the high and low dose intervention group' s conditioned response accounted about 46.67% and 45.00% respectively, and the control group's conditioned response accounted about 20.83%. The differences of the three behavioral experiments between control group and intervention group had the statistical significance (P < 0.05), so the same as between control group and negative control group (P < 0.05). And there was no difference between intervention group and negative control group for the three behavioral experiments. The early supplementation with compound nutrition could postpone the occurrence and development of Alzheimer' s disease mice model's cognitive dysfunction.

  4. Statistical Analysis Aiming at Predicting Respiratory Tract Disease Hospital Admissions from Environmental Variables in the City of São Paulo

    PubMed Central

    de Sousa Zanotti Stagliorio Coêlho, Micheline; Luiz Teixeira Gonçalves, Fabio; do Rosário Dias de Oliveira Latorre, Maria

    2010-01-01

    This study is aimed at creating a stochastic model, named Brazilian Climate and Health Model (BCHM), through Poisson regression, in order to predict the occurrence of hospital respiratory admissions (for children under thirteen years of age) as a function of air pollutants, meteorological variables, and thermal comfort indices (effective temperatures, ET). The data used in this study were obtained from the city of São Paulo, Brazil, between 1997 and 2000. The respiratory tract diseases were divided into three categories: URI (Upper Respiratory tract diseases), LRI (Lower Respiratory tract diseases), and IP (Influenza and Pneumonia). The overall results of URI, LRI, and IP show clear correlation with SO2 and CO, PM10 and O3, and PM10, respectively, and the ETw4 (Effective Temperature) for all the three disease groups. It is extremely important to warn the government of the most populated city in Brazil about the outcome of this study, providing it with valuable information in order to help it better manage its resources on behalf of the whole population of the city of Sao Paulo, especially those with low incomes. PMID:20706674

  5. Analysis of Serum Interleukin (IL)-1β and IL-18 in Systemic Lupus Erythematosus.

    PubMed

    Mende, Rachel; Vincent, Fabien B; Kandane-Rathnayake, Rangi; Koelmeyer, Rachel; Lin, Emily; Chang, Janet; Hoi, Alberta Y; Morand, Eric F; Harris, James; Lang, Tali

    2018-01-01

    Systemic lupus erythematosus (SLE) is a chronic multisystem autoimmune disease characterized by biological and clinical heterogeneity. The interleukin (IL)-1 superfamily is a group of innate cytokines that contribute to pathogenesis in many autoimmune diseases. IL-1β and IL-18 are two members that have been shown to play a role in murine lupus-like models, but their role in human SLE remains poorly understood. Here, IL-1β and IL-18 were quantified by enzyme-linked immunosorbent assay in the serum of healthy controls (HCs) and SLE patients from a prospectively followed cohort. Disease activity and organ damage were assessed using SLE disease activity index 2000 (SLEDAI-2K) and SLE damage index scores (SDI), respectively. 184 SLE patients (mean age 44.9 years, 91% female, 56% double-stranded deoxyribonucleic acid positive) were compared to 52 HC. SLE patients had median [IQR] SLEDAI-2K of 4 [2,6], and SDI of 1 [0-2]. Serum IL-18 levels were statistically significantly higher in SLE patients compared to HCs. Univariable linear regression analyses showed that patients with active renal disease or irreversible organ damage had statistically significantly elevated serum IL-18 levels. The association between serum IL-18 and active renal disease was confirmed in multivariable analysis after adjusting for ethnicity and organ damage. High baseline serum IL-18 levels were associated with organ damage at the subsequent visit. Serum IL-1β levels were not significantly elevated in SLE patients when compared to HCs and had no association with overall or organ-specific disease activity or organ damage in cross-sectional and longitudinal analyses. Our data suggest that serum IL-18 and IL-1β have different clinical implications in SLE, with IL-18 being potentially associated with active renal disease.

  6. Visualizing statistical significance of disease clusters using cartograms.

    PubMed

    Kronenfeld, Barry J; Wong, David W S

    2017-05-15

    Health officials and epidemiological researchers often use maps of disease rates to identify potential disease clusters. Because these maps exaggerate the prominence of low-density districts and hide potential clusters in urban (high-density) areas, many researchers have used density-equalizing maps (cartograms) as a basis for epidemiological mapping. However, we do not have existing guidelines for visual assessment of statistical uncertainty. To address this shortcoming, we develop techniques for visual determination of statistical significance of clusters spanning one or more districts on a cartogram. We developed the techniques within a geovisual analytics framework that does not rely on automated significance testing, and can therefore facilitate visual analysis to detect clusters that automated techniques might miss. On a cartogram of the at-risk population, the statistical significance of a disease cluster is determinate from the rate, area and shape of the cluster under standard hypothesis testing scenarios. We develop formulae to determine, for a given rate, the area required for statistical significance of a priori and a posteriori designated regions under certain test assumptions. Uniquely, our approach enables dynamic inference of aggregate regions formed by combining individual districts. The method is implemented in interactive tools that provide choropleth mapping, automated legend construction and dynamic search tools to facilitate cluster detection and assessment of the validity of tested assumptions. A case study of leukemia incidence analysis in California demonstrates the ability to visually distinguish between statistically significant and insignificant regions. The proposed geovisual analytics approach enables intuitive visual assessment of statistical significance of arbitrarily defined regions on a cartogram. Our research prompts a broader discussion of the role of geovisual exploratory analyses in disease mapping and the appropriate framework for visually assessing the statistical significance of spatial clusters.

  7. Collagen morphology and texture analysis: from statistics to classification

    PubMed Central

    Mostaço-Guidolin, Leila B.; Ko, Alex C.-T.; Wang, Fei; Xiang, Bo; Hewko, Mark; Tian, Ganghong; Major, Arkady; Shiomi, Masashi; Sowa, Michael G.

    2013-01-01

    In this study we present an image analysis methodology capable of quantifying morphological changes in tissue collagen fibril organization caused by pathological conditions. Texture analysis based on first-order statistics (FOS) and second-order statistics such as gray level co-occurrence matrix (GLCM) was explored to extract second-harmonic generation (SHG) image features that are associated with the structural and biochemical changes of tissue collagen networks. Based on these extracted quantitative parameters, multi-group classification of SHG images was performed. With combined FOS and GLCM texture values, we achieved reliable classification of SHG collagen images acquired from atherosclerosis arteries with >90% accuracy, sensitivity and specificity. The proposed methodology can be applied to a wide range of conditions involving collagen re-modeling, such as in skin disorders, different types of fibrosis and muscular-skeletal diseases affecting ligaments and cartilage. PMID:23846580

  8. Constructing Rigorous and Broad Biosurveillance Networks for Detecting Emerging Zoonotic Outbreaks

    PubMed Central

    Brown, Mac; Moore, Leslie; McMahon, Benjamin; Powell, Dennis; LaBute, Montiago; Hyman, James M.; Rivas, Ariel; Jankowski, Mark; Berendzen, Joel; Loeppky, Jason; Manore, Carrie; Fair, Jeanne

    2015-01-01

    Determining optimal surveillance networks for an emerging pathogen is difficult since it is not known beforehand what the characteristics of a pathogen will be or where it will emerge. The resources for surveillance of infectious diseases in animals and wildlife are often limited and mathematical modeling can play a supporting role in examining a wide range of scenarios of pathogen spread. We demonstrate how a hierarchy of mathematical and statistical tools can be used in surveillance planning help guide successful surveillance and mitigation policies for a wide range of zoonotic pathogens. The model forecasts can help clarify the complexities of potential scenarios, and optimize biosurveillance programs for rapidly detecting infectious diseases. Using the highly pathogenic zoonotic H5N1 avian influenza 2006-2007 epidemic in Nigeria as an example, we determined the risk for infection for localized areas in an outbreak and designed biosurveillance stations that are effective for different pathogen strains and a range of possible outbreak locations. We created a general multi-scale, multi-host stochastic SEIR epidemiological network model, with both short and long-range movement, to simulate the spread of an infectious disease through Nigerian human, poultry, backyard duck, and wild bird populations. We chose parameter ranges specific to avian influenza (but not to a particular strain) and used a Latin hypercube sample experimental design to investigate epidemic predictions in a thousand simulations. We ranked the risk of local regions by the number of times they became infected in the ensemble of simulations. These spatial statistics were then complied into a potential risk map of infection. Finally, we validated the results with a known outbreak, using spatial analysis of all the simulation runs to show the progression matched closely with the observed location of the farms infected in the 2006-2007 epidemic. PMID:25946164

  9. Deep Artificial Neural Networks for the Diagnostic of Caries Using Socioeconomic and Nutritional Features as Determinants: Data from NHANES 2013⁻2014.

    PubMed

    Zanella-Calzada, Laura A; Galván-Tejada, Carlos E; Chávez-Lamas, Nubia M; Rivas-Gutierrez, Jesús; Magallanes-Quintanar, Rafael; Celaya-Padilla, Jose M; Galván-Tejada, Jorge I; Gamboa-Rosales, Hamurabi

    2018-06-18

    Oral health represents an essential component in the quality of life of people, being a determinant factor in general health since it may affect the risk of suffering other conditions, such as chronic diseases. Oral diseases have become one of the main public health problems, where dental caries is the condition that most affects oral health worldwide, occurring in about 90% of the global population. This condition has been considered a challenge because of its high prevalence, besides being a chronic but preventable disease which can be caused depending on the consumption of certain nutritional elements interacting simultaneously with different factors, such as socioeconomic factors. Based on this problem, an analysis of a set of 189 dietary and demographic determinants is performed in this work, in order to find the relationship between these factors and the oral situation of a set of subjects. The oral situation refers to the presence and absence/restorations of caries. The methodology is performed constructing a dense artificial neural network (ANN), as a computer-aided diagnosis tool, looking for a generalized model that allows for classifying subjects. As validation, the classification model was evaluated through a statistical analysis based on a cross validation, calculating the accuracy, loss function, receiving operating characteristic (ROC) curve and area under the curve (AUC) parameters. The results obtained were statistically significant, obtaining an accuracy ≃ 0.69 and AUC values of 0.69 and 0.75. Based on these results, it is possible to conclude that the classification model developed through the deep ANN is able to classify subjects with absence of caries from subjects with presence or restorations with high accuracy, according to their demographic and dietary factors.

  10. Applications and Comparisons of Four Time Series Models in Epidemiological Surveillance Data

    PubMed Central

    Young, Alistair A.; Li, Xiaosong

    2014-01-01

    Public health surveillance systems provide valuable data for reliable predication of future epidemic events. This paper describes a study that used nine types of infectious disease data collected through a national public health surveillance system in mainland China to evaluate and compare the performances of four time series methods, namely, two decomposition methods (regression and exponential smoothing), autoregressive integrated moving average (ARIMA) and support vector machine (SVM). The data obtained from 2005 to 2011 and in 2012 were used as modeling and forecasting samples, respectively. The performances were evaluated based on three metrics: mean absolute error (MAE), mean absolute percentage error (MAPE), and mean square error (MSE). The accuracy of the statistical models in forecasting future epidemic disease proved their effectiveness in epidemiological surveillance. Although the comparisons found that no single method is completely superior to the others, the present study indeed highlighted that the SVMs outperforms the ARIMA model and decomposition methods in most cases. PMID:24505382

  11. Current Mathematical Models for Analyzing Anti-Malarial Antibody Data with an Eye to Malaria Elimination and Eradication

    PubMed Central

    Sepúlveda, Nuno; Stresman, Gillian; White, Michael T.; Drakeley, Chris J.

    2015-01-01

    The last decade has witnessed a steady reduction of the malaria burden worldwide. With various countries targeting disease elimination in the near future, the popular parasite infection or entomological inoculation rates are becoming less and less informative of the underlying malaria burden due to a reduced number of infected individuals or mosquitoes at the time of sampling. To overcome such problem, alternative measures based on antibodies against specific malaria antigens have gained recent interest in malaria epidemiology due to the possibility of estimating past disease exposure in absence of infected individuals. This paper aims then to review current mathematical models and corresponding statistical approaches used in antibody data analysis. The application of these models is illustrated with three data sets from Equatorial Guinea, Brazilian Amazonia region, and western Kenyan highlands. A brief discussion is also carried out on the future challenges of using these models in the context of malaria elimination. PMID:26770994

  12. Self-reported heart disease among Arab and Chaldean American women residing in southeast Michigan.

    PubMed

    Jamil, Hikmet; Fakhouri, Monty; Dallo, Florence; Templin, Thomas; Khoury, Radwan; Fakhouri, Haifa

    2008-01-01

    This study estimates the prevalence of heart disease among Arab and Chaldean American women and examines the association between Arab and Chaldean ethnicity and heart disease among a sample of women. This was a cross-sectional study of a convenience sample of 2084 Arab, Chaldean, and African American women aged > or = 18 years who completed a survey that was distributed at churches, mosques, and small businesses in southeast Michigans. Logistic regression was used to estimate odds ratios and 95% confidence intervals for the association between ethnicity and self-reported heart disease before and after adjusting for demographic, socioeconomic status, health care, chronic conditions, and health behavior variables. A sample of 2084 Arab, Chaldean, and African American women 18 years of age and older. The overall prevalence of heart disease was 5.1%. Estimates were higher for Arabs (7.1%), lower for Chaldeans (6.6%), and lowest among African Americans (1.8%). In the unadjusted model, Chaldeans and Arabs were four times more likely to have heart disease than were African Americans. However, in the fully adjusted model, the association between Chaldean or Arab ethnicity and heart disease was no longer statistically significant. Arab or Chaldean ethnicity was not significantly associated with self-reported heart disease among women, which suggests that other factors account for this relationship. Future studies should collect more detailed socioeconomic status, acculturation, and health behavior information.

  13. I-Maculaweb: A Tool to Support Data Reuse in Ophthalmology

    PubMed Central

    Bonetto, Monica; Nicolò, Massimo; Gazzarata, Roberta; Fraccaro, Paolo; Rosa, Raffaella; Musetti, Donatella; Musolino, Maria; Traverso, Carlo E.

    2016-01-01

    This paper intends to present a Web-based application to collect and manage clinical data and clinical trials together in a unique tool. I-maculaweb is a user-friendly Web-application designed to manage, share, and analyze clinical data from patients affected by degenerative and vascular diseases of the macula. The unique and innovative scientific and technological elements of this project are the integration with individual and population data, relevant for degenerative and vascular diseases of the macula. Clinical records can also be extracted for statistical purposes and used for clinical decision support systems. I-maculaweb is based on an existing multilevel and multiscale data management model, which includes general principles that are suitable for several different clinical domains. The database structure has been specifically built to respect laterality, a key aspect in ophthalmology. Users can add and manage patient records, follow-up visits, treatment, diagnoses, and clinical history. There are two different modalities to extract records: one for the patient’s own center, in which personal details are shown and the other for statistical purposes, where all center’s anonymized data are visible. The Web-platform allows effective management, sharing, and reuse of information within primary care and clinical research. Clear and precise clinical data will improve understanding of real-life management of degenerative and vascular diseases of the macula as well as increasing precise epidemiologic and statistical data. Furthermore, this Web-based application can be easily employed as an electronic clinical research file in clinical studies. PMID:27170913

  14. Differing Interpretations of Report Terminology Between Primary Care Physicians and Radiologists.

    PubMed

    Gunn, Andrew J; Tuttle, Mitch C; Flores, Efren J; Mangano, Mark D; Bennett, Susan E; Sahani, Dushyant V; Choy, Garry; Boland, Giles W

    2016-12-01

    The lexicons of the radiologist and the referring physician may not be synonymous, which could cause confusion with radiology reporting. To further explore this possibility, we surveyed radiologists and primary care physicians (PCPs) regarding their respective interpretations of report terminology. A survey was distributed to radiologists and PCPs through an internal listserv. Respondents were asked to provide an interpretation of the statistical likelihood of the presence of metastatic disease based upon the terminology used within a hypothetical radiology report. Ten common modifying terms were evaluated. Potential responses for the statistical likelihoods included 0%-25%, 26%-50%, 51%-75%, 76%-99%, and 100%. Differences between the groups were evaluated using either a χ 2 test or Fisher exact test, as appropriate. The phrases "diagnostic for metastatic disease" and "represents metastatic disease" were selected by a high percentage of both groups as conferring a 100% likelihood of "true metastatic disease." The phrases "cannot exclude metastatic disease" and "may represent metastatic disease" were selected by a high proportion of both groups as conferring a 0% likelihood of "true metastatic disease." Radiologists assigned a higher statistical likelihood to the terms "diagnostic for metastatic disease" (P = .016), "represents metastatic disease" (P = .004), "suspicious for metastatic disease" (P = .04), "consistent with metastatic disease" (P < .0001), and "compatible with metastatic disease" (P = .003). A qualitative agreement among radiologists and PCPs exists concerning the significance of the evaluated terminology, although radiologists assigned a higher statistical likelihood than PCPs for several phrases. Copyright © 2016 American College of Radiology. Published by Elsevier Inc. All rights reserved.

  15. Statistical modelling coupled with LC-MS analysis to predict human upper intestinal absorption of phytochemical mixtures.

    PubMed

    Selby-Pham, Sophie N B; Howell, Kate S; Dunshea, Frank R; Ludbey, Joel; Lutz, Adrian; Bennett, Louise

    2018-04-15

    A diet rich in phytochemicals confers benefits for health by reducing the risk of chronic diseases via regulation of oxidative stress and inflammation (OSI). For optimal protective bio-efficacy, the time required for phytochemicals and their metabolites to reach maximal plasma concentrations (T max ) should be synchronised with the time of increased OSI. A statistical model has been reported to predict T max of individual phytochemicals based on molecular mass and lipophilicity. We report the application of the model for predicting the absorption profile of an uncharacterised phytochemical mixture, herein referred to as the 'functional fingerprint'. First, chemical profiles of phytochemical extracts were acquired using liquid chromatography mass spectrometry (LC-MS), then the molecular features for respective components were used to predict their plasma absorption maximum, based on molecular mass and lipophilicity. This method of 'functional fingerprinting' of plant extracts represents a novel tool for understanding and optimising the health efficacy of plant extracts. Copyright © 2017 Elsevier Ltd. All rights reserved.

  16. Quantitative Amyloid Imaging in Autosomal Dominant Alzheimer's Disease: Results from the DIAN Study Group.

    PubMed

    Su, Yi; Blazey, Tyler M; Owen, Christopher J; Christensen, Jon J; Friedrichsen, Karl; Joseph-Mathurin, Nelly; Wang, Qing; Hornbeck, Russ C; Ances, Beau M; Snyder, Abraham Z; Cash, Lisa A; Koeppe, Robert A; Klunk, William E; Galasko, Douglas; Brickman, Adam M; McDade, Eric; Ringman, John M; Thompson, Paul M; Saykin, Andrew J; Ghetti, Bernardino; Sperling, Reisa A; Johnson, Keith A; Salloway, Stephen P; Schofield, Peter R; Masters, Colin L; Villemagne, Victor L; Fox, Nick C; Förster, Stefan; Chen, Kewei; Reiman, Eric M; Xiong, Chengjie; Marcus, Daniel S; Weiner, Michael W; Morris, John C; Bateman, Randall J; Benzinger, Tammie L S

    2016-01-01

    Amyloid imaging plays an important role in the research and diagnosis of dementing disorders. Substantial variation in quantitative methods to measure brain amyloid burden exists in the field. The aim of this work is to investigate the impact of methodological variations to the quantification of amyloid burden using data from the Dominantly Inherited Alzheimer's Network (DIAN), an autosomal dominant Alzheimer's disease population. Cross-sectional and longitudinal [11C]-Pittsburgh Compound B (PiB) PET imaging data from the DIAN study were analyzed. Four candidate reference regions were investigated for estimation of brain amyloid burden. A regional spread function based technique was also investigated for the correction of partial volume effects. Cerebellar cortex, brain-stem, and white matter regions all had stable tracer retention during the course of disease. Partial volume correction consistently improves sensitivity to group differences and longitudinal changes over time. White matter referencing improved statistical power in the detecting longitudinal changes in relative tracer retention; however, the reason for this improvement is unclear and requires further investigation. Full dynamic acquisition and kinetic modeling improved statistical power although it may add cost and time. Several technical variations to amyloid burden quantification were examined in this study. Partial volume correction emerged as the strategy that most consistently improved statistical power for the detection of both longitudinal changes and across-group differences. For the autosomal dominant Alzheimer's disease population with PiB imaging, utilizing brainstem as a reference region with partial volume correction may be optimal for current interventional trials. Further investigation of technical issues in quantitative amyloid imaging in different study populations using different amyloid imaging tracers is warranted.

  17. Quantitative Amyloid Imaging in Autosomal Dominant Alzheimer’s Disease: Results from the DIAN Study Group

    PubMed Central

    Su, Yi; Blazey, Tyler M.; Owen, Christopher J.; Christensen, Jon J.; Friedrichsen, Karl; Joseph-Mathurin, Nelly; Wang, Qing; Hornbeck, Russ C.; Ances, Beau M.; Snyder, Abraham Z.; Cash, Lisa A.; Koeppe, Robert A.; Klunk, William E.; Galasko, Douglas; Brickman, Adam M.; McDade, Eric; Ringman, John M.; Thompson, Paul M.; Saykin, Andrew J.; Ghetti, Bernardino; Sperling, Reisa A.; Johnson, Keith A.; Salloway, Stephen P.; Schofield, Peter R.; Masters, Colin L.; Villemagne, Victor L.; Fox, Nick C.; Förster, Stefan; Chen, Kewei; Reiman, Eric M.; Xiong, Chengjie; Marcus, Daniel S.; Weiner, Michael W.; Morris, John C.; Bateman, Randall J.; Benzinger, Tammie L. S.

    2016-01-01

    Amyloid imaging plays an important role in the research and diagnosis of dementing disorders. Substantial variation in quantitative methods to measure brain amyloid burden exists in the field. The aim of this work is to investigate the impact of methodological variations to the quantification of amyloid burden using data from the Dominantly Inherited Alzheimer’s Network (DIAN), an autosomal dominant Alzheimer’s disease population. Cross-sectional and longitudinal [11C]-Pittsburgh Compound B (PiB) PET imaging data from the DIAN study were analyzed. Four candidate reference regions were investigated for estimation of brain amyloid burden. A regional spread function based technique was also investigated for the correction of partial volume effects. Cerebellar cortex, brain-stem, and white matter regions all had stable tracer retention during the course of disease. Partial volume correction consistently improves sensitivity to group differences and longitudinal changes over time. White matter referencing improved statistical power in the detecting longitudinal changes in relative tracer retention; however, the reason for this improvement is unclear and requires further investigation. Full dynamic acquisition and kinetic modeling improved statistical power although it may add cost and time. Several technical variations to amyloid burden quantification were examined in this study. Partial volume correction emerged as the strategy that most consistently improved statistical power for the detection of both longitudinal changes and across-group differences. For the autosomal dominant Alzheimer’s disease population with PiB imaging, utilizing brainstem as a reference region with partial volume correction may be optimal for current interventional trials. Further investigation of technical issues in quantitative amyloid imaging in different study populations using different amyloid imaging tracers is warranted. PMID:27010959

  18. Cardiovascular disease and non-steroidal anti-inflammatory drug prescribing in the midst of evolving guidelines.

    PubMed

    Pham, Timothy T; Miller, Michael J; Harrison, Donald L; Lloyd, Ann E; Crosby, Kimberly M; Johnson, Jeremy L

    2013-12-01

    Responding to safety concerns, the American Heart Association (AHA) published guidelines for non-steroidal anti-inflammatory drug (NSAID) use in patients with pre-existing cardiovascular disease (CVD) during 2005 and revised them in 2007. In the revision, a stepped approach to pain management recommended non-selective NSAIDs over highly selective NSAIDs. This research evaluated NSAID prescribing during and after guideline dissemination. A cross-sectional sample of 8666 adult, community-based practice visits with one NSAID prescription representing approximately 305 million visits from the National Ambulatory Medical Care Survey (NAMCS) from 2005 to 2010 was studied. Multivariable logistic regression controlling for patient, provider and visit characteristics assessed the associations between diagnosis of CVD and NSAID type prescribed during each calendar year. Visits were stratified by arthritis diagnosis to model short-term/intermittent and long-term NSAID use. Approximately one-third (36.8%) of visits involving a NSAID prescription included at least one of four diagnoses for CVD (i.e. hypertension, congestive heart failure, ischaemic heart disease or cerebrovascular disease). Visits involving a CVD diagnosis had increased odds of a prescription for celecoxib, a highly selective NSAIDs, overall [adjusted odds ratio (AOR) = 1.29, 95% confidence interval (CI): 1.06-1.57] and in the subgroup of visits without an arthritis diagnosis (AOR = 1.45, 95% CI: 1.11-1.89). Results were not statistically significant for visits with an arthritis diagnosis (AOR = 1.10, 95% CI: 0.47-2.57). When analysed by year, the relationship was statistically significant in 2005 and 2006, but not statistically significant in each subsequent year. National prescribing trends suggest partial implementation of AHA guidelines for NSAID prescribing in CVD from 2005 to 2010. © 2012 John Wiley & Sons Ltd.

  19. The Association of Statin Use with Age-Related Macular Degeneration Progression The Age-Related Eye Disease Study 2 Report Number 9

    PubMed Central

    Al-Holou, Shaza N.; Tucker, William R.; Agrón, Elvira; Clemons, Traci E.; Cukras, Catherine; Ferris, Frederick L.; Chew, Emily Y.

    2015-01-01

    Objective/purpose To evaluate the association of statin use with progression of age-related macular degeneration (AMD). Design Preplanned, prospective cohort study within a controlled clinical trial of oral supplementation for age-related eye diseases. Subjects Age-Related Eye Disease Study 2 participants, aged 50 to 85 years. Methods Factors, including age, gender, smoking status, aspirin use, and history of diabetes, hypertension, heart disease, angina, and stroke, all known to be associated with statin use, were included in a logistic regression model to estimate propensity scores for each participant. Age-adjusted proportional hazards regression models, with and without propensity score matching, were performed to evaluate the association of statin use with progression to late AMD. Analyses were also performed adjusting for the competing risk of death. Main Outcome Measures Baseline and annual stereoscopic fundus photographs were assessed centrally by masked graders for the development of late AMD, either neovascular AMD or geographic atrophy (GA). Results Of the 3791 participants (2462 with bilateral large drusen and 1329 with unilateral late AMD at baseline), 1659 (43.8%) were statin users. The overall analysis, with no matching of propensity scores and no adjustment for death as a competing risk, showed that statin use was not associated with progression to late AMD (hazard ratios [HR] of 1.08, 95% confidence intervals [CI] of 0.83–1.41, P=0.56). When matched for propensity scores and adjusted for death as a competing risk, the result was not statistically significant with HR: 0.81, 95% CI: 0.55–1.20, P=0.29. Further subgroup analyses of persons with or without late AMD at baseline to the various components of late AMD (neovascular, central geographic atrophy, or any geographic atrophy) also showed no statistically significant association of statin use with progression to AMD. Conclusions Statin use was not statistically significantly associated with the progression to late AMD in the AREDS2 participants, and these findings are consistent with the findings in the majority of previous studies. Statins have been demonstrated to reduce the risks of cardiovascular disease, but our data do not provide evidence of a beneficial effect on slowing AMD progression. PMID:26435335

  20. Validation of an Experimental Model to Study Less Severe Chronic Renal Failure.

    PubMed

    Fernandes-Charpiot, Ida Mária Maximina; Caldas, Heloisa Cristina; Mendes, Glória Elisa Florido; Gomes de Sá Neto, Luiz; Oliveira, Henrique Lacativa; Baptista, Maria Alice Sperto Ferreira; Abbud-Filho, Mario

    2016-10-01

    The 5/6 nephrectomy, mimics the stages of human chronic renal failure (CRF), but the procedure causes severe renal functional and morphological damage that could interfere with the evaluation of therapies for slowing the progression of the disease. This study summarizes the results of renal function, histology, and immunohistochemical findings in rats undergoing a 2/3 nephrectomy. The rats were distributed in groups according to the type of nephrectomy: CRF5/6: induced by a 5/6 renal mass reduction and CRF2/3: less severe CRF. The body weight and blood pressure were monitored, and the serum creatinine (SCr), creatinine clearance (CCr), urine osmolality, and 24-h proteinuria (PT24h) were measured. CRF progression was evaluated by the rate of decline of CCr (RCCr). Histology and immunohistochemistry were performed in the remnant kidneys. Statistical analysis was done by unpaired t-test, and a P-value < 0.05 was taken as a statistical significance. Compared to the CRF5/6 group, the CRF2/3 model had a lower SCr, PT24h, CCr, and variations of the SCr from baseline. The disease progression was also significantly slower. The renal histopathological findings revealed fewer chronic lesions in rats with CRF2/3. Similarly, we observed less macrophage accumulation as well as lower proliferative activity and expression of fibronectin and a-smooth muscle-actin in the CRF2/3 model. The CRF2/3 model presented with a pattern of less severe CRF, functionally and morphologically, compared to the classical CRF5/6 model, and the CRF2/3 model may be useful for evaluating therapeutic interventions that target the early stages of CRF.

  1. A point-based prediction model for cardiovascular risk in orthotopic liver transplantation: The CAR-OLT score.

    PubMed

    VanWagner, Lisa B; Ning, Hongyan; Whitsett, Maureen; Levitsky, Josh; Uttal, Sarah; Wilkins, John T; Abecassis, Michael M; Ladner, Daniela P; Skaro, Anton I; Lloyd-Jones, Donald M

    2017-12-01

    Cardiovascular disease (CVD) complications are important causes of morbidity and mortality after orthotopic liver transplantation (OLT). There is currently no preoperative risk-assessment tool that allows physicians to estimate the risk for CVD events following OLT. We sought to develop a point-based prediction model (risk score) for CVD complications after OLT, the Cardiovascular Risk in Orthotopic Liver Transplantation risk score, among a cohort of 1,024 consecutive patients aged 18-75 years who underwent first OLT in a tertiary-care teaching hospital (2002-2011). The main outcome measures were major 1-year CVD complications, defined as death from a CVD cause or hospitalization for a major CVD event (myocardial infarction, revascularization, heart failure, atrial fibrillation, cardiac arrest, pulmonary embolism, and/or stroke). The bootstrap method yielded bias-corrected 95% confidence intervals for the regression coefficients of the final model. Among 1,024 first OLT recipients, major CVD complications occurred in 329 (32.1%). Variables selected for inclusion in the model (using model optimization strategies) included preoperative recipient age, sex, race, employment status, education status, history of hepatocellular carcinoma, diabetes, heart failure, atrial fibrillation, pulmonary or systemic hypertension, and respiratory failure. The discriminative performance of the point-based score (C statistic = 0.78, bias-corrected C statistic = 0.77) was superior to other published risk models for postoperative CVD morbidity and mortality, and it had appropriate calibration (Hosmer-Lemeshow P = 0.33). The point-based risk score can identify patients at risk for CVD complications after OLT surgery (available at www.carolt.us); this score may be useful for identification of candidates for further risk stratification or other management strategies to improve CVD outcomes after OLT. (Hepatology 2017;66:1968-1979). © 2017 by the American Association for the Study of Liver Diseases.

  2. Predictions of space radiation fatality risk for exploration missions

    NASA Astrophysics Data System (ADS)

    Cucinotta, Francis A.; To, Khiet; Cacao, Eliedonna

    2017-05-01

    In this paper we describe revisions to the NASA Space Cancer Risk (NSCR) model focusing on updates to probability distribution functions (PDF) representing the uncertainties in the radiation quality factor (QF) model parameters and the dose and dose-rate reduction effectiveness factor (DDREF). We integrate recent heavy ion data on liver, colorectal, intestinal, lung, and Harderian gland tumors with other data from fission neutron experiments into the model analysis. In an earlier work we introduced distinct QFs for leukemia and solid cancer risk predictions, and here we consider liver cancer risks separately because of the higher RBE's reported in mouse experiments compared to other tumors types, and distinct risk factors for liver cancer for astronauts compared to the U.S. population. The revised model is used to make predictions of fatal cancer and circulatory disease risks for 1-year deep space and International Space Station (ISS) missions, and a 940 day Mars mission. We analyzed the contribution of the various model parameter uncertainties to the overall uncertainty, which shows that the uncertainties in relative biological effectiveness (RBE) factors at high LET due to statistical uncertainties and differences across tissue types and mouse strains are the dominant uncertainty. NASA's exposure limits are approached or exceeded for each mission scenario considered. Two main conclusions are made: 1) Reducing the current estimate of about a 3-fold uncertainty to a 2-fold or lower uncertainty will require much more expansive animal carcinogenesis studies in order to reduce statistical uncertainties and understand tissue, sex and genetic variations. 2) Alternative model assumptions such as non-targeted effects, increased tumor lethality and decreased latency at high LET, and non-cancer mortality risks from circulatory diseases could significantly increase risk estimates to several times higher than the NASA limits.

  3. Omics approaches to individual variation: modeling networks and the virtual patient.

    PubMed

    Lehrach, Hans

    2016-09-01

    Every human is unique. We differ in our genomes, environment, behavior, disease history, and past and current medical treatment-a complex catalog of differences that often leads to variations in the way each of us responds to a particular therapy. We argue here that true personalization of drug therapies will rely on "virtual patient" models based on a detailed characterization of the individual patient by molecular, imaging, and sensor techniques. The models will be based, wherever possible, on the molecular mechanisms of disease processes and drug action but can also expand to hybrid models including statistics/machine learning/artificial intelligence-based elements trained on available data to address therapeutic areas or therapies for which insufficient information on mechanisms is available. Depending on the disease, its mechanisms, and the therapy, virtual patient models can be implemented at a fairly high level of abstraction, with molecular models representing cells, cell types, or organs relevant to the clinical question, interacting not only with each other but also the environment. In the future, "virtual patient/in-silico self" models may not only become a central element of our health care system, reducing otherwise unavoidable mistakes and unnecessary costs, but also act as "guardian angels" accompanying us through life to protect us against dangers and to help us to deal intelligently with our own health and wellness.

  4. Omics approaches to individual variation: modeling networks and the virtual patient

    PubMed Central

    Lehrach, Hans

    2016-01-01

    Every human is unique. We differ in our genomes, environment, behavior, disease history, and past and current medical treatment—a complex catalog of differences that often leads to variations in the way each of us responds to a particular therapy. We argue here that true personalization of drug therapies will rely on “virtual patient” models based on a detailed characterization of the individual patient by molecular, imaging, and sensor techniques. The models will be based, wherever possible, on the molecular mechanisms of disease processes and drug action but can also expand to hybrid models including statistics/machine learning/artificial intelligence-based elements trained on available data to address therapeutic areas or therapies for which insufficient information on mechanisms is available. Depending on the disease, its mechanisms, and the therapy, virtual patient models can be implemented at a fairly high level of abstraction, with molecular models representing cells, cell types, or organs relevant to the clinical question, interacting not only with each other but also the environment. In the future, “virtual patient/in-silico self” models may not only become a central element of our health care system, reducing otherwise unavoidable mistakes and unnecessary costs, but also act as “guardian angels” accompanying us through life to protect us against dangers and to help us to deal intelligently with our own health and wellness. PMID:27757060

  5. Two-locus maximum lod score analysis of a multifactorial trait: joint consideration of IDDM2 and IDDM4 with IDDM1 in type 1 diabetes.

    PubMed

    Cordell, H J; Todd, J A; Bennett, S T; Kawaguchi, Y; Farrall, M

    1995-10-01

    To investigate the genetic component of multifactorial diseases such as type 1 (insulin-dependent) diabetes mellitus (IDDM), models involving the joint action of several disease loci are important. These models can give increased power to detect an effect and a greater understanding of etiological mechanisms. Here, we present an extension of the maximum lod score method of N. Risch, which allows the simultaneous detection and modeling of two unlinked disease loci. Genetic constraints on the identical-by-descent sharing probabilities, analogous to the "triangle" restrictions in the single-locus method, are derived, and the size and power of the test statistics are investigated. The method is applied to affected-sib-pair data, and the joint effects of IDDM1 (HLA) and IDDM2 (the INS VNTR) and of IDDM1 and IDDM4 (FGF3-linked) are assessed with relation to the development of IDDM. In the presence of genetic heterogeneity, there is seen to be a significant advantage in analyzing more than one locus simultaneously. Analysis of these families indicates that the effects at IDDM1 and IDDM2 are well described by a multiplicative genetic model, while those at IDDM1 and IDDM4 follow a heterogeneity model.

  6. Two-locus maximum lod score analysis of a multifactorial trait: joint consideration of IDDM2 and IDDM4 with IDDM1 in type 1 diabetes.

    PubMed Central

    Cordell, H J; Todd, J A; Bennett, S T; Kawaguchi, Y; Farrall, M

    1995-01-01

    To investigate the genetic component of multifactorial diseases such as type 1 (insulin-dependent) diabetes mellitus (IDDM), models involving the joint action of several disease loci are important. These models can give increased power to detect an effect and a greater understanding of etiological mechanisms. Here, we present an extension of the maximum lod score method of N. Risch, which allows the simultaneous detection and modeling of two unlinked disease loci. Genetic constraints on the identical-by-descent sharing probabilities, analogous to the "triangle" restrictions in the single-locus method, are derived, and the size and power of the test statistics are investigated. The method is applied to affected-sib-pair data, and the joint effects of IDDM1 (HLA) and IDDM2 (the INS VNTR) and of IDDM1 and IDDM4 (FGF3-linked) are assessed with relation to the development of IDDM. In the presence of genetic heterogeneity, there is seen to be a significant advantage in analyzing more than one locus simultaneously. Analysis of these families indicates that the effects at IDDM1 and IDDM2 are well described by a multiplicative genetic model, while those at IDDM1 and IDDM4 follow a heterogeneity model. PMID:7573054

  7. Integrative pipeline for profiling DNA copy number and inferring tumor phylogeny.

    PubMed

    Urrutia, Eugene; Chen, Hao; Zhou, Zilu; Zhang, Nancy R; Jiang, Yuchao

    2018-06-15

    Copy number variation is an important and abundant source of variation in the human genome, which has been associated with a number of diseases, especially cancer. Massively parallel next-generation sequencing allows copy number profiling with fine resolution. Such efforts, however, have met with mixed successes, with setbacks arising partly from the lack of reliable analytical methods to meet the diverse and unique challenges arising from the myriad experimental designs and study goals in genetic studies. In cancer genomics, detection of somatic copy number changes and profiling of allele-specific copy number (ASCN) are complicated by experimental biases and artifacts as well as normal cell contamination and cancer subclone admixture. Furthermore, careful statistical modeling is warranted to reconstruct tumor phylogeny by both somatic ASCN changes and single nucleotide variants. Here we describe a flexible computational pipeline, MARATHON, which integrates multiple related statistical software for copy number profiling and downstream analyses in disease genetic studies. MARATHON is publicly available at https://github.com/yuchaojiang/MARATHON. Supplementary data are available at Bioinformatics online.

  8. Occupational skin diseases in Korea.

    PubMed

    Ahn, Yeon-Soon; Kim, Min-Gi

    2010-12-01

    Skin disease is the most common occupational disease, but the reported number is small in Korea due to a difficulty of detection and diagnosis in time. We described various official statistics and data from occupational skin disease surveillance system, epidemiological surveys and cases published in scientific journals. Until 1981, 2,222 cases of occupational skin disease were reported by Korean employee's regular medical check-up, accounting for 4.9% of the total occupational diseases. There was no subsequent official statistics to figure out occupational skin diseases till 1998. From 1999, the Korea Occupational Safety and Health Agency (KOSHA) published the number of occupational skin diseases through the statistics of Cause Investigation for Industrial Accidents. A total of 301 cases were reported from 1999 to 2007. Recent one study showed the figures of compensated occupational skin diseases. Many of them belonged to daily-paid workers in the public service, especially forestry workers. Also, it described the interesting cases such as vitiligo and trichloroethylene-induced Stevens-Johnson Syndrome. Skin diseases are still important though the number of cases has decreased, and therefore it is recommended to grasp the status of occupational skin diseases through continuous surveillance system and to make policy protecting high-risk group.

  9. Into the environment of mosquito-borne disease: A spatial analysis of vector distribution using traditional and remotely sensed methods

    NASA Astrophysics Data System (ADS)

    Brown, Heidi E.

    Spatially explicit information is increasingly available for infectious disease modeling. However, such information is reluctantly or inappropriately incorporated. My dissertation research uses spatially explicit data to assess relationships between landscape and mosquito species distribution and discusses challenges regarding accurate predictive risk modeling. The goal of my research is to use remotely sensed environmental information and spatial statistical methods to better understand mosquito-borne disease epidemiology for improvement of public health responses. In addition to reviewing the progress of spatial infectious disease modeling, I present four research projects. I begin by evaluating the biases in surveillance data and build up to predictive modeling of mosquito species presence. In the first study I explore how mosquito surveillance trap types influence estimations of mosquito populations. Then. I use county-based human surveillance data and landscape variables to identify risk factors for West Nile virus disease. The third study uses satellite-based vegetation indices to identify spatial variation among West Nile virus vectors in an urban area and relates the variability to virus transmission dynamics. Finally, I explore how information from three satellite sensors of differing spatial and spectral resolution can be used to identify and distinguish mosquito habitat across central Connecticut wetlands. Analyses presented here constitute improvements to the prediction of mosquito distribution and therefore identification of disease risk factors. Current methods for mosquito surveillance data collection are labor intensive and provide an extremely limited, incomplete picture of the species composition and abundance. Human surveillance data offers additional challenges with respect to reporting bias and resolution, but is nonetheless informative in identifying environmental risk factors and disease transmission dynamics. Remotely sensed imagery supports mosquito and human disease surveillance data by providing spatially explicit, line resolution information about environmental factors relevant to vector-borne disease processes. Together, surveillance and remotely sensed environmental data facilitate improved description and modeling of disease transmission. Remote sensing can be used to develop predictive maps of mosquito distribution in relation to disease risk. This has implications for increased accuracy of mosquito control efforts. The projects presented in this dissertation enhance current public health capacities by examining the applications of spatial modeling with respect to mosquito-borne disease.

  10. The development of a stochastic mathematical model of Alzheimer’s disease to help improve the design of clinical trials of potential treatments

    PubMed Central

    Ower, Alison K.; de Wolf, Frank; Anderson, Roy M.

    2018-01-01

    Alzheimer’s disease (AD) is a neurodegenerative disorder characterised by a slow progressive deterioration of cognitive capacity. Drugs are urgently needed for the treatment of AD and unfortunately almost all clinical trials of AD drug candidates have failed or been discontinued to date. Mathematical, computational and statistical tools can be employed in the construction of clinical trial simulators to assist in the improvement of trial design and enhance the chances of success of potential new therapies. Based on the analysis of a set of clinical data provided by the Alzheimer's Disease Neuroimaging Initiative (ADNI) we developed a simple stochastic mathematical model to simulate the development and progression of Alzheimer’s in a longitudinal cohort study. We show how this modelling framework could be used to assess the effect and the chances of success of hypothetical treatments that are administered at different stages and delay disease development. We demonstrate that the detection of the true efficacy of an AD treatment can be very challenging, even if the treatment is highly effective. An important reason behind the inability to detect signals of efficacy in a clinical trial in this therapy area could be the high between- and within-individual variability in the measurement of diagnostic markers and endpoints, which consequently results in the misdiagnosis of an individual’s disease state. PMID:29377891

  11. The development of a stochastic mathematical model of Alzheimer's disease to help improve the design of clinical trials of potential treatments.

    PubMed

    Hadjichrysanthou, Christoforos; Ower, Alison K; de Wolf, Frank; Anderson, Roy M

    2018-01-01

    Alzheimer's disease (AD) is a neurodegenerative disorder characterised by a slow progressive deterioration of cognitive capacity. Drugs are urgently needed for the treatment of AD and unfortunately almost all clinical trials of AD drug candidates have failed or been discontinued to date. Mathematical, computational and statistical tools can be employed in the construction of clinical trial simulators to assist in the improvement of trial design and enhance the chances of success of potential new therapies. Based on the analysis of a set of clinical data provided by the Alzheimer's Disease Neuroimaging Initiative (ADNI) we developed a simple stochastic mathematical model to simulate the development and progression of Alzheimer's in a longitudinal cohort study. We show how this modelling framework could be used to assess the effect and the chances of success of hypothetical treatments that are administered at different stages and delay disease development. We demonstrate that the detection of the true efficacy of an AD treatment can be very challenging, even if the treatment is highly effective. An important reason behind the inability to detect signals of efficacy in a clinical trial in this therapy area could be the high between- and within-individual variability in the measurement of diagnostic markers and endpoints, which consequently results in the misdiagnosis of an individual's disease state.

  12. Dopamine Transporter Neuroimaging as an Enrichment Biomarker in Early Parkinson's Disease Clinical Trials: A Disease Progression Modeling Analysis.

    PubMed

    Conrado, Daniela J; Nicholas, Timothy; Tsai, Kuenhi; Macha, Sreeraj; Sinha, Vikram; Stone, Julie; Corrigan, Brian; Bani, Massimo; Muglia, Pierandrea; Watson, Ian A; Kern, Volker D; Sheveleva, Elena; Marek, Kenneth; Stephenson, Diane T; Romero, Klaus

    2018-01-01

    Given the recognition that disease-modifying therapies should focus on earlier Parkinson's disease stages, trial enrollment based purely on clinical criteria poses significant challenges. The goal herein was to determine the utility of dopamine transporter neuroimaging as an enrichment biomarker in early motor Parkinson's disease clinical trials. Patient-level longitudinal data of 672 subjects with early-stage Parkinson's disease in the Parkinson's Progression Markers Initiative (PPMI) observational study and the Parkinson Research Examination of CEP-1347 Trial (PRECEPT) clinical trial were utilized in a linear mixed-effects model analysis. The rate of worsening in the motor scores between subjects with or without a scan without evidence of dopamine transporter deficit was different both statistically and clinically. The average difference in the change from baseline of motor scores at 24 months between biomarker statuses was -3.16 (90% confidence interval [CI] = -0.96 to -5.42) points. Dopamine transporter imaging could identify subjects with a steeper worsening of the motor scores, allowing trial enrichment and 24% reduction of sample size. Published 2017. This article is a U.S. Government work and is in the public domain in the USA. Clinical and Translational Science published by Wiley Periodicals, Inc. on behalf of American Society for Clinical Pharmacology and Therapeutics.

  13. Accounting for undetected compounds in statistical analyses of mass spectrometry 'omic studies.

    PubMed

    Taylor, Sandra L; Leiserowitz, Gary S; Kim, Kyoungmi

    2013-12-01

    Mass spectrometry is an important high-throughput technique for profiling small molecular compounds in biological samples and is widely used to identify potential diagnostic and prognostic compounds associated with disease. Commonly, this data generated by mass spectrometry has many missing values resulting when a compound is absent from a sample or is present but at a concentration below the detection limit. Several strategies are available for statistically analyzing data with missing values. The accelerated failure time (AFT) model assumes all missing values result from censoring below a detection limit. Under a mixture model, missing values can result from a combination of censoring and the absence of a compound. We compare power and estimation of a mixture model to an AFT model. Based on simulated data, we found the AFT model to have greater power to detect differences in means and point mass proportions between groups. However, the AFT model yielded biased estimates with the bias increasing as the proportion of observations in the point mass increased while estimates were unbiased with the mixture model except if all missing observations came from censoring. These findings suggest using the AFT model for hypothesis testing and mixture model for estimation. We demonstrated this approach through application to glycomics data of serum samples from women with ovarian cancer and matched controls.

  14. Mining heart disease risk factors in clinical text with named entity recognition and distributional semantic models.

    PubMed

    Urbain, Jay

    2015-12-01

    We present the design, and analyze the performance of a multi-stage natural language processing system employing named entity recognition, Bayesian statistics, and rule logic to identify and characterize heart disease risk factor events in diabetic patients over time. The system was originally developed for the 2014 i2b2 Challenges in Natural Language in Clinical Data. The system's strengths included a high level of accuracy for identifying named entities associated with heart disease risk factor events. The system's primary weakness was due to inaccuracies when characterizing the attributes of some events. For example, determining the relative time of an event with respect to the record date, whether an event is attributable to the patient's history or the patient's family history, and differentiating between current and prior smoking status. We believe these inaccuracies were due in large part to the lack of an effective approach for integrating context into our event detection model. To address these inaccuracies, we explore the addition of a distributional semantic model for characterizing contextual evidence of heart disease risk factor events. Using this semantic model, we raise our initial 2014 i2b2 Challenges in Natural Language of Clinical data F1 score of 0.838 to 0.890 and increased precision by 10.3% without use of any lexicons that might bias our results. Copyright © 2015 Elsevier Inc. All rights reserved.

  15. Interacting opinion and disease dynamics in multiplex networks: Discontinuous phase transition and nonmonotonic consensus times

    NASA Astrophysics Data System (ADS)

    Velásquez-Rojas, Fátima; Vazquez, Federico

    2017-05-01

    Opinion formation and disease spreading are among the most studied dynamical processes on complex networks. In real societies, it is expected that these two processes depend on and affect each other. However, little is known about the effects of opinion dynamics over disease dynamics and vice versa, since most studies treat them separately. In this work we study the dynamics of the voter model for opinion formation intertwined with that of the contact process for disease spreading, in a population of agents that interact via two types of connections, social and contact. These two interacting dynamics take place on two layers of networks, coupled through a fraction q of links present in both networks. The probability that an agent updates its state depends on both the opinion and disease states of the interacting partner. We find that the opinion dynamics has striking consequences on the statistical properties of disease spreading. The most important is that the smooth (continuous) transition from a healthy to an endemic phase observed in the contact process, as the infection probability increases beyond a threshold, becomes abrupt (discontinuous) in the two-layer system. Therefore, disregarding the effects of social dynamics on epidemics propagation may lead to a misestimation of the real magnitude of the spreading. Also, an endemic-healthy discontinuous transition is found when the coupling q overcomes a threshold value. Furthermore, we show that the disease dynamics delays the opinion consensus, leading to a consensus time that varies nonmonotonically with q in a large range of the model's parameters. A mean-field approach reveals that the coupled dynamics of opinions and disease can be approximately described by the dynamics of the voter model decoupled from that of the contact process, with effective probabilities of opinion and disease transmission.

  16. A meta-analysis of interleukin-10-1082 promoter genetic polymorphism associated with atherosclerotic risk.

    PubMed

    Chao, Li; Lei, Huang; Fei, Jin

    2014-01-01

    This meta-analysis was conducted to assess the relationship between interleukin-10-1082 G/A single nucleotide polymorphism with atherosclerosis (AS) risk. The databases of PubMed, EMBASE, Chinese National Knowledge Infrastructure and Wan-Fang were searched from January 2000 to January 2014. 16 studies (involving 7779 cases and 7271 controls) were finally included. Each eligible study was scored for quality assessment. We adopted the most probably appropriate genetic model (recessive model) after carefully calculation. Between study heterogeneity was explored by subgroup analysis and publication bias was estimated by Begg's funnel plot and Egger's regression test. Statistically significant association was observed between AA genotype with overall AS risk, being mainly in coronary heart disease and stroke subgroups among Asian population, and peripheral artery disease (PAD) subgroup among Caucasians. Interleukin-10-1082 AA genotype is associated with increased overall AS risk. AA carriers of Asians seem to be more susceptible to coronary artery disease and stroke, and Caucasians are more susceptible to PAD.

  17. A framework for the interpretation of de novo mutation in human disease

    PubMed Central

    Samocha, Kaitlin E.; Robinson, Elise B.; Sanders, Stephan J.; Stevens, Christine; Sabo, Aniko; McGrath, Lauren M.; Kosmicki, Jack A.; Rehnström, Karola; Mallick, Swapan; Kirby, Andrew; Wall, Dennis P.; MacArthur, Daniel G.; Gabriel, Stacey B.; dePristo, Mark; Purcell, Shaun M.; Palotie, Aarno; Boerwinkle, Eric; Buxbaum, Joseph D.; Cook, Edwin H.; Gibbs, Richard A.; Schellenberg, Gerard D.; Sutcliffe, James S.; Devlin, Bernie; Roeder, Kathryn; Neale, Benjamin M.; Daly, Mark J.

    2014-01-01

    Spontaneously arising (‘de novo’) mutations play an important role in medical genetics. For diseases with extensive locus heterogeneity – such as autism spectrum disorders (ASDs) – the signal from de novo mutations (DNMs) is distributed across many genes, making it difficult to distinguish disease-relevant mutations from background variation. We provide a statistical framework for the analysis of DNM excesses per gene and gene set by calibrating a model of de novo mutation. We applied this framework to DNMs collected from 1,078 ASD trios and – while affirming a significant role for loss-of-function (LoF) mutations – found no excess of de novo LoF mutations in cases with IQ above 100, suggesting that the role of DNMs in ASD may reside in fundamental neurodevelopmental processes. We also used our model to identify ~1,000 genes that are significantly lacking functional coding variation in non-ASD samples and are enriched for de novo LoF mutations identified in ASD cases. PMID:25086666

  18. Design and validation of a model to predict early mortality in haemodialysis patients.

    PubMed

    Mauri, Joan M; Clèries, Montse; Vela, Emili

    2008-05-01

    Mortality and morbidity rates are higher in patients receiving haemodialysis therapy than in the general population. Detection of risk factors related to early death in these patients could be of aid for clinical and administrative decision making. Objectives. The aims of this study were (1) to identify risk factors (comorbidity and variables specific to haemodialysis) associated with death in the first year following the start of haemodialysis and (2) to design and validate a prognostic model to quantify the probability of death for each patient. An analysis was carried out on all patients starting haemodialysis treatment in Catalonia during the period 1997-2003 (n = 5738). The data source was the Renal Registry of Catalonia, a mandatory population registry. Patients were randomly divided into two samples: 60% (n = 3455) of the total were used to develop the prognostic model and the remaining 40% (n = 2283) to validate the model. Logistic regression analysis was used to construct the model. One-year mortality in the total study population was 16.5%. The predictive model included the following variables: age, sex, primary renal disease, grade of functional autonomy, chronic obstructive pulmonary disease, malignant processes, chronic liver disease, cardiovascular disease, initial vascular access and malnutrition. The analyses showed adequate calibration for both the sample to develop the model and the validation sample (Hosmer-Lemeshow statistic 0.97 and P = 0.49, respectively) as well as adequate discrimination (ROC curve 0.78 in both cases). Risk factors implicated in mortality at one year following the start of haemodialysis have been determined and a prognostic model designed. The validated, easy-to-apply model quantifies individual patient risk attributable to various factors, some of them amenable to correction by directed interventions.

  19. Improving spatio-temporal model estimation of satellite-derived PM2.5 concentrations: Implications for public health

    NASA Astrophysics Data System (ADS)

    Barik, M. G.; Al-Hamdan, M. Z.; Crosson, W. L.; Yang, C. A.; Coffield, S. R.

    2017-12-01

    Satellite-derived environmental data, available in a range of spatio-temporal scales, are contributing to the growing use of health impact assessments of air pollution in the public health sector. Models developed using correlation of Moderate Resolution Imaging Spectrometer (MODIS) Aerosol Optical Depth (AOD) with ground measurements of fine particulate matter less than 2.5 microns (PM2.5) are widely applied to measure PM2.5 spatial and temporal variability. In the public health sector, associations of PM2.5 with respiratory and cardiovascular diseases are often investigated to quantify air quality impacts on these health concerns. In order to improve predictability of PM2.5 estimation using correlation models, we have included meteorological variables, higher-resolution AOD products and instantaneous PM2.5 observations into statistical estimation models. Our results showed that incorporation of high-resolution (1-km) Multi-Angle Implementation of Atmospheric Correction (MAIAC)-generated MODIS AOD, meteorological variables and instantaneous PM2.5 observations improved model performance in various parts of California (CA), USA, where single variable AOD-based models showed relatively weak performance. In this study, we further asked whether these improved models actually would be more successful for exploring associations of public health outcomes with estimated PM2.5. To answer this question, we geospatially investigated model-estimated PM2.5's relationship with respiratory and cardiovascular diseases such as asthma, high blood pressure, coronary heart disease, heart attack and stroke in CA using health data from the Centers for Disease Control and Prevention (CDC)'s Wide-ranging Online Data for Epidemiologic Research (WONDER) and the Behavioral Risk Factor Surveillance System (BRFSS). PM2.5 estimation from these improved models have the potential to improve our understanding of associations between public health concerns and air quality.

  20. Assessing risk factors for periodontitis using regression

    NASA Astrophysics Data System (ADS)

    Lobo Pereira, J. A.; Ferreira, Maria Cristina; Oliveira, Teresa

    2013-10-01

    Multivariate statistical analysis is indispensable to assess the associations and interactions between different factors and the risk of periodontitis. Among others, regression analysis is a statistical technique widely used in healthcare to investigate and model the relationship between variables. In our work we study the impact of socio-demographic, medical and behavioral factors on periodontal health. Using regression, linear and logistic models, we can assess the relevance, as risk factors for periodontitis disease, of the following independent variables (IVs): Age, Gender, Diabetic Status, Education, Smoking status and Plaque Index. The multiple linear regression analysis model was built to evaluate the influence of IVs on mean Attachment Loss (AL). Thus, the regression coefficients along with respective p-values will be obtained as well as the respective p-values from the significance tests. The classification of a case (individual) adopted in the logistic model was the extent of the destruction of periodontal tissues defined by an Attachment Loss greater than or equal to 4 mm in 25% (AL≥4mm/≥25%) of sites surveyed. The association measures include the Odds Ratios together with the correspondent 95% confidence intervals.

  1. Do non-targeted effects increase or decrease low dose risk in relation to the linear-non-threshold (LNT) model?☆

    PubMed Central

    Little, M.P.

    2011-01-01

    In this paper we review the evidence for departure from linearity for malignant and non-malignant disease and in the light of this assess likely mechanisms, and in particular the potential role for non-targeted effects. Excess cancer risks observed in the Japanese atomic bomb survivors and in many medically and occupationally exposed groups exposed at low or moderate doses are generally statistically compatible. For most cancer sites the dose–response in these groups is compatible with linearity over the range observed. The available data on biological mechanisms do not provide general support for the idea of a low dose threshold or hormesis. This large body of evidence does not suggest, indeed is not statistically compatible with, any very large threshold in dose for cancer, or with possible hormetic effects, and there is little evidence of the sorts of non-linearity in response implied by non-DNA-targeted effects. There are also excess risks of various types of non-malignant disease in the Japanese atomic bomb survivors and in other groups. In particular, elevated risks of cardiovascular disease, respiratory disease and digestive disease are observed in the A-bomb data. In contrast with cancer, there is much less consistency in the patterns of risk between the various exposed groups; for example, radiation-associated respiratory and digestive diseases have not been seen in these other (non-A-bomb) groups. Cardiovascular risks have been seen in many exposed populations, particularly in medically exposed groups, but in contrast with cancer there is much less consistency in risk between studies: risks per unit dose in epidemiological studies vary over at least two orders of magnitude, possibly a result of confounding and effect modification by well known (but unobserved) risk factors. In the absence of a convincing mechanistic explanation of epidemiological evidence that is, at present, less than persuasive, a cause-and-effect interpretation of the reported statistical associations for cardiovascular disease is unreliable but cannot be excluded. Inflammatory processes are the most likely mechanism by which radiation could modify the atherosclerotic disease process. If there is to be modification by low doses of ionizing radiation of cardiovascular disease through this mechanism, a role for non-DNA-targeted effects cannot be excluded. PMID:20105434

  2. Spatio-temporal patterns of Barmah Forest virus disease in Queensland, Australia.

    PubMed

    Naish, Suchithra; Hu, Wenbiao; Mengersen, Kerrie; Tong, Shilu

    2011-01-01

    Barmah Forest virus (BFV) disease is a common and wide-spread mosquito-borne disease in Australia. This study investigated the spatio-temporal patterns of BFV disease in Queensland, Australia using geographical information system (GIS) tools and geostatistical analysis. We calculated the incidence rates and standardised incidence rates of BFV disease. Moran's I statistic was used to assess the spatial autocorrelation of BFV incidences. Spatial dynamics of BFV disease was examined using semi-variogram analysis. Interpolation techniques were applied to visualise and display the spatial distribution of BFV disease in statistical local areas (SLAs) throughout Queensland. Mapping of BFV disease by SLAs reveals the presence of substantial spatio-temporal variation over time. Statistically significant differences in BFV incidence rates were identified among age groups (χ(2) = 7587, df = 7327,p<0.01). There was a significant positive spatial autocorrelation of BFV incidence for all four periods, with the Moran's I statistic ranging from 0.1506 to 0.2901 (p<0.01). Semi-variogram analysis and smoothed maps created from interpolation techniques indicate that the pattern of spatial autocorrelation was not homogeneous across the state. This is the first study to examine spatial and temporal variation in the incidence rates of BFV disease across Queensland using GIS and geostatistics. The BFV transmission varied with age and gender, which may be due to exposure rates or behavioural risk factors. There are differences in the spatio-temporal patterns of BFV disease which may be related to local socio-ecological and environmental factors. These research findings may have implications in the BFV disease control and prevention programs in Queensland.

  3. Is demography destiny? Application of machine learning techniques to accurately predict population health outcomes from a minimal demographic dataset.

    PubMed

    Luo, Wei; Nguyen, Thin; Nichols, Melanie; Tran, Truyen; Rana, Santu; Gupta, Sunil; Phung, Dinh; Venkatesh, Svetha; Allender, Steve

    2015-01-01

    For years, we have relied on population surveys to keep track of regional public health statistics, including the prevalence of non-communicable diseases. Because of the cost and limitations of such surveys, we often do not have the up-to-date data on health outcomes of a region. In this paper, we examined the feasibility of inferring regional health outcomes from socio-demographic data that are widely available and timely updated through national censuses and community surveys. Using data for 50 American states (excluding Washington DC) from 2007 to 2012, we constructed a machine-learning model to predict the prevalence of six non-communicable disease (NCD) outcomes (four NCDs and two major clinical risk factors), based on population socio-demographic characteristics from the American Community Survey. We found that regional prevalence estimates for non-communicable diseases can be reasonably predicted. The predictions were highly correlated with the observed data, in both the states included in the derivation model (median correlation 0.88) and those excluded from the development for use as a completely separated validation sample (median correlation 0.85), demonstrating that the model had sufficient external validity to make good predictions, based on demographics alone, for areas not included in the model development. This highlights both the utility of this sophisticated approach to model development, and the vital importance of simple socio-demographic characteristics as both indicators and determinants of chronic disease.

  4. A Comprehensive Study of Retinal Vessel Classification Methods in Fundus Images

    PubMed Central

    Miri, Maliheh; Amini, Zahra; Rabbani, Hossein; Kafieh, Raheleh

    2017-01-01

    Nowadays, it is obvious that there is a relationship between changes in the retinal vessel structure and diseases such as diabetic, hypertension, stroke, and the other cardiovascular diseases in adults as well as retinopathy of prematurity in infants. Retinal fundus images provide non-invasive visualization of the retinal vessel structure. Applying image processing techniques in the study of digital color fundus photographs and analyzing their vasculature is a reliable approach for early diagnosis of the aforementioned diseases. Reduction in the arteriolar–venular ratio of retina is one of the primary signs of hypertension, diabetic, and cardiovascular diseases which can be calculated by analyzing the fundus images. To achieve a precise measuring of this parameter and meaningful diagnostic results, accurate classification of arteries and veins is necessary. Classification of vessels in fundus images faces with some challenges that make it difficult. In this paper, a comprehensive study of the proposed methods for classification of arteries and veins in fundus images is presented. Considering that these methods are evaluated on different datasets and use different evaluation criteria, it is not possible to conduct a fair comparison of their performance. Therefore, we evaluate the classification methods from modeling perspective. This analysis reveals that most of the proposed approaches have focused on statistics, and geometric models in spatial domain and transform domain models have received less attention. This could suggest the possibility of using transform models, especially data adaptive ones, for modeling of the fundus images in future classification approaches. PMID:28553578

  5. How allele frequency and study design affect association test statistics with misrepresentation errors.

    PubMed

    Escott-Price, Valentina; Ghodsi, Mansoureh; Schmidt, Karl Michael

    2014-04-01

    We evaluate the effect of genotyping errors on the type-I error of a general association test based on genotypes, showing that, in the presence of errors in the case and control samples, the test statistic asymptotically follows a scaled non-central $\\chi ^2$ distribution. We give explicit formulae for the scaling factor and non-centrality parameter for the symmetric allele-based genotyping error model and for additive and recessive disease models. They show how genotyping errors can lead to a significantly higher false-positive rate, growing with sample size, compared with the nominal significance levels. The strength of this effect depends very strongly on the population distribution of the genotype, with a pronounced effect in the case of rare alleles, and a great robustness against error in the case of large minor allele frequency. We also show how these results can be used to correct $p$-values.

  6. The effects of iterative reconstruction in CT on low-contrast liver lesion volumetry: a phantom study

    NASA Astrophysics Data System (ADS)

    Li, Qin; Berman, Benjamin P.; Schumacher, Justin; Liang, Yongguang; Gavrielides, Marios A.; Yang, Hao; Zhao, Binsheng; Petrick, Nicholas

    2017-03-01

    Tumor volume measured from computed tomography images is considered a biomarker for disease progression or treatment response. The estimation of the tumor volume depends on the imaging system parameters selected, as well as lesion characteristics. In this study, we examined how different image reconstruction methods affect the measurement of lesions in an anthropomorphic liver phantom with a non-uniform background. Iterative statistics-based and model-based reconstructions, as well as filtered back-projection, were evaluated and compared in this study. Statistics-based and filtered back-projection yielded similar estimation performance, while model-based yielded higher precision but lower accuracy in the case of small lesions. Iterative reconstructions exhibited higher signal-to-noise ratio but slightly lower contrast of the lesion relative to the background. A better understanding of lesion volumetry performance as a function of acquisition parameters and lesion characteristics can lead to its incorporation as a routine sizing tool.

  7. Calling in sick: impacts of fever on intra-urban human mobility.

    PubMed

    Perkins, T Alex; Paz-Soldan, Valerie A; Stoddard, Steven T; Morrison, Amy C; Forshey, Brett M; Long, Kanya C; Halsey, Eric S; Kochel, Tadeusz J; Elder, John P; Kitron, Uriel; Scott, Thomas W; Vazquez-Prokopec, Gonzalo M

    2016-07-13

    Pathogens inflict a wide variety of disease manifestations on their hosts, yet the impacts of disease on the behaviour of infected hosts are rarely studied empirically and are seldom accounted for in mathematical models of transmission dynamics. We explored the potential impacts of one of the most common disease manifestations, fever, on a key determinant of pathogen transmission, host mobility, in residents of the Amazonian city of Iquitos, Peru. We did so by comparing two groups of febrile individuals (dengue-positive and dengue-negative) with an afebrile control group. A retrospective, semi-structured interview allowed us to quantify multiple aspects of mobility during the two-week period preceding each interview. We fitted nested models of each aspect of mobility to data from interviews and compared models using likelihood ratio tests to determine whether there were statistically distinguishable differences in mobility attributable to fever or its aetiology. Compared with afebrile individuals, febrile study participants spent more time at home, visited fewer locations, and, in some cases, visited locations closer to home and spent less time at certain types of locations. These multifaceted impacts are consistent with the possibility that disease-mediated changes in host mobility generate dynamic and complex changes in host contact network structure. © 2016 The Author(s).

  8. Calling in sick: impacts of fever on intra-urban human mobility

    PubMed Central

    Perkins, T. Alex; Paz-Soldan, Valerie A.; Stoddard, Steven T.; Morrison, Amy C.; Forshey, Brett M.; Long, Kanya C.; Halsey, Eric S.; Kochel, Tadeusz J.; Elder, John P.; Kitron, Uriel; Scott, Thomas W.; Vazquez-Prokopec, Gonzalo M.

    2016-01-01

    Pathogens inflict a wide variety of disease manifestations on their hosts, yet the impacts of disease on the behaviour of infected hosts are rarely studied empirically and are seldom accounted for in mathematical models of transmission dynamics. We explored the potential impacts of one of the most common disease manifestations, fever, on a key determinant of pathogen transmission, host mobility, in residents of the Amazonian city of Iquitos, Peru. We did so by comparing two groups of febrile individuals (dengue-positive and dengue-negative) with an afebrile control group. A retrospective, semi-structured interview allowed us to quantify multiple aspects of mobility during the two-week period preceding each interview. We fitted nested models of each aspect of mobility to data from interviews and compared models using likelihood ratio tests to determine whether there were statistically distinguishable differences in mobility attributable to fever or its aetiology. Compared with afebrile individuals, febrile study participants spent more time at home, visited fewer locations, and, in some cases, visited locations closer to home and spent less time at certain types of locations. These multifaceted impacts are consistent with the possibility that disease-mediated changes in host mobility generate dynamic and complex changes in host contact network structure. PMID:27412286

  9. Smoking, health-related quality of life and economic evaluation.

    PubMed

    López-Nicolás, Ángel; Trapero-Bertran, Marta; Muñoz, Celia

    2018-06-01

    The economic evaluation of tobacco control policies requires the adoption of assumptions about the impact of changes in smoking status on health-related quality of life (HRQoL). Estimates for such impacts are necessary for different populations. This paper aims to test whether smoking status has an independent effect on HRQoL over and above the effect derived from the increased likelihood of suffering a tobacco related disease, and to calculate utility values for the Spanish population. Using data from the Spanish Encuesta Nacional de Salud of 2011-12, we estimate statistical models for HRQoL as measured by the EQ-5D-5L instrument as a function of smoking status. We include a comprehensive set of controls for biological, clinical, lifestyle and socioeconomic characteristics. Smoking status has an independent, statistically significant effect on HRQoL. However, the size of the effect is small. The typical smoking related diseases, such as lung cancer, are associated with a reduction in HRQoL about 5 times larger than the difference between current smokers and never smokers. Attributing substantive HRQoL gains to quitting smoking as well as accounting for the concomitant HRQoL gain derived from a smaller likelihood of contracting tobacco related diseases might lead to an overestimation of the benefits of tobacco control policies. Nonetheless, the relatively large drops in HRQoL associated with being diagnosed with diseases that might be causally linked to tobacco suggest that such diseases should not be omitted from the economic evaluations of tobacco control policies.

  10. Association of chronic disease prevalence and quality of life with suicide-related ideation and suicide attempt among Korean adults

    PubMed Central

    Joshi, Pankaj; Song, Han-Byol; Lee, Sang-Ah

    2017-01-01

    Aims: The aim of this study is to find the association of chronic disease prevalence (CDP) with suicide-related ideation (SI) and suicide attempt (SA) and to determine the combined effect of CDP and quality of life (QoL) with SI or SA. Design: This was a cross-sectional study. Materials and Methods: The data were collected from the nationally representative Korea National Health and Nutrition Examination Survey IV and V (2007–2012). For the analysis, a total of 35,075 adult participants were selected as the final sample, which included 5773 participants with SI and 331 with SA. Statistical Analysis: Multiple logistic regression models were used to examine the odds ratio after adjusting for age, sex, marital status, education, occupation, and household income. Results and Conclusion: SI was positively associated with selected CDP, such as cardiovascular disease (CVD), stroke, ischemic heart disease (IHD), cancer, diabetes, renal failure, and depression, except hypertension. Subjects with CVD, IHD, renal failure, and depression were found likely to have increased odds for SA as compared to non-SA controls. Lower QoL strongly affected SI and SA. Furthermore, the likelihood of SI increased for depressed and cancer subjects who had low QoL in comparison to subjects with high QoL and without chronic disease. Similarly, statistically, significant interaction was observed between lower QoL and depression in relation to SA compared to non-SA controls. These data suggest that suicide-related behavior could be predicted by the prevalence of chronic disease and low QoL. PMID:29085096

  11. A method for age-matched OCT angiography deviation mapping in the assessment of disease- related changes to the radial peripapillary capillaries.

    PubMed

    Pinhas, Alexander; Linderman, Rachel; Mo, Shelley; Krawitz, Brian D; Geyman, Lawrence S; Carroll, Joseph; Rosen, Richard B; Chui, Toco Y

    2018-01-01

    To present a method for age-matched deviation mapping in the assessment of disease-related changes to the radial peripapillary capillaries (RPCs). We reviewed 4.5x4.5mm en face peripapillary OCT-A scans of 133 healthy control eyes (133 subjects, mean 41.5 yrs, range 11-82 yrs) and 4 eyes with distinct retinal pathologies, obtained using spectral-domain optical coherence tomography angiography. Statistical analysis was performed to evaluate the impact of age on RPC perfusion densities. RPC density group mean and standard deviation maps were generated for each decade of life. Deviation maps were created for the diseased eyes based on these maps. Large peripapillary vessel (LPV; noncapillary vessel) perfusion density was also studied for impact of age. Average healthy RPC density was 42.5±1.47%. ANOVA and pairwise Tukey-Kramer tests showed that RPC density in the ≥60yr group was significantly lower compared to RPC density in all younger decades of life (p<0.01). Average healthy LPV density was 21.5±3.07%. Linear regression models indicated that LPV density decreased with age, however ANOVA and pairwise Tukey-Kramer tests did not reach statistical significance. Deviation mapping enabled us to quantitatively and visually elucidate the significance of RPC density changes in disease. It is important to consider changes that occur with aging when analyzing RPC and LPV density changes in disease. RPC density, coupled with age-matched deviation mapping techniques, represents a potentially clinically useful method in detecting changes to peripapillary perfusion in disease.

  12. Multidimensional structure-function relationships in human β-cardiac myosin from population-scale genetic variation

    PubMed Central

    Homburger, Julian R.; Green, Eric M.; Caleshu, Colleen; Sunitha, Margaret S.; Taylor, Rebecca E.; Ruppel, Kathleen M.; Metpally, Raghu Prasad Rao; Colan, Steven D.; Michels, Michelle; Day, Sharlene M.; Olivotto, Iacopo; Bustamante, Carlos D.; Dewey, Frederick E.; Ho, Carolyn Y.; Spudich, James A.; Ashley, Euan A.

    2016-01-01

    Myosin motors are the fundamental force-generating elements of muscle contraction. Variation in the human β-cardiac myosin heavy chain gene (MYH7) can lead to hypertrophic cardiomyopathy (HCM), a heritable disease characterized by cardiac hypertrophy, heart failure, and sudden cardiac death. How specific myosin variants alter motor function or clinical expression of disease remains incompletely understood. Here, we combine structural models of myosin from multiple stages of its chemomechanical cycle, exome sequencing data from two population cohorts of 60,706 and 42,930 individuals, and genetic and phenotypic data from 2,913 patients with HCM to identify regions of disease enrichment within β-cardiac myosin. We first developed computational models of the human β-cardiac myosin protein before and after the myosin power stroke. Then, using a spatial scan statistic modified to analyze genetic variation in protein 3D space, we found significant enrichment of disease-associated variants in the converter, a kinetic domain that transduces force from the catalytic domain to the lever arm to accomplish the power stroke. Focusing our analysis on surface-exposed residues, we identified a larger region significantly enriched for disease-associated variants that contains both the converter domain and residues on a single flat surface on the myosin head described as the myosin mesa. Notably, patients with HCM with variants in the enriched regions have earlier disease onset than patients who have HCM with variants elsewhere. Our study provides a model for integrating protein structure, large-scale genetic sequencing, and detailed phenotypic data to reveal insight into time-shifted protein structures and genetic disease. PMID:27247418

  13. When human walking becomes random walking: fractal analysis and modeling of gait rhythm fluctuations

    NASA Astrophysics Data System (ADS)

    Hausdorff, Jeffrey M.; Ashkenazy, Yosef; Peng, Chang-K.; Ivanov, Plamen Ch.; Stanley, H. Eugene; Goldberger, Ary L.

    2001-12-01

    We present a random walk, fractal analysis of the stride-to-stride fluctuations in the human gait rhythm. The gait of healthy young adults is scale-free with long-range correlations extending over hundreds of strides. This fractal scaling changes characteristically with maturation in children and older adults and becomes almost completely uncorrelated with certain neurologic diseases. Stochastic modeling of the gait rhythm dynamics, based on transitions between different “neural centers”, reproduces distinctive statistical properties of the gait pattern. By tuning one model parameter, the hopping (transition) range, the model can describe alterations in gait dynamics from childhood to adulthood - including a decrease in the correlation and volatility exponents with maturation.

  14. Evaluating the Appropriateness of Downscaled Climate Information for Projecting Risks of Salmonella

    PubMed Central

    Guentchev, Galina S.; Rood, Richard B.; Ammann, Caspar M.; Barsugli, Joseph J.; Ebi, Kristie; Berrocal, Veronica; O’Neill, Marie S.; Gronlund, Carina J.; Vigh, Jonathan L.; Koziol, Ben; Cinquini, Luca

    2016-01-01

    Foodborne diseases have large economic and societal impacts worldwide. To evaluate how the risks of foodborne diseases might change in response to climate change, credible and usable climate information tailored to the specific application question is needed. Global Climate Model (GCM) data generally need to, both, be downscaled to the scales of the application to be usable, and represent, well, the key characteristics that inflict health impacts. This study presents an evaluation of temperature-based heat indices for the Washington D.C. area derived from statistically downscaled GCM simulations for 1971–2000—a necessary step in establishing the credibility of these data. The indices approximate high weekly mean temperatures linked previously to occurrences of Salmonella infections. Due to bias-correction, included in the Asynchronous Regional Regression Model (ARRM) and the Bias Correction Constructed Analogs (BCCA) downscaling methods, the observed 30-year means of the heat indices were reproduced reasonably well. In April and May, however, some of the statistically downscaled data misrepresent the increase in the number of hot days towards the summer months. This study demonstrates the dependence of the outcomes to the selection of downscaled climate data and the potential for misinterpretation of future estimates of Salmonella infections. PMID:26938544

  15. BagMOOV: A novel ensemble for heart disease prediction bootstrap aggregation with multi-objective optimized voting.

    PubMed

    Bashir, Saba; Qamar, Usman; Khan, Farhan Hassan

    2015-06-01

    Conventional clinical decision support systems are based on individual classifiers or simple combination of these classifiers which tend to show moderate performance. This research paper presents a novel classifier ensemble framework based on enhanced bagging approach with multi-objective weighted voting scheme for prediction and analysis of heart disease. The proposed model overcomes the limitations of conventional performance by utilizing an ensemble of five heterogeneous classifiers: Naïve Bayes, linear regression, quadratic discriminant analysis, instance based learner and support vector machines. Five different datasets are used for experimentation, evaluation and validation. The datasets are obtained from publicly available data repositories. Effectiveness of the proposed ensemble is investigated by comparison of results with several classifiers. Prediction results of the proposed ensemble model are assessed by ten fold cross validation and ANOVA statistics. The experimental evaluation shows that the proposed framework deals with all type of attributes and achieved high diagnosis accuracy of 84.16 %, 93.29 % sensitivity, 96.70 % specificity, and 82.15 % f-measure. The f-ratio higher than f-critical and p value less than 0.05 for 95 % confidence interval indicate that the results are extremely statistically significant for most of the datasets.

  16. Bayesian Analysis for Risk Assessment of Selected Medical Events in Support of the Integrated Medical Model Effort

    NASA Technical Reports Server (NTRS)

    Gilkey, Kelly M.; Myers, Jerry G.; McRae, Michael P.; Griffin, Elise A.; Kallrui, Aditya S.

    2012-01-01

    The Exploration Medical Capability project is creating a catalog of risk assessments using the Integrated Medical Model (IMM). The IMM is a software-based system intended to assist mission planners in preparing for spaceflight missions by helping them to make informed decisions about medical preparations and supplies needed for combating and treating various medical events using Probabilistic Risk Assessment. The objective is to use statistical analyses to inform the IMM decision tool with estimated probabilities of medical events occurring during an exploration mission. Because data regarding astronaut health are limited, Bayesian statistical analysis is used. Bayesian inference combines prior knowledge, such as data from the general U.S. population, the U.S. Submarine Force, or the analog astronaut population located at the NASA Johnson Space Center, with observed data for the medical condition of interest. The posterior results reflect the best evidence for specific medical events occurring in flight. Bayes theorem provides a formal mechanism for combining available observed data with data from similar studies to support the quantification process. The IMM team performed Bayesian updates on the following medical events: angina, appendicitis, atrial fibrillation, atrial flutter, dental abscess, dental caries, dental periodontal disease, gallstone disease, herpes zoster, renal stones, seizure, and stroke.

  17. Community Characteristics and Mortality: The Relative Strength of Association of Different Community Characteristics

    PubMed Central

    Roberts, Eric; McCleary, Rachael; Buttorff, Christine; Gaskin, Darrell J.

    2014-01-01

    Objectives. We compared the strength of association between average 5-year county-level mortality rates and area-level measures, including air quality, sociodemographic characteristics, violence, and economic distress. Methods. We obtained mortality data from the National Vital Statistics System and linked it to socioeconomic and demographic data from the Census Bureau, air quality data, violent crime statistics, and loan delinquency data. We modeled 5-year average mortality rates (1998–2002) for all-cause, cancer, heart disease, stroke, and respiratory diseases as a function of county-level characteristics using ordinary least squares regression models. We limited analyses to counties with population of 100 000 or greater (n = 458). Results. Demographic and socioeconomic characteristics, particularly the percentage older than 65 years and near poor, were top predictors of all-cause and condition-specific mortality, as were a high concentration of construction and service workers. We found weaker associations for air quality, mortgage delinquencies, and violent crimes. Protective characteristics included the percentage of Hispanics, Asians, and married residents. Conclusions. Multiple factors influence county-level mortality. Although county demographic and socioeconomic characteristics are important, there are independent, although weaker, associations of other environmental characteristics. Future studies should investigate these factors to better understand community mortality risk. PMID:25033152

  18. Effectiveness of self-management promotion educational program among diabetic patients based on health belief model

    PubMed Central

    Jalilian, Farzad; Motlagh, Fazel Zinat; Solhi, Mahnaz; Gharibnavaz, Hasan

    2014-01-01

    Introduction: Diabetes is a chronic disease; it can cause serious complications. Diabetes self-management is essential for prevention of disease complications. This study was conducted to evaluate self-management promotion educational program intervention efficiency among diabetic patients in Iran and health belief model (HBM) was applied as a theoretical framework. Materials and Methods: Overall, 120 Type 2 diabetic patients referred to rural health centers in Gachsaran, Iran participated in this study as randomly divided into intervention and control group. This was a longitudinal randomized pre- and post-test series control group design panel study to implement a behavior modification based intervention to promotion self-management among diabetic patients. Cross-tabulation and t-test by using SPSS statistical package, version 16 was used for the statistical analysis. Results: Mean age was 55.07 years (SD = 9.94, range: 30-70). Our result shows significant improvements in average response for susceptibility, severity, benefit and self-management among intervention group. Additionally, after intervention, average response of the barrier to self-management was decreased among intervention group. Conclusion: Our result showed education program based on HBM was improve of self-management and seems implementing these programs can be effective in the and prevention of diabetes complications. PMID:24741654

  19. Morning and Evening Home Blood Pressure and Risks of Incident Stroke and Coronary Artery Disease in the Japanese General Practice Population: The Japan Morning Surge-Home Blood Pressure Study.

    PubMed

    Hoshide, Satoshi; Yano, Yuichiro; Haimoto, Hajime; Yamagiwa, Kayo; Uchiba, Kiyoshi; Nagasaka, Shoichiro; Matsui, Yoshio; Nakamura, Akira; Fukutomi, Motoki; Eguchi, Kazuo; Ishikawa, Joji; Kario, Kazuomi

    2016-07-01

    Our aim is to determine the optimal time schedule for home blood pressure (BP) monitoring that best predicts stroke and coronary artery disease in general practice. The Japan Morning Surge-Home Blood Pressure (J-HOP) study is a nationwide practice-based study that included 4310 Japanese with a history of or risk factors for cardiovascular disease, or both (mean age, 65 years; 79% used antihypertensive medication). Home BP measures were taken twice daily (morning and evening) over 14 days at baseline. During a mean follow-up of 4 years (16 929 person-years), 74 stroke and 77 coronary artery disease events occurred. Morning systolic BP (SBP) improved the discrimination of incident stroke (C statistics, 0.802; 95% confidence interval, 0.692-0.911) beyond traditional risk factors including office SBP (0.756; 0.646-0.866), whereas the changes were smaller with evening SBP (0.764; 0.653-0.874). The addition of evening SBP to the model (including traditional risk factors plus morning SBP) significantly reduced the discrimination of incident stroke (C statistics difference, -0.008; 95% confidence interval: -0.015 to -0.008; P=0.03). The category-free net reclassification improvement (0.3606; 95% confidence interval, 0.1317-0.5896), absolute integrated discrimination improvement (0.015; SE, 0.005), and relative integrated discrimination improvement (58.3%; all P<0.01) with the addition of morning SBP to the model (including traditional risk factors) were greater than those with evening SBP and with combined morning and evening SBP. Neither morning nor evening SBP improved coronary artery disease risk prediction. Morning home SBP itself should be evaluated to ensure best stroke prediction in clinical practice, at least in Japan. This should be confirmed in the different ethnic groups. URL: http://www.umin.ac.jp/ctr/. Unique identifier: UMIN000000894. © 2016 American Heart Association, Inc.

  20. Cost-effectiveness analysis of a randomized trial comparing care models for chronic kidney disease.

    PubMed

    Hopkins, Robert B; Garg, Amit X; Levin, Adeera; Molzahn, Anita; Rigatto, Claudio; Singer, Joel; Soltys, George; Soroka, Steven; Parfrey, Patrick S; Barrett, Brendan J; Goeree, Ron

    2011-06-01

    Potential cost and effectiveness of a nephrologist/nurse-based multifaceted intervention for stage 3 to 4 chronic kidney disease are not known. This study examines the cost-effectiveness of a chronic disease management model for chronic kidney disease. Cost and cost-effectiveness were prospectively gathered alongside a multicenter trial. The Canadian Prevention of Renal and Cardiovascular Endpoints Trial (CanPREVENT) randomized 236 patients to receive usual care (controls) and another 238 patients to multifaceted nurse/nephrologist-supported care that targeted factors associated with development of kidney and cardiovascular disease (intervention). Cost and outcomes over 2 years were examined to determine the incremental cost-effectiveness of the intervention. Base-case analysis included disease-related costs, and sensitivity analysis included all costs. Consideration of all costs produced statistically significant differences. A lower number of days in hospital explained most of the cost difference. For both base-case and sensitivity analyses with all costs included, the intervention group required fewer resources and had higher quality of life. The direction of the results was unchanged to inclusion of various types of costs, consideration of payer or societal perspective, changes to the discount rate, and levels of GFR. The nephrologist/nurse-based multifaceted intervention represents good value for money because it reduces costs without reducing quality of life for patients with chronic kidney disease.

Top