Sample records for negative binomial probability

  1. Zero-truncated negative binomial - Erlang distribution

    NASA Astrophysics Data System (ADS)

    Bodhisuwan, Winai; Pudprommarat, Chookait; Bodhisuwan, Rujira; Saothayanun, Luckhana

    2017-11-01

    The zero-truncated negative binomial-Erlang distribution is introduced. It is developed from negative binomial-Erlang distribution. In this work, the probability mass function is derived and some properties are included. The parameters of the zero-truncated negative binomial-Erlang distribution are estimated by using the maximum likelihood estimation. Finally, the proposed distribution is applied to real data, the number of methamphetamine in the Bangkok, Thailand. Based on the results, it shows that the zero-truncated negative binomial-Erlang distribution provided a better fit than the zero-truncated Poisson, zero-truncated negative binomial, zero-truncated generalized negative-binomial and zero-truncated Poisson-Lindley distributions for this data.

  2. Use of the negative binomial-truncated Poisson distribution in thunderstorm prediction

    NASA Technical Reports Server (NTRS)

    Cohen, A. C.

    1971-01-01

    A probability model is presented for the distribution of thunderstorms over a small area given that thunderstorm events (1 or more thunderstorms) are occurring over a larger area. The model incorporates the negative binomial and truncated Poisson distributions. Probability tables for Cape Kennedy for spring, summer, and fall months and seasons are presented. The computer program used to compute these probabilities is appended.

  3. The Difference Calculus and The NEgative Binomial Distribution

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bowman, Kimiko o; Shenton, LR

    2007-01-01

    In a previous paper we state the dominant term in the third central moment of the maximum likelihood estimator k of the parameter k in the negative binomial probability function where the probability generating function is (p + 1 - pt){sup -k}. A partial sum of the series {Sigma}1/(k + x){sup 3} is involved, where x is a negative binomial random variate. In expectation this sum can only be found numerically using the computer. Here we give a simple definite integral in (0,1) for the generalized case. This means that now we do have a valid expression for {radical}{beta}{sub 11}(k)more » and {radical}{beta}{sub 11}(p). In addition we use the finite difference operator {Delta}, and E = 1 + {Delta} to set up formulas for low order moments. Other examples of the operators are quoted relating to the orthogonal set of polynomials associated with the negative binomial probability function used as a weight function.« less

  4. Binomial test statistics using Psi functions

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bowman, Kimiko o

    2007-01-01

    For the negative binomial model (probability generating function (p + 1 - pt){sup -k}) a logarithmic derivative is the Psi function difference {psi}(k + x) - {psi}(k); this and its derivatives lead to a test statistic to decide on the validity of a specified model. The test statistic uses a data base so there exists a comparison available between theory and application. Note that the test function is not dominated by outliers. Applications to (i) Fisher's tick data, (ii) accidents data, (iii) Weldon's dice data are included.

  5. Distribution pattern of public transport passenger in Yogyakarta, Indonesia

    NASA Astrophysics Data System (ADS)

    Narendra, Alfa; Malkhamah, Siti; Sopha, Bertha Maya

    2018-03-01

    The arrival and departure distribution pattern of Trans Jogja bus passenger is one of the fundamental model for simulation. The purpose of this paper is to build models of passengers flows. This research used passengers data from January to May 2014. There is no policy that change the operation system affecting the nature of this pattern nowadays. The roads, buses, land uses, schedule, and people are relatively still the same. The data then categorized based on the direction, days, and location. Moreover, each category was fitted into some well-known discrete distributions. Those distributions are compared based on its AIC value and BIC. The chosen distribution model has the smallest AIC and BIC value and the negative binomial distribution found has the smallest AIC and BIC value. Probability mass function (PMF) plots of those models were compared to draw generic model from each categorical negative binomial distribution models. The value of accepted generic negative binomial distribution is 0.7064 and 1.4504 of mu. The minimum and maximum passenger vector value of distribution are is 0 and 41.

  6. Probabilistic assessment of precipitation-triggered landslides using historical records of landslide occurence, Seattle, Washington

    USGS Publications Warehouse

    Coe, J.A.; Michael, J.A.; Crovelli, R.A.; Savage, W.Z.; Laprade, W.T.; Nashem, W.D.

    2004-01-01

    Ninety years of historical landslide records were used as input to the Poisson and binomial probability models. Results from these models show that, for precipitation-triggered landslides, approximately 9 percent of the area of Seattle has annual exceedance probabilities of 1 percent or greater. Application of the Poisson model for estimating the future occurrence of individual landslides results in a worst-case scenario map, with a maximum annual exceedance probability of 25 percent on a hillslope near Duwamish Head in West Seattle. Application of the binomial model for estimating the future occurrence of a year with one or more landslides results in a map with a maximum annual exceedance probability of 17 percent (also near Duwamish Head). Slope and geology both play a role in localizing the occurrence of landslides in Seattle. A positive correlation exists between slope and mean exceedance probability, with probability tending to increase as slope increases. Sixty-four percent of all historical landslide locations are within 150 m (500 ft, horizontal distance) of the Esperance Sand/Lawton Clay contact, but within this zone, no positive or negative correlation exists between exceedance probability and distance to the contact.

  7. Exploring the effects of roadway characteristics on the frequency and severity of head-on crashes: case studies from Malaysian federal roads.

    PubMed

    Hosseinpour, Mehdi; Yahaya, Ahmad Shukri; Sadullah, Ahmad Farhan

    2014-01-01

    Head-on crashes are among the most severe collision types and of great concern to road safety authorities. Therefore, it justifies more efforts to reduce both the frequency and severity of this collision type. To this end, it is necessary to first identify factors associating with the crash occurrence. This can be done by developing crash prediction models that relate crash outcomes to a set of contributing factors. This study intends to identify the factors affecting both the frequency and severity of head-on crashes that occurred on 448 segments of five federal roads in Malaysia. Data on road characteristics and crash history were collected on the study segments during a 4-year period between 2007 and 2010. The frequency of head-on crashes were fitted by developing and comparing seven count-data models including Poisson, standard negative binomial (NB), random-effect negative binomial, hurdle Poisson, hurdle negative binomial, zero-inflated Poisson, and zero-inflated negative binomial models. To model crash severity, a random-effect generalized ordered probit model (REGOPM) was used given a head-on crash had occurred. With respect to the crash frequency, the random-effect negative binomial (RENB) model was found to outperform the other models according to goodness of fit measures. Based on the results of the model, the variables horizontal curvature, terrain type, heavy-vehicle traffic, and access points were found to be positively related to the frequency of head-on crashes, while posted speed limit and shoulder width decreased the crash frequency. With regard to the crash severity, the results of REGOPM showed that horizontal curvature, paved shoulder width, terrain type, and side friction were associated with more severe crashes, whereas land use, access points, and presence of median reduced the probability of severe crashes. Based on the results of this study, some potential countermeasures were proposed to minimize the risk of head-on crashes. Copyright © 2013 Elsevier Ltd. All rights reserved.

  8. Football goal distributions and extremal statistics

    NASA Astrophysics Data System (ADS)

    Greenhough, J.; Birch, P. C.; Chapman, S. C.; Rowlands, G.

    2002-12-01

    We analyse the distributions of the number of goals scored by home teams, away teams, and the total scored in the match, in domestic football games from 169 countries between 1999 and 2001. The probability density functions (PDFs) of goals scored are too heavy-tailed to be fitted over their entire ranges by Poisson or negative binomial distributions which would be expected for uncorrelated processes. Log-normal distributions cannot include zero scores and here we find that the PDFs are consistent with those arising from extremal statistics. In addition, we show that it is sufficient to model English top division and FA Cup matches in the seasons of 1970/71-2000/01 on Poisson or negative binomial distributions, as reported in analyses of earlier seasons, and that these are not consistent with extremal statistics.

  9. Selecting the right statistical model for analysis of insect count data by using information theoretic measures.

    PubMed

    Sileshi, G

    2006-10-01

    Researchers and regulatory agencies often make statistical inferences from insect count data using modelling approaches that assume homogeneous variance. Such models do not allow for formal appraisal of variability which in its different forms is the subject of interest in ecology. Therefore, the objectives of this paper were to (i) compare models suitable for handling variance heterogeneity and (ii) select optimal models to ensure valid statistical inferences from insect count data. The log-normal, standard Poisson, Poisson corrected for overdispersion, zero-inflated Poisson, the negative binomial distribution and zero-inflated negative binomial models were compared using six count datasets on foliage-dwelling insects and five families of soil-dwelling insects. Akaike's and Schwarz Bayesian information criteria were used for comparing the various models. Over 50% of the counts were zeros even in locally abundant species such as Ootheca bennigseni Weise, Mesoplatys ochroptera Stål and Diaecoderus spp. The Poisson model after correction for overdispersion and the standard negative binomial distribution model provided better description of the probability distribution of seven out of the 11 insects than the log-normal, standard Poisson, zero-inflated Poisson or zero-inflated negative binomial models. It is concluded that excess zeros and variance heterogeneity are common data phenomena in insect counts. If not properly modelled, these properties can invalidate the normal distribution assumptions resulting in biased estimation of ecological effects and jeopardizing the integrity of the scientific inferences. Therefore, it is recommended that statistical models appropriate for handling these data properties be selected using objective criteria to ensure efficient statistical inference.

  10. Mental health status and healthcare utilization among community dwelling older adults.

    PubMed

    Adepoju, Omolola; Lin, Szu-Hsuan; Mileski, Michael; Kruse, Clemens Scott; Mask, Andrew

    2018-04-27

    Shifts in mental health utilization patterns are necessary to allow for meaningful access to care for vulnerable populations. There have been long standing issues in how mental health is provided, which has caused problems in that care being efficacious for those seeking it. To assess the relationship between mental health status and healthcare utilization among adults ≥65 years. A negative binomial regression model was used to assess the relationship between mental health status and healthcare utilization related to office-based physician visits, while a two-part model, consisting of logistic regression and negative binomial regression, was used to separately model emergency visits and inpatient services. The receipt of care in office-based settings were marginally higher for subjects with mental health difficulties. Both probabilities and counts of inpatient hospitalizations were similar across mental health categories. The count of ER visits was similar across mental health categories; however, the probability of having an emergency department visit was marginally higher for older adults who reported mental health difficulties in 2012. These findings are encouraging and lend promise to the recent initiatives on addressing gaps in mental healthcare services.

  11. Modeling the distribution of colonial species to improve estimation of plankton concentration in ballast water

    NASA Astrophysics Data System (ADS)

    Rajakaruna, Harshana; VandenByllaardt, Julie; Kydd, Jocelyn; Bailey, Sarah

    2018-03-01

    The International Maritime Organization (IMO) has set limits on allowable plankton concentrations in ballast water discharge to minimize aquatic invasions globally. Previous guidance on ballast water sampling and compliance decision thresholds was based on the assumption that probability distributions of plankton are Poisson when spatially homogenous, or negative binomial when heterogeneous. We propose a hierarchical probability model, which incorporates distributions at the level of particles (i.e., discrete individuals plus colonies per unit volume) and also within particles (i.e., individuals per particle) to estimate the average plankton concentration in ballast water. We examined the performance of the models using data for plankton in the size class ≥ 10 μm and < 50 μm, collected from five different depths of a ballast tank of a commercial ship in three independent surveys. We show that the data fit to the negative binomial and the hierarchical probability models equally well, with both models performing better than the Poisson model at the scale of our sampling. The hierarchical probability model, which accounts for both the individuals and the colonies in a sample, reduces the uncertainty associated with the concentration estimation, and improves the power of rejecting the decision on ship's compliance when a ship does not truly comply with the standard. We show examples of how to test ballast water compliance using the above models.

  12. Reliability of environmental sampling culture results using the negative binomial intraclass correlation coefficient.

    PubMed

    Aly, Sharif S; Zhao, Jianyang; Li, Ben; Jiang, Jiming

    2014-01-01

    The Intraclass Correlation Coefficient (ICC) is commonly used to estimate the similarity between quantitative measures obtained from different sources. Overdispersed data is traditionally transformed so that linear mixed model (LMM) based ICC can be estimated. A common transformation used is the natural logarithm. The reliability of environmental sampling of fecal slurry on freestall pens has been estimated for Mycobacterium avium subsp. paratuberculosis using the natural logarithm transformed culture results. Recently, the negative binomial ICC was defined based on a generalized linear mixed model for negative binomial distributed data. The current study reports on the negative binomial ICC estimate which includes fixed effects using culture results of environmental samples. Simulations using a wide variety of inputs and negative binomial distribution parameters (r; p) showed better performance of the new negative binomial ICC compared to the ICC based on LMM even when negative binomial data was logarithm, and square root transformed. A second comparison that targeted a wider range of ICC values showed that the mean of estimated ICC closely approximated the true ICC.

  13. Identifiability in N-mixture models: a large-scale screening test with bird data.

    PubMed

    Kéry, Marc

    2018-02-01

    Binomial N-mixture models have proven very useful in ecology, conservation, and monitoring: they allow estimation and modeling of abundance separately from detection probability using simple counts. Recently, doubts about parameter identifiability have been voiced. I conducted a large-scale screening test with 137 bird data sets from 2,037 sites. I found virtually no identifiability problems for Poisson and zero-inflated Poisson (ZIP) binomial N-mixture models, but negative-binomial (NB) models had problems in 25% of all data sets. The corresponding multinomial N-mixture models had no problems. Parameter estimates under Poisson and ZIP binomial and multinomial N-mixture models were extremely similar. Identifiability problems became a little more frequent with smaller sample sizes (267 and 50 sites), but were unaffected by whether the models did or did not include covariates. Hence, binomial N-mixture model parameters with Poisson and ZIP mixtures typically appeared identifiable. In contrast, NB mixtures were often unidentifiable, which is worrying since these were often selected by Akaike's information criterion. Identifiability of binomial N-mixture models should always be checked. If problems are found, simpler models, integrated models that combine different observation models or the use of external information via informative priors or penalized likelihoods, may help. © 2017 by the Ecological Society of America.

  14. Estimating abundance while accounting for rarity, correlated behavior, and other sources of variation in counts

    USGS Publications Warehouse

    Dorazio, Robert M.; Martin, Juulien; Edwards, Holly H.

    2013-01-01

    The class of N-mixture models allows abundance to be estimated from repeated, point count surveys while adjusting for imperfect detection of individuals. We developed an extension of N-mixture models to account for two commonly observed phenomena in point count surveys: rarity and lack of independence induced by unmeasurable sources of variation in the detectability of individuals. Rarity increases the number of locations with zero detections in excess of those expected under simple models of abundance (e.g., Poisson or negative binomial). Correlated behavior of individuals and other phenomena, though difficult to measure, increases the variation in detection probabilities among surveys. Our extension of N-mixture models includes a hurdle model of abundance and a beta-binomial model of detectability that accounts for additional (extra-binomial) sources of variation in detections among surveys. As an illustration, we fit this model to repeated point counts of the West Indian manatee, which was observed in a pilot study using aerial surveys. Our extension of N-mixture models provides increased flexibility. The effects of different sets of covariates may be estimated for the probability of occurrence of a species, for its mean abundance at occupied locations, and for its detectability.

  15. Estimating abundance while accounting for rarity, correlated behavior, and other sources of variation in counts.

    PubMed

    Dorazio, Robert M; Martin, Julien; Edwards, Holly H

    2013-07-01

    The class of N-mixture models allows abundance to be estimated from repeated, point count surveys while adjusting for imperfect detection of individuals. We developed an extension of N-mixture models to account for two commonly observed phenomena in point count surveys: rarity and lack of independence induced by unmeasurable sources of variation in the detectability of individuals. Rarity increases the number of locations with zero detections in excess of those expected under simple models of abundance (e.g., Poisson or negative binomial). Correlated behavior of individuals and other phenomena, though difficult to measure, increases the variation in detection probabilities among surveys. Our extension of N-mixture models includes a hurdle model of abundance and a beta-binomial model of detectability that accounts for additional (extra-binomial) sources of variation in detections among surveys. As an illustration, we fit this model to repeated point counts of the West Indian manatee, which was observed in a pilot study using aerial surveys. Our extension of N-mixture models provides increased flexibility. The effects of different sets of covariates may be estimated for the probability of occurrence of a species, for its mean abundance at occupied locations, and for its detectability.

  16. Distinguishing between Binomial, Hypergeometric and Negative Binomial Distributions

    ERIC Educational Resources Information Center

    Wroughton, Jacqueline; Cole, Tarah

    2013-01-01

    Recognizing the differences between three discrete distributions (Binomial, Hypergeometric and Negative Binomial) can be challenging for students. We present an activity designed to help students differentiate among these distributions. In addition, we present assessment results in the form of pre- and post-tests that were designed to assess the…

  17. CUMBIN - CUMULATIVE BINOMIAL PROGRAMS

    NASA Technical Reports Server (NTRS)

    Bowerman, P. N.

    1994-01-01

    The cumulative binomial program, CUMBIN, is one of a set of three programs which calculate cumulative binomial probability distributions for arbitrary inputs. The three programs, CUMBIN, NEWTONP (NPO-17556), and CROSSER (NPO-17557), can be used independently of one another. CUMBIN can be used by statisticians and users of statistical procedures, test planners, designers, and numerical analysts. The program has been used for reliability/availability calculations. CUMBIN calculates the probability that a system of n components has at least k operating if the probability that any one operating is p and the components are independent. Equivalently, this is the reliability of a k-out-of-n system having independent components with common reliability p. CUMBIN can evaluate the incomplete beta distribution for two positive integer arguments. CUMBIN can also evaluate the cumulative F distribution and the negative binomial distribution, and can determine the sample size in a test design. CUMBIN is designed to work well with all integer values 0 < k <= n. To run the program, the user simply runs the executable version and inputs the information requested by the program. The program is not designed to weed out incorrect inputs, so the user must take care to make sure the inputs are correct. Once all input has been entered, the program calculates and lists the result. The CUMBIN program is written in C. It was developed on an IBM AT with a numeric co-processor using Microsoft C 5.0. Because the source code is written using standard C structures and functions, it should compile correctly with most C compilers. The program format is interactive. It has been implemented under DOS 3.2 and has a memory requirement of 26K. CUMBIN was developed in 1988.

  18. Application of a hurdle negative binomial count data model to demand for bass fishing in the southeastern United States.

    PubMed

    Bilgic, Abdulbaki; Florkowski, Wojciech J

    2007-06-01

    This paper identifies factors that influence the demand for a bass fishing trip taken in the southeastern United States using a hurdle negative binomial count data model. The probability of fishing for a bass is estimated in the first stage and the fishing trip frequency is estimated in the second stage for individuals reporting bass fishing trips in the Southeast. The applied approach allows the decomposition of the effects of factors responsible for the decision to take a trip and the trip number. Calculated partial and total elasticities indicate a highly inelastic demand for the number of fishing trips as trip costs increase. However, the demand can be expected to increase if anglers experience a success measured by the number of caught fish or their size. Benefit estimates based on alternative estimation methods differ substantially, suggesting the need for testing each modeling approach applied in empirical studies.

  19. Meta-analysis of studies with bivariate binary outcomes: a marginal beta-binomial model approach

    PubMed Central

    Chen, Yong; Hong, Chuan; Ning, Yang; Su, Xiao

    2018-01-01

    When conducting a meta-analysis of studies with bivariate binary outcomes, challenges arise when the within-study correlation and between-study heterogeneity should be taken into account. In this paper, we propose a marginal beta-binomial model for the meta-analysis of studies with binary outcomes. This model is based on the composite likelihood approach, and has several attractive features compared to the existing models such as bivariate generalized linear mixed model (Chu and Cole, 2006) and Sarmanov beta-binomial model (Chen et al., 2012). The advantages of the proposed marginal model include modeling the probabilities in the original scale, not requiring any transformation of probabilities or any link function, having closed-form expression of likelihood function, and no constraints on the correlation parameter. More importantly, since the marginal beta-binomial model is only based on the marginal distributions, it does not suffer from potential misspecification of the joint distribution of bivariate study-specific probabilities. Such misspecification is difficult to detect and can lead to biased inference using currents methods. We compare the performance of the marginal beta-binomial model with the bivariate generalized linear mixed model and the Sarmanov beta-binomial model by simulation studies. Interestingly, the results show that the marginal beta-binomial model performs better than the Sarmanov beta-binomial model, whether or not the true model is Sarmanov beta-binomial, and the marginal beta-binomial model is more robust than the bivariate generalized linear mixed model under model misspecifications. Two meta-analyses of diagnostic accuracy studies and a meta-analysis of case-control studies are conducted for illustration. PMID:26303591

  20. Censored Hurdle Negative Binomial Regression (Case Study: Neonatorum Tetanus Case in Indonesia)

    NASA Astrophysics Data System (ADS)

    Yuli Rusdiana, Riza; Zain, Ismaini; Wulan Purnami, Santi

    2017-06-01

    Hurdle negative binomial model regression is a method that can be used for discreate dependent variable, excess zero and under- and overdispersion. It uses two parts approach. The first part estimates zero elements from dependent variable is zero hurdle model and the second part estimates not zero elements (non-negative integer) from dependent variable is called truncated negative binomial models. The discrete dependent variable in such cases is censored for some values. The type of censor that will be studied in this research is right censored. This study aims to obtain the parameter estimator hurdle negative binomial regression for right censored dependent variable. In the assessment of parameter estimation methods used Maximum Likelihood Estimator (MLE). Hurdle negative binomial model regression for right censored dependent variable is applied on the number of neonatorum tetanus cases in Indonesia. The type data is count data which contains zero values in some observations and other variety value. This study also aims to obtain the parameter estimator and test statistic censored hurdle negative binomial model. Based on the regression results, the factors that influence neonatorum tetanus case in Indonesia is the percentage of baby health care coverage and neonatal visits.

  1. Meta-analysis of studies with bivariate binary outcomes: a marginal beta-binomial model approach.

    PubMed

    Chen, Yong; Hong, Chuan; Ning, Yang; Su, Xiao

    2016-01-15

    When conducting a meta-analysis of studies with bivariate binary outcomes, challenges arise when the within-study correlation and between-study heterogeneity should be taken into account. In this paper, we propose a marginal beta-binomial model for the meta-analysis of studies with binary outcomes. This model is based on the composite likelihood approach and has several attractive features compared with the existing models such as bivariate generalized linear mixed model (Chu and Cole, 2006) and Sarmanov beta-binomial model (Chen et al., 2012). The advantages of the proposed marginal model include modeling the probabilities in the original scale, not requiring any transformation of probabilities or any link function, having closed-form expression of likelihood function, and no constraints on the correlation parameter. More importantly, because the marginal beta-binomial model is only based on the marginal distributions, it does not suffer from potential misspecification of the joint distribution of bivariate study-specific probabilities. Such misspecification is difficult to detect and can lead to biased inference using currents methods. We compare the performance of the marginal beta-binomial model with the bivariate generalized linear mixed model and the Sarmanov beta-binomial model by simulation studies. Interestingly, the results show that the marginal beta-binomial model performs better than the Sarmanov beta-binomial model, whether or not the true model is Sarmanov beta-binomial, and the marginal beta-binomial model is more robust than the bivariate generalized linear mixed model under model misspecifications. Two meta-analyses of diagnostic accuracy studies and a meta-analysis of case-control studies are conducted for illustration. Copyright © 2015 John Wiley & Sons, Ltd.

  2. Choosing a Transformation in Analyses of Insect Counts from Contagious Distributions with Low Means

    Treesearch

    W.D. Pepper; S.J. Zarnoch; G.L. DeBarr; P. de Groot; C.D. Tangren

    1997-01-01

    Guidelines based on computer simulation are suggested for choosing a transformation of insect counts from negative binomial distributions with low mean counts and high levels of contagion. Typical values and ranges of negative binomial model parameters were determined by fitting the model to data from 19 entomological field studies. Random sampling of negative binomial...

  3. Binomial leap methods for simulating stochastic chemical kinetics.

    PubMed

    Tian, Tianhai; Burrage, Kevin

    2004-12-01

    This paper discusses efficient simulation methods for stochastic chemical kinetics. Based on the tau-leap and midpoint tau-leap methods of Gillespie [D. T. Gillespie, J. Chem. Phys. 115, 1716 (2001)], binomial random variables are used in these leap methods rather than Poisson random variables. The motivation for this approach is to improve the efficiency of the Poisson leap methods by using larger stepsizes. Unlike Poisson random variables whose range of sample values is from zero to infinity, binomial random variables have a finite range of sample values. This probabilistic property has been used to restrict possible reaction numbers and to avoid negative molecular numbers in stochastic simulations when larger stepsize is used. In this approach a binomial random variable is defined for a single reaction channel in order to keep the reaction number of this channel below the numbers of molecules that undergo this reaction channel. A sampling technique is also designed for the total reaction number of a reactant species that undergoes two or more reaction channels. Samples for the total reaction number are not greater than the molecular number of this species. In addition, probability properties of the binomial random variables provide stepsize conditions for restricting reaction numbers in a chosen time interval. These stepsize conditions are important properties of robust leap control strategies. Numerical results indicate that the proposed binomial leap methods can be applied to a wide range of chemical reaction systems with very good accuracy and significant improvement on efficiency over existing approaches. (c) 2004 American Institute of Physics.

  4. Statistical inference involving binomial and negative binomial parameters.

    PubMed

    García-Pérez, Miguel A; Núñez-Antón, Vicente

    2009-05-01

    Statistical inference about two binomial parameters implies that they are both estimated by binomial sampling. There are occasions in which one aims at testing the equality of two binomial parameters before and after the occurrence of the first success along a sequence of Bernoulli trials. In these cases, the binomial parameter before the first success is estimated by negative binomial sampling whereas that after the first success is estimated by binomial sampling, and both estimates are related. This paper derives statistical tools to test two hypotheses, namely, that both binomial parameters equal some specified value and that both parameters are equal though unknown. Simulation studies are used to show that in small samples both tests are accurate in keeping the nominal Type-I error rates, and also to determine sample size requirements to detect large, medium, and small effects with adequate power. Additional simulations also show that the tests are sufficiently robust to certain violations of their assumptions.

  5. Negative Binomial Process Count and Mixture Modeling.

    PubMed

    Zhou, Mingyuan; Carin, Lawrence

    2015-02-01

    The seemingly disjoint problems of count and mixture modeling are united under the negative binomial (NB) process. A gamma process is employed to model the rate measure of a Poisson process, whose normalization provides a random probability measure for mixture modeling and whose marginalization leads to an NB process for count modeling. A draw from the NB process consists of a Poisson distributed finite number of distinct atoms, each of which is associated with a logarithmic distributed number of data samples. We reveal relationships between various count- and mixture-modeling distributions and construct a Poisson-logarithmic bivariate distribution that connects the NB and Chinese restaurant table distributions. Fundamental properties of the models are developed, and we derive efficient Bayesian inference. It is shown that with augmentation and normalization, the NB process and gamma-NB process can be reduced to the Dirichlet process and hierarchical Dirichlet process, respectively. These relationships highlight theoretical, structural, and computational advantages of the NB process. A variety of NB processes, including the beta-geometric, beta-NB, marked-beta-NB, marked-gamma-NB and zero-inflated-NB processes, with distinct sharing mechanisms, are also constructed. These models are applied to topic modeling, with connections made to existing algorithms under Poisson factor analysis. Example results show the importance of inferring both the NB dispersion and probability parameters.

  6. Variability in results from negative binomial models for Lyme disease measured at different spatial scales.

    PubMed

    Tran, Phoebe; Waller, Lance

    2015-01-01

    Lyme disease has been the subject of many studies due to increasing incidence rates year after year and the severe complications that can arise in later stages of the disease. Negative binomial models have been used to model Lyme disease in the past with some success. However, there has been little focus on the reliability and consistency of these models when they are used to study Lyme disease at multiple spatial scales. This study seeks to explore how sensitive/consistent negative binomial models are when they are used to study Lyme disease at different spatial scales (at the regional and sub-regional levels). The study area includes the thirteen states in the Northeastern United States with the highest Lyme disease incidence during the 2002-2006 period. Lyme disease incidence at county level for the period of 2002-2006 was linked with several previously identified key landscape and climatic variables in a negative binomial regression model for the Northeastern region and two smaller sub-regions (the New England sub-region and the Mid-Atlantic sub-region). This study found that negative binomial models, indeed, were sensitive/inconsistent when used at different spatial scales. We discuss various plausible explanations for such behavior of negative binomial models. Further investigation of the inconsistency and sensitivity of negative binomial models when used at different spatial scales is important for not only future Lyme disease studies and Lyme disease risk assessment/management but any study that requires use of this model type in a spatial context. Copyright © 2014 Elsevier Inc. All rights reserved.

  7. Marginalized zero-inflated negative binomial regression with application to dental caries

    PubMed Central

    Preisser, John S.; Das, Kalyan; Long, D. Leann; Divaris, Kimon

    2015-01-01

    The zero-inflated negative binomial regression model (ZINB) is often employed in diverse fields such as dentistry, health care utilization, highway safety, and medicine to examine relationships between exposures of interest and overdispersed count outcomes exhibiting many zeros. The regression coefficients of ZINB have latent class interpretations for a susceptible subpopulation at risk for the disease/condition under study with counts generated from a negative binomial distribution and for a non-susceptible subpopulation that provides only zero counts. The ZINB parameters, however, are not well-suited for estimating overall exposure effects, specifically, in quantifying the effect of an explanatory variable in the overall mixture population. In this paper, a marginalized zero-inflated negative binomial regression (MZINB) model for independent responses is proposed to model the population marginal mean count directly, providing straightforward inference for overall exposure effects based on maximum likelihood estimation. Through simulation studies, the finite sample performance of MZINB is compared to marginalized zero-inflated Poisson, Poisson, and negative binomial regression. The MZINB model is applied in the evaluation of a school-based fluoride mouthrinse program on dental caries in 677 children. PMID:26568034

  8. Zero-state Markov switching count-data models: an empirical assessment.

    PubMed

    Malyshkina, Nataliya V; Mannering, Fred L

    2010-01-01

    In this study, a two-state Markov switching count-data model is proposed as an alternative to zero-inflated models to account for the preponderance of zeros sometimes observed in transportation count data, such as the number of accidents occurring on a roadway segment over some period of time. For this accident-frequency case, zero-inflated models assume the existence of two states: one of the states is a zero-accident count state, which has accident probabilities that are so low that they cannot be statistically distinguished from zero, and the other state is a normal-count state, in which counts can be non-negative integers that are generated by some counting process, for example, a Poisson or negative binomial. While zero-inflated models have come under some criticism with regard to accident-frequency applications - one fact is undeniable - in many applications they provide a statistically superior fit to the data. The Markov switching approach we propose seeks to overcome some of the criticism associated with the zero-accident state of the zero-inflated model by allowing individual roadway segments to switch between zero and normal-count states over time. An important advantage of this Markov switching approach is that it allows for the direct statistical estimation of the specific roadway-segment state (i.e., zero-accident or normal-count state) whereas traditional zero-inflated models do not. To demonstrate the applicability of this approach, a two-state Markov switching negative binomial model (estimated with Bayesian inference) and standard zero-inflated negative binomial models are estimated using five-year accident frequencies on Indiana interstate highway segments. It is shown that the Markov switching model is a viable alternative and results in a superior statistical fit relative to the zero-inflated models.

  9. Modeling Tetanus Neonatorum case using the regression of negative binomial and zero-inflated negative binomial

    NASA Astrophysics Data System (ADS)

    Amaliana, Luthfatul; Sa'adah, Umu; Wayan Surya Wardhani, Ni

    2017-12-01

    Tetanus Neonatorum is an infectious disease that can be prevented by immunization. The number of Tetanus Neonatorum cases in East Java Province is the highest in Indonesia until 2015. Tetanus Neonatorum data contain over dispersion and big enough proportion of zero-inflation. Negative Binomial (NB) regression is an alternative method when over dispersion happens in Poisson regression. However, the data containing over dispersion and zero-inflation are more appropriately analyzed by using Zero-Inflated Negative Binomial (ZINB) regression. The purpose of this study are: (1) to model Tetanus Neonatorum cases in East Java Province with 71.05 percent proportion of zero-inflation by using NB and ZINB regression, (2) to obtain the best model. The result of this study indicates that ZINB is better than NB regression with smaller AIC.

  10. Simulation on Poisson and negative binomial models of count road accident modeling

    NASA Astrophysics Data System (ADS)

    Sapuan, M. S.; Razali, A. M.; Zamzuri, Z. H.; Ibrahim, K.

    2016-11-01

    Accident count data have often been shown to have overdispersion. On the other hand, the data might contain zero count (excess zeros). The simulation study was conducted to create a scenarios which an accident happen in T-junction with the assumption the dependent variables of generated data follows certain distribution namely Poisson and negative binomial distribution with different sample size of n=30 to n=500. The study objective was accomplished by fitting Poisson regression, negative binomial regression and Hurdle negative binomial model to the simulated data. The model validation was compared and the simulation result shows for each different sample size, not all model fit the data nicely even though the data generated from its own distribution especially when the sample size is larger. Furthermore, the larger sample size indicates that more zeros accident count in the dataset.

  11. Analysis of generalized negative binomial distributions attached to hyperbolic Landau levels

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chhaiba, Hassan, E-mail: chhaiba.hassan@gmail.com; Demni, Nizar, E-mail: nizar.demni@univ-rennes1.fr; Mouayn, Zouhair, E-mail: mouayn@fstbm.ac.ma

    2016-07-15

    To each hyperbolic Landau level of the Poincaré disc is attached a generalized negative binomial distribution. In this paper, we compute the moment generating function of this distribution and supply its atomic decomposition as a perturbation of the negative binomial distribution by a finitely supported measure. Using the Mandel parameter, we also discuss the nonclassical nature of the associated coherent states. Next, we derive a Lévy-Khintchine-type representation of its characteristic function when the latter does not vanish and deduce that it is quasi-infinitely divisible except for the lowest hyperbolic Landau level corresponding to the negative binomial distribution. By considering themore » total variation of the obtained quasi-Lévy measure, we introduce a new infinitely divisible distribution for which we derive the characteristic function.« less

  12. Library Book Circulation and the Beta-Binomial Distribution.

    ERIC Educational Resources Information Center

    Gelman, E.; Sichel, H. S.

    1987-01-01

    Argues that library book circulation is a binomial rather than a Poisson process, and that individual book popularities are continuous beta distributions. Three examples demonstrate the superiority of beta over negative binomial distribution, and it is suggested that a bivariate-binomial process would be helpful in predicting future book…

  13. Comparison of multiplicity distributions to the negative binomial distribution in muon-proton scattering

    NASA Astrophysics Data System (ADS)

    Arneodo, M.; Arvidson, A.; Aubert, J. J.; Badełek, B.; Beaufays, J.; Bee, C. P.; Benchouk, C.; Berghoff, G.; Bird, I.; Blum, D.; Böhm, E.; de Bouard, X.; Brasse, F. W.; Braun, H.; Broll, C.; Brown, S.; Brück, H.; Calen, H.; Chima, J. S.; Ciborowski, J.; Clifft, R.; Coignet, G.; Combley, F.; Coughlan, J.; D'Agostini, G.; Dahlgren, S.; Dengler, F.; Derado, I.; Dreyer, T.; Drees, J.; Düren, M.; Eckardt, V.; Edwards, A.; Edwards, M.; Ernst, T.; Eszes, G.; Favier, J.; Ferrero, M. I.; Figiel, J.; Flauger, W.; Foster, J.; Ftáčnik, J.; Gabathuler, E.; Gajewski, J.; Gamet, R.; Gayler, J.; Geddes, N.; Grafström, P.; Grard, F.; Haas, J.; Hagberg, E.; Hasert, F. J.; Hayman, P.; Heusse, P.; Jaffré, M.; Jachołkowska, A.; Janata, F.; Jancsó, G.; Johnson, A. S.; Kabuss, E. M.; Kellner, G.; Korbel, V.; Krüger, J.; Kullander, S.; Landgraf, U.; Lanske, D.; Loken, J.; Long, K.; Maire, M.; Malecki, P.; Manz, A.; Maselli, S.; Mohr, W.; Montanet, F.; Montgomery, H. E.; Nagy, E.; Nassalski, J.; Norton, P. R.; Oakham, F. G.; Osborne, A. M.; Pascaud, C.; Pawlik, B.; Payre, P.; Peroni, C.; Peschel, H.; Pessard, H.; Pettinghale, J.; Pietrzyk, B.; Pietrzyk, U.; Pönsgen, B.; Pötsch, M.; Renton, P.; Ribarics, P.; Rith, K.; Rondio, E.; Sandacz, A.; Scheer, M.; Schlagböhmer, A.; Schiemann, H.; Schmitz, N.; Schneegans, M.; Schneider, A.; Scholz, M.; Schröder, T.; Schultze, K.; Sloan, T.; Stier, H. E.; Studt, M.; Taylor, G. N.; Thénard, J. M.; Thompson, J. C.; de La Torre, A.; Toth, J.; Urban, L.; Urban, L.; Wallucks, W.; Whalley, M.; Wheeler, S.; Williams, W. S. C.; Wimpenny, S. J.; Windmolders, R.; Wolf, G.

    1987-09-01

    The multiplicity distributions of charged hadrons produced in the deep inelastic muon-proton scattering at 280 GeV are analysed in various rapidity intervals, as a function of the total hadronic centre of mass energy W ranging from 4 20 GeV. Multiplicity distributions for the backward and forward hemispheres are also analysed separately. The data can be well parameterized by binomial distributions, extending their range of applicability to the case of lepton-proton scattering. The energy and the rapidity dependence of the parameters is presented and a smooth transition from the negative binomial distribution via Poissonian to the ordinary binomial is observed.

  14. Void probability as a function of the void's shape and scale-invariant models

    NASA Technical Reports Server (NTRS)

    Elizalde, E.; Gaztanaga, E.

    1991-01-01

    The dependence of counts in cells on the shape of the cell for the large scale galaxy distribution is studied. A very concrete prediction can be done concerning the void distribution for scale invariant models. The prediction is tested on a sample of the CfA catalog, and good agreement is found. It is observed that the probability of a cell to be occupied is bigger for some elongated cells. A phenomenological scale invariant model for the observed distribution of the counts in cells, an extension of the negative binomial distribution, is presented in order to illustrate how this dependence can be quantitatively determined. An original, intuitive derivation of this model is presented.

  15. Regular exercise and related factors in patients with Parkinson's disease: Applying zero-inflated negative binomial modeling of exercise count data.

    PubMed

    Lee, JuHee; Park, Chang Gi; Choi, Moonki

    2016-05-01

    This study was conducted to identify risk factors that influence regular exercise among patients with Parkinson's disease in Korea. Parkinson's disease is prevalent in the elderly, and may lead to a sedentary lifestyle. Exercise can enhance physical and psychological health. However, patients with Parkinson's disease are less likely to exercise than are other populations due to physical disability. A secondary data analysis and cross-sectional descriptive study were conducted. A convenience sample of 106 patients with Parkinson's disease was recruited at an outpatient neurology clinic of a tertiary hospital in Korea. Demographic characteristics, disease-related characteristics (including disease duration and motor symptoms), self-efficacy for exercise, balance, and exercise level were investigated. Negative binomial regression and zero-inflated negative binomial regression for exercise count data were utilized to determine factors involved in exercise. The mean age of participants was 65.85 ± 8.77 years, and the mean duration of Parkinson's disease was 7.23 ± 6.02 years. Most participants indicated that they engaged in regular exercise (80.19%). Approximately half of participants exercised at least 5 days per week for 30 min, as recommended (51.9%). Motor symptoms were a significant predictor of exercise in the count model, and self-efficacy for exercise was a significant predictor of exercise in the zero model. Severity of motor symptoms was related to frequency of exercise. Self-efficacy contributed to the probability of exercise. Symptom management and improvement of self-efficacy for exercise are important to encourage regular exercise in patients with Parkinson's disease. Copyright © 2015 Elsevier Inc. All rights reserved.

  16. Sampling--how big a sample?

    PubMed

    Aitken, C G

    1999-07-01

    It is thought that, in a consignment of discrete units, a certain proportion of the units contain illegal material. A sample of the consignment is to be inspected. Various methods for the determination of the sample size are compared. The consignment will be considered as a random sample from some super-population of units, a certain proportion of which contain drugs. For large consignments, a probability distribution, known as the beta distribution, for the proportion of the consignment which contains illegal material is obtained. This distribution is based on prior beliefs about the proportion. Under certain specific conditions the beta distribution gives the same numerical results as an approach based on the binomial distribution. The binomial distribution provides a probability for the number of units in a sample which contain illegal material, conditional on knowing the proportion of the consignment which contains illegal material. This is in contrast to the beta distribution which provides probabilities for the proportion of a consignment which contains illegal material, conditional on knowing the number of units in the sample which contain illegal material. The interpretation when the beta distribution is used is much more intuitively satisfactory. It is also much more flexible in its ability to cater for prior beliefs which may vary given the different circumstances of different crimes. For small consignments, a distribution, known as the beta-binomial distribution, for the number of units in the consignment which are found to contain illegal material, is obtained, based on prior beliefs about the number of units in the consignment which are thought to contain illegal material. As with the beta and binomial distributions for large samples, it is shown that, in certain specific conditions, the beta-binomial and hypergeometric distributions give the same numerical results. However, the beta-binomial distribution, as with the beta distribution, has a more intuitively satisfactory interpretation and greater flexibility. The beta and the beta-binomial distributions provide methods for the determination of the minimum sample size to be taken from a consignment in order to satisfy a certain criterion. The criterion requires the specification of a proportion and a probability.

  17. C-5A Cargo Deck Low-Frequency Vibration Environment

    DTIC Science & Technology

    1975-02-01

    SAMPLE VIBRATION CALCULATIONS 13 1. Normal Distribution 13 2. Binomial Distribution 15 IV CONCLUSIONS 17 -! V REFERENCES 18 t: FEiCENDIJJ PAGS 2LANKNOT...Calculation for Binomial Distribution 108 (Vertical Acceleration, Right Rear Cargo Deck) xi I. INTRODUCTION The availability of large transport...the end of taxi. These peaks could then be used directly to compile the probability of occurrence of specific values of acceleration using the binomial

  18. [Application of negative binomial regression and modified Poisson regression in the research of risk factors for injury frequency].

    PubMed

    Cao, Qingqing; Wu, Zhenqiang; Sun, Ying; Wang, Tiezhu; Han, Tengwei; Gu, Chaomei; Sun, Yehuan

    2011-11-01

    To Eexplore the application of negative binomial regression and modified Poisson regression analysis in analyzing the influential factors for injury frequency and the risk factors leading to the increase of injury frequency. 2917 primary and secondary school students were selected from Hefei by cluster random sampling method and surveyed by questionnaire. The data on the count event-based injuries used to fitted modified Poisson regression and negative binomial regression model. The risk factors incurring the increase of unintentional injury frequency for juvenile students was explored, so as to probe the efficiency of these two models in studying the influential factors for injury frequency. The Poisson model existed over-dispersion (P < 0.0001) based on testing by the Lagrangemultiplier. Therefore, the over-dispersion dispersed data using a modified Poisson regression and negative binomial regression model, was fitted better. respectively. Both showed that male gender, younger age, father working outside of the hometown, the level of the guardian being above junior high school and smoking might be the results of higher injury frequencies. On a tendency of clustered frequency data on injury event, both the modified Poisson regression analysis and negative binomial regression analysis can be used. However, based on our data, the modified Poisson regression fitted better and this model could give a more accurate interpretation of relevant factors affecting the frequency of injury.

  19. Spatiotemporal hurdle models for zero-inflated count data: Exploring trends in emergency department visits.

    PubMed

    Neelon, Brian; Chang, Howard H; Ling, Qiang; Hastings, Nicole S

    2016-12-01

    Motivated by a study exploring spatiotemporal trends in emergency department use, we develop a class of two-part hurdle models for the analysis of zero-inflated areal count data. The models consist of two components-one for the probability of any emergency department use and one for the number of emergency department visits given use. Through a hierarchical structure, the models incorporate both patient- and region-level predictors, as well as spatially and temporally correlated random effects for each model component. The random effects are assigned multivariate conditionally autoregressive priors, which induce dependence between the components and provide spatial and temporal smoothing across adjacent spatial units and time periods, resulting in improved inferences. To accommodate potential overdispersion, we consider a range of parametric specifications for the positive counts, including truncated negative binomial and generalized Poisson distributions. We adopt a Bayesian inferential approach, and posterior computation is handled conveniently within standard Bayesian software. Our results indicate that the negative binomial and generalized Poisson hurdle models vastly outperform the Poisson hurdle model, demonstrating that overdispersed hurdle models provide a useful approach to analyzing zero-inflated spatiotemporal data. © The Author(s) 2014.

  20. Assessment of some important factors affecting the singing-ground survey

    USGS Publications Warehouse

    Tautin, J.

    1982-01-01

    A brief history of the procedures used to analyze singing-ground survey data is outlined. Some weaknesses associated with the analytical procedures are discussed, and preliminary results of efforts to improve the procedures are presented. The most significant finding to date is that counts made by new observers need not be omitted when calculating an index of the woodcock population. Also, the distribution of woodcock heard singing, with respect to time after sunset, affirms the appropriateness of recommended starting times for counting woodcock. Woodcock count data fit the negative binomial probability distribution.

  1. Void probability as a function of the void's shape and scale-invariant models. [in studies of spacial galactic distribution

    NASA Technical Reports Server (NTRS)

    Elizalde, E.; Gaztanaga, E.

    1992-01-01

    The dependence of counts in cells on the shape of the cell for the large scale galaxy distribution is studied. A very concrete prediction can be done concerning the void distribution for scale invariant models. The prediction is tested on a sample of the CfA catalog, and good agreement is found. It is observed that the probability of a cell to be occupied is bigger for some elongated cells. A phenomenological scale invariant model for the observed distribution of the counts in cells, an extension of the negative binomial distribution, is presented in order to illustrate how this dependence can be quantitatively determined. An original, intuitive derivation of this model is presented.

  2. Macro-level pedestrian and bicycle crash analysis: Incorporating spatial spillover effects in dual state count models.

    PubMed

    Cai, Qing; Lee, Jaeyoung; Eluru, Naveen; Abdel-Aty, Mohamed

    2016-08-01

    This study attempts to explore the viability of dual-state models (i.e., zero-inflated and hurdle models) for traffic analysis zones (TAZs) based pedestrian and bicycle crash frequency analysis. Additionally, spatial spillover effects are explored in the models by employing exogenous variables from neighboring zones. The dual-state models such as zero-inflated negative binomial and hurdle negative binomial models (with and without spatial effects) are compared with the conventional single-state model (i.e., negative binomial). The model comparison for pedestrian and bicycle crashes revealed that the models that considered observed spatial effects perform better than the models that did not consider the observed spatial effects. Across the models with spatial spillover effects, the dual-state models especially zero-inflated negative binomial model offered better performance compared to single-state models. Moreover, the model results clearly highlighted the importance of various traffic, roadway, and sociodemographic characteristics of the TAZ as well as neighboring TAZs on pedestrian and bicycle crash frequency. Copyright © 2016 Elsevier Ltd. All rights reserved.

  3. A comparison of different statistical methods analyzing hypoglycemia data using bootstrap simulations.

    PubMed

    Jiang, Honghua; Ni, Xiao; Huster, William; Heilmann, Cory

    2015-01-01

    Hypoglycemia has long been recognized as a major barrier to achieving normoglycemia with intensive diabetic therapies. It is a common safety concern for the diabetes patients. Therefore, it is important to apply appropriate statistical methods when analyzing hypoglycemia data. Here, we carried out bootstrap simulations to investigate the performance of the four commonly used statistical models (Poisson, negative binomial, analysis of covariance [ANCOVA], and rank ANCOVA) based on the data from a diabetes clinical trial. Zero-inflated Poisson (ZIP) model and zero-inflated negative binomial (ZINB) model were also evaluated. Simulation results showed that Poisson model inflated type I error, while negative binomial model was overly conservative. However, after adjusting for dispersion, both Poisson and negative binomial models yielded slightly inflated type I errors, which were close to the nominal level and reasonable power. Reasonable control of type I error was associated with ANCOVA model. Rank ANCOVA model was associated with the greatest power and with reasonable control of type I error. Inflated type I error was observed with ZIP and ZINB models.

  4. Zero adjusted models with applications to analysing helminths count data.

    PubMed

    Chipeta, Michael G; Ngwira, Bagrey M; Simoonga, Christopher; Kazembe, Lawrence N

    2014-11-27

    It is common in public health and epidemiology that the outcome of interest is counts of events occurrence. Analysing these data using classical linear models is mostly inappropriate, even after transformation of outcome variables due to overdispersion. Zero-adjusted mixture count models such as zero-inflated and hurdle count models are applied to count data when over-dispersion and excess zeros exist. Main objective of the current paper is to apply such models to analyse risk factors associated with human helminths (S. haematobium) particularly in a case where there's a high proportion of zero counts. The data were collected during a community-based randomised control trial assessing the impact of mass drug administration (MDA) with praziquantel in Malawi, and a school-based cross sectional epidemiology survey in Zambia. Count data models including traditional (Poisson and negative binomial) models, zero modified models (zero inflated Poisson and zero inflated negative binomial) and hurdle models (Poisson logit hurdle and negative binomial logit hurdle) were fitted and compared. Using Akaike information criteria (AIC), the negative binomial logit hurdle (NBLH) and zero inflated negative binomial (ZINB) showed best performance in both datasets. With regards to zero count capturing, these models performed better than other models. This paper showed that zero modified NBLH and ZINB models are more appropriate methods for the analysis of data with excess zeros. The choice between the hurdle and zero-inflated models should be based on the aim and endpoints of the study.

  5. WHAMII - An enumeration and insertion procedure with binomial bounds for the stochastic time-constrained traveling salesman problem

    NASA Technical Reports Server (NTRS)

    Dahl, Roy W.; Keating, Karen; Salamone, Daryl J.; Levy, Laurence; Nag, Barindra; Sanborn, Joan A.

    1987-01-01

    This paper presents an algorithm (WHAMII) designed to solve the Artificial Intelligence Design Challenge at the 1987 AIAA Guidance, Navigation and Control Conference. The problem under consideration is a stochastic generalization of the traveling salesman problem in which travel costs can incur a penalty with a given probability. The variability in travel costs leads to a probability constraint with respect to violating the budget allocation. Given the small size of the problem (eleven cities), an approach is considered that combines partial tour enumeration with a heuristic city insertion procedure. For computational efficiency during both the enumeration and insertion procedures, precalculated binomial probabilities are used to determine an upper bound on the actual probability of violating the budget constraint for each tour. The actual probability is calculated for the final best tour, and additional insertions are attempted until the actual probability exceeds the bound.

  6. Partitioning Detectability Components in Populations Subject to Within-Season Temporary Emigration Using Binomial Mixture Models

    PubMed Central

    O’Donnell, Katherine M.; Thompson, Frank R.; Semlitsch, Raymond D.

    2015-01-01

    Detectability of individual animals is highly variable and nearly always < 1; imperfect detection must be accounted for to reliably estimate population sizes and trends. Hierarchical models can simultaneously estimate abundance and effective detection probability, but there are several different mechanisms that cause variation in detectability. Neglecting temporary emigration can lead to biased population estimates because availability and conditional detection probability are confounded. In this study, we extend previous hierarchical binomial mixture models to account for multiple sources of variation in detectability. The state process of the hierarchical model describes ecological mechanisms that generate spatial and temporal patterns in abundance, while the observation model accounts for the imperfect nature of counting individuals due to temporary emigration and false absences. We illustrate our model’s potential advantages, including the allowance of temporary emigration between sampling periods, with a case study of southern red-backed salamanders Plethodon serratus. We fit our model and a standard binomial mixture model to counts of terrestrial salamanders surveyed at 40 sites during 3–5 surveys each spring and fall 2010–2012. Our models generated similar parameter estimates to standard binomial mixture models. Aspect was the best predictor of salamander abundance in our case study; abundance increased as aspect became more northeasterly. Increased time-since-rainfall strongly decreased salamander surface activity (i.e. availability for sampling), while higher amounts of woody cover objects and rocks increased conditional detection probability (i.e. probability of capture, given an animal is exposed to sampling). By explicitly accounting for both components of detectability, we increased congruence between our statistical modeling and our ecological understanding of the system. We stress the importance of choosing survey locations and protocols that maximize species availability and conditional detection probability to increase population parameter estimate reliability. PMID:25775182

  7. Single- and multiple-pulse noncoherent detection statistics associated with partially developed speckle.

    PubMed

    Osche, G R

    2000-08-20

    Single- and multiple-pulse detection statistics are presented for aperture-averaged direct detection optical receivers operating against partially developed speckle fields. A partially developed speckle field arises when the probability density function of the received intensity does not follow negative exponential statistics. The case of interest here is the target surface that exhibits diffuse as well as specular components in the scattered radiation. An approximate expression is derived for the integrated intensity at the aperture, which leads to single- and multiple-pulse discrete probability density functions for the case of a Poisson signal in Poisson noise with an additive coherent component. In the absence of noise, the single-pulse discrete density function is shown to reduce to a generalized negative binomial distribution. The radar concept of integration loss is discussed in the context of direct detection optical systems where it is shown that, given an appropriate set of system parameters, multiple-pulse processing can be more efficient than single-pulse processing over a finite range of the integration parameter n.

  8. A Mixed-Effects Heterogeneous Negative Binomial Model for Postfire Conifer Regeneration in Northeastern California, USA

    Treesearch

    Justin S. Crotteau; Martin W. Ritchie; J. Morgan Varner

    2014-01-01

    Many western USA fire regimes are typified by mixed-severity fire, which compounds the variability inherent to natural regeneration densities in associated forests. Tree regeneration data are often discrete and nonnegative; accordingly, we fit a series of Poisson and negative binomial variation models to conifer seedling counts across four distinct burn severities and...

  9. Smisc - A collection of miscellaneous functions

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Landon Sego, PNNL

    2015-08-31

    A collection of functions for statistical computing and data manipulation. These include routines for rapidly aggregating heterogeneous matrices, manipulating file names, loading R objects, sourcing multiple R files, formatting datetimes, multi-core parallel computing, stream editing, specialized plotting, etc. Smisc-package A collection of miscellaneous functions allMissing Identifies missing rows or columns in a data frame or matrix as.numericSilent Silent wrapper for coercing a vector to numeric comboList Produces all possible combinations of a set of linear model predictors cumMax Computes the maximum of the vector up to the current index cumsumNA Computes the cummulative sum of a vector without propogating NAsmore » d2binom Probability functions for the sum of two independent binomials dataIn A flexible way to import data into R. dbb The Beta-Binomial Distribution df2list Row-wise conversion of a data frame to a list dfplapply Parallelized single row processing of a data frame dframeEquiv Examines the equivalence of two dataframes or matrices dkbinom Probability functions for the sum of k independent binomials factor2character Converts all factor variables in a dataframe to character variables findDepMat Identify linearly dependent rows or columns in a matrix formatDT Converts date or datetime strings into alternate formats getExtension Filename manipulations: remove the extension or path, extract the extension or path getPath Filename manipulations: remove the extension or path, extract the extension or path grabLast Filename manipulations: remove the extension or path, extract the extension or path ifelse1 Non-vectorized version of ifelse integ Simple numerical integration routine interactionPlot Two-way Interaction Plot with Error Bar linearMap Linear mapping of a numerical vector or scalar list2df Convert a list to a data frame loadObject Loads and returns the object(s) in an ".Rdata" file more Display the contents of a file to the R terminal movAvg2 Calculate the moving average using a 2-sided window openDevice Opens a graphics device based on the filename extension p2binom Probability functions for the sum of two independent binomials padZero Pad a vector of numbers with zeros parseJob Parses a collection of elements into (almost) equal sized groups pbb The Beta-Binomial Distribution pcbinom A continuous version of the binomial cdf pkbinom Probability functions for the sum of k independent binomials plapply Simple parallelization of lapply plotFun Plot one or more functions on a single plot PowerData An example of power data pvar Prints the name and value of one or more objects qbb The Beta-Binomial Distribution rbb And numerous others (space limits reporting).« less

  10. Accident prediction model for public highway-rail grade crossings.

    PubMed

    Lu, Pan; Tolliver, Denver

    2016-05-01

    Considerable research has focused on roadway accident frequency analysis, but relatively little research has examined safety evaluation at highway-rail grade crossings. Highway-rail grade crossings are critical spatial locations of utmost importance for transportation safety because traffic crashes at highway-rail grade crossings are often catastrophic with serious consequences. The Poisson regression model has been employed to analyze vehicle accident frequency as a good starting point for many years. The most commonly applied variations of Poisson including negative binomial, and zero-inflated Poisson. These models are used to deal with common crash data issues such as over-dispersion (sample variance is larger than the sample mean) and preponderance of zeros (low sample mean and small sample size). On rare occasions traffic crash data have been shown to be under-dispersed (sample variance is smaller than the sample mean) and traditional distributions such as Poisson or negative binomial cannot handle under-dispersion well. The objective of this study is to investigate and compare various alternate highway-rail grade crossing accident frequency models that can handle the under-dispersion issue. The contributions of the paper are two-fold: (1) application of probability models to deal with under-dispersion issues and (2) obtain insights regarding to vehicle crashes at public highway-rail grade crossings. Copyright © 2016 Elsevier Ltd. All rights reserved.

  11. Estimating cavity tree and snag abundance using negative binomial regression models and nearest neighbor imputation methods

    Treesearch

    Bianca N.I. Eskelson; Hailemariam Temesgen; Tara M. Barrett

    2009-01-01

    Cavity tree and snag abundance data are highly variable and contain many zero observations. We predict cavity tree and snag abundance from variables that are readily available from forest cover maps or remotely sensed data using negative binomial (NB), zero-inflated NB, and zero-altered NB (ZANB) regression models as well as nearest neighbor (NN) imputation methods....

  12. I Remember You: Independence and the Binomial Model

    ERIC Educational Resources Information Center

    Levine, Douglas W.; Rockhill, Beverly

    2006-01-01

    We focus on the problem of ignoring statistical independence. A binomial experiment is used to determine whether judges could match, based on looks alone, dogs to their owners. The experimental design introduces dependencies such that the probability of a given judge correctly matching a dog and an owner changes from trial to trial. We show how…

  13. Poisson and negative binomial item count techniques for surveys with sensitive question.

    PubMed

    Tian, Guo-Liang; Tang, Man-Lai; Wu, Qin; Liu, Yin

    2017-04-01

    Although the item count technique is useful in surveys with sensitive questions, privacy of those respondents who possess the sensitive characteristic of interest may not be well protected due to a defect in its original design. In this article, we propose two new survey designs (namely the Poisson item count technique and negative binomial item count technique) which replace several independent Bernoulli random variables required by the original item count technique with a single Poisson or negative binomial random variable, respectively. The proposed models not only provide closed form variance estimate and confidence interval within [0, 1] for the sensitive proportion, but also simplify the survey design of the original item count technique. Most importantly, the new designs do not leak respondents' privacy. Empirical results show that the proposed techniques perform satisfactorily in the sense that it yields accurate parameter estimate and confidence interval.

  14. Visualizing and Understanding Probability and Statistics: Graphical Simulations Using Excel

    ERIC Educational Resources Information Center

    Gordon, Sheldon P.; Gordon, Florence S.

    2009-01-01

    The authors describe a collection of dynamic interactive simulations for teaching and learning most of the important ideas and techniques of introductory statistics and probability. The modules cover such topics as randomness, simulations of probability experiments such as coin flipping, dice rolling and general binomial experiments, a simulation…

  15. Orchestrating Semiotic Leaps from Tacit to Cultural Quantitative Reasoning--The Case of Anticipating Experimental Outcomes of a Quasi-Binomial Random Generator

    ERIC Educational Resources Information Center

    Abrahamson, Dor

    2009-01-01

    This article reports on a case study from a design-based research project that investigated how students make sense of the disciplinary tools they are taught to use, and specifically, what personal, interpersonal, and material resources support this process. The probability topic of binomial distribution was selected due to robust documentation of…

  16. Data mining of tree-based models to analyze freeway accident frequency.

    PubMed

    Chang, Li-Yen; Chen, Wen-Chieh

    2005-01-01

    Statistical models, such as Poisson or negative binomial regression models, have been employed to analyze vehicle accident frequency for many years. However, these models have their own model assumptions and pre-defined underlying relationship between dependent and independent variables. If these assumptions are violated, the model could lead to erroneous estimation of accident likelihood. Classification and Regression Tree (CART), one of the most widely applied data mining techniques, has been commonly employed in business administration, industry, and engineering. CART does not require any pre-defined underlying relationship between target (dependent) variable and predictors (independent variables) and has been shown to be a powerful tool, particularly for dealing with prediction and classification problems. This study collected the 2001-2002 accident data of National Freeway 1 in Taiwan. A CART model and a negative binomial regression model were developed to establish the empirical relationship between traffic accidents and highway geometric variables, traffic characteristics, and environmental factors. The CART findings indicated that the average daily traffic volume and precipitation variables were the key determinants for freeway accident frequencies. By comparing the prediction performance between the CART and the negative binomial regression models, this study demonstrates that CART is a good alternative method for analyzing freeway accident frequencies. By comparing the prediction performance between the CART and the negative binomial regression models, this study demonstrates that CART is a good alternative method for analyzing freeway accident frequencies.

  17. Distribution-free Inference of Zero-inated Binomial Data for Longitudinal Studies.

    PubMed

    He, H; Wang, W J; Hu, J; Gallop, R; Crits-Christoph, P; Xia, Y L

    2015-10-01

    Count reponses with structural zeros are very common in medical and psychosocial research, especially in alcohol and HIV research, and the zero-inflated poisson (ZIP) and zero-inflated negative binomial (ZINB) models are widely used for modeling such outcomes. However, as alcohol drinking outcomes such as days of drinkings are counts within a given period, their distributions are bounded above by an upper limit (total days in the period) and thus inherently follow a binomial or zero-inflated binomial (ZIB) distribution, rather than a Poisson or zero-inflated Poisson (ZIP) distribution, in the presence of structural zeros. In this paper, we develop a new semiparametric approach for modeling zero-inflated binomial (ZIB)-like count responses for cross-sectional as well as longitudinal data. We illustrate this approach with both simulated and real study data.

  18. Dispersion and sampling of adult Dermacentor andersoni in rangeland in Western North America.

    PubMed

    Rochon, K; Scoles, G A; Lysyk, T J

    2012-03-01

    A fixed precision sampling plan was developed for off-host populations of adult Rocky Mountain wood tick, Dermacentor andersoni (Stiles) based on data collected by dragging at 13 locations in Alberta, Canada; Washington; and Oregon. In total, 222 site-date combinations were sampled. Each site-date combination was considered a sample, and each sample ranged in size from 86 to 250 10 m2 quadrats. Analysis of simulated quadrats ranging in size from 10 to 50 m2 indicated that the most precise sample unit was the 10 m2 quadrat. Samples taken when abundance < 0.04 ticks per 10 m2 were more likely to not depart significantly from statistical randomness than samples taken when abundance was greater. Data were grouped into ten abundance classes and assessed for fit to the Poisson and negative binomial distributions. The Poisson distribution fit only data in abundance classes < 0.02 ticks per 10 m2, while the negative binomial distribution fit data from all abundance classes. A negative binomial distribution with common k = 0.3742 fit data in eight of the 10 abundance classes. Both the Taylor and Iwao mean-variance relationships were fit and used to predict sample sizes for a fixed level of precision. Sample sizes predicted using the Taylor model tended to underestimate actual sample sizes, while sample sizes estimated using the Iwao model tended to overestimate actual sample sizes. Using a negative binomial with common k provided estimates of required sample sizes closest to empirically calculated sample sizes.

  19. The Sequential Probability Ratio Test: An efficient alternative to exact binomial testing for Clean Water Act 303(d) evaluation.

    PubMed

    Chen, Connie; Gribble, Matthew O; Bartroff, Jay; Bay, Steven M; Goldstein, Larry

    2017-05-01

    The United States's Clean Water Act stipulates in section 303(d) that states must identify impaired water bodies for which total maximum daily loads (TMDLs) of pollution inputs into water bodies are developed. Decision-making procedures about how to list, or delist, water bodies as impaired, or not, per Clean Water Act 303(d) differ across states. In states such as California, whether or not a particular monitoring sample suggests that water quality is impaired can be regarded as a binary outcome variable, and California's current regulatory framework invokes a version of the exact binomial test to consolidate evidence across samples and assess whether the overall water body complies with the Clean Water Act. Here, we contrast the performance of California's exact binomial test with one potential alternative, the Sequential Probability Ratio Test (SPRT). The SPRT uses a sequential testing framework, testing samples as they become available and evaluating evidence as it emerges, rather than measuring all the samples and calculating a test statistic at the end of the data collection process. Through simulations and theoretical derivations, we demonstrate that the SPRT on average requires fewer samples to be measured to have comparable Type I and Type II error rates as the current fixed-sample binomial test. Policymakers might consider efficient alternatives such as SPRT to current procedure. Copyright © 2017 Elsevier Ltd. All rights reserved.

  20. Accounting for non-independent detection when estimating abundance of organisms with a Bayesian approach

    USGS Publications Warehouse

    Martin, Julien; Royle, J. Andrew; MacKenzie, Darryl I.; Edwards, Holly H.; Kery, Marc; Gardner, Beth

    2011-01-01

    Summary 1. Binomial mixture models use repeated count data to estimate abundance. They are becoming increasingly popular because they provide a simple and cost-effective way to account for imperfect detection. However, these models assume that individuals are detected independently of each other. This assumption may often be violated in the field. For instance, manatees (Trichechus manatus latirostris) may surface in turbid water (i.e. become available for detection during aerial surveys) in a correlated manner (i.e. in groups). However, correlated behaviour, affecting the non-independence of individual detections, may also be relevant in other systems (e.g. correlated patterns of singing in birds and amphibians). 2. We extend binomial mixture models to account for correlated behaviour and therefore to account for non-independent detection of individuals. We simulated correlated behaviour using beta-binomial random variables. Our approach can be used to simultaneously estimate abundance, detection probability and a correlation parameter. 3. Fitting binomial mixture models to data that followed a beta-binomial distribution resulted in an overestimation of abundance even for moderate levels of correlation. In contrast, the beta-binomial mixture model performed considerably better in our simulation scenarios. We also present a goodness-of-fit procedure to evaluate the fit of beta-binomial mixture models. 4. We illustrate our approach by fitting both binomial and beta-binomial mixture models to aerial survey data of manatees in Florida. We found that the binomial mixture model did not fit the data, whereas there was no evidence of lack of fit for the beta-binomial mixture model. This example helps illustrate the importance of using simulations and assessing goodness-of-fit when analysing ecological data with N-mixture models. Indeed, both the simulations and the goodness-of-fit procedure highlighted the limitations of the standard binomial mixture model for aerial manatee surveys. 5. Overestimation of abundance by binomial mixture models owing to non-independent detections is problematic for ecological studies, but also for conservation. For example, in the case of endangered species, it could lead to inappropriate management decisions, such as downlisting. These issues will be increasingly relevant as more ecologists apply flexible N-mixture models to ecological data.

  1. An examination of sources of sensitivity of consumer surplus estimates in travel cost models.

    PubMed

    Blaine, Thomas W; Lichtkoppler, Frank R; Bader, Timothy J; Hartman, Travis J; Lucente, Joseph E

    2015-03-15

    We examine sensitivity of estimates of recreation demand using the Travel Cost Method (TCM) to four factors. Three of the four have been routinely and widely discussed in the TCM literature: a) Poisson verses negative binomial regression; b) application of Englin correction to account for endogenous stratification; c) truncation of the data set to eliminate outliers. A fourth issue we address has not been widely modeled: the potential effect on recreation demand of the interaction between income and travel cost. We provide a straightforward comparison of all four factors, analyzing the impact of each on regression parameters and consumer surplus estimates. Truncation has a modest effect on estimates obtained from the Poisson models but a radical effect on the estimates obtained by way of the negative binomial. Inclusion of an income-travel cost interaction term generally produces a more conservative but not a statistically significantly different estimate of consumer surplus in both Poisson and negative binomial models. It also generates broader confidence intervals. Application of truncation, the Englin correction and the income-travel cost interaction produced the most conservative estimates of consumer surplus and eliminated the statistical difference between the Poisson and the negative binomial. Use of the income-travel cost interaction term reveals that for visitors who face relatively low travel costs, the relationship between income and travel demand is negative, while it is positive for those who face high travel costs. This provides an explanation of the ambiguities on the findings regarding the role of income widely observed in the TCM literature. Our results suggest that policies that reduce access to publicly owned resources inordinately impact local low income recreationists and are contrary to environmental justice. Copyright © 2014 Elsevier Ltd. All rights reserved.

  2. A Unifying Probability Example.

    ERIC Educational Resources Information Center

    Maruszewski, Richard F., Jr.

    2002-01-01

    Presents an example from probability and statistics that ties together several topics including the mean and variance of a discrete random variable, the binomial distribution and its particular mean and variance, the sum of independent random variables, the mean and variance of the sum, and the central limit theorem. Uses Excel to illustrate these…

  3. Solving the problem of negative populations in approximate accelerated stochastic simulations using the representative reaction approach.

    PubMed

    Kadam, Shantanu; Vanka, Kumar

    2013-02-15

    Methods based on the stochastic formulation of chemical kinetics have the potential to accurately reproduce the dynamical behavior of various biochemical systems of interest. However, the computational expense makes them impractical for the study of real systems. Attempts to render these methods practical have led to the development of accelerated methods, where the reaction numbers are modeled by Poisson random numbers. However, for certain systems, such methods give rise to physically unrealistic negative numbers for species populations. The methods which make use of binomial variables, in place of Poisson random numbers, have since become popular, and have been partially successful in addressing this problem. In this manuscript, the development of two new computational methods, based on the representative reaction approach (RRA), has been discussed. The new methods endeavor to solve the problem of negative numbers, by making use of tools like the stochastic simulation algorithm and the binomial method, in conjunction with the RRA. It is found that these newly developed methods perform better than other binomial methods used for stochastic simulations, in resolving the problem of negative populations. Copyright © 2012 Wiley Periodicals, Inc.

  4. Application of the Hyper-Poisson Generalized Linear Model for Analyzing Motor Vehicle Crashes.

    PubMed

    Khazraee, S Hadi; Sáez-Castillo, Antonio Jose; Geedipally, Srinivas Reddy; Lord, Dominique

    2015-05-01

    The hyper-Poisson distribution can handle both over- and underdispersion, and its generalized linear model formulation allows the dispersion of the distribution to be observation-specific and dependent on model covariates. This study's objective is to examine the potential applicability of a newly proposed generalized linear model framework for the hyper-Poisson distribution in analyzing motor vehicle crash count data. The hyper-Poisson generalized linear model was first fitted to intersection crash data from Toronto, characterized by overdispersion, and then to crash data from railway-highway crossings in Korea, characterized by underdispersion. The results of this study are promising. When fitted to the Toronto data set, the goodness-of-fit measures indicated that the hyper-Poisson model with a variable dispersion parameter provided a statistical fit as good as the traditional negative binomial model. The hyper-Poisson model was also successful in handling the underdispersed data from Korea; the model performed as well as the gamma probability model and the Conway-Maxwell-Poisson model previously developed for the same data set. The advantages of the hyper-Poisson model studied in this article are noteworthy. Unlike the negative binomial model, which has difficulties in handling underdispersed data, the hyper-Poisson model can handle both over- and underdispersed crash data. Although not a major issue for the Conway-Maxwell-Poisson model, the effect of each variable on the expected mean of crashes is easily interpretable in the case of this new model. © 2014 Society for Risk Analysis.

  5. A Random Variable Transformation Process.

    ERIC Educational Resources Information Center

    Scheuermann, Larry

    1989-01-01

    Provides a short BASIC program, RANVAR, which generates random variates for various theoretical probability distributions. The seven variates include: uniform, exponential, normal, binomial, Poisson, Pascal, and triangular. (MVL)

  6. Detection of influenza-like illness aberrations by directly monitoring Pearson residuals of fitted negative binomial regression models.

    PubMed

    Chan, Ta-Chien; Teng, Yung-Chu; Hwang, Jing-Shiang

    2015-02-21

    Emerging novel influenza outbreaks have increasingly been a threat to the public and a major concern of public health departments. Real-time data in seamless surveillance systems such as health insurance claims data for influenza-like illnesses (ILI) are ready for analysis, making it highly desirable to develop practical techniques to analyze such readymade data for outbreak detection so that the public can receive timely influenza epidemic warnings. This study proposes a simple and effective approach to analyze area-based health insurance claims data including outpatient and emergency department (ED) visits for early detection of any aberrations of ILI. The health insurance claims data during 2004-2009 from a national health insurance research database were used for developing early detection methods. The proposed approach fitted the daily new ILI visits and monitored the Pearson residuals directly for aberration detection. First, negative binomial regression was used for both outpatient and ED visits to adjust for potentially influential factors such as holidays, weekends, seasons, temporal dependence and temperature. Second, if the Pearson residuals exceeded 1.96, aberration signals were issued. The empirical validation of the model was done in 2008 and 2009. In addition, we designed a simulation study to compare the time of outbreak detection, non-detection probability and false alarm rate between the proposed method and modified CUSUM. The model successfully detected the aberrations of 2009 pandemic (H1N1) influenza virus in northern, central and southern Taiwan. The proposed approach was more sensitive in identifying aberrations in ED visits than those in outpatient visits. Simulation studies demonstrated that the proposed approach could detect the aberrations earlier, and with lower non-detection probability and mean false alarm rate in detecting aberrations compared to modified CUSUM methods. The proposed simple approach was able to filter out temporal trends, adjust for temperature, and issue warning signals for the first wave of the influenza epidemic in a timely and accurate manner.

  7. Fitting statistical distributions to sea duck count data: implications for survey design and abundance estimation

    USGS Publications Warehouse

    Zipkin, Elise F.; Leirness, Jeffery B.; Kinlan, Brian P.; O'Connell, Allan F.; Silverman, Emily D.

    2014-01-01

    Determining appropriate statistical distributions for modeling animal count data is important for accurate estimation of abundance, distribution, and trends. In the case of sea ducks along the U.S. Atlantic coast, managers want to estimate local and regional abundance to detect and track population declines, to define areas of high and low use, and to predict the impact of future habitat change on populations. In this paper, we used a modified marked point process to model survey data that recorded flock sizes of Common eiders, Long-tailed ducks, and Black, Surf, and White-winged scoters. The data come from an experimental aerial survey, conducted by the United States Fish & Wildlife Service (USFWS) Division of Migratory Bird Management, during which east-west transects were flown along the Atlantic Coast from Maine to Florida during the winters of 2009–2011. To model the number of flocks per transect (the points), we compared the fit of four statistical distributions (zero-inflated Poisson, zero-inflated geometric, zero-inflated negative binomial and negative binomial) to data on the number of species-specific sea duck flocks that were recorded for each transect flown. To model the flock sizes (the marks), we compared the fit of flock size data for each species to seven statistical distributions: positive Poisson, positive negative binomial, positive geometric, logarithmic, discretized lognormal, zeta and Yule–Simon. Akaike’s Information Criterion and Vuong’s closeness tests indicated that the negative binomial and discretized lognormal were the best distributions for all species for the points and marks, respectively. These findings have important implications for estimating sea duck abundances as the discretized lognormal is a more skewed distribution than the Poisson and negative binomial, which are frequently used to model avian counts; the lognormal is also less heavy-tailed than the power law distributions (e.g., zeta and Yule–Simon), which are becoming increasingly popular for group size modeling. Choosing appropriate statistical distributions for modeling flock size data is fundamental to accurately estimating population summaries, determining required survey effort, and assessing and propagating uncertainty through decision-making processes.

  8. Enumerative and binomial sequential sampling plans for the multicolored Asian lady beetle (Coleoptera: Coccinellidae) in wine grapes.

    PubMed

    Galvan, T L; Burkness, E C; Hutchison, W D

    2007-06-01

    To develop a practical integrated pest management (IPM) system for the multicolored Asian lady beetle, Harmonia axyridis (Pallas) (Coleoptera: Coccinellidae), in wine grapes, we assessed the spatial distribution of H. axyridis and developed eight sampling plans to estimate adult density or infestation level in grape clusters. We used 49 data sets collected from commercial vineyards in 2004 and 2005, in Minnesota and Wisconsin. Enumerative plans were developed using two precision levels (0.10 and 0.25); the six binomial plans reflected six unique action thresholds (3, 7, 12, 18, 22, and 31% of cluster samples infested with at least one H. axyridis). The spatial distribution of H. axyridis in wine grapes was aggregated, independent of cultivar and year, but it was more randomly distributed as mean density declined. The average sample number (ASN) for each sampling plan was determined using resampling software. For research purposes, an enumerative plan with a precision level of 0.10 (SE/X) resulted in a mean ASN of 546 clusters. For IPM applications, the enumerative plan with a precision level of 0.25 resulted in a mean ASN of 180 clusters. In contrast, the binomial plans resulted in much lower ASNs and provided high probabilities of arriving at correct "treat or no-treat" decisions, making these plans more efficient for IPM applications. For a tally threshold of one adult per cluster, the operating characteristic curves for the six action thresholds provided binomial sequential sampling plans with mean ASNs of only 19-26 clusters, and probabilities of making correct decisions between 83 and 96%. The benefits of the binomial sampling plans are discussed within the context of improving IPM programs for wine grapes.

  9. A Statistical Tool for Risk Assessment as Function of Number of Retrieved Lymph Nodes from Rectal Cancer Patients.

    PubMed

    Wu, Zhenyu; Qin, Guoyou; Zhao, Naiqing; Jia, Huixun; Zheng, Xueying

    2018-05-16

    Although a minimum of 12 lymph nodes (LNs) has been recommended for colorectal cancer, there remains considerable debates for rectal cancer patients. Inadequacy of examined LNs would lead to under-staging, and inappropriate treatment as a consequence. We describe statistical tool that allows an estimate the probability of false-negative nodes. A total of 26,778 adenocarcinoma rectum cancer patients with tumour stage (T stage) 1-3, diagnosed between 2004 and 2013, who did not receive neoadjuvant therapies and had at least one histologically assessed LN, were extracted from the Surveillance, Epidemiology and End Results (SEER) database. A statistical tool using beta-binomial distribution was developed to estimate the probability of an occult nodal disease is truly node-negative as a function of total number of LNs examined and T stage. The probability of falsely identifying a patient as node-negative decreased with an increasing number of nodes examined for each stage. It was estimated to be 72%, 66% and 52% for T1, T2 and T3 patients respectively with a single node examined. To confirm an occult nodal disease with 90% confidence, 5, 9, and 29 nodes need to be examined for patients from stages T1, T2, and T3, respectively. The false-negative rate of the examined lymph nodes in rectal cancer was verified to be dependent preoperatively on the clinical tumour stage. A more accurate nodal staging score was developed to recommend a threshold on the minimum number of examined nodes regarding to the favored level of confidence. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  10. The Detection of Signals in Impulsive Noise.

    DTIC Science & Technology

    1983-06-01

    ASSI FICATION/ DOWN GRADING SCHEOUL1E * I1S. DISTRIBUTION STATEMENT (of th0i0 Rhport) Approved for Public Release; Distribucion Unlimited * 17...has a symmetric distribution, sgn(x i) will be -1 with probability 1/2 and +1 with probability 1/2. Considering the sum of observations as 0 binomial

  11. A chi-square goodness-of-fit test for non-identically distributed random variables: with application to empirical Bayes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Conover, W.J.; Cox, D.D.; Martz, H.F.

    1997-12-01

    When using parametric empirical Bayes estimation methods for estimating the binomial or Poisson parameter, the validity of the assumed beta or gamma conjugate prior distribution is an important diagnostic consideration. Chi-square goodness-of-fit tests of the beta or gamma prior hypothesis are developed for use when the binomial sample sizes or Poisson exposure times vary. Nine examples illustrate the application of the methods, using real data from such diverse applications as the loss of feedwater flow rates in nuclear power plants, the probability of failure to run on demand and the failure rates of the high pressure coolant injection systems atmore » US commercial boiling water reactors, the probability of failure to run on demand of emergency diesel generators in US commercial nuclear power plants, the rate of failure of aircraft air conditioners, baseball batting averages, the probability of testing positive for toxoplasmosis, and the probability of tumors in rats. The tests are easily applied in practice by means of corresponding Mathematica{reg_sign} computer programs which are provided.« less

  12. Modeling number of claims and prediction of total claim amount

    NASA Astrophysics Data System (ADS)

    Acar, Aslıhan Şentürk; Karabey, Uǧur

    2017-07-01

    In this study we focus on annual number of claims of a private health insurance data set which belongs to a local insurance company in Turkey. In addition to Poisson model and negative binomial model, zero-inflated Poisson model and zero-inflated negative binomial model are used to model the number of claims in order to take into account excess zeros. To investigate the impact of different distributional assumptions for the number of claims on the prediction of total claim amount, predictive performances of candidate models are compared by using root mean square error (RMSE) and mean absolute error (MAE) criteria.

  13. Modelling parasite aggregation: disentangling statistical and ecological approaches.

    PubMed

    Yakob, Laith; Soares Magalhães, Ricardo J; Gray, Darren J; Milinovich, Gabriel; Wardrop, Nicola; Dunning, Rebecca; Barendregt, Jan; Bieri, Franziska; Williams, Gail M; Clements, Archie C A

    2014-05-01

    The overdispersion in macroparasite infection intensity among host populations is commonly simulated using a constant negative binomial aggregation parameter. We describe an alternative to utilising the negative binomial approach and demonstrate important disparities in intervention efficacy projections that can come about from opting for pattern-fitting models that are not process-explicit. We present model output in the context of the epidemiology and control of soil-transmitted helminths due to the significant public health burden imposed by these parasites, but our methods are applicable to other infections with demonstrable aggregation in parasite numbers among hosts. Copyright © 2014. Published by Elsevier Ltd.

  14. FluBreaks: early epidemic detection from Google flu trends.

    PubMed

    Pervaiz, Fahad; Pervaiz, Mansoor; Abdur Rehman, Nabeel; Saif, Umar

    2012-10-04

    The Google Flu Trends service was launched in 2008 to track changes in the volume of online search queries related to flu-like symptoms. Over the last few years, the trend data produced by this service has shown a consistent relationship with the actual number of flu reports collected by the US Centers for Disease Control and Prevention (CDC), often identifying increases in flu cases weeks in advance of CDC records. However, contrary to popular belief, Google Flu Trends is not an early epidemic detection system. Instead, it is designed as a baseline indicator of the trend, or changes, in the number of disease cases. To evaluate whether these trends can be used as a basis for an early warning system for epidemics. We present the first detailed algorithmic analysis of how Google Flu Trends can be used as a basis for building a fully automated system for early warning of epidemics in advance of methods used by the CDC. Based on our work, we present a novel early epidemic detection system, called FluBreaks (dritte.org/flubreaks), based on Google Flu Trends data. We compared the accuracy and practicality of three types of algorithms: normal distribution algorithms, Poisson distribution algorithms, and negative binomial distribution algorithms. We explored the relative merits of these methods, and related our findings to changes in Internet penetration and population size for the regions in Google Flu Trends providing data. Across our performance metrics of percentage true-positives (RTP), percentage false-positives (RFP), percentage overlap (OT), and percentage early alarms (EA), Poisson- and negative binomial-based algorithms performed better in all except RFP. Poisson-based algorithms had average values of 99%, 28%, 71%, and 76% for RTP, RFP, OT, and EA, respectively, whereas negative binomial-based algorithms had average values of 97.8%, 17.8%, 60%, and 55% for RTP, RFP, OT, and EA, respectively. Moreover, the EA was also affected by the region's population size. Regions with larger populations (regions 4 and 6) had higher values of EA than region 10 (which had the smallest population) for negative binomial- and Poisson-based algorithms. The difference was 12.5% and 13.5% on average in negative binomial- and Poisson-based algorithms, respectively. We present the first detailed comparative analysis of popular early epidemic detection algorithms on Google Flu Trends data. We note that realizing this opportunity requires moving beyond the cumulative sum and historical limits method-based normal distribution approaches, traditionally employed by the CDC, to negative binomial- and Poisson-based algorithms to deal with potentially noisy search query data from regions with varying population and Internet penetrations. Based on our work, we have developed FluBreaks, an early warning system for flu epidemics using Google Flu Trends.

  15. Football fever: goal distributions and non-Gaussian statistics

    NASA Astrophysics Data System (ADS)

    Bittner, E.; Nußbaumer, A.; Janke, W.; Weigel, M.

    2009-02-01

    Analyzing football score data with statistical techniques, we investigate how the not purely random, but highly co-operative nature of the game is reflected in averaged properties such as the probability distributions of scored goals for the home and away teams. As it turns out, especially the tails of the distributions are not well described by the Poissonian or binomial model resulting from the assumption of uncorrelated random events. Instead, a good effective description of the data is provided by less basic distributions such as the negative binomial one or the probability densities of extreme value statistics. To understand this behavior from a microscopical point of view, however, no waiting time problem or extremal process need be invoked. Instead, modifying the Bernoulli random process underlying the Poissonian model to include a simple component of self-affirmation seems to describe the data surprisingly well and allows to understand the observed deviation from Gaussian statistics. The phenomenological distributions used before can be understood as special cases within this framework. We analyzed historical football score data from many leagues in Europe as well as from international tournaments, including data from all past tournaments of the “FIFA World Cup” series, and found the proposed models to be applicable rather universally. In particular, here we analyze the results of the German women’s premier football league and consider the two separate German men’s premier leagues in the East and West during the cold war times as well as the unified league after 1990 to see how scoring in football and the component of self-affirmation depend on cultural and political circumstances.

  16. On Models for Binomial Data with Random Numbers of Trials

    PubMed Central

    Comulada, W. Scott; Weiss, Robert E.

    2010-01-01

    Summary A binomial outcome is a count s of the number of successes out of the total number of independent trials n = s + f, where f is a count of the failures. The n are random variables not fixed by design in many studies. Joint modeling of (s, f) can provide additional insight into the science and into the probability π of success that cannot be directly incorporated by the logistic regression model. Observations where n = 0 are excluded from the binomial analysis yet may be important to understanding how π is influenced by covariates. Correlation between s and f may exist and be of direct interest. We propose Bayesian multivariate Poisson models for the bivariate response (s, f), correlated through random effects. We extend our models to the analysis of longitudinal and multivariate longitudinal binomial outcomes. Our methodology was motivated by two disparate examples, one from teratology and one from an HIV tertiary intervention study. PMID:17688514

  17. Poisson, Poisson-gamma and zero-inflated regression models of motor vehicle crashes: balancing statistical fit and theory.

    PubMed

    Lord, Dominique; Washington, Simon P; Ivan, John N

    2005-01-01

    There has been considerable research conducted over the last 20 years focused on predicting motor vehicle crashes on transportation facilities. The range of statistical models commonly applied includes binomial, Poisson, Poisson-gamma (or negative binomial), zero-inflated Poisson and negative binomial models (ZIP and ZINB), and multinomial probability models. Given the range of possible modeling approaches and the host of assumptions with each modeling approach, making an intelligent choice for modeling motor vehicle crash data is difficult. There is little discussion in the literature comparing different statistical modeling approaches, identifying which statistical models are most appropriate for modeling crash data, and providing a strong justification from basic crash principles. In the recent literature, it has been suggested that the motor vehicle crash process can successfully be modeled by assuming a dual-state data-generating process, which implies that entities (e.g., intersections, road segments, pedestrian crossings, etc.) exist in one of two states-perfectly safe and unsafe. As a result, the ZIP and ZINB are two models that have been applied to account for the preponderance of "excess" zeros frequently observed in crash count data. The objective of this study is to provide defensible guidance on how to appropriate model crash data. We first examine the motor vehicle crash process using theoretical principles and a basic understanding of the crash process. It is shown that the fundamental crash process follows a Bernoulli trial with unequal probability of independent events, also known as Poisson trials. We examine the evolution of statistical models as they apply to the motor vehicle crash process, and indicate how well they statistically approximate the crash process. We also present the theory behind dual-state process count models, and note why they have become popular for modeling crash data. A simulation experiment is then conducted to demonstrate how crash data give rise to "excess" zeros frequently observed in crash data. It is shown that the Poisson and other mixed probabilistic structures are approximations assumed for modeling the motor vehicle crash process. Furthermore, it is demonstrated that under certain (fairly common) circumstances excess zeros are observed-and that these circumstances arise from low exposure and/or inappropriate selection of time/space scales and not an underlying dual state process. In conclusion, carefully selecting the time/space scales for analysis, including an improved set of explanatory variables and/or unobserved heterogeneity effects in count regression models, or applying small-area statistical methods (observations with low exposure) represent the most defensible modeling approaches for datasets with a preponderance of zeros.

  18. Analysis of multiple tank car releases in train accidents.

    PubMed

    Liu, Xiang; Liu, Chang; Hong, Yili

    2017-10-01

    There are annually over two million carloads of hazardous materials transported by rail in the United States. The American railroads use large blocks of tank cars to transport petroleum crude oil and other flammable liquids from production to consumption sites. Being different from roadway transport of hazardous materials, a train accident can potentially result in the derailment and release of multiple tank cars, which may result in significant consequences. The prior literature predominantly assumes that the occurrence of multiple tank car releases in a train accident is a series of independent Bernoulli processes, and thus uses the binomial distribution to estimate the total number of tank car releases given the number of tank cars derailing or damaged. This paper shows that the traditional binomial model can incorrectly estimate multiple tank car release probability by magnitudes in certain circumstances, thereby significantly affecting railroad safety and risk analysis. To bridge this knowledge gap, this paper proposes a novel, alternative Correlated Binomial (CB) model that accounts for the possible correlations of multiple tank car releases in the same train. We test three distinct correlation structures in the CB model, and find that they all outperform the conventional binomial model based on empirical tank car accident data. The analysis shows that considering tank car release correlations would result in a significantly improved fit of the empirical data than otherwise. Consequently, it is prudent to consider alternative modeling techniques when analyzing the probability of multiple tank car releases in railroad accidents. Copyright © 2017 Elsevier Ltd. All rights reserved.

  19. A big data approach to the development of mixed-effects models for seizure count data.

    PubMed

    Tharayil, Joseph J; Chiang, Sharon; Moss, Robert; Stern, John M; Theodore, William H; Goldenholz, Daniel M

    2017-05-01

    Our objective was to develop a generalized linear mixed model for predicting seizure count that is useful in the design and analysis of clinical trials. This model also may benefit the design and interpretation of seizure-recording paradigms. Most existing seizure count models do not include children, and there is currently no consensus regarding the most suitable model that can be applied to children and adults. Therefore, an additional objective was to develop a model that accounts for both adult and pediatric epilepsy. Using data from SeizureTracker.com, a patient-reported seizure diary tool with >1.2 million recorded seizures across 8 years, we evaluated the appropriateness of Poisson, negative binomial, zero-inflated negative binomial, and modified negative binomial models for seizure count data based on minimization of the Bayesian information criterion. Generalized linear mixed-effects models were used to account for demographic and etiologic covariates and for autocorrelation structure. Holdout cross-validation was used to evaluate predictive accuracy in simulating seizure frequencies. For both adults and children, we found that a negative binomial model with autocorrelation over 1 day was optimal. Using holdout cross-validation, the proposed model was found to provide accurate simulation of seizure counts for patients with up to four seizures per day. The optimal model can be used to generate more realistic simulated patient data with very few input parameters. The availability of a parsimonious, realistic virtual patient model can be of great utility in simulations of phase II/III clinical trials, epilepsy monitoring units, outpatient biosensors, and mobile Health (mHealth) applications. Wiley Periodicals, Inc. © 2017 International League Against Epilepsy.

  20. Application of binomial and multinomial probability statistics to the sampling design process of a global grain tracing and recall system

    USDA-ARS?s Scientific Manuscript database

    Small, coded, pill-sized tracers embedded in grain are proposed as a method for grain traceability. A sampling process for a grain traceability system was designed and investigated by applying probability statistics using a science-based sampling approach to collect an adequate number of tracers fo...

  1. Points on the Path to Probability.

    ERIC Educational Resources Information Center

    Kiernan, James F.

    2001-01-01

    Presents the problem of points and the development of the binomial triangle, or Pascal's triangle. Examines various attempts to solve this problem to give students insight into the nature of mathematical discovery. (KHR)

  2. A binomial stochastic kinetic approach to the Michaelis-Menten mechanism

    NASA Astrophysics Data System (ADS)

    Lente, Gábor

    2013-05-01

    This Letter presents a new method that gives an analytical approximation of the exact solution of the stochastic Michaelis-Menten mechanism without computationally demanding matrix operations. The method is based on solving the deterministic rate equations and then using the results as guiding variables of calculating probability values using binomial distributions. This principle can be generalized to a number of different kinetic schemes and is expected to be very useful in the evaluation of measurements focusing on the catalytic activity of one or a few individual enzyme molecules.

  3. The Binomial Model in Fluctuation Analysis of Quantal Neurotransmitter Release

    PubMed Central

    Quastel, D. M. J.

    1997-01-01

    The mathematics of the binomial model for quantal neurotransmitter release is considered in general terms, to explore what information might be extractable from statistical aspects of data. For an array of N statistically independent release sites, each with a release probability p, the compound binomial always pertains, with = N

    , p′ ≡ 1 - var(m)/ =

    (1 + cvp2) and n′ ≡ /p′ = N/(1 + cvp2), where m is the output/stimulus and cvp2 is var(p)/

    2. Unless n′ is invariant with ambient conditions or stimulation paradigms, the simple binomial (cvp = 0) is untenable and n′ is neither N nor the number of “active” sites or sites with a quantum available. At each site p = popA, where po is the output probability if a site is “eligible” or “filled” despite previous quantal discharge, and pA (eligibility probability) depends at least on the replenishment rate, po, and interstimulus time. Assuming stochastic replenishment, a simple algorithm allows calculation of the full statistical composition of outputs for any hypothetical combinations of po's and refill rates, for any stimulation paradigm and spontaneous release. A rise in n′ (reduced cvp) tends to occur whenever po varies widely between sites, with a raised stimulation frequency or factors tending to increase po's. Unlike and var(m) at equilibrium, output changes early in trains of stimuli, and covariances, potentially provide information about whether changes in reflect change in or in . Formulae are derived for variance and third moments of postsynaptic responses, which depend on the quantal mix in the signals. A new, easily computed function, the area product, gives noise-unbiased variance of a series of synaptic signals and its peristimulus time distribution, which is modified by the unit channel composition of quantal responses and if the signals reflect mixed responses from synapses with different quantal time course. PMID:9017200

  4. Bayesian inference for disease prevalence using negative binomial group testing

    PubMed Central

    Pritchard, Nicholas A.; Tebbs, Joshua M.

    2011-01-01

    Group testing, also known as pooled testing, and inverse sampling are both widely used methods of data collection when the goal is to estimate a small proportion. Taking a Bayesian approach, we consider the new problem of estimating disease prevalence from group testing when inverse (negative binomial) sampling is used. Using different distributions to incorporate prior knowledge of disease incidence and different loss functions, we derive closed form expressions for posterior distributions and resulting point and credible interval estimators. We then evaluate our new estimators, on Bayesian and classical grounds, and apply our methods to a West Nile Virus data set. PMID:21259308

  5. Use of negative binomial distribution to describe the presence of Anisakis in Thyrsites atun.

    PubMed

    Peña-Rehbein, Patricio; De los Ríos-Escalante, Patricio

    2012-01-01

    Nematodes of the genus Anisakis have marine fishes as intermediate hosts. One of these hosts is Thyrsites atun, an important fishery resource in Chile between 38 and 41° S. This paper describes the frequency and number of Anisakis nematodes in the internal organs of Thyrsites atun. An analysis based on spatial distribution models showed that the parasites tend to be clustered. The variation in the number of parasites per host could be described by the negative binomial distribution. The maximum observed number of parasites was nine parasites per host. The environmental and zoonotic aspects of the study are also discussed.

  6. Impact of cigarette smoking on utilization of nursing home services.

    PubMed

    Warner, Kenneth E; McCammon, Ryan J; Fries, Brant E; Langa, Kenneth M

    2013-11-01

    Few studies have examined the effects of smoking on nursing home utilization, generally using poor data on smoking status. No previous study has distinguished utilization for recent from long-term quitters. Using the Health and Retirement Study, we assessed nursing home utilization by never-smokers, long-term quitters (quit >3 years), recent quitters (quit ≤3 years), and current smokers. We used logistic regression to evaluate the likelihood of a nursing home admission. For those with an admission, we used negative binomial regression on the number of nursing home nights. Finally, we employed zero-inflated negative binomial regression to estimate nights for the full sample. Controlling for other variables, compared with never-smokers, long-term quitters have an odds ratio (OR) for nursing home admission of 1.18 (95% CI: 1.07-1.2), current smokers 1.39 (1.23-1.57), and recent quitters 1.55 (1.29-1.87). The probability of admission rises rapidly with age and is lower for African Americans and Hispanics, more affluent respondents, respondents with a spouse present in the home, and respondents with a living child. Given admission, smoking status is not associated with length of stay (LOS). LOS is longer for older respondents and women and shorter for more affluent respondents and those with spouses present. Compared with otherwise identical never-smokers, former and current smokers have a significantly increased risk of nursing home admission. That recent quitters are at greatest risk of admission is consistent with evidence that many stop smoking because they are sick, often due to smoking.

  7. Novel formulation of the ℳ model through the Generalized-K distribution for atmospheric optical channels.

    PubMed

    Garrido-Balsells, José María; Jurado-Navas, Antonio; Paris, José Francisco; Castillo-Vazquez, Miguel; Puerta-Notario, Antonio

    2015-03-09

    In this paper, a novel and deeper physical interpretation on the recently published Málaga or ℳ statistical distribution is provided. This distribution, which is having a wide acceptance by the scientific community, models the optical irradiance scintillation induced by the atmospheric turbulence. Here, the analytical expressions previously published are modified in order to express them by a mixture of the known Generalized-K and discrete Binomial and Negative Binomial distributions. In particular, the probability density function (pdf) of the ℳ model is now obtained as a linear combination of these Generalized-K pdf, in which the coefficients depend directly on the parameters of the ℳ distribution. In this way, the Málaga model can be physically interpreted as a superposition of different optical sub-channels each of them described by the corresponding Generalized-K fading model and weighted by the ℳ dependent coefficients. The expressions here proposed are simpler than the equations of the original ℳ model and are validated by means of numerical simulations by generating ℳ -distributed random sequences and their associated histogram. This novel interpretation of the Málaga statistical distribution provides a valuable tool for analyzing the performance of atmospheric optical channels for every turbulence condition.

  8. Bycatch, bait, anglers, and roads: quantifying vector activity and propagule introduction risk across lake ecosystems.

    PubMed

    Drake, D Andrew R; Mandrak, Nicholas E

    2014-06-01

    Long implicated in the invasion process, live-bait anglers are highly mobile species vectors with frequent overland transport of fishes. To test hypotheses about the role of anglers in propagule transport, we developed a social-ecological model quantifying the opportunity for species transport beyond the invaded range resulting from bycatch during commercial bait operations, incidental transport, and release to lake ecosystems by anglers. We combined a gravity model with a stochastic, agent-based simulation, representing a 1-yr iteration of live-bait angling and the dynamics of propagule transport at fine spatiotemporal scales (i.e., probability of introducing n propagules per lake per year). A baseline scenario involving round goby (Neogobius melanostomus) indicated that most angling trips were benign; irrespective of lake visitation, anglers failed to purchase and transport propagules (benign trips, median probability P = 0.99912). However, given the large number of probability trials (4.2 million live-bait angling events per year), even the rarest sequence of events (uptake, movement, and deposition of propagules) is anticipated to occur. Risky trips (modal P = 0.00088 trips per year; approximately 1 in 1136) were sufficient to introduce a substantial number of propagules (modal values, Poisson model = 3715 propagules among 1288 lakes per year; zero-inflated negative binomial model = 6722 propagules among 1292 lakes per year). Two patterns of lake-specific introduction risk emerged. Large lakes supporting substantial angling activity experienced propagule pressure likely to surpass demographic barriers to establishment (top 2.5% of lakes with modal outcomes of five to 76 propagules per year; 303 high-risk lakes with three or more propagules, per year). Small or remote lakes were less likely to receive propagules; however, most risk distributions were leptokurtic with a long right tail, indicating the rare occurrence of high propagule loads to most waterbodies. Infestation simulations indicated that the number of high-risk waterbodies could be as great as 1318 (zero-inflated negative binomial), whereas a 90% reduction in bycatch from baseline would reduce the modal number of high risk lakes to zero. Results indicate that the combination of invasive bycatch and live-bait anglers warrants management concern as a species vector, but that risk is confined to a subset of individuals and recipient sites that may be effectively managed with targeted strategies.

  9. Neighborhood educational disparities in active commuting among women: the effect of distance between the place of residence and the place of work/study (an ACTI-Cités study).

    PubMed

    Perchoux, Camille; Nazare, Julie-Anne; Benmarhnia, Tarik; Salze, Paul; Feuillet, Thierry; Hercberg, Serge; Hess, Franck; Menai, Mehdi; Weber, Christiane; Charreire, Hélène; Enaux, Christophe; Oppert, Jean-Michel; Simon, Chantal

    2017-06-12

    Active transportation has been associated with favorable health outcomes. Previous research highlighted the influence of neighborhood educational level on active transportation. However, little is known regarding the effect of commuting distance on social disparities in active commuting. In this regard, women have been poorly studied. The objective of this paper was to evaluate the relationship between neighborhood educational level and active commuting, and to assess whether the commuting distance modifies this relationship in adult women. This cross-sectional study is based on a subsample of women from the Nutrinet-Santé web-cohort (N = 1169). Binomial, log-binomial and negative binomial regressions were used to assess the associations between neighborhood education level and (i) the likelihood of reporting any active commuting time, and (ii) the share of commuting time made by active transportation modes. Potential effect measure modification of distance to work on the previous associations was assessed both on the additive and the multiplicative scales. Neighborhood education level was positively associated with the probability of reporting any active commuting time (relative risk = 1.774; p < 0.05) and the share of commuting time spent active (relative risk = 1.423; p < 0.05). The impact of neighborhood education was greater at long distances to work for both outcomes. Our results suggest that neighborhood educational disparities in active commuting tend to increase with commuting distance among women. Further research is needed to provide geographically driven guidance for health promotion intervention aiming at reducing disparities in active transportation among socioeconomic groups.

  10. Selecting a distributional assumption for modelling relative densities of benthic macroinvertebrates

    USGS Publications Warehouse

    Gray, B.R.

    2005-01-01

    The selection of a distributional assumption suitable for modelling macroinvertebrate density data is typically challenging. Macroinvertebrate data often exhibit substantially larger variances than expected under a standard count assumption, that of the Poisson distribution. Such overdispersion may derive from multiple sources, including heterogeneity of habitat (historically and spatially), differing life histories for organisms collected within a single collection in space and time, and autocorrelation. Taken to extreme, heterogeneity of habitat may be argued to explain the frequent large proportions of zero observations in macroinvertebrate data. Sampling locations may consist of habitats defined qualitatively as either suitable or unsuitable. The former category may yield random or stochastic zeroes and the latter structural zeroes. Heterogeneity among counts may be accommodated by treating the count mean itself as a random variable, while extra zeroes may be accommodated using zero-modified count assumptions, including zero-inflated and two-stage (or hurdle) approaches. These and linear assumptions (following log- and square root-transformations) were evaluated using 9 years of mayfly density data from a 52 km, ninth-order reach of the Upper Mississippi River (n = 959). The data exhibited substantial overdispersion relative to that expected under a Poisson assumption (i.e. variance:mean ratio = 23 ??? 1), and 43% of the sampling locations yielded zero mayflies. Based on the Akaike Information Criterion (AIC), count models were improved most by treating the count mean as a random variable (via a Poisson-gamma distributional assumption) and secondarily by zero modification (i.e. improvements in AIC values = 9184 units and 47-48 units, respectively). Zeroes were underestimated by the Poisson, log-transform and square root-transform models, slightly by the standard negative binomial model but not by the zero-modified models (61%, 24%, 32%, 7%, and 0%, respectively). However, the zero-modified Poisson models underestimated small counts (1 ??? y ??? 4) and overestimated intermediate counts (7 ??? y ??? 23). Counts greater than zero were estimated well by zero-modified negative binomial models, while counts greater than one were also estimated well by the standard negative binomial model. Based on AIC and percent zero estimation criteria, the two-stage and zero-inflated models performed similarly. The above inferences were largely confirmed when the models were used to predict values from a separate, evaluation data set (n = 110). An exception was that, using the evaluation data set, the standard negative binomial model appeared superior to its zero-modified counterparts using the AIC (but not percent zero criteria). This and other evidence suggest that a negative binomial distributional assumption should be routinely considered when modelling benthic macroinvertebrate data from low flow environments. Whether negative binomial models should themselves be routinely examined for extra zeroes requires, from a statistical perspective, more investigation. However, this question may best be answered by ecological arguments that may be specific to the sampled species and locations. ?? 2004 Elsevier B.V. All rights reserved.

  11. Discrimination of numerical proportions: A comparison of binomial and Gaussian models.

    PubMed

    Raidvee, Aire; Lember, Jüri; Allik, Jüri

    2017-01-01

    Observers discriminated the numerical proportion of two sets of elements (N = 9, 13, 33, and 65) that differed either by color or orientation. According to the standard Thurstonian approach, the accuracy of proportion discrimination is determined by irreducible noise in the nervous system that stochastically transforms the number of presented visual elements onto a continuum of psychological states representing numerosity. As an alternative to this customary approach, we propose a Thurstonian-binomial model, which assumes discrete perceptual states, each of which is associated with a certain visual element. It is shown that the probability β with which each visual element can be noticed and registered by the perceptual system can explain data of numerical proportion discrimination at least as well as the continuous Thurstonian-Gaussian model, and better, if the greater parsimony of the Thurstonian-binomial model is taken into account using AIC model selection. We conclude that Gaussian and binomial models represent two different fundamental principles-internal noise vs. using only a fraction of available information-which are both plausible descriptions of visual perception.

  12. Interrelationships Between Receiver/Relative Operating Characteristics Display, Binomial, Logit, and Bayes' Rule Probability of Detection Methodologies

    NASA Technical Reports Server (NTRS)

    Generazio, Edward R.

    2014-01-01

    Unknown risks are introduced into failure critical systems when probability of detection (POD) capabilities are accepted without a complete understanding of the statistical method applied and the interpretation of the statistical results. The presence of this risk in the nondestructive evaluation (NDE) community is revealed in common statements about POD. These statements are often interpreted in a variety of ways and therefore, the very existence of the statements identifies the need for a more comprehensive understanding of POD methodologies. Statistical methodologies have data requirements to be met, procedures to be followed, and requirements for validation or demonstration of adequacy of the POD estimates. Risks are further enhanced due to the wide range of statistical methodologies used for determining the POD capability. Receiver/Relative Operating Characteristics (ROC) Display, simple binomial, logistic regression, and Bayes' rule POD methodologies are widely used in determining POD capability. This work focuses on Hit-Miss data to reveal the framework of the interrelationships between Receiver/Relative Operating Characteristics Display, simple binomial, logistic regression, and Bayes' Rule methodologies as they are applied to POD. Knowledge of these interrelationships leads to an intuitive and global understanding of the statistical data, procedural and validation requirements for establishing credible POD estimates.

  13. A comparison of LMC and SDL complexity measures on binomial distributions

    NASA Astrophysics Data System (ADS)

    Piqueira, José Roberto C.

    2016-02-01

    The concept of complexity has been widely discussed in the last forty years, with a lot of thinking contributions coming from all areas of the human knowledge, including Philosophy, Linguistics, History, Biology, Physics, Chemistry and many others, with mathematicians trying to give a rigorous view of it. In this sense, thermodynamics meets information theory and, by using the entropy definition, López-Ruiz, Mancini and Calbet proposed a definition for complexity that is referred as LMC measure. Shiner, Davison and Landsberg, by slightly changing the LMC definition, proposed the SDL measure and the both, LMC and SDL, are satisfactory to measure complexity for a lot of problems. Here, SDL and LMC measures are applied to the case of a binomial probability distribution, trying to clarify how the length of the data set implies complexity and how the success probability of the repeated trials determines how complex the whole set is.

  14. Extended Poisson process modelling and analysis of grouped binary data.

    PubMed

    Faddy, Malcolm J; Smith, David M

    2012-05-01

    A simple extension of the Poisson process results in binomially distributed counts of events in a time interval. A further extension generalises this to probability distributions under- or over-dispersed relative to the binomial distribution. Substantial levels of under-dispersion are possible with this modelling, but only modest levels of over-dispersion - up to Poisson-like variation. Although simple analytical expressions for the moments of these probability distributions are not available, approximate expressions for the mean and variance are derived, and used to re-parameterise the models. The modelling is applied in the analysis of two published data sets, one showing under-dispersion and the other over-dispersion. More appropriate assessment of the precision of estimated parameters and reliable model checking diagnostics follow from this more general modelling of these data sets. © 2012 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  15. Distribution pattern of phthirapterans infesting certain common Indian birds.

    PubMed

    Saxena, A K; Kumar, Sandeep; Gupta, Nidhi; Mitra, J D; Ali, S A; Srivastava, Roshni

    2007-08-01

    The prevalence and frequency distribution patterns of 10 phthirapteran species infesting house sparrows, Indian parakeets, common mynas, and white breasted kingfishers were recorded in the district of Rampur, India, during 2004-05. The sample mean abundances, mean intensities, range of infestations, variance to mean ratios, values of the exponent of the negative binomial distribution, and the indices of discrepancy were also computed. Frequency distribution patterns of all phthirapteran species were skewed, but the observed frequencies did not correspond to the negative binomial distribution. Thus, adult-nymph ratios varied in different species from 1:0.53 to 1:1.25. Sex ratios of different phthirapteran species ranged from 1:1.10 to 1:1.65 and were female biased.

  16. Use of the binomial distribution to predict impairment: application in a nonclinical sample.

    PubMed

    Axelrod, Bradley N; Wall, Jacqueline R; Estes, Bradley W

    2008-01-01

    A mathematical model based on the binomial theory was developed to illustrate when abnormal score variations occur by chance in a multitest battery (Ingraham & Aiken, 1996). It has been successfully used as a comparison for obtained test scores in clinical samples, but not in nonclinical samples. In the current study, this model has been applied to demographically corrected scores on the Halstead-Reitan Neuropsychological Test Battery, obtained from a sample of 94 nonclinical college students. Results found that 15% of the sample had impairments suggested by the Halstead Impairment Index, using criteria established by Reitan and Wolfson (1993). In addition, one-half of the sample obtained impaired scores on one or two tests. These results were compared to that predicted by the binomial model and found to be consistent. The model therefore serves as a useful resource for clinicians considering the probability of impaired test performance.

  17. Sample size determination for a three-arm equivalence trial of Poisson and negative binomial responses.

    PubMed

    Chang, Yu-Wei; Tsong, Yi; Zhao, Zhigen

    2017-01-01

    Assessing equivalence or similarity has drawn much attention recently as many drug products have lost or will lose their patents in the next few years, especially certain best-selling biologics. To claim equivalence between the test treatment and the reference treatment when assay sensitivity is well established from historical data, one has to demonstrate both superiority of the test treatment over placebo and equivalence between the test treatment and the reference treatment. Thus, there is urgency for practitioners to derive a practical way to calculate sample size for a three-arm equivalence trial. The primary endpoints of a clinical trial may not always be continuous, but may be discrete. In this paper, the authors derive power function and discuss sample size requirement for a three-arm equivalence trial with Poisson and negative binomial clinical endpoints. In addition, the authors examine the effect of the dispersion parameter on the power and the sample size by varying its coefficient from small to large. In extensive numerical studies, the authors demonstrate that required sample size heavily depends on the dispersion parameter. Therefore, misusing a Poisson model for negative binomial data may easily lose power up to 20%, depending on the value of the dispersion parameter.

  18. The analysis of incontinence episodes and other count data in patients with overactive bladder by Poisson and negative binomial regression.

    PubMed

    Martina, R; Kay, R; van Maanen, R; Ridder, A

    2015-01-01

    Clinical studies in overactive bladder have traditionally used analysis of covariance or nonparametric methods to analyse the number of incontinence episodes and other count data. It is known that if the underlying distributional assumptions of a particular parametric method do not hold, an alternative parametric method may be more efficient than a nonparametric one, which makes no assumptions regarding the underlying distribution of the data. Therefore, there are advantages in using methods based on the Poisson distribution or extensions of that method, which incorporate specific features that provide a modelling framework for count data. One challenge with count data is overdispersion, but methods are available that can account for this through the introduction of random effect terms in the modelling, and it is this modelling framework that leads to the negative binomial distribution. These models can also provide clinicians with a clearer and more appropriate interpretation of treatment effects in terms of rate ratios. In this paper, the previously used parametric and non-parametric approaches are contrasted with those based on Poisson regression and various extensions in trials evaluating solifenacin and mirabegron in patients with overactive bladder. In these applications, negative binomial models are seen to fit the data well. Copyright © 2014 John Wiley & Sons, Ltd.

  19. A methodology to design heuristics for model selection based on the characteristics of data: Application to investigate when the Negative Binomial Lindley (NB-L) is preferred over the Negative Binomial (NB).

    PubMed

    Shirazi, Mohammadali; Dhavala, Soma Sekhar; Lord, Dominique; Geedipally, Srinivas Reddy

    2017-10-01

    Safety analysts usually use post-modeling methods, such as the Goodness-of-Fit statistics or the Likelihood Ratio Test, to decide between two or more competitive distributions or models. Such metrics require all competitive distributions to be fitted to the data before any comparisons can be accomplished. Given the continuous growth in introducing new statistical distributions, choosing the best one using such post-modeling methods is not a trivial task, in addition to all theoretical or numerical issues the analyst may face during the analysis. Furthermore, and most importantly, these measures or tests do not provide any intuitions into why a specific distribution (or model) is preferred over another (Goodness-of-Logic). This paper ponders into these issues by proposing a methodology to design heuristics for Model Selection based on the characteristics of data, in terms of descriptive summary statistics, before fitting the models. The proposed methodology employs two analytic tools: (1) Monte-Carlo Simulations and (2) Machine Learning Classifiers, to design easy heuristics to predict the label of the 'most-likely-true' distribution for analyzing data. The proposed methodology was applied to investigate when the recently introduced Negative Binomial Lindley (NB-L) distribution is preferred over the Negative Binomial (NB) distribution. Heuristics were designed to select the 'most-likely-true' distribution between these two distributions, given a set of prescribed summary statistics of data. The proposed heuristics were successfully compared against classical tests for several real or observed datasets. Not only they are easy to use and do not need any post-modeling inputs, but also, using these heuristics, the analyst can attain useful information about why the NB-L is preferred over the NB - or vice versa- when modeling data. Copyright © 2017 Elsevier Ltd. All rights reserved.

  20. Computational Aspects of N-Mixture Models

    PubMed Central

    Dennis, Emily B; Morgan, Byron JT; Ridout, Martin S

    2015-01-01

    The N-mixture model is widely used to estimate the abundance of a population in the presence of unknown detection probability from only a set of counts subject to spatial and temporal replication (Royle, 2004, Biometrics 60, 105–115). We explain and exploit the equivalence of N-mixture and multivariate Poisson and negative-binomial models, which provides powerful new approaches for fitting these models. We show that particularly when detection probability and the number of sampling occasions are small, infinite estimates of abundance can arise. We propose a sample covariance as a diagnostic for this event, and demonstrate its good performance in the Poisson case. Infinite estimates may be missed in practice, due to numerical optimization procedures terminating at arbitrarily large values. It is shown that the use of a bound, K, for an infinite summation in the N-mixture likelihood can result in underestimation of abundance, so that default values of K in computer packages should be avoided. Instead we propose a simple automatic way to choose K. The methods are illustrated by analysis of data on Hermann's tortoise Testudo hermanni. PMID:25314629

  1. Estimation of Multinomial Probabilities.

    DTIC Science & Technology

    1978-11-01

    1971) and Alam (1978) have shown that the maximum likelihood estimator is admissible with respect to the quadratic loss. Steinhaus (1957) and Trybula...appear). Johnson, B. Mck. (1971). On admissible estimators for certain fixed sample binomial populations. Ann. Math. Statist. 92, 1579-1587. Steinhaus , H

  2. Spatiotemporal and random parameter panel data models of traffic crash fatalities in Vietnam.

    PubMed

    Truong, Long T; Kieu, Le-Minh; Vu, Tuan A

    2016-09-01

    This paper investigates factors associated with traffic crash fatalities in 63 provinces of Vietnam during the period from 2012 to 2014. Random effect negative binomial (RENB) and random parameter negative binomial (RPNB) panel data models are adopted to consider spatial heterogeneity across provinces. In addition, a spatiotemporal model with conditional autoregressive priors (ST-CAR) is utilised to account for spatiotemporal autocorrelation in the data. The statistical comparison indicates the ST-CAR model outperforms the RENB and RPNB models. Estimation results provide several significant findings. For example, traffic crash fatalities tend to be higher in provinces with greater numbers of level crossings. Passenger distance travelled and road lengths are also positively associated with fatalities. However, hospital densities are negatively associated with fatalities. The safety impact of the national highway 1A, the main transport corridor of the country, is also highlighted. Copyright © 2016 Elsevier Ltd. All rights reserved.

  3. Improving removal-based estimates of abundance by sampling a population of spatially distinct subpopulations

    USGS Publications Warehouse

    Dorazio, R.M.; Jelks, H.L.; Jordan, F.

    2005-01-01

     A statistical modeling framework is described for estimating the abundances of spatially distinct subpopulations of animals surveyed using removal sampling. To illustrate this framework, hierarchical models are developed using the Poisson and negative-binomial distributions to model variation in abundance among subpopulations and using the beta distribution to model variation in capture probabilities. These models are fitted to the removal counts observed in a survey of a federally endangered fish species. The resulting estimates of abundance have similar or better precision than those computed using the conventional approach of analyzing the removal counts of each subpopulation separately. Extension of the hierarchical models to include spatial covariates of abundance is straightforward and may be used to identify important features of an animal's habitat or to predict the abundance of animals at unsampled locations.

  4. NEWTONP - CUMULATIVE BINOMIAL PROGRAMS

    NASA Technical Reports Server (NTRS)

    Bowerman, P. N.

    1994-01-01

    The cumulative binomial program, NEWTONP, is one of a set of three programs which calculate cumulative binomial probability distributions for arbitrary inputs. The three programs, NEWTONP, CUMBIN (NPO-17555), and CROSSER (NPO-17557), can be used independently of one another. NEWTONP can be used by statisticians and users of statistical procedures, test planners, designers, and numerical analysts. The program has been used for reliability/availability calculations. NEWTONP calculates the probably p required to yield a given system reliability V for a k-out-of-n system. It can also be used to determine the Clopper-Pearson confidence limits (either one-sided or two-sided) for the parameter p of a Bernoulli distribution. NEWTONP can determine Bayesian probability limits for a proportion (if the beta prior has positive integer parameters). It can determine the percentiles of incomplete beta distributions with positive integer parameters. It can also determine the percentiles of F distributions and the midian plotting positions in probability plotting. NEWTONP is designed to work well with all integer values 0 < k <= n. To run the program, the user simply runs the executable version and inputs the information requested by the program. NEWTONP is not designed to weed out incorrect inputs, so the user must take care to make sure the inputs are correct. Once all input has been entered, the program calculates and lists the result. It also lists the number of iterations of Newton's method required to calculate the answer within the given error. The NEWTONP program is written in C. It was developed on an IBM AT with a numeric co-processor using Microsoft C 5.0. Because the source code is written using standard C structures and functions, it should compile correctly with most C compilers. The program format is interactive. It has been implemented under DOS 3.2 and has a memory requirement of 26K. NEWTONP was developed in 1988.

  5. A comparison of different ways of including baseline counts in negative binomial models for data from falls prevention trials.

    PubMed

    Zheng, Han; Kimber, Alan; Goodwin, Victoria A; Pickering, Ruth M

    2018-01-01

    A common design for a falls prevention trial is to assess falling at baseline, randomize participants into an intervention or control group, and ask them to record the number of falls they experience during a follow-up period of time. This paper addresses how best to include the baseline count in the analysis of the follow-up count of falls in negative binomial (NB) regression. We examine the performance of various approaches in simulated datasets where both counts are generated from a mixed Poisson distribution with shared random subject effect. Including the baseline count after log-transformation as a regressor in NB regression (NB-logged) or as an offset (NB-offset) resulted in greater power than including the untransformed baseline count (NB-unlogged). Cook and Wei's conditional negative binomial (CNB) model replicates the underlying process generating the data. In our motivating dataset, a statistically significant intervention effect resulted from the NB-logged, NB-offset, and CNB models, but not from NB-unlogged, and large, outlying baseline counts were overly influential in NB-unlogged but not in NB-logged. We conclude that there is little to lose by including the log-transformed baseline count in standard NB regression compared to CNB for moderate to larger sized datasets. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  6. Joint Analysis of Binomial and Continuous Traits with a Recursive Model: A Case Study Using Mortality and Litter Size of Pigs

    PubMed Central

    Varona, Luis; Sorensen, Daniel

    2014-01-01

    This work presents a model for the joint analysis of a binomial and a Gaussian trait using a recursive parametrization that leads to a computationally efficient implementation. The model is illustrated in an analysis of mortality and litter size in two breeds of Danish pigs, Landrace and Yorkshire. Available evidence suggests that mortality of piglets increased partly as a result of successful selection for total number of piglets born. In recent years there has been a need to decrease the incidence of mortality in pig-breeding programs. We report estimates of genetic variation at the level of the logit of the probability of mortality and quantify how it is affected by the size of the litter. Several models for mortality are considered and the best fits are obtained by postulating linear and cubic relationships between the logit of the probability of mortality and litter size, for Landrace and Yorkshire, respectively. An interpretation of how the presence of genetic variation affects the probability of mortality in the population is provided and we discuss and quantify the prospects of selecting for reduced mortality, without affecting litter size. PMID:24414548

  7. Introducing Perception and Modelling of Spatial Randomness in Classroom

    ERIC Educational Resources Information Center

    De Nóbrega, José Renato

    2017-01-01

    A strategy to facilitate understanding of spatial randomness is described, using student activities developed in sequence: looking at spatial patterns, simulating approximate spatial randomness using a grid of equally-likely squares, using binomial probabilities for approximations and predictions and then comparing with given Poisson…

  8. Type I error probability spending for post-market drug and vaccine safety surveillance with binomial data.

    PubMed

    Silva, Ivair R

    2018-01-15

    Type I error probability spending functions are commonly used for designing sequential analysis of binomial data in clinical trials, but it is also quickly emerging for near-continuous sequential analysis of post-market drug and vaccine safety surveillance. It is well known that, for clinical trials, when the null hypothesis is not rejected, it is still important to minimize the sample size. Unlike in post-market drug and vaccine safety surveillance, that is not important. In post-market safety surveillance, specially when the surveillance involves identification of potential signals, the meaningful statistical performance measure to be minimized is the expected sample size when the null hypothesis is rejected. The present paper shows that, instead of the convex Type I error spending shape conventionally used in clinical trials, a concave shape is more indicated for post-market drug and vaccine safety surveillance. This is shown for both, continuous and group sequential analysis. Copyright © 2017 John Wiley & Sons, Ltd.

  9. Distribution of apparent activation energy counterparts during thermo - And thermo-oxidative degradation of Aronia melanocarpa (black chokeberry).

    PubMed

    Janković, Bojan; Marinović-Cincović, Milena; Janković, Marija

    2017-09-01

    Kinetics of degradation for Aronia melanocarpa fresh fruits in argon and air atmospheres were investigated. The investigation was based on probability distributions of apparent activation energy of counterparts (ε a ). Isoconversional analysis results indicated that the degradation process in an inert atmosphere was governed by decomposition reactions of esterified compounds. Also, based on same kinetics approach, it was assumed that in an air atmosphere, the primary compound in degradation pathways could be anthocyanins, which undergo rapid chemical reactions. A new model of reactivity demonstrated that, under inert atmospheres, expectation values for ε a occured at levels of statistical probability. These values corresponded to decomposition processes in which polyphenolic compounds might be involved. ε a values obeyed laws of binomial distribution. It was established that, for thermo-oxidative degradation, Poisson distribution represented a very successful approximation for ε a values where there was additional mechanistic complexity and the binomial distribution was no longer valid. Copyright © 2017 Elsevier Ltd. All rights reserved.

  10. Binomial Test Method for Determining Probability of Detection Capability for Fracture Critical Applications

    NASA Technical Reports Server (NTRS)

    Generazio, Edward R.

    2011-01-01

    The capability of an inspection system is established by applications of various methodologies to determine the probability of detection (POD). One accepted metric of an adequate inspection system is that for a minimum flaw size and all greater flaw sizes, there is 0.90 probability of detection with 95% confidence (90/95 POD). Directed design of experiments for probability of detection (DOEPOD) has been developed to provide an efficient and accurate methodology that yields estimates of POD and confidence bounds for both Hit-Miss or signal amplitude testing, where signal amplitudes are reduced to Hit-Miss by using a signal threshold Directed DOEPOD uses a nonparametric approach for the analysis or inspection data that does require any assumptions about the particular functional form of a POD function. The DOEPOD procedure identifies, for a given sample set whether or not the minimum requirement of 0.90 probability of detection with 95% confidence is demonstrated for a minimum flaw size and for all greater flaw sizes (90/95 POD). The DOEPOD procedures are sequentially executed in order to minimize the number of samples needed to demonstrate that there is a 90/95 POD lower confidence bound at a given flaw size and that the POD is monotonic for flaw sizes exceeding that 90/95 POD flaw size. The conservativeness of the DOEPOD methodology results is discussed. Validated guidelines for binomial estimation of POD for fracture critical inspection are established.

  11. Studying the Binomial Distribution Using LabVIEW

    ERIC Educational Resources Information Center

    George, Danielle J.; Hammer, Nathan I.

    2015-01-01

    This undergraduate physical chemistry laboratory exercise introduces students to the study of probability distributions both experimentally and using computer simulations. Students perform the classic coin toss experiment individually and then pool all of their data together to study the effect of experimental sample size on the binomial…

  12. Pricing American Asian options with higher moments in the underlying distribution

    NASA Astrophysics Data System (ADS)

    Lo, Keng-Hsin; Wang, Kehluh; Hsu, Ming-Feng

    2009-01-01

    We develop a modified Edgeworth binomial model with higher moment consideration for pricing American Asian options. With lognormal underlying distribution for benchmark comparison, our algorithm is as precise as that of Chalasani et al. [P. Chalasani, S. Jha, F. Egriboyun, A. Varikooty, A refined binomial lattice for pricing American Asian options, Rev. Derivatives Res. 3 (1) (1999) 85-105] if the number of the time steps increases. If the underlying distribution displays negative skewness and leptokurtosis as often observed for stock index returns, our estimates can work better than those in Chalasani et al. [P. Chalasani, S. Jha, F. Egriboyun, A. Varikooty, A refined binomial lattice for pricing American Asian options, Rev. Derivatives Res. 3 (1) (1999) 85-105] and are very similar to the benchmarks in Hull and White [J. Hull, A. White, Efficient procedures for valuing European and American path-dependent options, J. Derivatives 1 (Fall) (1993) 21-31]. The numerical analysis shows that our modified Edgeworth binomial model can value American Asian options with greater accuracy and speed given higher moments in their underlying distribution.

  13. Generalization of multifractal theory within quantum calculus

    NASA Astrophysics Data System (ADS)

    Olemskoi, A.; Shuda, I.; Borisyuk, V.

    2010-03-01

    On the basis of the deformed series in quantum calculus, we generalize the partition function and the mass exponent of a multifractal, as well as the average of a random variable distributed over a self-similar set. For the partition function, such expansion is shown to be determined by binomial-type combinations of the Tsallis entropies related to manifold deformations, while the mass exponent expansion generalizes the known relation τq=Dq(q-1). We find the equation for the set of averages related to ordinary, escort, and generalized probabilities in terms of the deformed expansion as well. Multifractals related to the Cantor binomial set, exchange currency series, and porous-surface condensates are considered as examples.

  14. The Rainbow Spectrum of RNA Secondary Structures.

    PubMed

    Li, Thomas J X; Reidys, Christian M

    2018-06-01

    In this paper, we analyze the length spectrum of rainbows in RNA secondary structures. A rainbow in a secondary structure is a maximal arc with respect to the partial order induced by nesting. We show that there is a significant gap in this length spectrum. We shall prove that there asymptotically almost surely exists a unique longest rainbow of length at least [Formula: see text] and that with high probability any other rainbow has finite length. We show that the distribution of the length of the longest rainbow converges to a discrete limit law and that, for finite k, the distribution of rainbows of length k becomes for large n a negative binomial distribution. We then put the results of this paper into context, comparing the analytical results with those observed in RNA minimum free energy structures, biological RNA structures and relate our findings to the sparsification of folding algorithms.

  15. Understanding poisson regression.

    PubMed

    Hayat, Matthew J; Higgins, Melinda

    2014-04-01

    Nurse investigators often collect study data in the form of counts. Traditional methods of data analysis have historically approached analysis of count data either as if the count data were continuous and normally distributed or with dichotomization of the counts into the categories of occurred or did not occur. These outdated methods for analyzing count data have been replaced with more appropriate statistical methods that make use of the Poisson probability distribution, which is useful for analyzing count data. The purpose of this article is to provide an overview of the Poisson distribution and its use in Poisson regression. Assumption violations for the standard Poisson regression model are addressed with alternative approaches, including addition of an overdispersion parameter or negative binomial regression. An illustrative example is presented with an application from the ENSPIRE study, and regression modeling of comorbidity data is included for illustrative purposes. Copyright 2014, SLACK Incorporated.

  16. Development of enhanced pavement deterioration curves.

    DOT National Transportation Integrated Search

    2016-10-01

    This report describes the research performed by the Center for Sustainable Transportation Infrastructure (CSTI) at the Virginia Tech Transportation Institute (VTTI) to develop a pavement condition prediction model, using (negative binomial) regressio...

  17. Indicators of Terrorism Vulnerability in Africa

    DTIC Science & Technology

    2015-03-26

    the terror threat and vulnerabilities across Africa. Key words: Terrorism, Africa, Negative Binomial Regression, Classification Tree iv I would like...31 Metrics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32 Log -likelihood...70 viii Page 5.3 Classification Tree Description

  18. Critical Values for Lawshe's Content Validity Ratio: Revisiting the Original Methods of Calculation

    ERIC Educational Resources Information Center

    Ayre, Colin; Scally, Andrew John

    2014-01-01

    The content validity ratio originally proposed by Lawshe is widely used to quantify content validity and yet methods used to calculate the original critical values were never reported. Methods for original calculation of critical values are suggested along with tables of exact binomial probabilities.

  19. An Exercise to Introduce Power

    ERIC Educational Resources Information Center

    Seier, Edith; Liu, Yali

    2013-01-01

    In introductory statistics courses, the concept of power is usually presented in the context of testing hypotheses about the population mean. We instead propose an exercise that uses a binomial probability table to introduce the idea of power in the context of testing a population proportion. (Contains 2 tables, and 2 figures.)

  20. Matching the Statistical Model to the Research Question for Dental Caries Indices with Many Zero Counts.

    PubMed

    Preisser, John S; Long, D Leann; Stamm, John W

    2017-01-01

    Marginalized zero-inflated count regression models have recently been introduced for the statistical analysis of dental caries indices and other zero-inflated count data as alternatives to traditional zero-inflated and hurdle models. Unlike the standard approaches, the marginalized models directly estimate overall exposure or treatment effects by relating covariates to the marginal mean count. This article discusses model interpretation and model class choice according to the research question being addressed in caries research. Two data sets, one consisting of fictional dmft counts in 2 groups and the other on DMFS among schoolchildren from a randomized clinical trial comparing 3 toothpaste formulations to prevent incident dental caries, are analyzed with negative binomial hurdle, zero-inflated negative binomial, and marginalized zero-inflated negative binomial models. In the first example, estimates of treatment effects vary according to the type of incidence rate ratio (IRR) estimated by the model. Estimates of IRRs in the analysis of the randomized clinical trial were similar despite their distinctive interpretations. The choice of statistical model class should match the study's purpose, while accounting for the broad decline in children's caries experience, such that dmft and DMFS indices more frequently generate zero counts. Marginalized (marginal mean) models for zero-inflated count data should be considered for direct assessment of exposure effects on the marginal mean dental caries count in the presence of high frequencies of zero counts. © 2017 S. Karger AG, Basel.

  1. Matching the Statistical Model to the Research Question for Dental Caries Indices with Many Zero Counts

    PubMed Central

    Preisser, John S.; Long, D. Leann; Stamm, John W.

    2017-01-01

    Marginalized zero-inflated count regression models have recently been introduced for the statistical analysis of dental caries indices and other zero-inflated count data as alternatives to traditional zero-inflated and hurdle models. Unlike the standard approaches, the marginalized models directly estimate overall exposure or treatment effects by relating covariates to the marginal mean count. This article discusses model interpretation and model class choice according to the research question being addressed in caries research. Two datasets, one consisting of fictional dmft counts in two groups and the other on DMFS among schoolchildren from a randomized clinical trial (RCT) comparing three toothpaste formulations to prevent incident dental caries, are analysed with negative binomial hurdle (NBH), zero-inflated negative binomial (ZINB), and marginalized zero-inflated negative binomial (MZINB) models. In the first example, estimates of treatment effects vary according to the type of incidence rate ratio (IRR) estimated by the model. Estimates of IRRs in the analysis of the RCT were similar despite their distinctive interpretations. Choice of statistical model class should match the study’s purpose, while accounting for the broad decline in children’s caries experience, such that dmft and DMFS indices more frequently generate zero counts. Marginalized (marginal mean) models for zero-inflated count data should be considered for direct assessment of exposure effects on the marginal mean dental caries count in the presence of high frequencies of zero counts. PMID:28291962

  2. Modelling road accident blackspots data with the discrete generalized Pareto distribution.

    PubMed

    Prieto, Faustino; Gómez-Déniz, Emilio; Sarabia, José María

    2014-10-01

    This study shows how road traffic networks events, in particular road accidents on blackspots, can be modelled with simple probabilistic distributions. We considered the number of crashes and the number of fatalities on Spanish blackspots in the period 2003-2007, from Spanish General Directorate of Traffic (DGT). We modelled those datasets, respectively, with the discrete generalized Pareto distribution (a discrete parametric model with three parameters) and with the discrete Lomax distribution (a discrete parametric model with two parameters, and particular case of the previous model). For that, we analyzed the basic properties of both parametric models: cumulative distribution, survival, probability mass, quantile and hazard functions, genesis and rth-order moments; applied two estimation methods of their parameters: the μ and (μ+1) frequency method and the maximum likelihood method; used two goodness-of-fit tests: Chi-square test and discrete Kolmogorov-Smirnov test based on bootstrap resampling; and compared them with the classical negative binomial distribution in terms of absolute probabilities and in models including covariates. We found that those probabilistic models can be useful to describe the road accident blackspots datasets analyzed. Copyright © 2014 Elsevier Ltd. All rights reserved.

  3. Confidence Intervals for True Scores Using the Skew-Normal Distribution

    ERIC Educational Resources Information Center

    Garcia-Perez, Miguel A.

    2010-01-01

    A recent comparative analysis of alternative interval estimation approaches and procedures has shown that confidence intervals (CIs) for true raw scores determined with the Score method--which uses the normal approximation to the binomial distribution--have actual coverage probabilities that are closest to their nominal level. It has also recently…

  4. Non-stochastic sampling error in quantal analyses for Campylobacter species on poultry products

    USDA-ARS?s Scientific Manuscript database

    Using primers and fluorescent probes specific for the most common foodborne Campylobacter species (C. jejuni = Cj and C. coli = Cc), we developed a multiplex, most probable number (MPN) assay using quantitative PCR (qPCR) as the determinant for binomial detection: number of p positives out of n = 6 ...

  5. Inference for binomial probability based on dependent Bernoulli random variables with applications to meta‐analysis and group level studies

    PubMed Central

    Bakbergenuly, Ilyas; Morgenthaler, Stephan

    2016-01-01

    We study bias arising as a result of nonlinear transformations of random variables in random or mixed effects models and its effect on inference in group‐level studies or in meta‐analysis. The findings are illustrated on the example of overdispersed binomial distributions, where we demonstrate considerable biases arising from standard log‐odds and arcsine transformations of the estimated probability p^, both for single‐group studies and in combining results from several groups or studies in meta‐analysis. Our simulations confirm that these biases are linear in ρ, for small values of ρ, the intracluster correlation coefficient. These biases do not depend on the sample sizes or the number of studies K in a meta‐analysis and result in abysmal coverage of the combined effect for large K. We also propose bias‐correction for the arcsine transformation. Our simulations demonstrate that this bias‐correction works well for small values of the intraclass correlation. The methods are applied to two examples of meta‐analyses of prevalence. PMID:27192062

  6. Estimating relative risks for common outcome using PROC NLP.

    PubMed

    Yu, Binbing; Wang, Zhuoqiao

    2008-05-01

    In cross-sectional or cohort studies with binary outcomes, it is biologically interpretable and of interest to estimate the relative risk or prevalence ratio, especially when the response rates are not rare. Several methods have been used to estimate the relative risk, among which the log-binomial models yield the maximum likelihood estimate (MLE) of the parameters. Because of restrictions on the parameter space, the log-binomial models often run into convergence problems. Some remedies, e.g., the Poisson and Cox regressions, have been proposed. However, these methods may give out-of-bound predicted response probabilities. In this paper, a new computation method using the SAS Nonlinear Programming (NLP) procedure is proposed to find the MLEs. The proposed NLP method was compared to the COPY method, a modified method to fit the log-binomial model. Issues in the implementation are discussed. For illustration, both methods were applied to data on the prevalence of microalbuminuria (micro-protein leakage into urine) for kidney disease patients from the Diabetes Control and Complications Trial. The sample SAS macro for calculating relative risk is provided in the appendix.

  7. A comparative study of count models: application to pedestrian-vehicle crashes along Malaysia federal roads.

    PubMed

    Hosseinpour, Mehdi; Pour, Mehdi Hossein; Prasetijo, Joewono; Yahaya, Ahmad Shukri; Ghadiri, Seyed Mohammad Reza

    2013-01-01

    The objective of this study was to examine the effects of various roadway characteristics on the incidence of pedestrian-vehicle crashes by developing a set of crash prediction models on 543 km of Malaysia federal roads over a 4-year time span between 2007 and 2010. Four count models including the Poisson, negative binomial (NB), hurdle Poisson (HP), and hurdle negative binomial (HNB) models were developed and compared to model the number of pedestrian crashes. The results indicated the presence of overdispersion in the pedestrian crashes (PCs) and showed that it is due to excess zero rather than variability in the crash data. To handle the issue, the hurdle Poisson model was found to be the best model among the considered models in terms of comparative measures. Moreover, the variables average daily traffic, heavy vehicle traffic, speed limit, land use, and area type were significantly associated with PCs.

  8. Effect of Breastfeeding Promotion on Early Childhood Caries and Breastfeeding Duration among 5 Year Old Children in Eastern Uganda: A Cluster Randomized Trial.

    PubMed

    Birungi, Nancy; Fadnes, Lars T; Okullo, Isaac; Kasangaki, Arabat; Nankabirwa, Victoria; Ndeezi, Grace; Tumwine, James K; Tylleskär, Thorkild; Lie, Stein Atle; Åstrøm, Anne Nordrehaug

    2015-01-01

    Although several studies have shown short term health benefits of exclusive breastfeeding (EBF), its long term consequences have not been studied extensively in low-income contexts. This study assessed the impact of an EBF promotion initiative for 6 months on early childhood caries (ECC) and breastfeeding duration in children aged 5 years in Mbale, Eastern Uganda. Participants were recruited from the Ugandan site of the PROMISE- EBF cluster randomised trial (ClinicalTrials.gov no: NCT00397150). A total of 765 pregnant women from 24 clusters were included in the ratio 1:1 to receive peer counselled promotion of EBF as the intervention or standard of care. At the 5 year follow-up, ECC was recorded under field conditions using the World Health Organization's decayed missing filled tooth (dmft) index. Adjusted negative binomial and linear regression were used in the analysis. Mean breastfeeding duration in the intervention and control groups (n=417) were 21.8 (CI 20.7-22.9) and 21.3(CI 20.7-21.9) months, respectively. The mean dmft was 1.5 (standard deviation [SD] 2.9) and 1.7 (SD 2.9) in the intervention and control groups, respectively. Corresponding prevalence estimates of ECC were 38% and 41%. Negative binomial regression analysis adjusted for cluster effects and loss-to-follow-up by inverse probability weights (IPW) showed an incidence-rate ratio (IRR) of 0.91 (95% CI 0.65-1.2). Comparing the effect of the trial arm on breastfeeding duration showed a difference in months of 0.48 (-0.72 to 1.7). PROMISE EBF trial did not impact on early childhood caries or breastfeeding duration at 5 years of age. This study contributes to the body of evidence that promotion of exclusive breastfeeding does not raise oral health concerns. However, the high burden of caries calls for efforts to improve the oral health condition in this setting. ClinicalTrials.gov NCT00397150.

  9. Cocoa Farmers’ Compliance with Safety Precautions in Spraying Agrochemicals and Use of Personal Protective Equipment (PPE) in Cameroon

    PubMed Central

    2018-01-01

    The inability of farmers to comply with essential precautions in the course of spraying agrochemicals remains a policy dilemma, especially in developing countries. The objectives of this paper were to assess compliance of cocoa farmers with agrochemical safety measures, analyse the factors explaining involvement of cocoa farmers in the practice of reusing agrochemical containers and wearing of personal protective equipment (PPE). Data were collected with structured questionnaires from 667 cocoa farmers from the Centre and South West regions in Cameroon. Data analyses were carried out with Probit regression and Negative Binomial regression models. The results showed that average cocoa farm sizes were 3.55 ha and 2.82 ha in South West and Centre regions, respectively, and 89.80% and 42.64% complied with manufacturers’ instructions in the use of insecticides. Eating or drinking while spraying insecticides and fungicides was reported by 4.20% and 5.10% of all farmers in the two regions, respectively. However, 37.78% and 57.57% of all farmers wore hand gloves and safety boots while spraying insecticides in the South West and Centre regions of Cameroon, respectively. In addition, 7.80% of all the farmers would wash agrochemical containers and use them at home, while 42.43% would wash and use them on their farms. Probit regression results showed that probability of reusing agrochemical containers was significantly influenced (p < 0.05) by region of residence of cocoa farmers, gender, possession of formal education and farming as primary occupation. The Negative Binomial regression results showed that the log of number PPE worn was significantly influenced (p < 0.10) by region, marital status, attainment of formal education, good health, awareness of manufacturers’ instructions, land area and contact index. It was among others concluded that efforts to train farmers on the need to be familiar with manufacturers’ instructions and use PPE would enhance their safety in the course of spraying agrochemicals. PMID:29438333

  10. Cocoa Farmers' Compliance with Safety Precautions in Spraying Agrochemicals and Use of Personal Protective Equipment (PPE) in Cameroon.

    PubMed

    Oyekale, Abayomi Samuel

    2018-02-13

    The inability of farmers to comply with essential precautions in the course of spraying agrochemicals remains a policy dilemma, especially in developing countries. The objectives of this paper were to assess compliance of cocoa farmers with agrochemical safety measures, analyse the factors explaining involvement of cocoa farmers in the practice of reusing agrochemical containers and wearing of personal protective equipment (PPE). Data were collected with structured questionnaires from 667 cocoa farmers from the Centre and South West regions in Cameroon. Data analyses were carried out with Probit regression and Negative Binomial regression models. The results showed that average cocoa farm sizes were 3.55 ha and 2.82 ha in South West and Centre regions, respectively, and 89.80% and 42.64% complied with manufacturers' instructions in the use of insecticides. Eating or drinking while spraying insecticides and fungicides was reported by 4.20% and 5.10% of all farmers in the two regions, respectively. However, 37.78% and 57.57% of all farmers wore hand gloves and safety boots while spraying insecticides in the South West and Centre regions of Cameroon, respectively. In addition, 7.80% of all the farmers would wash agrochemical containers and use them at home, while 42.43% would wash and use them on their farms. Probit regression results showed that probability of reusing agrochemical containers was significantly influenced ( p < 0.05) by region of residence of cocoa farmers, gender, possession of formal education and farming as primary occupation. The Negative Binomial regression results showed that the log of number PPE worn was significantly influenced ( p < 0.10) by region, marital status, attainment of formal education, good health, awareness of manufacturers' instructions, land area and contact index. It was among others concluded that efforts to train farmers on the need to be familiar with manufacturers' instructions and use PPE would enhance their safety in the course of spraying agrochemicals.

  11. Analyzing crash frequency in freeway tunnels: A correlated random parameters approach.

    PubMed

    Hou, Qinzhong; Tarko, Andrew P; Meng, Xianghai

    2018-02-01

    The majority of past road safety studies focused on open road segments while only a few focused on tunnels. Moreover, the past tunnel studies produced some inconsistent results about the safety effects of the traffic patterns, the tunnel design, and the pavement conditions. The effects of these conditions therefore remain unknown, especially for freeway tunnels in China. The study presented in this paper investigated the safety effects of these various factors utilizing a four-year period (2009-2012) of data as well as three models: 1) a random effects negative binomial model (RENB), 2) an uncorrelated random parameters negative binomial model (URPNB), and 3) a correlated random parameters negative binomial model (CRPNB). Of these three, the results showed that the CRPNB model provided better goodness-of-fit and offered more insights into the factors that contribute to tunnel safety. The CRPNB was not only able to allocate the part of the otherwise unobserved heterogeneity to the individual model parameters but also was able to estimate the cross-correlations between these parameters. Furthermore, the study results showed that traffic volume, tunnel length, proportion of heavy trucks, curvature, and pavement rutting were associated with higher frequencies of traffic crashes, while the distance to the tunnel wall, distance to the adjacent tunnel, distress ratio, International Roughness Index (IRI), and friction coefficient were associated with lower crash frequencies. In addition, the effects of the heterogeneity of the proportion of heavy trucks, the curvature, the rutting depth, and the friction coefficient were identified and their inter-correlations were analyzed. Copyright © 2017 Elsevier Ltd. All rights reserved.

  12. Estimation of the cure rate in Iranian breast cancer patients.

    PubMed

    Rahimzadeh, Mitra; Baghestani, Ahmad Reza; Gohari, Mahmood Reza; Pourhoseingholi, Mohamad Amin

    2014-01-01

    Although the Cox's proportional hazard model is the popular approach for survival analysis to investigate significant risk factors of cancer patient survival, it is not appropriate in the case of log-term disease free survival. Recently, cure rate models have been introduced to distinguish between clinical determinants of cure and variables associated with the time to event of interest. The aim of this study was to use a cure rate model to determine the clinical associated factors for cure rates of patients with breast cancer (BC). This prospective cohort study covered 305 patients with BC, admitted at Shahid Faiazbakhsh Hospital, Tehran, during 2006 to 2008 and followed until April 2012. Cases of patient death were confirmed by telephone contact. For data analysis, a non-mixed cure rate model with Poisson distribution and negative binomial distribution were employed. All analyses were carried out using a developed Macro in WinBugs. Deviance information criteria (DIC) were employed to find the best model. The overall 1-year, 3-year and 5-year relative survival rates were 97%, 89% and 74%. Metastasis and stage of BC were the significant factors, but age was significant only in negative binomial model. The DIC also showed that the negative binomial model had a better fit. This study indicated that, metastasis and stage of BC were identified as the clinical criteria for cure rates. There are limited studies on BC survival which employed these cure rate models to identify the clinical factors associated with cure. These models are better than Cox, in the case of long-term survival.

  13. Estimating spatial and temporal components of variation in count data using negative binomial mixed models

    USGS Publications Warehouse

    Irwin, Brian J.; Wagner, Tyler; Bence, James R.; Kepler, Megan V.; Liu, Weihai; Hayes, Daniel B.

    2013-01-01

    Partitioning total variability into its component temporal and spatial sources is a powerful way to better understand time series and elucidate trends. The data available for such analyses of fish and other populations are usually nonnegative integer counts of the number of organisms, often dominated by many low values with few observations of relatively high abundance. These characteristics are not well approximated by the Gaussian distribution. We present a detailed description of a negative binomial mixed-model framework that can be used to model count data and quantify temporal and spatial variability. We applied these models to data from four fishery-independent surveys of Walleyes Sander vitreus across the Great Lakes basin. Specifically, we fitted models to gill-net catches from Wisconsin waters of Lake Superior; Oneida Lake, New York; Saginaw Bay in Lake Huron, Michigan; and Ohio waters of Lake Erie. These long-term monitoring surveys varied in overall sampling intensity, the total catch of Walleyes, and the proportion of zero catches. Parameter estimation included the negative binomial scaling parameter, and we quantified the random effects as the variations among gill-net sampling sites, the variations among sampled years, and site × year interactions. This framework (i.e., the application of a mixed model appropriate for count data in a variance-partitioning context) represents a flexible approach that has implications for monitoring programs (e.g., trend detection) and for examining the potential of individual variance components to serve as response metrics to large-scale anthropogenic perturbations or ecological changes.

  14. Evaluation of surrogate measures for pedestrian safety in various road and roadside environments.

    DOT National Transportation Integrated Search

    2012-10-01

    This report presents an investigation of pedestrian conflicts and crash count models to learn which exposure measures and roadway or roadside characteristics significantly influence pedestrian safety at road crossings. Negative binomial models were e...

  15. A Classroom Note on the Binomial and Poisson Distributions: Biomedical Examples for Use in Teaching Introductory Statistics

    ERIC Educational Resources Information Center

    Holland, Bart K.

    2006-01-01

    A generally-educated individual should have some insight into how decisions are made in the very wide range of fields that employ statistical and probabilistic reasoning. Also, students of introductory probability and statistics are often best motivated by specific applications rather than by theory and mathematical development, because most…

  16. Lotka's Law and Institutional Productivity.

    ERIC Educational Resources Information Center

    Kumar, Suresh; Sharma, Praveen; Garg, K. C.

    1998-01-01

    Examines the applicability of Lotka's Law, negative binomial distribution, and lognormal distribution for institutional productivity in the same way as it is to authors and their productivity. Results indicate that none of the distributions are applicable for institutional productivity in engineering sciences. (Author/LRW)

  17. Binomial probability distribution model-based protein identification algorithm for tandem mass spectrometry utilizing peak intensity information.

    PubMed

    Xiao, Chuan-Le; Chen, Xiao-Zhou; Du, Yang-Li; Sun, Xuesong; Zhang, Gong; He, Qing-Yu

    2013-01-04

    Mass spectrometry has become one of the most important technologies in proteomic analysis. Tandem mass spectrometry (LC-MS/MS) is a major tool for the analysis of peptide mixtures from protein samples. The key step of MS data processing is the identification of peptides from experimental spectra by searching public sequence databases. Although a number of algorithms to identify peptides from MS/MS data have been already proposed, e.g. Sequest, OMSSA, X!Tandem, Mascot, etc., they are mainly based on statistical models considering only peak-matches between experimental and theoretical spectra, but not peak intensity information. Moreover, different algorithms gave different results from the same MS data, implying their probable incompleteness and questionable reproducibility. We developed a novel peptide identification algorithm, ProVerB, based on a binomial probability distribution model of protein tandem mass spectrometry combined with a new scoring function, making full use of peak intensity information and, thus, enhancing the ability of identification. Compared with Mascot, Sequest, and SQID, ProVerB identified significantly more peptides from LC-MS/MS data sets than the current algorithms at 1% False Discovery Rate (FDR) and provided more confident peptide identifications. ProVerB is also compatible with various platforms and experimental data sets, showing its robustness and versatility. The open-source program ProVerB is available at http://bioinformatics.jnu.edu.cn/software/proverb/ .

  18. Optimizing Probability of Detection Point Estimate Demonstration

    NASA Technical Reports Server (NTRS)

    Koshti, Ajay M.

    2017-01-01

    Probability of detection (POD) analysis is used in assessing reliably detectable flaw size in nondestructive evaluation (NDE). MIL-HDBK-18231and associated mh18232POD software gives most common methods of POD analysis. Real flaws such as cracks and crack-like flaws are desired to be detected using these NDE methods. A reliably detectable crack size is required for safe life analysis of fracture critical parts. The paper provides discussion on optimizing probability of detection (POD) demonstration experiments using Point Estimate Method. POD Point estimate method is used by NASA for qualifying special NDE procedures. The point estimate method uses binomial distribution for probability density. Normally, a set of 29 flaws of same size within some tolerance are used in the demonstration. The optimization is performed to provide acceptable value for probability of passing demonstration (PPD) and achieving acceptable value for probability of false (POF) calls while keeping the flaw sizes in the set as small as possible.

  19. Remote sensing of earth terrain

    NASA Technical Reports Server (NTRS)

    Kong, J. A.

    1988-01-01

    Two monographs and 85 journal and conference papers on remote sensing of earth terrain have been published, sponsored by NASA Contract NAG5-270. A multivariate K-distribution is proposed to model the statistics of fully polarimetric data from earth terrain with polarizations HH, HV, VH, and VV. In this approach, correlated polarizations of radar signals, as characterized by a covariance matrix, are treated as the sum of N n-dimensional random vectors; N obeys the negative binomial distribution with a parameter alpha and mean bar N. Subsequently, and n-dimensional K-distribution, with either zero or non-zero mean, is developed in the limit of infinite bar N or illuminated area. The probability density function (PDF) of the K-distributed vector normalized by its Euclidean norm is independent of the parameter alpha and is the same as that derived from a zero-mean Gaussian-distributed random vector. The above model is well supported by experimental data provided by MIT Lincoln Laboratory and the Jet Propulsion Laboratory in the form of polarimetric measurements.

  20. The spatial distribution of fixed mutations within genes coding for proteins

    NASA Technical Reports Server (NTRS)

    Holmquist, R.; Goodman, M.; Conroy, T.; Czelusniak, J.

    1983-01-01

    An examination has been conducted of the extensive amino acid sequence data now available for five protein families - the alpha crystallin A chain, myoglobin, alpha and beta hemoglobin, and the cytochromes c - with the goal of estimating the true spatial distribution of base substitutions within genes that code for proteins. In every case the commonly used Poisson density failed to even approximate the experimental pattern of base substitution. For the 87 species of beta hemoglobin examined, for example, the probability that the observed results were from a Poisson process was the minuscule 10 to the -44th. Analogous results were obtained for the other functional families. All the data were reasonably, but not perfectly, described by the negative binomial density. In particular, most of the data were described by one of the very simple limiting forms of this density, the geometric density. The implications of this for evolutionary inference are discussed. It is evident that most estimates of total base substitutions between genes are badly in need of revision.

  1. Using beta binomials to estimate classification uncertainty for ensemble models.

    PubMed

    Clark, Robert D; Liang, Wenkel; Lee, Adam C; Lawless, Michael S; Fraczkiewicz, Robert; Waldman, Marvin

    2014-01-01

    Quantitative structure-activity (QSAR) models have enormous potential for reducing drug discovery and development costs as well as the need for animal testing. Great strides have been made in estimating their overall reliability, but to fully realize that potential, researchers and regulators need to know how confident they can be in individual predictions. Submodels in an ensemble model which have been trained on different subsets of a shared training pool represent multiple samples of the model space, and the degree of agreement among them contains information on the reliability of ensemble predictions. For artificial neural network ensembles (ANNEs) using two different methods for determining ensemble classification - one using vote tallies and the other averaging individual network outputs - we have found that the distribution of predictions across positive vote tallies can be reasonably well-modeled as a beta binomial distribution, as can the distribution of errors. Together, these two distributions can be used to estimate the probability that a given predictive classification will be in error. Large data sets comprised of logP, Ames mutagenicity, and CYP2D6 inhibition data are used to illustrate and validate the method. The distributions of predictions and errors for the training pool accurately predicted the distribution of predictions and errors for large external validation sets, even when the number of positive and negative examples in the training pool were not balanced. Moreover, the likelihood of a given compound being prospectively misclassified as a function of the degree of consensus between networks in the ensemble could in most cases be estimated accurately from the fitted beta binomial distributions for the training pool. Confidence in an individual predictive classification by an ensemble model can be accurately assessed by examining the distributions of predictions and errors as a function of the degree of agreement among the constituent submodels. Further, ensemble uncertainty estimation can often be improved by adjusting the voting or classification threshold based on the parameters of the error distribution. Finally, the profiles for models whose predictive uncertainty estimates are not reliable provide clues to that effect without the need for comparison to an external test set.

  2. Effects of fish species composition on Diphyllobothrium spp. infections in brown trout - is three-spined stickleback a key species?

    PubMed

    Kuhn, J A; Frainer, A; Knudsen, R; Kristoffersen, R; Amundsen, P-A

    2016-11-01

    Subarctic populations of brown trout (Salmo trutta) are often heavily infected with cestodes of the genus Diphyllobothrium, assumedly because of their piscivorous behaviour. This study explores possible associations between availability of fish prey and Diphyllobothrium spp. infections in lacustrine trout populations. Trout in (i) allopatry (group T); (ii) sympatry with Arctic charr (Salvelinus alpinus) (group TC); and (iii) sympatry with charr and three-spined stickleback (Gasterosteus aculeatus) (group TCS) were contrasted. Mean abundance and intensity of Diphyllobothrium spp. were higher in group TCS compared to groups TC and T. Prevalence, however, was similarly higher in groups TCS and TC compared to group T. Zero-altered negative binomial modelling identified the lowest probability of infection in group T and similar probabilities of infection in groups TC and TCS, whereas the highest intensity was predicted in group TCS. The most infected trout were from the group co-occurring with stickleback (TCS), possibly due to a higher availability of fish prey. In conclusion, our study demonstrates elevated Diphyllobothrium spp. infections in lacustrine trout populations where fish prey are available and suggests that highly available and easily caught stickleback prey may play a key role in the transmission of Diphyllobothrium spp. parasite larvae. © 2016 John Wiley & Sons Ltd.

  3. Inference for binomial probability based on dependent Bernoulli random variables with applications to meta-analysis and group level studies.

    PubMed

    Bakbergenuly, Ilyas; Kulinskaya, Elena; Morgenthaler, Stephan

    2016-07-01

    We study bias arising as a result of nonlinear transformations of random variables in random or mixed effects models and its effect on inference in group-level studies or in meta-analysis. The findings are illustrated on the example of overdispersed binomial distributions, where we demonstrate considerable biases arising from standard log-odds and arcsine transformations of the estimated probability p̂, both for single-group studies and in combining results from several groups or studies in meta-analysis. Our simulations confirm that these biases are linear in ρ, for small values of ρ, the intracluster correlation coefficient. These biases do not depend on the sample sizes or the number of studies K in a meta-analysis and result in abysmal coverage of the combined effect for large K. We also propose bias-correction for the arcsine transformation. Our simulations demonstrate that this bias-correction works well for small values of the intraclass correlation. The methods are applied to two examples of meta-analyses of prevalence. © 2016 The Authors. Biometrical Journal Published by Wiley-VCH Verlag GmbH & Co. KGaA.

  4. QNB: differential RNA methylation analysis for count-based small-sample sequencing data with a quad-negative binomial model.

    PubMed

    Liu, Lian; Zhang, Shao-Wu; Huang, Yufei; Meng, Jia

    2017-08-31

    As a newly emerged research area, RNA epigenetics has drawn increasing attention recently for the participation of RNA methylation and other modifications in a number of crucial biological processes. Thanks to high throughput sequencing techniques, such as, MeRIP-Seq, transcriptome-wide RNA methylation profile is now available in the form of count-based data, with which it is often of interests to study the dynamics at epitranscriptomic layer. However, the sample size of RNA methylation experiment is usually very small due to its costs; and additionally, there usually exist a large number of genes whose methylation level cannot be accurately estimated due to their low expression level, making differential RNA methylation analysis a difficult task. We present QNB, a statistical approach for differential RNA methylation analysis with count-based small-sample sequencing data. Compared with previous approaches such as DRME model based on a statistical test covering the IP samples only with 2 negative binomial distributions, QNB is based on 4 independent negative binomial distributions with their variances and means linked by local regressions, and in the way, the input control samples are also properly taken care of. In addition, different from DRME approach, which relies only the input control sample only for estimating the background, QNB uses a more robust estimator for gene expression by combining information from both input and IP samples, which could largely improve the testing performance for very lowly expressed genes. QNB showed improved performance on both simulated and real MeRIP-Seq datasets when compared with competing algorithms. And the QNB model is also applicable to other datasets related RNA modifications, including but not limited to RNA bisulfite sequencing, m 1 A-Seq, Par-CLIP, RIP-Seq, etc.

  5. An analytical framework for estimating aquatic species density from environmental DNA

    USGS Publications Warehouse

    Chambert, Thierry; Pilliod, David S.; Goldberg, Caren S.; Doi, Hideyuki; Takahara, Teruhiko

    2018-01-01

    Environmental DNA (eDNA) analysis of water samples is on the brink of becoming a standard monitoring method for aquatic species. This method has improved detection rates over conventional survey methods and thus has demonstrated effectiveness for estimation of site occupancy and species distribution. The frontier of eDNA applications, however, is to infer species density. Building upon previous studies, we present and assess a modeling approach that aims at inferring animal density from eDNA. The modeling combines eDNA and animal count data from a subset of sites to estimate species density (and associated uncertainties) at other sites where only eDNA data are available. As a proof of concept, we first perform a cross-validation study using experimental data on carp in mesocosms. In these data, fish densities are known without error, which allows us to test the performance of the method with known data. We then evaluate the model using field data from a study on a stream salamander species to assess the potential of this method to work in natural settings, where density can never be known with absolute certainty. Two alternative distributions (Normal and Negative Binomial) to model variability in eDNA concentration data are assessed. Assessment based on the proof of concept data (carp) revealed that the Negative Binomial model provided much more accurate estimates than the model based on a Normal distribution, likely because eDNA data tend to be overdispersed. Greater imprecision was found when we applied the method to the field data, but the Negative Binomial model still provided useful density estimates. We call for further model development in this direction, as well as further research targeted at sampling design optimization. It will be important to assess these approaches on a broad range of study systems.

  6. Factors Associated with Hospital Length of Stay among Cancer Patients with Febrile Neutropenia

    PubMed Central

    Rosa, Regis G.; Goldani, Luciano Z.

    2014-01-01

    Purpose This study sought to evaluate factors associated with hospital length of stay in cancer patients with febrile neutropenia. Methods A prospective cohort study was performed at a single tertiary referral hospital in southern Brazil from October 2009 to August 2011. All adult cancer patients with febrile neutropenia admitted to the hematology ward were evaluated. Stepwise random-effects negative binomial regression was performed to identify risk factors for prolonged length of hospital stay. Results In total, 307 cases of febrile neutropenia were evaluated. The overall median length of hospital stay was 16 days (interquartile range 18 days). According to multiple negative binomial regression analysis, hematologic neoplasms (P = 0.003), high-dose chemotherapy regimens (P<0.001), duration of neutropenia (P<0.001), and bloodstream infection involving Gram-negative multi-drug-resistant bacteria (P = 0.003) were positively associated with prolonged hospital length of stay in patients with febrile neutropenia. The condition index showed no evidence of multi-collinearity effect among the independent variables. Conclusions Hematologic neoplasms, high-dose chemotherapy regimens, prolonged periods of neutropenia, and bloodstream infection with Gram-negative multi-drug-resistant bacteria are predictors of prolonged length hospital of stay among adult cancer patients with febrile neutropenia. PMID:25285790

  7. Tackling missing radiographic progression data: multiple imputation technique compared with inverse probability weights and complete case analysis.

    PubMed

    Descalzo, Miguel Á; Garcia, Virginia Villaverde; González-Alvaro, Isidoro; Carbonell, Jordi; Balsa, Alejandro; Sanmartí, Raimon; Lisbona, Pilar; Hernandez-Barrera, Valentín; Jiménez-Garcia, Rodrigo; Carmona, Loreto

    2013-02-01

    To describe the results of different statistical ways of addressing radiographic outcome affected by missing data--multiple imputation technique, inverse probability weights and complete case analysis--using data from an observational study. A random sample of 96 RA patients was selected for a follow-up study in which radiographs of hands and feet were scored. Radiographic progression was tested by comparing the change in the total Sharp-van der Heijde radiographic score (TSS) and the joint erosion score (JES) from baseline to the end of the second year of follow-up. MI technique, inverse probability weights in weighted estimating equation (WEE) and CC analysis were used to fit a negative binomial regression. Major predictors of radiographic progression were JES and joint space narrowing (JSN) at baseline, together with baseline disease activity measured by DAS28 for TSS and MTX use for JES. Results from CC analysis show larger coefficients and s.e.s compared with MI and weighted techniques. The results from the WEE model were quite in line with those of MI. If it seems plausible that CC or MI analysis may be valid, then MI should be preferred because of its greater efficiency. CC analysis resulted in inefficient estimates or, translated into non-statistical terminology, could guide us into inaccurate results and unwise conclusions. The methods discussed here will contribute to the use of alternative approaches for tackling missing data in observational studies.

  8. Finding consistent patterns: A nonparametric approach for identifying differential expression in RNA-Seq data

    PubMed Central

    Li, Jun; Tibshirani, Robert

    2015-01-01

    We discuss the identification of features that are associated with an outcome in RNA-Sequencing (RNA-Seq) and other sequencing-based comparative genomic experiments. RNA-Seq data takes the form of counts, so models based on the normal distribution are generally unsuitable. The problem is especially challenging because different sequencing experiments may generate quite different total numbers of reads, or ‘sequencing depths’. Existing methods for this problem are based on Poisson or negative binomial models: they are useful but can be heavily influenced by ‘outliers’ in the data. We introduce a simple, nonparametric method with resampling to account for the different sequencing depths. The new method is more robust than parametric methods. It can be applied to data with quantitative, survival, two-class or multiple-class outcomes. We compare our proposed method to Poisson and negative binomial-based methods in simulated and real data sets, and find that our method discovers more consistent patterns than competing methods. PMID:22127579

  9. levels and sociodemographic correlates of accelerometer-based physical activity in Irish children: a cross-sectional study.

    PubMed

    Li, Xia; Kearney, Patricia M; Keane, Eimear; Harrington, Janas M; Fitzgerald, Anthony P

    2017-06-01

    The aim of this study was to explore levels and sociodemographic correlates of physical activity (PA) over 1 week using accelerometer data. Accelerometer data was collected over 1 week from 1075 8-11-year-old children in the cross-sectional Cork Children's Lifestyle Study. Threshold values were used to categorise activity intensity as sedentary, light, moderate or vigorous. Questionnaires collected data on demographic factors. Smoothed curves were used to display minute by minute variations. Binomial regression was used to identify factors correlated with the probability of meeting WHO 60 min moderate to vigorous PA guidelines. Overall, 830 children (mean (SD) age: 9.9(0.7) years, 56.3% boys) were included. From the binomial multiple regression analysis, boys were found more likely to meet guidelines (probability ratio 1.17, 95% CI 1.06 to 1.28) than girls. Older children were less likely to meet guidelines than younger children (probability ratio 0.91, CI 0.87 to 0.95). Normal weight children were more likely than overweight and obese children to meet guidelines (probability ratio 1.25, CI 1.16 to 1.34). Children in urban areas were more likely to meet guidelines than those in rural areas (probability ratio 1.19, CI 1.07 to 1.33). Longer daylight length days were associated with greater probability of meeting guidelines compared to shorter daylight length days. PA levels differed by individual factors including age, gender and weight status as well as by environmental factors including residence and daylight length. Less than one-quarter of children (26.8% boys, 16.2% girls) meet guidelines. Effective intervention policies are urgently needed to increase PA. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.

  10. Demonstrating the Safety and Reliability of a New System or Spacecraft: Incorporating Analyses and Reviews of the Design and Processing in Determining the Number of Tests to be Conducted

    NASA Technical Reports Server (NTRS)

    Vesely, William E.; Colon, Alfredo E.

    2010-01-01

    Design Safety/Reliability is associated with the probability of no failure-causing faults existing in a design. Confidence in the non-existence of failure-causing faults is increased by performing tests with no failure. Reliability-Growth testing requirements are based on initial assurance and fault detection probability. Using binomial tables generally gives too many required tests compared to reliability-growth requirements. Reliability-Growth testing requirements are based on reliability principles and factors and should be used.

  11. Yes, the GIGP Really Does Work--And Is Workable!

    ERIC Educational Resources Information Center

    Burrell, Quentin L.; Fenton, Michael R.

    1993-01-01

    Discusses the generalized inverse Gaussian-Poisson (GIGP) process for informetric modeling. Negative binomial distribution is discussed, construction of the GIGP process is explained, zero-truncated GIGP is considered, and applications of the process with journals, library circulation statistics, and database index terms are described. (50…

  12. Multiple electron processes of He and Ne by proton impact

    NASA Astrophysics Data System (ADS)

    Terekhin, Pavel Nikolaevich; Montenegro, Pablo; Quinto, Michele; Monti, Juan; Fojon, Omar; Rivarola, Roberto

    2016-05-01

    A detailed investigation of multiple electron processes (single and multiple ionization, single capture, transfer-ionization) of He and Ne is presented for proton impact at intermediate and high collision energies. Exclusive absolute cross sections for these processes have been obtained by calculation of transition probabilities in the independent electron and independent event models as a function of impact parameter in the framework of the continuum distorted wave-eikonal initial state theory. A binomial analysis is employed to calculate exclusive probabilities. The comparison with available theoretical and experimental results shows that exclusive probabilities are needed for a reliable description of the experimental data. The developed approach can be used for obtaining the input database for modeling multiple electron processes of charged particles passing through the matter.

  13. The coefficient of determination R2 and intra-class correlation coefficient from generalized linear mixed-effects models revisited and expanded.

    PubMed

    Nakagawa, Shinichi; Johnson, Paul C D; Schielzeth, Holger

    2017-09-01

    The coefficient of determination R 2 quantifies the proportion of variance explained by a statistical model and is an important summary statistic of biological interest. However, estimating R 2 for generalized linear mixed models (GLMMs) remains challenging. We have previously introduced a version of R 2 that we called [Formula: see text] for Poisson and binomial GLMMs, but not for other distributional families. Similarly, we earlier discussed how to estimate intra-class correlation coefficients (ICCs) using Poisson and binomial GLMMs. In this paper, we generalize our methods to all other non-Gaussian distributions, in particular to negative binomial and gamma distributions that are commonly used for modelling biological data. While expanding our approach, we highlight two useful concepts for biologists, Jensen's inequality and the delta method, both of which help us in understanding the properties of GLMMs. Jensen's inequality has important implications for biologically meaningful interpretation of GLMMs, whereas the delta method allows a general derivation of variance associated with non-Gaussian distributions. We also discuss some special considerations for binomial GLMMs with binary or proportion data. We illustrate the implementation of our extension by worked examples from the field of ecology and evolution in the R environment. However, our method can be used across disciplines and regardless of statistical environments. © 2017 The Author(s).

  14. A Financial Market Model Incorporating Herd Behaviour.

    PubMed

    Wray, Christopher M; Bishop, Steven R

    2016-01-01

    Herd behaviour in financial markets is a recurring phenomenon that exacerbates asset price volatility, and is considered a possible contributor to market fragility. While numerous studies investigate herd behaviour in financial markets, it is often considered without reference to the pricing of financial instruments or other market dynamics. Here, a trader interaction model based upon informational cascades in the presence of information thresholds is used to construct a new model of asset price returns that allows for both quiescent and herd-like regimes. Agent interaction is modelled using a stochastic pulse-coupled network, parametrised by information thresholds and a network coupling probability. Agents may possess either one or two information thresholds that, in each case, determine the number of distinct states an agent may occupy before trading takes place. In the case where agents possess two thresholds (labelled as the finite state-space model, corresponding to agents' accumulating information over a bounded state-space), and where coupling strength is maximal, an asymptotic expression for the cascade-size probability is derived and shown to follow a power law when a critical value of network coupling probability is attained. For a range of model parameters, a mixture of negative binomial distributions is used to approximate the cascade-size distribution. This approximation is subsequently used to express the volatility of model price returns in terms of the model parameter which controls the network coupling probability. In the case where agents possess a single pulse-coupling threshold (labelled as the semi-infinite state-space model corresponding to agents' accumulating information over an unbounded state-space), numerical evidence is presented that demonstrates volatility clustering and long-memory patterns in the volatility of asset returns. Finally, output from the model is compared to both the distribution of historical stock returns and the market price of an equity index option.

  15. Do parent–child acculturation gaps affect early adolescent Latino alcohol use? A study of the probability and extent of use

    PubMed Central

    2013-01-01

    The literature has been mixed regarding how parent–child relationships are affected by the acculturation process and how this process relates to alcohol use among Latino youth. The mixed results may be due to, at least, two factors: First, staggered migration in which one or both parents arrive to the new country and then send for the children may lead to faster acculturation in parents than in children for some families. Second, acculturation may have different effects depending on which aspects of alcohol use are being examined. This study addresses the first factor by testing for a curvilinear trend in the acculturation-alcohol use relationship and the second by modeling past year alcohol use as a zero inflated negative binomial distribution. Additionally, this study examined the unique and mediation effects of parent–child acculturation discrepancies (gap), mother involvement in children’s schooling, father involvement in children’s schooling, and effective parenting on youth alcohol use during the last 12 months, measured as the probability of using and the extent of use. Direct paths from parent–child acculturation discrepancy to alcohol use, and mediated paths through mother involvement, father involvement, and effective parenting were also tested. Only father involvement fully mediated the path from parent–child acculturation discrepancies to the probability of alcohol use. None of the variables examined mediated the path from parent–child acculturation discrepancies to the extent of alcohol use. Effective parenting was unrelated to acculturation discrepancies; however, it maintained a significant direct effect on the probability of youth alcohol use and the extent of use after controlling for mother and father involvement. Implications for prevention strategies are discussed. PMID:23347822

  16. School Violence: The Role of Parental and Community Involvement

    ERIC Educational Resources Information Center

    Lesneskie, Eric; Block, Steven

    2017-01-01

    This study utilizes the School Survey on Crime and Safety to identify variables that predict lower levels of violence from four domains: school security, school climate, parental involvement, and community involvement. Negative binomial regression was performed and the findings indicate that statistically significant results come from all four…

  17. Predicting Children's Asthma Hospitalizations: Rural and Urban Differences in Texas

    ERIC Educational Resources Information Center

    Grineski, Sara E.

    2009-01-01

    Asthma is the number one chronic health condition facing children today; however, little is known about rural-urban inequalities in asthma. This "area effects on health" study examines rural-urban differences in childhood asthma hospitalizations within the state of Texas using negative binomial regression models. Effects associated with…

  18. An Alternate Approach to Alternating Sums: A Method to DIE for

    ERIC Educational Resources Information Center

    Benjamin, Arthur T.; Quinn, Jennifer J.

    2008-01-01

    Positive sums count. Alternating sums match. Alternating sums of binomial coefficients, Fibonacci numbers, and other combinatorial quantities are analyzed using sign-reversing involutions. In particular, we describe the quantity being considered, match positive and negative terms through an Involution, and count the Exceptions to the matching rule…

  19. Transportation safety data and analysis : Volume 2, Calibration of the highway safety manual and development of new safety performance functions.

    DOT National Transportation Integrated Search

    2011-03-01

    This report documents the calibration of the Highway Safety Manual (HSM) safety performance function (SPF) : for rural two-lane two-way roadway segments in Utah and the development of new models using negative : binomial and hierarchical Bayesian mod...

  20. Use of Internet viral marketing to promote smoke-free lifestyles among Chinese adolescents.

    PubMed

    Ip, Patrick; Lam, Tai-Hing; Chan, Sophia Siu-Chee; Ho, Frederick Ka-Wing; Lo, Lewis A; Chiu, Ivy Wing-Sze; Wong, Wilfred Hing-Sang; Chow, Chun-Bong

    2014-01-01

    Youth smoking is a global public health concern. Health educators are increasingly using Internet-based technologies, but the effectiveness of Internet viral marketing in promoting health remains uncertain. This prospective pilot study assessed the efficacy of an online game-based viral marketing campaign in promoting a smoke-free attitude among Chinese adolescents. One hundred and twenty-one Hong Kong Chinese adolescents aged 10 to 24 were invited to participate in an online multiple-choice quiz game competition designed to deliver tobacco-related health information. Participants were encouraged to refer others to join. A zero-inflated negative binomial model was used to explore the factors contributing to the referral process. Latent transition analysis utilising a pre- and post-game survey was used to detect attitudinal changes toward smoking. The number of participants increased almost eightfold from 121 to 928 (34.6% current or ex-smokers) during the 22-day campaign. Participants exhibited significant attitudinal change, with 73% holding negative attitudes toward smoking after the campaign compared to 57% before it. The transition probabilities from positive to negative and neutral to negative attitudes were 0.52 and 0.48, respectively. It was also found that attempting every 20 quiz questions was associated with lower perceived smoking decision in future (OR = 0.95, p-value <0.01). Our online game-based viral marketing programme was effective in reaching a large number of smoking and non-smoking participants and changing their attitudes toward smoking. It constitutes a promising practical and cost-effective model for engaging young smokers and promulgating smoking-related health information among Chinese adolescents.

  1. Spatial distribution and sequential sampling plans for Tuta absoluta (Lepidoptera: Gelechiidae) in greenhouse tomato crops.

    PubMed

    Cocco, Arturo; Serra, Giuseppe; Lentini, Andrea; Deliperi, Salvatore; Delrio, Gavino

    2015-09-01

    The within- and between-plant distribution of the tomato leafminer, Tuta absoluta (Meyrick), was investigated in order to define action thresholds based on leaf infestation and to propose enumerative and binomial sequential sampling plans for pest management applications in protected crops. The pest spatial distribution was aggregated between plants, and median leaves were the most suitable sample to evaluate the pest density. Action thresholds of 36 and 48%, 43 and 56% and 60 and 73% infested leaves, corresponding to economic thresholds of 1 and 3% damaged fruits, were defined for tomato cultivars with big, medium and small fruits respectively. Green's method was a more suitable enumerative sampling plan as it required a lower sampling effort. Binomial sampling plans needed lower average sample sizes than enumerative plans to make a treatment decision, with probabilities of error of <0.10. The enumerative sampling plan required 87 or 343 leaves to estimate the population density in extensive or intensive ecological studies respectively. Binomial plans would be more practical and efficient for control purposes, needing average sample sizes of 17, 20 and 14 leaves to take a pest management decision in order to avoid fruit damage higher than 1% in cultivars with big, medium and small fruits respectively. © 2014 Society of Chemical Industry.

  2. Mixing in High Schmidt Number Turbulent Jets

    DTIC Science & Technology

    1991-01-01

    the higher Sc jet is less well mixed. The difference is less pronounced at higher Re. Flame length estimates imply either an increase in entrainment...72 8.0 Estimation of flame lengths ....................................... 74 8.1 Estim ation m...A.4) Lf flame length N number of trials (Eq. 3.1) p exponent in fits of the variance behavior with Re p probability of a binomial event (Eq. 3.1) p

  3. A Statistical Treatment of Bioassay Pour Fractions

    NASA Technical Reports Server (NTRS)

    Barengoltz, Jack; Hughes, David W.

    2014-01-01

    The binomial probability distribution is used to treat the statistics of a microbiological sample that is split into two parts, with only one part evaluated for spore count. One wishes to estimate the total number of spores in the sample based on the counts obtained from the part that is evaluated (pour fraction). Formally, the binomial distribution is recharacterized as a function of the observed counts (successes), with the total number (trials) an unknown. The pour fraction is the probability of success per spore (trial). This distribution must be renormalized in terms of the total number. Finally, the new renormalized distribution is integrated and mathematically inverted to yield the maximum estimate of the total number as a function of a desired level of confidence ( P(

  4. Effect of Breastfeeding Promotion on Early Childhood Caries and Breastfeeding Duration among 5 Year Old Children in Eastern Uganda: A Cluster Randomized Trial

    PubMed Central

    Birungi, Nancy; Fadnes, Lars T.; Okullo, Isaac; Kasangaki, Arabat; Nankabirwa, Victoria; Ndeezi, Grace; Tumwine, James K.; Tylleskär, Thorkild; Lie, Stein Atle; Åstrøm, Anne Nordrehaug

    2015-01-01

    Background Although several studies have shown short term health benefits of exclusive breastfeeding (EBF), its long term consequences have not been studied extensively in low-income contexts. This study assessed the impact of an EBF promotion initiative for 6 months on early childhood caries (ECC) and breastfeeding duration in children aged 5 years in Mbale, Eastern Uganda. Methods Participants were recruited from the Ugandan site of the PROMISE- EBF cluster randomised trial (ClinicalTrials.gov no: NCT00397150). A total of 765 pregnant women from 24 clusters were included in the ratio 1:1 to receive peer counselled promotion of EBF as the intervention or standard of care. At the 5 year follow-up, ECC was recorded under field conditions using the World Health Organization’s decayed missing filled tooth (dmft) index. Adjusted negative binomial and linear regression were used in the analysis. Results Mean breastfeeding duration in the intervention and control groups (n=417) were 21.8 (CI 20.7–22.9) and 21.3(CI 20.7–21.9) months, respectively. The mean dmft was 1.5 (standard deviation [SD] 2.9) and 1.7 (SD 2.9) in the intervention and control groups, respectively. Corresponding prevalence estimates of ECC were 38% and 41%. Negative binomial regression analysis adjusted for cluster effects and loss-to-follow-up by inverse probability weights (IPW) showed an incidence-rate ratio (IRR) of 0.91 (95% CI 0.65–1.2). Comparing the effect of the trial arm on breastfeeding duration showed a difference in months of 0.48 (-0.72 to 1.7). Conclusion PROMISE EBF trial did not impact on early childhood caries or breastfeeding duration at 5 years of age. This study contributes to the body of evidence that promotion of exclusive breastfeeding does not raise oral health concerns. However, the high burden of caries calls for efforts to improve the oral health condition in this setting. Trial Registration ClinicalTrials.gov NCT00397150 PMID:25938681

  5. The influence of baseline marijuana use on treatment of cocaine dependence: application of an informative-priors bayesian approach.

    PubMed

    Green, Charles; Schmitz, Joy; Lindsay, Jan; Pedroza, Claudia; Lane, Scott; Agnelli, Rob; Kjome, Kimberley; Moeller, F Gerard

    2012-01-01

    Marijuana use is prevalent among patients with cocaine dependence and often non-exclusionary in clinical trials of potential cocaine medications. The dual-focus of this study was to (1) examine the moderating effect of baseline marijuana use on response to treatment with levodopa/carbidopa for cocaine dependence; and (2) apply an informative-priors, Bayesian approach for estimating the probability of a subgroup-by-treatment interaction effect. A secondary data analysis of two previously published, double-blind, randomized controlled trials provided complete data for the historical (Study 1: N = 64 placebo), and current (Study 2: N = 113) data sets. Negative binomial regression evaluated Treatment Effectiveness Scores (TES) as a function of medication condition (levodopa/carbidopa, placebo), baseline marijuana use (days in past 30), and their interaction. Bayesian analysis indicated that there was a 96% chance that baseline marijuana use predicts differential response to treatment with levodopa/carbidopa. Simple effects indicated that among participants receiving levodopa/carbidopa the probability that baseline marijuana confers harm in terms of reducing TES was 0.981; whereas the probability that marijuana confers harm within the placebo condition was 0.163. For every additional day of marijuana use reported at baseline, participants in the levodopa/carbidopa condition demonstrated a 5.4% decrease in TES; while participants in the placebo condition demonstrated a 4.9% increase in TES. The potential moderating effect of marijuana on cocaine treatment response should be considered in future trial designs. Applying Bayesian subgroup analysis proved informative in characterizing this patient-treatment interaction effect.

  6. The Influence of Baseline Marijuana Use on Treatment of Cocaine Dependence: Application of an Informative-Priors Bayesian Approach

    PubMed Central

    Green, Charles; Schmitz, Joy; Lindsay, Jan; Pedroza, Claudia; Lane, Scott; Agnelli, Rob; Kjome, Kimberley; Moeller, F. Gerard

    2012-01-01

    Background: Marijuana use is prevalent among patients with cocaine dependence and often non-exclusionary in clinical trials of potential cocaine medications. The dual-focus of this study was to (1) examine the moderating effect of baseline marijuana use on response to treatment with levodopa/carbidopa for cocaine dependence; and (2) apply an informative-priors, Bayesian approach for estimating the probability of a subgroup-by-treatment interaction effect. Method: A secondary data analysis of two previously published, double-blind, randomized controlled trials provided complete data for the historical (Study 1: N = 64 placebo), and current (Study 2: N = 113) data sets. Negative binomial regression evaluated Treatment Effectiveness Scores (TES) as a function of medication condition (levodopa/carbidopa, placebo), baseline marijuana use (days in past 30), and their interaction. Results: Bayesian analysis indicated that there was a 96% chance that baseline marijuana use predicts differential response to treatment with levodopa/carbidopa. Simple effects indicated that among participants receiving levodopa/carbidopa the probability that baseline marijuana confers harm in terms of reducing TES was 0.981; whereas the probability that marijuana confers harm within the placebo condition was 0.163. For every additional day of marijuana use reported at baseline, participants in the levodopa/carbidopa condition demonstrated a 5.4% decrease in TES; while participants in the placebo condition demonstrated a 4.9% increase in TES. Conclusion: The potential moderating effect of marijuana on cocaine treatment response should be considered in future trial designs. Applying Bayesian subgroup analysis proved informative in characterizing this patient-treatment interaction effect. PMID:23115553

  7. Optimal estimation for discrete time jump processes

    NASA Technical Reports Server (NTRS)

    Vaca, M. V.; Tretter, S. A.

    1978-01-01

    Optimum estimates of nonobservable random variables or random processes which influence the rate functions of a discrete time jump process (DTJP) are derived. The approach used is based on the a posteriori probability of a nonobservable event expressed in terms of the a priori probability of that event and of the sample function probability of the DTJP. Thus a general representation is obtained for optimum estimates, and recursive equations are derived for minimum mean-squared error (MMSE) estimates. In general, MMSE estimates are nonlinear functions of the observations. The problem is considered of estimating the rate of a DTJP when the rate is a random variable with a beta probability density function and the jump amplitudes are binomially distributed. It is shown that the MMSE estimates are linear. The class of beta density functions is rather rich and explains why there are insignificant differences between optimum unconstrained and linear MMSE estimates in a variety of problems.

  8. General Strain Theory as a Basis for the Design of School Interventions

    ERIC Educational Resources Information Center

    Moon, Byongook; Morash, Merry

    2013-01-01

    The research described in this article applies general strain theory to identify possible points of intervention for reducing delinquency of students in two middle schools. Data were collected from 296 youths, and separate negative binomial regression analyses were used to identify predictors of violent, property, and status delinquency. Emotional…

  9. A New Zero-Inflated Negative Binomial Methodology for Latent Category Identification

    ERIC Educational Resources Information Center

    Blanchard, Simon J.; DeSarbo, Wayne S.

    2013-01-01

    We introduce a new statistical procedure for the identification of unobserved categories that vary between individuals and in which objects may span multiple categories. This procedure can be used to analyze data from a proposed sorting task in which individuals may simultaneously assign objects to multiple piles. The results of a synthetic…

  10. The Effectiveness of an Electronic Security Management System in a Privately Owned Apartment Complex

    ERIC Educational Resources Information Center

    Greenberg, David F.; Roush, Jeffrey B.

    2009-01-01

    Poisson and negative binomial regression methods are used to analyze the monthly time series data to determine the effects of introducing an integrated security management system including closed-circuit television (CCTV), door alarm monitoring, proximity card access, and emergency call boxes to a large privately-owned complex of apartment…

  11. Socio-environmental predictors of Barmah forest virus transmission in coastal areas, Queensland, Australia.

    PubMed

    Naish, Suchithra; Hu, Wenbiao; Nicholls, Neville; Mackenzie, John S; Dale, Pat; McMichael, Anthony J; Tong, Shilu

    2009-02-01

    To assess the socio-environmental predictors of Barmah forest virus (BFV) transmission in coastal areas, Queensland, Australia. Data on BFV notified cases, climate, tidal levels and socioeconomic index for area (SEIFA) in six coastal cities, Queensland, for the period 1992-2001 were obtained from the relevant government agencies. Negative binomial regression models were used to assess the socio-environmental predictors of BFV transmission. The results show that maximum and minimum temperature, rainfall, relative humidity, high and low tide were statistically significantly associated with BFV incidence at lags 0-2 months. The fitted negative binomial regression models indicate a significant independent association of each of maximum temperature (beta = 0.139, P = 0.000), high tide (beta = 0.005, P = 0.000) and SEIFA index (beta = -0.010, P = 0.000) with BFV transmission after adjustment for confounding variables. The transmission of BFV disease in Queensland coastal areas seemed to be determined by a combination of local social and environmental factors. The model developed in this study may have applications in the control and prevention of BFV disease in these areas.

  12. Statistical models for RNA-seq data derived from a two-condition 48-replicate experiment.

    PubMed

    Gierliński, Marek; Cole, Christian; Schofield, Pietà; Schurch, Nicholas J; Sherstnev, Alexander; Singh, Vijender; Wrobel, Nicola; Gharbi, Karim; Simpson, Gordon; Owen-Hughes, Tom; Blaxter, Mark; Barton, Geoffrey J

    2015-11-15

    High-throughput RNA sequencing (RNA-seq) is now the standard method to determine differential gene expression. Identifying differentially expressed genes crucially depends on estimates of read-count variability. These estimates are typically based on statistical models such as the negative binomial distribution, which is employed by the tools edgeR, DESeq and cuffdiff. Until now, the validity of these models has usually been tested on either low-replicate RNA-seq data or simulations. A 48-replicate RNA-seq experiment in yeast was performed and data tested against theoretical models. The observed gene read counts were consistent with both log-normal and negative binomial distributions, while the mean-variance relation followed the line of constant dispersion parameter of ∼0.01. The high-replicate data also allowed for strict quality control and screening of 'bad' replicates, which can drastically affect the gene read-count distribution. RNA-seq data have been submitted to ENA archive with project ID PRJEB5348. g.j.barton@dundee.ac.uk. © The Author 2015. Published by Oxford University Press.

  13. Analysis of overdispersed count data: application to the Human Papillomavirus Infection in Men (HIM) Study.

    PubMed

    Lee, J-H; Han, G; Fulp, W J; Giuliano, A R

    2012-06-01

    The Poisson model can be applied to the count of events occurring within a specific time period. The main feature of the Poisson model is the assumption that the mean and variance of the count data are equal. However, this equal mean-variance relationship rarely occurs in observational data. In most cases, the observed variance is larger than the assumed variance, which is called overdispersion. Further, when the observed data involve excessive zero counts, the problem of overdispersion results in underestimating the variance of the estimated parameter, and thus produces a misleading conclusion. We illustrated the use of four models for overdispersed count data that may be attributed to excessive zeros. These are Poisson, negative binomial, zero-inflated Poisson and zero-inflated negative binomial models. The example data in this article deal with the number of incidents involving human papillomavirus infection. The four models resulted in differing statistical inferences. The Poisson model, which is widely used in epidemiology research, underestimated the standard errors and overstated the significance of some covariates.

  14. Statistical models for RNA-seq data derived from a two-condition 48-replicate experiment

    PubMed Central

    Cole, Christian; Schofield, Pietà; Schurch, Nicholas J.; Sherstnev, Alexander; Singh, Vijender; Wrobel, Nicola; Gharbi, Karim; Simpson, Gordon; Owen-Hughes, Tom; Blaxter, Mark; Barton, Geoffrey J.

    2015-01-01

    Motivation: High-throughput RNA sequencing (RNA-seq) is now the standard method to determine differential gene expression. Identifying differentially expressed genes crucially depends on estimates of read-count variability. These estimates are typically based on statistical models such as the negative binomial distribution, which is employed by the tools edgeR, DESeq and cuffdiff. Until now, the validity of these models has usually been tested on either low-replicate RNA-seq data or simulations. Results: A 48-replicate RNA-seq experiment in yeast was performed and data tested against theoretical models. The observed gene read counts were consistent with both log-normal and negative binomial distributions, while the mean-variance relation followed the line of constant dispersion parameter of ∼0.01. The high-replicate data also allowed for strict quality control and screening of ‘bad’ replicates, which can drastically affect the gene read-count distribution. Availability and implementation: RNA-seq data have been submitted to ENA archive with project ID PRJEB5348. Contact: g.j.barton@dundee.ac.uk PMID:26206307

  15. Variable selection for zero-inflated and overdispersed data with application to health care demand in Germany

    PubMed Central

    Wang, Zhu; Shuangge, Ma; Wang, Ching-Yun

    2017-01-01

    In health services and outcome research, count outcomes are frequently encountered and often have a large proportion of zeros. The zero-inflated negative binomial (ZINB) regression model has important applications for this type of data. With many possible candidate risk factors, this paper proposes new variable selection methods for the ZINB model. We consider maximum likelihood function plus a penalty including the least absolute shrinkage and selection operator (LASSO), smoothly clipped absolute deviation (SCAD) and minimax concave penalty (MCP). An EM (expectation-maximization) algorithm is proposed for estimating the model parameters and conducting variable selection simultaneously. This algorithm consists of estimating penalized weighted negative binomial models and penalized logistic models via the coordinated descent algorithm. Furthermore, statistical properties including the standard error formulae are provided. A simulation study shows that the new algorithm not only has more accurate or at least comparable estimation, also is more robust than the traditional stepwise variable selection. The proposed methods are applied to analyze the health care demand in Germany using an open-source R package mpath. PMID:26059498

  16. Simplified pupal surveys of Aedes aegypti (L.) for entomologic surveillance and dengue control.

    PubMed

    Barrera, Roberto

    2009-07-01

    Pupal surveys of Aedes aegypti (L.) are useful indicators of risk for dengue transmission, although sample sizes for reliable estimations can be large. This study explores two methods for making pupal surveys more practical yet reliable and used data from 10 pupal surveys conducted in Puerto Rico during 2004-2008. The number of pupae per person for each sampling followed a negative binomial distribution, thus showing aggregation. One method found a common aggregation parameter (k) for the negative binomial distribution, a finding that enabled the application of a sequential sampling method requiring few samples to determine whether the number of pupae/person was above a vector density threshold for dengue transmission. A second approach used the finding that the mean number of pupae/person is correlated with the proportion of pupa-infested households and calculated equivalent threshold proportions of pupa-positive households. A sequential sampling program was also developed for this method to determine whether observed proportions of infested households were above threshold levels. These methods can be used to validate entomological thresholds for dengue transmission.

  17. Modeling left-turn crash occurrence at signalized intersections by conflicting patterns.

    PubMed

    Wang, Xuesong; Abdel-Aty, Mohamed

    2008-01-01

    In order to better understand the underlying crash mechanisms, left-turn crashes occurring at 197 four-legged signalized intersections over 6 years were classified into nine patterns based on vehicle maneuvers and then were assigned to intersection approaches. Crash frequency of each pattern was modeled at the approach level by mainly using Generalized Estimating Equations (GEE) with the Negative Binomial as the link function to account for the correlation among the crash data. GEE with a binomial logit link function was also applied for patterns with fewer crashes. The Cumulative Residuals test shows that, for correlated left-turn crashes, GEE models usually outperformed basic Negative Binomial models. The estimation results show that there are obvious differences in the factors that cause the occurrence of different left-turn collision patterns. For example, for each pattern, the traffic flows to which the colliding vehicles belong are identified to be significant. The width of the crossing distance (represented by the number of through lanes on the opposing approach of the left-turning traffic) is associated with more left-turn traffic colliding with opposing through traffic (Pattern 5), but with less left-turning traffic colliding with near-side crossing through traffic (Pattern 8). The safety effectiveness of the left-turning signal is not consistent for different crash patterns; "protected" phasing is correlated with fewer Pattern 5 crashes, but with more Pattern 8 crashes. The study indicates that in order to develop efficient countermeasures for left-turn crashes and improve safety at signalized intersections, left-turn crashes should be considered in different patterns.

  18. Relationship between suicide rate and economic growth and stock market in the People's Republic of China: 2004-2013.

    PubMed

    Yin, Honglei; Xu, Lin; Shao, Yechang; Li, Liping; Wan, Chengsong

    2016-01-01

    The objective of this study was to estimate the features of suicide rate and its association with economic development and stock market during the past decade in the People's Republic of China. Official data were gathered and analyzed in the People's Republic of China during the period 2004-2013. Nationwide suicide rate was stratified by four year age-groups, sex, urban/rural areas, and regions (East, Central, and West). Annual economic indexes including gross domestic product (GDP) per capita and rural and urban income per capita were all adjusted for inflation. Variation coefficient of market index (VCMI) was also included as an economic index to measure the fluctuation of the stock market. Negative binomial regression was performed to examine the time trend of region-level suicide rates and effects of sex, age, urban/rural area, region, and economic index on the suicide rates. Suicide rates of each age-group, sex, urban/rural area, and region were generally decreased from 2004 to 2013, while annual GDP per capita and rural and urban income per capita were generally increased by year. VCMI fluctuated largely, which peaked around 2009 and decreased after that time. Negative binomial regression showed that the decreased suicide rate in East and Central rural areas was the main cause of the decrease in suicide rate in the People's Republic of China. Suicide rate in the People's Republic of China for the study period increased with age and was higher in rural than in urban area, higher in males than in females, and the highest in the Central region. When GDP per capita increased by 2,787 RMB, the suicide rate decreased by 0.498 times. VCMI showed no significant relationship with suicide rate in the negative binomial regression. Suicide rate decreased in 2004-2013; varied among different age-groups, sex, urban/rural areas, and regions; and was negatively associated with the economic growth in the People's Republic of China. Stock market showed no relationship with suicide rate, but this finding needs to be verified in a future study.

  19. Relationship between suicide rate and economic growth and stock market in the People’s Republic of China: 2004–2013

    PubMed Central

    Yin, Honglei; Xu, Lin; Shao, Yechang; Li, Liping; Wan, Chengsong

    2016-01-01

    Objectives The objective of this study was to estimate the features of suicide rate and its association with economic development and stock market during the past decade in the People’s Republic of China. Methods Official data were gathered and analyzed in the People’s Republic of China during the period 2004–2013. Nationwide suicide rate was stratified by four year age-groups, sex, urban/rural areas, and regions (East, Central, and West). Annual economic indexes including gross domestic product (GDP) per capita and rural and urban income per capita were all adjusted for inflation. Variation coefficient of market index (VCMI) was also included as an economic index to measure the fluctuation of the stock market. Negative binomial regression was performed to examine the time trend of region-level suicide rates and effects of sex, age, urban/rural area, region, and economic index on the suicide rates. Results Suicide rates of each age-group, sex, urban/rural area, and region were generally decreased from 2004 to 2013, while annual GDP per capita and rural and urban income per capita were generally increased by year. VCMI fluctuated largely, which peaked around 2009 and decreased after that time. Negative binomial regression showed that the decreased suicide rate in East and Central rural areas was the main cause of the decrease in suicide rate in the People’s Republic of China. Suicide rate in the People’s Republic of China for the study period increased with age and was higher in rural than in urban area, higher in males than in females, and the highest in the Central region. When GDP per capita increased by 2,787 RMB, the suicide rate decreased by 0.498 times. VCMI showed no significant relationship with suicide rate in the negative binomial regression. Conclusion Suicide rate decreased in 2004–2013; varied among different age-groups, sex, urban/rural areas, and regions; and was negatively associated with the economic growth in the People’s Republic of China. Stock market showed no relationship with suicide rate, but this finding needs to be verified in a future study. PMID:27994468

  20. Site occupancy models with heterogeneous detection probabilities

    USGS Publications Warehouse

    Royle, J. Andrew

    2006-01-01

    Models for estimating the probability of occurrence of a species in the presence of imperfect detection are important in many ecological disciplines. In these ?site occupancy? models, the possibility of heterogeneity in detection probabilities among sites must be considered because variation in abundance (and other factors) among sampled sites induces variation in detection probability (p). In this article, I develop occurrence probability models that allow for heterogeneous detection probabilities by considering several common classes of mixture distributions for p. For any mixing distribution, the likelihood has the general form of a zero-inflated binomial mixture for which inference based upon integrated likelihood is straightforward. A recent paper by Link (2003, Biometrics 59, 1123?1130) demonstrates that in closed population models used for estimating population size, different classes of mixture distributions are indistinguishable from data, yet can produce very different inferences about population size. I demonstrate that this problem can also arise in models for estimating site occupancy in the presence of heterogeneous detection probabilities. The implications of this are discussed in the context of an application to avian survey data and the development of animal monitoring programs.

  1. Photon Counting Data Analysis: Application of the Maximum Likelihood and Related Methods for the Determination of Lifetimes in Mixtures of Rose Bengal and Rhodamine B

    DOE PAGES

    Santra, Kalyan; Smith, Emily A.; Petrich, Jacob W.; ...

    2016-12-12

    It is often convenient to know the minimum amount of data needed in order to obtain a result of desired accuracy and precision. It is a necessity in the case of subdiffraction-limited microscopies, such as stimulated emission depletion (STED) microscopy, owing to the limited sample volumes and the extreme sensitivity of the samples to photobleaching and photodamage. We present a detailed comparison of probability-based techniques (the maximum likelihood method and methods based on the binomial and the Poisson distributions) with residual minimization-based techniques for retrieving the fluorescence decay parameters for various two-fluorophore mixtures, as a function of the total numbermore » of photon counts, in time-correlated, single-photon counting experiments. The probability-based techniques proved to be the most robust (insensitive to initial values) in retrieving the target parameters and, in fact, performed equivalently to 2-3 significant figures. This is to be expected, as we demonstrate that the three methods are fundamentally related. Furthermore, methods based on the Poisson and binomial distributions have the desirable feature of providing a bin-by-bin analysis of a single fluorescence decay trace, which thus permits statistics to be acquired using only the one trace for not only the mean and median values of the fluorescence decay parameters but also for the associated standard deviations. Lastly, these probability-based methods lend themselves well to the analysis of the sparse data sets that are encountered in subdiffraction-limited microscopies.« less

  2. Estimating safety effects of pavement management factors utilizing Bayesian random effect models.

    PubMed

    Jiang, Ximiao; Huang, Baoshan; Zaretzki, Russell L; Richards, Stephen; Yan, Xuedong

    2013-01-01

    Previous studies of pavement management factors that relate to the occurrence of traffic-related crashes are rare. Traditional research has mostly employed summary statistics of bidirectional pavement quality measurements in extended longitudinal road segments over a long time period, which may cause a loss of important information and result in biased parameter estimates. The research presented in this article focuses on crash risk of roadways with overall fair to good pavement quality. Real-time and location-specific data were employed to estimate the effects of pavement management factors on the occurrence of crashes. This research is based on the crash data and corresponding pavement quality data for the Tennessee state route highways from 2004 to 2009. The potential temporal and spatial correlations among observations caused by unobserved factors were considered. Overall 6 models were built accounting for no correlation, temporal correlation only, and both the temporal and spatial correlations. These models included Poisson, negative binomial (NB), one random effect Poisson and negative binomial (OREP, ORENB), and two random effect Poisson and negative binomial (TREP, TRENB) models. The Bayesian method was employed to construct these models. The inference is based on the posterior distribution from the Markov chain Monte Carlo (MCMC) simulation. These models were compared using the deviance information criterion. Analysis of the posterior distribution of parameter coefficients indicates that the pavement management factors indexed by Present Serviceability Index (PSI) and Pavement Distress Index (PDI) had significant impacts on the occurrence of crashes, whereas the variable rutting depth was not significant. Among other factors, lane width, median width, type of terrain, and posted speed limit were significant in affecting crash frequency. The findings of this study indicate that a reduction in pavement roughness would reduce the likelihood of traffic-related crashes. Hence, maintaining a low level of pavement roughness is strongly suggested. In addition, the results suggested that the temporal correlation among observations was significant and that the ORENB model outperformed all other models.

  3. Buffon, Georges Louis Leclerc Comte de (1707-88)

    NASA Astrophysics Data System (ADS)

    Murdin, P.

    2000-11-01

    French naturalist. Discovered the binomial theorem and worked on probability theory. In astronomy he suggested that the Earth might have been created by the collision of a comet with the Sun. Based on the cooling rate of iron, he calculated in Théorie de la Terre that the age of the Earth was 75 000 years. This estimate, so much larger than the official 6000 years, was condemned by the Catholic C...

  4. Bernoulli, Darwin, and Sagan: the probability of life on other planets

    NASA Astrophysics Data System (ADS)

    Rossmo, D. Kim

    2017-04-01

    The recent discovery that billions of planets in the Milky Way Galaxy may be in circumstellar habitable zones has renewed speculation over the possibility of extraterrestrial life. The Drake equation is a probabilistic framework for estimating the number of technological advanced civilizations in our Galaxy; however, many of the equation's component probabilities are either unknown or have large error intervals. In this paper, a different method of examining this question is explored, one that replaces the various Drake factors with the single estimate for the probability of life existing on Earth. This relationship can be described by the binomial distribution if the presence of life on a given number of planets is equated to successes in a Bernoulli trial. The question of exoplanet life may then be reformulated as follows - given the probability of one or more independent successes for a given number of trials, what is the probability of two or more successes? Some of the implications of this approach for finding life on exoplanets are discussed.

  5. Perceived Prevalence of Teasing and Bullying Predicts High School Dropout Rates

    ERIC Educational Resources Information Center

    Cornell, Dewey; Gregory, Anne; Huang, Francis; Fan, Xitao

    2013-01-01

    This prospective study of 276 Virginia public high schools found that the prevalence of teasing and bullying (PTB) as perceived by both 9th-grade students and teachers was predictive of dropout rates for this cohort 4 years later. Negative binomial regression indicated that one standard deviation increases in student- and teacher-reported PTB were…

  6. The Influence of Television Advertisements on Promoting Calls to Telephone Quitlines

    ERIC Educational Resources Information Center

    Farrelly, Matthew; Mann, Nathan; Watson, Kimberly; Pechacek, Terry

    2013-01-01

    The aim of the study was to assess the relative effectiveness of cessation, secondhand smoke and other tobacco control television advertisements in promoting quitlines in nine states from 2002 through 2005. Quarterly, the number of individuals who used quitlines per 10 000 adult smokers in a media market are measured. Negative binomial regression…

  7. Modeling Zero-Inflated and Overdispersed Count Data: An Empirical Study of School Suspensions

    ERIC Educational Resources Information Center

    Desjardins, Christopher David

    2016-01-01

    The purpose of this article is to develop a statistical model that best explains variability in the number of school days suspended. Number of school days suspended is a count variable that may be zero-inflated and overdispersed relative to a Poisson model. Four models were examined: Poisson, negative binomial, Poisson hurdle, and negative…

  8. Logit and probit model in toll sensitivity analysis of Solo-Ngawi, Kartasura-Palang Joglo segment based on Willingness to Pay (WTP)

    NASA Astrophysics Data System (ADS)

    Handayani, Dewi; Cahyaning Putri, Hera; Mahmudah, AMH

    2017-12-01

    Solo-Ngawi toll road project is part of the mega project of the Trans Java toll road development initiated by the government and is still under construction until now. PT Solo Ngawi Jaya (SNJ) as the Solo-Ngawi toll management company needs to determine the toll fare that is in accordance with the business plan. The determination of appropriate toll rates will affect progress in regional economic sustainability and decrease the traffic congestion. These policy instruments is crucial for achieving environmentally sustainable transport. Therefore, the objective of this research is to find out how the toll fare sensitivity of Solo-Ngawi toll road based on Willingness To Pay (WTP). Primary data was obtained by distributing stated preference questionnaires to four wheeled vehicle users in Kartasura-Palang Joglo artery road segment. Further data obtained will be analysed with logit and probit model. Based on the analysis, it is found that the effect of fare change on the amount of WTP on the binomial logit model is more sensitive than the probit model on the same travel conditions. The range of tariff change against values of WTP on the binomial logit model is 20% greater than the range of values in the probit model . On the other hand, the probability results of the binomial logit model and the binary probit have no significant difference (less than 1%).

  9. Use of Internet Viral Marketing to Promote Smoke-Free Lifestyles among Chinese Adolescents

    PubMed Central

    Ip, Patrick; Lam, Tai-Hing; Chan, Sophia Siu-Chee; Ho, Frederick Ka-Wing; Lo, Lewis A.; Chiu, Ivy Wing-Sze; Wong, Wilfred Hing-Sang; Chow, Chun-Bong

    2014-01-01

    Purpose Youth smoking is a global public health concern. Health educators are increasingly using Internet-based technologies, but the effectiveness of Internet viral marketing in promoting health remains uncertain. This prospective pilot study assessed the efficacy of an online game-based viral marketing campaign in promoting a smoke-free attitude among Chinese adolescents. Methods One hundred and twenty-one Hong Kong Chinese adolescents aged 10 to 24 were invited to participate in an online multiple-choice quiz game competition designed to deliver tobacco-related health information. Participants were encouraged to refer others to join. A zero-inflated negative binomial model was used to explore the factors contributing to the referral process. Latent transition analysis utilising a pre- and post-game survey was used to detect attitudinal changes toward smoking. Results The number of participants increased almost eightfold from 121 to 928 (34.6% current or ex-smokers) during the 22-day campaign. Participants exhibited significant attitudinal change, with 73% holding negative attitudes toward smoking after the campaign compared to 57% before it. The transition probabilities from positive to negative and neutral to negative attitudes were 0.52 and 0.48, respectively. It was also found that attempting every 20 quiz questions was associated with lower perceived smoking decision in future (OR  = 0.95, p-value <0.01). Conclusions Our online game-based viral marketing programme was effective in reaching a large number of smoking and non-smoking participants and changing their attitudes toward smoking. It constitutes a promising practical and cost-effective model for engaging young smokers and promulgating smoking-related health information among Chinese adolescents. PMID:24911010

  10. Simulation of flight maneuver-load distributions by utilizing stationary, non-Gaussian random load histories

    NASA Technical Reports Server (NTRS)

    Leybold, H. A.

    1971-01-01

    Random numbers were generated with the aid of a digital computer and transformed such that the probability density function of a discrete random load history composed of these random numbers had one of the following non-Gaussian distributions: Poisson, binomial, log-normal, Weibull, and exponential. The resulting random load histories were analyzed to determine their peak statistics and were compared with cumulative peak maneuver-load distributions for fighter and transport aircraft in flight.

  11. A Lower Bound to the Probability of Choosing the Optimal Passing Score for a Mastery Test When There is an External Criterion [and] Estimating the Parameters of the Beta-Binomial Distribution.

    ERIC Educational Resources Information Center

    Wilcox, Rand R.

    A mastery test is frequently described as follows: an examinee responds to n dichotomously scored test items. Depending upon the examinee's observed (number correct) score, a mastery decision is made and the examinee is advanced to the next level of instruction. Otherwise, a nonmastery decision is made and the examinee is given remedial work. This…

  12. Some considerations for excess zeroes in substance abuse research.

    PubMed

    Bandyopadhyay, Dipankar; DeSantis, Stacia M; Korte, Jeffrey E; Brady, Kathleen T

    2011-09-01

    Count data collected in substance abuse research often come with an excess of "zeroes," which are typically handled using zero-inflated regression models. However, there is a need to consider the design aspects of those studies before using such a statistical model to ascertain the sources of zeroes. We sought to illustrate hurdle models as alternatives to zero-inflated models to validate a two-stage decision-making process in situations of "excess zeroes." We use data from a study of 45 cocaine-dependent subjects where the primary scientific question was to evaluate whether study participation influences drug-seeking behavior. The outcome, "the frequency (count) of cocaine use days per week," is bounded (ranging from 0 to 7). We fit and compare binomial, Poisson, negative binomial, and the hurdle version of these models to study the effect of gender, age, time, and study participation on cocaine use. The hurdle binomial model provides the best fit. Gender and time are not predictive of use. Higher odds of use versus no use are associated with age; however once use is experienced, odds of further use decrease with increase in age. Participation was associated with higher odds of no-cocaine use; once there is use, participation reduced the odds of further use. Age and study participation are significantly predictive of cocaine-use behavior. The two-stage decision process as modeled by a hurdle binomial model (appropriate for bounded count data with excess zeroes) provides interesting insights into the study of covariate effects on count responses of substance use, when all enrolled subjects are believed to be "at-risk" of use.

  13. Spatial Distribution of Adult Anthonomus grandis Boheman (Coleoptera: Curculionidae) and Damage to Cotton Flower Buds Due to Feeding and Oviposition.

    PubMed

    Grigolli, J F J; Souza, L A; Fernandes, M G; Busoli, A C

    2017-08-01

    The cotton boll weevil Anthonomus grandis Boheman (Coleoptera: Curculionidae) is the main pest in cotton crop around the world, directly affecting cotton production. In order to establish a sequential sampling plan, it is crucial to understand the spatial distribution of the pest population and the damage it causes to the crop through the different developmental stages of cotton plants. Therefore, this study aimed to investigate the spatial distribution of adults in the cultivation area and their oviposition and feeding behavior throughout the development of the cotton plants. The experiment was conducted in Maracaju, Mato Grosso do Sul, Brazil, in the 2012/2013 and 2013/2014 growing seasons, in an area of 10,000 m 2 , planted with the cotton cultivar FM 993. The experimental area was divided into 100 plots of 100 m 2 (10 × 10 m) each, and five plants per plot were sampled weekly throughout the crop cycle. The number of flower buds with feeding and oviposition punctures and of adult A. grandis was recorded throughout the crop cycle in five plants per plot. After determining the aggregation indices (variance/mean ratio, Morisita's index, exponent k of the negative binomial distribution, and Green's coefficient) and adjusting the frequencies observed in the field to the distribution of frequencies (Poisson, negative binomial, and positive binomial) using the chi-squared test, it was observed that flower buds with punctures derived from feeding, oviposition, and feeding + oviposition showed an aggregated distribution in the cultivation area until 85 days after emergence and a random distribution after this stage. The adults of A. grandis presented a random distribution in the cultivation area.

  14. [Spatial epidemiological study on malaria epidemics in Hainan province].

    PubMed

    Wen, Liang; Shi, Run-He; Fang, Li-Qun; Xu, De-Zhong; Li, Cheng-Yi; Wang, Yong; Yuan, Zheng-Quan; Zhang, Hui

    2008-06-01

    To better understand the characteristics of spatial distribution of malaria epidemics in Hainan province and to explore the relationship between malaria epidemics and environmental factors, as well to develop prediction model on malaria epidemics. Data on Malaria and meteorological factors were collected in all 19 counties in Hainan province from May to Oct., 2000, and the proportion of land use types of these counties in this period were extracted from digital map of land use in Hainan province. Land surface temperatures (LST) were extracted from MODIS images and elevations of these counties were extracted from DEM of Hainan province. The coefficients of correlation of malaria incidences and these environmental factors were then calculated with SPSS 13.0, and negative binomial regression analysis were done using SAS 9.0. The incidence of malaria showed (1) positive correlations to elevation, proportion of forest land area and grassland area; (2) negative correlations to the proportion of cultivated area, urban and rural residents and to industrial enterprise area, LST; (3) no correlations to meteorological factors, proportion of water area, and unemployed land area. The prediction model of malaria which came from negative binomial regression analysis was: I (monthly, unit: 1/1,000,000) = exp (-1.672-0.399xLST). Spatial distribution of malaria epidemics was associated with some environmental factors, and prediction model of malaria epidemic could be developed with indexes which extracted from satellite remote sensing images.

  15. Single-vehicle crashes along rural mountainous highways in Malaysia: An application of random parameters negative binomial model.

    PubMed

    Rusli, Rusdi; Haque, Md Mazharul; King, Mark; Voon, Wong Shaw

    2017-05-01

    Mountainous highways generally associate with complex driving environment because of constrained road geometries, limited cross-section elements, inappropriate roadside features, and adverse weather conditions. As a result, single-vehicle (SV) crashes are overrepresented along mountainous roads, particularly in developing countries, but little attention is known about the roadway geometric, traffic and weather factors contributing to these SV crashes. As such, the main objective of the present study is to investigate SV crashes using detailed data obtained from a rigorous site survey and existing databases. The final dataset included a total of 56 variables representing road geometries including horizontal and vertical alignment, traffic characteristics, real-time weather condition, cross-sectional elements, roadside features, and spatial characteristics. To account for structured heterogeneities resulting from multiple observations within a site and other unobserved heterogeneities, the study applied a random parameters negative binomial model. Results suggest that rainfall during the crash is positively associated with SV crashes, but real-time visibility is negatively associated. The presence of a road shoulder, particularly a bitumen shoulder or wider shoulders, along mountainous highways is associated with less SV crashes. While speeding along downgrade slopes increases the likelihood of SV crashes, proper delineation decreases the likelihood. Findings of this study have significant implications for designing safer highways in mountainous areas, particularly in the context of a developing country. Copyright © 2017 Elsevier Ltd. All rights reserved.

  16. Type 1 diabetes, quality of life, occupational status and education level - A comparative population-based study.

    PubMed

    Nielsen, Helena B; Ovesen, Louise L; Mortensen, Laust H; Lau, Cathrine J; Joensen, Lene E

    2016-11-01

    Type 1 diabetes requires extensive self-management to avoid complications and may have negative effects on the everyday life of people with the disease. The aim of this study was to compare adults with type 1 diabetes to the general population in terms of health-related quality of life, occupational status (level of employment, working hours and sick leave) and education level. 2415 adults (aged 18-98years) with type 1 diabetes were compared to 48,511 adults (aged 18-103years) from the general population. Data were obtained from two cross-sectional surveys conducted in 2010 and 2011 of adults living or treated in the Capital Region in Denmark. Differences between adults with type 1 diabetes and the general population were standardised for age and sex and analyzed using linear probability models and negative binomial regression. Differences were further analyzed in subgroups. Compared to the general population, adults with type 1 diabetes experienced lower health-related quality of life, were more frequently unemployed, had more sick leave per year and were slightly better educated. Differences in health-related quality of life and employment increased with age and were larger among women, as compared to men. No significant differences were found with regard to working hours. Our findings suggest that type 1 diabetes is associated with lower health-related quality of life, higher unemployment and additional sick leave. The negative association with type 1 diabetes is more pronounced in women and older adults. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  17. Making sense of the noise: The effect of hydrology on silver carp eDNA detection in the Chicago area waterway system.

    PubMed

    Song, Jeffery W; Small, Mitchell J; Casman, Elizabeth A

    2017-12-15

    Environmental DNA (eDNA) sampling is an emerging tool for monitoring the spread of aquatic invasive species. One confounding factor when interpreting eDNA sampling evidence is that eDNA can be present in the water in the absence of living target organisms, originating from excreta, dead tissue, boats, or sewage effluent, etc. In the Chicago Area Waterway System (CAWS), electric fish dispersal barriers were built to prevent non-native Asian carp species from invading Lake Michigan, and yet Asian carp eDNA has been detected above the barriers sporadically since 2009. In this paper the influence of stream flow characteristics in the CAWS on the probability of invasive Asian carp eDNA detection in the CAWS from 2009 to 2012 was examined. In the CAWS, the direction of stream flow is mostly away from Lake Michigan, though there are infrequent reversals in flow direction towards Lake Michigan during dry spells. We find that the flow reversal volume into the Lake has a statistically significant positive relationship with eDNA detection probability, while other covariates, like gage height, precipitation, season, water temperature, dissolved oxygen concentration, pH and chlorophyll concentration do not. This suggests that stream flow direction is highly influential on eDNA detection in the CAWS and should be considered when interpreting eDNA evidence. We also find that the beta-binomial regression model provides a stronger fit for eDNA detection probability compared to a binomial regression model. This paper provides a statistical modeling framework for interpreting eDNA sampling evidence and for evaluating covariates influencing eDNA detection. Copyright © 2017 Elsevier B.V. All rights reserved.

  18. Sleep Disruption Medical Intervention Forecasting (SDMIF) Module for the Integrated Medical Model

    NASA Technical Reports Server (NTRS)

    Lewandowski, Beth; Brooker, John; Mallis, Melissa; Hursh, Steve; Caldwell, Lynn; Myers, Jerry

    2011-01-01

    The NASA Integrated Medical Model (IMM) assesses the risk, including likelihood and impact of occurrence, of all credible in-flight medical conditions. Fatigue due to sleep disruption is a condition that could lead to operational errors, potentially resulting in loss of mission or crew. Pharmacological consumables are mitigation strategies used to manage the risks associated with sleep deficits. The likelihood of medical intervention due to sleep disruption was estimated with a well validated sleep model and a Monte Carlo computer simulation in an effort to optimize the quantity of consumables. METHODS: The key components of the model are the mission parameter program, the calculation of sleep intensity and the diagnosis and decision module. The mission parameter program was used to create simulated daily sleep/wake schedules for an ISS increment. The hypothetical schedules included critical events such as dockings and extravehicular activities and included actual sleep time and sleep quality. The schedules were used as inputs to the Sleep, Activity, Fatigue and Task Effectiveness (SAFTE) Model (IBR Inc., Baltimore MD), which calculated sleep intensity. Sleep data from an ISS study was used to relate calculated sleep intensity to the probability of sleep medication use, using a generalized linear model for binomial regression. A human yes/no decision process using a binomial random number was also factored into sleep medication use probability. RESULTS: These probability calculations were repeated 5000 times resulting in an estimate of the most likely amount of sleep aids used during an ISS mission and a 95% confidence interval. CONCLUSIONS: These results were transferred to the parent IMM for further weighting and integration with other medical conditions, to help inform operational decisions. This model is a potential planning tool for ensuring adequate sleep during sleep disrupted periods of a mission.

  19. A prospective study of mental health care for comorbid depressed mood in older adults with painful osteoarthritis

    PubMed Central

    2011-01-01

    Background Comorbid depression is common among adults with painful osteoarthritis (OA). We evaluated the relationship between depressed mood and receipt of mental health (MH) care services. Methods In a cohort with OA, annual interviews assessed comorbidity, arthritis severity, and MH (SF-36 mental health score). Surveys were linked to administrative health databases to identify mental health-related visits to physicians in the two years following the baseline interview (1996-98). Prescriptions for anti-depressants were ascertained for participants aged 65+ years (eligible for drug benefits). The relationship between MH scores and MH-related physician visits was assessed using zero-inflated negative binomial regression, adjusting for confounders. For those aged 65+ years, logistic regression examined the probability of receiving any MH-related care (physician visit or anti-depressant prescription). Results Analyses were based on 2,005 (90.1%) individuals (mean age 70.8 years). Of 576 (28.7%) with probable depression (MH score < 60/100), 42.5% experienced one or more MH-related physician visits during follow-up. The likelihood of a physician visit was associated with sex (adjusted OR women vs. men = 5.87, p = 0.005) and MH score (adjusted OR per 10-point decrease in MH score = 1.63, p = 0.003). Among those aged 65+, 56.7% with probable depression received any MH care. The likelihood of receiving any MH care exhibited a significant interaction between MH score and self-reported health status (p = 0.0009); with good general health, worsening MH was associated with increased likelihood of MH care; as general health declined, this effect was attenuated. Conclusions Among older adults with painful OA, more than one-quarter had depressed mood, but almost half received no mental health care, suggesting a care gap. PMID:21910895

  20. Behavioral Analysis of Visitors to a Medical Institution's Website Using Markov Chain Monte Carlo Methods.

    PubMed

    Suzuki, Teppei; Tani, Yuji; Ogasawara, Katsuhiko

    2016-07-25

    Consistent with the "attention, interest, desire, memory, action" (AIDMA) model of consumer behavior, patients collect information about available medical institutions using the Internet to select information for their particular needs. Studies of consumer behavior may be found in areas other than medical institution websites. Such research uses Web access logs for visitor search behavior. At this time, research applying the patient searching behavior model to medical institution website visitors is lacking. We have developed a hospital website search behavior model using a Bayesian approach to clarify the behavior of medical institution website visitors and determine the probability of their visits, classified by search keyword. We used the website data access log of a clinic of internal medicine and gastroenterology in the Sapporo suburbs, collecting data from January 1 through June 31, 2011. The contents of the 6 website pages included the following: home, news, content introduction for medical examinations, mammography screening, holiday person-on-duty information, and other. The search keywords we identified as best expressing website visitor needs were listed as the top 4 headings from the access log: clinic name, clinic name + regional name, clinic name + medical examination, and mammography screening. Using the search keywords as the explaining variable, we built a binomial probit model that allows inspection of the contents of each purpose variable. Using this model, we determined a beta value and generated a posterior distribution. We performed the simulation using Markov Chain Monte Carlo methods with a noninformation prior distribution for this model and determined the visit probability classified by keyword for each category. In the case of the keyword "clinic name," the visit probability to the website, repeated visit to the website, and contents page for medical examination was positive. In the case of the keyword "clinic name and regional name," the probability for a repeated visit to the website and the mammography screening page was negative. In the case of the keyword "clinic name + medical examination," the visit probability to the website was positive, and the visit probability to the information page was negative. When visitors referred to the keywords "mammography screening," the visit probability to the mammography screening page was positive (95% highest posterior density interval = 3.38-26.66). Further analysis for not only the clinic website but also various other medical institution websites is necessary to build a general inspection model for medical institution websites; we want to consider this in future research. Additionally, we hope to use the results obtained in this study as a prior distribution for future work to conduct higher-precision analysis.

  1. Behavioral Analysis of Visitors to a Medical Institution’s Website Using Markov Chain Monte Carlo Methods

    PubMed Central

    Tani, Yuji

    2016-01-01

    Background Consistent with the “attention, interest, desire, memory, action” (AIDMA) model of consumer behavior, patients collect information about available medical institutions using the Internet to select information for their particular needs. Studies of consumer behavior may be found in areas other than medical institution websites. Such research uses Web access logs for visitor search behavior. At this time, research applying the patient searching behavior model to medical institution website visitors is lacking. Objective We have developed a hospital website search behavior model using a Bayesian approach to clarify the behavior of medical institution website visitors and determine the probability of their visits, classified by search keyword. Methods We used the website data access log of a clinic of internal medicine and gastroenterology in the Sapporo suburbs, collecting data from January 1 through June 31, 2011. The contents of the 6 website pages included the following: home, news, content introduction for medical examinations, mammography screening, holiday person-on-duty information, and other. The search keywords we identified as best expressing website visitor needs were listed as the top 4 headings from the access log: clinic name, clinic name + regional name, clinic name + medical examination, and mammography screening. Using the search keywords as the explaining variable, we built a binomial probit model that allows inspection of the contents of each purpose variable. Using this model, we determined a beta value and generated a posterior distribution. We performed the simulation using Markov Chain Monte Carlo methods with a noninformation prior distribution for this model and determined the visit probability classified by keyword for each category. Results In the case of the keyword “clinic name,” the visit probability to the website, repeated visit to the website, and contents page for medical examination was positive. In the case of the keyword “clinic name and regional name,” the probability for a repeated visit to the website and the mammography screening page was negative. In the case of the keyword “clinic name + medical examination,” the visit probability to the website was positive, and the visit probability to the information page was negative. When visitors referred to the keywords “mammography screening,” the visit probability to the mammography screening page was positive (95% highest posterior density interval = 3.38-26.66). Conclusions Further analysis for not only the clinic website but also various other medical institution websites is necessary to build a general inspection model for medical institution websites; we want to consider this in future research. Additionally, we hope to use the results obtained in this study as a prior distribution for future work to conduct higher-precision analysis. PMID:27457537

  2. A Model Comparison for Count Data with a Positively Skewed Distribution with an Application to the Number of University Mathematics Courses Completed

    ERIC Educational Resources Information Center

    Liou, Pey-Yan

    2009-01-01

    The current study examines three regression models: OLS (ordinary least square) linear regression, Poisson regression, and negative binomial regression for analyzing count data. Simulation results show that the OLS regression model performed better than the others, since it did not produce more false statistically significant relationships than…

  3. A Negative Binomial Regression Model for Accuracy Tests

    ERIC Educational Resources Information Center

    Hung, Lai-Fa

    2012-01-01

    Rasch used a Poisson model to analyze errors and speed in reading tests. An important property of the Poisson distribution is that the mean and variance are equal. However, in social science research, it is very common for the variance to be greater than the mean (i.e., the data are overdispersed). This study embeds the Rasch model within an…

  4. Which Types of Televised Anti-Tobacco Campaigns Prompt More Quitline Calls from Disadvantaged Groups?

    ERIC Educational Resources Information Center

    Durkin, Sarah J.; Wakefield, Melanie A.; Spittal, Matthew J.

    2011-01-01

    To examine the efficacy of different types of mass media ads in driving lower socio-economic smokers (SES) to utilize quitlines. This study collected all 33 719 calls to the Victorian quitline in Australia over a 2-year period. Negative binomial regressions examined the relationship between weekly levels of exposure to different types of…

  5. Togetherness among Plasmodium falciparum gametocytes: interpretation through simulation and consequences for malaria transmission.

    PubMed

    Gaillard, F O; Boudin, C; Chau, N P; Robert, V; Pichon, G

    2003-11-01

    Previous experimental gametocyte infections of Anopheles arabiensis on 3 volunteers naturally infected with Plasmodium falciparum were conducted in Senegal. They showed that gametocyte counts in the mosquitoes are, like macroparasite intakes, heterogeneous (overdispersed). They followed a negative binomial distribution, the overdispersion coefficient seeming constant (k = 3.1). To try to explain this heterogeneity, we used an individual-based model (IBM), simulating the behaviour of gametocytes in the human blood circulation and their ingestion by mosquitoes. The hypothesis was that there exists a clustering of the gametocytes in the capillaries. From a series of simulations, in the case of clustering the following results were obtained: (i) the distribution of the gametocytes ingested by the mosquitoes followed a negative binomial, (ii) the k coefficient significantly increased with the density of circulating gametocytes. To validate this model result, 2 more experiments were conducted in Cameroon. Pooled experiments showed a distinct density dependency of the k-values. The simulation results and the experimental results were thus in agreement and suggested that an aggregation process at the microscopic level might produce the density-dependent overdispersion at the macroscopic level. Simulations also suggested that the clustering of gametocytes might facilitate fertilization of gametes.

  6. Effectiveness on Early Childhood Caries of an Oral Health Promotion Program for Medical Providers

    PubMed Central

    Widmer-Racich, Katina; Sevick, Carter; Starzyk, Erin J.; Mauritson, Katya; Hambidge, Simon J.

    2017-01-01

    Objectives. To assess an oral health promotion (OHP) intervention for medical providers’ impact on early childhood caries (ECC). Methods. We implemented a quasiexperimental OHP intervention in 8 federally qualified health centers that trained medical providers on ECC risk assessment, oral examination and instruction, dental referral, and fluoride varnish applications (FVAs). We measured OHP delivery by FVA count at medical visits. We measured the intervention’s impact on ECC in 3 unique cohorts of children aged 3 to 4 years in 2009 (preintervention; n = 202), 2011 (midintervention; n = 420), and 2015 (≥ 4 FVAs; n = 153). We compared numbers of decayed, missing, and filled tooth surfaces using adjusted zero-inflated negative binomial models. Results. Across 3 unique cohorts, the FVA mean (range) count was 0.0 (0), 1.1 (0–7), and 4.5 (4–7) in 2009, 2011, and 2015, respectively. In adjusted zero-inflated negative binomial models analyses, children in the 2015 cohort had significantly fewer decayed, missing, and filled tooth surfaces than did children in previous cohorts. Conclusions. An OHP intervention targeting medical providers reduced ECC when children received 4 or more FVAs at a medical visit by age 3 years. PMID:28661802

  7. Co-Infestation and Spatial Distribution of Bactrocera carambolae and Anastrepha spp. (Diptera: Tephritidae) in Common Guava in the Eastern Amazon

    PubMed Central

    Deus, E. G.; Godoy, W. A. C.; Sousa, M. S. M.; Lopes, G. N.; Jesus-Barros, C. R.; Silva, J. G.; Adaime, R.

    2016-01-01

    Field infestation and spatial distribution of introduced Bactrocera carambolae Drew and Hancock and native species of Anastrepha in common guavas [Psidium guajava (L.)] were investigated in the eastern Amazon. Fruit sampling was carried out in the municipalities of Calçoene and Oiapoque in the state of Amapá, Brazil. The frequency distribution of larvae in fruit was fitted to the negative binomial distribution. Anastrepha striata was more abundant in both sampled areas in comparison to Anastrepha fraterculus (Wiedemann) and B. carambolae. The frequency distribution analysis of adults revealed an aggregated pattern for B. carambolae as well as for A. fraterculus and Anastrepha striata Schiner, described by the negative binomial distribution. Although the populations of Anastrepha spp. may have suffered some impact due to the presence of B. carambolae, the results are still not robust enough to indicate effective reduction in the abundance of Anastrepha spp. caused by B. carambolae in a general sense. The high degree of aggregation observed for both species suggests interspecific co-occurrence with the simultaneous presence of both species in the analysed fruit. Moreover, a significant fraction of uninfested guavas also indicated absence of competitive displacement. PMID:27638949

  8. The effect of a major cigarette price change on smoking behavior in california: a zero-inflated negative binomial model.

    PubMed

    Sheu, Mei-Ling; Hu, Teh-Wei; Keeler, Theodore E; Ong, Michael; Sung, Hai-Yen

    2004-08-01

    The objective of this paper is to determine the price sensitivity of smokers in their consumption of cigarettes, using evidence from a major increase in California cigarette prices due to Proposition 10 and the Tobacco Settlement. The study sample consists of individual survey data from Behavioral Risk Factor Survey (BRFS) and price data from the Bureau of Labor Statistics between 1996 and 1999. A zero-inflated negative binomial (ZINB) regression model was applied for the statistical analysis. The statistical model showed that price did not have an effect on reducing the estimated prevalence of smoking. However, it indicated that among smokers the price elasticity was at the level of -0.46 and statistically significant. Since smoking prevalence is significantly lower than it was a decade ago, price increases are becoming less effective as an inducement for hard-core smokers to quit, although they may respond by decreasing consumption. For those who only smoke occasionally (many of them being young adults) price increases alone may not be an effective inducement to quit smoking. Additional underlying behavioral factors need to be identified so that more effective anti-smoking strategies can be developed.

  9. A Time Series Analysis: Weather Factors, Human Migration and Malaria Cases in Endemic Area of Purworejo, Indonesia, 2005–2014

    PubMed Central

    REJEKI, Dwi Sarwani Sri; NURHAYATI, Nunung; AJI, Budi; MURHANDARWATI, E. Elsa Herdiana; KUSNANTO, Hari

    2018-01-01

    Background: Climatic and weather factors become important determinants of vector-borne diseases transmission like malaria. This study aimed to prove relationships between weather factors with considering human migration and previous case findings and malaria cases in endemic areas in Purworejo during 2005–2014. Methods: This study employed ecological time series analysis by using monthly data. The independent variables were the maximum temperature, minimum temperature, maximum humidity, minimum humidity, precipitation, human migration, and previous malaria cases, while the dependent variable was positive malaria cases. Three models of count data regression analysis i.e. Poisson model, quasi-Poisson model, and negative binomial model were applied to measure the relationship. The least Akaike Information Criteria (AIC) value was also performed to find the best model. Negative binomial regression analysis was considered as the best model. Results: The model showed that humidity (lag 2), precipitation (lag 3), precipitation (lag 12), migration (lag1) and previous malaria cases (lag 12) had a significant relationship with malaria cases. Conclusion: Weather, migration and previous malaria cases factors need to be considered as prominent indicators for the increase of malaria case projection. PMID:29900134

  10. Population heterogeneity in the salience of multiple risk factors for adolescent delinquency.

    PubMed

    Lanza, Stephanie T; Cooper, Brittany R; Bray, Bethany C

    2014-03-01

    To present mixture regression analysis as an alternative to more standard regression analysis for predicting adolescent delinquency. We demonstrate how mixture regression analysis allows for the identification of population subgroups defined by the salience of multiple risk factors. We identified population subgroups (i.e., latent classes) of individuals based on their coefficients in a regression model predicting adolescent delinquency from eight previously established risk indices drawn from the community, school, family, peer, and individual levels. The study included N = 37,763 10th-grade adolescents who participated in the Communities That Care Youth Survey. Standard, zero-inflated, and mixture Poisson and negative binomial regression models were considered. Standard and mixture negative binomial regression models were selected as optimal. The five-class regression model was interpreted based on the class-specific regression coefficients, indicating that risk factors had varying salience across classes of adolescents. Standard regression showed that all risk factors were significantly associated with delinquency. Mixture regression provided more nuanced information, suggesting a unique set of risk factors that were salient for different subgroups of adolescents. Implications for the design of subgroup-specific interventions are discussed. Copyright © 2014 Society for Adolescent Health and Medicine. Published by Elsevier Inc. All rights reserved.

  11. Variable selection for zero-inflated and overdispersed data with application to health care demand in Germany.

    PubMed

    Wang, Zhu; Ma, Shuangge; Wang, Ching-Yun

    2015-09-01

    In health services and outcome research, count outcomes are frequently encountered and often have a large proportion of zeros. The zero-inflated negative binomial (ZINB) regression model has important applications for this type of data. With many possible candidate risk factors, this paper proposes new variable selection methods for the ZINB model. We consider maximum likelihood function plus a penalty including the least absolute shrinkage and selection operator (LASSO), smoothly clipped absolute deviation (SCAD), and minimax concave penalty (MCP). An EM (expectation-maximization) algorithm is proposed for estimating the model parameters and conducting variable selection simultaneously. This algorithm consists of estimating penalized weighted negative binomial models and penalized logistic models via the coordinated descent algorithm. Furthermore, statistical properties including the standard error formulae are provided. A simulation study shows that the new algorithm not only has more accurate or at least comparable estimation, but also is more robust than the traditional stepwise variable selection. The proposed methods are applied to analyze the health care demand in Germany using the open-source R package mpath. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  12. Not seeking yet trying long-acting reversible contraception: a 24-month randomized trial on continuation, unintended pregnancy and satisfaction.

    PubMed

    Hubacher, David; Spector, Hannah; Monteith, Charles; Chen, Pai-Lien

    2018-06-01

    To measure the 24-month impact on continuation, unintended pregnancy and satisfaction of trying long-acting reversible contraception (LARC) in a population seeking short-acting reversible contraception (SARC). We enrolled 916 women aged 18-29 who were seeking pills or injectables in a partially randomized patient preference trial. Women with strong preferences for pills or injectables started on those products, while others opted for randomization to LARC or SARC and received their methods gratis. We estimated continuation and unintended pregnancy rates through 24months. Intent-to-treat principles were applied after method initiation for comparing incidence of unintended pregnancy. We also examined how satisfaction levels varied by cohort and how baseline negative LARC attitudes were associated with satisfaction over time. Forty-three percent chose randomization, and 57% chose the preference option. Complete loss to follow-up was<2%. The 24-month LARC continuation probability was 64.3% [95% confidence interval (CI): 56.6-70.9], statistically higher than SARC groups [25.5% (randomized) and 40.0% (preference)]. The 24-month cumulative unintended pregnancy probabilities were 9.9% (95% CI: 7.2-12.6) (preference-SARC), 6.9% (95% CI: 3.3-10.6) (randomized-SARC) and 3.6% (95% CI: 1.8-6.4) (randomized-LARC). Statistical tests for comparing randomized groups on unintended pregnancy were mixed: binomial at 24-month time point (p=.02) and log-rank survival probabilities (p=.14 for first pregnancies and p=.07 when including second pregnancies). LARC satisfaction was high (80% happy/neutral, 73% would use LARC again, 81% would recommend to a friend). Baseline negative attitudes toward LARC (27%) were not clearly associated with satisfaction or early discontinuation. The decision to try LARC resulted in high continuation rates and substantial protection from unintended pregnancy over 24months. Despite participants' initial desires to begin short-acting regimens, they had high satisfaction with LARC. Voluntary decisions to try LARC will benefit large proportions of typical SARC users. Even women who do not necessarily view LARC as a first choice may have a highly satisfying experience and avoid unintended pregnancy if they try it. Copyright © 2018 Elsevier Inc. All rights reserved.

  13. Out-of-pocket expenditures for pharmaceuticals: lessons from the Austrian household budget survey.

    PubMed

    Sanwald, Alice; Theurl, Engelbert

    2017-05-01

    Paying pharmaceuticals out of pocket is an important source of financing pharmaceutical consumption. Only limited empirical knowledge is available on the determinants of these expenditures. In this article we analyze which characteristics of private households influence out-of-pocket pharmaceutical expenditure (OOPPE) in Austria. We use cross-sectional information on OOPPE and household characteristics provided by the Austrian household budget survey 2009/10. We split pharmaceutical expenditures into the two components prescription fees and over-the-counter (OTC) expenditures. To adjust for the specific characteristics of the data, we compare different econometric approaches: a two-part model, hurdle model, generalized linear model and zero-inflated negative binomial regression model. The finally selected econometric approaches give a quite consistent picture. The probability of expenditures of both types is strongly influenced by the household structure. It increases with age, doctoral visits and the presence of a female householder. The education level and income only increase the probability of OTC pharmaceuticals. The level of OTC expenditures remains widely unexplained while the household structure and age influence the expenditures for prescription fees. Insurance characteristics of private households, either private or public, play a minor role in explaining the expenditure levels in all specifications. This refers to a homogeneous and comprehensive provision of pharmaceuticals in the public part of the Austrian health care system. The article gives useful insights into the determinants of pharmaceutical expenditures of private households and supplements the previous research that focuses on the individual level.

  14. Impact of early childhood caries on oral health-related quality of life of preschool children.

    PubMed

    Li, M Y; Zhi, Q H; Zhou, Y; Qiu, R M; Lin, H C

    2015-03-01

    Child oral health-related quality of life (COHRQoL) has been assessed in developed areas; however, it remains unstudied in mainland China. Studies on COHRQoL would benefit a large number of children in China suffering from oral health problems such as dental caries. This study explored the relationship between COHRQoL and early childhood caries, adjusted by socioeconomic factors, in 3- to 4-year-old children in a region of southern China. In this study, 1062 children aged 3-4 years were recruited by cluster sampling and their oral health statuses were examined by a trained dentist. The Chinese version of the Early Childhood Oral Health Impact Scale (ECOHIS) and questions about the children's socioeconomic conditions were completed by the children's parents. A negative binomial regression analysis was used to assess the prevalence of early childhood caries among the children and its influence on COHRQoL. The total ECOHIS scores of the returned scale sets ranged from 0 to 31, and their average scores was 3.1±5.1. The negative binomial analysis showed that the dmfs indices were significantly associated with the ECOHIS score and subscale scores (P<0.05). The multivariate adjusted model showed that a higher dmft index was associated with greater negative impact on COHRQoL (RR = 1.10; 95% CI = 1.07, 1.13; P < 0.05). However, demographic and socioeconomic factors were not associated with COHRQoL (P>0.05). The severity of early childhood caries has a negative impact on the oral health-related quality of life of preschool children and their parents.

  15. Time analysis of volcanic activity on Io by means of plasma observations

    NASA Technical Reports Server (NTRS)

    Mekler, Y.; Eviatar, A.

    1980-01-01

    A model of Io volcanism in which the probability of activity obeys a binomial distribution is presented. Observed values of the electron density obtained over a 3-year period by ground-based spectroscopy are fitted to such a distribution. The best fit is found for a total number of 15 volcanoes with a probability of individual activity at any time of 0.143. The Pioneer 10 ultraviolet observations are reinterpreted as emissions of sulfur and oxygen ions and are found to be consistent with a plasma much less dense than that observed by the Voyager spacecraft. Late 1978 and the first half of 1979 are shown to be periods of anomalous volcanicity. Rapid variations in electron density are related to enhanced radial diffusion.

  16. A Financial Market Model Incorporating Herd Behaviour

    PubMed Central

    2016-01-01

    Herd behaviour in financial markets is a recurring phenomenon that exacerbates asset price volatility, and is considered a possible contributor to market fragility. While numerous studies investigate herd behaviour in financial markets, it is often considered without reference to the pricing of financial instruments or other market dynamics. Here, a trader interaction model based upon informational cascades in the presence of information thresholds is used to construct a new model of asset price returns that allows for both quiescent and herd-like regimes. Agent interaction is modelled using a stochastic pulse-coupled network, parametrised by information thresholds and a network coupling probability. Agents may possess either one or two information thresholds that, in each case, determine the number of distinct states an agent may occupy before trading takes place. In the case where agents possess two thresholds (labelled as the finite state-space model, corresponding to agents’ accumulating information over a bounded state-space), and where coupling strength is maximal, an asymptotic expression for the cascade-size probability is derived and shown to follow a power law when a critical value of network coupling probability is attained. For a range of model parameters, a mixture of negative binomial distributions is used to approximate the cascade-size distribution. This approximation is subsequently used to express the volatility of model price returns in terms of the model parameter which controls the network coupling probability. In the case where agents possess a single pulse-coupling threshold (labelled as the semi-infinite state-space model corresponding to agents’ accumulating information over an unbounded state-space), numerical evidence is presented that demonstrates volatility clustering and long-memory patterns in the volatility of asset returns. Finally, output from the model is compared to both the distribution of historical stock returns and the market price of an equity index option. PMID:27007236

  17. Estimation of aquifer scale proportion using equal area grids: assessment of regional scale groundwater quality

    USGS Publications Warehouse

    Belitz, Kenneth; Jurgens, Bryant C.; Landon, Matthew K.; Fram, Miranda S.; Johnson, Tyler D.

    2010-01-01

    The proportion of an aquifer with constituent concentrations above a specified threshold (high concentrations) is taken as a nondimensional measure of regional scale water quality. If computed on the basis of area, it can be referred to as the aquifer scale proportion. A spatially unbiased estimate of aquifer scale proportion and a confidence interval for that estimate are obtained through the use of equal area grids and the binomial distribution. Traditionally, the confidence interval for a binomial proportion is computed using either the standard interval or the exact interval. Research from the statistics literature has shown that the standard interval should not be used and that the exact interval is overly conservative. On the basis of coverage probability and interval width, the Jeffreys interval is preferred. If more than one sample per cell is available, cell declustering is used to estimate the aquifer scale proportion, and Kish's design effect may be useful for estimating an effective number of samples. The binomial distribution is also used to quantify the adequacy of a grid with a given number of cells for identifying a small target, defined as a constituent that is present at high concentrations in a small proportion of the aquifer. Case studies illustrate a consistency between approaches that use one well per grid cell and many wells per cell. The methods presented in this paper provide a quantitative basis for designing a sampling program and for utilizing existing data.

  18. CROSSER - CUMULATIVE BINOMIAL PROGRAMS

    NASA Technical Reports Server (NTRS)

    Bowerman, P. N.

    1994-01-01

    The cumulative binomial program, CROSSER, is one of a set of three programs which calculate cumulative binomial probability distributions for arbitrary inputs. The three programs, CROSSER, CUMBIN (NPO-17555), and NEWTONP (NPO-17556), can be used independently of one another. CROSSER can be used by statisticians and users of statistical procedures, test planners, designers, and numerical analysts. The program has been used for reliability/availability calculations. CROSSER calculates the point at which the reliability of a k-out-of-n system equals the common reliability of the n components. It is designed to work well with all integer values 0 < k <= n. To run the program, the user simply runs the executable version and inputs the information requested by the program. The program is not designed to weed out incorrect inputs, so the user must take care to make sure the inputs are correct. Once all input has been entered, the program calculates and lists the result. It also lists the number of iterations of Newton's method required to calculate the answer within the given error. The CROSSER program is written in C. It was developed on an IBM AT with a numeric co-processor using Microsoft C 5.0. Because the source code is written using standard C structures and functions, it should compile correctly with most C compilers. The program format is interactive. It has been implemented under DOS 3.2 and has a memory requirement of 26K. CROSSER was developed in 1988.

  19. Do Barriers to Crime Prevention Moderate the Effects of Situational Crime Prevention Policies on Violent Crime in High Schools?

    ERIC Educational Resources Information Center

    Sevigny, Eric L.; Zhang, Gary

    2018-01-01

    This study investigates how barriers to school-based crime prevention programming moderate the effects of situational crime prevention (SCP) policies on levels of violent crime in U.S. public high schools. Using data from the 2008 School Survey on Crime and Safety, we estimate a series of negative binomial regression models with interactions to…

  20. Evaluation of the Johne's disease risk assessment and management plan on dairy farms in Ontario, Canada.

    PubMed

    Pieper, Laura; Sorge, Ulrike S; DeVries, Trevor J; Godkin, Ann; Lissemore, Kerry; Kelton, David F

    2015-10-01

    Johne's disease (JD) is a production-limiting gastrointestinal disease in cattle. To minimize the effects of JD, the Ontario dairy industry launched the Ontario Johne's Education and Management Assistance Program in 2010. As part of the program, trained veterinarians conducted a risk assessment and management plan (RAMP), an on-farm questionnaire where high RAMP scores are associated with high risk of JD transmission. Subsequently, veterinarians recommended farm-specific management practices for JD prevention. Milk or serum ELISA results from the milking herd were used to determine the herd ELISA status (HES) and within-herd prevalence. After 3.5 yr of implementation of the program, the aim of this study was to evaluate the associations among RAMP scores, HES, and recommendations. Data from 2,103 herds were available for the analyses. A zero-inflated negative binomial model for the prediction of the number of ELISA-positive animals per farm was built. The model included individual RAMP questions about purchasing animals in the logistic portion, indicating risks for between-herd transmission, and purchasing bulls, birth of calves outside the designated calving area, colostrum and milk feeding management, and adult cow environmental hygiene in the negative binomial portion, indicating risk factors for within-herd transmission. However, farms which fed low-risk milk compared with milk replacer had fewer seropositive animals. The model additionally included the JD herd history in the negative binomial and the logistic portion, indicating that herds with a JD herd history were more likely to have at least 1 positive animal and to have a higher number of positive animals. Generally, a positive association was noted between RAMP scores and the odds of receiving a recommendation for the respective risk area; however, the relationship was not always linear. For general JD risk and calving area risk, seropositive herds had higher odds of receiving recommendations compared with seronegative herds if the section scores were low. This study suggests that the RAMP is a valuable tool to assess the risk for JD transmission within and between herds and to determine farm-specific recommendations for JD prevention. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  1. Predictive accuracy of particle filtering in dynamic models supporting outbreak projections.

    PubMed

    Safarishahrbijari, Anahita; Teyhouee, Aydin; Waldner, Cheryl; Liu, Juxin; Osgood, Nathaniel D

    2017-09-26

    While a new generation of computational statistics algorithms and availability of data streams raises the potential for recurrently regrounding dynamic models with incoming observations, the effectiveness of such arrangements can be highly subject to specifics of the configuration (e.g., frequency of sampling and representation of behaviour change), and there has been little attempt to identify effective configurations. Combining dynamic models with particle filtering, we explored a solution focusing on creating quickly formulated models regrounded automatically and recurrently as new data becomes available. Given a latent underlying case count, we assumed that observed incident case counts followed a negative binomial distribution. In accordance with the condensation algorithm, each such observation led to updating of particle weights. We evaluated the effectiveness of various particle filtering configurations against each other and against an approach without particle filtering according to the accuracy of the model in predicting future prevalence, given data to a certain point and a norm-based discrepancy metric. We examined the effectiveness of particle filtering under varying times between observations, negative binomial dispersion parameters, and rates with which the contact rate could evolve. We observed that more frequent observations of empirical data yielded super-linearly improved accuracy in model predictions. We further found that for the data studied here, the most favourable assumptions to make regarding the parameters associated with the negative binomial distribution and changes in contact rate were robust across observation frequency and the observation point in the outbreak. Combining dynamic models with particle filtering can perform well in projecting future evolution of an outbreak. Most importantly, the remarkable improvements in predictive accuracy resulting from more frequent sampling suggest that investments to achieve efficient reporting mechanisms may be more than paid back by improved planning capacity. The robustness of the results on particle filter configuration in this case study suggests that it may be possible to formulate effective standard guidelines and regularized approaches for such techniques in particular epidemiological contexts. Most importantly, the work tentatively suggests potential for health decision makers to secure strong guidance when anticipating outbreak evolution for emerging infectious diseases by combining even very rough models with particle filtering method.

  2. Seasonal changes in spatial patterns of two annual plants in the Chihuahuan Desert, USA

    USGS Publications Warehouse

    Yin, Z.-Y.; Guo, Q.; Ren, H.; Peng, S.-L.

    2005-01-01

    Spatial pattern of a biotic population may change over time as its component individuals grow or die out, but whether this is the case for desert annual plants is largely unknown. Here we examined seasonal changes in spatial patterns of two annuals, Eriogonum abertianum and Haplopappus gracilis, in initial (winter) and final (summer) densities. The density was measured as the number of individuals from 384 permanent quadrats (each 0.5 m × 0.5 m) in the Chihuahuan Desert near Portal, Arizona, USA. We used three probability distributions (binomial, Poisson, and negative binomial or NB) that represent three basic spatial patterns (regular, random, and clumped) to fit the observed frequency distributions of densities of the two annuals. Both species showed clear clumped patterns as characterized by the NB and had similar inverse J-shaped frequency distribution curves in two density categories. Also, both species displayed a reduced degree of aggregation from winter to summer after the spring drought (massive die-off), as indicated by the increased k-parameter of the NB and decreased values of another NB parameter p, variance/mean ratio, Lloyd’s Index of Patchiness, and David and Moore’s Index of Clumping. Further, we hypothesized that while the NB (i.e., Poisson-logarithmic) well fits the distribution of individuals per quadrat, its components, the Poisson and logarithmic, may describe the distributions of clumps per quadrat and of individuals per clump, respectively. We thus obtained the means and variances for (1) individuals per quadrat, (2) clumps per quadrat, and (3) individuals per clump. The results showed that the decrease of the density from winter to summer for each plant resulted from the decrease of individuals per clump, rather than from the decrease of clumps per quadrat. The great similarities between the two annuals indicate that our observed temporal changes in spatial patterns may be common among desert annual plants.

  3. Collective Human Mobility Pattern from Taxi Trips in Urban Area

    PubMed Central

    Peng, Chengbin; Jin, Xiaogang; Wong, Ka-Chun; Shi, Meixia; Liò, Pietro

    2012-01-01

    We analyze the passengers' traffic pattern for 1.58 million taxi trips of Shanghai, China. By employing the non-negative matrix factorization and optimization methods, we find that, people travel on workdays mainly for three purposes: commuting between home and workplace, traveling from workplace to workplace, and others such as leisure activities. Therefore, traffic flow in one area or between any pair of locations can be approximated by a linear combination of three basis flows, corresponding to the three purposes respectively. We name the coefficients in the linear combination as traffic powers, each of which indicates the strength of each basis flow. The traffic powers on different days are typically different even for the same location, due to the uncertainty of the human motion. Therefore, we provide a probability distribution function for the relative deviation of the traffic power. This distribution function is in terms of a series of functions for normalized binomial distributions. It can be well explained by statistical theories and is verified by empirical data. These findings are applicable in predicting the road traffic, tracing the traffic pattern and diagnosing the traffic related abnormal events. These results can also be used to infer land uses of urban area quite parsimoniously. PMID:22529917

  4. [Sequential sampling plans to Orthezia praelonga Douglas (Hemiptera: Sternorrhyncha, Ortheziidae) in citrus].

    PubMed

    Costa, Marilia G; Barbosa, José C; Yamamoto, Pedro T

    2007-01-01

    The sequential sampling is characterized by using samples of variable sizes, and has the advantage of reducing sampling time and costs if compared to fixed-size sampling. To introduce an adequate management for orthezia, sequential sampling plans were developed for orchards under low and high infestation. Data were collected in Matão, SP, in commercial stands of the orange variety 'Pêra Rio', at five, nine and 15 years of age. Twenty samplings were performed in the whole area of each stand by observing the presence or absence of scales on plants, being plots comprised of ten plants. After observing that in all of the three stands the scale population was distributed according to the contagious model, fitting the Negative Binomial Distribution in most samplings, two sequential sampling plans were constructed according to the Sequential Likelihood Ratio Test (SLRT). To construct these plans an economic threshold of 2% was adopted and the type I and II error probabilities were fixed in alpha = beta = 0.10. Results showed that the maximum numbers of samples expected to determine control need were 172 and 76 samples for stands with low and high infestation, respectively.

  5. Equilibrium problems for Raney densities

    NASA Astrophysics Data System (ADS)

    Forrester, Peter J.; Liu, Dang-Zheng; Zinn-Justin, Paul

    2015-07-01

    The Raney numbers are a class of combinatorial numbers generalising the Fuss-Catalan numbers. They are indexed by a pair of positive real numbers (p, r) with p > 1 and 0 < r ⩽ p, and form the moments of a probability density function. For certain (p, r) the latter has the interpretation as the density of squared singular values for certain random matrix ensembles, and in this context equilibrium problems characterising the Raney densities for (p, r) = (θ + 1, 1) and (θ/2 + 1, 1/2) have recently been proposed. Using two different techniques—one based on the Wiener-Hopf method for the solution of integral equations and the other on an analysis of the algebraic equation satisfied by the Green's function—we establish the validity of the equilibrium problems for general θ > 0 and similarly use both methods to identify the equilibrium problem for (p, r) = (θ/q + 1, 1/q), θ > 0 and q \\in Z+ . The Wiener-Hopf method is used to extend the latter to parameters (p, r) = (θ/q + 1, m + 1/q) for m a non-negative integer, and also to identify the equilibrium problem for a family of densities with moments given by certain binomial coefficients.

  6. [Monitoring microbiological safety of small systems of water distribution. Comparison of two sampling programs in a town in central Italy].

    PubMed

    Papini, Paolo; Faustini, Annunziata; Manganello, Rosa; Borzacchi, Giancarlo; Spera, Domenico; Perucci, Carlo A

    2005-01-01

    To determine the frequency of sampling in small water distribution systems (<5,000 inhabitants) and compare the results according to different hypotheses in bacteria distribution. We carried out two sampling programs to monitor the water distribution system in a town in Central Italy between July and September 1992; the Poisson distribution assumption implied 4 water samples, the assumption of negative binomial distribution implied 21 samples. Coliform organisms were used as indicators of water safety. The network consisted of two pipe rings and two wells fed by the same water source. The number of summer customers varied considerably from 3,000 to 20,000. The mean density was 2.33 coliforms/100 ml (sd= 5.29) for 21 samples and 3 coliforms/100 ml (sd= 6) for four samples. However the hypothesis of homogeneity was rejected (p-value <0.001) and the probability of II type error with the assumption of heterogeneity was higher with 4 samples (beta= 0.24) than with 21 (beta= 0.05). For this small network, determining the samples' size according to heterogeneity hypothesis strengthens the statement that water is drinkable compared with homogeneity assumption.

  7. Implementing reduced-risk integrated pest management in fresh-market cabbage: influence of sampling parameters, and validation of binomial sequential sampling plans for the cabbage looper (Lepidoptera Noctuidae).

    PubMed

    Burkness, Eric C; Hutchison, W D

    2009-10-01

    Populations of cabbage looper, Trichoplusiani (Lepidoptera: Noctuidae), were sampled in experimental plots and commercial fields of cabbage (Brasicca spp.) in Minnesota during 1998-1999 as part of a larger effort to implement an integrated pest management program. Using a resampling approach and the Wald's sequential probability ratio test, sampling plans with different sampling parameters were evaluated using independent presence/absence and enumerative data. Evaluations and comparisons of the different sampling plans were made based on the operating characteristic and average sample number functions generated for each plan and through the use of a decision probability matrix. Values for upper and lower decision boundaries, sequential error rates (alpha, beta), and tally threshold were modified to determine parameter influence on the operating characteristic and average sample number functions. The following parameters resulted in the most desirable operating characteristic and average sample number functions; action threshold of 0.1 proportion of plants infested, tally threshold of 1, alpha = beta = 0.1, upper boundary of 0.15, lower boundary of 0.05, and resampling with replacement. We found that sampling parameters can be modified and evaluated using resampling software to achieve desirable operating characteristic and average sample number functions. Moreover, management of T. ni by using binomial sequential sampling should provide a good balance between cost and reliability by minimizing sample size and maintaining a high level of correct decisions (>95%) to treat or not treat.

  8. Not in My Back Yard: A Comparative Analysis of Crime Around Publicly Funded Drug Treatment Centers, Liquor Stores, Convenience Stores, and Corner Stores in One Mid-Atlantic City.

    PubMed

    Furr-Holden, C Debra M; Milam, Adam J; Nesoff, Elizabeth D; Johnson, Renee M; Fakunle, David O; Jennings, Jacky M; Thorpe, Roland J

    2016-01-01

    This research examined whether publicly funded drug treatment centers (DTCs) were associated with violent crime in excess of the violence happening around other commercial businesses. Violent crime data and locations of community entities were geocoded and mapped. DTCs and other retail outlets were matched based on a Neighborhood Disadvantage score at the census tract level. Street network buffers ranging from 100 to 1,400 feet were placed around each location. Negative binomial regression models were used to estimate the relationship between the count of violent crimes and the distance from each business type. Compared with the mean count of violent crime around drug treatment centers, the mean count of violent crime (M = 2.87) was significantly higher around liquor stores (M = 3.98; t test; p < .01) and corner stores (M = 3.78; t test; p < .01), and there was no statistically significant difference between the count around convenience stores (M = 2.65; t test; p = .32). In the adjusted negative binomial regression models, there was a negative and significant relationship between the count of violent crime and the distance from drug treatment centers (β = -.069, p < .01), liquor stores (β = -.081, p < .01), corner stores (β = -.116, p < .01), and convenience stores (β = -.154, p < .01). Violent crime associated with drug treatment centers is similar to that associated with liquor stores and is less frequent than that associated with convenience stores and corner stores.

  9. Detection rates of geckos in visual surveys: Turning confounding variables into useful knowledge

    USGS Publications Warehouse

    Lardner, Bjorn; Rodda, Gordon H.; Yackel Adams, Amy A.; Savidge, Julie A.; Reed, Robert N.

    2016-01-01

    Transect surveys without some means of estimating detection probabilities generate population size indices prone to bias because survey conditions differ in time and space. Knowing what causes such bias can help guide the collection of relevant survey covariates, correct the survey data, anticipate situations where bias might be unacceptably large, and elucidate the ecology of target species. We used negative binomial regression to evaluate confounding variables for gecko (primarily Hemidactylus frenatus and Lepidodactylus lugubris) counts on 220-m-long transects surveyed at night, primarily for snakes, on 9,475 occasions. Searchers differed in gecko detection rates by up to a factor of six. The worst and best headlamps differed by a factor of at least two. Strong winds had a negative effect potentially as large as those of searchers or headlamps. More geckos were seen during wet weather conditions, but the effect size was small. Compared with a detection nadir during waxing gibbous (nearly full) moons above the horizon, we saw 28% more geckos during waning crescent moons below the horizon. A sine function suggested that we saw 24% more geckos at the end of the wet season than at the end of the dry season. Fluctuations on a longer timescale also were verified. Disturbingly, corrected data exhibited strong short-term fluctuations that covariates apparently failed to capture. Although some biases can be addressed with measured covariates, others will be difficult to eliminate as a significant source of error in longterm monitoring programs.

  10. Mixture models for estimating the size of a closed population when capture rates vary among individuals

    USGS Publications Warehouse

    Dorazio, R.M.; Royle, J. Andrew

    2003-01-01

    We develop a parameterization of the beta-binomial mixture that provides sensible inferences about the size of a closed population when probabilities of capture or detection vary among individuals. Three classes of mixture models (beta-binomial, logistic-normal, and latent-class) are fitted to recaptures of snowshoe hares for estimating abundance and to counts of bird species for estimating species richness. In both sets of data, rates of detection appear to vary more among individuals (animals or species) than among sampling occasions or locations. The estimates of population size and species richness are sensitive to model-specific assumptions about the latent distribution of individual rates of detection. We demonstrate using simulation experiments that conventional diagnostics for assessing model adequacy, such as deviance, cannot be relied on for selecting classes of mixture models that produce valid inferences about population size. Prior knowledge about sources of individual heterogeneity in detection rates, if available, should be used to help select among classes of mixture models that are to be used for inference.

  11. Sequential sampling of ribes populations in the control of white pine blister rust (Cronartium ribicola Fischer) in California

    Treesearch

    Harold R. Offord

    1966-01-01

    Sequential sampling based on a negative binomial distribution of ribes populations required less than half the time taken by regular systematic line transect sampling in a comparison test. It gave the same control decision as the regular method in 9 of 13 field trials. A computer program that permits sequential plans to be built readily for other white pine regions is...

  12. Logistic quantile regression provides improved estimates for bounded avian counts: a case study of California Spotted Owl fledgling production

    Treesearch

    Brian S. Cade; Barry R. Noon; Rick D. Scherer; John J. Keane

    2017-01-01

    Counts of avian fledglings, nestlings, or clutch size that are bounded below by zero and above by some small integer form a discrete random variable distribution that is not approximated well by conventional parametric count distributions such as the Poisson or negative binomial. We developed a logistic quantile regression model to provide estimates of the empirical...

  13. Oral health of schoolchildren in Western Australia.

    PubMed

    Arrow, P

    2016-09-01

    The West Australian School Dental Service (SDS) provides free, statewide, primary dental care to schoolchildren aged 5-17 years. This study reports on an evaluation of the oral health of children examined during the 2014 calendar year. Children were sampled, based on their date of birth, and SDS clinicians collected the clinical information. Weighted mean values of caries experience were presented. Negative binomial regression modelling was undertaken to test for factors of significance in the rate of caries occurrence. Data from children aged 5-15 years were used (girls = 4616, boys = 4900). Mean dmft (5-10-year-olds), 1.42 SE 0.03; mean DMFT (6-15-year-olds), 0.51 SE 0.01. Negative binomial regression model of permanent tooth caries found higher rates of caries in children who were from non-fluoridated areas (RR 2.1); Aboriginal (RR 2.4); had gingival inflammation (RR 1.5); lower ICSEA level (RR 1.4); and recalled at more than 24-month interval (RR 1.8). The study highlighted poor dental health associated with living in non-fluoridated areas, Aboriginal identity, poor oral hygiene, lower socioeconomic level and having extended intervals between dental checkups. Timely assessments and preventive measures targeted at groups, including extending community water fluoridation, may assist in further improving the oral health of children in Western Australia. © 2015 Australian Dental Association.

  14. Characterizing environmental risk factors for West Nile virus in Quebec, Canada, using clinical data in humans and serology in pet dogs.

    PubMed

    Rocheleau, J P; Michel, P; Lindsay, L R; Drebot, M; Dibernardo, A; Ogden, N H; Fortin, A; Arsenault, J

    2017-10-01

    The identification of specific environments sustaining emerging arbovirus amplification and transmission to humans is a key component of public health intervention planning. This study aimed at identifying environmental factors associated with West Nile virus (WNV) infections in southern Quebec, Canada, by modelling and jointly interpreting aggregated clinical data in humans and serological data in pet dogs. Environmental risk factors were estimated in humans by negative binomial regression based on a dataset of 191 human WNV clinical cases reported in the study area between 2011 and 2014. Risk factors for infection in dogs were evaluated by logistic and negative binomial models based on a dataset including WNV serological results from 1442 dogs sampled from the same geographical area in 2013. Forested lands were identified as low-risk environments in humans. Agricultural lands represented higher risk environments for dogs. Environments identified as impacting risk in the current study were somewhat different from those identified in other studies conducted in north-eastern USA, which reported higher risk in suburban environments. In the context of the current study, combining human and animal data allowed a more comprehensive and possibly a more accurate view of environmental WNV risk factors to be obtained than by studying aggregated human data alone.

  15. Community covariates of malnutrition based mortality among older adults.

    PubMed

    Lee, Matthew R; Berthelot, Emily R

    2010-05-01

    The purpose of this study was to identify community level covariates of malnutrition-based mortality among older adults. A community level framework was delineated which explains rates of malnutrition-related mortality among older adults as a function of community levels of socioeconomic disadvantage, disability, and social isolation among members of this group. County level data on malnutrition mortality of people 65 years of age and older for the period 2000-2003 were drawn from the CDC WONDER system databases. County level measures of older adult socioeconomic disadvantage, disability, and social isolation were derived from the 2000 US Census of Population and Housing. Negative binomial regression models adjusting for the size of the population at risk, racial composition, urbanism, and region were estimated to assess the relationships among these indicators. Results from negative binomial regression analysis yielded the following: a standard deviation increase in socioeconomic/physical disadvantage was associated with a 12% increase in the rate of malnutrition mortality among older adults (p < 0.001), whereas a standard deviation increase in social isolation was associated with a 5% increase in malnutrition mortality among older adults (p < 0.05). Community patterns of malnutrition based mortality among older adults are partly a function of levels of socioeconomic and physical disadvantage and social isolation among older adults. 2010 Elsevier Inc. All rights reserved.

  16. Goodness-of-fit tests and model diagnostics for negative binomial regression of RNA sequencing data.

    PubMed

    Mi, Gu; Di, Yanming; Schafer, Daniel W

    2015-01-01

    This work is about assessing model adequacy for negative binomial (NB) regression, particularly (1) assessing the adequacy of the NB assumption, and (2) assessing the appropriateness of models for NB dispersion parameters. Tools for the first are appropriate for NB regression generally; those for the second are primarily intended for RNA sequencing (RNA-Seq) data analysis. The typically small number of biological samples and large number of genes in RNA-Seq analysis motivate us to address the trade-offs between robustness and statistical power using NB regression models. One widely-used power-saving strategy, for example, is to assume some commonalities of NB dispersion parameters across genes via simple models relating them to mean expression rates, and many such models have been proposed. As RNA-Seq analysis is becoming ever more popular, it is appropriate to make more thorough investigations into power and robustness of the resulting methods, and into practical tools for model assessment. In this article, we propose simulation-based statistical tests and diagnostic graphics to address model adequacy. We provide simulated and real data examples to illustrate that our proposed methods are effective for detecting the misspecification of the NB mean-variance relationship as well as judging the adequacy of fit of several NB dispersion models.

  17. Forecasting asthma-related hospital admissions in London using negative binomial models.

    PubMed

    Soyiri, Ireneous N; Reidpath, Daniel D; Sarran, Christophe

    2013-05-01

    Health forecasting can improve health service provision and individual patient outcomes. Environmental factors are known to impact chronic respiratory conditions such as asthma, but little is known about the extent to which these factors can be used for forecasting. Using weather, air quality and hospital asthma admissions, in London (2005-2006), two related negative binomial models were developed and compared with a naive seasonal model. In the first approach, predictive forecasting models were fitted with 7-day averages of each potential predictor, and then a subsequent multivariable model is constructed. In the second strategy, an exhaustive search of the best fitting models between possible combinations of lags (0-14 days) of all the environmental effects on asthma admission was conducted. Three models were considered: a base model (seasonal effects), contrasted with a 7-day average model and a selected lags model (weather and air quality effects). Season is the best predictor of asthma admissions. The 7-day average and seasonal models were trivial to implement. The selected lags model was computationally intensive, but of no real value over much more easily implemented models. Seasonal factors can predict daily hospital asthma admissions in London, and there is a little evidence that additional weather and air quality information would add to forecast accuracy.

  18. Association between adherence to physical activity guidelines and health-related quality of life among individuals with physician-diagnosed arthritis.

    PubMed

    Austin, Shamly; Qu, Haiyan; Shewchuk, Richard M

    2012-10-01

    To examine the association between adherence to physical activity guidelines and health-related quality of life (HRQOL) among individuals with arthritis. A cross-sectional sample with 33,071 US adults, 45 years or older with physician-diagnosed arthritis was obtained from 2007 Behavioral Risk Factor Surveillance System survey. We conducted negative binomial regression analysis to examine HRQOL as a function of adherence to physical activity guidelines controlling for physicians' recommendations for physical activity, age, sex, race, education, marital status, employment, annual income, health insurance, personal physician, emotional support, body mass index, activity limitations, health status, and co-morbidities based on Behavioral Model of Health Services Utilization. Descriptive statistics showed that 60% adults with arthritis did not adhere to physical activity guidelines, mean physically and mentally unhealthy days were 7.7 and 4.4 days, respectively. Results from negative binomial regression indicated that individuals who did not adhere to physical activity guidelines had 1.14 days more physically unhealthy days and 1.12 days more mentally unhealthy days than those who adhered controlling for covariates. Adherence to physical activity is important to improve HRQOL for individuals with arthritis. However, adherence is low among this population. Interventions are required to engage individuals with arthritis in physical activity.

  19. Technical and biological variance structure in mRNA-Seq data: life in the real world

    PubMed Central

    2012-01-01

    Background mRNA expression data from next generation sequencing platforms is obtained in the form of counts per gene or exon. Counts have classically been assumed to follow a Poisson distribution in which the variance is equal to the mean. The Negative Binomial distribution which allows for over-dispersion, i.e., for the variance to be greater than the mean, is commonly used to model count data as well. Results In mRNA-Seq data from 25 subjects, we found technical variation to generally follow a Poisson distribution as has been reported previously and biological variability was over-dispersed relative to the Poisson model. The mean-variance relationship across all genes was quadratic, in keeping with a Negative Binomial (NB) distribution. Over-dispersed Poisson and NB distributional assumptions demonstrated marked improvements in goodness-of-fit (GOF) over the standard Poisson model assumptions, but with evidence of over-fitting in some genes. Modeling of experimental effects improved GOF for high variance genes but increased the over-fitting problem. Conclusions These conclusions will guide development of analytical strategies for accurate modeling of variance structure in these data and sample size determination which in turn will aid in the identification of true biological signals that inform our understanding of biological systems. PMID:22769017

  20. Design and analysis of three-arm trials with negative binomially distributed endpoints.

    PubMed

    Mütze, Tobias; Munk, Axel; Friede, Tim

    2016-02-20

    A three-arm clinical trial design with an experimental treatment, an active control, and a placebo control, commonly referred to as the gold standard design, enables testing of non-inferiority or superiority of the experimental treatment compared with the active control. In this paper, we propose methods for designing and analyzing three-arm trials with negative binomially distributed endpoints. In particular, we develop a Wald-type test with a restricted maximum-likelihood variance estimator for testing non-inferiority or superiority. For this test, sample size and power formulas as well as optimal sample size allocations will be derived. The performance of the proposed test will be assessed in an extensive simulation study with regard to type I error rate, power, sample size, and sample size allocation. For the purpose of comparison, Wald-type statistics with a sample variance estimator and an unrestricted maximum-likelihood estimator are included in the simulation study. We found that the proposed Wald-type test with a restricted variance estimator performed well across the considered scenarios and is therefore recommended for application in clinical trials. The methods proposed are motivated and illustrated by a recent clinical trial in multiple sclerosis. The R package ThreeArmedTrials, which implements the methods discussed in this paper, is available on CRAN. Copyright © 2015 John Wiley & Sons, Ltd.

  1. Ecology of nonnative Siberian prawn (Palaemon modestus) in the lower Snake River, Washington, USA

    USGS Publications Warehouse

    Erhardt, John M.; Tiffan, Kenneth F.

    2016-01-01

    We assessed the abundance, distribution, and ecology of the nonnative Siberian prawn Palaemon modestus in the lower Snake River, Washington, USA. Analysis of prawn passage abundance at three Snake River dams showed that populations are growing at exponential rates, especially at Little Goose Dam where over 464,000 prawns were collected in 2015. Monthly beam trawling during 2011–2013 provided information on prawn abundance and distribution in Lower Granite and Little Goose Reservoirs. Zero-inflated regression predicted that the probability of prawn presence increased with decreasing water velocity and increasing depth. Negative binomial models predicted higher catch rates of prawns in deeper water and in closer proximity to dams. Temporally, prawn densities decreased slightly in the summer, likely due to the mortality of older individuals, and then increased in autumn and winter with the emergence and recruitment of young of the year. Seasonal length frequencies showed that distinct juvenile and adult size classes exist throughout the year, suggesting prawns live from 1 to 2 years and may be able to reproduce multiple times during their life. Most juvenile prawns become reproductive adults in 1 year, and peak reproduction occurs from late July through October. Mean fecundity (189 eggs) and reproductive output (11.9 %) are similar to that in their native range. The current use of deep habitats by prawns likely makes them unavailable to most predators in the reservoirs. The distribution and role of Siberian prawns in the lower Snake River food web will probably continue to change as the population grows and warrants continued monitoring and investigation.

  2. A Note on a Family of Alternating Sums of Products of Binomial Numbers

    ERIC Educational Resources Information Center

    Gauthier, N.

    2013-01-01

    We study the following family of integral-valued alternating sums, where -infinity equal to or less than m equal to or less than infinity and n equal to or greater than 0 are integers [equation omitted]. We first consider h[subscript m](n) for m and n non-negative integers and show that it is of the form 2[superscript n + 2m] - P[subscript m](n),…

  3. Investigation of shipping accident injury severity and mortality.

    PubMed

    Weng, Jinxian; Yang, Dong

    2015-03-01

    Shipping movements are operated in a complex and high-risk environment. Fatal shipping accidents are the nightmares of seafarers. With ten years' worldwide ship accident data, this study develops a binary logistic regression model and a zero-truncated binomial regression model to predict the probability of fatal shipping accidents and corresponding mortalities. The model results show that both the probability of fatal accidents and mortalities are greater for collision, fire/explosion, contact, grounding, sinking accidents occurred in adverse weather conditions and darkness conditions. Sinking has the largest effects on the increment of fatal accident probability and mortalities. The results also show that the bigger number of mortalities is associated with shipping accidents occurred far away from the coastal area/harbor/port. In addition, cruise ships are found to have more mortalities than non-cruise ships. The results of this study are beneficial for policy-makers in proposing efficient strategies to prevent fatal shipping accidents. Copyright © 2015 Elsevier Ltd. All rights reserved.

  4. Evaluating Perceived Probability of Threat-Relevant Outcomes and Temporal Orientation in Flying Phobia.

    PubMed

    Mavromoustakos, Elena; Clark, Gavin I; Rock, Adam J

    2016-01-01

    Probability bias regarding threat-relevant outcomes has been demonstrated across anxiety disorders but has not been investigated in flying phobia. Individual temporal orientation (time perspective) may be hypothesised to influence estimates of negative outcomes occurring. The present study investigated whether probability bias could be demonstrated in flying phobia and whether probability estimates of negative flying events was predicted by time perspective. Sixty flying phobic and fifty-five non-flying-phobic adults were recruited to complete an online questionnaire. Participants completed the Flight Anxiety Scale, Probability Scale (measuring perceived probability of flying-negative events, general-negative and general positive events) and the Past-Negative, Future and Present-Hedonistic subscales of the Zimbardo Time Perspective Inventory (variables argued to predict mental travel forward and backward in time). The flying phobic group estimated the probability of flying negative and general negative events occurring as significantly higher than non-flying phobics. Past-Negative scores (positively) and Present-Hedonistic scores (negatively) predicted probability estimates of flying negative events. The Future Orientation subscale did not significantly predict probability estimates. This study is the first to demonstrate probability bias for threat-relevant outcomes in flying phobia. Results suggest that time perspective may influence perceived probability of threat-relevant outcomes but the nature of this relationship remains to be determined.

  5. Evaluating Perceived Probability of Threat-Relevant Outcomes and Temporal Orientation in Flying Phobia

    PubMed Central

    Mavromoustakos, Elena; Clark, Gavin I.; Rock, Adam J.

    2016-01-01

    Probability bias regarding threat-relevant outcomes has been demonstrated across anxiety disorders but has not been investigated in flying phobia. Individual temporal orientation (time perspective) may be hypothesised to influence estimates of negative outcomes occurring. The present study investigated whether probability bias could be demonstrated in flying phobia and whether probability estimates of negative flying events was predicted by time perspective. Sixty flying phobic and fifty-five non-flying-phobic adults were recruited to complete an online questionnaire. Participants completed the Flight Anxiety Scale, Probability Scale (measuring perceived probability of flying-negative events, general-negative and general positive events) and the Past-Negative, Future and Present-Hedonistic subscales of the Zimbardo Time Perspective Inventory (variables argued to predict mental travel forward and backward in time). The flying phobic group estimated the probability of flying negative and general negative events occurring as significantly higher than non-flying phobics. Past-Negative scores (positively) and Present-Hedonistic scores (negatively) predicted probability estimates of flying negative events. The Future Orientation subscale did not significantly predict probability estimates. This study is the first to demonstrate probability bias for threat-relevant outcomes in flying phobia. Results suggest that time perspective may influence perceived probability of threat-relevant outcomes but the nature of this relationship remains to be determined. PMID:27557054

  6. [Evaluation of estimation of prevalence ratio using bayesian log-binomial regression model].

    PubMed

    Gao, W L; Lin, H; Liu, X N; Ren, X W; Li, J S; Shen, X P; Zhu, S L

    2017-03-10

    To evaluate the estimation of prevalence ratio ( PR ) by using bayesian log-binomial regression model and its application, we estimated the PR of medical care-seeking prevalence to caregivers' recognition of risk signs of diarrhea in their infants by using bayesian log-binomial regression model in Openbugs software. The results showed that caregivers' recognition of infant' s risk signs of diarrhea was associated significantly with a 13% increase of medical care-seeking. Meanwhile, we compared the differences in PR 's point estimation and its interval estimation of medical care-seeking prevalence to caregivers' recognition of risk signs of diarrhea and convergence of three models (model 1: not adjusting for the covariates; model 2: adjusting for duration of caregivers' education, model 3: adjusting for distance between village and township and child month-age based on model 2) between bayesian log-binomial regression model and conventional log-binomial regression model. The results showed that all three bayesian log-binomial regression models were convergence and the estimated PRs were 1.130(95 %CI : 1.005-1.265), 1.128(95 %CI : 1.001-1.264) and 1.132(95 %CI : 1.004-1.267), respectively. Conventional log-binomial regression model 1 and model 2 were convergence and their PRs were 1.130(95 % CI : 1.055-1.206) and 1.126(95 % CI : 1.051-1.203), respectively, but the model 3 was misconvergence, so COPY method was used to estimate PR , which was 1.125 (95 %CI : 1.051-1.200). In addition, the point estimation and interval estimation of PRs from three bayesian log-binomial regression models differed slightly from those of PRs from conventional log-binomial regression model, but they had a good consistency in estimating PR . Therefore, bayesian log-binomial regression model can effectively estimate PR with less misconvergence and have more advantages in application compared with conventional log-binomial regression model.

  7. Drivers of multidimensional eco-innovation: empirical evidence from the Brazilian industry.

    PubMed

    da Silva Rabêlo, Olivan; de Azevedo Melo, Andrea Sales Soares

    2018-03-08

    The study analyses the relationships between the main drivers of eco-innovation introduced by innovative industries, focused on cooperation strategy. Eco-innovation is analysed by means of a multidimensional identification strategy, showing the relationships between the independent variables and the variable of interest. The literature discussing environmental innovation is different from the one discussing other types of innovation inasmuch as it seeks to grasp its determinants and to mostly highlight the relevance of environmental regulation. The key feature of this paper is that it ascribes special relevance to cooperation strategy with external partners and to the propensity of innovative industry introducing eco-innovation. A sample of 35,060 Brazilian industries were analysed, between 2003 and 2011, by means of Binomial, Multinomial and Ordinal logistic regressions with microdata collected with the research and innovation department (PINTEC) from the Brazilian Institute of Geography and Statistics (Instituto Brasileiro de Geografia e Estatística). The econometric results estimated by the Logit Multinomial method suggest that the cooperation with external partners practiced by innovative industries facilitates the adoption of eco-innovation in dimension 01 with probability of 64.59%, 57.63% in dimension 02 and 81.02% in dimension 03. The data reveal that the higher the degree of eco-innovation complexity, the harder industries seek to obtain cooperation with external partners. When calculating with the Logit Ordinal and Binomial models, cooperation increases the probability that the industry is eco-innovative in 65.09% and 89.34%, respectively. Environmental regulation and innovation in product and information management were also positively correlated as drivers of eco-innovation.

  8. Maximizing Statistical Power When Verifying Probabilistic Forecasts of Hydrometeorological Events

    NASA Astrophysics Data System (ADS)

    DeChant, C. M.; Moradkhani, H.

    2014-12-01

    Hydrometeorological events (i.e. floods, droughts, precipitation) are increasingly being forecasted probabilistically, owing to the uncertainties in the underlying causes of the phenomenon. In these forecasts, the probability of the event, over some lead time, is estimated based on some model simulations or predictive indicators. By issuing probabilistic forecasts, agencies may communicate the uncertainty in the event occurring. Assuming that the assigned probability of the event is correct, which is referred to as a reliable forecast, the end user may perform some risk management based on the potential damages resulting from the event. Alternatively, an unreliable forecast may give false impressions of the actual risk, leading to improper decision making when protecting resources from extreme events. Due to this requisite for reliable forecasts to perform effective risk management, this study takes a renewed look at reliability assessment in event forecasts. Illustrative experiments will be presented, showing deficiencies in the commonly available approaches (Brier Score, Reliability Diagram). Overall, it is shown that the conventional reliability assessment techniques do not maximize the ability to distinguish between a reliable and unreliable forecast. In this regard, a theoretical formulation of the probabilistic event forecast verification framework will be presented. From this analysis, hypothesis testing with the Poisson-Binomial distribution is the most exact model available for the verification framework, and therefore maximizes one's ability to distinguish between a reliable and unreliable forecast. Application of this verification system was also examined within a real forecasting case study, highlighting the additional statistical power provided with the use of the Poisson-Binomial distribution.

  9. Not in My Back Yard: A Comparative Analysis of Crime Around Publicly Funded Drug Treatment Centers, Liquor Stores, Convenience Stores, and Corner Stores in One Mid-Atlantic City

    PubMed Central

    Furr-Holden, C. Debra M.; Milam, Adam J.; Nesoff, Elizabeth D.; Johnson, Renee M.; Fakunle, David O.; Jennings, Jacky M.; Thorpe, Roland J.

    2016-01-01

    Objective: This research examined whether publicly funded drug treatment centers (DTCs) were associated with violent crime in excess of the violence happening around other commercial businesses. Method: Violent crime data and locations of community entities were geocoded and mapped. DTCs and other retail outlets were matched based on a Neighborhood Disadvantage score at the census tract level. Street network buffers ranging from 100 to 1,400 feet were placed around each location. Negative binomial regression models were used to estimate the relationship between the count of violent crimes and the distance from each business type. Results: Compared with the mean count of violent crime around drug treatment centers, the mean count of violent crime (M = 2.87) was significantly higher around liquor stores (M = 3.98; t test; p < .01) and corner stores (M = 3.78; t test; p < .01), and there was no statistically significant difference between the count around convenience stores (M = 2.65; t test; p = .32). In the adjusted negative binomial regression models, there was a negative and significant relationship between the count of violent crime and the distance from drug treatment centers (β = -.069, p < .01), liquor stores (β = -.081, p < .01), corner stores (β = -.116, p < .01), and convenience stores (β = -.154, p < .01). Conclusions: Violent crime associated with drug treatment centers is similar to that associated with liquor stores and is less frequent than that associated with convenience stores and corner stores. PMID:26751351

  10. On the robustness of the q-Gaussian family

    NASA Astrophysics Data System (ADS)

    Sicuro, Gabriele; Tempesta, Piergiulio; Rodríguez, Antonio; Tsallis, Constantino

    2015-12-01

    We introduce three deformations, called α-, β- and γ-deformation respectively, of a N-body probabilistic model, first proposed by Rodríguez et al. (2008), having q-Gaussians as N → ∞ limiting probability distributions. The proposed α- and β-deformations are asymptotically scale-invariant, whereas the γ-deformation is not. We prove that, for both α- and β-deformations, the resulting deformed triangles still have q-Gaussians as limiting distributions, with a value of q independent (dependent) on the deformation parameter in the α-case (β-case). In contrast, the γ-case, where we have used the celebrated Q-numbers and the Gauss binomial coefficients, yields other limiting probability distribution functions, outside the q-Gaussian family. These results suggest that scale-invariance might play an important role regarding the robustness of the q-Gaussian family.

  11. Assessment of NDE Reliability Data

    NASA Technical Reports Server (NTRS)

    Yee, B. G. W.; Chang, F. H.; Couchman, J. C.; Lemon, G. H.; Packman, P. F.

    1976-01-01

    Twenty sets of relevant Nondestructive Evaluation (NDE) reliability data have been identified, collected, compiled, and categorized. A criterion for the selection of data for statistical analysis considerations has been formulated. A model to grade the quality and validity of the data sets has been developed. Data input formats, which record the pertinent parameters of the defect/specimen and inspection procedures, have been formulated for each NDE method. A comprehensive computer program has been written to calculate the probability of flaw detection at several confidence levels by the binomial distribution. This program also selects the desired data sets for pooling and tests the statistical pooling criteria before calculating the composite detection reliability. Probability of detection curves at 95 and 50 percent confidence levels have been plotted for individual sets of relevant data as well as for several sets of merged data with common sets of NDE parameters.

  12. Modeling avian abundance from replicated counts using binomial mixture models

    USGS Publications Warehouse

    Kery, Marc; Royle, J. Andrew; Schmid, Hans

    2005-01-01

    Abundance estimation in ecology is usually accomplished by capture–recapture, removal, or distance sampling methods. These may be hard to implement at large spatial scales. In contrast, binomial mixture models enable abundance estimation without individual identification, based simply on temporally and spatially replicated counts. Here, we evaluate mixture models using data from the national breeding bird monitoring program in Switzerland, where some 250 1-km2 quadrats are surveyed using the territory mapping method three times during each breeding season. We chose eight species with contrasting distribution (wide–narrow), abundance (high–low), and detectability (easy–difficult). Abundance was modeled as a random effect with a Poisson or negative binomial distribution, with mean affected by forest cover, elevation, and route length. Detectability was a logit-linear function of survey date, survey date-by-elevation, and sampling effort (time per transect unit). Resulting covariate effects and parameter estimates were consistent with expectations. Detectability per territory (for three surveys) ranged from 0.66 to 0.94 (mean 0.84) for easy species, and from 0.16 to 0.83 (mean 0.53) for difficult species, depended on survey effort for two easy and all four difficult species, and changed seasonally for three easy and three difficult species. Abundance was positively related to route length in three high-abundance and one low-abundance (one easy and three difficult) species, and increased with forest cover in five forest species, decreased for two nonforest species, and was unaffected for a generalist species. Abundance estimates under the most parsimonious mixture models were between 1.1 and 8.9 (median 1.8) times greater than estimates based on territory mapping; hence, three surveys were insufficient to detect all territories for each species. We conclude that binomial mixture models are an important new approach for estimating abundance corrected for detectability when only repeated-count data are available. Future developments envisioned include estimation of trend, occupancy, and total regional abundance.

  13. Measurement error and outcome distributions: Methodological issues in regression analyses of behavioral coding data.

    PubMed

    Holsclaw, Tracy; Hallgren, Kevin A; Steyvers, Mark; Smyth, Padhraic; Atkins, David C

    2015-12-01

    Behavioral coding is increasingly used for studying mechanisms of change in psychosocial treatments for substance use disorders (SUDs). However, behavioral coding data typically include features that can be problematic in regression analyses, including measurement error in independent variables, non normal distributions of count outcome variables, and conflation of predictor and outcome variables with third variables, such as session length. Methodological research in econometrics has shown that these issues can lead to biased parameter estimates, inaccurate standard errors, and increased Type I and Type II error rates, yet these statistical issues are not widely known within SUD treatment research, or more generally, within psychotherapy coding research. Using minimally technical language intended for a broad audience of SUD treatment researchers, the present paper illustrates the nature in which these data issues are problematic. We draw on real-world data and simulation-based examples to illustrate how these data features can bias estimation of parameters and interpretation of models. A weighted negative binomial regression is introduced as an alternative to ordinary linear regression that appropriately addresses the data characteristics common to SUD treatment behavioral coding data. We conclude by demonstrating how to use and interpret these models with data from a study of motivational interviewing. SPSS and R syntax for weighted negative binomial regression models is included in online supplemental materials. (c) 2016 APA, all rights reserved).

  14. Measurement error and outcome distributions: Methodological issues in regression analyses of behavioral coding data

    PubMed Central

    Holsclaw, Tracy; Hallgren, Kevin A.; Steyvers, Mark; Smyth, Padhraic; Atkins, David C.

    2015-01-01

    Behavioral coding is increasingly used for studying mechanisms of change in psychosocial treatments for substance use disorders (SUDs). However, behavioral coding data typically include features that can be problematic in regression analyses, including measurement error in independent variables, non-normal distributions of count outcome variables, and conflation of predictor and outcome variables with third variables, such as session length. Methodological research in econometrics has shown that these issues can lead to biased parameter estimates, inaccurate standard errors, and increased type-I and type-II error rates, yet these statistical issues are not widely known within SUD treatment research, or more generally, within psychotherapy coding research. Using minimally-technical language intended for a broad audience of SUD treatment researchers, the present paper illustrates the nature in which these data issues are problematic. We draw on real-world data and simulation-based examples to illustrate how these data features can bias estimation of parameters and interpretation of models. A weighted negative binomial regression is introduced as an alternative to ordinary linear regression that appropriately addresses the data characteristics common to SUD treatment behavioral coding data. We conclude by demonstrating how to use and interpret these models with data from a study of motivational interviewing. SPSS and R syntax for weighted negative binomial regression models is included in supplementary materials. PMID:26098126

  15. Magnitude of virologic blips is associated with a higher risk for virologic rebound in HIV-infected individuals: a recurrent events analysis.

    PubMed

    Grennan, J Troy; Loutfy, Mona R; Su, DeSheng; Harrigan, P Richard; Cooper, Curtis; Klein, Marina; Machouf, Nima; Montaner, Julio S G; Rourke, Sean; Tsoukas, Christos; Hogg, Bob; Raboud, Janet

    2012-04-15

    The importance of human immunodeficiency virus (HIV) blip magnitude on virologic rebound has been raised in clinical guidelines relating to viral load assays. Antiretroviral-naive individuals initiating combination antiretroviral therapy (cART) after 1 January 2000 and achieving virologic suppression were studied. Negative binomial models were used to identify blip correlates. Recurrent event models were used to determine the association between blips and rebound by incorporating multiple periods of virologic suppression per individual. 3550 participants (82% male; median age, 40 years) were included. In a multivariable negative binomial regression model, the Amplicor assay was associated with a lower blip rate than branched DNA (rate ratio, 0.69; P < .01), controlling for age, sex, region, baseline HIV-1 RNA and CD4 count, AIDS-defining illnesses, year of cART initiation, cART type, and HIV-1 RNA testing frequency. In a multivariable recurrent event model controlling for age, sex, intravenous drug use, cART start year, cART type, assay type, and HIV-1 RNA testing frequency, blips of 500-999 copies/mL were associated with virologic rebound (hazard ratio, 2.70; P = .002), whereas blips of 50-499 were not. HIV-1 RNA assay was an important determinant of blip rates and should be considered in clinical guidelines. Blips ≥500 copies/mL were associated with increased rebound risk.

  16. Multi-level Bayesian safety analysis with unprocessed Automatic Vehicle Identification data for an urban expressway.

    PubMed

    Shi, Qi; Abdel-Aty, Mohamed; Yu, Rongjie

    2016-03-01

    In traffic safety studies, crash frequency modeling of total crashes is the cornerstone before proceeding to more detailed safety evaluation. The relationship between crash occurrence and factors such as traffic flow and roadway geometric characteristics has been extensively explored for a better understanding of crash mechanisms. In this study, a multi-level Bayesian framework has been developed in an effort to identify the crash contributing factors on an urban expressway in the Central Florida area. Two types of traffic data from the Automatic Vehicle Identification system, which are the processed data capped at speed limit and the unprocessed data retaining the original speed were incorporated in the analysis along with road geometric information. The model framework was proposed to account for the hierarchical data structure and the heterogeneity among the traffic and roadway geometric data. Multi-level and random parameters models were constructed and compared with the Negative Binomial model under the Bayesian inference framework. Results showed that the unprocessed traffic data was superior. Both multi-level models and random parameters models outperformed the Negative Binomial model and the models with random parameters achieved the best model fitting. The contributing factors identified imply that on the urban expressway lower speed and higher speed variation could significantly increase the crash likelihood. Other geometric factors were significant including auxiliary lanes and horizontal curvature. Copyright © 2015 Elsevier Ltd. All rights reserved.

  17. A statistical model to estimate the impact of a hepatitis A vaccination programme.

    PubMed

    Oviedo, Manuel; Pilar Muñoz, M; Domínguez, Angela; Borras, Eva; Carmona, Gloria

    2008-11-11

    A program of routine hepatitis A+B vaccination in preadolescents was introduced in 1998 in Catalonia, a region situated in the northeast of Spain. The objective of this study was to quantify the reduction in the incidence of hepatitis A in order to differentiate the natural reduction of the incidence of hepatitis A from that produced due to the vaccination programme and to predict the evolution of the disease in forthcoming years. A generalized linear model (GLM) using negative binomial regression was used to estimate the incidence rates of hepatitis A in Catalonia by year, age group and vaccination. Introduction of the vaccine reduced cases by 5.5 by year (p-value<0.001), but there was a significant interaction between the year of report and vaccination that smoothed this reduction (p-value<0.001). The reduction was not equal in all age groups, being greater in the 12-18 years age group, which fell from a mean rate of 8.15 per 100,000 person/years in the pre-vaccination period (1992-1998) to 1.4 in the vaccination period (1999-2005). The model predicts the evolution accurately for the group of vaccinated subjects. Negative binomial regression is more appropriate than Poisson regression when observed variance exceeds the observed mean (overdispersed count data), can cause a variable apparently contribute more on the model of what really makes it.

  18. Finite mixture modeling approach for developing crash modification factors in highway safety analysis.

    PubMed

    Park, Byung-Jung; Lord, Dominique; Wu, Lingtao

    2016-10-28

    This study aimed to investigate the relative performance of two models (negative binomial (NB) model and two-component finite mixture of negative binomial models (FMNB-2)) in terms of developing crash modification factors (CMFs). Crash data on rural multilane divided highways in California and Texas were modeled with the two models, and crash modification functions (CMFunctions) were derived. The resultant CMFunction estimated from the FMNB-2 model showed several good properties over that from the NB model. First, the safety effect of a covariate was better reflected by the CMFunction developed using the FMNB-2 model, since the model takes into account the differential responsiveness of crash frequency to the covariate. Second, the CMFunction derived from the FMNB-2 model is able to capture nonlinear relationships between covariate and safety. Finally, following the same concept as those for NB models, the combined CMFs of multiple treatments were estimated using the FMNB-2 model. The results indicated that they are not the simple multiplicative of single ones (i.e., their safety effects are not independent under FMNB-2 models). Adjustment Factors (AFs) were then developed. It is revealed that current Highway Safety Manual's method could over- or under-estimate the combined CMFs under particular combination of covariates. Safety analysts are encouraged to consider using the FMNB-2 models for developing CMFs and AFs. Copyright © 2016 Elsevier Ltd. All rights reserved.

  19. nbCNV: a multi-constrained optimization model for discovering copy number variants in single-cell sequencing data.

    PubMed

    Zhang, Changsheng; Cai, Hongmin; Huang, Jingying; Song, Yan

    2016-09-17

    Variations in DNA copy number have an important contribution to the development of several diseases, including autism, schizophrenia and cancer. Single-cell sequencing technology allows the dissection of genomic heterogeneity at the single-cell level, thereby providing important evolutionary information about cancer cells. In contrast to traditional bulk sequencing, single-cell sequencing requires the amplification of the whole genome of a single cell to accumulate enough samples for sequencing. However, the amplification process inevitably introduces amplification bias, resulting in an over-dispersing portion of the sequencing data. Recent study has manifested that the over-dispersed portion of the single-cell sequencing data could be well modelled by negative binomial distributions. We developed a read-depth based method, nbCNV to detect the copy number variants (CNVs). The nbCNV method uses two constraints-sparsity and smoothness to fit the CNV patterns under the assumption that the read signals are negatively binomially distributed. The problem of CNV detection was formulated as a quadratic optimization problem, and was solved by an efficient numerical solution based on the classical alternating direction minimization method. Extensive experiments to compare nbCNV with existing benchmark models were conducted on both simulated data and empirical single-cell sequencing data. The results of those experiments demonstrate that nbCNV achieves superior performance and high robustness for the detection of CNVs in single-cell sequencing data.

  20. A comparison of observation-level random effect and Beta-Binomial models for modelling overdispersion in Binomial data in ecology & evolution.

    PubMed

    Harrison, Xavier A

    2015-01-01

    Overdispersion is a common feature of models of biological data, but researchers often fail to model the excess variation driving the overdispersion, resulting in biased parameter estimates and standard errors. Quantifying and modeling overdispersion when it is present is therefore critical for robust biological inference. One means to account for overdispersion is to add an observation-level random effect (OLRE) to a model, where each data point receives a unique level of a random effect that can absorb the extra-parametric variation in the data. Although some studies have investigated the utility of OLRE to model overdispersion in Poisson count data, studies doing so for Binomial proportion data are scarce. Here I use a simulation approach to investigate the ability of both OLRE models and Beta-Binomial models to recover unbiased parameter estimates in mixed effects models of Binomial data under various degrees of overdispersion. In addition, as ecologists often fit random intercept terms to models when the random effect sample size is low (<5 levels), I investigate the performance of both model types under a range of random effect sample sizes when overdispersion is present. Simulation results revealed that the efficacy of OLRE depends on the process that generated the overdispersion; OLRE failed to cope with overdispersion generated from a Beta-Binomial mixture model, leading to biased slope and intercept estimates, but performed well for overdispersion generated by adding random noise to the linear predictor. Comparison of parameter estimates from an OLRE model with those from its corresponding Beta-Binomial model readily identified when OLRE were performing poorly due to disagreement between effect sizes, and this strategy should be employed whenever OLRE are used for Binomial data to assess their reliability. Beta-Binomial models performed well across all contexts, but showed a tendency to underestimate effect sizes when modelling non-Beta-Binomial data. Finally, both OLRE and Beta-Binomial models performed poorly when models contained <5 levels of the random intercept term, especially for estimating variance components, and this effect appeared independent of total sample size. These results suggest that OLRE are a useful tool for modelling overdispersion in Binomial data, but that they do not perform well in all circumstances and researchers should take care to verify the robustness of parameter estimates of OLRE models.

  1. Landau-Zener extension of the Tavis-Cummings model: Structure of the solution

    DOE PAGES

    Sun, Chen; Sinitsyn, Nikolai A.

    2016-09-07

    We explore the recently discovered solution of the driven Tavis-Cummings model (DTCM). It describes interaction of an arbitrary number of two-level systems with a bosonic mode that has linearly time-dependent frequency. We derive compact and tractable expressions for transition probabilities in terms of the well-known special functions. In this form, our formulas are suitable for fast numerical calculations and analytical approximations. As an application, we obtain the semiclassical limit of the exact solution and compare it to prior approximations. Furthermore, we also reveal connection between DTCM and q-deformed binomial statistics.

  2. Sampling Error in Relation to Cyst Nematode Population Density Estimation in Small Field Plots.

    PubMed

    Župunski, Vesna; Jevtić, Radivoje; Jokić, Vesna Spasić; Župunski, Ljubica; Lalošević, Mirjana; Ćirić, Mihajlo; Ćurčić, Živko

    2017-06-01

    Cyst nematodes are serious plant-parasitic pests which could cause severe yield losses and extensive damage. Since there is still very little information about error of population density estimation in small field plots, this study contributes to the broad issue of population density assessment. It was shown that there was no significant difference between cyst counts of five or seven bulk samples taken per each 1-m 2 plot, if average cyst count per examined plot exceeds 75 cysts per 100 g of soil. Goodness of fit of data to probability distribution tested with χ 2 test confirmed a negative binomial distribution of cyst counts for 21 out of 23 plots. The recommended measure of sampling precision of 17% expressed through coefficient of variation ( cv ) was achieved if the plots of 1 m 2 contaminated with more than 90 cysts per 100 g of soil were sampled with 10-core bulk samples taken in five repetitions. If plots were contaminated with less than 75 cysts per 100 g of soil, 10-core bulk samples taken in seven repetitions gave cv higher than 23%. This study indicates that more attention should be paid on estimation of sampling error in experimental field plots to ensure more reliable estimation of population density of cyst nematodes.

  3. Broad distribution spectrum from Gaussian to power law appears in stochastic variations in RNA-seq data.

    PubMed

    Awazu, Akinori; Tanabe, Takahiro; Kamitani, Mari; Tezuka, Ayumi; Nagano, Atsushi J

    2018-05-29

    Gene expression levels exhibit stochastic variations among genetically identical organisms under the same environmental conditions. In many recent transcriptome analyses based on RNA sequencing (RNA-seq), variations in gene expression levels among replicates were assumed to follow a negative binomial distribution, although the physiological basis of this assumption remains unclear. In this study, RNA-seq data were obtained from Arabidopsis thaliana under eight conditions (21-27 replicates), and the characteristics of gene-dependent empirical probability density function (ePDF) profiles of gene expression levels were analyzed. For A. thaliana and Saccharomyces cerevisiae, various types of ePDF of gene expression levels were obtained that were classified as Gaussian, power law-like containing a long tail, or intermediate. These ePDF profiles were well fitted with a Gauss-power mixing distribution function derived from a simple model of a stochastic transcriptional network containing a feedback loop. The fitting function suggested that gene expression levels with long-tailed ePDFs would be strongly influenced by feedback regulation. Furthermore, the features of gene expression levels are correlated with their functions, with the levels of essential genes tending to follow a Gaussian-like ePDF while those of genes encoding nucleic acid-binding proteins and transcription factors exhibit long-tailed ePDF.

  4. M-Bonomial Coefficients and Their Identities

    ERIC Educational Resources Information Center

    Asiru, Muniru A.

    2010-01-01

    In this note, we introduce M-bonomial coefficients or (M-bonacci binomial coefficients). These are similar to the binomial and the Fibonomial (or Fibonacci-binomial) coefficients and can be displayed in a triangle similar to Pascal's triangle from which some identities become obvious.

  5. Some characteristics of repeated sickness absence

    PubMed Central

    Ferguson, David

    1972-01-01

    Ferguson, D. (1972).Brit. J. industr. Med.,29, 420-431. Some characteristics of repeated sickness absence. Several studies have shown that frequency of absence attributed to sickness is not distributed randomly but tends to follow the negative binomial distribution, and this has been taken to support the concept of `proneness' to such absence. Thus, the distribution of sickness absence resembles that of minor injury at work demonstrated over 50 years ago. Because the investigation of proneness to absence does not appear to have been reported by others in Australia, the opportunity was taken, during a wider study of health among telegraphists in a large communications undertaking, to analyse some characteristics of repeated sickness absence. The records of medically certified and uncertified sickness absence of all 769 telegraphists continuously employed in all State capitals over a two-and-a-half-year period were compared with those of 411 clerks and 415 mechanics and, in Sydney, 380 mail sorters and 80 of their supervisors. All telegraphists in Sydney, Melbourne, and Brisbane, and all mail sorters in Sydney, who were available and willing were later medically examined. From their absence pattern repeaters (employees who had had eight or more certified absences in two and a half years) were separated into three types based on a presumptive origin in chance, recurrent disease and symptomatic non-specific disorder. The observed distribution of individual frequency of certified absence over the full two-and-a-half-year period of study followed that expected from the univariate negative binomial, using maximum likelihood estimators, rather than the poisson distribution, in three of the four occupational groups in Sydney. Limited correlational and bivariate analysis supported the interpretation of proneness ascribed to the univariate fit. In the two groups studied, frequency of uncertified absence could not be fitted by the negative binomial, although the numbers of such absences in individuals in successive years were relatively highly correlated. All types of repeater were commoner in Sydney than in the other capital city offices, which differed little from each other. Repeaters were more common among those whose absence was attributed to neurosis, alimentary and upper respiratory tract disorder, and injury. Out of more than 90 health, personal, social, and industrial attributes determined at examination, only two (ethanol habit and adverse attitude to pay) showed any statistically significant association when telegraphist repeaters in Sydney were compared with employees who were rarely absent. Though repeating tended to be associated with chronic or recurrent ill health revealed at examination, one quarter of repeaters had little such ill health and one quarter of rarely absent employees had much. It was concluded that, in the population studied, the fitting of the negative binomial to frequency of certified sickness absence could, in the circumstances of the study, reasonably be given an interpretation of proneness. In that population also repeating varies geographically and occupationally, and is poorly associated with disease and other attributes uncovered at examination, with the exception of the ethanol habit. Repeaters are more often neurotic than employees who are rarely absent but also are more often stable double jobbers. The repeater should be identified for what help may be given him, if needed, otherwise it would seem more profitable to attack those features in work design and organization which influence motivation to come to work. Social factors which predispose to repeated absence are less amenable to modification. PMID:4636662

  6. Traumatic Brain Injury among US Active Duty Military Personnel and Negative Drinking-Related Consequences

    PubMed Central

    Adams, Rachel Sayko; Larson, Mary Jo; Corrigan, John D.; Ritter, Grant A.; Williams, Thomas V.

    2013-01-01

    This study used the 2008 Department of Defense Survey of Health Related Behaviors among Active Duty Military Personnel to determine whether traumatic brain injury (TBI) is associated with past year drinking-related consequences. The study sample included currently-drinking personnel who had a combat deployment in the past year and were home for ≥6 months (N = 3,350). Negative binomial regression models were used to assess the incidence rate ratios of consequences, by TBI-level. Experiencing a TBI with a loss of consciousness >20 minutes was significantly associated with consequences independent of demographics, combat exposure, posttraumatic stress disorder, and binge drinking. The study’s limitations are noted. PMID:23869456

  7. Performance and structure of single-mode bosonic codes

    NASA Astrophysics Data System (ADS)

    Albert, Victor V.; Noh, Kyungjoo; Duivenvoorden, Kasper; Young, Dylan J.; Brierley, R. T.; Reinhold, Philip; Vuillot, Christophe; Li, Linshu; Shen, Chao; Girvin, S. M.; Terhal, Barbara M.; Jiang, Liang

    2018-03-01

    The early Gottesman, Kitaev, and Preskill (GKP) proposal for encoding a qubit in an oscillator has recently been followed by cat- and binomial-code proposals. Numerically optimized codes have also been proposed, and we introduce codes of this type here. These codes have yet to be compared using the same error model; we provide such a comparison by determining the entanglement fidelity of all codes with respect to the bosonic pure-loss channel (i.e., photon loss) after the optimal recovery operation. We then compare achievable communication rates of the combined encoding-error-recovery channel by calculating the channel's hashing bound for each code. Cat and binomial codes perform similarly, with binomial codes outperforming cat codes at small loss rates. Despite not being designed to protect against the pure-loss channel, GKP codes significantly outperform all other codes for most values of the loss rate. We show that the performance of GKP and some binomial codes increases monotonically with increasing average photon number of the codes. In order to corroborate our numerical evidence of the cat-binomial-GKP order of performance occurring at small loss rates, we analytically evaluate the quantum error-correction conditions of those codes. For GKP codes, we find an essential singularity in the entanglement fidelity in the limit of vanishing loss rate. In addition to comparing the codes, we draw parallels between binomial codes and discrete-variable systems. First, we characterize one- and two-mode binomial as well as multiqubit permutation-invariant codes in terms of spin-coherent states. Such a characterization allows us to introduce check operators and error-correction procedures for binomial codes. Second, we introduce a generalization of spin-coherent states, extending our characterization to qudit binomial codes and yielding a multiqudit code.

  8. Associations among habitat characteristics and meningeal worm prevalence in eastern South Dakota, USA

    USGS Publications Warehouse

    Jacques, Christopher N.; Jenks, Jonathan A.; Klaver, Robert W.; Dubay, Shelli A.

    2017-01-01

    Few studies have evaluated how wetland and forest characteristics influence the prevalence of meningeal worm (Parelaphostrongylus tenuis) infection of deer throughout the grassland biome of central North America. We used previously collected, county-level prevalence data to evaluate associations between habitat characteristics and probability of meningeal worm infection in white-tailed deer (Odocoileus virginianus) across eastern South Dakota, US. The highest-ranked binomial regression model for detecting probability of meningeal worm infection was spring temperature + summer precipitation + percent wetland; weight of evidence (wi=0.71) favored this model over alternative models, though predictive capability was low (Receiver operating characteristic=0.62). Probability of meningeal worm infection increased by 1.3- and 1.6-fold for each 1-cm and 1-C increase in summer precipitation and spring temperature, respectively. Similarly, probability of infection increased 1.2-fold for each 1% increase in wetland habitat. Our findings highlight the importance of wetland habitat in predicting meningeal worm infection across eastern South Dakota. Future research is warranted to evaluate the relationships between climatic conditions (e.g., drought, wet cycles) and deer habitat selection in maintaining P. tenuis along the western boundary of the parasite.

  9. Resources predicting positive and negative affect during the experience of stress: a study of older Asian Indian immigrants in the United States.

    PubMed

    Diwan, Sadhna; Jonnalagadda, Satya S; Balaswamy, Shantha

    2004-10-01

    Using the life stress model of psychological well-being, in this study we examined risks and resources predicting the occurrence of both positive and negative affect among older Asian Indian immigrants who experienced stressful life events. We collected data through a telephone survey of 226 respondents (aged 50 years and older) in the Southeastern United States. We used hierarchical, negative binomial regression analyses to examine correlates of positive and negative affect. Different coping resources influenced positive and negative affect when stressful life events were controlled for. Being female was a common risk factor for poorer positive and increased negative affect. Satisfaction with friendships and a cultural or ethnic identity that is either bicultural or more American were predictive of greater positive affect. Greater religiosity and increased mastery were resources predicting less negative affect. Cognitive and structural interventions that increase opportunities for social integration, increasing mastery, and addressing spiritual concerns are discussed as ways of coping with stress to improve the well-being of individuals in this immigrant community.

  10. Negative values of quasidistributions and quantum wave and number statistics

    NASA Astrophysics Data System (ADS)

    Peřina, J.; Křepelka, J.

    2018-04-01

    We consider nonclassical wave and number quantum statistics, and perform a decomposition of quasidistributions for nonlinear optical down-conversion processes using Bessel functions. We show that negative values of the quasidistribution do not directly represent probabilities; however, they directly influence measurable number statistics. Negative terms in the decomposition related to the nonclassical behavior with negative amplitudes of probability can be interpreted as positive amplitudes of probability in the negative orthogonal Bessel basis, whereas positive amplitudes of probability in the positive basis describe classical cases. However, probabilities are positive in all cases, including negative values of quasidistributions. Negative and positive contributions of decompositions to quasidistributions are estimated. The approach can be adapted to quantum coherence functions.

  11. Distribution of chewing lice upon the polygynous peacock Pavo cristatus.

    PubMed

    Stewart, I R; Clark, F; Petrie, M

    1996-04-01

    An opportunistic survey of louse distribution upon the peacock Pavo cristatus was undertaken following a cull of 23 birds from an English zoo. After complete skin and feather dissolution, 2 species of lice were retrieved, Goniodes pavonis and Amyrsidea minuta. The distribution of both louse species could be described by a negative binomial model. The significance of this is discussed in relation to transmission dynamics of lice in the atypical avian mating system found in the peacock, which involves no male parental care.

  12. Ecological and pest-management implications of sex differences in scarab landing patterns on grape vines

    PubMed Central

    Boyer, Stéphane; Lefort, Marie-Caroline; Nboyine, Jerry; Wratten, Steve D.

    2017-01-01

    Background Melolonthinae beetles, comprising different white grub species, are a globally-distributed pest group. Their larvae feed on roots of several crop and forestry species, and adults can cause severe defoliation. In New Zealand, the endemic scarab pest Costelytra zealandica (White) causes severe defoliation on different horticultural crops, including grape vines (Vitis vinifera). Understanding flight and landing behaviours of this pest can help inform pest management decisions. Methods Adult beetles were counted and then removed from 96 grape vine plants from 21:30 until 23:00 h, every day from October 26 until December 2, during 2014 and 2015. Also, adults were removed from the grape vine foliage at dusk 5, 10, 15, 20 and 25 min after flight started on 2015. Statistical analyses were performed using generalised linear models with a beta-binomial distribution to analyse proportions and with a negative binomial distribution for beetle abundance. Results By analysing C. zealandica sex ratios during its entire flight season, it is clear that the proportion of males is higher at the beginning of the season, gradually declining towards its end. When adults were successively removed from the grape vines at 5-min intervals after flight activity begun, the mean proportion of males ranged from 6–28%. The male proportion suggests males were attracted to females that had already landed on grape vines, probably through pheromone release. Discussion The seasonal and daily changes in adult C. zealandica sex ratio throughout its flight season are presented for the first time. Although seasonal changes in sex ratio have been reported for other melolonthines, changes during their daily flight activity have not been analysed so far. Sex-ratio changes can have important consequences for the management of this pest species, and possibly for other melolonthines, as it has been previously suggested that C. zealandica females land on plants that produce a silhouette against the sky. Therefore, long-term management might evaluate the effect of different plant heights and architecture on female melolonthine landing patterns, with consequences for male distribution, and subsequently overall damage within horticultural areas. PMID:28462026

  13. Ecological and pest-management implications of sex differences in scarab landing patterns on grape vines.

    PubMed

    González-Chang, Mauricio; Boyer, Stéphane; Lefort, Marie-Caroline; Nboyine, Jerry; Wratten, Steve D

    2017-01-01

    Melolonthinae beetles, comprising different white grub species, are a globally-distributed pest group. Their larvae feed on roots of several crop and forestry species, and adults can cause severe defoliation. In New Zealand, the endemic scarab pest Costelytra zealandica (White) causes severe defoliation on different horticultural crops, including grape vines ( Vitis vinifera ). Understanding flight and landing behaviours of this pest can help inform pest management decisions. Adult beetles were counted and then removed from 96 grape vine plants from 21:30 until 23:00 h, every day from October 26 until December 2, during 2014 and 2015. Also, adults were removed from the grape vine foliage at dusk 5, 10, 15, 20 and 25 min after flight started on 2015. Statistical analyses were performed using generalised linear models with a beta-binomial distribution to analyse proportions and with a negative binomial distribution for beetle abundance. By analysing C. zealandica sex ratios during its entire flight season, it is clear that the proportion of males is higher at the beginning of the season, gradually declining towards its end. When adults were successively removed from the grape vines at 5-min intervals after flight activity begun, the mean proportion of males ranged from 6-28%. The male proportion suggests males were attracted to females that had already landed on grape vines, probably through pheromone release. The seasonal and daily changes in adult C. zealandica sex ratio throughout its flight season are presented for the first time. Although seasonal changes in sex ratio have been reported for other melolonthines, changes during their daily flight activity have not been analysed so far. Sex-ratio changes can have important consequences for the management of this pest species, and possibly for other melolonthines, as it has been previously suggested that C. zealandica females land on plants that produce a silhouette against the sky. Therefore, long-term management might evaluate the effect of different plant heights and architecture on female melolonthine landing patterns, with consequences for male distribution, and subsequently overall damage within horticultural areas.

  14. Developing safety performance functions incorporating reliability-based risk measures.

    PubMed

    Ibrahim, Shewkar El-Bassiouni; Sayed, Tarek

    2011-11-01

    Current geometric design guides provide deterministic standards where the safety margin of the design output is generally unknown and there is little knowledge of the safety implications of deviating from these standards. Several studies have advocated probabilistic geometric design where reliability analysis can be used to account for the uncertainty in the design parameters and to provide a risk measure of the implication of deviation from design standards. However, there is currently no link between measures of design reliability and the quantification of safety using collision frequency. The analysis presented in this paper attempts to bridge this gap by incorporating a reliability-based quantitative risk measure such as the probability of non-compliance (P(nc)) in safety performance functions (SPFs). Establishing this link will allow admitting reliability-based design into traditional benefit-cost analysis and should lead to a wider application of the reliability technique in road design. The present application is concerned with the design of horizontal curves, where the limit state function is defined in terms of the available (supply) and stopping (demand) sight distances. A comprehensive collision and geometric design database of two-lane rural highways is used to investigate the effect of the probability of non-compliance on safety. The reliability analysis was carried out using the First Order Reliability Method (FORM). Two Negative Binomial (NB) SPFs were developed to compare models with and without the reliability-based risk measures. It was found that models incorporating the P(nc) provided a better fit to the data set than the traditional (without risk) NB SPFs for total, injury and fatality (I+F) and property damage only (PDO) collisions. Copyright © 2011 Elsevier Ltd. All rights reserved.

  15. Evaluating the drivers of and obstacles to the willingness to use cognitive enhancement drugs: the influence of drug characteristics, social environment, and personal characteristics.

    PubMed

    Sattler, Sebastian; Mehlkop, Guido; Graeff, Peter; Sauer, Carsten

    2014-02-01

    The use of cognitive enhancement (CE) by means of pharmaceutical agents has been the subject of intense debate both among scientists and in the media. This study investigates several drivers of and obstacles to the willingness to use prescription drugs non-medically for augmenting brain capacity. We conducted a web-based study among 2,877 students from randomly selected disciplines at German universities. Using a factorial survey, respondents expressed their willingness to take various hypothetical CE-drugs; the drugs were described by five experimentally varied characteristics and the social environment by three varied characteristics. Personal characteristics and demographic controls were also measured. We found that 65.3% of the respondents staunchly refused to use CE-drugs. The results of a multivariate negative binomial regression indicated that respondents' willingness to use CE-drugs increased if the potential drugs promised a significant augmentation of mental capacity and a high probability of achieving this augmentation. Willingness decreased when there was a high probability of side effects and a high price. Prevalent CE-drug use among peers increased willingness, whereas a social environment that strongly disapproved of these drugs decreased it. Regarding the respondents' characteristics, pronounced academic procrastination, high cognitive test anxiety, low intrinsic motivation, low internalization of social norms against CE-drug use, and past experiences with CE-drugs increased willingness. The potential severity of side effects, social recommendations about using CE-drugs, risk preferences, and competencies had no measured effects upon willingness. These findings contribute to understanding factors that influence the willingness to use CE-drugs. They support the assumption of instrumental drug use and may contribute to the development of prevention, policy, and educational strategies.

  16. Evaluating the drivers of and obstacles to the willingness to use cognitive enhancement drugs: the influence of drug characteristics, social environment, and personal characteristics

    PubMed Central

    2014-01-01

    Background The use of cognitive enhancement (CE) by means of pharmaceutical agents has been the subject of intense debate both among scientists and in the media. This study investigates several drivers of and obstacles to the willingness to use prescription drugs non-medically for augmenting brain capacity. Methods We conducted a web-based study among 2,877 students from randomly selected disciplines at German universities. Using a factorial survey, respondents expressed their willingness to take various hypothetical CE-drugs; the drugs were described by five experimentally varied characteristics and the social environment by three varied characteristics. Personal characteristics and demographic controls were also measured. Results We found that 65.3% of the respondents staunchly refused to use CE-drugs. The results of a multivariate negative binomial regression indicated that respondents’ willingness to use CE-drugs increased if the potential drugs promised a significant augmentation of mental capacity and a high probability of achieving this augmentation. Willingness decreased when there was a high probability of side effects and a high price. Prevalent CE-drug use among peers increased willingness, whereas a social environment that strongly disapproved of these drugs decreased it. Regarding the respondents’ characteristics, pronounced academic procrastination, high cognitive test anxiety, low intrinsic motivation, low internalization of social norms against CE-drug use, and past experiences with CE-drugs increased willingness. The potential severity of side effects, social recommendations about using CE-drugs, risk preferences, and competencies had no measured effects upon willingness. Conclusions These findings contribute to understanding factors that influence the willingness to use CE-drugs. They support the assumption of instrumental drug use and may contribute to the development of prevention, policy, and educational strategies. PMID:24484640

  17. The effect of albendazole treatment on seizure outcomes in patients with symptomatic neurocysticercosis.

    PubMed

    Romo, Matthew L; Wyka, Katarzyna; Carpio, Arturo; Leslie, Denise; Andrews, Howard; Bagiella, Emilia; Hauser, W Allen; Kelvin, Elizabeth A

    2015-11-01

    Randomized controlled trials have found an inconsistent effect of anthelmintic treatment on long-term seizure outcomes in neurocysticercosis. The objective of this study was to further explore the effect of albendazole treatment on long-term seizure outcomes and to determine if there is evidence for a differential effect by seizure type. In this trial, 178 patients with active or transitional neurocysticercosis cysts and new-onset symptoms were randomized to 8 days of treatment with albendazole (n=88) or placebo (n=90), both with prednisone, and followed for 24 months. We used negative binomial regression and logistic regression models to determine the effect of albendazole on the number of seizures and probability of recurrent or new-onset seizures, respectively, over follow-up. Treatment with albendazole was associated with a reduction in the number of seizures during 24 months of follow-up, but this was only significant for generalized seizures during months 1-12 (unadjusted rate ratio [RR] 0.19; 95% CI: 0.04-0.91) and months 1-24 (unadjusted RR 0.06; 95% CI: 0.01-0.57). We did not detect a significant effect of albendazole on reducing the number of focal seizures or on the probability of having a seizure, regardless of seizure type or time period. Albendazole treatment may be associated with some symptomatic improvement; however, this association seems to be specific to generalized seizures. Future research is needed to identify strategies to better reduce long-term seizure burden in patients with neurocysticercosis. © The Author 2015. Published by Oxford University Press on behalf of Royal Society of Tropical Medicine and Hygiene. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  18. Problems on Divisibility of Binomial Coefficients

    ERIC Educational Resources Information Center

    Osler, Thomas J.; Smoak, James

    2004-01-01

    Twelve unusual problems involving divisibility of the binomial coefficients are represented in this article. The problems are listed in "The Problems" section. All twelve problems have short solutions which are listed in "The Solutions" section. These problems could be assigned to students in any course in which the binomial theorem and Pascal's…

  19. Application of binomial-edited CPMG to shale characterization

    USGS Publications Warehouse

    Washburn, Kathryn E.; Birdwell, Justin E.

    2014-01-01

    Unconventional shale resources may contain a significant amount of hydrogen in organic solids such as kerogen, but it is not possible to directly detect these solids with many NMR systems. Binomial-edited pulse sequences capitalize on magnetization transfer between solids, semi-solids, and liquids to provide an indirect method of detecting solid organic materials in shales. When the organic solids can be directly measured, binomial-editing helps distinguish between different phases. We applied a binomial-edited CPMG pulse sequence to a range of natural and experimentally-altered shale samples. The most substantial signal loss is seen in shales rich in organic solids while fluids associated with inorganic pores seem essentially unaffected. This suggests that binomial-editing is a potential method for determining fluid locations, solid organic content, and kerogen–bitumen discrimination.

  20. A Three-Parameter Generalisation of the Beta-Binomial Distribution with Applications

    DTIC Science & Technology

    1987-07-01

    York. Rust, R.T. and Klompmaker, J.E. (1981). Improving the estimation procedure for the beta binomial t.v. exposure model. Journal of Marketing ... Research . 18, 442-448. Sabavala, D.J. and Morrison, D.G. (1977). Television show loyalty: a beta- binomial model using recall data. Journal of Advertiuing

  1. Can Bayesian models play a role in dental caries epidemiology? Evidence from an application to the BELCAP data set.

    PubMed

    Matranga, Domenica; Firenze, Alberto; Vullo, Angela

    2013-10-01

    The aim of this study was to show the potential of Bayesian analysis in statistical modelling of dental caries data. Because of the bounded nature of the dmft (DMFT) index, zero-inflated binomial (ZIB) and beta-binomial (ZIBB) models were considered. The effects of incorporating prior information available about the parameters of models were also shown. The data set used in this study was the Belo Horizonte Caries Prevention (BELCAP) study (Böhning et al. (1999)), consisting of five variables collected among 797 Brazilian school children designed to evaluate four programmes for reducing caries. Only the eight primary molar teeth were considered in the data set. A data augmentation algorithm was used for estimation. Firstly, noninformative priors were used to express our lack of knowledge about the regression parameters. Secondly, prior information about the probability of being a structural zero dmft and the probability of being caries affected in the subpopulation of susceptible children was incorporated. With noninformative priors, the best fitting model was the ZIBB. Education (OR = 0.76, 95% CrI: 0.59, 0.99), all interventions (OR = 0.46, 95% CrI: 0.35, 0.62), rinsing (OR = 0.61, 95% CrI: 0.47, 0.80) and hygiene (OR = 0.65, 95% CrI: 0.49, 0.86) were demonstrated to be factors protecting children from being caries affected. Being male increased the probability of being caries diseased (OR = 1.19, 95% CrI: 1.01, 1.42). However, after incorporating informative priors, ZIB models' estimates were not influenced, while ZIBB models reduced deviance and confirmed the association with all interventions and rinsing only. In our application, Bayesian estimates showed a similar accuracy and precision than likelihood-based estimates, although they offered many computational advantages and the possibility of expressing all forms of uncertainty in terms of probability. The overdispersion parameter could expound why the introduction of prior information had significant effects on the parameters of the ZIBB model, while ZIB estimates remained unchanged. Finally, the best performance of ZIBB compared to the ZIB model was shown to catch overdispersion in data. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  2. Jump-and-return sandwiches: A new family of binomial-like selective inversion sequences with improved performance

    NASA Astrophysics Data System (ADS)

    Brenner, Tom; Chen, Johnny; Stait-Gardner, Tim; Zheng, Gang; Matsukawa, Shingo; Price, William S.

    2018-03-01

    A new family of binomial-like inversion sequences, named jump-and-return sandwiches (JRS), has been developed by inserting a binomial-like sequence into a standard jump-and-return sequence, discovered through use of a stochastic Genetic Algorithm optimisation. Compared to currently used binomial-like inversion sequences (e.g., 3-9-19 and W5), the new sequences afford wider inversion bands and narrower non-inversion bands with an equal number of pulses. As an example, two jump-and-return sandwich 10-pulse sequences achieved 95% inversion at offsets corresponding to 9.4% and 10.3% of the non-inversion band spacing, compared to 14.7% for the binomial-like W5 inversion sequence, i.e., they afforded non-inversion bands about two thirds the width of the W5 non-inversion band.

  3. Relating river discharge and water temperature to the recruitment of age‐0 White Sturgeon (Acipenser transmontanus Richardson, 1836) in the Columbia River using over‐dispersed catch data

    USGS Publications Warehouse

    Counihan, Timothy D.; Chapman, Colin G.

    2018-01-01

    The goals were to (i) determine if river discharge and water temperature during various early life history stages were predictors of age‐0 White Sturgeon, Acipenser transmontanus, recruitment, and (ii) provide an example of how over‐dispersed catch data, including data with many zero observations, can be used to better understand the effects of regulated rivers on the productivity of depressed sturgeon populations. An information theoretic approach was used to develop and select negative binomial and zero‐inflated negative binomial models that model the relation of age‐0 White Sturgeon survey data from three contiguous Columbia River reservoirs to river discharge and water temperature during spawning, egg incubation, larval, and post‐larval phases. Age‐0 White Sturgeon were collected with small mesh gill nets in The Dalles and John Day reservoirs from 1997 to 2014 and a bottom trawl in Bonneville Reservoir from 1989 to 2006. Results suggest that seasonal river discharge was positively correlated with age‐0 recruitment; notably that discharge, 16 June–31 July was positively correlated to age‐0 recruitment in all three reservoirs. The best approximating models for two of the three reservoirs also suggest that seasonal water temperature may be a determinant of age‐0 recruitment. Our research demonstrates how over‐dispersed catch data can be used to better understand the effects of environmental conditions on sturgeon populations caused by the construction and operation of dams.

  4. [Epidemiology of scrub typhus and influencing factors in Yunnan province, 2006-2013].

    PubMed

    Sun, Y; Shi, C; Li, X L; Fang, L Q; Cao, W C

    2018-01-10

    Objective: To understand the epidemiological characteristics of scrub typhu s and influencing factors in Yunnan province, and provide further information for the prevention and control of scrub typhus. Methods: Based on the incidence data of scrub typhus reported in Yunnan from 2006 to 2013, the epidemiological characteristics of scrub typhus were analyzed and related environmental factors were identified with panel negative binomial regression model. Results: A total of 8 980 scrub typhus cases were reported during 2006-2013 in Yunnan. The average annual incidence was 2.46/100 000, with an uptrend observed. Natural focus expansion was found, affecting 71.3% of the counties in 2013. The epidemic mainly occurred in summer and autumn with the incidence peak during July-October. The annual incidence was higher in females than in males. More cases occurred in children and farmers, the proportions of cases in farmers and pre-school aged children showed an obvious increase. Panel negative binomial regression model indicated that the transmission risk of scrub typhus was positive associated with monthly temperature and monthly relative humidity. Furthermore, an "U" pattern between the risk and the increased coverage of cropland and grassland as well as an "inverted-U" pattern between the risk and increased coverage of shrub were observed. Conclusion: It is necessary to strengthen the scrub typhus surveillance in warm and moist areas as well as the areas with high coverage of cropland and grassland in Yunnan, and the health education in children and farmers who are at high risk.

  5. Use of clinical practice guidelines by dentists: findings from the Japanese dental practice-based research network.

    PubMed

    Kakudate, Naoki; Yokoyama, Yoko; Sumida, Futoshi; Matsumoto, Yuki; Gordan, Valeria V; Gilbert, Gregg H

    2017-02-01

    The objectives of this study were to: (1) examine differences in the use of dental clinical practice guidelines among Japanese dentists, and (2) identify characteristics associated with the number of guidelines used by participating dentists. We conducted a cross-sectional study consisting of a questionnaire survey in Japan between July 2014 and May 2015. The study queried dentists working in outpatient dental practices who are affiliated with the Dental Practice-Based Research Network Japan (n = 148). They were asked whether they have used each of 15 Japanese dental clinical guidelines. Associations between the number of guidelines used by participants and specific characteristics were analysed via negative binomial regression analysis. The mean number of guidelines used by participating dentists was 2.5 ± 2.9 [standard deviation (SD)]. Rate of use of guidelines showed substantial variation, from 5% to 34% among dentists. The proportion of dentists that used guidelines was the highest among oral medicine specialists, who had the highest proportion for 10 of 15 guidelines. Negative binomial regression analysis identified three factors significantly associated with the number of guidelines used: 'years since graduation from dental school', 'specialty practice' and 'practice busyness'. These results suggest that the use of clinical practice guidelines by Japanese dentists may still be inadequate. Training in the use of the guidelines could be given to dental students as undergraduate education and to young clinicians as continuing education. © 2016 John Wiley & Sons, Ltd.

  6. A preliminary investigation of the relationships between historical crash and naturalistic driving.

    PubMed

    Pande, Anurag; Chand, Sai; Saxena, Neeraj; Dixit, Vinayak; Loy, James; Wolshon, Brian; Kent, Joshua D

    2017-04-01

    This paper describes a project that was undertaken using naturalistic driving data collected via Global Positioning System (GPS) devices to demonstrate a proof-of-concept for proactive safety assessments of crash-prone locations. The main hypothesis for the study is that the segments where drivers have to apply hard braking (higher jerks) more frequently might be the "unsafe" segments with more crashes over a long-term. The linear referencing methodology in ArcMap was used to link the GPS data with roadway characteristic data of US Highway 101 northbound (NB) and southbound (SB) in San Luis Obispo, California. The process used to merge GPS data with quarter-mile freeway segments for traditional crash frequency analysis is also discussed in the paper. A negative binomial regression analyses showed that proportion of high magnitude jerks while decelerating on freeway segments (from the driving data) was significantly related with the long-term crash frequency of those segments. A random parameter negative binomial model with uniformly distributed parameter for ADT and a fixed parameter for jerk provided a statistically significant estimate for quarter-mile segments. The results also indicated that roadway curvature and the presence of auxiliary lane are not significantly related with crash frequency for the highway segments under consideration. The results from this exploration are promising since the data used to derive the explanatory variable(s) can be collected using most off-the-shelf GPS devices, including many smartphones. Copyright © 2017 Elsevier Ltd. All rights reserved.

  7. GENE-Counter: A Computational Pipeline for the Analysis of RNA-Seq Data for Gene Expression Differences

    PubMed Central

    Di, Yanming; Schafer, Daniel W.; Wilhelm, Larry J.; Fox, Samuel E.; Sullivan, Christopher M.; Curzon, Aron D.; Carrington, James C.; Mockler, Todd C.; Chang, Jeff H.

    2011-01-01

    GENE-counter is a complete Perl-based computational pipeline for analyzing RNA-Sequencing (RNA-Seq) data for differential gene expression. In addition to its use in studying transcriptomes of eukaryotic model organisms, GENE-counter is applicable for prokaryotes and non-model organisms without an available genome reference sequence. For alignments, GENE-counter is configured for CASHX, Bowtie, and BWA, but an end user can use any Sequence Alignment/Map (SAM)-compliant program of preference. To analyze data for differential gene expression, GENE-counter can be run with any one of three statistics packages that are based on variations of the negative binomial distribution. The default method is a new and simple statistical test we developed based on an over-parameterized version of the negative binomial distribution. GENE-counter also includes three different methods for assessing differentially expressed features for enriched gene ontology (GO) terms. Results are transparent and data are systematically stored in a MySQL relational database to facilitate additional analyses as well as quality assessment. We used next generation sequencing to generate a small-scale RNA-Seq dataset derived from the heavily studied defense response of Arabidopsis thaliana and used GENE-counter to process the data. Collectively, the support from analysis of microarrays as well as the observed and substantial overlap in results from each of the three statistics packages demonstrates that GENE-counter is well suited for handling the unique characteristics of small sample sizes and high variability in gene counts. PMID:21998647

  8. Optimizing probability of detection point estimate demonstration

    NASA Astrophysics Data System (ADS)

    Koshti, Ajay M.

    2017-04-01

    The paper provides discussion on optimizing probability of detection (POD) demonstration experiments using point estimate method. The optimization is performed to provide acceptable value for probability of passing demonstration (PPD) and achieving acceptable value for probability of false (POF) calls while keeping the flaw sizes in the set as small as possible. POD Point estimate method is used by NASA for qualifying special NDE procedures. The point estimate method uses binomial distribution for probability density. Normally, a set of 29 flaws of same size within some tolerance are used in the demonstration. Traditionally largest flaw size in the set is considered to be a conservative estimate of the flaw size with minimum 90% probability and 95% confidence. The flaw size is denoted as α90/95PE. The paper investigates relationship between range of flaw sizes in relation to α90, i.e. 90% probability flaw size, to provide a desired PPD. The range of flaw sizes is expressed as a proportion of the standard deviation of the probability density distribution. Difference between median or average of the 29 flaws and α90 is also expressed as a proportion of standard deviation of the probability density distribution. In general, it is concluded that, if probability of detection increases with flaw size, average of 29 flaw sizes would always be larger than or equal to α90 and is an acceptable measure of α90/95PE. If NDE technique has sufficient sensitivity and signal-to-noise ratio, then the 29 flaw-set can be optimized to meet requirements of minimum required PPD, maximum allowable POF, requirements on flaw size tolerance about mean flaw size and flaw size detectability requirements. The paper provides procedure for optimizing flaw sizes in the point estimate demonstration flaw-set.

  9. Revealing Word Order: Using Serial Position in Binomials to Predict Properties of the Speaker

    ERIC Educational Resources Information Center

    Iliev, Rumen; Smirnova, Anastasia

    2016-01-01

    Three studies test the link between word order in binomials and psychological and demographic characteristics of a speaker. While linguists have already suggested that psychological, cultural and societal factors are important in choosing word order in binomials, the vast majority of relevant research was focused on general factors and on broadly…

  10. Negative impact of oral health conditions on oral health related quality of life of community dwelling elders in Mexico city, a population based study.

    PubMed

    Castrejón-Pérez, Roberto Carlos; Borges-Yáñez, S Aída; Irigoyen-Camacho, Ma Esther; Cruz-Hervert, Luis Pablo

    2017-05-01

    Oral health in old persons is frequently poor; non-functional prostheses are common and negatively affect quality of life. The objective of this study was to estimate the impact of oral health problems on oral health related quality of life in a sample of home dwelling Mexican elders. Household survey in 655 persons 70 years old and over residing in one county in Mexico City. Oral Health Related Quality of Life (Short version of the Oral Health Impact Profile validated in Mexico-OHIP-14-sp), self-perception of general and oral health, xerostomia, utilization of dental services, utilization and functionality of removable dental prostheses, dental and periodontal conditions, age, gender, marital status, schooling, depression, cognitive impairment and independence in activities of daily living (ADL). A negative binomial regression model was fitted. Mean age was 79.2 ± 7.1 years; 54.2% were women. Mean OHIP-14-Sp score was 6.8 ± 8.7, median was 4. The final model showed that men (RR = 1.30); persons with xerostomia (RR = 1.41); no utilization of removable prostheses (RR = 1.55); utilization of non-functional removable prostheses (RR = 1.69); fair self-perception of general health (RR = 1.34); equal (RR = 1.43) or worse (RR = 2.32) self-perception of oral health compared with persons of the same age; and being dependent for at least one ADL (RR = 1.71) increased the probability of higher scores of the OHIP-14-sp. Age, schooling, depression, cognitive impairment and periodontal conditions showed no association. Oral rehabilitation can improve quality of life, health education and health promotion for the elder and their caregivers may reduce the risk of dental problems. Geriatr Gerontol Int 2017; 17: 744-752. © 2016 Japan Geriatrics Society.

  11. Jump-and-return sandwiches: A new family of binomial-like selective inversion sequences with improved performance.

    PubMed

    Brenner, Tom; Chen, Johnny; Stait-Gardner, Tim; Zheng, Gang; Matsukawa, Shingo; Price, William S

    2018-03-01

    A new family of binomial-like inversion sequences, named jump-and-return sandwiches (JRS), has been developed by inserting a binomial-like sequence into a standard jump-and-return sequence, discovered through use of a stochastic Genetic Algorithm optimisation. Compared to currently used binomial-like inversion sequences (e.g., 3-9-19 and W5), the new sequences afford wider inversion bands and narrower non-inversion bands with an equal number of pulses. As an example, two jump-and-return sandwich 10-pulse sequences achieved 95% inversion at offsets corresponding to 9.4% and 10.3% of the non-inversion band spacing, compared to 14.7% for the binomial-like W5 inversion sequence, i.e., they afforded non-inversion bands about two thirds the width of the W5 non-inversion band. Copyright © 2018 Elsevier Inc. All rights reserved.

  12. Negative Urgency, Distress Tolerance, and Substance Abuse Among College Students

    PubMed Central

    Kaiser, Alison J.; Milich, Richard; Lynam, Donald R.; Charnigo, Richard J.

    2012-01-01

    Objective Negative affect has been consistently linked with substance use/problems in prior research. The present study sought to build upon these findings by exploring how an individual’s characteristic responding to negative affect impacts substance abuse risk. Trait negative affect was examined in relation to substance abuse outcomes along with two variables tapping into response to negative affect: Distress Tolerance, an individual’s perceived ability to tolerate negative affect, and Negative Urgency, the tendency to act rashly while experiencing distress. Method Participants were 525 first-year college students (48.1% male, 81.1% Caucasian), who completed self-report measures assessing personality traits and alcohol-related problems, and a structured interview assessing past and current substance use. Relations were tested using Zero-Inflated Negative Binomial regression models, and each of the personality variables was tested in a model on its own, and in a model where all three traits were accounted for. Results Negative Urgency emerged as the best predictor, relating to every one of the substance use outcome variables even when trait negative affect and Distress Tolerance were accounted for. Conclusions These findings suggest that Negative Urgency is an important factor to consider in developing prevention and intervention efforts aimed at reducing substance use and problems. PMID:22698894

  13. Sensitivity and specificity of dopamine transporter imaging with 123I-FP-CIT SPECT in dementia with Lewy bodies: a phase III, multicentre study.

    PubMed

    McKeith, Ian; O'Brien, John; Walker, Zuzana; Tatsch, Klaus; Booij, Jan; Darcourt, Jacques; Padovani, Alessandro; Giubbini, Raffaele; Bonuccelli, Ubaldo; Volterrani, Duccio; Holmes, Clive; Kemp, Paul; Tabet, Naji; Meyer, Ines; Reininger, Cornelia

    2007-04-01

    Dementia with Lewy bodies (DLB) needs to be distinguished from other types of dementia because of important differences in patient management and outcome. Current clinically based diagnostic criteria for DLB have limited accuracy. Severe nigrostriatal dopaminergic degeneration occurs in DLB, but not in Alzheimer's disease or most other dementia subtypes, offering a potential system for a biological diagnostic marker. The primary aim of this study was to investigate the sensitivity and specificity, in the ante-mortem differentiation of probable DLB from other causes of dementia, of single photon emission computed tomography (SPECT) brain imaging with the ligand (123)I-2beta-carbometoxy-3beta-(4-iodophenyl)-N-(3-fluoropropyl) nortropane ((123)I-FP-CIT), which binds to the dopamine transporter (DAT) reuptake site. Diagnostic accuracy, positive and negative predictive values, and inter-reader agreement were the secondary endpoints and a subgroup of possible DLB patients was also included. We did a phase III study in which we used a (123)I-FP-CIT SPECT scan to assess 326 patients with clinical diagnoses of probable (n=94) or possible (n=57) DLB or non-DLB dementia (n=147) established by a consensus panel (in 28 patients no diagnosis could be made). Three readers, unaware of the clinical diagnosis, classified the images as normal or abnormal by visual inspection. The study had 90% power to detect the differences between our anticipated sensitivity (0.80) and specificity (0.85) targets and prespecified lower thresholds (sensitivity 0.65, specificity 0.73) using one-sided binomial tests with a significance level of alpha=0.025. Abnormal scans had a mean sensitivity of 77.7% for detecting clinical probable DLB, with specificity of 90.4% for excluding non-DLB dementia, which was predominantly due to Alzheimer's disease. A mean value of 85.7% was achieved for overall diagnostic accuracy, 82.4% for positive predictive value, and 87.5% for negative predictive value. Inter-reader agreement for rating scans as normal or abnormal was high (Cohen's kappa=0.87). The procedure was well tolerated with few adverse events. A revision of the International Consensus Criteria for DLB has recommended that low DAT uptake in the basal ganglia, as shown by SPECT or PET imaging, be a suggestive feature for diagnosis. Our findings confirm the high correlation between abnormal (low binding) DAT activity measured with (123)I-FP-CIT SPECT and a clinical diagnosis of probable DLB. The diagnostic accuracy is sufficiently high for this technique to be clinically useful in distinguishing DLB from Alzheimer's disease.

  14. Cyclical absenteeism among private sector, public sector and self-employed workers.

    PubMed

    Pfeifer, Christian

    2013-03-01

    This research note analyzes differences in the number of absent working days and doctor visits and in their cyclicality between private sector, public sector and self-employed workers. For this purpose, I used large-scale German survey data for the years 1995 to 2007 to estimate random effects negative binomial (count data) models. The main findings are as follows. (i) Public sector workers have on average more absent working days than private sector and self-employed workers. Self-employed workers have fewer absent working days and doctor visits than dependent employed workers. (ii) The regional unemployment rate is on average negatively correlated with the number of absent working days among private and public sector workers as well as among self-employed men. The correlations between regional unemployment rate and doctor visits are only significantly negative among private sector workers. Copyright © 2012 John Wiley & Sons, Ltd.

  15. Protection from annual flooding is correlated with increased cholera prevalence in Bangladesh: a zero-inflated regression analysis.

    PubMed

    Carrel, Margaret; Voss, Paul; Streatfield, Peter K; Yunus, Mohammad; Emch, Michael

    2010-03-22

    Alteration of natural or historical aquatic flows can have unintended consequences for regions where waterborne diseases are endemic and where the epidemiologic implications of such change are poorly understood. The implementation of flood protection measures for a portion of an intensely monitored population in Matlab, Bangladesh, allows us to examine whether cholera outcomes respond positively or negatively to measures designed to control river flooding. Using a zero inflated negative binomial model, we examine how selected covariates can simultaneously account for household clusters reporting no cholera from those with positive counts as well as distinguishing residential areas with low counts from areas with high cholera counts. Our goal is to examine how residence within or outside a flood protected area interacts with the probability of cholera presence and the effect of flood protection on the magnitude of cholera prevalence. In Matlab, living in a household that is protected from annual monsoon flooding appears to have no significant effect on whether the household experiences cholera, net of other covariates. However, counter-intuitively, among households where cholera is reported, living within the flood protected region significantly increases the number of cholera cases. The construction of dams or other water impoundment strategies for economic or social motives can have profound and unanticipated consequences for waterborne disease. Our results indicate that the construction of a flood control structure in rural Bangladesh is correlated with an increase in cholera cases for residents protected from annual monsoon flooding. Such a finding requires attention from both the health community and from governments and non-governmental organizations involved in ongoing water management schemes.

  16. Adjusted Wald Confidence Interval for a Difference of Binomial Proportions Based on Paired Data

    ERIC Educational Resources Information Center

    Bonett, Douglas G.; Price, Robert M.

    2012-01-01

    Adjusted Wald intervals for binomial proportions in one-sample and two-sample designs have been shown to perform about as well as the best available methods. The adjusted Wald intervals are easy to compute and have been incorporated into introductory statistics courses. An adjusted Wald interval for paired binomial proportions is proposed here and…

  17. On the p, q-binomial distribution and the Ising model

    NASA Astrophysics Data System (ADS)

    Lundow, P. H.; Rosengren, A.

    2010-08-01

    We employ p, q-binomial coefficients, a generalisation of the binomial coefficients, to describe the magnetisation distributions of the Ising model. For the complete graph this distribution corresponds exactly to the limit case p = q. We apply our investigation to the simple d-dimensional lattices for d = 1, 2, 3, 4, 5 and fit p, q-binomial distributions to our data, some of which are exact but most are sampled. For d = 1 and d = 5, the magnetisation distributions are remarkably well-fitted by p,q-binomial distributions. For d = 4 we are only slightly less successful, while for d = 2, 3 we see some deviations (with exceptions!) between the p, q-binomial and the Ising distribution. However, at certain temperatures near T c the statistical moments of the fitted distribution agree with the moments of the sampled data within the precision of sampling. We begin the paper by giving results of the behaviour of the p, q-distribution and its moment growth exponents given a certain parameterisation of p, q. Since the moment exponents are known for the Ising model (or at least approximately for d = 3) we can predict how p, q should behave and compare this to our measured p, q. The results speak in favour of the p, q-binomial distribution's correctness regarding its general behaviour in comparison to the Ising model. The full extent to which they correctly model the Ising distribution, however, is not settled.

  18. [Hepatitis B case grouping serological study among six chinese families in Almeria, Spain].

    PubMed

    Barroso García, Pilar; Lucerna Méndez, M Angeles; Adrián Monforte, Estrella; Parrón Carreño, Tesifón

    2004-01-01

    Following the detection of two cases of members of 6 Chinese families having tested positive for the hepatitis B virus, a study of those living in these families was begun for the purpose of knowing the spread of the infection within the family environment of the cases detected. Descriptive study. Population under study: 24 members of six Chinese families. Age, sex, serological diagnosis, risk factors, healthcare-related attitude. Clinical records, serological data, epidemiological survey and immunization cards. A family focus was employed and the genogram used. Distribution Binomial spread for calculating probability of occurrence of the process to be studied. A total of 14 males (58.3%) and 10 females (41.7%) ranking from 1 to 54 years of age were studied. The age group having the largest number of subjects studied was the age 21-30 group (37.5%). Twelve chronic hepatitis B infections were recorded (50%). No relationship was found to exist with the risk factors studied in the epidemiological survey conducted. The probability of this number of chronic hepatitis cases occurring was 0.066 x 10(-6). It was concluded that the prevalence of infection found was probable due to intra-family transmission. Given the low probability of occurrence of a process of this type, the case grouping found is considered to be high.

  19. Tobit analysis of vehicle accident rates on interstate highways.

    PubMed

    Anastasopoulos, Panagiotis Ch; Tarko, Andrew P; Mannering, Fred L

    2008-03-01

    There has been an abundance of research that has used Poisson models and its variants (negative binomial and zero-inflated models) to improve our understanding of the factors that affect accident frequencies on roadway segments. This study explores the application of an alternate method, tobit regression, by viewing vehicle accident rates directly (instead of frequencies) as a continuous variable that is left-censored at zero. Using data from vehicle accidents on Indiana interstates, the estimation results show that many factors relating to pavement condition, roadway geometrics and traffic characteristics significantly affect vehicle accident rates.

  20. Population Characteristics and the Nature of Egg Shells of two Phthirapteran Species Parasitizing Indian Cattle Egrets

    PubMed Central

    Ahmad, Aftab; Khan, Vikram; Badola, Smita; Arya, Gaurav; Bansal, Nayanci; Saxena, A. K.

    2010-01-01

    The prevalence, intensities of infestation, range of infestation and population composition of two phthirapteran species, Ardeicola expallidus Blagoveshtchensky (Phthiraptera: Philopteridae) and Ciconiphilus decimfasciatus Boisduval and Lacordaire (Menoponidae) on seventy cattle egrets were recorded during August 2004 to March 2005, in India. The frequency distribution patterns of both the species were skewed but did not correspond to the negative binomial model. The oviposition sites, egg laying patterns and the nature of the eggs of the two species were markedly different. PMID:21067416

  1. The surprising benefit of passive-aggressive behaviour at Christmas parties: being crowned king of the crackers.

    PubMed

    Huang, B Emma; Clifford, David; Lê Cao, Kim-Anh

    2014-12-11

    To test the effects of technique and attitude in pulling Christmas crackers. A binomial trial conducted at a Christmas-in-July dinner party involving five anonymous dinner guests, including two of the authors. Number of wins achieved by different strategies, with a win defined as securing the larger portion of the cracker. The previously "guaranteed" strategy for victory, employing a downwards angle towards the puller, failed to differentiate itself from random chance (win rate, 6/15; probability of winning, 0.40; 95% CI, 0.15-0.65). A novel passive-aggressive strategy, in which one individual just holds on without pulling, provided a significant advantage (win rate, 11/12; probability of winning, 0.92; 95% CI, 0.76-1.00). The passive-aggressive strategy of failing to pull has a high rate of success at winning Christmas crackers; however, excessive adoption of this approach will result in a complete failure, with no winners at all.

  2. Derivation of the expressions for γ50 and D50 for different individual TCP and NTCP models

    NASA Astrophysics Data System (ADS)

    Stavreva, N.; Stavrev, P.; Warkentin, B.; Fallone, B. G.

    2002-10-01

    This paper presents a complete set of formulae for the position (D50) and the normalized slope (γ50) of the dose-response relationship based on the most commonly used radiobiological models for tumours as well as for normal tissues. The functional subunit response models (critical element and critical volume) are used in the derivation of the formulae for the normal tissue. Binomial statistics are used to describe the tumour control probability, the functional subunit response as well as the normal tissue complication probability. The formulae are derived for the single hit and linear quadratic models of cell kill in terms of the number of fractions and dose per fraction. It is shown that the functional subunit models predict very steep, almost step-like, normal tissue individual dose-response relationships. Furthermore, the formulae for the normalized gradient depend on the cellular parameters α and β when written in terms of number of fractions, but not when written in terms of dose per fraction.

  3. Using exceedance probabilities to detect anomalies in routinely recorded animal health data, with particular reference to foot-and-mouth disease in Viet Nam.

    PubMed

    Richards, K K; Hazelton, M L; Stevenson, M A; Lockhart, C Y; Pinto, J; Nguyen, L

    2014-10-01

    The widespread availability of computer hardware and software for recording and storing disease event information means that, in theory, we have the necessary information to carry out detailed analyses of factors influencing the spatial distribution of disease in animal populations. However, the reliability of such analyses depends on data quality, with anomalous records having the potential to introduce significant bias and lead to inappropriate decision making. In this paper we promote the use of exceedance probabilities as a tool for detecting anomalies when applying hierarchical spatio-temporal models to animal health data. We illustrate this methodology through a case study data on outbreaks of foot-and-mouth disease (FMD) in Viet Nam for the period 2006-2008. A flexible binomial logistic regression was employed to model the number of FMD infected communes within each province of the country. Standard analyses of the residuals from this model failed to identify problems, but exceedance probabilities identified provinces in which the number of reported FMD outbreaks was unexpectedly low. This finding is interesting given that these provinces are on major cattle movement pathways through Viet Nam. Copyright © 2014 Elsevier Ltd. All rights reserved.

  4. Ageing under unequal circumstances: a cross-sectional analysis of the gender and socioeconomic patterning of functional limitations among the Southern European elderly.

    PubMed

    Serrano-Alarcón, Manuel; Perelman, Julian

    2017-10-03

    In a context of population ageing, it is a priority for planning and prevention to understand the socioeconomic (SE) patterning of functional limitations and its consequences on healthcare needs. This paper aims at measuring the gender and SE inequalities in functional limitations and their age of onset among the Southern European elderly; then, we evaluate how functional status is linked to formal and informal care use. We used Portuguese, Italian and Spanish data from the Survey of Health, Ageing and Retirement in Europe (SHARE) of 2011 (n = 9233). We constructed a summary functional limitation score as the sum of two variables: i) Activities of Daily Living (ADL) and ii) Instrumental Activities of Daily Living (IADL). We modelled the functional limitation as a function of age, gender, education, subjective poverty, employment and marital status using multinomial logit models. We then estimated how functional limitation affected informal and formal care demand using negative binomial and logistic models. Women were 2.3 percentage points (pp) more likely to experience severe functional limitation than men, and overcame a 10% probability threshold of suffering from severe limitation around 5 years earlier. Subjective poverty was associated with a 3.1 pp. higher probability of severe functional limitation. Having a university degree reduced the probability of severe functional limitation by 3.5 pp. as compared to none educational level. Discrepancies were wider for the oldest old: women aged 65-79 years old were 3.3 pp. more likely to suffer severe limitations, the excess risk increasing to 15.5 pp. among those older than 80. Similarly, educational inequalities in functional limitation were wider at older ages. Being severely limited was related with a 32.1 pp. higher probability of receiving any informal care, as compared to those moderately limited. Finally, those severely limited had on average 3.2 hospitalization days and 4.6 doctor consultations more, per year, than those without limitations. Functional limitations are unequally distributed, hitting women and the worse-off earlier and more severely, with consequences on care needs. Considering the burden on healthcare systems and families, public health policies should seek to reduce current inequalities in functional limitations.

  5. Statistical inference for time course RNA-Seq data using a negative binomial mixed-effect model.

    PubMed

    Sun, Xiaoxiao; Dalpiaz, David; Wu, Di; S Liu, Jun; Zhong, Wenxuan; Ma, Ping

    2016-08-26

    Accurate identification of differentially expressed (DE) genes in time course RNA-Seq data is crucial for understanding the dynamics of transcriptional regulatory network. However, most of the available methods treat gene expressions at different time points as replicates and test the significance of the mean expression difference between treatments or conditions irrespective of time. They thus fail to identify many DE genes with different profiles across time. In this article, we propose a negative binomial mixed-effect model (NBMM) to identify DE genes in time course RNA-Seq data. In the NBMM, mean gene expression is characterized by a fixed effect, and time dependency is described by random effects. The NBMM is very flexible and can be fitted to both unreplicated and replicated time course RNA-Seq data via a penalized likelihood method. By comparing gene expression profiles over time, we further classify the DE genes into two subtypes to enhance the understanding of expression dynamics. A significance test for detecting DE genes is derived using a Kullback-Leibler distance ratio. Additionally, a significance test for gene sets is developed using a gene set score. Simulation analysis shows that the NBMM outperforms currently available methods for detecting DE genes and gene sets. Moreover, our real data analysis of fruit fly developmental time course RNA-Seq data demonstrates the NBMM identifies biologically relevant genes which are well justified by gene ontology analysis. The proposed method is powerful and efficient to detect biologically relevant DE genes and gene sets in time course RNA-Seq data.

  6. Robust inference in the negative binomial regression model with an application to falls data.

    PubMed

    Aeberhard, William H; Cantoni, Eva; Heritier, Stephane

    2014-12-01

    A popular way to model overdispersed count data, such as the number of falls reported during intervention studies, is by means of the negative binomial (NB) distribution. Classical estimating methods are well-known to be sensitive to model misspecifications, taking the form of patients falling much more than expected in such intervention studies where the NB regression model is used. We extend in this article two approaches for building robust M-estimators of the regression parameters in the class of generalized linear models to the NB distribution. The first approach achieves robustness in the response by applying a bounded function on the Pearson residuals arising in the maximum likelihood estimating equations, while the second approach achieves robustness by bounding the unscaled deviance components. For both approaches, we explore different choices for the bounding functions. Through a unified notation, we show how close these approaches may actually be as long as the bounding functions are chosen and tuned appropriately, and provide the asymptotic distributions of the resulting estimators. Moreover, we introduce a robust weighted maximum likelihood estimator for the overdispersion parameter, specific to the NB distribution. Simulations under various settings show that redescending bounding functions yield estimates with smaller biases under contamination while keeping high efficiency at the assumed model, and this for both approaches. We present an application to a recent randomized controlled trial measuring the effectiveness of an exercise program at reducing the number of falls among people suffering from Parkinsons disease to illustrate the diagnostic use of such robust procedures and their need for reliable inference. © 2014, The International Biometric Society.

  7. Association between month of birth and melanoma risk: fact or fiction?

    PubMed

    Fiessler, Cornelia; Pfahlberg, Annette B; Keller, Andrea K; Radespiel-Tröger, Martin; Uter, Wolfgang; Gefeller, Olaf

    2017-04-01

    Evidence on the effect of ultraviolet radiation (UVR) exposure in infancy on melanoma risk in later life is scarce. Three recent studies suggest that people born in spring carry a higher melanoma risk. Our study aimed at verifying whether such a seasonal pattern of melanoma risk actually exists. Data from the population-based Cancer Registry Bavaria (CRB) on the birth months of 28 374 incident melanoma cases between 2002 and 2012 were analysed and compared with data from the Bavarian State Office for Statistics and Data Processing on the birth month distribution in the Bavarian population. Crude and adjusted analyses using negative binomial regression models were performed in the total study group and supplemented by several subgroup analyses. In the crude analysis, the birth months March-May were over-represented among melanoma cases. Negative binomial regression models adjusted only for sex and birth year revealed a seasonal association between melanoma risk and birth month with 13-21% higher relative incidence rates for March, April and May compared with the reference December. However, after additionally adjusting for the birth month distribution of the Bavarian population, these risk estimates decreased markedly and no association with the birth month was observed any more. Similar results emerged in all subgroup analyses. Our large registry-based study provides no evidence that people born in spring carry a higher risk for developing melanoma in later life and thus lends no support to the hypothesis of higher UVR susceptibility during the first months of life. © The Author 2016; all rights reserved. Published by Oxford University Press on behalf of the International Epidemiological Association

  8. Dental Caries and Enamel Defects in Very Low Birth Weight Adolescents

    PubMed Central

    Nelson, S.; Albert, J.M.; Lombardi, G.; Wishnek, S.; Asaad, G.; Kirchner, H.L.; Singer, L.T.

    2011-01-01

    Objectives The purpose of this study was to examine developmental enamel defects and dental caries in very low birth weight adolescents with high risk (HR-VLBW) and low risk (LR-VLBW) compared to full-term (term) adolescents. Methods The sample consisted of 224 subjects (80 HR-VLBW, 59 LR-VLBW, 85 term adolescents) recruited from an ongoing longitudinal study. Sociodemographic and medical information was available from birth. Dental examination of the adolescent at the 14-year visit included: enamel defects (opacity and hypoplasia); decayed, missing, filled teeth of incisors and molars (DMFT-IM) and of overall permanent teeth (DMFT); Simplified Oral Hygiene Index for debris/calculus on teeth, and sealant presence. A caregiver questionnaire completed simultaneously assessed dental behavior, access, insurance status and prevention factors. Hierarchical analysis utilized the zero-inflated negative binomial model and zero-inflated Poisson model. Results The zero-inflated negative binomial model controlling for sociodemographic variables indicated that the LR-VLBW group had an estimated 75% increase (p < 0.05) in number of demarcated opacities in the incisors and first molar teeth compared to the term group. Hierarchical modeling indicated that demarcated opacities were a significant predictor of DMFT-IM after control for relevant covariates. The term adolescents had significantly increased DMFT-IM and DMFT scores compared to the LR-VLBW adolescents. Conclusion LR-VLBW was a significant risk factor for increased enamel defects in the permanent incisors and first molars. Term children had increased caries compared to the LR-VLBW group. The effect of birth group and enamel defects on caries has to be investigated longitudinally from birth. PMID:20975268

  9. Hopelessness as a Predictor of Suicide Ideation in Depressed Male and Female Adolescent Youth.

    PubMed

    Wolfe, Kristin L; Nakonezny, Paul A; Owen, Victoria J; Rial, Katherine V; Moorehead, Alexandra P; Kennard, Beth D; Emslie, Graham J

    2017-12-21

    We examined hopelessness as a predictor of suicide ideation in depressed youth after acute medication treatment. A total of 158 depressed adolescents were administered the Children's Depression Rating Scale-Revised (CDRS-R) and Columbia Suicide Severity Rating Scale (C-SSRS) as part of a larger battery at baseline and at weekly visits across 6 weeks of acute fluoxetine treatment. The Beck Hopelessness Scale (BHS) was administered at baseline and week 6. A negative binomial regression model via a generalized estimating equation analysis of repeated measures was used to estimate suicide ideation over the 6 weeks of acute treatment from baseline measure of hopelessness. Depression severity and gender were included as covariates in the model. The negative binomial analysis was also conducted separately for the sample of males and females (in a gender-stratified analysis). Mean CDRS-R total scores were 60.30 ± 8.93 at baseline and 34.65 ± 10.41 at week 6. Mean baseline and week 6 BHS scores were 9.57 ± 5.51 and 5.59 ± 5.38, respectively. Per the C-SSRS, 43.04% and 83.54% reported having no suicide ideation at baseline and at week 6, respectively. The analyses revealed that baseline hopelessness was positively related to suicide ideation over treatment (p = .0027), independent of changes in depression severity. This significant finding persisted only for females (p = .0024). These results indicate the importance of early identification of hopelessness. © 2017 The American Association of Suicidology.

  10. Prediction of vehicle crashes by drivers' characteristics and past traffic violations in Korea using a zero-inflated negative binomial model.

    PubMed

    Kim, Dae-Hwan; Ramjan, Lucie M; Mak, Kwok-Kei

    2016-01-01

    Traffic safety is a significant public health challenge, and vehicle crashes account for the majority of injuries. This study aims to identify whether drivers' characteristics and past traffic violations may predict vehicle crashes in Korea. A total of 500,000 drivers were randomly selected from the 11.6 million driver records of the Ministry of Land, Transport and Maritime Affairs in Korea. Records of traffic crashes were obtained from the archives of the Korea Insurance Development Institute. After matching the past violation history for the period 2004-2005 with the number of crashes in year 2006, a total of 488,139 observations were used for the analysis. Zero-inflated negative binomial model was used to determine the incident risk ratio (IRR) of vehicle crashes by past violations of individual drivers. The included covariates were driver's age, gender, district of residence, vehicle choice, and driving experience. Drivers violating (1) a hit-and-run or drunk driving regulation at least once and (2) a signal, central line, or speed regulation more than once had a higher risk of a vehicle crash with respective IRRs of 1.06 and 1.15. Furthermore, female gender, a younger age, fewer years of driving experience, and middle-sized vehicles were all significantly associated with a higher likelihood of vehicle crashes. Drivers' demographic characteristics and past traffic violations could predict vehicle crashes in Korea. Greater resources should be assigned to the provision of traffic safety education programs for the high-risk driver groups.

  11. The Effect of Sharrows, Painted Bicycle Lanes and Physically Protected Paths on the Severity of Bicycle Injuries Caused by Motor Vehicles.

    PubMed

    Wall, Stephen P; Lee, David C; Frangos, Spiros G; Sethi, Monica; Heyer, Jessica H; Ayoung-Chee, Patricia; DiMaggio, Charles J

    2016-01-01

    We conducted individual and ecologic analyses of prospectively collected data from 839 injured bicyclists who collided with motorized vehicles and presented to Bellevue Hospital, an urban Level-1 trauma center in New York City, from December 2008 to August 2014. Variables included demographics, scene information, rider behaviors, bicycle route availability, and whether the collision occurred before the road segment was converted to a bicycle route. We used negative binomial modeling to assess the risk of injury occurrence following bicycle path or lane implementation. We dichotomized U.S. National Trauma Data Bank Injury Severity Scores (ISS) into none/mild (0-8) versus moderate, severe, or critical (>8) and used adjusted multivariable logistic regression to model the association of ISS with collision proximity to sharrows (i.e., bicycle lanes designated for sharing with cars), painted bicycle lanes, or physically protected paths. Negative binomial modeling of monthly counts, while adjusting for pedestrian activity, revealed that physically protected paths were associated with 23% fewer injuries. Painted bicycle lanes reduced injury risk by nearly 90% (IDR 0.09, 95% CI 0.02-0.33). Holding all else equal, compared to no bicycle route, a bicycle injury nearby sharrows was nearly twice as likely to be moderate, severe, or critical (adjusted odds ratio 1.94; 95% confidence interval (CI) 0.91-4.15). Painted bicycle lanes and physically protected paths were 1.52 (95% CI 0.85-2.71) and 1.66 (95% CI 0.85-3.22) times as likely to be associated with more than mild injury respectively.

  12. Generalized seasonal autoregressive integrated moving average models for count data with application to malaria time series with low case numbers.

    PubMed

    Briët, Olivier J T; Amerasinghe, Priyanie H; Vounatsou, Penelope

    2013-01-01

    With the renewed drive towards malaria elimination, there is a need for improved surveillance tools. While time series analysis is an important tool for surveillance, prediction and for measuring interventions' impact, approximations by commonly used Gaussian methods are prone to inaccuracies when case counts are low. Therefore, statistical methods appropriate for count data are required, especially during "consolidation" and "pre-elimination" phases. Generalized autoregressive moving average (GARMA) models were extended to generalized seasonal autoregressive integrated moving average (GSARIMA) models for parsimonious observation-driven modelling of non Gaussian, non stationary and/or seasonal time series of count data. The models were applied to monthly malaria case time series in a district in Sri Lanka, where malaria has decreased dramatically in recent years. The malaria series showed long-term changes in the mean, unstable variance and seasonality. After fitting negative-binomial Bayesian models, both a GSARIMA and a GARIMA deterministic seasonality model were selected based on different criteria. Posterior predictive distributions indicated that negative-binomial models provided better predictions than Gaussian models, especially when counts were low. The G(S)ARIMA models were able to capture the autocorrelation in the series. G(S)ARIMA models may be particularly useful in the drive towards malaria elimination, since episode count series are often seasonal and non-stationary, especially when control is increased. Although building and fitting GSARIMA models is laborious, they may provide more realistic prediction distributions than do Gaussian methods and may be more suitable when counts are low.

  13. Dispersion models and sampling of cacao mirid bug Sahlbergella singularis (Hemiptera: Miridae) on Theobroma Cacao in southern Cameroon.

    PubMed

    Bisseleua, D H B; Vidal, Stefan

    2011-02-01

    The spatio-temporal distribution of Sahlbergella singularis Haglung, a major pest of cacao trees (Theobroma cacao) (Malvaceae), was studied for 2 yr in traditional cacao forest gardens in the humid forest area of southern Cameroon. The first objective was to analyze the dispersion of this insect on cacao trees. The second objective was to develop sampling plans based on fixed levels of precision for estimating S. singularis populations. The following models were used to analyze the data: Taylor's power law, Iwao's patchiness regression, the Nachman model, and the negative binomial distribution. Our results document that Taylor's power law was a better fit for the data than the Iwao and Nachman models. Taylor's b and Iwao's β were both significantly >1, indicating that S. singularis aggregated on specific trees. This result was further supported by the calculated common k of 1.75444. Iwao's α was significantly <0, indicating that the basic distribution component of S. singularis was the individual insect. Comparison of negative binomial (NBD) and Nachman models indicated that the NBD model was appropriate for studying S. singularis distribution. Optimal sample sizes for fixed precision levels of 0.10, 0.15, and 0.25 were estimated with Taylor's regression coefficients. Required sample sizes increased dramatically with increasing levels of precision. This is the first study on S. singularis dispersion in cacao plantations. Sampling plans, presented here, should be a tool for research on population dynamics and pest management decisions of mirid bugs on cacao. © 2011 Entomological Society of America

  14. On performance of parametric and distribution-free models for zero-inflated and over-dispersed count responses.

    PubMed

    Tang, Wan; Lu, Naiji; Chen, Tian; Wang, Wenjuan; Gunzler, Douglas David; Han, Yu; Tu, Xin M

    2015-10-30

    Zero-inflated Poisson (ZIP) and negative binomial (ZINB) models are widely used to model zero-inflated count responses. These models extend the Poisson and negative binomial (NB) to address excessive zeros in the count response. By adding a degenerate distribution centered at 0 and interpreting it as describing a non-risk group in the population, the ZIP (ZINB) models a two-component population mixture. As in applications of Poisson and NB, the key difference between ZIP and ZINB is the allowance for overdispersion by the ZINB in its NB component in modeling the count response for the at-risk group. Overdispersion arising in practice too often does not follow the NB, and applications of ZINB to such data yield invalid inference. If sources of overdispersion are known, other parametric models may be used to directly model the overdispersion. Such models too are subject to assumed distributions. Further, this approach may not be applicable if information about the sources of overdispersion is unavailable. In this paper, we propose a distribution-free alternative and compare its performance with these popular parametric models as well as a moment-based approach proposed by Yu et al. [Statistics in Medicine 2013; 32: 2390-2405]. Like the generalized estimating equations, the proposed approach requires no elaborate distribution assumptions. Compared with the approach of Yu et al., it is more robust to overdispersed zero-inflated responses. We illustrate our approach with both simulated and real study data. Copyright © 2015 John Wiley & Sons, Ltd.

  15. Generalized Seasonal Autoregressive Integrated Moving Average Models for Count Data with Application to Malaria Time Series with Low Case Numbers

    PubMed Central

    Briët, Olivier J. T.; Amerasinghe, Priyanie H.; Vounatsou, Penelope

    2013-01-01

    Introduction With the renewed drive towards malaria elimination, there is a need for improved surveillance tools. While time series analysis is an important tool for surveillance, prediction and for measuring interventions’ impact, approximations by commonly used Gaussian methods are prone to inaccuracies when case counts are low. Therefore, statistical methods appropriate for count data are required, especially during “consolidation” and “pre-elimination” phases. Methods Generalized autoregressive moving average (GARMA) models were extended to generalized seasonal autoregressive integrated moving average (GSARIMA) models for parsimonious observation-driven modelling of non Gaussian, non stationary and/or seasonal time series of count data. The models were applied to monthly malaria case time series in a district in Sri Lanka, where malaria has decreased dramatically in recent years. Results The malaria series showed long-term changes in the mean, unstable variance and seasonality. After fitting negative-binomial Bayesian models, both a GSARIMA and a GARIMA deterministic seasonality model were selected based on different criteria. Posterior predictive distributions indicated that negative-binomial models provided better predictions than Gaussian models, especially when counts were low. The G(S)ARIMA models were able to capture the autocorrelation in the series. Conclusions G(S)ARIMA models may be particularly useful in the drive towards malaria elimination, since episode count series are often seasonal and non-stationary, especially when control is increased. Although building and fitting GSARIMA models is laborious, they may provide more realistic prediction distributions than do Gaussian methods and may be more suitable when counts are low. PMID:23785448

  16. Individual-level exposure to disaster, neighborhood environmental characteristics, and their independent and combined associations with depressive symptoms in women.

    PubMed

    Gaston, Symielle A; Volaufova, Julia; Peters, Edward S; Ferguson, Tekeda F; Robinson, William T; Nugent, Nicole; Trapido, Edward J; Rung, Ariane L

    2017-09-01

    The severity of the stress response to experiencing disaster depends on individual exposure and background stress prior to the event. To date, there is limited research on the interaction between neighborhood environmental stress and experiencing an oil spill, and their effects on depression. The objective of the current study was to assess if the association between exposure to the Deepwater Horizon Oil Spill (DHOS) and depressive symptoms varied by neighborhood characteristics. US Census data (2010) and longitudinal data collected in two waves (2012-2014 and 2014-2016) from female residents [N = 889 (Wave I), 737 (Wave II)] of an area highly affected by the DHOS were analyzed. Multilevel and individual-level negative binomial regressions were performed to estimate associations with depressive symptoms in both waves. An interaction term was included to estimate effect modification of the association between DHOS exposure and depressive symptoms by neighborhood characteristics. Generalized estimating equations were applied to the negative binomial regression testing longitudinal associations. Census tract-level neighborhood characteristics were not associated with depressive symptoms. Exposure to the DHOS and neighborhood physical disorder were associated with depressive symptoms cross-sectionally. There was no evidence of effect modification; however, physical/environmental exposure to the DHOS was associated with increased depressive symptoms only among women living in areas with physical disorder. Exposure to the DHOS remained associated with depressive symptoms over time. Findings support the enduring consequences of disaster exposure on depressive symptoms in women and identify potential targets for post-disaster intervention based on residential characteristics.

  17. Linking parasite populations in hosts to parasite populations in space through Taylor's law and the negative binomial distribution

    PubMed Central

    Poulin, Robert; Lagrue, Clément

    2017-01-01

    The spatial distribution of individuals of any species is a basic concern of ecology. The spatial distribution of parasites matters to control and conservation of parasites that affect human and nonhuman populations. This paper develops a quantitative theory to predict the spatial distribution of parasites based on the distribution of parasites in hosts and the spatial distribution of hosts. Four models are tested against observations of metazoan hosts and their parasites in littoral zones of four lakes in Otago, New Zealand. These models differ in two dichotomous assumptions, constituting a 2 × 2 theoretical design. One assumption specifies whether the variance function of the number of parasites per host individual is described by Taylor's law (TL) or the negative binomial distribution (NBD). The other assumption specifies whether the numbers of parasite individuals within each host in a square meter of habitat are independent or perfectly correlated among host individuals. We find empirically that the variance–mean relationship of the numbers of parasites per square meter is very well described by TL but is not well described by NBD. Two models that posit perfect correlation of the parasite loads of hosts in a square meter of habitat approximate observations much better than two models that posit independence of parasite loads of hosts in a square meter, regardless of whether the variance–mean relationship of parasites per host individual obeys TL or NBD. We infer that high local interhost correlations in parasite load strongly influence the spatial distribution of parasites. Local hotspots could influence control and conservation of parasites. PMID:27994156

  18. Sickness absence and psychosocial job quality: an analysis from a longitudinal survey of working Australians, 2005-2012.

    PubMed

    Milner, Allison; Butterworth, Peter; Bentley, Rebecca; Kavanagh, Anne M; LaMontagne, Anthony D

    2015-05-15

    Sickness absence is associated with adverse health, organizational, and societal outcomes. Using data from a longitudinal cohort study of working Australians (the Household, Income and Labour Dynamics in Australia (HILDA) Survey), we examined the relationship between changes in individuals' overall psychosocial job quality and variation in sickness absence. The outcome variables were paid sickness absence (yes/no) and number of days of paid sickness absence in the past year (2005-2012). The main exposure variable was psychosocial job quality, measured using a psychosocial job quality index (levels of job control, demands and complexity, insecurity, and perceptions of unfair pay). Analysis was conducted using longitudinal fixed-effects logistic regression models and negative binomial regression models. There was a dose-response relationship between the number of psychosocial job stressors reported by an individual and the odds of paid sickness absence (1 adversity: odds ratio (OR) = 1.26, 95% confidence interval (CI): 1.09, 1.45 (P = 0.002); 2 adversities: OR = 1.28, 95% CI: 1.09, 1.51 (P = 0.002); ≥3 adversities: OR = 1.58, 95% CI: 1.29, 1.94 (P < 0.001)). The negative binomial regression models also indicated that respondents reported a greater number of days of sickness absence in response to worsening psychosocial job quality. These results suggest that workplace interventions aiming to improve the quality of work could help reduce sickness absence. © The Author 2015. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  19. Binomial Baseball.

    ERIC Educational Resources Information Center

    Levin, Eugene M.

    1981-01-01

    Student access to programmable calculators and computer terminals, coupled with a familiarity with baseball, provides opportunities to enhance their understanding of the binomial distribution and other aspects of analysis. (MP)

  20. Injuries in children with extra physical education in primary schools.

    PubMed

    Rexen, Christina Trifonov; Andersen, Lars Bo; Ersbøll, Annette Kjær; Jespersen, Eva; Franz, Claudia; Wedderkopp, Niels

    2014-04-01

    (1) Examine the influence of extra physical education (EPE) on the number of musculoskeletal injuries in public schools accounting for organized sports participation (OSP) outside school. (2) Examine the major injury subgroup: growth-related overuse (GRO) through the overuse-related injury group. A longitudinal controlled school-based study among Danish public schools. At baseline, 1216 children participated age 6.2-12.4 yr. Six schools (701 children) with EPE and four control schools (515 children) were followed up with weekly automated mobile phone text messages for information on musculoskeletal problems and OSP. Health care personnel diagnosed the children according to the World Health Organization's International Classification of Diseases, Tenth Revision. Data were analyzed using a two-part zero-inflated negative binomial (ZINB) regression model. School type had no influence on the odds of sustaining an injury but increased the probability of sustaining a higher injury count for children with injuries, with total injuries by a factor of 1.29 (95% confidence interval [CI] = 1.07-1.56), overuse by a factor of 1.29 (95% CI = 1.06-1.55), and GRO by a factor of 1.38 (95% CI = 1.02-1.80). Weekly mean OSP decreased the odds of belonging to the group of children with no injuries, by a factor of 0.29 (95% CI = 0.14-0.58), 0.26 (95% CI = 0.14-0.48), and 0.17 (95% CI = 0.06-0.52) for total, overuse, and GRO, respectively. OSP also increased the probability of sustaining a higher injury count for children with injuries by a factor of 1.11 (95% CI = 1.02-1.22), 1.10 (95% CI = 1.00-1.22), and 1.14 (95% CI = 1.00-1.30), respectively. Children enrolled in EPE schools with high OSP have the highest odds of injury and a high probability of sustaining a higher injury count compared to their peers at schools with normal PE. Special attention should be assigned to these children during compulsory PE.

  1. Modeling factors influencing the demand for emergency department services in Ontario: a comparison of methods.

    PubMed

    Moineddin, Rahim; Meaney, Christopher; Agha, Mohammad; Zagorski, Brandon; Glazier, Richard Henry

    2011-08-19

    Emergency departments are medical treatment facilities, designed to provide episodic care to patients suffering from acute injuries and illnesses as well as patients who are experiencing sporadic flare-ups of underlying chronic medical conditions which require immediate attention. Supply and demand for emergency department services varies across geographic regions and time. Some persons do not rely on the service at all whereas; others use the service on repeated occasions. Issues regarding increased wait times for services and crowding illustrate the need to investigate which factors are associated with increased frequency of emergency department utilization. The evidence from this study can help inform policy makers on the appropriate mix of supply and demand targeted health care policies necessary to ensure that patients receive appropriate health care delivery in an efficient and cost-effective manner. The purpose of this report is to assess those factors resulting in increased demand for emergency department services in Ontario. We assess how utilization rates vary according to the severity of patient presentation in the emergency department. We are specifically interested in the impact that access to primary care physicians has on the demand for emergency department services. Additionally, we wish to investigate these trends using a series of novel regression models for count outcomes which have yet to be employed in the domain of emergency medical research. Data regarding the frequency of emergency department visits for the respondents of Canadian Community Health Survey (CCHS) during our study interval (2003-2005) are obtained from the National Ambulatory Care Reporting System (NACRS). Patients' emergency department utilizations were linked with information from the Canadian Community Health Survey (CCHS) which provides individual level medical, socio-demographic, psychological and behavioral information for investigating predictors of increased emergency department utilization. Six different multiple regression models for count data were fitted to assess the influence of predictors on demand for emergency department services, including: Poisson, Negative Binomial, Zero-Inflated Poisson, Zero-Inflated Negative Binomial, Hurdle Poisson, and Hurdle Negative Binomial. Comparison of competing models was assessed by the Vuong test statistic. The CCHS cycle 2.1 respondents were a roughly equal mix of males (50.4%) and females (49.6%). The majority (86.2%) were young-middle aged adults between the ages of 20-64, living in predominantly urban environments (85.9%), with mid-high household incomes (92.2%) and well-educated, receiving at least a high-school diploma (84.1%). Many participants reported no chronic disease (51.9%), fell into a small number (0-5) of ambulatory diagnostic groups (62.3%), and perceived their health status as good/excellent (88.1%); however, were projected to have high Resource Utilization Band levels of health resource utilization (68.2%). These factors were largely stable for CCHS cycle 3.1 respondents. Factors influencing demand for emergency department services varied according to the severity of triage scores at initial presentation. For example, although a non-significant predictor of the odds of emergency department utilization in high severity cases, access to a primary care physician was a statistically significant predictor of the likelihood of emergency department utilization (OR: 0.69; 95% CI OR: 0.63-0.75) and the rate of emergency department utilization (RR: 0.57; 95% CI RR: 0.50-0.66) in low severity cases. Using a theoretically appropriate hurdle negative binomial regression model this unique study illustrates that access to a primary care physician is an important predictor of both the odds and rate of emergency department utilization in Ontario. Restructuring primary care services, with aims of increasing access to undersupplied populations may result in decreased emergency department utilization rates by approximately 43% for low severity triage level cases.

  2. New Class of Quantum Error-Correcting Codes for a Bosonic Mode

    NASA Astrophysics Data System (ADS)

    Michael, Marios H.; Silveri, Matti; Brierley, R. T.; Albert, Victor V.; Salmilehto, Juha; Jiang, Liang; Girvin, S. M.

    2016-07-01

    We construct a new class of quantum error-correcting codes for a bosonic mode, which are advantageous for applications in quantum memories, communication, and scalable computation. These "binomial quantum codes" are formed from a finite superposition of Fock states weighted with binomial coefficients. The binomial codes can exactly correct errors that are polynomial up to a specific degree in bosonic creation and annihilation operators, including amplitude damping and displacement noise as well as boson addition and dephasing errors. For realistic continuous-time dissipative evolution, the codes can perform approximate quantum error correction to any given order in the time step between error detection measurements. We present an explicit approximate quantum error recovery operation based on projective measurements and unitary operations. The binomial codes are tailored for detecting boson loss and gain errors by means of measurements of the generalized number parity. We discuss optimization of the binomial codes and demonstrate that by relaxing the parity structure, codes with even lower unrecoverable error rates can be achieved. The binomial codes are related to existing two-mode bosonic codes, but offer the advantage of requiring only a single bosonic mode to correct amplitude damping as well as the ability to correct other errors. Our codes are similar in spirit to "cat codes" based on superpositions of the coherent states but offer several advantages such as smaller mean boson number, exact rather than approximate orthonormality of the code words, and an explicit unitary operation for repumping energy into the bosonic mode. The binomial quantum codes are realizable with current superconducting circuit technology, and they should prove useful in other quantum technologies, including bosonic quantum memories, photonic quantum communication, and optical-to-microwave up- and down-conversion.

  3. Quantifying the safety effects of horizontal curves on two-way, two-lane rural roads.

    PubMed

    Gooch, Jeffrey P; Gayah, Vikash V; Donnell, Eric T

    2016-07-01

    The objective of this study is to quantify the safety performance of horizontal curves on two-way, two-lane rural roads relative to tangent segments. Past research is limited by small samples sizes, outdated statistical evaluation methods, and unreported standard errors. This study overcomes these drawbacks by using the propensity scores-potential outcomes framework. The impact of adjacent curves on horizontal curve safety is also explored using a cross-sectional regression model of only horizontal curves. The models estimated in the present study used eight years of crash data (2005-2012) obtained from over 10,000 miles of state-owned two-lane rural roads in Pennsylvania. These data included information on roadway geometry (e.g., horizontal curvature, lane width, and shoulder width), traffic volume, roadside hazard rating, and the presence of various low-cost safety countermeasures (e.g., centerline and shoulder rumble strips, curve and intersection warning pavement markings, and aggressive driving pavement dots). Crash prediction is performed by means of mixed effects negative binomial regression using the explanatory variables noted previously, as well as attributes of adjacent horizontal curves. The results indicate that both the presence of a horizontal curve and its degree of curvature must be considered when predicting the frequency of total crashes on horizontal curves. Both are associated with an increase in crash frequency, which is consistent with previous findings in the literature. Mixed effects negative binomial regression models for total crash frequency on horizontal curves indicate that the distance to adjacent curves is not statistically significant. However, the degree of curvature of adjacent curves in close proximity (within 0.75 miles) was found to be statistically significant and negatively correlated with crash frequency on the subject curve. This is logical, as drivers exiting a sharp curve are likely to be driving slower and with more awareness as they approach the next horizontal curve. Copyright © 2016 Elsevier Ltd. All rights reserved.

  4. Comparison of robustness to outliers between robust poisson models and log-binomial models when estimating relative risks for common binary outcomes: a simulation study.

    PubMed

    Chen, Wansu; Shi, Jiaxiao; Qian, Lei; Azen, Stanley P

    2014-06-26

    To estimate relative risks or risk ratios for common binary outcomes, the most popular model-based methods are the robust (also known as modified) Poisson and the log-binomial regression. Of the two methods, it is believed that the log-binomial regression yields more efficient estimators because it is maximum likelihood based, while the robust Poisson model may be less affected by outliers. Evidence to support the robustness of robust Poisson models in comparison with log-binomial models is very limited. In this study a simulation was conducted to evaluate the performance of the two methods in several scenarios where outliers existed. The findings indicate that for data coming from a population where the relationship between the outcome and the covariate was in a simple form (e.g. log-linear), the two models yielded comparable biases and mean square errors. However, if the true relationship contained a higher order term, the robust Poisson models consistently outperformed the log-binomial models even when the level of contamination is low. The robust Poisson models are more robust (or less sensitive) to outliers compared to the log-binomial models when estimating relative risks or risk ratios for common binary outcomes. Users should be aware of the limitations when choosing appropriate models to estimate relative risks or risk ratios.

  5. The Context Matters: Outcome Probability and Expectation Mismatch Modulate the Feedback Negativity When Self-Evaluation of Response Correctness Is Possible

    PubMed Central

    Leue, Anja; Cano Rodilla, Carmen; Beauducel, André

    2015-01-01

    Individuals typically evaluate whether their performance and the obtained feedback match. Previous research has shown that feedback negativity (FN) depends on outcome probability and feedback valence. It is, however, less clear to what extent previous effects of outcome probability on FN depend on self-evaluations of response correctness. Therefore, we investigated the effects of outcome probability on FN amplitude in a simple go/no-go task that allowed for the self-evaluation of response correctness. We also investigated effects of performance incompatibility and feedback valence. In a sample of N = 22 participants, outcome probability was manipulated by means of precues, feedback valence by means of monetary feedback, and performance incompatibility by means of feedback that induced a match versus mismatch with individuals' performance. We found that the 100% outcome probability condition induced a more negative FN following no-loss than the 50% outcome probability condition. The FN following loss was more negative in the 50% compared to the 100% outcome probability condition. Performance-incompatible loss resulted in a more negative FN than performance-compatible loss. Our results indicate that the self-evaluation of the correctness of responses should be taken into account when the effects of outcome probability and expectation mismatch on FN are investigated. PMID:26783525

  6. The Context Matters: Outcome Probability and Expectation Mismatch Modulate the Feedback Negativity When Self-Evaluation of Response Correctness Is Possible.

    PubMed

    Leue, Anja; Cano Rodilla, Carmen; Beauducel, André

    2015-01-01

    Individuals typically evaluate whether their performance and the obtained feedback match. Previous research has shown that feedback negativity (FN) depends on outcome probability and feedback valence. It is, however, less clear to what extent previous effects of outcome probability on FN depend on self-evaluations of response correctness. Therefore, we investigated the effects of outcome probability on FN amplitude in a simple go/no-go task that allowed for the self-evaluation of response correctness. We also investigated effects of performance incompatibility and feedback valence. In a sample of N = 22 participants, outcome probability was manipulated by means of precues, feedback valence by means of monetary feedback, and performance incompatibility by means of feedback that induced a match versus mismatch with individuals' performance. We found that the 100% outcome probability condition induced a more negative FN following no-loss than the 50% outcome probability condition. The FN following loss was more negative in the 50% compared to the 100% outcome probability condition. Performance-incompatible loss resulted in a more negative FN than performance-compatible loss. Our results indicate that the self-evaluation of the correctness of responses should be taken into account when the effects of outcome probability and expectation mismatch on FN are investigated.

  7. Speech-discrimination scores modeled as a binomial variable.

    PubMed

    Thornton, A R; Raffin, M J

    1978-09-01

    Many studies have reported variability data for tests of speech discrimination, and the disparate results of these studies have not been given a simple explanation. Arguments over the relative merits of 25- vs 50-word tests have ignored the basic mathematical properties inherent in the use of percentage scores. The present study models performance on clinical tests of speech discrimination as a binomial variable. A binomial model was developed, and some of its characteristics were tested against data from 4120 scores obtained on the CID Auditory Test W-22. A table for determining significant deviations between scores was generated and compared to observed differences in half-list scores for the W-22 tests. Good agreement was found between predicted and observed values. Implications of the binomial characteristics of speech-discrimination scores are discussed.

  8. Possibility and Challenges of Conversion of Current Virus Species Names to Linnaean Binomials

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Postler, Thomas S.; Clawson, Anna N.; Amarasinghe, Gaya K.

    Botanical, mycological, zoological, and prokaryotic species names follow the Linnaean format, consisting of an italicized Latinized binomen with a capitalized genus name and a lower case species epithet (e.g., Homo sapiens). Virus species names, however, do not follow a uniform format, and, even when binomial, are not Linnaean in style. In this thought exercise, we attempted to convert all currently official names of species included in the virus family Arenaviridae and the virus order Mononegavirales to Linnaean binomials, and to identify and address associated challenges and concerns. Surprisingly, this endeavor was not as complicated or time-consuming as even the authorsmore » of this article expected when conceiving the experiment. [Arenaviridae; binomials; ICTV; International Committee on Taxonomy of Viruses; Mononegavirales; virus nomenclature; virus taxonomy.]« less

  9. Mutational Analysis of the Adaptor Protein 2 Sigma Subunit (AP2S1) Gene: Search for Autosomal Dominant Hypocalcemia Type 3 (ADH3)

    PubMed Central

    Rogers, Angela; Nesbit, M. Andrew; Hannan, Fadil M.; Howles, Sarah A.; Gorvin, Caroline M.; Cranston, Treena; Allgrove, Jeremy; Bevan, John S.; Bano, Gul; Brain, Caroline; Datta, Vipan; Grossman, Ashley B.; Hodgson, Shirley V.; Izatt, Louise; Millar-Jones, Lynne; Pearce, Simon H.; Robertson, Lisa; Selby, Peter L.; Shine, Brian; Snape, Katie; Warner, Justin

    2014-01-01

    Context: Autosomal dominant hypocalcemia (ADH) types 1 and 2 are due to calcium-sensing receptor (CASR) and G-protein subunit-α11 (GNA11) gain-of-function mutations, respectively, whereas CASR and GNA11 loss-of-function mutations result in familial hypocalciuric hypercalcemia (FHH) types 1 and 2, respectively. Loss-of-function mutations of adaptor protein-2 sigma subunit (AP2σ 2), encoded by AP2S1, cause FHH3, and we therefore sought for gain-of-function AP2S1 mutations that may cause an additional form of ADH, which we designated ADH3. Objective: The objective of the study was to investigate the hypothesis that gain-of-function AP2S1 mutations may cause ADH3. Design: The sample size required for the detection of at least one mutation with a greater than 95% likelihood was determined by binomial probability analysis. Nineteen patients (including six familial cases) with hypocalcemia in association with low or normal serum PTH concentrations, consistent with ADH, but who did not have CASR or GNA11 mutations, were ascertained. Leukocyte DNA was used for sequence and copy number variation analysis of AP2S1. Results: Binomial probability analysis, using the assumption that AP2S1 mutations would occur in hypocalcemic patients at a prevalence of 20%, which is observed in FHH patients without CASR or GNA11 mutations, indicated that the likelihood of detecting at least one AP2S1 mutation was greater than 95% and greater than 98% in sample sizes of 14 and 19 hypocalcemic patients, respectively. AP2S1 mutations and copy number variations were not detected in the 19 hypocalcemic patients. Conclusion: The absence of AP2S1 abnormalities in hypocalcemic patients, suggests that ADH3 may not occur or otherwise represents a rare hypocalcemic disorder. PMID:24708097

  10. The role of vaspin as a predictor of coronary angiography result in SCAD (stable coronary artery disease) patients.

    PubMed

    Stančík, Matej; Ságová, Ivana; Kantorová, Ema; Mokáň, Marián

    2017-05-08

    The role of vaspin in the pathogenesis of stable coronary artery disease (SCAD) have been repeatedly addressed in clinical studies. However, from the point of view of clinical practice, the results of earlier studies are still inconclusive. The data of 106 SCAD patients who received coronary angiography and 85 coronary artery disease-free controls were collected and analysed. The patients were divided into subgroups according to their pre-test probability (PTP) and according to the result of coronary angiography. Fasting vaspin concentrations were compared between subgroups of SCAD patients and between target group and controls. The effect of age and smoking on the result of coronary angiography was compared to the effect of vaspin using the binomial regression. We did not find significant difference in vaspin level between target group and controls. Unless the pre-test probability was taken into account, we did not find vaspin difference in the target group, when dividing patients on the basis of presence/absence of significant coronary stenosis. In the subgroup of SCAD patients with PTP between 15% - 65%, those with significant coronary stenoses had higher mean vaspin concentration (0,579 ± 0,898 ng/ml) than patients without significant stenoses. (0,379 ± 0,732 ng/ml) (t = -2595; p = 0,012; d = 0,658; 1-β = 0,850). Age, smoking status and vaspin significantly contributed to the HSCS prediction in binomial regression model in patients with low PTP (OR: 1.1, 4.9, 8.7, respectively). According to our results, vaspin cannot be used as an independent marker for the presence of CAD in general population. However, our results indicate that measuring vaspin in SCAD patients might be clinically useful in patients with PTP below 66%.

  11. Charged particle multiplicities in deep inelastic scattering at HERA

    NASA Astrophysics Data System (ADS)

    Aid, S.; Anderson, M.; Andreev, V.; Andrieu, B.; Appuhn, R.-D.; Babaev, A.; Bähr, J.; Bán, J.; Ban, Y.; Baranov, P.; Barrelet, E.; Barschke, R.; Bartel, W.; Barth, M.; Bassler, U.; Beck, H. P.; Behrend, H.-J.; Belousov, A.; Berger, Ch.; Bernardi, G.; Bertrand-Coremans, G.; Besançon, M.; Beyer, R.; Biddulph, P.; Bispham, P.; Bizot, J. C.; Blobel, V.; Borras, K.; Botterweck, F.; Boudry, V.; Braemer, A.; Braunschweig, W.; Brisson, V.; Bruel, P.; Bruncko, D.; Brune, C.; Buchholz, R.; Büngener, L.; Bürger, J.; Büsser, F. W.; Buniatian, A.; Burke, S.; Burton, M. J.; Calvet, D.; Campbell, A. J.; Carli, T.; Charlet, M.; Clarke, D.; Clegg, A. B.; Clerbaux, B.; Cocks, S.; Contreras, J. G.; Cormack, C.; Coughlan, J. A.; Courau, A.; Cousinou, M.-C.; Cozzika, G.; Criegee, L.; Cussans, D. G.; Cvach, J.; Dagoret, S.; Dainton, J. B.; Dau, W. D.; Daum, K.; David, M.; Davis, C. L.; Delcourt, B.; de Roeck, A.; de Wolf, E. A.; Dirkmann, M.; Dixon, P.; di Nezza, P.; Dlugosz, W.; Dollfus, C.; Dowell, J. D.; Dreis, H. B.; Droutskoi, A.; Dünger, O.; Duhm, H.; Ebert, J.; Ebert, T. R.; Eckerlin, G.; Efremenko, V.; Egli, S.; Eichler, R.; Eisele, F.; Eisenhandler, E.; Elsen, E.; Erdmann, M.; Erdmann, W.; Evrard, E.; Fahr, A. B.; Favart, L.; Fedotov, A.; Feeken, D.; Felst, R.; Feltesse, J.; Ferencei, J.; Ferrarotto, F.; Flamm, K.; Fleischer, M.; Flieser, M.; Flügge, G.; Fomenko, A.; Fominykh, B.; Formánek, J.; Foster, J. M.; Franke, G.; Fretwurst, E.; Gabathuler, E.; Gabathuler, K.; Gaede, F.; Garvey, J.; Gayler, J.; Gebauer, M.; Genzel, H.; Gerhards, R.; Glazov, A.; Goerlach, U.; Goerlich, L.; Gogitidze, N.; Goldberg, M.; Goldner, D.; Golec-Biernat, K.; Gonzalez-Pineiro, B.; Gorelov, I.; Grab, C.; Grässler, H.; Greenshaw, T.; Griffiths, R. K.; Grindhammer, G.; Gruber, A.; Gruber, C.; Haack, J.; Hadig, T.; Haidt, D.; Hajduk, L.; Hampel, M.; Haynes, W. J.; Heinzelmann, G.; Henderson, R. C. W.; Henschel, H.; Herynek, I.; Hess, M. F.; Hewitt, K.; Hildesheim, W.; Hiller, K. H.; Hilton, C. D.; Hladký, J.; Hoeger, K. C.; Höppner, M.; Hoffmann, D.; Holtom, T.; Horisberger, R.; Hudgson, V. L.; Hütte, M.; Ibbotson, M.; Itterbeck, H.; Jacholkowska, A.; Jacobsson, C.; Jaffre, M.; Janoth, J.; Jansen, T.; Jönsson, L.; Johnson, D. P.; Jung, H.; Kalmus, P. I. P.; Kander, M.; Kant, D.; Kaschowitz, R.; Kathage, U.; Katzy, J.; Kaufmann, H. H.; Kaufmann, O.; Kazarian, S.; Kenyon, I. R.; Kermiche, S.; Keuker, C.; Kiesling, C.; Klein, M.; Kleinwort, C.; Knies, G.; Köhler, T.; Köhne, J. H.; Kolanoski, H.; Kole, F.; Kolya, S. D.; Korbel, V.; Korn, M.; Kostka, P.; Kotelnikov, S. K.; Krämerkämper, T.; Krasny, M. W.; Krehbiel, H.; Krücker, D.; Küster, H.; Kuhlen, M.; Kurča, T.; Kurzhöfer, J.; Lacour, D.; Laforge, B.; Lander, R.; Landon, M. P. J.; Lange, W.; Langenegger, U.; Laporte, J.-F.; Lebedev, A.; Lehner, F.; Levonian, S.; Lindström, G.; Lindstroem, M.; Link, J.; Linsel, F.; Lipinski, J.; List, B.; Lobo, G.; Lomas, J. W.; Lopez, G. C.; Lubimov, V.; Lüke, D.; Magnussen, N.; Malinovski, E.; Mani, S.; Maraček, R.; Marage, P.; Marks, J.; Marshall, R.; Martens, J.; Martin, G.; Martin, R.; Martyn, H.-U.; Martyniak, J.; Mavroidis, T.; Maxfield, S. J.; McMahon, S. J.; Mehta, A.; Meier, K.; Meyer, A.; Meyer, A.; Meyer, H.; Meyer, J.; Meyer, P.-O.; Migliori, A.; Mikocki, S.; Milstead, D.; Moeck, J.; Moreau, F.; Morris, J. V.; Mroczko, E.; Müller, D.; Müller, G.; Müller, K.; Müller, M.; Murín, P.; Nagovizin, V.; Nahnhauer, R.; Naroska, B.; Naumann, Th.; Négri, I.; Newman, P. R.; Newton, D.; Nguyen, H. K.; Nicholls, T. C.; Niebergall, F.; Niebuhr, C.; Niedzballa, Ch.; Niggli, H.; Nisius, R.; Nowak, G.; Noyes, G. W.; Nyberg-Werther, M.; Oakden, M.; Oberlack, H.; Olsson, J. E.; Ozerov, D.; Palmen, P.; Panaro, E.; Panitch, A.; Pascaud, C.; Patel, G. D.; Pawletta, H.; Peppel, E.; Perez, E.; Phillips, J. P.; Pieuchot, A.; Pitzl, D.; Pope, G.; Prell, S.; Rabbertz, K.; Rädel, G.; Reimer, P.; Reinshagen, S.; Rick, H.; Riech, V.; Riedlberger, J.; Riepenhausen, F.; Riess, S.; Rizvi, E.; Robertson, S. M.; Robmann, P.; Roloff, H. E.; Roosen, R.; Rosenbauer, K.; Rostovtsev, A.; Rouse, F.; Royon, C.; Rüter, K.; Rusakov, S.; Rybicki, K.; Sankey, D. P. C.; Schacht, P.; Schiek, S.; Schleif, S.; Schleper, P.; von Schlippe, W.; Schmidt, D.; Schmidt, G.; Schöning, A.; Schröder, V.; Schuhmann, E.; Schwab, B.; Sefkow, F.; Seidel, M.; Sell, R.; Semenov, A.; Shekelyan, V.; Sheviakov, I.; Shtarkov, L. N.; Siegmon, G.; Siewert, U.; Sirois, Y.; Skillicorn, I. O.; Smirnov, P.; Smith, J. R.; Solochenko, V.; Soloviev, Y.; Specka, A.; Spiekermann, J.; Spielman, S.; Spitzer, H.; Squinabol, F.; Steenbock, M.; Steffen, P.; Steinberg, R.; Steiner, H.; Steinhart, J.; Stella, B.; Stellberger, A.; Stier, J.; Stiewe, J.; Stößlein, U.; Stolze, K.; Straumann, U.; Struczinski, W.; Sutton, J. P.; Tapprogge, S.; Taševský, M.; Tchernyshov, V.; Tchetchelnitski, S.; Theissen, J.; Thiebaux, C.; Thompson, G.; Truöl, P.; Tsipolitis, G.; Turnau, J.; Tutas, J.; Uelkes, P.; Usik, A.; Valkár, S.; Valkárová, A.; Vallée, C.; Vandenplas, D.; van Esch, P.; van Mechelen, P.; Vazdik, Y.; Verrecchia, P.; Villet, G.; Wacker, K.; Wagener, A.; Wagener, M.; Walther, A.; Waugh, B.; Weber, G.; Weber, M.; Wegener, D.; Wegner, A.; Wengler, T.; Werner, M.; West, L. R.; Wilksen, T.; Willard, S.; Winde, M.; Winter, G.-G.; Wittek, C.; Wobisch, M.; Wünsch, E.; Žáček, J.; Zarbock, D.; Zhang, Z.; Zhokin, A.; Zini, P.; Zomer, F.; Zsembery, J.; Zuber, K.; Zurnedden, M.

    1996-12-01

    Using the H1 detector at HERA, charged particle multiplicity distributions in deep inelastic e + p scattering have been measured over a large kinematical region. The evolution with W and Q 2 of the multiplicity distribution and of the multiplicity moments in pseudorapidity domains of varying size is studied in the current fragmentation region of the hadronic centre-of-mass frame. The results are compared with data from fixed target lepton-nucleon interactions, e + e - annihilations and hadron-hadron collisions as well as with expectations from QCD based parton models. Fits to the Negative Binomial and Lognormal distributions are presented.

  12. The influence of neighborhood characteristics on the relationship between discrimination and increased drug-using social ties among illicit drug users.

    PubMed

    Crawford, Natalie D; Borrell, Luisa N; Galea, Sandro; Ford, Chandra; Latkin, Carl; Fuller, Crystal M

    2013-04-01

    Social discrimination may isolate drug users into higher risk relationships, particularly in disadvantaged neighborhood environments where drug trade occurs. We used negative binomial regression accounting for clustering of individuals within their recruitment neighborhood to investigate the relationship between high-risk drug ties with various forms of social discrimination, neighborhood minority composition, poverty and education. Results show that experiencing discrimination due to drug use is significantly associated with more drug ties in neighborhoods with fewer blacks. Future social network and discrimination research should assess the role of neighborhood social cohesion.

  13. Neighborhood characteristics contribute to urban alcohol availability: Accounting for race/ethnicity and social disorganization.

    PubMed

    Snowden, Aleksandra J

    2016-01-01

    This study examined the role that race/ethnicity and social disorganization play in alcohol availability in Milwaukee, Wisconsin, census block groups. This study estimated negative binomial regression models to examine separately the relationship between neighborhood racial/ethnic composition and social disorganization levels for (1) total, (2) on-premise, and (3) off-premise alcohol outlets. Results of this study suggest that proportion Hispanic was positively associated with total and with off-premise alcohol outlets. Second, proportion African American was negatively associated with on-premise alcohol outlets and positively associated with off-premise alcohol outlets. Proportion Asian was not associated with total, on-premise, or off-premise alcohol outlets. However, the effects of race/ethnicity on alcohol availability were either unrelated or negatively related to alcohol outlet availability once neighborhood social disorganization levels were taken into account, and social disorganization was positively and significantly associated with all alcohol outlet types. Neighborhood characteristics contribute to alcohol availability and must be considered in any efforts aimed toward prevention of alcohol-related negative health and social outcomes.

  14. The five-factor model of impulsivity-like traits and emotional lability in aggressive behavior.

    PubMed

    Dvorak, Robert D; Pearson, Matthew R; Kuvaas, Nicholas J

    2013-01-01

    Factors that increase automatic psychological processes may result in impulsive action and, consequently, aggressive behavior. The current cross-sectional study examined the association between the five-factor model of impulsivity-like traits (negative urgency, positive urgency, premeditation, perseverance, and sensation seeking), emotional lability, and physically aggressive behaviors among college students (n = 481) in a negative binomial hurdle model. In the logistic portion of the model, emotional lability was related to a higher likelihood of engaging in aggressive acts in the past 6 months. The association between emotional lability and the likelihood of aggressive behavior was moderated by two impulsivity-like traits: negative urgency and positive urgency. Specifically, emotional lability was related to engaging in aggressive acts among those with high negative urgency, and among those with low positive urgency. In the count portion of the model, emotional lability was uniquely related to the number of aggressive acts in the past 6 months. Our results indicate that emotional lability and facets of impulsivity interactively relate to engagement in aggressive behavior, suggesting that these variables be integrated into models of aggression. © 2013 Wiley Periodicals, Inc.

  15. A Monte Carlo Risk Analysis of Life Cycle Cost Prediction.

    DTIC Science & Technology

    1975-09-01

    process which occurs with each FLU failure. With this in mind there is no alternative other than the binomial distribution. 24 GOR/SM/75D-6 With all of...Weibull distribution of failures as selected by user. For each failure of the ith FLU, the model then samples from the binomial distribution to deter- mine...which is sampled from the binomial . Neither of the two conditions for normality are met, i.e., that RTS Ie close to .5 and the number of samples close

  16. Statistical methods for the beta-binomial model in teratology.

    PubMed Central

    Yamamoto, E; Yanagimoto, T

    1994-01-01

    The beta-binomial model is widely used for analyzing teratological data involving littermates. Recent developments in statistical analyses of teratological data are briefly reviewed with emphasis on the model. For statistical inference of the parameters in the beta-binomial distribution, separation of the likelihood introduces an likelihood inference. This leads to reducing biases of estimators and also to improving accuracy of empirical significance levels of tests. Separate inference of the parameters can be conducted in a unified way. PMID:8187716

  17. Assessment of NDE reliability data

    NASA Technical Reports Server (NTRS)

    Yee, B. G. W.; Couchman, J. C.; Chang, F. H.; Packman, D. F.

    1975-01-01

    Twenty sets of relevant nondestructive test (NDT) reliability data were identified, collected, compiled, and categorized. A criterion for the selection of data for statistical analysis considerations was formulated, and a model to grade the quality and validity of the data sets was developed. Data input formats, which record the pertinent parameters of the defect/specimen and inspection procedures, were formulated for each NDE method. A comprehensive computer program was written and debugged to calculate the probability of flaw detection at several confidence limits by the binomial distribution. This program also selects the desired data sets for pooling and tests the statistical pooling criteria before calculating the composite detection reliability. An example of the calculated reliability of crack detection in bolt holes by an automatic eddy current method is presented.

  18. Conditional modeling of antibody titers using a zero-inflated poisson random effects model: application to Fabrazyme.

    PubMed

    Bonate, Peter L; Sung, Crystal; Welch, Karen; Richards, Susan

    2009-10-01

    Patients that are exposed to biotechnology-derived therapeutics often develop antibodies to the therapeutic, the magnitude of which is assessed by measuring antibody titers. A statistical approach for analyzing antibody titer data conditional on seroconversion is presented. The proposed method is to first transform the antibody titer data based on a geometric series using a common ratio of 2 and a scale factor of 50 and then analyze the exponent using a zero-inflated or hurdle model assuming a Poisson or negative binomial distribution with random effects to account for patient heterogeneity. Patient specific covariates can be used to model the probability of developing an antibody response, i.e., seroconversion, as well as the magnitude of the antibody titer itself. The method was illustrated using antibody titer data from 87 male seroconverted Fabry patients receiving Fabrazyme. Titers from five clinical trials were collected over 276 weeks of therapy with anti-Fabrazyme IgG titers ranging from 100 to 409,600 after exclusion of seronegative patients. The best model to explain seroconversion was a zero-inflated Poisson (ZIP) model where cumulative dose (under a constant dose regimen of dosing every 2 weeks) influenced the probability of seroconversion. There was an 80% chance of seroconversion when the cumulative dose reached 210 mg (90% confidence interval: 194-226 mg). No difference in antibody titers was noted between Japanese or Western patients. Once seroconverted, antibody titers did not remain constant but decreased in an exponential manner from an initial magnitude to a new lower steady-state value. The expected titer after the new steady-state titer had been achieved was 870 (90% CI: 630-1109). The half-life to the new steady-state value after seroconversion was 44 weeks (90% CI: 17-70 weeks). Time to seroconversion did not appear to be correlated with titer at the time of seroconversion. The method can be adequately used to model antibody titer data.

  19. The Binomial Distribution in Shooting

    ERIC Educational Resources Information Center

    Chalikias, Miltiadis S.

    2009-01-01

    The binomial distribution is used to predict the winner of the 49th International Shooting Sport Federation World Championship in double trap shooting held in 2006 in Zagreb, Croatia. The outcome of the competition was definitely unexpected.

  20. Trends in incidence of occupational asthma, contact dermatitis, noise-induced hearing loss, carpal tunnel syndrome and upper limb musculoskeletal disorders in European countries from 2000 to 2012.

    PubMed

    Stocks, S Jill; McNamee, Roseanne; van der Molen, Henk F; Paris, Christophe; Urban, Pavel; Campo, Giuseppe; Sauni, Riitta; Martínez Jarreta, Begoña; Valenty, Madeleine; Godderis, Lode; Miedinger, David; Jacquetin, Pascal; Gravseth, Hans M; Bonneterre, Vincent; Telle-Lamberton, Maylis; Bensefa-Colas, Lynda; Faye, Serge; Mylle, Godewina; Wannag, Axel; Samant, Yogindra; Pal, Teake; Scholz-Odermatt, Stefan; Papale, Adriano; Schouteden, Martijn; Colosio, Claudio; Mattioli, Stefano; Agius, Raymond

    2015-04-01

    The European Union (EU) strategy for health and safety at work underlines the need to reduce the incidence of occupational diseases (OD), but European statistics to evaluate this common goal are scarce. We aim to estimate and compare changes in incidence over time for occupational asthma, contact dermatitis, noise-induced hearing loss (NIHL), carpal tunnel syndrome (CTS) and upper limb musculoskeletal disorders across 10 European countries. OD surveillance systems that potentially reflected nationally representative trends in incidence within Belgium, the Czech Republic, Finland, France, Italy, the Netherlands, Norway, Spain, Switzerland and the UK provided data. Case counts were analysed using a negative binomial regression model with year as the main covariate. Many systems collected data from networks of 'centres', requiring the use of a multilevel negative binomial model. Some models made allowance for changes in compensation or reporting rules. Reports of contact dermatitis and asthma, conditions with shorter time between exposure to causal substances and OD, were consistently declining with only a few exceptions. For OD with physical causal exposures there was more variation between countries. Reported NIHL was increasing in Belgium, Spain, Switzerland and the Netherlands and decreasing elsewhere. Trends in CTS and upper limb musculoskeletal disorders varied widely within and between countries. This is the first direct comparison of trends in OD within Europe and is consistent with a positive impact of European initiatives addressing exposures relevant to asthma and contact dermatitis. Taking a more flexible approach allowed comparisons of surveillance data between and within countries without harmonisation of data collection methods. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  1. Risk factors related to Toxoplasma gondii seroprevalence in indoor-housed Dutch dairy goats.

    PubMed

    Deng, Huifang; Dam-Deisz, Cecile; Luttikholt, Saskia; Maas, Miriam; Nielen, Mirjam; Swart, Arno; Vellema, Piet; van der Giessen, Joke; Opsteegh, Marieke

    2016-02-01

    Toxoplasma gondii can cause disease in goats, but also has impact on human health through food-borne transmission. Our aims were to determine the seroprevalence of T. gondii infection in indoor-housed Dutch dairy goats and to identify the risk factors related to T. gondii seroprevalence. Fifty-two out of ninety approached farmers with indoor-kept goats (58%) participated by answering a standardized questionnaire and contributing 32 goat blood samples each. Serum samples were tested for T. gondii SAG1 antibodies by ELISA and results showed that the frequency distribution of the log10-transformed OD-values fitted well with a binary mixture of a shifted gamma and a shifted reflected gamma distribution. The overall animal seroprevalence was 13.3% (95% CI: 11.7–14.9%), and at least one seropositive animal was found on 61.5% (95% CI: 48.3–74.7%) of the farms. To evaluate potential risk factors on herd level, three modeling strategies (Poisson, negative binomial and zero-inflated) were compared. The negative binomial model fitted the data best with the number of cats (1–4 cats: IR: 2.6, 95% CI: 1.1–6.5; > = 5 cats:IR: 14.2, 95% CI: 3.9–51.1) and mean animal age (IR: 1.5, 95% CI: 1.1–2.1) related to herd positivity. In conclusion, the ELISA test was 100% sensitive and specific based on binary mixture analysis. T. gondii infection is prevalent in indoor housed Dutch dairy goats but at a lower overall animal level seroprevalence than outdoor farmed goats in other European countries, and cat exposure is an important risk factor.

  2. Dental enamel defects, caries experience and oral health-related quality of life: a cohort study.

    PubMed

    Arrow, P

    2017-06-01

    The impact of enamel defects of the first permanent molars on caries experience and child oral health-related quality of life was evaluated in a cohort study. Children who participated in a study of enamel defects of the first permanent molars 8 years earlier were invited for a follow-up assessment. Consenting children completed the Child Perception Questionnaire and the faces Modified Child Dental Anxiety Scale, and were examined by two calibrated examiners. ANOVA, Kruskal-Wallis, negative binomial and logistic regression were used for data analyses. One hundred and eleven children returned a completed questionnaire and 91 were clinically examined. Negative binomial regression found that oral health impacts were associated with gender (boys, risk ratio (RR) = 0.73, P = 0.03) and decayed, missing or filled permanent teeth (DMFT) (RR = 1.1, P = 0.04). The mean DMFT of children were sound (0.9, standard deviation (SD) = 1.4), diffuse defects (0.8, SD = 1.7), demarcated defects (1.5, SD = 1.4) and pit defects (1.3, SD = 2.3) (Kruskal-Wallis, P = 0.05). Logistic regression of first permanent molar caries found higher odds of caries experience with baseline primary tooth caries experience (odds ratio (OR) = 1.5, P = 0.01), the number of teeth affected by enamel defects (OR = 1.9, P = 0.05) and lower odds with the presence of diffuse enamel defects (OR = 0.1, P = 0.04). The presence of diffuse enamel defects was associated with lower odds of caries experience. © 2016 Australian Dental Association.

  3. EM Adaptive LASSO—A Multilocus Modeling Strategy for Detecting SNPs Associated with Zero-inflated Count Phenotypes

    PubMed Central

    Mallick, Himel; Tiwari, Hemant K.

    2016-01-01

    Count data are increasingly ubiquitous in genetic association studies, where it is possible to observe excess zero counts as compared to what is expected based on standard assumptions. For instance, in rheumatology, data are usually collected in multiple joints within a person or multiple sub-regions of a joint, and it is not uncommon that the phenotypes contain enormous number of zeroes due to the presence of excessive zero counts in majority of patients. Most existing statistical methods assume that the count phenotypes follow one of these four distributions with appropriate dispersion-handling mechanisms: Poisson, Zero-inflated Poisson (ZIP), Negative Binomial, and Zero-inflated Negative Binomial (ZINB). However, little is known about their implications in genetic association studies. Also, there is a relative paucity of literature on their usefulness with respect to model misspecification and variable selection. In this article, we have investigated the performance of several state-of-the-art approaches for handling zero-inflated count data along with a novel penalized regression approach with an adaptive LASSO penalty, by simulating data under a variety of disease models and linkage disequilibrium patterns. By taking into account data-adaptive weights in the estimation procedure, the proposed method provides greater flexibility in multi-SNP modeling of zero-inflated count phenotypes. A fast coordinate descent algorithm nested within an EM (expectation-maximization) algorithm is implemented for estimating the model parameters and conducting variable selection simultaneously. Results show that the proposed method has optimal performance in the presence of multicollinearity, as measured by both prediction accuracy and empirical power, which is especially apparent as the sample size increases. Moreover, the Type I error rates become more or less uncontrollable for the competing methods when a model is misspecified, a phenomenon routinely encountered in practice. PMID:27066062

  4. EM Adaptive LASSO-A Multilocus Modeling Strategy for Detecting SNPs Associated with Zero-inflated Count Phenotypes.

    PubMed

    Mallick, Himel; Tiwari, Hemant K

    2016-01-01

    Count data are increasingly ubiquitous in genetic association studies, where it is possible to observe excess zero counts as compared to what is expected based on standard assumptions. For instance, in rheumatology, data are usually collected in multiple joints within a person or multiple sub-regions of a joint, and it is not uncommon that the phenotypes contain enormous number of zeroes due to the presence of excessive zero counts in majority of patients. Most existing statistical methods assume that the count phenotypes follow one of these four distributions with appropriate dispersion-handling mechanisms: Poisson, Zero-inflated Poisson (ZIP), Negative Binomial, and Zero-inflated Negative Binomial (ZINB). However, little is known about their implications in genetic association studies. Also, there is a relative paucity of literature on their usefulness with respect to model misspecification and variable selection. In this article, we have investigated the performance of several state-of-the-art approaches for handling zero-inflated count data along with a novel penalized regression approach with an adaptive LASSO penalty, by simulating data under a variety of disease models and linkage disequilibrium patterns. By taking into account data-adaptive weights in the estimation procedure, the proposed method provides greater flexibility in multi-SNP modeling of zero-inflated count phenotypes. A fast coordinate descent algorithm nested within an EM (expectation-maximization) algorithm is implemented for estimating the model parameters and conducting variable selection simultaneously. Results show that the proposed method has optimal performance in the presence of multicollinearity, as measured by both prediction accuracy and empirical power, which is especially apparent as the sample size increases. Moreover, the Type I error rates become more or less uncontrollable for the competing methods when a model is misspecified, a phenomenon routinely encountered in practice.

  5. Dental plaque, preventive care, and tooth brushing associated with dental caries in primary teeth in schoolchildren ages 6–9 years of Leon, Nicaragua

    PubMed Central

    del Socorro Herrera, Miriam; Medina-Solis, Carlo Eduardo; Minaya-Sánchez, Mirna; Pontigo-Loyola, América Patricia; Villalobos-Rodelo, Juan José; Islas-Granillo, Horacio; de la Rosa-Santillana, Rubén; Maupomé, Gerardo

    2013-01-01

    Background Our study aimed to evaluate the effect of various risk indicators for dental caries on primary teeth of Nicaraguan children (from Leon, Nicaragua) ages 6 to 9, using the negative binomial regression model. Material/Methods A cross-sectional study was carried out to collect clinical, demographic, socioeconomic, and behavioral data from 794 schoolchildren ages 6 to 9 years, randomly selected from 25 schools in the city of León, Nicaragua. Clinical examinations for dental caries (dmft index) were performed by 2 trained and standardized examiners. Socio-demographic, socioeconomic, and behavioral data were self-reported using questionnaires. Multivariate negative binomial regression (NBR) analysis was used. Results Mean age was 7.49±1.12 years. Boys accounted for 50.1% of the sample. Mean dmft was 3.54±3.13 and caries prevalence (dmft >0) was 77.6%. In the NBR multivariate model (p<0.05), for each year of age, the expected mean dmft decreased by 7.5%. Brushing teeth at least once a day and having received preventive dental care in the last year before data collection were associated with declines in the expected mean dmft by 19.5% and 69.6%, respectively. Presence of dental plaque increased the expected mean dmft by 395.5%. Conclusions The proportion of students with caries in this sample was high. We found associations between dental caries in the primary dentition and dental plaque, brushing teeth at least once a day, and having received preventive dental care. To improve oral health, school programs and/or age-appropriate interventions need to be developed based on the specific profile of caries experience and the associated risk indicators. PMID:24247119

  6. Frequency distribution of Echinococcus multilocularis and other helminths of foxes in Kyrgyzstan

    PubMed Central

    I., Ziadinov; P., Deplazes; A., Mathis; B., Mutunova; K., Abdykerimov; R., Nurgaziev; P.R, Torgerson

    2010-01-01

    Echinococcosis is a major emerging zoonosis in central Asia. A study of the helminth fauna of foxes from Naryn Oblast in central Kyrgyzstan was undertaken to investigate the abundance of Echinococcus multilocularis in a district where a high prevalence of this parasite had previously been detected in dogs. A total of 151 foxes (Vulpes vulpes) were investigated in a necropsy study. Of these 96 (64%) were infected with E. multilocularis with a mean abundance of 8669 parasites per fox. This indicates that red foxes are a major definitive host of E. multilocularis in this country. This also demonstrates that the abundance and prevalence of E. multilocularis in the natural definitive host are likely to be high in geographical regions where there is a concomitant high prevalence in alternative definitive hosts such as dogs. In addition Mesocestoides spp., Dipylidium caninum, Taenia spp., Toxocara canis, Toxascaris leonina, Capillaria and Acanthocephala spp. were found in 99 (66%), 50 (33%), 48 (32%), 46 (30%), 9 (6%), 34 (23%) and 2 (1%) of foxes, respectively. The prevalence but not the abundance of E. multilocularis decreased with age. The abundance of Dipylidium caninum also decreased with age. The frequency distribution of E. multilocularis and Mesocestoides spp. followed a zero inflated negative binomial distribution, whilst all other helminths had a negative binomial distribution. This demonstrates that the frequency distribution of positive counts and not just the frequency of zeros in the data set can determine if a zero inflated or non-zero inflated model is more appropriate. This is because the prevalences of E. multolocularis and Mesocestoides spp. were the highest (and hence had fewest zero counts) yet the parasite distribution nevertheless gave a better fit to the zero inflated models. PMID:20434845

  7. Mycophenolic Acid Pharmacokinetics and Relapse in Children with Steroid–Dependent Idiopathic Nephrotic Syndrome

    PubMed Central

    Tellier, Stéphanie; Dallocchio, Aymeric; Guigonis, Vincent; Saint-Marcoux, Frank; Llanas, Brigitte; Ichay, Lydia; Bandin, Flavio; Godron, Astrid; Morin, Denis; Brochard, Karine; Gandia, Peggy; Bouchet, Stéphane; Marquet, Pierre; Decramer, Stéphane

    2016-01-01

    Background and objectives Therapeutic drug monitoring of mycophenolic acid can improve clinical outcome in organ transplantation and lupus, but data are scarce in idiopathic nephrotic syndrome. The aim of our study was to investigate whether mycophenolic acid pharmacokinetics are associated with disease control in children receiving mycophenolate mofetil for the treatment of steroid–dependent nephrotic syndrome. Design, setting, participants, & measurements This was a retrospective multicenter study including 95 children with steroid–dependent nephrotic syndrome treated with mycophenolate mofetil with or without steroids. Area under the concentration-time curve of mycophenolic acid was determined in all children on the basis of sampling times at 20, 60, and 180 minutes postdose, using Bayesian estimation. The association between a threshold value of the area under the concentration-time curve of mycophenolic acid and the relapse rate was assessed using a negative binomial model. Results In total, 140 areas under the concentration-time curve of mycophenolic acid were analyzed. The findings indicate individual dose adaptation in 53 patients (38%) to achieve an area under the concentration-time curve target of 30–60 mg·h/L. In a multivariable negative binomial model including sex, age at disease onset, time to start of mycophenolate mofetil, previous immunomodulatory treatment, and concomitant prednisone dose, a level of area under the concentration-time curve of mycophenolic acid >45 mg·h/L was significantly associated with a lower relapse rate (rate ratio, 0.65; 95% confidence interval, 0.46 to 0.89; P=0.01). Conclusions Therapeutic drug monitoring leading to individualized dosing may improve the efficacy of mycophenolate mofetil in steroid–dependent nephrotic syndrome. Additional prospective studies are warranted to determine the optimal target for area under the concentration-time curve of mycophenolic acid in this population. PMID:27445161

  8. Mycophenolic Acid Pharmacokinetics and Relapse in Children with Steroid-Dependent Idiopathic Nephrotic Syndrome.

    PubMed

    Tellier, Stéphanie; Dallocchio, Aymeric; Guigonis, Vincent; Saint-Marcoux, Frank; Llanas, Brigitte; Ichay, Lydia; Bandin, Flavio; Godron, Astrid; Morin, Denis; Brochard, Karine; Gandia, Peggy; Bouchet, Stéphane; Marquet, Pierre; Decramer, Stéphane; Harambat, Jérôme

    2016-10-07

    Therapeutic drug monitoring of mycophenolic acid can improve clinical outcome in organ transplantation and lupus, but data are scarce in idiopathic nephrotic syndrome. The aim of our study was to investigate whether mycophenolic acid pharmacokinetics are associated with disease control in children receiving mycophenolate mofetil for the treatment of steroid-dependent nephrotic syndrome. This was a retrospective multicenter study including 95 children with steroid-dependent nephrotic syndrome treated with mycophenolate mofetil with or without steroids. Area under the concentration-time curve of mycophenolic acid was determined in all children on the basis of sampling times at 20, 60, and 180 minutes postdose, using Bayesian estimation. The association between a threshold value of the area under the concentration-time curve of mycophenolic acid and the relapse rate was assessed using a negative binomial model. In total, 140 areas under the concentration-time curve of mycophenolic acid were analyzed. The findings indicate individual dose adaptation in 53 patients (38%) to achieve an area under the concentration-time curve target of 30-60 mg·h/L. In a multivariable negative binomial model including sex, age at disease onset, time to start of mycophenolate mofetil, previous immunomodulatory treatment, and concomitant prednisone dose, a level of area under the concentration-time curve of mycophenolic acid >45 mg·h/L was significantly associated with a lower relapse rate (rate ratio, 0.65; 95% confidence interval, 0.46 to 0.89; P =0.01). Therapeutic drug monitoring leading to individualized dosing may improve the efficacy of mycophenolate mofetil in steroid-dependent nephrotic syndrome. Additional prospective studies are warranted to determine the optimal target for area under the concentration-time curve of mycophenolic acid in this population. Copyright © 2016 by the American Society of Nephrology.

  9. Multivariate random-parameters zero-inflated negative binomial regression model: an application to estimate crash frequencies at intersections.

    PubMed

    Dong, Chunjiao; Clarke, David B; Yan, Xuedong; Khattak, Asad; Huang, Baoshan

    2014-09-01

    Crash data are collected through police reports and integrated with road inventory data for further analysis. Integrated police reports and inventory data yield correlated multivariate data for roadway entities (e.g., segments or intersections). Analysis of such data reveals important relationships that can help focus on high-risk situations and coming up with safety countermeasures. To understand relationships between crash frequencies and associated variables, while taking full advantage of the available data, multivariate random-parameters models are appropriate since they can simultaneously consider the correlation among the specific crash types and account for unobserved heterogeneity. However, a key issue that arises with correlated multivariate data is the number of crash-free samples increases, as crash counts have many categories. In this paper, we describe a multivariate random-parameters zero-inflated negative binomial (MRZINB) regression model for jointly modeling crash counts. The full Bayesian method is employed to estimate the model parameters. Crash frequencies at urban signalized intersections in Tennessee are analyzed. The paper investigates the performance of MZINB and MRZINB regression models in establishing the relationship between crash frequencies, pavement conditions, traffic factors, and geometric design features of roadway intersections. Compared to the MZINB model, the MRZINB model identifies additional statistically significant factors and provides better goodness of fit in developing the relationships. The empirical results show that MRZINB model possesses most of the desirable statistical properties in terms of its ability to accommodate unobserved heterogeneity and excess zero counts in correlated data. Notably, in the random-parameters MZINB model, the estimated parameters vary significantly across intersections for different crash types. Copyright © 2014 Elsevier Ltd. All rights reserved.

  10. Gingival recession and associated factors in a homogeneous Mexican adult male population: A cross-sectional study

    PubMed Central

    Minaya-Sánchez, Mirna; Medina-Solís, Carlo E.; Vallejos-Sánchez, Ana A.; Marquez-Corona, Maria L.; Pontigo-Loyola, América P.; Islas-Granillo, Horacio; Maupomé, Gerardo

    2012-01-01

    Background: Diverse variables are implicated in the pathogenesis of gingival recession; more detailed knowledge about the relationship between the clinical presentation of gingival recession and assorted risk indicators may lead to improved patient monitoring, early intervention, and subsequent prevention. The objective was to evaluate clinically gingival recession in a homogeneous Mexican adult male population and to determine the strength of association with related factors. Method: A cross-sectional study was carried out in a largely homogeneous group in terms of ethnic background, socioeconomic status, gender, occupation, and medical/dental insurance, in Campeche, Mexico. Periodontal examinations were undertaken to determine diverse clinical dental variables. All periodontal clinical examinations were assessed using the Florida Probe System, a dental chair and one examiner. Questionnaires were used to collect diverse risk indicators. Statistical analyses were undertaken with negative binomial regression models. Results: The mean number of sites with gingival recession per subject was 6.73±5.81; the prevalence was 87.6%. In the negative binomial regression model we observed that for (i) each year of age, and (ii) each percentage unit of increase in sites with plaque, and (iii) with suppuration, mean sites with gingival recession increased 2.9%, 1.0% and 13.0%, respectively. Having a spouse was associated with gingival recession. Conclusions: We observed association between gingival recession, and sociodemographic and clinical parameters. Patients need to be educated about risk indicators for gingival recession as well as the preventive maneuvers that may be implemented to minimize its occurrence. The potential of improved oral self-care to prevent a largely benign condition such as gingival recession is important, given the associated disorders that may ensue root exposure, such as root caries and root hypersensitivity. Key words:Oral health, periodontal health, gingival recession, adults, Mexico. PMID:22549678

  11. Recurrent suicide attempts in patients with depressive and anxiety disorders: the role of borderline personality traits.

    PubMed

    Stringer, Barbara; van Meijel, Berno; Eikelenboom, Merijn; Koekkoek, Bauke; Licht, Carmilla M M; Kerkhof, Ad J F M; Penninx, Brenda W J H; Beekman, Aartjan T F

    2013-10-01

    The presence of a comorbid borderline personality disorder (BPD) may be associated with an increase of suicidal behaviors in patients with depressive and anxiety disorders. The aim of this study is to examine the role of borderline personality traits on recurrent suicide attempts. The Netherlands Study on Depression and Anxiety included 1838 respondents with lifetime depressive and/or anxiety disorders, of whom 309 reported at least one previous suicide attempt. A univariable negative binomial regression analysis was performed to examine the association between comorbid borderline personality traits and suicide attempts. Univariable and multivariable negative binomial regression analyses were performed to identify risk factors for the number of recurrent suicide attempts in four clusters (type and severity of axis-I disorders, BPD traits, determinants of suicide attempts and socio-demographics). In the total sample the suicide attempt rate ratio increased with 33% for every unit increase in BPD traits. A lifetime diagnosis of dysthymia and comorbid BPD traits, especially the symptoms anger and fights, were independently and significantly associated with recurrent suicide attempts in the final model (n=309). The screening of personality disorders was added to the NESDA assessments at the 4-year follow-up for the first time. Therefore we were not able to examine the influence of comorbid BPD traits on suicide attempts over time. Persons with a lifetime diagnosis of dysthymia combined with borderline personality traits especially difficulties in coping with anger seemed to be at high risk for recurrent suicide attempts. For clinical practice, it is recommended to screen for comorbid borderline personality traits and to strengthen the patient's coping skills with regard to anger. © 2013 Elsevier B.V. All rights reserved.

  12. Analysing recurrent hospitalizations in heart failure: a review of statistical methodology, with application to CHARM-Preserved.

    PubMed

    Rogers, Jennifer K; Pocock, Stuart J; McMurray, John J V; Granger, Christopher B; Michelson, Eric L; Östergren, Jan; Pfeffer, Marc A; Solomon, Scott D; Swedberg, Karl; Yusuf, Salim

    2014-01-01

    Heart failure is characterized by recurrent hospitalizations, but often only the first event is considered in clinical trial reports. In chronic diseases, such as heart failure, analysing all events gives a more complete picture of treatment benefit. We describe methods of analysing repeat hospitalizations, and illustrate their value in one major trial. The Candesartan in Heart failure Assessment of Reduction in Mortality and morbidity (CHARM)-Preserved study compared candesartan with placebo in 3023 patients with heart failure and preserved systolic function. The heart failure hospitalization rates were 12.5 and 8.9 per 100 patient-years in the placebo and candesartan groups, respectively. The repeat hospitalizations were analysed using the Andersen-Gill, Poisson, and negative binomial methods. Death was incorporated into analyses by treating it as an additional event. The win ratio method and a method that jointly models hospitalizations and mortality were also considered. Using repeat events gave larger treatment benefits than time to first event analysis. The negative binomial method for the composite of recurrent heart failure hospitalizations and cardiovascular death gave a rate ratio of 0.75 [95% confidence interval (CI) 0.62-0.91, P = 0.003], whereas the hazard ratio for time to first heart failure hospitalization or cardiovascular death was 0.86 (95% CI 0.74-1.00, P = 0.050). In patients with preserved EF, candesartan reduces the rate of admissions for worsening heart failure, to a greater extent than apparent from analysing only first hospitalizations. Recurrent events should be routinely incorporated into the analysis of future clinical trials in heart failure. © 2013 The Authors. European Journal of Heart Failure © 2013 European Society of Cardiology.

  13. Prediction of different ovarian responses using anti-Müllerian hormone following a long agonist treatment protocol for IVF.

    PubMed

    Heidar, Z; Bakhtiyari, M; Mirzamoradi, M; Zadehmodarres, S; Sarfjoo, F S; Mansournia, M A

    2015-09-01

    The purpose of this study was to predict the poor and excessive ovarian response using anti-Müllerian hormone (AMH) levels following a long agonist protocol in IVF candidates. Through a prospective cohort study, the type of relationship and appropriate scale for AMH were determined using the fractional polynomial regression. To determine the effect of AMH on the outcomes of ovarian stimulation and different ovarian responses, the multi-nominal and negative binomial regression models were fitted using backward stepwise method. The ovarian response of study subject who entered a standard long-term treatment cycle with GnRH agonist was evaluated using prediction model, separately and in combined models with (ROC) curves. The use of standard long-term treatments with GnRH agonist led to positive pregnancy test results in 30% of treated patients. With each unit increase in the log of AMH, the odds ratio of having poor response compared to normal response decreases by 64% (OR 0.36, 95% CI 0.19-0.68). Also the results of negative binomial regression model indicated that for one unit increase in the log of AMH blood levels, the odds of releasing an oocyte increased 24% (OR 1.24, 95% CI 1.14-1.35). The optimal cut-off points of AMH for predicting excessive and poor ovarian responses were 3.4 and 1.2 ng/ml, respectively, with area under curves of 0.69 (0.60-0.77) and 0.76 (0.66-0.86), respectively. By considering the age of the patient undergoing infertility treatment as a variable affecting ovulation, use of AMH levels showed to be a good test to discriminate between different ovarian responses.

  14. Clinical and MRI activity as determinants of sample size for pediatric multiple sclerosis trials

    PubMed Central

    Verhey, Leonard H.; Signori, Alessio; Arnold, Douglas L.; Bar-Or, Amit; Sadovnick, A. Dessa; Marrie, Ruth Ann; Banwell, Brenda

    2013-01-01

    Objective: To estimate sample sizes for pediatric multiple sclerosis (MS) trials using new T2 lesion count, annualized relapse rate (ARR), and time to first relapse (TTFR) endpoints. Methods: Poisson and negative binomial models were fit to new T2 lesion and relapse count data, and negative binomial time-to-event and exponential models were fit to TTFR data of 42 children with MS enrolled in a national prospective cohort study. Simulations were performed by resampling from the best-fitting model of new T2 lesion count, number of relapses, or TTFR, under various assumptions of the effect size, trial duration, and model parameters. Results: Assuming a 50% reduction in new T2 lesions over 6 months, 90 patients/arm are required, whereas 165 patients/arm are required for a 40% treatment effect. Sample sizes for 2-year trials using relapse-related endpoints are lower than that for 1-year trials. For 2-year trials and a conservative assumption of overdispersion (ϑ), sample sizes range from 70 patients/arm (using ARR) to 105 patients/arm (TTFR) for a 50% reduction in relapses, and 230 patients/arm (ARR) to 365 patients/arm (TTFR) for a 30% relapse reduction. Assuming a less conservative ϑ, 2-year trials using ARR require 45 patients/arm (60 patients/arm for TTFR) for a 50% reduction in relapses and 145 patients/arm (200 patients/arm for TTFR) for a 30% reduction. Conclusion: Six-month phase II trials using new T2 lesion count as an endpoint are feasible in the pediatric MS population; however, trials powered on ARR or TTFR will need to be 2 years in duration and will require multicentered collaboration. PMID:23966255

  15. Evaluation of logistic regression models and effect of covariates for case-control study in RNA-Seq analysis.

    PubMed

    Choi, Seung Hoan; Labadorf, Adam T; Myers, Richard H; Lunetta, Kathryn L; Dupuis, Josée; DeStefano, Anita L

    2017-02-06

    Next generation sequencing provides a count of RNA molecules in the form of short reads, yielding discrete, often highly non-normally distributed gene expression measurements. Although Negative Binomial (NB) regression has been generally accepted in the analysis of RNA sequencing (RNA-Seq) data, its appropriateness has not been exhaustively evaluated. We explore logistic regression as an alternative method for RNA-Seq studies designed to compare cases and controls, where disease status is modeled as a function of RNA-Seq reads using simulated and Huntington disease data. We evaluate the effect of adjusting for covariates that have an unknown relationship with gene expression. Finally, we incorporate the data adaptive method in order to compare false positive rates. When the sample size is small or the expression levels of a gene are highly dispersed, the NB regression shows inflated Type-I error rates but the Classical logistic and Bayes logistic (BL) regressions are conservative. Firth's logistic (FL) regression performs well or is slightly conservative. Large sample size and low dispersion generally make Type-I error rates of all methods close to nominal alpha levels of 0.05 and 0.01. However, Type-I error rates are controlled after applying the data adaptive method. The NB, BL, and FL regressions gain increased power with large sample size, large log2 fold-change, and low dispersion. The FL regression has comparable power to NB regression. We conclude that implementing the data adaptive method appropriately controls Type-I error rates in RNA-Seq analysis. Firth's logistic regression provides a concise statistical inference process and reduces spurious associations from inaccurately estimated dispersion parameters in the negative binomial framework.

  16. Assessing historical rate changes in global tsunami occurrence

    USGS Publications Warehouse

    Geist, E.L.; Parsons, T.

    2011-01-01

    The global catalogue of tsunami events is examined to determine if transient variations in tsunami rates are consistent with a Poisson process commonly assumed for tsunami hazard assessments. The primary data analyzed are tsunamis with maximum sizes >1m. The record of these tsunamis appears to be complete since approximately 1890. A secondary data set of tsunamis >0.1m is also analyzed that appears to be complete since approximately 1960. Various kernel density estimates used to determine the rate distribution with time indicate a prominent rate change in global tsunamis during the mid-1990s. Less prominent rate changes occur in the early- and mid-20th century. To determine whether these rate fluctuations are anomalous, the distribution of annual event numbers for the tsunami catalogue is compared to Poisson and negative binomial distributions, the latter of which includes the effects of temporal clustering. Compared to a Poisson distribution, the negative binomial distribution model provides a consistent fit to tsunami event numbers for the >1m data set, but the Poisson null hypothesis cannot be falsified for the shorter duration >0.1m data set. Temporal clustering of tsunami sources is also indicated by the distribution of interevent times for both data sets. Tsunami event clusters consist only of two to four events, in contrast to protracted sequences of earthquakes that make up foreshock-main shock-aftershock sequences. From past studies of seismicity, it is likely that there is a physical triggering mechanism responsible for events within the tsunami source 'mini-clusters'. In conclusion, prominent transient rate increases in the occurrence of global tsunamis appear to be caused by temporal grouping of geographically distinct mini-clusters, in addition to the random preferential location of global M >7 earthquakes along offshore fault zones.

  17. Quality of maternity care and its determinants along the continuum in Kenya: A structural equation modeling analysis

    PubMed Central

    Mendez, Bomar Rojas

    2017-01-01

    Background Improving access to delivery services does not guarantee access to quality obstetric care and better survival, and therefore, concerns for quality of maternal and newborn care in low- and middle-income countries have been raised. Our study explored characteristics associated with the quality of initial assessment, intrapartum, and immediate postpartum and newborn care, and further assessed the relationships along the continuum of care. Methods The 2010 Service Provision Assessment data of Kenya for 627 routine deliveries of women aged 15–49 were used. Quality of care measures were assessed using recently validated quality of care measures during initial assessment, intrapartum, and postpartum periods. Data were analyzed with negative binomial regression and structural equation modeling technique. Results The negative binomial regression results identified a number of determinants of quality, such as the level of health facilities, managing authority, presence of delivery fee, central electricity supply and clinical guideline for maternal and neonatal care. Our structural equation modeling (SEM) further demonstrated that facility characteristics were important determinants of quality for initial assessment and postpartum care, while characteristics at the provider level became more important in shaping the quality of intrapartum care. Furthermore we also noted that quality of initial assessment had a positive association with quality of intrapartum care (β = 0.71, p < 0.001), which in turn was positively associated with the quality of newborn and immediate postpartum care (β = 1.29, p = 0.004). Conclusions A continued focus on quality of care along the continuum of maternity care is important not only to mothers but also their newborns. Policymakers should therefore ensure that required resources, as well as adequate supervision and emphasis on the quality of obstetric care, are available. PMID:28520771

  18. Factors Associated with Dental Caries in a Group of American Indian Children at age 36 Months

    PubMed Central

    Warren, John J.; Blanchette, Derek; Dawson, Deborah V.; Marshall, Teresa A.; Phipps, Kathy R.; Starr, Delores; Drake, David R.

    2015-01-01

    Objectives Early childhood caries (ECC) is rampant among American Indian children, but there has been relatively little study of this problem. This paper reports on risk factors for caries for a group of American Indian children at age 36 months as part of a longitudinal study. Methods Pregnant women from a Northern Plains Tribal community were recruited to participate in a longitudinal study of caries and caries risk factors. Standardized dental examinations were completed on children and questionnaires were completed by mothers at baseline and when children were 4, 8, 12, 16, 22, 28 and 36 months of age. Examinations were surface-specific for dental caries, and the questionnaires collected data on demographic, dietary and behavioral factors. Non-parametric bivariate tests and logistic regression models were used to identify risk factors for caries at 36 months, and negative binomial regression was used to identify factors related to caries severity (dmf counts). Results Among the 232 children, and caries prevalence for cavitated lesions was 80%, with an additional 15% having only non-cavitated lesions. The mean dmfs was 9.6, and of the total dmfs, nearly 62% of affected surfaces were decayed, 31% were missing, and 7% were filled. Logistic regression identified higher added sugar beverage consumption, younger maternal age at baseline, higher maternal DMFS at baseline, and greater number of people in the household as significant (p<0.05) risk factors. Negative binomial regression found that only maternal DMFS was associated with child dmf counts. Conclusions By the age of 36 months, dental caries is nearly universal in this population of American Indian children. Caries risk factors included sugared beverage consumption, greater household size and maternal factors, but further analyses are needed to better understand caries in this population. PMID:26544674

  19. Household expenditure on leprosy outpatient services in the Indian health system: A comparative study

    PubMed Central

    Suryawanshi, Pramilesh; Raikwar, Akash; Arif, Mohammad; Richardus, Jan Hendrik

    2018-01-01

    Background Leprosy is a major public health problem in many low and middle income countries, especially in India, and contributes considerably to the global burden of the disease. Leprosy and poverty are closely associated, and therefore the economic burden of leprosy is a concern. However, evidence on patient’s expenditure is scarce. In this study, we estimate the expenditure in primary care (outpatient) by leprosy households in two different public health settings. Methodology/Principal findings We performed a cross-sectional study, comparing the Union Territory of Dadra and Nagar Haveli with the Umbergaon block of Valsad, Gujrat, India. A household (HH) survey was conducted between May and October, 2016. We calculated direct and indirect expenditure by zero inflated negative binomial and negative binomial regression. The sampled households were comparable on socioeconomic indicators. The mean direct expenditure was USD 6.5 (95% CI: 2.4–17.9) in Dadra and Nagar Haveli and USD 5.4 (95% CI: 3.8–7.9) per visit in Umbergaon. The mean indirect expenditure was USD 8.7 (95% CI: 7.2–10.6) in Dadra and Nagar Haveli and USD 12.4 (95% CI: 7.0–21.9) in Umbergaon. The age of the leprosy patients and type of health facilities were the major predictors of total expenditure on leprosy primary care. The higher the age, the higher the expenditure at both sites. The private facilities are more expensive than the government facilities at both sites. If the public health system is enhanced, government facilities are the first preference for patients. Conclusions/Significance An enhanced public health system reduces the patient’s expenditure and improves the health seeking behaviour. We recommend investing in health system strengthening to reduce the economic burden of leprosy. PMID:29300747

  20. Homicide mortality rates in Canada, 2000-2009: Youth at increased risk.

    PubMed

    Basham, C Andrew; Snider, Carolyn

    2016-10-20

    To estimate and compare Canadian homicide mortality rates (HMRs) and trends in HMRs across age groups, with a focus on trends for youth. Data for the period of 2000 to 2009 were collected from Statistics Canada's CANSIM (Canadian Statistical Information Management) Table 102-0540 with the following ICD-10-CA coded external causes of death: X85 to Y09 (assault) and Y87.1 (sequelae of assault). Annual population counts from 2000 to 2009 were obtained from Statistics Canada's CANSIM Table 051-0001. Both death and population counts were organized into five-year age groups. A random effects negative binomial regression analysis was conducted to estimate age group-specific rates, rate ratios, and trends in homicide mortality. There were 9,878 homicide deaths in Canada during the study period. The increase in the overall homicide mortality rate (HMR) of 0.3% per year was not statistically significant (95% CI: -1.1% to +1.8%). Canadians aged 15-19 years and 20-24 years had the highest HMRs during the study period, and experienced statistically significant annual increases in their HMRs of 3% and 4% respectively (p < 0.05). A general, though not statistically significant, decrease in the HMR was observed for all age groups 50+ years. A fixed effects negative binomial regression model showed that the HMR for males was higher than for females over the study period [RRfemale/male = 0.473 (95% CI: 0.361, 0.621)], but no significant difference in sex-specific trends in the HMR was found. An increasing risk of homicide mortality was identified among Canadian youth, ages 15-24, over the 10-year study period. Research that seeks to understand the reasons for the increased homicide risk facing Canada's youth, and public policy responses to reduce this risk, are warranted.

  1. Predicting length of stay from an electronic patient record system: a primary total knee replacement example.

    PubMed

    Carter, Evelene M; Potts, Henry W W

    2014-04-04

    To investigate whether factors can be identified that significantly affect hospital length of stay from those available in an electronic patient record system, using primary total knee replacements as an example. To investigate whether a model can be produced to predict the length of stay based on these factors to help resource planning and patient expectations on their length of stay. Data were extracted from the electronic patient record system for discharges from primary total knee operations from January 2007 to December 2011 (n=2,130) at one UK hospital and analysed for their effect on length of stay using Mann-Whitney and Kruskal-Wallis tests for discrete data and Spearman's correlation coefficient for continuous data. Models for predicting length of stay for primary total knee replacements were tested using the Poisson regression and the negative binomial modelling techniques. Factors found to have a significant effect on length of stay were age, gender, consultant, discharge destination, deprivation and ethnicity. Applying a negative binomial model to these variables was successful. The model predicted the length of stay of those patients who stayed 4-6 days (~50% of admissions) with 75% accuracy within 2 days (model data). Overall, the model predicted the total days stayed over 5 years to be only 88 days more than actual, a 6.9% uplift (test data). Valuable information can be found about length of stay from the analysis of variables easily extracted from an electronic patient record system. Models can be successfully created to help improve resource planning and from which a simple decision support system can be produced to help patient expectation on their length of stay.

  2. Dental plaque, preventive care, and tooth brushing associated with dental caries in primary teeth in schoolchildren ages 6-9 years of Leon, Nicaragua.

    PubMed

    Herrera, Miriam del Socorro; Medina-Solís, Carlo Eduardo; Minaya-Sánchez, Mirna; Pontigo-Loyola, América Patricia; Villalobos-Rodelo, Juan José; Islas-Granillo, Horacio; de la Rosa-Santillana, Rubén; Maupomé, Gerardo

    2013-11-19

    Our study aimed to evaluate the effect of various risk indicators for dental caries on primary teeth of Nicaraguan children (from Leon, Nicaragua) ages 6 to 9, using the negative binomial regression model. A cross-sectional study was carried out to collect clinical, demographic, socioeconomic, and behavioral data from 794 schoolchildren ages 6 to 9 years, randomly selected from 25 schools in the city of León, Nicaragua. Clinical examinations for dental caries (dmft index) were performed by 2 trained and standardized examiners. Socio-demographic, socioeconomic, and behavioral data were self-reported using questionnaires. Multivariate negative binomial regression (NBR) analysis was used. Mean age was 7.49 ± 1.12 years. Boys accounted for 50.1% of the sample. Mean dmft was 3.54 ± 3.13 and caries prevalence (dmft >0) was 77.6%. In the NBR multivariate model (p<0.05), for each year of age, the expected mean dmft decreased by 7.5%. Brushing teeth at least once a day and having received preventive dental care in the last year before data collection were associated with declines in the expected mean dmft by 19.5% and 69.6%, respectively. Presence of dental plaque increased the expected mean dmft by 395.5%. The proportion of students with caries in this sample was high. We found associations between dental caries in the primary dentition and dental plaque, brushing teeth at least once a day, and having received preventive dental care. To improve oral health, school programs and/or age-appropriate interventions need to be developed based on the specific profile of caries experience and the associated risk indicators.

  3. Gambling frequency and symptoms of attention-deficit hyperactivity disorder in relation to problem gambling among Swedish adolescents: a population-based study.

    PubMed

    Hellström, Charlotta; Wagner, Philippe; Nilsson, Kent W; Leppert, Jerzy; Åslund, Cecilia

    2017-06-01

    To investigate the associations between gambling frequency, attention-deficit hyperactivity disorder (ADHD) symptoms, and problem gambling among adolescent boys and girls. One hypothesis was that adolescents with increased ADHD symptoms have a higher frequency of gambling compared to adolescents with fewer ADHD symptoms. A population-based sample of adolescents (aged 15-18 years) completed a questionnaire on demographics, gambling habits, ADHD symptoms, and problematic gambling; 1412 adolescents (from 4440 sampled) with gambling experience were included in the final sample. A zero-inflated negative binomial regression analysis revealed that increased ADHD symptoms, higher gambling frequency, and higher age were associated with lower odds for being non-susceptible to gambling problems. Moreover, gambling frequency interacted with ADHD symptoms in predicting probability of being non-susceptible to gambling problems. However, when analysing those already susceptible to problem gambling, ADHD symptoms did not modify the effect of gambling frequency on the expected magnitude of gambling problems. In susceptible individuals, problem gambling increased with both increased ADHD symptoms and increased gambling frequency, but the level of problems due to gambling frequency did not change depending on the ADHD symptom level. There was an interaction effect between sex and gambling frequency in relation to gambling problems. Adolescents with ADHD symptoms seem to be more sensitive to gambling, in terms of being susceptible to developing gambling problems. However, once susceptible, adolescents with ADHD symptoms are affected by gambling frequency similarly to other susceptible participants.

  4. Fitting Cure Rate Model to Breast Cancer Data of Cancer Research Center.

    PubMed

    Baghestani, Ahmad Reza; Zayeri, Farid; Akbari, Mohammad Esmaeil; Shojaee, Leyla; Khadembashi, Naghmeh; Shahmirzalou, Parviz

    2015-01-01

    The Cox PH model is one of the most significant statistical models in studying survival of patients. But, in the case of patients with long-term survival, it may not be the most appropriate. In such cases, a cure rate model seems more suitable. The purpose of this study was to determine clinical factors associated with cure rate of patients with breast cancer. In order to find factors affecting cure rate (response), a non-mixed cure rate model with negative binomial distribution for latent variable was used. Variables selected were recurrence cancer, status for HER2, estrogen receptor (ER) and progesterone receptor (PR), size of tumor, grade of cancer, stage of cancer, type of surgery, age at the diagnosis time and number of removed positive lymph nodes. All analyses were performed using PROC MCMC processes in the SAS 9.2 program. The mean (SD) age of patients was equal to 48.9 (11.1) months. For these patients, 1, 5 and 10-year survival rates were 95, 79 and 50 percent respectively. All of the mentioned variables were effective in cure fraction. Kaplan-Meier curve showed cure model's use competence. Unlike other variables, existence of ER and PR positivity will increase probability of cure in patients. In the present study, Weibull distribution was used for the purpose of analysing survival times. Model fitness with other distributions such as log-N and log-logistic and other distributions for latent variable is recommended.

  5. Estimating relative risks in multicenter studies with a small number of centers - which methods to use? A simulation study.

    PubMed

    Pedroza, Claudia; Truong, Van Thi Thanh

    2017-11-02

    Analyses of multicenter studies often need to account for center clustering to ensure valid inference. For binary outcomes, it is particularly challenging to properly adjust for center when the number of centers or total sample size is small, or when there are few events per center. Our objective was to evaluate the performance of generalized estimating equation (GEE) log-binomial and Poisson models, generalized linear mixed models (GLMMs) assuming binomial and Poisson distributions, and a Bayesian binomial GLMM to account for center effect in these scenarios. We conducted a simulation study with few centers (≤30) and 50 or fewer subjects per center, using both a randomized controlled trial and an observational study design to estimate relative risk. We compared the GEE and GLMM models with a log-binomial model without adjustment for clustering in terms of bias, root mean square error (RMSE), and coverage. For the Bayesian GLMM, we used informative neutral priors that are skeptical of large treatment effects that are almost never observed in studies of medical interventions. All frequentist methods exhibited little bias, and the RMSE was very similar across the models. The binomial GLMM had poor convergence rates, ranging from 27% to 85%, but performed well otherwise. The results show that both GEE models need to use small sample corrections for robust SEs to achieve proper coverage of 95% CIs. The Bayesian GLMM had similar convergence rates but resulted in slightly more biased estimates for the smallest sample sizes. However, it had the smallest RMSE and good coverage across all scenarios. These results were very similar for both study designs. For the analyses of multicenter studies with a binary outcome and few centers, we recommend adjustment for center with either a GEE log-binomial or Poisson model with appropriate small sample corrections or a Bayesian binomial GLMM with informative priors.

  6. Probability of a false-negative HIV antibody test result during the window period: a tool for pre- and post-test counselling.

    PubMed

    Taylor, Darlene; Durigon, Monica; Davis, Heather; Archibald, Chris; Konrad, Bernhard; Coombs, Daniel; Gilbert, Mark; Cook, Darrel; Krajden, Mel; Wong, Tom; Ogilvie, Gina

    2015-03-01

    Failure to understand the risk of false-negative HIV test results during the window period results in anxiety. Patients typically want accurate test results as soon as possible while clinicians prefer to wait until the probability of a false-negative is virtually nil. This review summarizes the median window periods for third-generation antibody and fourth-generation HIV tests and provides the probability of a false-negative result for various days post-exposure. Data were extracted from published seroconversion panels. A 10-day eclipse period was used to estimate days from infection to first detection of HIV RNA. Median (interquartile range) days to seroconversion were calculated and probabilities of a false-negative result at various time periods post-exposure are reported. The median (interquartile range) window period for third-generation tests was 22 days (19-25) and 18 days (16-24) for fourth-generation tests. The probability of a false-negative result is 0.01 at 80 days' post-exposure for third-generation tests and at 42 days for fourth-generation tests. The table of probabilities of falsely-negative HIV test results may be useful during pre- and post-test HIV counselling to inform co-decision making regarding the ideal time to test for HIV. © The Author(s) 2014 Reprints and permissions: sagepub.co.uk/journalsPermissions.nav.

  7. The magnetisation distribution of the Ising model - a new approach

    NASA Astrophysics Data System (ADS)

    Hakan Lundow, Per; Rosengren, Anders

    2010-03-01

    A completely new approach to the Ising model in 1 to 5 dimensions is developed. We employ a generalisation of the binomial coefficients to describe the magnetisation distributions of the Ising model. For the complete graph this distribution is exact. For simple lattices of dimensions d=1 and d=5 the magnetisation distributions are remarkably well-fitted by the generalized binomial distributions. For d=4 we are only slightly less successful, while for d=2,3 we see some deviations (with exceptions!) between the generalized binomial and the Ising distribution. The results speak in favour of the generalized binomial distribution's correctness regarding their general behaviour in comparison to the Ising model. A theoretical analysis of the distribution's moments also lends support their being correct asymptotically, including the logarithmic corrections in d=4. The full extent to which they correctly model the Ising distribution, and for which graph families, is not settled though.

  8. Phase transition and information cascade in a voting model

    NASA Astrophysics Data System (ADS)

    Hisakado, M.; Mori, S.

    2010-08-01

    In this paper, we introduce a voting model that is similar to a Keynesian beauty contest and analyse it from a mathematical point of view. There are two types of voters—copycat and independent—and two candidates. Our voting model is a binomial distribution (independent voters) doped in a beta binomial distribution (copycat voters). We find that the phase transition in this system is at the upper limit of t, where t is the time (or the number of the votes). Our model contains three phases. If copycats constitute a majority or even half of the total voters, the voting rate converges more slowly than it would in a binomial distribution. If independents constitute the majority of voters, the voting rate converges at the same rate as it would in a binomial distribution. We also study why it is difficult to estimate the conclusion of a Keynesian beauty contest when there is an information cascade.

  9. Abstract knowledge versus direct experience in processing of binomial expressions

    PubMed Central

    Morgan, Emily; Levy, Roger

    2016-01-01

    We ask whether word order preferences for binomial expressions of the form A and B (e.g. bread and butter) are driven by abstract linguistic knowledge of ordering constraints referencing the semantic, phonological, and lexical properties of the constituent words, or by prior direct experience with the specific items in questions. Using forced-choice and self-paced reading tasks, we demonstrate that online processing of never-before-seen binomials is influenced by abstract knowledge of ordering constraints, which we estimate with a probabilistic model. In contrast, online processing of highly frequent binomials is primarily driven by direct experience, which we estimate from corpus frequency counts. We propose a trade-off wherein processing of novel expressions relies upon abstract knowledge, while reliance upon direct experience increases with increased exposure to an expression. Our findings support theories of language processing in which both compositional generation and direct, holistic reuse of multi-word expressions play crucial roles. PMID:27776281

  10. Funnel plot control limits to identify poorly performing healthcare providers when there is uncertainty in the value of the benchmark.

    PubMed

    Manktelow, Bradley N; Seaton, Sarah E; Evans, T Alun

    2016-12-01

    There is an increasing use of statistical methods, such as funnel plots, to identify poorly performing healthcare providers. Funnel plots comprise the construction of control limits around a benchmark and providers with outcomes falling outside the limits are investigated as potential outliers. The benchmark is usually estimated from observed data but uncertainty in this estimate is usually ignored when constructing control limits. In this paper, the use of funnel plots in the presence of uncertainty in the value of the benchmark is reviewed for outcomes from a Binomial distribution. Two methods to derive the control limits are shown: (i) prediction intervals; (ii) tolerance intervals Tolerance intervals formally include the uncertainty in the value of the benchmark while prediction intervals do not. The probability properties of 95% control limits derived using each method were investigated through hypothesised scenarios. Neither prediction intervals nor tolerance intervals produce funnel plot control limits that satisfy the nominal probability characteristics when there is uncertainty in the value of the benchmark. This is not necessarily to say that funnel plots have no role to play in healthcare, but that without the development of intervals satisfying the nominal probability characteristics they must be interpreted with care. © The Author(s) 2014.

  11. Recall of anti-tobacco advertisements and effects on quitting behavior: results from the California smokers cohort.

    PubMed

    Leas, Eric C; Myers, Mark G; Strong, David R; Hofstetter, C Richard; Al-Delaimy, Wael K

    2015-02-01

    We assessed whether an anti-tobacco television advertisement called "Stages," which depicted a woman giving a brief emotional narrative of her experiences with tobacco use, would be recalled more often and have a greater effect on smoking cessation than 3 other advertisements with different intended themes. Our data were derived from a sample of 2596 California adult smokers. We used multivariable log-binomial and modified Poisson regression models to calculate respondents' probability of quitting as a result of advertisement recall. More respondents recalled the "Stages" ad (58.5%) than the 3 other ads (23.1%, 23.4%, and 25.6%; P<.001). Respondents who recalled "Stages" at baseline had a higher probability than those who did not recall the ad of making a quit attempt between baseline and follow-up (adjusted risk ratio [RR]=1.18; 95% confidence interval [CI]=1.03, 1.34) and a higher probability of being in a period of smoking abstinence for at least a month at follow-up (adjusted RR=1.55; 95% CI=1.02, 2.37). Anti-tobacco television advertisements that depict visceral and personal messages may be recalled by a larger percentage of smokers and may have a greater impact on smoking cessation than other types of advertisements.

  12. A probability metric for identifying high-performing facilities: an application for pay-for-performance programs.

    PubMed

    Shwartz, Michael; Peköz, Erol A; Burgess, James F; Christiansen, Cindy L; Rosen, Amy K; Berlowitz, Dan

    2014-12-01

    Two approaches are commonly used for identifying high-performing facilities on a performance measure: one, that the facility is in a top quantile (eg, quintile or quartile); and two, that a confidence interval is below (or above) the average of the measure for all facilities. This type of yes/no designation often does not do well in distinguishing high-performing from average-performing facilities. To illustrate an alternative continuous-valued metric for profiling facilities--the probability a facility is in a top quantile--and show the implications of using this metric for profiling and pay-for-performance. We created a composite measure of quality from fiscal year 2007 data based on 28 quality indicators from 112 Veterans Health Administration nursing homes. A Bayesian hierarchical multivariate normal-binomial model was used to estimate shrunken rates of the 28 quality indicators, which were combined into a composite measure using opportunity-based weights. Rates were estimated using Markov Chain Monte Carlo methods as implemented in WinBUGS. The probability metric was calculated from the simulation replications. Our probability metric allowed better discrimination of high performers than the point or interval estimate of the composite score. In a pay-for-performance program, a smaller top quantile (eg, a quintile) resulted in more resources being allocated to the highest performers, whereas a larger top quantile (eg, being above the median) distinguished less among high performers and allocated more resources to average performers. The probability metric has potential but needs to be evaluated by stakeholders in different types of delivery systems.

  13. Automated segmentation of linear time-frequency representations of marine-mammal sounds.

    PubMed

    Dadouchi, Florian; Gervaise, Cedric; Ioana, Cornel; Huillery, Julien; Mars, Jérôme I

    2013-09-01

    Many marine mammals produce highly nonlinear frequency modulations. Determining the time-frequency support of these sounds offers various applications, which include recognition, localization, and density estimation. This study introduces a low parameterized automated spectrogram segmentation method that is based on a theoretical probabilistic framework. In the first step, the background noise in the spectrogram is fitted with a Chi-squared distribution and thresholded using a Neyman-Pearson approach. In the second step, the number of false detections in time-frequency regions is modeled as a binomial distribution, and then through a Neyman-Pearson strategy, the time-frequency bins are gathered into regions of interest. The proposed method is validated on real data of large sequences of whistles from common dolphins, collected in the Bay of Biscay (France). The proposed method is also compared with two alternative approaches: the first is smoothing and thresholding of the spectrogram; the second is thresholding of the spectrogram followed by the use of morphological operators to gather the time-frequency bins and to remove false positives. This method is shown to increase the probability of detection for the same probability of false alarms.

  14. Factors that influence the interests of farmer in shallots farming at Cinta Dame village of Simanindo sub district of Samosir district

    NASA Astrophysics Data System (ADS)

    Siregar, A. F.; Supriana, T.

    2018-02-01

    Shallots contains a lot of usefull ingredients for human life, especially as flavor to dishes by Indonesian. The need for shallots was increasing as increasing population. The increased demand of shallots caused the price to increase due to production in North Sumatera was low. The objective of this study is to analyze interest and factors that affect the interest of farmers in shallots farming and analyze the responses from each factors to the interest of farmers in shallots farming. The samples were 85 farmers in shallots farming. Binomial logit was used as data analysis method. The result of the study showed that the factors that influence the interest of farmers in shallots farming consist of land area, experience, income, supporting and trauma.. The opportunity of farmers in shallots farming increased 22% if the area of land increased by one acre. The probability variable with the supporting is higher 0,3 % compared without the supporting. While the probability variable without the trauma is higher 0,014 % compared with the trauma.

  15. Referent group proximity, social norms, and context: alcohol use in a low-use environment.

    PubMed

    Cox, Jared M; Bates, Scott C

    2011-01-01

    The purpose of this study was to investigate the relationship between perceived normative use of alcohol and reported consumption in an environment where relatively little alcohol use occurs. A total of 585 undergraduate students completed an online survey on alcohol use in March 2006. Participants reported personal alcohol use and perceptions of use by "friends," "the average student," and "the average student who drinks." Due to the large number of students reporting zero alcohol use, zero-inflated negative binomial regression was used to analyze the data. Results showed that perceptions of use and beliefs about the acceptability of use by proximal groups were strongly and positively correlated with personal alcohol use. Perceptions of distal groups were either not correlated or were correlated negatively with personal use. These findings suggest that the use of distal referent groups for a social norms campaign in a low-use environment may have paradoxical effects.

  16. Narrow log-periodic modulations in non-Markovian random walks

    NASA Astrophysics Data System (ADS)

    Diniz, R. M. B.; Cressoni, J. C.; da Silva, M. A. A.; Mariz, A. M.; de Araújo, J. M.

    2017-12-01

    What are the necessary ingredients for log-periodicity to appear in the dynamics of a random walk model? Can they be subtle enough to be overlooked? Previous studies suggest that long-range damaged memory and negative feedback together are necessary conditions for the emergence of log-periodic oscillations. The role of negative feedback would then be crucial, forcing the system to change direction. In this paper we show that small-amplitude log-periodic oscillations can emerge when the system is driven by positive feedback. Due to their very small amplitude, these oscillations can easily be mistaken for numerical finite-size effects. The models we use consist of discrete-time random walks with strong memory correlations where the decision process is taken from memory profiles based either on a binomial distribution or on a delta distribution. Anomalous superdiffusive behavior and log-periodic modulations are shown to arise in the large time limit for convenient choices of the models parameters.

  17. Identity Conflict and Sexual Risk for Black and Latino YMSM.

    PubMed

    Corsbie-Massay, Charisse L'Pree; Miller, Lynn C; Christensen, John L; Appleby, Paul R; Godoy, Carlos; Read, Stephen J

    2017-06-01

    Young (aged 18-30) Black and Latino men who have sex with men are at a higher risk of contracting HIV than their White counterparts. In order to better understand the unique nature of sexual risk-taking, we examined the extent to which ethnic group, ethnic identity, and sexual pride predicted condomless anal sex with casual partners among 161 young men who have sex with men (YMSM) who identify as Black or Latino. Negative binomial regressions were conducted using a cross-sectional design. Sexual pride was a negative predictor of condomless anal sex across all participants, but this effect was moderated by ethnic exploration and ethnic group; the relationship between sexual pride and condomless anal sex was strengthened by greater ethnic exploration among Latino YMSM, and weakened by greater ethnic exploration among Black YMSM. Implications for intersectional identity, identity conflict, and HIV prevention among young gay men of color are discussed.

  18. Using linked household-level data sets to explain consumer response to bovine spongiform encepalopathy (BSE) in Canada.

    PubMed

    Wang, Xin; Maynard, Leigh J; Butler, J S; Goddard, Ellen W

    2011-01-01

    Household-level Canadian meat purchases from 2002 to 2008 and a Food Opinions Survey conducted in 2008 were used to explore consumer responses to bovine spongiform encephalopathy (BSE) at the national level in Canada. Consumption in terms of the number of unit purchases was analyzed with a random-effects negative binomial model. In this study, household heterogeneity in meat purchases was partially explained using data from a self-reported food opinions survey. Of special interest was the hypothesis that consumers responded consistently to BSE in a one-time survey and in actual meat purchase behavior spanning years. Regional differences appeared, with consumers in eastern Canada reacting most negatively to BSE. Consumers responded more to the perception that food decision makers are honest about food safety than to the perception that they are knowledgeable, in maintaining beef purchases during BSE events.

  19. Factors related to the number of fast food meals obtained by college meal plan students.

    PubMed

    Dingman, Deirdre A; Schulz, Mark R; Wyrick, David L; Bibeau, Daniel L; Gupta, Sat N

    2014-01-01

    This study tested whether days on campus, financial access through a meal plan, and health consciousness were associated with number of meals that college students obtained from fast food restaurants. In April 2013, all students currently enrolled in a meal plan were invited to participate in an online survey (N = 1,246). Students were asked to report the total number of meals eaten in the past week and where they obtained them. Negative binomial regression was used, and it was found that the number of meals obtained from fast food restaurants was positively associated with financial access and negatively associated with health consciousness. An association between days on campus and the number of meals obtained from fast food restaurants was not found. Increasing levels of health consciousness and reducing access to fast food restaurants through flex plans may reduce college students' consumption of fast food.

  20. LS-CAP: an algorithm for identifying cytogenetic aberrations in hepatocellular carcinoma using microarray data.

    PubMed

    He, Xianmin; Wei, Qing; Sun, Meiqian; Fu, Xuping; Fan, Sichang; Li, Yao

    2006-05-01

    Biological techniques such as Array-Comparative genomic hybridization (CGH), fluorescent in situ hybridization (FISH) and affymetrix single nucleotide pleomorphism (SNP) array have been used to detect cytogenetic aberrations. However, on genomic scale, these techniques are labor intensive and time consuming. Comparative genomic microarray analysis (CGMA) has been used to identify cytogenetic changes in hepatocellular carcinoma (HCC) using gene expression microarray data. However, CGMA algorithm can not give precise localization of aberrations, fails to identify small cytogenetic changes, and exhibits false negatives and positives. Locally un-weighted smoothing cytogenetic aberrations prediction (LS-CAP) based on local smoothing and binomial distribution can be expected to address these problems. LS-CAP algorithm was built and used on HCC microarray profiles. Eighteen cytogenetic abnormalities were identified, among them 5 were reported previously, and 12 were proven by CGH studies. LS-CAP effectively reduced the false negatives and positives, and precisely located small fragments with cytogenetic aberrations.

  1. [Nursing: the meaning of this profession to nurses. A first approach].

    PubMed

    Luchesi, Luciana Barizon; Santos, Claudia Benedita dos

    2005-01-01

    In an attempt to understand, tell and, why not, participate a little in the history of Nursing, we proposed to study the prejudices and negative stereotypes that have permeated this profession over time. This is a before-after experimental type of study in a population of adolescents regularly enrolled in the eleventh grade of a Brazilian public school. The intervention took the form of a lecture about the profession and a questionnaire with closed questions which was applied before and after the lecture. Conclusions were based on the results of binomial and McNemar's non-parametric tests for the significance of changes. Although the statistically significant presence of prejudice and negatives stereotypes was not found, the results of the intervention were in line with expectations, since the changes(or tendency towards changes) took place exactly in those subgroups that showed a greater frequency of stereotypes.

  2. A new zero-inflated negative binomial methodology for latent category identification.

    PubMed

    Blanchard, Simon J; DeSarbo, Wayne S

    2013-04-01

    We introduce a new statistical procedure for the identification of unobserved categories that vary between individuals and in which objects may span multiple categories. This procedure can be used to analyze data from a proposed sorting task in which individuals may simultaneously assign objects to multiple piles. The results of a synthetic example and a consumer psychology study involving categories of restaurant brands illustrate how the application of the proposed methodology to the new sorting task can account for a variety of categorization phenomena including multiple category memberships and for heterogeneity through individual differences in the saliency of latent category structures.

  3. Probing the statistics of primordial fluctuations and their evolution

    NASA Technical Reports Server (NTRS)

    Gaztanaga, Enrique; Yokoyama, Jun'ichi

    1993-01-01

    The statistical distribution of fluctuations on various scales is analyzed in terms of the counts in cells of smoothed density fields, using volume-limited samples of galaxy redshift catalogs. It is shown that the distribution on large scales, with volume average of the two-point correlation function of the smoothed field less than about 0.05, is consistent with Gaussian. Statistics are shown to agree remarkably well with the negative binomial distribution, which has hierarchial correlations and a Gaussian behavior at large scales. If these observed properties correspond to the matter distribution, they suggest that our universe started with Gaussian fluctuations and evolved keeping hierarchial form.

  4. Negative Binomial Fits to Multiplicity Distributions from Central Collisions of (16)O+Cu at 14.6A GeV/c and Intermittency

    NASA Technical Reports Server (NTRS)

    Tannenbaum, M. J.

    1994-01-01

    The concept of "Intermittency" was introduced by Bialas and Peschanski to try to explain the "large" fluctuations of multiplicity in restricted intervals of rapidity or pseudorapidity. A formalism was proposed to to study non-statistical (more precisely, non-Poisson) fluctuations as a function of the size of rapidity interval, and it was further suggested that the "spikes" in the rapidity fluctuations were evidence of fractal or intermittent behavior, in analogy to turbulence in fluid dynamics which is characterized by self-similar fluctuations at all scales-the absence of well defined scale of length.

  5. Definite Integrals, Some Involving Residue Theory Evaluated by Maple Code

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bowman, Kimiko o

    2010-01-01

    The calculus of residue is applied to evaluate certain integrals in the range (-{infinity} to {infinity}) using the Maple symbolic code. These integrals are of the form {integral}{sub -{infinity}}{sup {infinity}} cos(x)/[(x{sup 2} + a{sup 2})(x{sup 2} + b{sup 2}) (x{sup 2} + c{sup 2})]dx and similar extensions. The Maple code is also applied to expressions in maximum likelihood estimator moments when sampling from the negative binomial distribution. In general the Maple code approach to the integrals gives correct answers to specified decimal places, but the symbolic result may be extremely long and complex.

  6. On extinction time of a generalized endemic chain-binomial model.

    PubMed

    Aydogmus, Ozgur

    2016-09-01

    We considered a chain-binomial epidemic model not conferring immunity after infection. Mean field dynamics of the model has been analyzed and conditions for the existence of a stable endemic equilibrium are determined. The behavior of the chain-binomial process is probabilistically linked to the mean field equation. As a result of this link, we were able to show that the mean extinction time of the epidemic increases at least exponentially as the population size grows. We also present simulation results for the process to validate our analytical findings. Copyright © 2016 Elsevier Inc. All rights reserved.

  7. Solar San Diego: The Impact of Binomial Rate Structures on Real PV Systems; Preprint

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    VanGeet, O.; Brown, E.; Blair, T.

    2008-05-01

    There is confusion in the marketplace regarding the impact of solar photovoltaics (PV) on the user's actual electricity bill under California Net Energy Metering, particularly with binomial tariffs (those that include both demand and energy charges) and time-of-use (TOU) rate structures. The City of San Diego has extensive real-time electrical metering on most of its buildings and PV systems, with interval data for overall consumption and PV electrical production available for multiple years. This paper uses 2007 PV-system data from two city facilities to illustrate the impacts of binomial rate designs. The analysis will determine the energy and demand savingsmore » that the PV systems are achieving relative to the absence of systems. A financial analysis of PV-system performance under various rate structures is presented. The data revealed that actual demand and energy use benefits of binomial tariffs increase in summer months, when solar resources allow for maximized electricity production. In a binomial tariff system, varying on- and semi-peak times can result in approximately $1,100 change in demand charges per month over not having a PV system in place, an approximate 30% cost savings. The PV systems are also shown to have a 30%-50% reduction in facility energy charges in 2007.« less

  8. Topics in Bayesian Hierarchical Modeling and its Monte Carlo Computations

    NASA Astrophysics Data System (ADS)

    Tak, Hyung Suk

    The first chapter addresses a Beta-Binomial-Logit model that is a Beta-Binomial conjugate hierarchical model with covariate information incorporated via a logistic regression. Various researchers in the literature have unknowingly used improper posterior distributions or have given incorrect statements about posterior propriety because checking posterior propriety can be challenging due to the complicated functional form of a Beta-Binomial-Logit model. We derive data-dependent necessary and sufficient conditions for posterior propriety within a class of hyper-prior distributions that encompass those used in previous studies. Frequency coverage properties of several hyper-prior distributions are also investigated to see when and whether Bayesian interval estimates of random effects meet their nominal confidence levels. The second chapter deals with a time delay estimation problem in astrophysics. When the gravitational field of an intervening galaxy between a quasar and the Earth is strong enough to split light into two or more images, the time delay is defined as the difference between their travel times. The time delay can be used to constrain cosmological parameters and can be inferred from the time series of brightness data of each image. To estimate the time delay, we construct a Gaussian hierarchical model based on a state-space representation for irregularly observed time series generated by a latent continuous-time Ornstein-Uhlenbeck process. Our Bayesian approach jointly infers model parameters via a Gibbs sampler. We also introduce a profile likelihood of the time delay as an approximation of its marginal posterior distribution. The last chapter specifies a repelling-attracting Metropolis algorithm, a new Markov chain Monte Carlo method to explore multi-modal distributions in a simple and fast manner. This algorithm is essentially a Metropolis-Hastings algorithm with a proposal that consists of a downhill move in density that aims to make local modes repelling, followed by an uphill move in density that aims to make local modes attracting. The downhill move is achieved via a reciprocal Metropolis ratio so that the algorithm prefers downward movement. The uphill move does the opposite using the standard Metropolis ratio which prefers upward movement. This down-up movement in density increases the probability of a proposed move to a different mode.

  9. Cost-effective binomial sequential sampling of western bean cutworm, Striacosta albicosta (Lepidoptera: Noctuidae), egg masses in corn.

    PubMed

    Paula-Moraes, S; Burkness, E C; Hunt, T E; Wright, R J; Hein, G L; Hutchison, W D

    2011-12-01

    Striacosta albicosta (Smith) (Lepidoptera: Noctuidae), is a native pest of dry beans (Phaseolus vulgaris L.) and corn (Zea mays L.). As a result of larval feeding damage on corn ears, S. albicosta has a narrow treatment window; thus, early detection of the pest in the field is essential, and egg mass sampling has become a popular monitoring tool. Three action thresholds for field and sweet corn currently are used by crop consultants, including 4% of plants infested with egg masses on sweet corn in the silking-tasseling stage, 8% of plants infested with egg masses on field corn with approximately 95% tasseled, and 20% of plants infested with egg masses on field corn during mid-milk-stage corn. The current monitoring recommendation is to sample 20 plants at each of five locations per field (100 plants total). In an effort to develop a more cost-effective sampling plan for S. albicosta egg masses, several alternative binomial sampling plans were developed using Wald's sequential probability ratio test, and validated using Resampling for Validation of Sampling Plans (RVSP) software. The benefit-cost ratio also was calculated and used to determine the final selection of sampling plans. Based on final sampling plans selected for each action threshold, the average sample number required to reach a treat or no-treat decision ranged from 38 to 41 plants per field. This represents a significant savings in sampling cost over the current recommendation of 100 plants.

  10. Comparison and Field Validation of Binomial Sampling Plans for Oligonychus perseae (Acari: Tetranychidae) on Hass Avocado in Southern California.

    PubMed

    Lara, Jesus R; Hoddle, Mark S

    2015-08-01

    Oligonychus perseae Tuttle, Baker, & Abatiello is a foliar pest of 'Hass' avocados [Persea americana Miller (Lauraceae)]. The recommended action threshold is 50-100 motile mites per leaf, but this count range and other ecological factors associated with O. perseae infestations limit the application of enumerative sampling plans in the field. Consequently, a comprehensive modeling approach was implemented to compare the practical application of various binomial sampling models for decision-making of O. perseae in California. An initial set of sequential binomial sampling models were developed using three mean-proportion modeling techniques (i.e., Taylor's power law, maximum likelihood, and an empirical model) in combination with two-leaf infestation tally thresholds of either one or two mites. Model performance was evaluated using a robust mite count database consisting of >20,000 Hass avocado leaves infested with varying densities of O. perseae and collected from multiple locations. Operating characteristic and average sample number results for sequential binomial models were used as the basis to develop and validate a standardized fixed-size binomial sampling model with guidelines on sample tree and leaf selection within blocks of avocado trees. This final validated model requires a leaf sampling cost of 30 leaves and takes into account the spatial dynamics of O. perseae to make reliable mite density classifications for a 50-mite action threshold. Recommendations for implementing this fixed-size binomial sampling plan to assess densities of O. perseae in commercial California avocado orchards are discussed. © The Authors 2015. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  11. Depressive Symptoms, Religious Coping, and Cigarette Smoking Among Post-secondary Vocational Students

    PubMed Central

    Horton, Karissa D.; Loukas, Alexandra

    2014-01-01

    Depressive symptoms are associated with increased levels of cigarette smoking, yet not every individual experiencing depressive symptoms smokes. This study examined whether religious coping moderated the impact of depressive symptoms on past 30-day cigarette use among a racially/ethnically diverse sample of 963 post-secondary vocational students (46.8% women; mean age = 25 years). Results from negative binomial regression analyses indicated that depressive symptoms increased the likelihood of cigarette smoking (quantity-frequency measure of cigarette use) for female students, whereas positive religious coping decreased the likelihood of smoking for female students. Consistent with religious coping theory and as expected, negative religious coping moderated the depressive symptoms-smoking relationship such that negative religious coping exacerbated the impact of depressive symptoms on cigarette smoking among females. Positive religious coping also moderated the depressive symptoms-cigarette smoking relationship for females. However, contrary to expectations, high levels of positive religious coping exacerbated the likelihood of cigarette smoking among females with high levels of depressive symptoms. Surprisingly, neither depressive symptoms nor positive or negative religious coping contributed to the likelihood of males’ smoking. Study limitations and suggestions for directions in future research are discussed. PMID:23276324

  12. Targeting the probability versus cost of feared outcomes in public speaking anxiety.

    PubMed

    Nelson, Elizabeth A; Deacon, Brett J; Lickel, James J; Sy, Jennifer T

    2010-04-01

    Cognitive-behavioral theory suggests that social phobia is maintained, in part, by overestimates of the probability and cost of negative social events. Indeed, empirically supported cognitive-behavioral treatments directly target these cognitive biases through the use of in vivo exposure or behavioral experiments. While cognitive-behavioral theories and treatment protocols emphasize the importance of targeting probability and cost biases in the reduction of social anxiety, few studies have examined specific techniques for reducing probability and cost bias, and thus the relative efficacy of exposure to the probability versus cost of negative social events is unknown. In the present study, 37 undergraduates with high public speaking anxiety were randomly assigned to a single-session intervention designed to reduce either the perceived probability or the perceived cost of negative outcomes associated with public speaking. Compared to participants in the probability treatment condition, those in the cost treatment condition demonstrated significantly greater improvement on measures of public speaking anxiety and cost estimates for negative social events. The superior efficacy of the cost treatment condition was mediated by greater treatment-related changes in social cost estimates. The clinical implications of these findings are discussed. Published by Elsevier Ltd.

  13. Replacement/Refurbishment of JSC/NASA POD Specimens

    NASA Technical Reports Server (NTRS)

    Castner, Willard L.

    2010-01-01

    The NASA Special NDE certification process requires demonstration of NDE capability by test per NASA-STD-5009. This test is performed with fatigue cracked specimens containing very small cracks. The certification test results are usually based on binomial statistics and must meet a 90/95 Probability of Detection (POD). The assumption is that fatigue cracks are tightly closed, difficult to detect, and inspectors and processes passing such a test are well qualified for inspecting NASA fracture critical hardware. The JSC NDE laboratory has what may be the largest inventory that exists of such fatigue cracked NDE demonstration specimens. These specimens were produced by the hundreds in the late 1980s and early 1990s. None have been produced since that time and the condition and usability of the specimens are questionable.

  14. Using the Binomial Series to Prove the Arithmetic Mean-Geometric Mean Inequality

    ERIC Educational Resources Information Center

    Persky, Ronald L.

    2003-01-01

    In 1968, Leon Gerber compared (1 + x)[superscript a] to its kth partial sum as a binomial series. His result is stated and, as an application of this result, a proof of the arithmetic mean-geometric mean inequality is presented.

  15. Four Bootstrap Confidence Intervals for the Binomial-Error Model.

    ERIC Educational Resources Information Center

    Lin, Miao-Hsiang; Hsiung, Chao A.

    1992-01-01

    Four bootstrap methods are identified for constructing confidence intervals for the binomial-error model. The extent to which similar results are obtained and the theoretical foundation of each method and its relevance and ranges of modeling the true score uncertainty are discussed. (SLD)

  16. Possibility and Challenges of Conversion of Current Virus Species Names to Linnaean Binomials.

    PubMed

    Postler, Thomas S; Clawson, Anna N; Amarasinghe, Gaya K; Basler, Christopher F; Bavari, Sbina; Benko, Mária; Blasdell, Kim R; Briese, Thomas; Buchmeier, Michael J; Bukreyev, Alexander; Calisher, Charles H; Chandran, Kartik; Charrel, Rémi; Clegg, Christopher S; Collins, Peter L; Juan Carlos, De La Torre; Derisi, Joseph L; Dietzgen, Ralf G; Dolnik, Olga; Dürrwald, Ralf; Dye, John M; Easton, Andrew J; Emonet, Sébastian; Formenty, Pierre; Fouchier, Ron A M; Ghedin, Elodie; Gonzalez, Jean-Paul; Harrach, Balázs; Hewson, Roger; Horie, Masayuki; Jiang, Dàohóng; Kobinger, Gary; Kondo, Hideki; Kropinski, Andrew M; Krupovic, Mart; Kurath, Gael; Lamb, Robert A; Leroy, Eric M; Lukashevich, Igor S; Maisner, Andrea; Mushegian, Arcady R; Netesov, Sergey V; Nowotny, Norbert; Patterson, Jean L; Payne, Susan L; PaWeska, Janusz T; Peters, Clarence J; Radoshitzky, Sheli R; Rima, Bertus K; Romanowski, Victor; Rubbenstroth, Dennis; Sabanadzovic, Sead; Sanfaçon, Hélène; Salvato, Maria S; Schwemmle, Martin; Smither, Sophie J; Stenglein, Mark D; Stone, David M; Takada, Ayato; Tesh, Robert B; Tomonaga, Keizo; Tordo, Noël; Towner, Jonathan S; Vasilakis, Nikos; Volchkov, Viktor E; Wahl-Jensen, Victoria; Walker, Peter J; Wang, Lin-Fa; Varsani, Arvind; Whitfield, Anna E; Zerbini, F Murilo; Kuhn, Jens H

    2017-05-01

    Botanical, mycological, zoological, and prokaryotic species names follow the Linnaean format, consisting of an italicized Latinized binomen with a capitalized genus name and a lower case species epithet (e.g., Homo sapiens). Virus species names, however, do not follow a uniform format, and, even when binomial, are not Linnaean in style. In this thought exercise, we attempted to convert all currently official names of species included in the virus family Arenaviridae and the virus order Mononegavirales to Linnaean binomials, and to identify and address associated challenges and concerns. Surprisingly, this endeavor was not as complicated or time-consuming as even the authors of this article expected when conceiving the experiment. [Arenaviridae; binomials; ICTV; International Committee on Taxonomy of Viruses; Mononegavirales; virus nomenclature; virus taxonomy.]. Published by Oxford University Press on behalf of Society of Systematic Biologists 2016. This work is written by a US Government employee and is in the public domain in the US.

  17. The arcsine is asinine: the analysis of proportions in ecology.

    PubMed

    Warton, David I; Hui, Francis K C

    2011-01-01

    The arcsine square root transformation has long been standard procedure when analyzing proportional data in ecology, with applications in data sets containing binomial and non-binomial response variables. Here, we argue that the arcsine transform should not be used in either circumstance. For binomial data, logistic regression has greater interpretability and higher power than analyses of transformed data. However, it is important to check the data for additional unexplained variation, i.e., overdispersion, and to account for it via the inclusion of random effects in the model if found. For non-binomial data, the arcsine transform is undesirable on the grounds of interpretability, and because it can produce nonsensical predictions. The logit transformation is proposed as an alternative approach to address these issues. Examples are presented in both cases to illustrate these advantages, comparing various methods of analyzing proportions including untransformed, arcsine- and logit-transformed linear models and logistic regression (with or without random effects). Simulations demonstrate that logistic regression usually provides a gain in power over other methods.

  18. [The reentrant binomial model of nuclear anomalies growth in rhabdomyosarcoma RA-23 cell populations under increasing doze of rare ionizing radiation].

    PubMed

    Alekseeva, N P; Alekseev, A O; Vakhtin, Iu B; Kravtsov, V Iu; Kuzovatov, S N; Skorikova, T I

    2008-01-01

    Distributions of nuclear morphology anomalies in transplantable rabdomiosarcoma RA-23 cell populations were investigated under effect of ionizing radiation from 0 to 45 Gy. Internuclear bridges, nuclear protrusions and dumbbell-shaped nuclei were accepted for morphological anomalies. Empirical distributions of the number of anomalies per 100 nuclei were used. The adequate model of reentrant binomial distribution has been found. The sum of binomial random variables with binomial number of summands has such distribution. Averages of these random variables were named, accordingly, internal and external average reentrant components. Their maximum likelihood estimations were received. Statistical properties of these estimations were investigated by means of statistical modeling. It has been received that at equally significant correlation between the radiation dose and the average of nuclear anomalies in cell populations after two-three cellular cycles from the moment of irradiation in vivo the irradiation doze significantly correlates with internal average reentrant component, and in remote descendants of cell transplants irradiated in vitro - with external one.

  19. Spiritual and ceremonial plants in North America: an assessment of Moerman's ethnobotanical database comparing Residual, Binomial, Bayesian and Imprecise Dirichlet Model (IDM) analysis.

    PubMed

    Turi, Christina E; Murch, Susan J

    2013-07-09

    Ethnobotanical research and the study of plants used for rituals, ceremonies and to connect with the spirit world have led to the discovery of many novel psychoactive compounds such as nicotine, caffeine, and cocaine. In North America, spiritual and ceremonial uses of plants are well documented and can be accessed online via the University of Michigan's Native American Ethnobotany Database. The objective of the study was to compare Residual, Bayesian, Binomial and Imprecise Dirichlet Model (IDM) analyses of ritual, ceremonial and spiritual plants in Moerman's ethnobotanical database and to identify genera that may be good candidates for the discovery of novel psychoactive compounds. The database was queried with the following format "Family Name AND Ceremonial OR Spiritual" for 263 North American botanical families. Spiritual and ceremonial flora consisted of 86 families with 517 species belonging to 292 genera. Spiritual taxa were then grouped further into ceremonial medicines and items categories. Residual, Bayesian, Binomial and IDM analysis were performed to identify over and under-utilized families. The 4 statistical approaches were in good agreement when identifying under-utilized families but large families (>393 species) were underemphasized by Binomial, Bayesian and IDM approaches for over-utilization. Residual, Binomial, and IDM analysis identified similar families as over-utilized in the medium (92-392 species) and small (<92 species) classes. The families Apiaceae, Asteraceae, Ericacea, Pinaceae and Salicaceae were identified as significantly over-utilized as ceremonial medicines in medium and large sized families. Analysis of genera within the Apiaceae and Asteraceae suggest that the genus Ligusticum and Artemisia are good candidates for facilitating the discovery of novel psychoactive compounds. The 4 statistical approaches were not consistent in the selection of over-utilization of flora. Residual analysis revealed overall trends that were supported by Binomial analysis when separated into small, medium and large families. The Bayesian, Binomial and IDM approaches identified different genera as potentially important. Species belonging to the genus Artemisia and Ligusticum were most consistently identified and may be valuable in future studies of the ethnopharmacology. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  20. Detection of Clostridium difficile infection clusters, using the temporal scan statistic, in a community hospital in southern Ontario, Canada, 2006-2011.

    PubMed

    Faires, Meredith C; Pearl, David L; Ciccotelli, William A; Berke, Olaf; Reid-Smith, Richard J; Weese, J Scott

    2014-05-12

    In hospitals, Clostridium difficile infection (CDI) surveillance relies on unvalidated guidelines or threshold criteria to identify outbreaks. This can result in false-positive and -negative cluster alarms. The application of statistical methods to identify and understand CDI clusters may be a useful alternative or complement to standard surveillance techniques. The objectives of this study were to investigate the utility of the temporal scan statistic for detecting CDI clusters and determine if there are significant differences in the rate of CDI cases by month, season, and year in a community hospital. Bacteriology reports of patients identified with a CDI from August 2006 to February 2011 were collected. For patients detected with CDI from March 2010 to February 2011, stool specimens were obtained. Clostridium difficile isolates were characterized by ribotyping and investigated for the presence of toxin genes by PCR. CDI clusters were investigated using a retrospective temporal scan test statistic. Statistically significant clusters were compared to known CDI outbreaks within the hospital. A negative binomial regression model was used to identify associations between year, season, month and the rate of CDI cases. Overall, 86 CDI cases were identified. Eighteen specimens were analyzed and nine ribotypes were classified with ribotype 027 (n = 6) the most prevalent. The temporal scan statistic identified significant CDI clusters at the hospital (n = 5), service (n = 6), and ward (n = 4) levels (P ≤ 0.05). Three clusters were concordant with the one C. difficile outbreak identified by hospital personnel. Two clusters were identified as potential outbreaks. The negative binomial model indicated years 2007-2010 (P ≤ 0.05) had decreased CDI rates compared to 2006 and spring had an increased CDI rate compared to the fall (P = 0.023). Application of the temporal scan statistic identified several clusters, including potential outbreaks not detected by hospital personnel. The identification of time periods with decreased or increased CDI rates may have been a result of specific hospital events. Understanding the clustering of CDIs can aid in the interpretation of surveillance data and lead to the development of better early detection systems.

  1. Selecting Tools to Model Integer and Binomial Multiplication

    ERIC Educational Resources Information Center

    Pratt, Sarah Smitherman; Eddy, Colleen M.

    2017-01-01

    Mathematics teachers frequently provide concrete manipulatives to students during instruction; however, the rationale for using certain manipulatives in conjunction with concepts may not be explored. This article focuses on area models that are currently used in classrooms to provide concrete examples of integer and binomial multiplication. The…

  2. Enumerative and binomial sampling plans for citrus mealybug (Homoptera: pseudococcidae) in citrus groves.

    PubMed

    Martínez-Ferrer, María Teresa; Ripollés, José Luís; Garcia-Marí, Ferran

    2006-06-01

    The spatial distribution of the citrus mealybug, Planococcus citri (Risso) (Homoptera: Pseudococcidae), was studied in citrus groves in northeastern Spain. Constant precision sampling plans were designed for all developmental stages of citrus mealybug under the fruit calyx, for late stages on fruit, and for females on trunks and main branches; more than 66, 286, and 101 data sets, respectively, were collected from nine commercial fields during 1992-1998. Dispersion parameters were determined using Taylor's power law, giving aggregated spatial patterns for citrus mealybug populations in three locations of the tree sampled. A significant relationship between the number of insects per organ and the percentage of occupied organs was established using either Wilson and Room's binomial model or Kono and Sugino's empirical formula. Constant precision (E = 0.25) sampling plans (i.e., enumerative plans) for estimating mean densities were developed using Green's equation and the two binomial models. For making management decisions, enumerative counts may be less labor-intensive than binomial sampling. Therefore, we recommend enumerative sampling plans for the use in an integrated pest management program in citrus. Required sample sizes for the range of population densities near current management thresholds, in the three plant locations calyx, fruit, and trunk were 50, 110-330, and 30, respectively. Binomial sampling, especially the empirical model, required a higher sample size to achieve equivalent levels of precision.

  3. Patterns of medicinal plant use: an examination of the Ecuadorian Shuar medicinal flora using contingency table and binomial analyses.

    PubMed

    Bennett, Bradley C; Husby, Chad E

    2008-03-28

    Botanical pharmacopoeias are non-random subsets of floras, with some taxonomic groups over- or under-represented. Moerman [Moerman, D.E., 1979. Symbols and selectivity: a statistical analysis of Native American medical ethnobotany, Journal of Ethnopharmacology 1, 111-119] introduced linear regression/residual analysis to examine these patterns. However, regression, the commonly-employed analysis, suffers from several statistical flaws. We use contingency table and binomial analyses to examine patterns of Shuar medicinal plant use (from Amazonian Ecuador). We first analyzed the Shuar data using Moerman's approach, modified to better meet requirements of linear regression analysis. Second, we assessed the exact randomization contingency table test for goodness of fit. Third, we developed a binomial model to test for non-random selection of plants in individual families. Modified regression models (which accommodated assumptions of linear regression) reduced R(2) to from 0.59 to 0.38, but did not eliminate all problems associated with regression analyses. Contingency table analyses revealed that the entire flora departs from the null model of equal proportions of medicinal plants in all families. In the binomial analysis, only 10 angiosperm families (of 115) differed significantly from the null model. These 10 families are largely responsible for patterns seen at higher taxonomic levels. Contingency table and binomial analyses offer an easy and statistically valid alternative to the regression approach.

  4. Estimating the Parameters of the Beta-Binomial Distribution.

    ERIC Educational Resources Information Center

    Wilcox, Rand R.

    1979-01-01

    For some situations the beta-binomial distribution might be used to describe the marginal distribution of test scores for a particular population of examinees. Several different methods of approximating the maximum likelihood estimate were investigated, and it was found that the Newton-Raphson method should be used when it yields admissable…

  5. Multilevel Models for Binary Data

    ERIC Educational Resources Information Center

    Powers, Daniel A.

    2012-01-01

    The methods and models for categorical data analysis cover considerable ground, ranging from regression-type models for binary and binomial data, count data, to ordered and unordered polytomous variables, as well as regression models that mix qualitative and continuous data. This article focuses on methods for binary or binomial data, which are…

  6. Establishing endangered species recovery criteria using predictive simulation modeling

    USGS Publications Warehouse

    McGowan, Conor P.; Catlin, Daniel H.; Shaffer, Terry L.; Gratto-Trevor, Cheri L.; Aron, Carol

    2014-01-01

    Listing a species under the Endangered Species Act (ESA) and developing a recovery plan requires U.S. Fish and Wildlife Service to establish specific and measurable criteria for delisting. Generally, species are listed because they face (or are perceived to face) elevated risk of extinction due to issues such as habitat loss, invasive species, or other factors. Recovery plans identify recovery criteria that reduce extinction risk to an acceptable level. It logically follows that the recovery criteria, the defined conditions for removing a species from ESA protections, need to be closely related to extinction risk. Extinction probability is a population parameter estimated with a model that uses current demographic information to project the population into the future over a number of replicates, calculating the proportion of replicated populations that go extinct. We simulated extinction probabilities of piping plovers in the Great Plains and estimated the relationship between extinction probability and various demographic parameters. We tested the fit of regression models linking initial abundance, productivity, or population growth rate to extinction risk, and then, using the regression parameter estimates, determined the conditions required to reduce extinction probability to some pre-defined acceptable threshold. Binomial regression models with mean population growth rate and the natural log of initial abundance were the best predictors of extinction probability 50 years into the future. For example, based on our regression models, an initial abundance of approximately 2400 females with an expected mean population growth rate of 1.0 will limit extinction risk for piping plovers in the Great Plains to less than 0.048. Our method provides a straightforward way of developing specific and measurable recovery criteria linked directly to the core issue of extinction risk. Published by Elsevier Ltd.

  7. The gap between suicide characteristics in the print media and in the population.

    PubMed

    Niederkrotenthaler, Thomas; Till, Benedikt; Herberth, Arno; Voracek, Martin; Kapusta, Nestor D; Etzersdorfer, Elmar; Strauss, Markus; Sonneck, Gernot

    2009-08-01

    Programmes to educate media professionals about suicide are increasingly established, but information about which suicide cases are most likely to be reported in the mass media is sparse. We applied binomial tests to compare frequencies of social characteristics of all domestic suicides in the 13 largest Austrian print media in 2005 with frequencies of suicide characteristics in the population. Additionally, each reported suicide case was linked to its respective entry in the suicide database. We performed a logistic regression analysis, with presence of an article as outcome, and sex of the suicide case, age, religious affiliation, family status, conduction of an autopsy and location of the suicide as explaining variables. Time of the year and federal state where the suicide happened was controlled for. Binomial tests showed that suicides involving murder or murder attempt were over-represented in the media. Reporting on mental disorders was under-represented. In the regression analysis, the likelihood of a report was negatively associated with the age of suicide cases. Foreign citizenship was a further predictor of a suicide report. The methods of drowning, jumping, shooting and rare methods were more likely to be reported than hanging, which is the most frequent suicide method in Austria. Suicide characteristics in the media are not representative of the population. The identified discrepancies provide a basis for tailor-made education of mass media professionals.

  8. A robust design mark-resight abundance estimator allowing heterogeneity in resighting probabilities

    USGS Publications Warehouse

    McClintock, B.T.; White, Gary C.; Burnham, K.P.

    2006-01-01

    This article introduces the beta-binomial estimator (BBE), a closed-population abundance mark-resight model combining the favorable qualities of maximum likelihood theory and the allowance of individual heterogeneity in sighting probability (p). The model may be parameterized for a robust sampling design consisting of multiple primary sampling occasions where closure need not be met between primary occasions. We applied the model to brown bear data from three study areas in Alaska and compared its performance to the joint hypergeometric estimator (JHE) and Bowden's estimator (BOWE). BBE estimates suggest heterogeneity levels were non-negligible and discourage the use of JHE for these data. Compared to JHE and BOWE, confidence intervals were considerably shorter for the AICc model-averaged BBE. To evaluate the properties of BBE relative to JHE and BOWE when sample sizes are small, simulations were performed with data from three primary occasions generated under both individual heterogeneity and temporal variation in p. All models remained consistent regardless of levels of variation in p. In terms of precision, the AICc model-averaged BBE showed advantages over JHE and BOWE when heterogeneity was present and mean sighting probabilities were similar between primary occasions. Based on the conditions examined, BBE is a reliable alternative to JHE or BOWE and provides a framework for further advances in mark-resight abundance estimation. ?? 2006 American Statistical Association and the International Biometric Society.

  9. Recall of Anti-Tobacco Advertisements and Effects on Quitting Behavior: Results From the California Smokers Cohort

    PubMed Central

    Leas, Eric C.; Myers, Mark G.; Strong, David R.; Hofstetter, C. Richard

    2015-01-01

    Objectives. We assessed whether an anti-tobacco television advertisement called “Stages,” which depicted a woman giving a brief emotional narrative of her experiences with tobacco use, would be recalled more often and have a greater effect on smoking cessation than 3 other advertisements with different intended themes. Methods. Our data were derived from a sample of 2596 California adult smokers. We used multivariable log-binomial and modified Poisson regression models to calculate respondents’ probability of quitting as a result of advertisement recall. Results. More respondents recalled the “Stages” ad (58.5%) than the 3 other ads (23.1%, 23.4%, and 25.6%; P < .001). Respondents who recalled “Stages” at baseline had a higher probability than those who did not recall the ad of making a quit attempt between baseline and follow-up (adjusted risk ratio [RR] = 1.18; 95% confidence interval [CI] = 1.03, 1.34) and a higher probability of being in a period of smoking abstinence for at least a month at follow-up (adjusted RR = 1.55; 95% CI = 1.02, 2.37). Conclusions. Anti-tobacco television advertisements that depict visceral and personal messages may be recalled by a larger percentage of smokers and may have a greater impact on smoking cessation than other types of advertisements. PMID:25521871

  10. Possibility and Challenges of Conversion of Current Virus Species Names to Linnaean Binomials

    PubMed Central

    Postler, Thomas S.; Clawson, Anna N.; Amarasinghe, Gaya K.; Basler, Christopher F.; Bavari, Sbina; Benkő, Mária; Blasdell, Kim R.; Briese, Thomas; Buchmeier, Michael J.; Bukreyev, Alexander; Calisher, Charles H.; Chandran, Kartik; Charrel, Rémi; Clegg, Christopher S.; Collins, Peter L.; Juan Carlos, De La Torre; Derisi, Joseph L.; Dietzgen, Ralf G.; Dolnik, Olga; Dürrwald, Ralf; Dye, John M.; Easton, Andrew J.; Emonet, Sébastian; Formenty, Pierre; Fouchier, Ron A. M.; Ghedin, Elodie; Gonzalez, Jean-Paul; Harrach, Balázs; Hewson, Roger; Horie, Masayuki; Jiāng, Dàohóng; Kobinger, Gary; Kondo, Hideki; Kropinski, Andrew M.; Krupovic, Mart; Kurath, Gael; Lamb, Robert A.; Leroy, Eric M.; Lukashevich, Igor S.; Maisner, Andrea; Mushegian, Arcady R.; Netesov, Sergey V.; Nowotny, Norbert; Patterson, Jean L.; Payne, Susan L.; PaWeska, Janusz T.; Peters, Clarence J.; Radoshitzky, Sheli R.; Rima, Bertus K.; Romanowski, Victor; Rubbenstroth, Dennis; Sabanadzovic, Sead; Sanfaçon, Hélène; Salvato, Maria S.; Schwemmle, Martin; Smither, Sophie J.; Stenglein, Mark D.; Stone, David M.; Takada, Ayato; Tesh, Robert B.; Tomonaga, Keizo; Tordo, Noël; Towner, Jonathan S.; Vasilakis, Nikos; Volchkov, Viktor E.; Wahl-Jensen, Victoria; Walker, Peter J.; Wang, Lin-Fa; Varsani, Arvind; Whitfield, Anna E.; Zerbini, F. Murilo; Kuhn, Jens H.

    2017-01-01

    Abstract Botanical, mycological, zoological, and prokaryotic species names follow the Linnaean format, consisting of an italicized Latinized binomen with a capitalized genus name and a lower case species epithet (e.g., Homo sapiens). Virus species names, however, do not follow a uniform format, and, even when binomial, are not Linnaean in style. In this thought exercise, we attempted to convert all currently official names of species included in the virus family Arenaviridae and the virus order Mononegavirales to Linnaean binomials, and to identify and address associated challenges and concerns. Surprisingly, this endeavor was not as complicated or time-consuming as even the authors of this article expected when conceiving the experiment. PMID:27798405

  11. Analysis of overdispersed count data by mixtures of Poisson variables and Poisson processes.

    PubMed

    Hougaard, P; Lee, M L; Whitmore, G A

    1997-12-01

    Count data often show overdispersion compared to the Poisson distribution. Overdispersion is typically modeled by a random effect for the mean, based on the gamma distribution, leading to the negative binomial distribution for the count. This paper considers a larger family of mixture distributions, including the inverse Gaussian mixture distribution. It is demonstrated that it gives a significantly better fit for a data set on the frequency of epileptic seizures. The same approach can be used to generate counting processes from Poisson processes, where the rate or the time is random. A random rate corresponds to variation between patients, whereas a random time corresponds to variation within patients.

  12. The coverage of a random sample from a biological community.

    PubMed

    Engen, S

    1975-03-01

    A taxonomic group will frequently have a large number of species with small abundances. When a sample is drawn at random from this group, one is therefore faced with the problem that a large proportion of the species will not be discovered. A general definition of quantitative measures of "sample coverage" is proposed, and the problem of statistical inference is considered for two special cases, (1) the actual total relative abundance of those species that are represented in the sample, and (2) their relative contribution to the information index of diversity. The analysis is based on a extended version of the negative binomial species frequency model. The results are tabulated.

  13. Utilization of accident databases and fuzzy sets to estimate frequency of HazMat transport accidents.

    PubMed

    Qiao, Yuanhua; Keren, Nir; Mannan, M Sam

    2009-08-15

    Risk assessment and management of transportation of hazardous materials (HazMat) require the estimation of accident frequency. This paper presents a methodology to estimate hazardous materials transportation accident frequency by utilizing publicly available databases and expert knowledge. The estimation process addresses route-dependent and route-independent variables. Negative binomial regression is applied to an analysis of the Department of Public Safety (DPS) accident database to derive basic accident frequency as a function of route-dependent variables, while the effects of route-independent variables are modeled by fuzzy logic. The integrated methodology provides the basis for an overall transportation risk analysis, which can be used later to develop a decision support system.

  14. Observation weights unlock bulk RNA-seq tools for zero inflation and single-cell applications.

    PubMed

    Van den Berge, Koen; Perraudeau, Fanny; Soneson, Charlotte; Love, Michael I; Risso, Davide; Vert, Jean-Philippe; Robinson, Mark D; Dudoit, Sandrine; Clement, Lieven

    2018-02-26

    Dropout events in single-cell RNA sequencing (scRNA-seq) cause many transcripts to go undetected and induce an excess of zero read counts, leading to power issues in differential expression (DE) analysis. This has triggered the development of bespoke scRNA-seq DE methods to cope with zero inflation. Recent evaluations, however, have shown that dedicated scRNA-seq tools provide no advantage compared to traditional bulk RNA-seq tools. We introduce a weighting strategy, based on a zero-inflated negative binomial model, that identifies excess zero counts and generates gene- and cell-specific weights to unlock bulk RNA-seq DE pipelines for zero-inflated data, boosting performance for scRNA-seq.

  15. Statistical procedures for analyzing mental health services data.

    PubMed

    Elhai, Jon D; Calhoun, Patrick S; Ford, Julian D

    2008-08-15

    In mental health services research, analyzing service utilization data often poses serious problems, given the presence of substantially skewed data distributions. This article presents a non-technical introduction to statistical methods specifically designed to handle the complexly distributed datasets that represent mental health service use, including Poisson, negative binomial, zero-inflated, and zero-truncated regression models. A flowchart is provided to assist the investigator in selecting the most appropriate method. Finally, a dataset of mental health service use reported by medical patients is described, and a comparison of results across several different statistical methods is presented. Implications of matching data analytic techniques appropriately with the often complexly distributed datasets of mental health services utilization variables are discussed.

  16. Inference of R 0 and Transmission Heterogeneity from the Size Distribution of Stuttering Chains

    PubMed Central

    Blumberg, Seth; Lloyd-Smith, James O.

    2013-01-01

    For many infectious disease processes such as emerging zoonoses and vaccine-preventable diseases, and infections occur as self-limited stuttering transmission chains. A mechanistic understanding of transmission is essential for characterizing the risk of emerging diseases and monitoring spatio-temporal dynamics. Thus methods for inferring and the degree of heterogeneity in transmission from stuttering chain data have important applications in disease surveillance and management. Previous researchers have used chain size distributions to infer , but estimation of the degree of individual-level variation in infectiousness (as quantified by the dispersion parameter, ) has typically required contact tracing data. Utilizing branching process theory along with a negative binomial offspring distribution, we demonstrate how maximum likelihood estimation can be applied to chain size data to infer both and the dispersion parameter that characterizes heterogeneity. While the maximum likelihood value for is a simple function of the average chain size, the associated confidence intervals are dependent on the inferred degree of transmission heterogeneity. As demonstrated for monkeypox data from the Democratic Republic of Congo, this impacts when a statistically significant change in is detectable. In addition, by allowing for superspreading events, inference of shifts the threshold above which a transmission chain should be considered anomalously large for a given value of (thus reducing the probability of false alarms about pathogen adaptation). Our analysis of monkeypox also clarifies the various ways that imperfect observation can impact inference of transmission parameters, and highlights the need to quantitatively evaluate whether observation is likely to significantly bias results. PMID:23658504

  17. QMRA for Drinking Water: 2. The Effect of Pathogen Clustering in Single-Hit Dose-Response Models.

    PubMed

    Nilsen, Vegard; Wyller, John

    2016-01-01

    Spatial and/or temporal clustering of pathogens will invalidate the commonly used assumption of Poisson-distributed pathogen counts (doses) in quantitative microbial risk assessment. In this work, the theoretically predicted effect of spatial clustering in conventional "single-hit" dose-response models is investigated by employing the stuttering Poisson distribution, a very general family of count distributions that naturally models pathogen clustering and contains the Poisson and negative binomial distributions as special cases. The analysis is facilitated by formulating the dose-response models in terms of probability generating functions. It is shown formally that the theoretical single-hit risk obtained with a stuttering Poisson distribution is lower than that obtained with a Poisson distribution, assuming identical mean doses. A similar result holds for mixed Poisson distributions. Numerical examples indicate that the theoretical single-hit risk is fairly insensitive to moderate clustering, though the effect tends to be more pronounced for low mean doses. Furthermore, using Jensen's inequality, an upper bound on risk is derived that tends to better approximate the exact theoretical single-hit risk for highly overdispersed dose distributions. The bound holds with any dose distribution (characterized by its mean and zero inflation index) and any conditional dose-response model that is concave in the dose variable. Its application is exemplified with published data from Norovirus feeding trials, for which some of the administered doses were prepared from an inoculum of aggregated viruses. The potential implications of clustering for dose-response assessment as well as practical risk characterization are discussed. © 2016 Society for Risk Analysis.

  18. Determinants of the geographic distribution of Puumala virus and Lyme borreliosis infections in Belgium.

    PubMed

    Linard, Catherine; Lamarque, Pénélope; Heyman, Paul; Ducoffre, Geneviève; Luyasu, Victor; Tersago, Katrien; Vanwambeke, Sophie O; Lambin, Eric F

    2007-05-02

    Vector-borne and zoonotic diseases generally display clear spatial patterns due to different space-dependent factors. Land cover and land use influence disease transmission by controlling both the spatial distribution of vectors or hosts, and the probability of contact with susceptible human populations. The objective of this study was to combine environmental and socio-economic factors to explain the spatial distribution of two emerging human diseases in Belgium, Puumala virus (PUUV) and Lyme borreliosis. Municipalities were taken as units of analysis. Negative binomial regressions including a correction for spatial endogeneity show that the spatial distribution of PUUV and Lyme borreliosis infections are associated with a combination of factors linked to the vector and host populations, to human behaviours, and to landscape attributes. Both diseases are associated with the presence of forests, which are the preferred habitat for vector or host populations. The PUUV infection risk is higher in remote forest areas, where the level of urbanisation is low, and among low-income populations. The Lyme borreliosis transmission risk is higher in mixed landscapes with forests and spatially dispersed houses, mostly in wealthy peri-urban areas. The spatial dependence resulting from a combination of endogenous and exogenous processes could be accounted for in the model on PUUV but not for Lyme borreliosis. A large part of the spatial variation in disease risk can be explained by environmental and socio-economic factors. The two diseases not only are most prevalent in different regions but also affect different groups of people. Combining these two criteria may increase the efficiency of information campaigns through appropriate targeting.

  19. Challenging the assumption of habitat limitation: An example from centrarchid fishes over an intermediate spatial scale

    USGS Publications Warehouse

    Gutreuter, S.

    2004-01-01

    Habitat rehabilitation efforts are predicated on the frequently untested assumption that habitat is limiting to populations. These efforts are typically costly and will be ineffective if habitat is not limiting. Therefore it is important to assess, rather than assume, habitat limitation wherever habitat rehabilitation projects are considered. Catch-count data from a standardized probability-based stratified-random monitoring programme were examined for indirect evidence of backwater habitat limitation by centrarchid fishes in the Upper Mississippi River System. The monitoring design enabled fitting statistical models of the association between mean catch at the spatial scale of tens of river kilometres and the percentage of contiguous aquatic area in backwater at least 1 m deep by maximizing a stratum-area weighted negative binomial log-likelihood function. Statistical models containing effects for backwater limitation failed to account for substantial variation in the data. However, 95% confidence intervals on the backwater parameter estimates excluded zero, indicating that population abundance may be limited by backwater prevalence where backwaters are extremely scarce. The combined results indicate, at most, a weak signal of backwater limitation where backwaters are extremely scarce in the lower reaches, but not elsewhere in the Upper Mississippi River System. This suggests that habitat restoration projects designed to increase the area of backwaters suitable for winter survival of centrarchids are unlikely to produce measurable benefits over intermediate spatial scales in much of the Upper Mississippi River System, and indicates the importance of correct identification of limiting processes. Published in 2004 by John Wiley and Sons, Ltd.

  20. Animal Ownership and Touching Enrich the Context of Social Contacts Relevant to the Spread of Human Infectious Diseases.

    PubMed

    Kifle, Yimer Wasihun; Goeyvaerts, Nele; Van Kerckhove, Kim; Willem, Lander; Kucharski, Adam; Faes, Christel; Leirs, Herwig; Hens, Niel; Beutels, Philippe

    2015-01-01

    Many human infectious diseases originate from animals or are transmitted through animal vectors. We aimed to identify factors that are predictive of ownership and touching of animals, assess whether animal ownership influences social contact behavior, and estimate the probability of a major zoonotic outbreak should a transmissible influenza-like pathogen be present in animals, all in the setting of a densely populated European country. A diary-based social contact survey (n = 1768) was conducted in Flanders, Belgium, from September 2010 until February 2011. Many participants touched pets (46%), poultry (2%) or livestock (2%) on a randomly assigned day, and a large proportion of participants owned such animals (51%, 15% and 5%, respectively). Logistic regression models indicated that larger households are more likely to own an animal and, unsurprisingly, that animal owners are more likely to touch animals. We observed a significant effect of age on animal ownership and touching. The total number of social contacts during a randomly assigned day was modeled using weighted-negative binomial regression. Apart from age, household size and day type (weekend versus weekday and regular versus holiday period), animal ownership was positively associated with the total number of social contacts during the weekend. Assuming that animal ownership and/or touching are at-risk events, we demonstrate a method to estimate the outbreak potential of zoonoses. We show that in Belgium animal-human interactions involving young children (0-9 years) and adults (25-54 years) have the highest potential to cause a major zoonotic outbreak.

  1. Animal Ownership and Touching Enrich the Context of Social Contacts Relevant to the Spread of Human Infectious Diseases

    PubMed Central

    Kifle, Yimer Wasihun; Goeyvaerts, Nele; Van Kerckhove, Kim; Willem, Lander; Faes, Christel; Leirs, Herwig; Hens, Niel; Beutels, Philippe

    2015-01-01

    Many human infectious diseases originate from animals or are transmitted through animal vectors. We aimed to identify factors that are predictive of ownership and touching of animals, assess whether animal ownership influences social contact behavior, and estimate the probability of a major zoonotic outbreak should a transmissible influenza-like pathogen be present in animals, all in the setting of a densely populated European country. A diary-based social contact survey (n = 1768) was conducted in Flanders, Belgium, from September 2010 until February 2011. Many participants touched pets (46%), poultry (2%) or livestock (2%) on a randomly assigned day, and a large proportion of participants owned such animals (51%, 15% and 5%, respectively). Logistic regression models indicated that larger households are more likely to own an animal and, unsurprisingly, that animal owners are more likely to touch animals. We observed a significant effect of age on animal ownership and touching. The total number of social contacts during a randomly assigned day was modeled using weighted-negative binomial regression. Apart from age, household size and day type (weekend versus weekday and regular versus holiday period), animal ownership was positively associated with the total number of social contacts during the weekend. Assuming that animal ownership and/or touching are at-risk events, we demonstrate a method to estimate the outbreak potential of zoonoses. We show that in Belgium animal-human interactions involving young children (0–9 years) and adults (25–54 years) have the highest potential to cause a major zoonotic outbreak. PMID:26193480

  2. Determination of a Testing Threshold for Lumbar Puncture in the Diagnosis of Subarachnoid Hemorrhage after a Negative Head Computed Tomography: A Decision Analysis.

    PubMed

    Taylor, Richard Andrew; Singh Gill, Harman; Marcolini, Evie G; Meyers, H Pendell; Faust, Jeremy Samuel; Newman, David H

    2016-10-01

    The objective was to determine the testing threshold for lumbar puncture (LP) in the evaluation of aneurysmal subarachnoid hemorrhage (SAH) after a negative head computed tomography (CT). As a secondary aim we sought to identify clinical variables that have the greatest impact on this threshold. A decision analytic model was developed to estimate the testing threshold for patients with normal neurologic findings, being evaluated for SAH, after a negative CT of the head. The testing threshold was calculated as the pretest probability of disease where the two strategies (LP or no LP) are balanced in terms of quality-adjusted life-years. Two-way and probabilistic sensitivity analyses (PSAs) were performed. For the base-case scenario the testing threshold for performing an LP after negative head CT was 4.3%. Results for the two-way sensitivity analyses demonstrated that the test threshold ranged from 1.9% to 15.6%, dominated by the uncertainty in the probability of death from initial missed SAH. In the PSA the mean testing threshold was 4.3% (95% confidence interval = 1.4% to 9.3%). Other significant variables in the model included probability of aneurysmal versus nonaneurysmal SAH after negative head CT, probability of long-term morbidity from initial missed SAH, and probability of renal failure from contrast-induced nephropathy. Our decision analysis results suggest a testing threshold for LP after negative CT to be approximately 4.3%, with a range of 1.4% to 9.3% on robust PSA. In light of these data, and considering the low probability of aneurysmal SAH after a negative CT, classical teaching and current guidelines addressing testing for SAH should be revisited. © 2016 by the Society for Academic Emergency Medicine.

  3. Emotional Support, Negative Interaction and DSM IV Lifetime Disorders among Older African Americans: Findings from the National Survey of American Life (NSAL)

    PubMed Central

    Lincoln, Karen D.; Taylor, Robert Joseph; Bullard, Kai McKeever; Chatters, Linda M.; Himle, Joseph A.; Woodward, Amanda Toler; Jackson, James S.

    2010-01-01

    Objectives Both emotional support and negative interaction with family members have been linked to mental health. However, few studies have examined the associations between emotional support and negative interaction and psychiatric disorders in late life. This study investigated the relationship between emotional support and negative interaction on lifetime prevalence of mood and anxiety disorders among older African Americans. Design The analyses utilized the National Survey of American Life. Methods Logistic regression and negative binomial regression analyses were used to examine the effect of emotional support and negative interaction with family members on the prevalence of lifetime DSM-IV mood and anxiety disorders. Participants Data from 786 African Americans aged 55 years and older were used. Measurement The DSM-IV World Mental Health Composite International Diagnostic Interview (WMH-CIDI) was used to assess mental disorders. Three dependent variables were investigated: the prevalence of lifetime mood disorders, the prevalence of lifetime anxiety disorders, and the total number of lifetime mood and anxiety disorders. Results Multivariate analysis found that emotional support was not associated with any of the three dependent variables. Negative interaction was significantly and positively associated with the odds of having a lifetime mood disorder, a lifetime anxiety disorder and the number of lifetime mood and anxiety disorders. Conclusions This is the first study to investigate the relationship between emotional support, negative interaction with family members and psychiatric disorders among older African Americans. Negative interaction was a risk factor for mood and anxiety disorders among older African Americans, whereas emotional support was not significant. PMID:20157904

  4. Binomial Coefficients Modulo a Prime--A Visualization Approach to Undergraduate Research

    ERIC Educational Resources Information Center

    Bardzell, Michael; Poimenidou, Eirini

    2011-01-01

    In this article we present, as a case study, results of undergraduate research involving binomial coefficients modulo a prime "p." We will discuss how undergraduates were involved in the project, even with a minimal mathematical background beforehand. There are two main avenues of exploration described to discover these binomial…

  5. Using the β-binomial distribution to characterize forest health

    Treesearch

    S.J. Zarnoch; R.L. Anderson; R.M. Sheffield

    1995-01-01

    The β-binomial distribution is suggested as a model for describing and analyzing the dichotomous data obtained from programs monitoring the health of forests in the United States. Maximum likelihood estimation of the parameters is given as well as asymptotic likelihood ratio tests. The procedure is illustrated with data on dogwood anthracnose infection (caused...

  6. Integer Solutions of Binomial Coefficients

    ERIC Educational Resources Information Center

    Gilbertson, Nicholas J.

    2016-01-01

    A good formula is like a good story, rich in description, powerful in communication, and eye-opening to readers. The formula presented in this article for determining the coefficients of the binomial expansion of (x + y)n is one such "good read." The beauty of this formula is in its simplicity--both describing a quantitative situation…

  7. Confidence Intervals for Weighted Composite Scores under the Compound Binomial Error Model

    ERIC Educational Resources Information Center

    Kim, Kyung Yong; Lee, Won-Chan

    2018-01-01

    Reporting confidence intervals with test scores helps test users make important decisions about examinees by providing information about the precision of test scores. Although a variety of estimation procedures based on the binomial error model are available for computing intervals for test scores, these procedures assume that items are randomly…

  8. An empirical probability model of detecting species at low densities.

    PubMed

    Delaney, David G; Leung, Brian

    2010-06-01

    False negatives, not detecting things that are actually present, are an important but understudied problem. False negatives are the result of our inability to perfectly detect species, especially those at low density such as endangered species or newly arriving introduced species. They reduce our ability to interpret presence-absence survey data and make sound management decisions (e.g., rapid response). To reduce the probability of false negatives, we need to compare the efficacy and sensitivity of different sampling approaches and quantify an unbiased estimate of the probability of detection. We conducted field experiments in the intertidal zone of New England and New York to test the sensitivity of two sampling approaches (quadrat vs. total area search, TAS), given different target characteristics (mobile vs. sessile). Using logistic regression we built detection curves for each sampling approach that related the sampling intensity and the density of targets to the probability of detection. The TAS approach reduced the probability of false negatives and detected targets faster than the quadrat approach. Mobility of targets increased the time to detection but did not affect detection success. Finally, we interpreted two years of presence-absence data on the distribution of the Asian shore crab (Hemigrapsus sanguineus) in New England and New York, using our probability model for false negatives. The type of experimental approach in this paper can help to reduce false negatives and increase our ability to detect species at low densities by refining sampling approaches, which can guide conservation strategies and management decisions in various areas of ecology such as conservation biology and invasion ecology.

  9. Evaluation of an operational malaria outbreak identification and response system in Mpumalanga Province, South Africa.

    PubMed

    Coleman, Marlize; Coleman, Michael; Mabuza, Aaron M; Kok, Gerdalize; Coetzee, Maureen; Durrheim, David N

    2008-04-27

    To evaluate the performance of a novel malaria outbreak identification system in the epidemic prone rural area of Mpumalanga Province, South Africa, for timely identification of malaria outbreaks and guiding integrated public health responses. Using five years of historical notification data, two binomial thresholds were determined for each primary health care facility in the highest malaria risk area of Mpumalanga province. Whenever the thresholds were exceeded at health facility level (tier 1), primary health care staff notified the malaria control programme, which then confirmed adequate stocks of malaria treatment to manage potential increased cases. The cases were followed up at household level to verify the likely source of infection. The binomial thresholds were reviewed at village/town level (tier 2) to determine whether additional response measures were required. In addition, an automated electronic outbreak identification system at town/village level (tier 2) was integrated into the case notification database (tier 3) to ensure that unexpected increases in case notification were not missed.The performance of these binomial outbreak thresholds was evaluated against other currently recommended thresholds using retrospective data. The acceptability of the system at primary health care level was evaluated through structured interviews with health facility staff. Eighty four percent of health facilities reported outbreaks within 24 hours (n = 95), 92% (n = 104) within 48 hours and 100% (n = 113) within 72 hours. Appropriate response to all malaria outbreaks (n = 113, tier 1, n = 46, tier 2) were achieved within 24 hours. The system was positively viewed by all health facility staff. When compared to other epidemiological systems for a specified 12 month outbreak season (June 2003 to July 2004) the binomial exact thresholds produced one false weekly outbreak, the C-sum 12 weekly outbreaks and the mean + 2 SD nine false weekly outbreaks. Exceeding the binomial level 1 threshold triggered an alert four weeks prior to an outbreak, but exceeding the binomial level 2 threshold identified an outbreak as it occurred. The malaria outbreak surveillance system using binomial thresholds achieved its primary goal of identifying outbreaks early facilitating appropriate local public health responses aimed at averting a possible large-scale epidemic in a low, and unstable, malaria transmission setting.

  10. A binomial modeling approach for upscaling colloid transport under unfavorable conditions: Emergent prediction of extended tailing

    NASA Astrophysics Data System (ADS)

    Hilpert, Markus; Rasmuson, Anna; Johnson, William P.

    2017-07-01

    Colloid transport in saturated porous media is significantly influenced by colloidal interactions with grain surfaces. Near-surface fluid domain colloids experience relatively low fluid drag and relatively strong colloidal forces that slow their downgradient translation relative to colloids in bulk fluid. Near-surface fluid domain colloids may reenter into the bulk fluid via diffusion (nanoparticles) or expulsion at rear flow stagnation zones, they may immobilize (attach) via primary minimum interactions, or they may move along a grain-to-grain contact to the near-surface fluid domain of an adjacent grain. We introduce a simple model that accounts for all possible permutations of mass transfer within a dual pore and grain network. The primary phenomena thereby represented in the model are mass transfer of colloids between the bulk and near-surface fluid domains and immobilization. Colloid movement is described by a Markov chain, i.e., a sequence of trials in a 1-D network of unit cells, which contain a pore and a grain. Using combinatorial analysis, which utilizes the binomial coefficient, we derive the residence time distribution, i.e., an inventory of the discrete colloid travel times through the network and of their probabilities to occur. To parameterize the network model, we performed mechanistic pore-scale simulations in a single unit cell that determined the likelihoods and timescales associated with the above colloid mass transfer processes. We found that intergrain transport of colloids in the near-surface fluid domain can cause extended tailing, which has traditionally been attributed to hydrodynamic dispersion emanating from flow tortuosity of solute trajectories.

  11. Linear approximations of global behaviors in nonlinear systems with moderate or strong noise

    NASA Astrophysics Data System (ADS)

    Liang, Junhao; Din, Anwarud; Zhou, Tianshou

    2018-03-01

    While many physical or chemical systems can be modeled by nonlinear Langevin equations (LEs), dynamical analysis of these systems is challenging in the cases of moderate and strong noise. Here we develop a linear approximation scheme, which can transform an often intractable LE into a linear set of binomial moment equations (BMEs). This scheme provides a feasible way to capture nonlinear behaviors in the sense of probability distribution and is effective even when the noise is moderate or big. Based on BMEs, we further develop a noise reduction technique, which can effectively handle tough cases where traditional small-noise theories are inapplicable. The overall method not only provides an approximation-based paradigm to analysis of the local and global behaviors of nonlinear noisy systems but also has a wide range of applications.

  12. A Bayesian method for inferring transmission chains in a partially observed epidemic.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Marzouk, Youssef M.; Ray, Jaideep

    2008-10-01

    We present a Bayesian approach for estimating transmission chains and rates in the Abakaliki smallpox epidemic of 1967. The epidemic affected 30 individuals in a community of 74; only the dates of appearance of symptoms were recorded. Our model assumes stochastic transmission of the infections over a social network. Distinct binomial random graphs model intra- and inter-compound social connections, while disease transmission over each link is treated as a Poisson process. Link probabilities and rate parameters are objects of inference. Dates of infection and recovery comprise the remaining unknowns. Distributions for smallpox incubation and recovery periods are obtained from historicalmore » data. Using Markov chain Monte Carlo, we explore the joint posterior distribution of the scalar parameters and provide an expected connectivity pattern for the social graph and infection pathway.« less

  13. Landau-Zener extension of the Tavis-Cummings model: structure of the solution

    NASA Astrophysics Data System (ADS)

    Sun, Chen; Sinitsyn, Nikolai

    We explore the recently discovered solution of the driven Tavis-Cummings model (DTCM). It describes interaction of arbitrary number of two-level systems with a bosonic mode that has linearly time-dependent frequency. We derive compact and tractable expressions for transition probabilities in terms of the well known special functions. In the new form, our formulas are suitable for fast numerical calculations and analytical approximations. As an application, we obtain the semiclassical limit of the exact solution and compare it to prior approximations. We also reveal connection between DTCM and q-deformed binomial statistics. Under the auspices of the National Nuclear Security Administration of the U.S. Department of Energy at Los Alamos National Laboratory under Contract No. DE-AC52-06NA25396. Authors also thank the support from the LDRD program at LANL.

  14. Shot-noise evidence of fractional quasiparticle creation in a local fractional quantum Hall state.

    PubMed

    Hashisaka, Masayuki; Ota, Tomoaki; Muraki, Koji; Fujisawa, Toshimasa

    2015-02-06

    We experimentally identify fractional quasiparticle creation in a tunneling process through a local fractional quantum Hall (FQH) state. The local FQH state is prepared in a low-density region near a quantum point contact in an integer quantum Hall (IQH) system. Shot-noise measurements reveal a clear transition from elementary-charge tunneling at low bias to fractional-charge tunneling at high bias. The fractional shot noise is proportional to T(1)(1-T(1)) over a wide range of T(1), where T(1) is the transmission probability of the IQH edge channel. This binomial distribution indicates that fractional quasiparticles emerge from the IQH state to be transmitted through the local FQH state. The study of this tunneling process enables us to elucidate the dynamics of Laughlin quasiparticles in FQH systems.

  15. Estimating a Logistic Discrimination Functions When One of the Training Samples Is Subject to Misclassification: A Maximum Likelihood Approach.

    PubMed

    Nagelkerke, Nico; Fidler, Vaclav

    2015-01-01

    The problem of discrimination and classification is central to much of epidemiology. Here we consider the estimation of a logistic regression/discrimination function from training samples, when one of the training samples is subject to misclassification or mislabeling, e.g. diseased individuals are incorrectly classified/labeled as healthy controls. We show that this leads to zero-inflated binomial model with a defective logistic regression or discrimination function, whose parameters can be estimated using standard statistical methods such as maximum likelihood. These parameters can be used to estimate the probability of true group membership among those, possibly erroneously, classified as controls. Two examples are analyzed and discussed. A simulation study explores properties of the maximum likelihood parameter estimates and the estimates of the number of mislabeled observations.

  16. An evaluation of the NASA/GSFC Barnes field spectral reflectometer model 14-758, using signal/noise as a measure of utility

    NASA Astrophysics Data System (ADS)

    Bell, R.; Labovitz, M. L.

    1982-07-01

    A Barnes field spectral reflectometer which collected information in 373 channels covering the region from 0.4 to 2.5 micrometers was assessed for signal utility. A band was judged unsatisfactory if the probability was 0.1 or greater than its signal to noise ratio was less than eight to one. For each of the bands the probability of a noisy observation was estimated under a binomial assumption from a set of field crop spectra covering an entire growing season. A 95% confidence interval was calculated about each estimate and bands whose lower confidence limits were greater than 0.1 were judged unacceptable. As a result, 283 channels were deemed statistically satisfactory. Excluded channels correspond to portions of the electromagnetic spectrum (EMS) where high atmospheric absorption and filter wheel overlap occur. In addition, the analyses uncovered intervals of unsatisfactory detection capability within the blue, red and far infrared regions of vegetation spectra. From the results of the analysis it was recommended that 90 channels monitored by the instrument under consideration be eliminated from future studies. These channels are tabulated and discussed.

  17. Impact of a letter-grade program on restaurant sanitary conditions and diner behavior in New York City.

    PubMed

    Wong, Melissa R; McKelvey, Wendy; Ito, Kazuhiko; Schiff, Corinne; Jacobson, J Bryan; Kass, Daniel

    2015-03-01

    We evaluated the impact of the New York City restaurant letter-grading program on restaurant hygiene, food safety practices, and public awareness. We analyzed data from 43,448 restaurants inspected between 2007 and 2013 to measure changes in inspection score and violation citations since program launch in July 2010. We used binomial regression to assess probability of scoring 0 to 13 points (A-range score). Two population-based random-digit-dial telephone surveys assessed public perceptions of the program. After we controlled for repeated restaurant observations, season of inspection, and chain restaurant status, the probability of scoring 0 to 13 points on an unannounced inspection increased 35% (95% confidence interval [CI]=31%, 40%) 3 years after compared with 3 years before grading. There were notable improvements in compliance with some specific requirements, including having a certified kitchen manager on site and being pest-free. More than 91% (95% CI=88%, 94%) of New Yorkers approved of the program and 88% (95% CI=85%, 92%) considered grades in dining decisions in 2012. Restaurant letter grading in New York City has resulted in improved sanitary conditions on unannounced inspection, suggesting that the program is an effective regulatory tool.

  18. An evaluation of the NASA/GSFC Barnes field spectral reflecometer model 14-758, using signal/noise as a measure of utility

    NASA Technical Reports Server (NTRS)

    Bell, R.; Labovitz, M. L.

    1982-01-01

    A Barnes field spectral reflectometer which collected information in 373 channels covering the region from 0.4 to 2.5 micrometers was assessed for signal utility. A band was judged unsatisfactory if the probability was 0.1 or greater than its signal to noise ratio was less than eight to one. For each of the bands the probability of a noisy observation was estimated under a binomial assumption from a set of field crop spectra covering an entire growing season. A 95% confidence interval was calculated about each estimate and bands whose lower confidence limits were greater than 0.1 were judged unacceptable. As a result, 283 channels were deemed statistically satisfactory. Excluded channels correspond to portions of the electromagnetic spectrum (EMS) where high atmospheric absorption and filter wheel overlap occur. In addition, the analyses uncovered intervals of unsatisfactory detection capability within the blue, red and far infrared regions of vegetation spectra. From the results of the analysis it was recommended that 90 channels monitored by the instrument under consideration be eliminated from future studies. These channels are tabulated and discussed.

  19. Possibility and challenges of conversion of current virus species names to Linnaean binomials

    USGS Publications Warehouse

    Thomas, Postler; Clawson, Anna N.; Amarasinghe, Gaya K.; Basler, Christopher F.; Bavari, Sina; Benko, Maria; Blasdell, Kim R.; Briese, Thomas; Buchmeier, Michael J.; Bukreyev, Alexander; Calisher, Charles H.; Chandran, Kartik; Charrel, Remi; Clegg, Christopher S.; Collins, Peter L.; De la Torre, Juan Carlos; DeRisi, Joseph L.; Dietzgen, Ralf G.; Dolnik, Olga; Durrwald, Ralf; Dye, John M.; Easton, Andrew J.; Emonet, Sebastian; Formenty, Pierre; Fouchier, Ron A. M.; Ghedin, Elodie; Gonzalez, Jean-Paul; Harrach, Balazs; Hewson, Roger; Horie, Masayuki; Jiang, Daohong; Kobinger, Gary P.; Kondo, Hideki; Kropinski, Andrew; Krupovic, Mart; Kurath, Gael; Lamb, Robert A.; Leroy, Eric M.; Lukashevich, Igor S.; Maisner, Andrea; Mushegian, Arcady; Netesov, Sergey V.; Nowotny, Norbert; Patterson, Jean L.; Payne, Susan L.; Paweska, Janusz T.; Peters, C.J.; Radoshitzky, Sheli; Rima, Bertus K.; Romanowski, Victor; Rubbenstroth, Dennis; Sabanadzovic, Sead; Sanfacon, Helene; Salvato , Maria; Schwemmle, Martin; Smither, Sophie J.; Stenglein, Mark; Stone, D.M.; Takada , Ayato; Tesh, Robert B.; Tomonaga, Keizo; Tordo, N.; Towner, Jonathan S.; Vasilakis, Nikos; Volchkov, Victor E.; Jensen, Victoria; Walker, Peter J.; Wang, Lin-Fa; Varsani, Arvind; Whitfield , Anna E.; Zerbini, Francisco Murilo; Kuhn, Jens H.

    2017-01-01

    Botanical, mycological, zoological, and prokaryotic species names follow the Linnaean format, consisting of an italicized Latinized binomen with a capitalized genus name and a lower case species epithet (e.g., Homo sapiens). Virus species names, however, do not follow a uniform format, and, even when binomial, are not Linnaean in style. In this thought exercise, we attempted to convert all currently official names of species included in the virus family Arenaviridae and the virus order Mononegavirales to Linnaean binomials, and to identify and address associated challenges and concerns. Surprisingly, this endeavor was not as complicated or time-consuming as even the authors of this article expected when conceiving the experiment.

  20. Predictors and outcomes of non-adherence in patients receiving maintenance hemodialysis.

    PubMed

    Tohme, Fadi; Mor, Maria K; Pena-Polanco, Julio; Green, Jamie A; Fine, Michael J; Palevsky, Paul M; Weisbord, Steven D

    2017-08-01

    Predictors of and outcomes associated with non-adherent behavior among patients on chronic hemodialysis (HD) have been incompletely elucidated. We conducted a post hoc analysis of data from the SMILE trial to identify patient factors associated with non-adherence to dialysis-related treatments and the associations of non-adherence with clinical outcomes. We defined non-adherence as missed HD and abbreviated HD. We used negative binomial regression to model the associations of demographic and clinical factors with measures of non-adherence, and negative binomial and Cox regression to analyze the associations of non-adherence with hospitalizations and mortality, respectively. We followed 286 patients for up to 24 months. Factors independently associated with missing HD included Tuesday/Thursday/Saturday HD schedule [incident rate ratio (IRR) 1.85, p < 0.01], current smoking (IRR 2.22, p < 0.01), higher pain score (IRR 1.04, p < 0.01), lower healthy literacy (IRR 3.01, p < 0.01), lower baseline quality of life (IRR 0.89, p = 0.01), and younger age (IRR 1.35, p < 0.01). Factors independently associated with abbreviating HD included dialysis vintage (IRR 1.07, p < 0.01), higher pain score (IRR 1.02, p < 0.01), current non-smoking (IRR 1.32, p = 0.03), and younger age (IRR 1.22, p < 0.01). Abbreviating HD was independently associated with an increased number of total (IRR 1.70, p < 0.01) and ESRD-related (IRR 1.66, p < 0.01) hospitalizations, while missing HD was independently associated with mortality (HR 2.36, p = 0.04). We identified several previously described and novel factors independently associated with non-adherence to HD-related treatments, and independent associations of non-adherence with hospitalization and mortality. These findings should inform the development and implementation of interventions to improve adherence and reduce health resource utilization.

  1. Inhomogeneity of the density of Parascaris spp. eggs in faeces of individual foals and the use of hypothesis testing for treatment decision making.

    PubMed

    Wilkes, E J A; Cowling, A; Woodgate, R G; Hughes, K J

    2016-10-15

    Faecal egg counts (FEC) are used widely for monitoring of parasite infection in animals, treatment decision-making and estimation of anthelmintic efficacy. When a single count or sample mean is used as a point estimate of the expectation of the egg distribution over some time interval, the variability in the egg density is not accounted for. Although variability, including quantifying sources, of egg count data has been described, the spatiotemporal distribution of nematode eggs in faeces is not well understood. We believe that statistical inference about the mean egg count for treatment decision-making has not been used previously. The aim of this study was to examine the density of Parascaris eggs in solution and faeces and to describe the use of hypothesis testing for decision-making. Faeces from two foals with Parascaris burdens were mixed with magnesium sulphate solution and 30 McMaster chambers were examined to determine the egg distribution in a well-mixed solution. To examine the distribution of eggs in faeces from an individual animal, three faecal piles from a foal with a known Parascaris burden were obtained, from which 81 counts were performed. A single faecal sample was also collected daily from 20 foals on three consecutive days and a FEC was performed on three separate portions of each sample. As appropriate, Poisson or negative binomial confidence intervals for the distribution mean were calculated. Parascaris eggs in a well-mixed solution conformed to a homogeneous Poisson process, while the egg density in faeces was not homogeneous, but aggregated. This study provides an extension from homogeneous to inhomogeneous Poisson processes, leading to an understanding of why Poisson and negative binomial distributions correspondingly provide a good fit for egg count data. The application of one-sided hypothesis tests for decision-making is presented. Copyright © 2016 Elsevier B.V. All rights reserved.

  2. Estimating influenza and respiratory syncytial virus-associated mortality in Western Kenya using health and demographic surveillance system data, 2007-2013.

    PubMed

    Emukule, Gideon O; Spreeuwenberg, Peter; Chaves, Sandra S; Mott, Joshua A; Tempia, Stefano; Bigogo, Godfrey; Nyawanda, Bryan; Nyaguara, Amek; Widdowson, Marc-Alain; van der Velden, Koos; Paget, John W

    2017-01-01

    Influenza and respiratory syncytial virus (RSV) associated mortality has not been well-established in tropical Africa. We used the negative binomial regression method and the rate-difference method (i.e. deaths during low and high influenza/RSV activity months), to estimate excess mortality attributable to influenza and RSV using verbal autopsy data collected through a health and demographic surveillance system in Western Kenya, 2007-2013. Excess mortality rates were calculated for a) all-cause mortality, b) respiratory deaths (including pneumonia), c) HIV-related deaths, and d) pulmonary tuberculosis (TB) related deaths. Using the negative binomial regression method, the mean annual all-cause excess mortality rate associated with influenza and RSV was 14.1 (95% confidence interval [CI] 0.0-93.3) and 17.1 (95% CI 0.0-111.5) per 100,000 person-years (PY) respectively; and 10.5 (95% CI 0.0-28.5) and 7.3 (95% CI 0.0-27.3) per 100,000 PY for respiratory deaths, respectively. Highest mortality rates associated with influenza were among ≥50 years, particularly among persons with TB (41.6[95% CI 0.0-122.7]); and with RSV were among <5 years. Using the rate-difference method, the excess mortality rate for influenza and RSV was 44.8 (95% CI 36.8-54.4) and 19.7 (95% CI 14.7-26.5) per 100,000 PY, respectively, for all-cause deaths; and 9.6 (95% CI 6.3-14.7) and 6.6 (95% CI 3.9-11.0) per 100,000 PY, respectively, for respiratory deaths. Our study shows a substantial excess mortality associated with influenza and RSV in Western Kenya, especially among children <5 years and older persons with TB, supporting recommendations for influenza vaccination and efforts to develop RSV vaccines.

  3. Effects of Cognition, Function, and Behavioral and Psychological Symptoms on Medicare Expenditures and Health Care Utilization for Persons With Dementia.

    PubMed

    Jutkowitz, Eric; Kane, Robert L; Dowd, Bryan; Gaugler, Joseph E; MacLehose, Richard F; Kuntz, Karen M

    2017-06-01

    Clinical features of dementia (cognition, function, and behavioral/psychological symptoms [BPSD]) may differentially affect Medicare expenditures/health care utilization. We linked cross-sectional data from the Aging, Demographics, and Memory Study to Medicare data to evaluate the association between dementia clinical features among those with dementia and Medicare expenditures/health care utilization (n = 234). Cognition was evaluated using the Mini-Mental State Examination (MMSE). Function was evaluated as the number of functional limitations (0-10). BPSD was evaluated as the number of symptoms (0-12). Expenditures were estimated with a generalized linear model (log-link and gamma distribution). Number of hospitalizations, institutional outpatient visits, and physician visits were estimated with a negative binomial regression. Medicare covered skilled nursing days were estimated with a zero-inflated negative binomial model. Cognition and BPSD were not associated with expenditures. Among individuals with less than seven functional limitations, one additional limitation was associated with $123 (95% confidence interval: $19-$227) additional monthly Medicare spending. Better cognition and poorer function were associated with more hospitalizations among those with an MMSE less than three and less than six functional limitations, respectively. BPSD had no effect on hospitalizations. Poorer function and fewer BPSD were associated with more skilled nursing among individuals with one to seven functional limitations and more than four symptoms, respectively. Cognition had no effect on skilled nursing care. No clinical feature was associated with institutional outpatient care. Of individuals with an MMSE less than 15, poorer cognition was associated with fewer physician visits. Among those with more than six functional limitations, poorer function was associated with fewer physician visits. Poorer function, not cognition or BPSD, was associated with higher Medicare expenditures. © The Author 2017. Published by Oxford University Press on behalf of The Gerontological Society of America. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  4. Factors associated with dental caries in a group of American Indian children at age 36 months.

    PubMed

    Warren, John J; Blanchette, Derek; Dawson, Deborah V; Marshall, Teresa A; Phipps, Kathy R; Starr, Delores; Drake, David R

    2016-04-01

    Early childhood caries (ECC) is rampant among American Indian children, but there has been relatively little study of this problem. This article reports on risk factors for caries for a group of American Indian children at age 36 months as part of a longitudinal study. Pregnant women from a Northern Plains Tribal community were recruited to participate in a longitudinal study of caries and caries risk factors. Standardized dental examinations were completed on children, and questionnaires were completed by mothers at baseline and when children were 4, 8, 12, 16, 22, 28, and 36 months of age. Examinations were surface-specific for dental caries, and the questionnaires collected data on demographic, dietary, and behavioral factors. Nonparametric bivariate tests and logistic regression models were used to identify risk factors for caries at 36 months, and negative binomial regression was used to identify factors related to caries severity (dmf counts). Among the 232 children, and caries prevalence for cavitated lesions was 80%, with an additional 15% having only noncavitated lesions. The mean dmfs was 9.6, and of the total dmfs, nearly 62% of affected surfaces were decayed, 31% were missing, and 7% were filled. Logistic regression identified higher added-sugar beverage consumption, younger maternal age at baseline, higher maternal DMFS at baseline, and greater number of people in the household as significant (P < 0.05) risk factors. Negative binomial regression found that only maternal DMFS was associated with child dmf counts. By the age of 36 months, dental caries is nearly universal in this population of American Indian children. Caries risk factors included sugared beverage consumption, greater household size, and maternal factors, but further analyses are needed to better understand caries in this population. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  5. Factors associated with the frequency of monitoring of liver enzymes, renal function and lipid laboratory markers among individuals initiating combination antiretroviral therapy: a cohort study.

    PubMed

    Gillis, Jennifer; Bayoumi, Ahmed M; Burchell, Ann N; Cooper, Curtis; Klein, Marina B; Loutfy, Mona; Machouf, Nima; Montaner, Julio Sg; Tsoukas, Chris; Hogg, Robert S; Raboud, Janet

    2015-10-26

    As the average age of the HIV-positive population increases, there is increasing need to monitor patients for the development of comorbidities as well as for drug toxicities. We examined factors associated with the frequency of measurement of liver enzymes, renal function tests, and lipid levels among participants of the Canadian Observational Cohort (CANOC) collaboration which follows people who initiated HIV antiretroviral therapy in 2000 or later. We used zero-inflated negative binomial regression models to examine the associations of demographic and clinical characteristics with the rates of measurement during follow-up. Generalized estimating equations with a logit link were used to examine factors associated with gaps of 12 months or more between measurements. Electronic laboratory data were available for 3940 of 7718 CANOC participants. The median duration of electronic follow-up was 3.5 years. The median (interquartile) rates of tests per year were 2.76 (1.60, 3.73), 2.55 (1.44, 3.38) and 1.42 (0.50, 2.52) for liver, renal and lipid parameters, respectively. In multivariable zero-inflated negative binomial regression models, individuals infected through injection drug use (IDU) were significantly less likely to have any measurements. Among participants with at least one measurement, rates of measurement of liver, renal and lipid tests were significantly lower for younger individuals and Aboriginal Peoples. Hepatitis C co-infected individuals with a history of IDU had lower rates of measurement and were at greater risk of having 12 month gaps between measurements. Hepatitis C co-infected participants infected through IDU were at increased risk of gaps in testing, despite publicly funded health care and increased risk of comorbid conditions. This should be taken into consideration in analyses examining factors associated with outcomes based on laboratory parameters.

  6. Estimating the prevalence and intensity of Schistosoma mansoni infection among rural communities in Western Tanzania: The influence of sampling strategy and statistical approach

    PubMed Central

    Bakuza, Jared S.; Denwood, Matthew J.; Nkwengulila, Gamba

    2017-01-01

    Background Schistosoma mansoni is a parasite of major public health importance in developing countries, where it causes a neglected tropical disease known as intestinal schistosomiasis. However, the distribution of the parasite within many endemic regions is currently unknown, which hinders effective control. The purpose of this study was to characterize the prevalence and intensity of infection of S. mansoni in a remote area of western Tanzania. Methodology/Principal findings Stool samples were collected from 192 children and 147 adults residing in Gombe National Park and four nearby villages. Children were actively sampled in local schools, and adults were sampled passively by voluntary presentation at the local health clinics. The two datasets were therefore analysed separately. Faecal worm egg count (FWEC) data were analysed using negative binomial and zero-inflated negative binomial (ZINB) models with explanatory variables of site, sex, and age. The ZINB models indicated that a substantial proportion of the observed zero FWEC reflected a failure to detect eggs in truly infected individuals, meaning that the estimated true prevalence was much higher than the apparent prevalence as calculated based on the simple proportion of non-zero FWEC. For the passively sampled data from adults, the data were consistent with close to 100% true prevalence of infection. Both the prevalence and intensity of infection differed significantly between sites, but there were no significant associations with sex or age. Conclusions/Significance Overall, our data suggest a more widespread distribution of S. mansoni in this part of Tanzania than was previously thought. The apparent prevalence estimates substantially under-estimated the true prevalence as determined by the ZINB models, and the two types of sampling strategies also resulted in differing conclusions regarding prevalence of infection. We therefore recommend that future surveillance programmes designed to assess risk factors should use active sampling whenever possible, in order to avoid the self-selection bias associated with passive sampling. PMID:28934206

  7. Random parameter models of interstate crash frequencies by severity, number of vehicles involved, collision and location type.

    PubMed

    Venkataraman, Narayan; Ulfarsson, Gudmundur F; Shankar, Venky N

    2013-10-01

    A nine-year (1999-2007) continuous panel of crash histories on interstates in Washington State, USA, was used to estimate random parameter negative binomial (RPNB) models for various aggregations of crashes. A total of 21 different models were assessed in terms of four ways to aggregate crashes, by: (a) severity, (b) number of vehicles involved, (c) crash type, and by (d) location characteristics. The models within these aggregations include specifications for all severities (property damage only, possible injury, evident injury, disabling injury, and fatality), number of vehicles involved (one-vehicle to five-or-more-vehicle), crash type (sideswipe, same direction, overturn, head-on, fixed object, rear-end, and other), and location types (urban interchange, rural interchange, urban non-interchange, rural non-interchange). A total of 1153 directional road segments comprising of the seven Washington State interstates were analyzed, yielding statistical models of crash frequency based on 10,377 observations. These results suggest that in general there was a significant improvement in log-likelihood when using RPNB compared to a fixed parameter negative binomial baseline model. Heterogeneity effects are most noticeable for lighting type, road curvature, and traffic volume (ADT). Median lighting or right-side lighting are linked to increased crash frequencies in many models for more than half of the road segments compared to both-sides lighting. Both-sides lighting thereby appears to generally lead to a safety improvement. Traffic volume has a random parameter but the effect is always toward increasing crash frequencies as expected. However that the effect is random shows that the effect of traffic volume on crash frequency is complex and varies by road segment. The number of lanes has a random parameter effect only in the interchange type models. The results show that road segment-specific insights into crash frequency occurrence can lead to improved design policy and project prioritization. Copyright © 2013 Elsevier Ltd. All rights reserved.

  8. Analyzing hospitalization data: potential limitations of Poisson regression.

    PubMed

    Weaver, Colin G; Ravani, Pietro; Oliver, Matthew J; Austin, Peter C; Quinn, Robert R

    2015-08-01

    Poisson regression is commonly used to analyze hospitalization data when outcomes are expressed as counts (e.g. number of days in hospital). However, data often violate the assumptions on which Poisson regression is based. More appropriate extensions of this model, while available, are rarely used. We compared hospitalization data between 206 patients treated with hemodialysis (HD) and 107 treated with peritoneal dialysis (PD) using Poisson regression and compared results from standard Poisson regression with those obtained using three other approaches for modeling count data: negative binomial (NB) regression, zero-inflated Poisson (ZIP) regression and zero-inflated negative binomial (ZINB) regression. We examined the appropriateness of each model and compared the results obtained with each approach. During a mean 1.9 years of follow-up, 183 of 313 patients (58%) were never hospitalized (indicating an excess of 'zeros'). The data also displayed overdispersion (variance greater than mean), violating another assumption of the Poisson model. Using four criteria, we determined that the NB and ZINB models performed best. According to these two models, patients treated with HD experienced similar hospitalization rates as those receiving PD {NB rate ratio (RR): 1.04 [bootstrapped 95% confidence interval (CI): 0.49-2.20]; ZINB summary RR: 1.21 (bootstrapped 95% CI 0.60-2.46)}. Poisson and ZIP models fit the data poorly and had much larger point estimates than the NB and ZINB models [Poisson RR: 1.93 (bootstrapped 95% CI 0.88-4.23); ZIP summary RR: 1.84 (bootstrapped 95% CI 0.88-3.84)]. We found substantially different results when modeling hospitalization data, depending on the approach used. Our results argue strongly for a sound model selection process and improved reporting around statistical methods used for modeling count data. © The Author 2015. Published by Oxford University Press on behalf of ERA-EDTA. All rights reserved.

  9. Density of wild prey modulates lynx kill rates on free-ranging domestic sheep.

    PubMed

    Odden, John; Nilsen, Erlend B; Linnell, John D C

    2013-01-01

    Understanding the factors shaping the dynamics of carnivore-livestock conflicts is vital to facilitate large carnivore conservation in multi-use landscapes. We investigated how the density of their main wild prey, roe deer Capreolus capreolus, modulates individual Eurasian lynx Lynx lynx kill rates on free-ranging domestic sheep Ovis aries across a range of sheep and roe deer densities. Lynx kill rates on free-ranging domestic sheep were collected in south-eastern Norway from 1995 to 2011 along a gradient of different livestock and wild prey densities using VHF and GPS telemetry. We used zero-inflated negative binomial (ZINB) models including lynx sex, sheep density and an index of roe deer density as explanatory variables to model observed kill rates on sheep, and ranked the models based on their AICc values. The model including the effects of lynx sex and sheep density in the zero-inflation model and the effect of lynx sex and roe deer density in the negative binomial part received most support. Irrespective of sheep density and sex, we found the lowest sheep kill rates in areas with high densities of roe deer. As roe deer density decreased, males killed sheep at higher rates, and this pattern held for both high and low sheep densities. Similarly, females killed sheep at higher rates in areas with high densities of sheep and low densities of roe deer. However, when sheep densities were low females rarely killed sheep irrespective of roe deer density. Our quantification of depredation rates can be the first step towards establishing fairer compensation systems based on more accurate and area specific estimation of losses. This study demonstrates how we can use ecological theory to predict where losses of sheep will be greatest, and can be used to identify areas where mitigation measures are most likely to be needed.

  10. Density of Wild Prey Modulates Lynx Kill Rates on Free-Ranging Domestic Sheep

    PubMed Central

    Odden, John; Nilsen, Erlend B.; Linnell, John D. C.

    2013-01-01

    Understanding the factors shaping the dynamics of carnivore–livestock conflicts is vital to facilitate large carnivore conservation in multi-use landscapes. We investigated how the density of their main wild prey, roe deer Capreolus capreolus, modulates individual Eurasian lynx Lynx lynx kill rates on free-ranging domestic sheep Ovis aries across a range of sheep and roe deer densities. Lynx kill rates on free-ranging domestic sheep were collected in south-eastern Norway from 1995 to 2011 along a gradient of different livestock and wild prey densities using VHF and GPS telemetry. We used zero-inflated negative binomial (ZINB) models including lynx sex, sheep density and an index of roe deer density as explanatory variables to model observed kill rates on sheep, and ranked the models based on their AICc values. The model including the effects of lynx sex and sheep density in the zero-inflation model and the effect of lynx sex and roe deer density in the negative binomial part received most support. Irrespective of sheep density and sex, we found the lowest sheep kill rates in areas with high densities of roe deer. As roe deer density decreased, males killed sheep at higher rates, and this pattern held for both high and low sheep densities. Similarly, females killed sheep at higher rates in areas with high densities of sheep and low densities of roe deer. However, when sheep densities were low females rarely killed sheep irrespective of roe deer density. Our quantification of depredation rates can be the first step towards establishing fairer compensation systems based on more accurate and area specific estimation of losses. This study demonstrates how we can use ecological theory to predict where losses of sheep will be greatest, and can be used to identify areas where mitigation measures are most likely to be needed. PMID:24278123

  11. Differential Associations of UPPS-P Impulsivity Traits With Alcohol Problems.

    PubMed

    McCarty, Kayleigh N; Morris, David H; Hatz, Laura E; McCarthy, Denis M

    2017-07-01

    The UPPS-P model posits that impulsivity comprises five factors: positive urgency, negative urgency, lack of planning, lack of perseverance, and sensation seeking. Negative and positive urgency are the traits most consistently associated with alcohol problems. However, previous work has examined alcohol problems either individually or in the aggregate, rather than examining multiple problem domains simultaneously. Recent work has also questioned the utility of distinguishing between positive and negative urgency, as this distinction did not meaningfully differ in predicting domains of psychopathology. The aims of this study were to address these issues by (a) testing unique associations of UPPS-P with specific domains of alcohol problems and (b) determining the utility of distinguishing between positive and negative urgency as risk factors for specific alcohol problems. Associations between UPPS-P traits and alcohol problem domains were examined in two cross-sectional data sets using negative binomial regression models. In both samples, negative urgency was associated with social/interpersonal, self-perception, risky behaviors, and blackout drinking problems. Positive urgency was associated with academic/occupational and physiological dependence problems. Both urgency traits were associated with impaired control and self-care problems. Associations for other UPPS-P traits did not replicate across samples. Results indicate that negative and positive urgency have differential associations with alcohol problem domains. Results also suggest a distinction between the type of alcohol problems associated with these traits-negative urgency was associated with problems experienced during a drinking episode, whereas positive urgency was associated with alcohol problems that result from longer-term drinking trends.

  12. Influx of multidrug-resistant organisms by country-to-country transfer of patients.

    PubMed

    Mutters, Nico T; Günther, Frank; Sander, Anja; Mischnik, Alexander; Frank, Uwe

    2015-10-28

    Multidrug-resistant organisms (MDRO) are a worldwide problem. International migration and travel facilitate the spread of MDRO. Therefore the goal of our study was to assess the risk of influx of MDRO from patients transferred to one of Central Europe's largest hospitals from abroad. A mono-centre study was conducted. All patients transferred from other countries were screened; additional data was collected on comorbidities, etc. Presence of carbapenemases of multidrug-resistant Gram-negatives was confirmed by PCR. The association between length of stay, being colonized and/or infected by a MDRO, country of origin, diagnosis and other factors was assessed by binomial regression analyses. From 2012 to 2013, one fifth of all patients were colonized with MDRO (Methicillin-resistant Staphylococcus aureus [4.1 %], Vancomycin-resistant Enterococci [2.9 %], multidrug-resistant Gram-negatives [12.8 %] and extensively drug-resistant Gram-negatives [3.4 %]). The Gram-negatives carried a variety of carbapenemases including OXA, VIM, KPC and NDM. The length of stay was significantly prolonged by 77.2 % in patients colonized with a MDRO, compared to those not colonized (p<0.0001). Country-to-Country transfer of patients to European hospitals represents a high risk of introduction of MDRO and infection control specialists should endorse containment and screening measures.

  13. How does the emotive content of televised anti-smoking mass media campaigns influence monthly calls to the NHS Stop Smoking helpline in England?

    PubMed

    Richardson, Sol; Langley, Tessa; Szatkowski, Lisa; Sims, Michelle; Gilmore, Anna; McNeill, Ann; Lewis, Sarah

    2014-12-01

    To investigate the effects of different types of televised mass media campaign content on calls to the English NHS Stop Smoking helpline. We used UK government-funded televised tobacco control campaigns from April 2005 to April 2010, categorised as either "positive" (eliciting happiness, satisfaction or hope) or "negative" (eliciting fear, guilt or disgust). We built negative binomial generalised additive models (GAMs) with linear and smooth terms for monthly per capita exposure to each campaign type (expressed as Gross Ratings Points, or GRPs) to determine their effect on calls in the same month. We adjusted for seasonal trends, inflation-adjusted weighted average cigarette prices and other tobacco control policies. We found non-linear associations between exposure to positive and negative emotive campaigns and quitline calls. The rate of calls increased more than 50% as exposure to positive campaigns increased from 0 to 400 GRPs (rate ratio: 1.58, 95% CI: 1.25-2.01). An increase in calls in response to negative emotive campaigns was only apparent after monthly exposure exceeded 400 GRPs. While positive campaigns were most effective at increasing quitline calls, those with negative emotive content were also found to impact on call rates but only at higher levels of exposure. Copyright © 2014. Published by Elsevier Inc.

  14. Hospital support services and the impacts of outsourcing on occupational health and safety.

    PubMed

    Siganporia, Pearl; Astrakianakis, George; Alamgir, Hasanat; Ostry, Aleck; Nicol, Anne-Marie; Koehoorn, Mieke

    2016-10-01

    Outsourcing labor is linked to negative impacts on occupational health and safety (OHS). In British Columbia, Canada, provincial health care service providers outsource support services such as cleaners and food service workers (CFSWs) to external contractors. This study investigates the impact of outsourcing on the occupational health safety of hospital CFSWs through a mixed methods approach. Worker's compensation data for hospital CFSWs were analyzed by negative binomial and multiple linear regressions supplemented by iterative thematic analysis of telephone interviews of the same job groups. Non-significant decreases in injury rates and days lost per injury were observed in outsourced CFSWs post outsourcing. Significant decreases (P < 0.05) were observed in average costs per injury for cleaners post outsourcing. Outsourced workers interviewed implied instances of underreporting workplace injuries. This mixed methods study describes the impact of outsourcing on OHS of healthcare workers in British Columbia. Results will be helpful for policy-makers and workplace regulators to assess program effectiveness for outsourced workers.

  15. Hospital support services and the impacts of outsourcing on occupational health and safety

    PubMed Central

    Alamgir, Hasanat; Ostry, Aleck; Nicol, Anne-Marie; Koehoorn, Mieke

    2016-01-01

    Background Outsourcing labor is linked to negative impacts on occupational health and safety (OHS). In British Columbia, Canada, provincial health care service providers outsource support services such as cleaners and food service workers (CFSWs) to external contractors. Objectives This study investigates the impact of outsourcing on the occupational health safety of hospital CFSWs through a mixed methods approach. Methods Worker’s compensation data for hospital CFSWs were analyzed by negative binomial and multiple linear regressions supplemented by iterative thematic analysis of telephone interviews of the same job groups. Results Non-significant decreases in injury rates and days lost per injury were observed in outsourced CFSWs post outsourcing. Significant decreases (P < 0.05) were observed in average costs per injury for cleaners post outsourcing. Outsourced workers interviewed implied instances of underreporting workplace injuries. Conclusions This mixed methods study describes the impact of outsourcing on OHS of healthcare workers in British Columbia. Results will be helpful for policy-makers and workplace regulators to assess program effectiveness for outsourced workers. PMID:27696988

  16. Supply-side response to declining heroin purity: fentanyl overdose episode in New Jersey.

    PubMed

    Hempstead, Katherine; Yildirim, Emel O

    2014-06-01

    The inelastic price demand observations characteristic of illegal drug markets have led to the conclusion that the burden of a negative supply shock would be completely reflected to consumers. This paper argues that the increasing availability of prescription opioids may threaten heroin sellers' profit margin and force them to find alternative methods to compensate buyers in the event of a supply shock. We investigate the 2006 fentanyl overdose episode in New Jersey and argue that the introduction of non-pharmaceutical fentanyl, its spatial distribution, and the timing of overdose deaths may have been related to trends in heroin purity. Using medical examiner data, as well as data from the Drug Enforcement Administration, Office of Diversion Control on retail sales of prescription opioids in a negative binomial specification, we show that month-to-month fluctuations in heroin purity have a significant effect on fentanyl-related overdoses, particularly in those areas where prescription opioids are highly available. Copyright © 2013 John Wiley & Sons, Ltd.

  17. Factors affecting the probability of first year medical student dropout in the UK: a logistic analysis for the intake cohorts of 1980-92.

    PubMed

    Arulampalam, Wiji; Naylor, Robin; Smith, Jeremy

    2004-05-01

    In the context of the 1997 Report of the Medical Workforce Standing Advisory Committee, it is important that we develop an understanding of the factors influencing medical school retention rates. To analyse the determinants of the probability that an individual medical student will drop out of medical school during their first year of study. Binomial and multinomial logistic regression analysis of individual-level administrative data on 51 810 students in 21 medical schools in the UK for the intake cohorts of 1980-92 was performed. The overall average first year dropout rate over the period 1980-92 was calculated to be 3.8%. We found that the probability that a student would drop out of medical school during their first year of study was influenced significantly by both the subjects studied at A-level and by the scores achieved. For example, achieving 1 grade higher in biology, chemistry or physics reduced the dropout probability by 0.38% points, equivalent to a fall of 10%. We also found that males were about 8% more likely to drop out than females. The medical school attended also had a significant effect on the estimated dropout probability. Indicators of both the social class and the previous school background of the student were largely insignificant. Policies aimed at increasing the size of the medical student intake in the UK and of widening access to students from non-traditional backgrounds should be informed by evidence that student dropout probabilities are sensitive to measures of A-level attainment, such as subject studied and scores achieved. If traditional entry requirements or standards are relaxed, then this is likely to have detrimental effects on medical schools' retention rates unless accompanied by appropriate measures such as focussed student support.

  18. Alcohol expectancies and alcohol outcomes: effects of the use of protective behavioral strategies.

    PubMed

    Grazioli, Véronique S; Lewis, Melissa A; Garberson, Lisa A; Fossos-Wong, Nicole; Lee, Christine M; Larimer, Mary E

    2015-05-01

    Alcohol expectancies (AEs) are positively associated with drinking behaviors, whereas the use of protective behavioural strategies (PBS) is negatively related to alcohol outcomes among young adults. PBS have been shown to weaken relationships between some alcohol risk factors and alcohol outcomes. This study aimed to examine longitudinally the moderating effect of PBS on the relationships between AEs and alcohol outcomes among young adults. Participants (N = 188; 61.7% female) were U.S. young adults participating in a larger longitudinal study. Measures of PBS, AEs, alcohol use, and related consequences were used from the baseline and 12-month follow-up assessments. Negative binomial hurdle models found that PBS (total score) significantly moderated the relationship between positive AEs and consequences, such that among high school seniors endorsing higher positive AEs, those using more PBS in high school reported fewer negative consequences 1 year later. PBS (Manner of Drinking) also moderated the relationship between negative AEs and alcohol use, revealing the use of PBS in high school as having a protective function against later drinking among participants with high positive AEs. Last, PBS (Serious Harm Reduction) significantly moderated the associations between positive AEs and alcohol use and between negative AEs and consequences, such that participants with higher AEs and higher PBS use in high school were at greatest risk for drinking and experiencing negative consequences later. Overall, these findings suggest that PBS use may be protective by weakening relationships between positive AEs and alcohol outcomes. Limitations and future directions are discussed.

  19. Factors Associated With Bites to a Child From a Dog Living in the Same Home: A Bi-National Comparison.

    PubMed

    Messam, Locksley L McV; Kass, Philip H; Chomel, Bruno B; Hart, Lynette A

    2018-01-01

    We conducted a veterinary clinic-based retrospective cohort study aimed at identifying child-, dog-, and home-environment factors associated with dog bites to children aged 5-15 years old living in the same home as a dog in Kingston, Jamaica (236) and San Francisco, USA (61). Secondarily, we wished to compare these factors to risk factors for dog bites to the general public. Participant information was collected via interviewer-administered questionnaire using proxy respondents. Data were analyzed using log-binomial regression to estimate relative risks and associated 95% confidence intervals (CIs) for each exposure-dog bite relationship. Exploiting the correspondence between X% confidence intervals and X% Bayesian probability intervals obtained using a uniform prior distribution, for each exposure, we calculated probabilities of the true (population) RRs ≥ 1.25 or ≤0.8, for positive or negative associations, respectively. Boys and younger children were at higher risk for bites, than girls and older children, respectively. Dogs living in a home with no yard space were at an elevated risk (RR = 2.97; 95% CI: 1.06-8.33) of biting a child living in the same home, compared to dogs that had yard space. Dogs routinely allowed inside for some portion of the day (RR = 3.00; 95% CI: 0.94-9.62) and dogs routinely allowed to sleep in a family member's bedroom (RR = 2.82; 95% CI: 1.17-6.81) were also more likely to bite a child living in the home than those that were not. In San Francisco, but less so in Kingston, bites were inversely associated with the number of children in the home. While in Kingston, but not in San Francisco, smaller breeds and dogs obtained for companionship were at higher risk for biting than larger breeds and dogs obtained for protection, respectively. Overall, for most exposures, the observed associations were consistent with population RRs of practical importance (i.e., RRs ≥ 1.25 or ≤0.8). Finally, we found substantial consistency between risk factors for bites to children and previously reported risk factors for general bites.

  20. What Is Threatening the Effectiveness of Insecticide-Treated Bednets? A Case-Control Study of Environmental, Behavioral, and Physical Factors Associated with Prevention Failure.

    PubMed

    Obala, Andrew A; Mangeni, Judith Nekesa; Platt, Alyssa; Aswa, Daniel; Abel, Lucy; Namae, Jane; Prudhomme O'Meara, Wendy

    2015-01-01

    Insecticide-treated nets are the cornerstone of global malaria control and have been shown to reduce malaria morbidity by 50-60%. However, some areas are experiencing a resurgence in malaria following successful control. We describe an efficacy decay framework to understand why high malaria burden persists even under high ITN coverage in a community in western Kenya. We enrolled 442 children hospitalized with malaria and paired them with age, time, village and gender-matched controls. We completed comprehensive household and neighborhood assessments including entomological surveillance. The indicators are grouped into five domains in an efficacy decay framework: ITN ownership, compliance, physical integrity, vector susceptibility and facilitating factors. After variable selection, case-control data were analyzed using conditional logistic regression models and mosquito data were analyzed using negative binomial regression. Predictive margins were calculated from logistic regression models. Measures of ITN coverage and physical integrity were not correlated with hospitalized malaria in our study. However, consistent ITN use (Adjusted Odds Ratio (AOR) = 0.23, 95%CI: 0.12-0.43), presence of nearby larval sites (AOR = 1.137, 95%CI: 1.02-1.27), and specific types of crops (AOR (grains) = 0.446, 95%CI: 0.24-0.82) were significantly correlated with malaria amongst children who owned an ITN. The odds of hospitalization for febrile malaria nearly tripled when one other household member had symptomatic malaria infection (AOR-2.76, 95%CI:1.83-4.18). Overall, perfect household adherence could reduce the probability of hospitalization for malaria to less than 30% (95%CI:0.12-0.46) and adjusting environmental factors such as elimination of larval sites and growing grains nearby could reduce the probability of hospitalization for malaria to less than 20% (95%CI:0.04-0.31). Availability of ITNs is not the bottleneck for malaria prevention in this community. Behavior change interventions to improve compliance and environmental management of mosquito breeding habitats may greatly enhance ITN efficacy. A better understanding of the relationship between agriculture and mosquito survival and feeding success is needed.

  1. The variation in the health status of immigrants and Italians during the global crisis and the role of socioeconomic factors.

    PubMed

    Petrelli, Alessio; Di Napoli, Anteo; Rossi, Alessandra; Costanzo, Gianfranco; Mirisola, Concetta; Gargiulo, Lidia

    2017-06-12

    The effects of the recent global economic and financial crisis especially affected the most vulnerable social groups. Objective of the study was to investigate variation of self-perceived health status in Italians and immigrants during the economic global crisis, focusing on demographic and socioeconomic factors. Through a cross-sectional design we analyzed the national sample of multipurpose surveys "Health conditions and use of health services" (2005 and 2013) conducted by the Italian National Institute of Statistics (ISTAT). Physical Component Summary (PCS) and Mental Component Summary (MCS) scores, derived from SF-12 questionnaire, were assumed as study outcome, dichotomizing variables distribution at 1 st quartile. Prevalence rate ratios (PRR) were estimated through log-binomial regression models, stratified by citizenship and gender, evaluating the association between PCS and MCS with surveys' year, adjusting for age, educational level, employment status, self-perceived economic resources, smoking habits, body mass index. From 2005 to 2013 the proportion of people not employed or reporting scarce/insufficient economic resources increased, especially among men, in particular immigrants. Compared with 2005 we observed in 2013 among Italians a significant lower probability of worse PCS (PRR = 0.96 both for males and females), while no differences were observed among immigrants; a higher probability of worse MCS was observed, particularly among men (Italians: PRR = 1.26;95%CI:1.22-1.29; immigrants: PRR = 1.19;95%CI:1.03-1.38). Self-perceived scarce/insufficient economic resources were strongly and significantly associated with worse PCS and MCS for all subgroups. Lower educational level was strongly associated with worse PCS in Italians and slightly associated with worse MCS for all subgroups. Being not employed was associated with worse health status, especially mental health among men. Our findings support the hypothesis that economic global crisis could have negatively affected health status, particularly mental health, of Italians and immigrants. Furthermore, results suggest socioeconomic inequalities increase, in economic resources availability dimension. In a context of public health resources' limitation due to financial crisis, policy decision makers and health service managers must face the challenge of equity in health.

  2. Raw and Central Moments of Binomial Random Variables via Stirling Numbers

    ERIC Educational Resources Information Center

    Griffiths, Martin

    2013-01-01

    We consider here the problem of calculating the moments of binomial random variables. It is shown how formulae for both the raw and the central moments of such random variables may be obtained in a recursive manner utilizing Stirling numbers of the first kind. Suggestions are also provided as to how students might be encouraged to explore this…

  3. Performance of the modified Poisson regression approach for estimating relative risks from clustered prospective data.

    PubMed

    Yelland, Lisa N; Salter, Amy B; Ryan, Philip

    2011-10-15

    Modified Poisson regression, which combines a log Poisson regression model with robust variance estimation, is a useful alternative to log binomial regression for estimating relative risks. Previous studies have shown both analytically and by simulation that modified Poisson regression is appropriate for independent prospective data. This method is often applied to clustered prospective data, despite a lack of evidence to support its use in this setting. The purpose of this article is to evaluate the performance of the modified Poisson regression approach for estimating relative risks from clustered prospective data, by using generalized estimating equations to account for clustering. A simulation study is conducted to compare log binomial regression and modified Poisson regression for analyzing clustered data from intervention and observational studies. Both methods generally perform well in terms of bias, type I error, and coverage. Unlike log binomial regression, modified Poisson regression is not prone to convergence problems. The methods are contrasted by using example data sets from 2 large studies. The results presented in this article support the use of modified Poisson regression as an alternative to log binomial regression for analyzing clustered prospective data when clustering is taken into account by using generalized estimating equations.

  4. Spatial distribution of single-nucleotide polymorphisms related to fungicide resistance and implications for sampling.

    PubMed

    Van der Heyden, H; Dutilleul, P; Brodeur, L; Carisse, O

    2014-06-01

    Spatial distribution of single-nucleotide polymorphisms (SNPs) related to fungicide resistance was studied for Botrytis cinerea populations in vineyards and for B. squamosa populations in onion fields. Heterogeneity in this distribution was characterized by performing geostatistical analyses based on semivariograms and through the fitting of discrete probability distributions. Two SNPs known to be responsible for boscalid resistance (H272R and H272Y), both located on the B subunit of the succinate dehydrogenase gene, and one SNP known to be responsible for dicarboximide resistance (I365S) were chosen for B. cinerea in grape. For B. squamosa in onion, one SNP responsible for dicarboximide resistance (I365S homologous) was chosen. One onion field was sampled in 2009 and another one was sampled in 2010 for B. squamosa, and two vineyards were sampled in 2011 for B. cinerea, for a total of four sampled sites. Cluster sampling was carried on a 10-by-10 grid, each of the 100 nodes being the center of a 10-by-10-m quadrat. In each quadrat, 10 samples were collected and analyzed by restriction fragment length polymorphism polymerase chain reaction (PCR) or allele specific PCR. Mean SNP incidence varied from 16 to 68%, with an overall mean incidence of 43%. In the geostatistical analyses, omnidirectional variograms showed spatial autocorrelation characterized by ranges of 21 to 1 m. Various levels of anisotropy were detected, however, with variograms computed in four directions (at 0°, 45°, 90°, and 135° from the within-row direction used as reference), indicating that spatial autocorrelation was prevalent or characterized by a longer range in one direction. For all eight data sets, the β-binomial distribution was found to fit the data better than the binomial distribution. This indicates local aggregation of fungicide resistance among sampling units, as supported by estimates of the parameter θ of the β-binomial distribution of 0.09 to 0.23 (overall median value = 0.20). On the basis of the observed spatial distribution patterns of SNP incidence, sampling curves were computed for different levels of reliability, emphasizing the importance of sample size for the detection of mutation incidence below the risk threshold for control failure.

  5. A crash-prediction model for multilane roads.

    PubMed

    Caliendo, Ciro; Guida, Maurizio; Parisi, Alessandra

    2007-07-01

    Considerable research has been carried out in recent years to establish relationships between crashes and traffic flow, geometric infrastructure characteristics and environmental factors for two-lane rural roads. Crash-prediction models focused on multilane rural roads, however, have rarely been investigated. In addition, most research has paid but little attention to the safety effects of variables such as stopping sight distance and pavement surface characteristics. Moreover, the statistical approaches have generally included Poisson and Negative Binomial regression models, whilst Negative Multinomial regression model has been used to a lesser extent. Finally, as far as the authors are aware, prediction models involving all the above-mentioned factors have still not been developed in Italy for multilane roads, such as motorways. Thus, in this paper crash-prediction models for a four-lane median-divided Italian motorway were set up on the basis of accident data observed during a 5-year monitoring period extending between 1999 and 2003. The Poisson, Negative Binomial and Negative Multinomial regression models, applied separately to tangents and curves, were used to model the frequency of accident occurrence. Model parameters were estimated by the Maximum Likelihood Method, and the Generalized Likelihood Ratio Test was applied to detect the significant variables to be included in the model equation. Goodness-of-fit was measured by means of both the explained fraction of total variation and the explained fraction of systematic variation. The Cumulative Residuals Method was also used to test the adequacy of a regression model throughout the range of each variable. The candidate set of explanatory variables was: length (L), curvature (1/R), annual average daily traffic (AADT), sight distance (SD), side friction coefficient (SFC), longitudinal slope (LS) and the presence of a junction (J). Separate prediction models for total crashes and for fatal and injury crashes only were considered. For curves it is shown that significant variables are L, 1/R and AADT, whereas for tangents they are L, AADT and junctions. The effect of rain precipitation was analysed on the basis of hourly rainfall data and assumptions about drying time. It is shown that a wet pavement significantly increases the number of crashes. The models developed in this paper for Italian motorways appear to be useful for many applications such as the detection of critical factors, the estimation of accident reduction due to infrastructure and pavement improvement, and the predictions of accidents counts when comparing different design options. Thus this research may represent a point of reference for engineers in adjusting or designing multilane roads.

  6. DEsingle for detecting three types of differential expression in single-cell RNA-seq data.

    PubMed

    Miao, Zhun; Deng, Ke; Wang, Xiaowo; Zhang, Xuegong

    2018-04-24

    The excessive amount of zeros in single-cell RNA-seq data include "real" zeros due to the on-off nature of gene transcription in single cells and "dropout" zeros due to technical reasons. Existing differential expression (DE) analysis methods cannot distinguish these two types of zeros. We developed an R package DEsingle which employed Zero-Inflated Negative Binomial model to estimate the proportion of real and dropout zeros and to define and detect 3 types of DE genes in single-cell RNA-seq data with higher accuracy. The R package DEsingle is freely available at https://github.com/miaozhun/DEsingle and is under Bioconductor's consideration now. zhangxg@tsinghua.edu.cn. Supplementary data are available at Bioinformatics online.

  7. AMP: Assembly Matching Pursuit.

    PubMed

    Biswas, S; Jojic, V

    2013-01-01

    Metagenomics, the study of the total genetic material isolated from a biological host, promises to reveal host-microbe or microbe-microbe interactions that may help to personalize medicine or improve agronomic practice. We introduce a method that discovers metagenomic units (MGUs) relevant for phenotype prediction through sequence-based dictionary learning. The method aggregates patient-specific dictionaries and estimates MGU abundances in order to summarize a whole population and yield universally predictive biomarkers. We analyze the impact of Gaussian, Poisson, and Negative Binomial read count models in guiding dictionary construction by examining classification efficiency on a number of synthetic datasets and a real dataset from Ref. 1. Each outperforms standard methods of dictionary composition, such as random projection and orthogonal matching pursuit. Additionally, the predictive MGUs they recover are biologically relevant.

  8. A review on models for count data with extra zeros

    NASA Astrophysics Data System (ADS)

    Zamri, Nik Sarah Nik; Zamzuri, Zamira Hasanah

    2017-04-01

    Typically, the zero inflated models are usually used in modelling count data with excess zeros. The existence of the extra zeros could be structural zeros or random which occur by chance. These types of data are commonly found in various disciplines such as finance, insurance, biomedical, econometrical, ecology, and health sciences. As found in the literature, the most popular zero inflated models used are zero inflated Poisson and zero inflated negative binomial. Recently, more complex models have been developed to account for overdispersion and unobserved heterogeneity. In addition, more extended distributions are also considered in modelling data with this feature. In this paper, we review related literature, provide a recent development and summary on models for count data with extra zeros.

  9. The Association between Romantic Relationships and Delinquency in Adolescence and Young Adulthood

    PubMed Central

    Cui, Ming; Ueno, Koji; Fincham, Frank D.; Donnellan, M. Brent; Wickrama, K. A. S.

    2011-01-01

    This study examined the association between romantic relationships and delinquency in adolescence and young adulthood. Using a large, longitudinal, and nationally representative sample, results from negative binomial regressions showed a positive association between romantic involvement and delinquency in adolescence. Further, the cumulative number of romantic relationships from adolescence to young adulthood was positively related to delinquency in young adulthood even controlling for earlier delinquency in adolescence. These analyses also controlled for the effects of participant gender, age at initial assessment, puberty, race/ethnicity, and other demographic characteristics (e.g., family structure and parents’ education). Findings are discussed in terms of their implications for understanding the role of romantic relationships in the development of young people and for stimulating future research questions. PMID:22984343

  10. Factors affecting road mortality of white-tailed deer in eastern South Dakota

    USGS Publications Warehouse

    Grovenburg, Troy W.; Jenks, Jonathan A.; Klaver, Robert W.; Monteith, Kevin L.; Galster, Dwight H.; Schauer, Ron J.; Morlock, Wilbert W.; Delger, Joshua A.

    2008-01-01

    White-tailed deer (Odocoileus virginianus) mortalities (n = 4,433) caused by collisions with automobiles during 2003 were modeled in 35 counties in eastern South Dakota. Seventeen independent variables and 5 independent variable interactions were evaluated to explain deer mortalities. A negative binomial regression model (Ln Y = 1.25 – 0.12 [percentage tree coverage] + 0.0002 [county area] + 5.39 [county hunter success rate] + 0.0023 [vehicle proxy 96–104 km/hr roads], model deviance = 33.43, χ2 = 27.53, df = 27) was chosen using a combination of a priori model selection and AICc. Management options include use of the model to predict road mortalities and to increase the number of hunting licenses, which could result in fewer DVCs.

  11. Demonstration of fundamental statistics by studying timing of electronics signals in a physics-based laboratory

    NASA Astrophysics Data System (ADS)

    Beach, Shaun E.; Semkow, Thomas M.; Remling, David J.; Bradt, Clayton J.

    2017-07-01

    We have developed accessible methods to demonstrate fundamental statistics in several phenomena, in the context of teaching electronic signal processing in a physics-based college-level curriculum. A relationship between the exponential time-interval distribution and Poisson counting distribution for a Markov process with constant rate is derived in a novel way and demonstrated using nuclear counting. Negative binomial statistics is demonstrated as a model for overdispersion and justified by the effect of electronic noise in nuclear counting. The statistics of digital packets on a computer network are shown to be compatible with the fractal-point stochastic process leading to a power-law as well as generalized inverse Gaussian density distributions of time intervals between packets.

  12. Under the hood of statistical learning: A statistical MMN reflects the magnitude of transitional probabilities in auditory sequences.

    PubMed

    Koelsch, Stefan; Busch, Tobias; Jentschke, Sebastian; Rohrmeier, Martin

    2016-02-02

    Within the framework of statistical learning, many behavioural studies investigated the processing of unpredicted events. However, surprisingly few neurophysiological studies are available on this topic, and no statistical learning experiment has investigated electroencephalographic (EEG) correlates of processing events with different transition probabilities. We carried out an EEG study with a novel variant of the established statistical learning paradigm. Timbres were presented in isochronous sequences of triplets. The first two sounds of all triplets were equiprobable, while the third sound occurred with either low (10%), intermediate (30%), or high (60%) probability. Thus, the occurrence probability of the third item of each triplet (given the first two items) was varied. Compared to high-probability triplet endings, endings with low and intermediate probability elicited an early anterior negativity that had an onset around 100 ms and was maximal at around 180 ms. This effect was larger for events with low than for events with intermediate probability. Our results reveal that, when predictions are based on statistical learning, events that do not match a prediction evoke an early anterior negativity, with the amplitude of this mismatch response being inversely related to the probability of such events. Thus, we report a statistical mismatch negativity (sMMN) that reflects statistical learning of transitional probability distributions that go beyond auditory sensory memory capabilities.

  13. Non-normal Distributions Commonly Used in Health, Education, and Social Sciences: A Systematic Review

    PubMed Central

    Bono, Roser; Blanca, María J.; Arnau, Jaume; Gómez-Benito, Juana

    2017-01-01

    Statistical analysis is crucial for research and the choice of analytical technique should take into account the specific distribution of data. Although the data obtained from health, educational, and social sciences research are often not normally distributed, there are very few studies detailing which distributions are most likely to represent data in these disciplines. The aim of this systematic review was to determine the frequency of appearance of the most common non-normal distributions in the health, educational, and social sciences. The search was carried out in the Web of Science database, from which we retrieved the abstracts of papers published between 2010 and 2015. The selection was made on the basis of the title and the abstract, and was performed independently by two reviewers. The inter-rater reliability for article selection was high (Cohen’s kappa = 0.84), and agreement regarding the type of distribution reached 96.5%. A total of 262 abstracts were included in the final review. The distribution of the response variable was reported in 231 of these abstracts, while in the remaining 31 it was merely stated that the distribution was non-normal. In terms of their frequency of appearance, the most-common non-normal distributions can be ranked in descending order as follows: gamma, negative binomial, multinomial, binomial, lognormal, and exponential. In addition to identifying the distributions most commonly used in empirical studies these results will help researchers to decide which distributions should be included in simulation studies examining statistical procedures. PMID:28959227

  14. Generalized empirical Bayesian methods for discovery of differential data in high-throughput biology.

    PubMed

    Hardcastle, Thomas J

    2016-01-15

    High-throughput data are now commonplace in biological research. Rapidly changing technologies and application mean that novel methods for detecting differential behaviour that account for a 'large P, small n' setting are required at an increasing rate. The development of such methods is, in general, being done on an ad hoc basis, requiring further development cycles and a lack of standardization between analyses. We present here a generalized method for identifying differential behaviour within high-throughput biological data through empirical Bayesian methods. This approach is based on our baySeq algorithm for identification of differential expression in RNA-seq data based on a negative binomial distribution, and in paired data based on a beta-binomial distribution. Here we show how the same empirical Bayesian approach can be applied to any parametric distribution, removing the need for lengthy development of novel methods for differently distributed data. Comparisons with existing methods developed to address specific problems in high-throughput biological data show that these generic methods can achieve equivalent or better performance. A number of enhancements to the basic algorithm are also presented to increase flexibility and reduce computational costs. The methods are implemented in the R baySeq (v2) package, available on Bioconductor http://www.bioconductor.org/packages/release/bioc/html/baySeq.html. tjh48@cam.ac.uk Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  15. Quantification of integrated HIV DNA by repetitive-sampling Alu-HIV PCR on the basis of poisson statistics.

    PubMed

    De Spiegelaere, Ward; Malatinkova, Eva; Lynch, Lindsay; Van Nieuwerburgh, Filip; Messiaen, Peter; O'Doherty, Una; Vandekerckhove, Linos

    2014-06-01

    Quantification of integrated proviral HIV DNA by repetitive-sampling Alu-HIV PCR is a candidate virological tool to monitor the HIV reservoir in patients. However, the experimental procedures and data analysis of the assay are complex and hinder its widespread use. Here, we provide an improved and simplified data analysis method by adopting binomial and Poisson statistics. A modified analysis method on the basis of Poisson statistics was used to analyze the binomial data of positive and negative reactions from a 42-replicate Alu-HIV PCR by use of dilutions of an integration standard and on samples of 57 HIV-infected patients. Results were compared with the quantitative output of the previously described Alu-HIV PCR method. Poisson-based quantification of the Alu-HIV PCR was linearly correlated with the standard dilution series, indicating that absolute quantification with the Poisson method is a valid alternative for data analysis of repetitive-sampling Alu-HIV PCR data. Quantitative outputs of patient samples assessed by the Poisson method correlated with the previously described Alu-HIV PCR analysis, indicating that this method is a valid alternative for quantifying integrated HIV DNA. Poisson-based analysis of the Alu-HIV PCR data enables absolute quantification without the need of a standard dilution curve. Implementation of the CI estimation permits improved qualitative analysis of the data and provides a statistical basis for the required minimal number of technical replicates. © 2014 The American Association for Clinical Chemistry.

  16. True detection limits in an experimental linearly heteroscedastic system. Part 1

    NASA Astrophysics Data System (ADS)

    Voigtman, Edward; Abraham, Kevin T.

    2011-11-01

    Using a lab-constructed laser-excited filter fluorimeter deliberately designed to exhibit linearly heteroscedastic, additive Gaussian noise, it has been shown that accurate estimates may be made of the true theoretical Currie decision levels ( YC and XC) and true Currie detection limits ( YD and XD) for the detection of rhodamine 6 G tetrafluoroborate in ethanol. The obtained experimental values, for 5% probability of false positives and 5% probability of false negatives, were YC = 56.1 mV, YD = 125. mV, XC = 0.132 μg /mL and XD = 0.294 μg /mL. For 5% probability of false positives and 1% probability of false negatives, the obtained detection limits were YD = 158. mV and XD = 0.372 μg /mL. These decision levels and corresponding detection limits were shown to pass the ultimate test: they resulted in observed probabilities of false positives and false negatives that were statistically equivalent to the a priori specified values.

  17. The influence of environmental variables on the presence of white sharks, Carcharodon carcharias at two popular Cape Town bathing beaches: a generalized additive mixed model.

    PubMed

    Weltz, Kay; Kock, Alison A; Winker, Henning; Attwood, Colin; Sikweyiya, Monwabisi

    2013-01-01

    Shark attacks on humans are high profile events which can significantly influence policies related to the coastal zone. A shark warning system in South Africa, Shark Spotters, recorded 378 white shark (Carcharodon carcharias) sightings at two popular beaches, Fish Hoek and Muizenberg, during 3690 six-hour long spotting shifts, during the months September to May 2006 to 2011. The probabilities of shark sightings were related to environmental variables using Binomial Generalized Additive Mixed Models (GAMMs). Sea surface temperature was significant, with the probability of shark sightings increasing rapidly as SST exceeded 14 °C and approached a maximum at 18 °C, whereafter it remains high. An 8 times (Muizenberg) and 5 times (Fish Hoek) greater likelihood of sighting a shark was predicted at 18 °C than at 14 °C. Lunar phase was also significant with a prediction of 1.5 times (Muizenberg) and 4 times (Fish Hoek) greater likelihood of a shark sighting at new moon than at full moon. At Fish Hoek, the probability of sighting a shark was 1.6 times higher during the afternoon shift compared to the morning shift, but no diel effect was found at Muizenberg. A significant increase in the number of shark sightings was identified over the last three years, highlighting the need for ongoing research into shark attack mitigation. These patterns will be incorporated into shark awareness and bather safety campaigns in Cape Town.

  18. Utilization of infertility services: how much does money matter?

    PubMed

    Farley Ordovensky Staniec, J; Webb, Natalie J

    2007-06-01

    To estimate the effects of financial access and other individual characteristics on the likelihood that a woman pursues infertility treatment and the choice of treatment type. The 1995 National Survey of Family Growth. We use a binomial logit model to estimate the effects of financial access and individual characteristics on the likelihood that a woman pursues infertility treatment. We then use a multinomial logit model to estimate the differential effects of these variables across treatment types. This study analyzes the subset of 1,210 women who meet the definition of infertile or subfecund from the 1995 National Survey of Family Growth. We find that income, insurance coverage, age, and parity (number of previous births) all significantly affect the probability of seeking infertility treatment; however, the effect of these variables on choice of treatment type varies significantly. Neither income nor insurance influences the probability of seeking advice, a relatively low cost, low yield treatment. At the other end of the spectrum, the choice to pursue assisted reproductive technologies (ARTs)-a much more expensive but potentially more productive option-is highly influenced by income, but merely having private insurance has no significant effect. In the middle of the spectrum are treatment options such as testing, surgery, and medications, for which "financial access" increases their probability of selection. Our results illustrate that for the sample of infertile of subfecund women of childbearing age studied, and considering their options, financial access to infertility treatment does matter.

  19. The Influence of Environmental Variables on the Presence of White Sharks, Carcharodon carcharias at Two Popular Cape Town Bathing Beaches: A Generalized Additive Mixed Model

    PubMed Central

    Weltz, Kay; Kock, Alison A.; Winker, Henning; Attwood, Colin; Sikweyiya, Monwabisi

    2013-01-01

    Shark attacks on humans are high profile events which can significantly influence policies related to the coastal zone. A shark warning system in South Africa, Shark Spotters, recorded 378 white shark (Carcharodon carcharias) sightings at two popular beaches, Fish Hoek and Muizenberg, during 3690 six-hour long spotting shifts, during the months September to May 2006 to 2011. The probabilities of shark sightings were related to environmental variables using Binomial Generalized Additive Mixed Models (GAMMs). Sea surface temperature was significant, with the probability of shark sightings increasing rapidly as SST exceeded 14°C and approached a maximum at 18°C, whereafter it remains high. An 8 times (Muizenberg) and 5 times (Fish Hoek) greater likelihood of sighting a shark was predicted at 18°C than at 14°C. Lunar phase was also significant with a prediction of 1.5 times (Muizenberg) and 4 times (Fish Hoek) greater likelihood of a shark sighting at new moon than at full moon. At Fish Hoek, the probability of sighting a shark was 1.6 times higher during the afternoon shift compared to the morning shift, but no diel effect was found at Muizenberg. A significant increase in the number of shark sightings was identified over the last three years, highlighting the need for ongoing research into shark attack mitigation. These patterns will be incorporated into shark awareness and bather safety campaigns in Cape Town. PMID:23874668

  20. Perceptions of Unprofessional Social Media Behavior Among Emergency Medicine Physicians.

    PubMed

    Soares, William; Shenvi, Christina; Waller, Nikki; Johnson, Reuben; Hodgson, Carol S

    2017-02-01

    Use of social media (SM) by physicians has exposed issues of privacy and professionalism. While guidelines have been created for SM use, details regarding specific SM behaviors that could lead to disciplinary action presently do not exist. To compare State Medical Board (SMB) directors' perceptions of investigation for specific SM behaviors with those of emergency medicine (EM) physicians. A multicenter anonymous survey was administered to physicians at 3 academic EM residency programs. Surveys consisted of case vignettes, asking, "If the SMB were informed of the content, how likely would they be to initiate an investigation, possibly leading to disciplinary action?" (1, very unlikely, to 4, very likely). Results were compared to published probabilities using exact binomial testing. Of 205 eligible physicians, 119 (58%) completed the survey. Compared to SMB directors, EM physicians indicated similar probabilities of investigation for themes involving identifying patient images, inappropriate communication, and discriminatory speech. Participants indicated lower probabilities of investigation for themes including derogatory speech (32%, 95% confidence interval [CI] 24-41 versus 46%, P  < .05); alcohol intoxication (41%, 95% CI 32-51 versus 73%, P  < .05); and holding alcohol without intoxication (7%, 95% CI 3-13 versus 40%, P  < .05). There were no significant associations with position, hospital site, years since medical school, or prior SM professionalism training. Physicians reported a lower likelihood of investigation for themes that intersect with social identity, compared to SMB directors, particularly for images of alcohol and derogatory speech.

  1. 78 FR 33010 - Airworthiness Directives; Saab AB, Saab Aerosystems Airplanes

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-06-03

    ... the probability of a negative effect on the handling quality during stall, which could result in... the probability of a negative effect on the handling quality during stall. For the reasons described above, this [EASA] AD requires a one- time inspection of the stick pusher rigging and, depending on...

  2. Inferring patterns in mitochondrial DNA sequences through hypercube independent spanning trees.

    PubMed

    Silva, Eduardo Sant Ana da; Pedrini, Helio

    2016-03-01

    Given a graph G, a set of spanning trees rooted at a vertex r of G is said vertex/edge independent if, for each vertex v of G, v≠r, the paths of r to v in any pair of trees are vertex/edge disjoint. Independent spanning trees (ISTs) provide a number of advantages in data broadcasting due to their fault tolerant properties. For this reason, some studies have addressed the issue by providing mechanisms for constructing independent spanning trees efficiently. In this work, we investigate how to construct independent spanning trees on hypercubes, which are generated based upon spanning binomial trees, and how to use them to predict mitochondrial DNA sequence parts through paths on the hypercube. The prediction works both for inferring mitochondrial DNA sequences comprised of six bases as well as infer anomalies that probably should not belong to the mitochondrial DNA standard. Copyright © 2016 Elsevier Ltd. All rights reserved.

  3. Comparison of clinical probability-adjusted D-dimer and age-adjusted D-dimer interpretation to exclude venous thromboembolism.

    PubMed

    Takach Lapner, Sarah; Julian, Jim A; Linkins, Lori-Ann; Bates, Shannon; Kearon, Clive

    2017-10-05

    Two new strategies for interpreting D-dimer results have been proposed: i) using a progressively higher D-dimer threshold with increasing age (age-adjusted strategy) and ii) using a D-dimer threshold in patients with low clinical probability that is twice the threshold used in patients with moderate clinical probability (clinical probability-adjusted strategy). Our objective was to compare the diagnostic accuracy of age-adjusted and clinical probability-adjusted D-dimer interpretation in patients with a low or moderate clinical probability of venous thromboembolism (VTE). We performed a retrospective analysis of clinical data and blood samples from two prospective studies. We compared the negative predictive value (NPV) for VTE, and the proportion of patients with a negative D-dimer result, using two D-dimer interpretation strategies: the age-adjusted strategy, which uses a progressively higher D-dimer threshold with increasing age over 50 years (age in years × 10 µg/L FEU); and the clinical probability-adjusted strategy which uses a D-dimer threshold of 1000 µg/L FEU in patients with low clinical probability and 500 µg/L FEU in patients with moderate clinical probability. A total of 1649 outpatients with low or moderate clinical probability for a first suspected deep vein thrombosis or pulmonary embolism were included. The NPV of both the clinical probability-adjusted strategy (99.7 %) and the age-adjusted strategy (99.6 %) were similar. However, the proportion of patients with a negative result was greater with the clinical probability-adjusted strategy (56.1 % vs, 50.9 %; difference 5.2 %; 95 % CI 3.5 % to 6.8 %). These findings suggest that clinical probability-adjusted D-dimer interpretation is a better way of interpreting D-dimer results compared to age-adjusted interpretation.

  4. Children at risk: A comparison of child pedestrian traffic collisions in Santiago, Chile, and Seoul, South Korea.

    PubMed

    Blazquez, Carola; Lee, Jae Seung; Zegras, Christopher

    2016-01-01

    We examine and compare pedestrian-vehicle collisions and injury outcomes involving school-age children between 5 and 18 years of age in the capital cities of Santiago, Chile, and Seoul, South Korea. We conduct descriptive analysis of the child pedestrian-vehicle collision (P-VC) data (904 collisions for Santiago and 3,505 for Seoul) reported by the police between 2010 and 2011. We also statistically analyze factors associated with child P-VCs, by both incident severity and age group, using 3 regression models: negative binomial, probit, and spatial lag models. Descriptive statistics suggest that child pedestrians in Seoul have a higher risk of being involved in traffic crashes than their counterparts in Santiago. However, in Seoul a greater proportion of children are unharmed as a result of these incidents, whereas more child pedestrians are killed in Santiago. Younger children in Seoul suffer more injuries from P-VCs than in Santiago. The majority of P-VCs in both cities tend to occur in the afternoon and evening, at intersections in Santiago and at midblock locations in Seoul. Our model results suggest that the resident population of children is positively associated with P-VCs in both cities, and school concentrations apparently increase P-VC risk among older children in Santiago. Bus stops are associated with higher P-VCs in Seoul, and subway stations relate to higher P-VCs among older children in Santiago. Zone-level land use mix was negatively related to child P-VCs in Seoul but not in Santiago. Arterial roads are associated with fewer P-VCs, especially for younger children in both cities. A share of collector roads is associated with increased P-VCs in Seoul but fewer P-VCs in Santiago. Hilliness is related to fewer P-VCs in both cities. Differences in these model results for Santiago and Seoul warrant additional analysis, as do the differences in results across model type (negative binomial versus spatial lag models). To reduce child P-VCs, this study suggests the need to assess subway station and bus stop area conditions in Santiago and Seoul, respectively; areas with high density of schools in Santiago; areas with greater concentrations of children in both cities; and collector roads in Seoul.

  5. Patterns and determinants of use of pharmacological therapies for intermittent claudication in PAD outpatients: results of the IDOMENEO study.

    PubMed

    Cimminiello, Claudio; Polo Friz, Hernan; Marano, Giuseppe; Arpaia, Guido; Boracchi, Patrizia; Spezzigu, Gabriella; Visonà, Adriana

    2017-06-01

    Peripheral arterial disease (PAD) usually presents with intermittent claudication (IC). The aim of the present study was to assess, in clinical practice, the pattern of use of pharmacological therapies for IC in stable PAD outpatients. A propensity analysis was performed using data from the IDOMENEO study, an observational prospective multicenter cohort study. The association between any pharmacological symptomatic IC therapy with different variables was investigated using generalized linear mixed models with pharmacological therapy as response variable and binomial error. Study population: 213 patients, male sex 147 (69.0%), mean age 70.0±8.6 years. Only 36.6% was under pharmacological treatment for IC, being cilostazol the most used medication (21.6%). Univariate analysis showed a probability of a patient of being assigned to any pharmacological symptomatic IC therapy of 67.0% when Ankle-Brachial Index (ABI) <0.6 and 29.8% when ABI>0.6 (P=0.0048), and a propensity to avoid pharmacological treatment for patients with a high number of drugs to treat cardiovascular risk factors (probability of 55.2% for <4 drugs and 19.6% for >4 drugs, P=0.0317). Multivariate analysis confirmed a higher probability of assigning treatment for ABI<0.6 (P=0.0274), and a trend to a lower probability in patients under polypharmacy (>4 drugs: OR=0.13, P=0.0546). In clinical practice, only one third of stable outpatients with IC used symptomatic pharmacological therapy for IC. We found a propensity of clinicians to assign any symptomatic pharmacological IC therapy to patients with lower values of ABI and a propensity to avoid this kind of treatment in patients under polypharmacy.

  6. Accounting for Selection Bias in Studies of Acute Cardiac Events.

    PubMed

    Banack, Hailey R; Harper, Sam; Kaufman, Jay S

    2018-06-01

    In cardiovascular research, pre-hospital mortality represents an important potential source of selection bias. Inverse probability of censoring weights are a method to account for this source of bias. The objective of this article is to examine and correct for the influence of selection bias due to pre-hospital mortality on the relationship between cardiovascular risk factors and all-cause mortality after an acute cardiac event. The relationship between the number of cardiovascular disease (CVD) risk factors (0-5; smoking status, diabetes, hypertension, dyslipidemia, and obesity) and all-cause mortality was examined using data from the Atherosclerosis Risk in Communities (ARIC) study. To illustrate the magnitude of selection bias, estimates from an unweighted generalized linear model with a log link and binomial distribution were compared with estimates from an inverse probability of censoring weighted model. In unweighted multivariable analyses the estimated risk ratio for mortality ranged from 1.09 (95% confidence interval [CI], 0.98-1.21) for 1 CVD risk factor to 1.95 (95% CI, 1.41-2.68) for 5 CVD risk factors. In the inverse probability of censoring weights weighted analyses, the risk ratios ranged from 1.14 (95% CI, 0.94-1.39) to 4.23 (95% CI, 2.69-6.66). Estimates from the inverse probability of censoring weighted model were substantially greater than unweighted, adjusted estimates across all risk factor categories. This shows the magnitude of selection bias due to pre-hospital mortality and effect on estimates of the effect of CVD risk factors on mortality. Moreover, the results highlight the utility of using this method to address a common form of bias in cardiovascular research. Copyright © 2018 Canadian Cardiovascular Society. Published by Elsevier Inc. All rights reserved.

  7. The diagnostic performance of coronary artery angiography with 64-MSCT and post 64-MSCT: systematic review and meta-analysis.

    PubMed

    Li, Min; Du, Xiang-Min; Jin, Zhi-Tao; Peng, Zhao-Hui; Ding, Juan; Li, Li

    2014-01-01

    To comprehensively investigate the diagnostic performance of coronary artery angiography with 64-MDCT and post 64-MDCT. PubMed was searched for all published studies that evaluated coronary arteries with 64-MDCT and post 64-MDCT. The clinical diagnostic role was evaluated by applying the likelihood ratios (LRs) to calculate the post-test probability based on Bayes' theorem. 91 studies that met our inclusion criteria were ultimately included in the analysis. The pooled positive and negative LRs at patient level were 8.91 (95% CI, 7.53, 10.54) and 0.02 (CI, 0.01, 0.03), respectively. For studies that did not claim that non-evaluable segments were included, the pooled positive and negative LRs were 11.16 (CI, 8.90, 14.00) and 0.01 (CI, 0.01, 0.03), respectively. For studies including uninterruptable results, the diagnostic performance decreased, with the pooled positive LR 7.40 (CI, 6.00, 9.13) and negative LR 0.02 (CI, 0.01, 0.03). The areas under the summary ROC curve were 0.98 (CI, 0.97 to 0.99) for 64-MDCT and 0.96 (CI, 0.94 to 0.98) for post 64-MDCT, respectively. For references explicitly stating that the non-assessable segments were included during analysis, a post-test probability of negative results >95% and a positive post-test probability <95% could be obtained for patients with a pre-test probability of <73% for coronary artery disease (CAD). On the other hand, when the pre-test probability of CAD was >73%, the diagnostic role was reversed, with a positive post-test probability of CAD >95% and a negative post-test probability of CAD <95%. The diagnostic performance of post 64-MDCT does not increase as compared with 64-MDCT. CTA, overall, is a test of exclusion for patients with a pre-test probability of CAD<73%, while for patients with a pre-test probability of CAD>73%, CTA is a test used to confirm the presence of CAD.

  8. Probability of lek collapse is lower inside sage-grouse Core Areas: Effectiveness of conservation policy for a landscape species.

    PubMed

    Spence, Emma Suzuki; Beck, Jeffrey L; Gregory, Andrew J

    2017-01-01

    Greater sage-grouse (Centrocercus urophasianus) occupy sagebrush (Artemisia spp.) habitats in 11 western states and 2 Canadian provinces. In September 2015, the U.S. Fish and Wildlife Service announced the listing status for sage-grouse had changed from warranted but precluded to not warranted. The primary reason cited for this change of status was that the enactment of new regulatory mechanisms was sufficient to protect sage-grouse populations. One such plan is the 2008, Wyoming Sage Grouse Executive Order (SGEO), enacted by Governor Freudenthal. The SGEO identifies "Core Areas" that are to be protected by keeping them relatively free from further energy development and limiting other forms of anthropogenic disturbances near active sage-grouse leks. Using the Wyoming Game and Fish Department's sage-grouse lek count database and the Wyoming Oil and Gas Conservation Commission database of oil and gas well locations, we investigated the effectiveness of Wyoming's Core Areas, specifically: 1) how well Core Areas encompass the distribution of sage-grouse in Wyoming, 2) whether Core Area leks have a reduced probability of lek collapse, and 3) what, if any, edge effects intensification of oil and gas development adjacent to Core Areas may be having on Core Area populations. Core Areas contained 77% of male sage-grouse attending leks and 64% of active leks. Using Bayesian binomial probability analysis, we found an average 10.9% probability of lek collapse in Core Areas and an average 20.4% probability of lek collapse outside Core Areas. Using linear regression, we found development density outside Core Areas was related to the probability of lek collapse inside Core Areas. Specifically, probability of collapse among leks >4.83 km from inside Core Area boundaries was significantly related to well density within 1.61 km (1-mi) and 4.83 km (3-mi) outside of Core Area boundaries. Collectively, these data suggest that the Wyoming Core Area Strategy has benefited sage-grouse and sage-grouse habitat conservation; however, additional guidelines limiting development densities adjacent to Core Areas may be necessary to effectively protect Core Area populations.

  9. Relation between social information processing and intimate partner violence in dating couples.

    PubMed

    Setchell, Sarah; Fritz, Patti Timmons; Glasgow, Jillian

    2017-07-01

    We used couple-level data to predict physical acts of intimate partner violence (IPV) from self-reported negative emotions and social information-processing (SIP) abilities among 100 dating couples (n = 200; mean age = 21.45 years). Participants read a series of hypothetical conflict situation vignettes and responded to questionnaires to assess negative emotions and various facets of SIP including attributions for partner behavior, generation of response alternatives, and response selection. We conducted a series of negative binomial mixed-model regressions based on the actor-partner interdependence model (APIM; Kenny, Kashy, & Cook, 2006, Dyadic data analysis. New York, NY: Guilford Press). There were significant results for the response generation and negative emotion models. Participants who generated fewer coping response alternatives were at greater risk of victimization (actor effect). Women were at greater risk of victimization if they had partners who generated fewer coping response alternatives (sex by partner interaction effect). Generation of less competent coping response alternatives predicted greater risk of perpetration among men, whereas generation of more competent coping response alternatives predicted greater risk of victimization among women (sex by actor interaction effects). Two significant actor by partner interaction effects were found for the negative emotion models. Participants who reported discrepant levels of negative emotions from their partners were at greatest risk of perpetration. Participants who reported high levels of negative emotions were at greatest risk of victimization if they had partners who reported low levels of negative emotions. This research has implications for researchers and clinicians interested in addressing the problem of IPV. Aggr. Behav. 43:329-341, 2017. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  10. [Distribution of individuals by spontaneous frequencies of lymphocytes with micronuclei. Particularity and consequences].

    PubMed

    Serebrianyĭ, A M; Akleev, A V; Aleshchenko, A V; Antoshchina, M M; Kudriashova, O V; Riabchenko, N I; Semenova, L P; Pelevina, I I

    2011-01-01

    By micronucleus (MN) assay with cytokinetic cytochalasin B block, the mean frequency of blood lymphocytes with MN has been determined in 76 Moscow inhabitants, 35 people from Obninsk and 122 from Chelyabinsk region. In contrast to the distribution of individuals on spontaneous frequency of cells with aberrations, which was shown to be binomial (Kusnetzov et al., 1980), the distribution of individuals on the spontaneous frequency of cells with MN in all three massif can be acknowledged as log-normal (chi2 test). Distribution of individuals in the joined massifs (Moscow and Obninsk inhabitants) and in the unique massif of all inspected with great reliability must be acknowledged as log-normal (0.70 and 0.86 correspondingly), but it cannot be regarded as Poisson, binomial or normal. Taking into account that log-normal distribution of children by spontaneous frequency of lymphocytes with MN has been observed by the inspection of 473 children from different kindergartens in Moscow we can make the conclusion that log-normal is regularity inherent in this type of damage of lymphocytes genome. On the contrary the distribution of individuals on induced by irradiation in vitro lymphocytes with MN frequency in most cases must be acknowledged as normal. This distribution character points out that damage appearance in the individual (genomic instability) in a single lymphocytes increases the probability of the damage appearance in another lymphocytes. We can propose that damaged stem cells lymphocyte progenitor's exchange by information with undamaged cells--the type of the bystander effect process. It can also be supposed that transmission of damage to daughter cells occurs in the time of stem cells division.

  11. Determinants of the geographic distribution of Puumala virus and Lyme borreliosis infections in Belgium

    PubMed Central

    Linard, Catherine; Lamarque, Pénélope; Heyman, Paul; Ducoffre, Geneviève; Luyasu, Victor; Tersago, Katrien; Vanwambeke, Sophie O; Lambin, Eric F

    2007-01-01

    Background Vector-borne and zoonotic diseases generally display clear spatial patterns due to different space-dependent factors. Land cover and land use influence disease transmission by controlling both the spatial distribution of vectors or hosts, and the probability of contact with susceptible human populations. The objective of this study was to combine environmental and socio-economic factors to explain the spatial distribution of two emerging human diseases in Belgium, Puumala virus (PUUV) and Lyme borreliosis. Municipalities were taken as units of analysis. Results Negative binomial regressions including a correction for spatial endogeneity show that the spatial distribution of PUUV and Lyme borreliosis infections are associated with a combination of factors linked to the vector and host populations, to human behaviours, and to landscape attributes. Both diseases are associated with the presence of forests, which are the preferred habitat for vector or host populations. The PUUV infection risk is higher in remote forest areas, where the level of urbanisation is low, and among low-income populations. The Lyme borreliosis transmission risk is higher in mixed landscapes with forests and spatially dispersed houses, mostly in wealthy peri-urban areas. The spatial dependence resulting from a combination of endogenous and exogenous processes could be accounted for in the model on PUUV but not for Lyme borreliosis. Conclusion A large part of the spatial variation in disease risk can be explained by environmental and socio-economic factors. The two diseases not only are most prevalent in different regions but also affect different groups of people. Combining these two criteria may increase the efficiency of information campaigns through appropriate targeting. PMID:17474974

  12. Racial and Ethnic Differences in Problem Gambling among College Students.

    PubMed

    Rinker, Dipali Venkataraman; Rodriguez, Lindsey M; Krieger, Heather; Tackett, Jennifer L; Neighbors, Clayton

    2016-06-01

    The college years are a formative period where the risk for development of problematic gambling is high. Research examining racial and ethnic differences in gambling behaviors has been limited and inconsistent. The aims of this study were to examine racial and ethnic differences in problem gambling among a large sample of college students. Undergraduates (N = 3058) from a large southern university completed an online screening questionnaire which included demographics, gambling frequency, gambling expenditure (i.e. money lost) in the previous 6 months, and the South Oaks Gambling Screen (SOGS). Negative binomial regression results indicated that Asian participants gambled less frequently than participants who were Caucasian or Hispanic/Latino(a), but spent more money than participants who were African-American (AA)/Black or Hispanic/Latino(a). A significantly larger proportion of Asian students met probable pathological gambling criteria (SOGS 5+; 7.8 %) and at-risk gambling criteria (SOGS 3+; 16.3 %)) than Caucasian (5.2; 10.1 %), AA/Black (3.9; 10.2 %), or Hispanic/Latino(a) (3.6; 9.4 %) students. Additionally, a significantly larger proportion of Asian students endorsed problematic gambling indicators such as lying about losses, feeling guilty about gambling, feeling like they had a gambling problem, being criticized for their gambling, feeling like they couldn't stop gambling, losing time from school or work due to gambling, having a family history of problem gambling, and arguing with close others about their gambling than Caucasian, AA/Black or Hispanic/Latino(a) students. Results suggest that Asian students may be a high-risk sub-group of college gamblers, and that there is a critical need for targeted interventions for this population.

  13. Factors associated with falling in early, treated Parkinson's disease: The NET-PD LS1 cohort.

    PubMed

    Chou, Kelvin L; Elm, Jordan J; Wielinski, Catherine L; Simon, David K; Aminoff, Michael J; Christine, Chadwick W; Liang, Grace S; Hauser, Robert A; Sudarsky, Lewis; Umeh, Chizoba C; Voss, Tiffini; Juncos, Jorge; Fang, John Y; Boyd, James T; Bodis-Wollner, Ivan; Mari, Zoltan; Morgan, John C; Wills, Anne-Marie; Lee, Stephen L; Parashos, Sotirios A

    2017-06-15

    Recognizing the factors associated with falling in Parkinson's disease (PD) would improve identification of at-risk individuals. To examine frequency of falling and baseline characteristics associated with falling in PD using the National Institute of Neurological Disorders and Stroke (NINDS) Exploratory Trials in PD Long-term Study-1 (NET-PD LS-1) dataset. The LS-1 database included 1741 early treated PD subjects (median 4year follow-up). Baseline characteristics were tested for a univariate association with post-baseline falling during the trial. Significant variables were included in a multivariable logistic regression model. A separate analysis using a negative binomial model investigated baseline factors on fall rate. 728 subjects (42%) fell during the trial, including at baseline. A baseline history of falls was the factor most associated with post-baseline falling. Men had lower odds of post-baseline falling compared to women, but for men, the probability of a post-baseline fall increased with age such that after age 70, men and women had similar odds of falling. Other baseline factors associated with a post-baseline fall and increased fall rate included the Unified PD Rating Scale (UPDRS) Activities of Daily Living (ADL) score, total functional capacity (TFC), baseline ambulatory capacity score and dopamine agonist monotherapy. Falls are common in early treated PD. The biggest risk factor for falls in PD remains a history of falling. Measures of functional ability (UPDRS ADL, TFC) and ambulatory capacity are novel clinical risk factors needing further study. A significant age by sex interaction may help to explain why age has been an inconsistent risk factor for falls in PD. Copyright © 2017 Elsevier B.V. All rights reserved.

  14. Offer of primary care services and detection of tuberculosis incidence in Brazil

    PubMed Central

    Pelissari, Daniele Maria; Bartholomay, Patricia; Jacobs, Marina Gasino; Arakaki-Sanchez, Denise; dos Anjos, Davllyn Santos Oliveira; Costa, Mara Lucia dos Santos; Cavalcanti, Pauline Cristine da Silva; Diaz-Quijano, Fredi Alexander

    2018-01-01

    ABSTRACT OBJECTIVE To evaluate the association between the health services offered by primary care teams and the detection of new tuberculosis cases in Brazil. METHODS This was an ecological study covering all Brazilian municipalities that registered at least one new tuberculosis case (diagnosed between 2012 to 2014 and notified in the Information System of Notifiable Diseases) and with at least one primary care team evaluated by the second cycle of the National Program for Improving Access and Quality of Primary Care (PMAQ-AB). The variables of the PMAQ-AB were classified as proximal or distal, according to their relation with the tuberculosis diagnosis. Then, they were tested hierarchically in multiple models (adjusted by States) using negative binomial regression. RESULTS An increase of 10% in the primary health care coverage was associated with a decrease of 2.24% in the tuberculosis detection rate (95%CI -3.35– -1.11). Regarding the proximal variables in relation to diagnosis, in the multiple model, the detection of tuberculosis was associated with the proportion of teams that conduct contact investigation (increase in Incidence Rate Ratio [IRR] = 2.97%, 95%CI 2.41–3.53), carry out tuberculosis active case finding (increase in IRR = 2.17%, 95%CI 1.48–2.87), and request culture for mycobacteria (increase in IRR = 1.87%, 95%CI 0.98–2.76). CONCLUSIONS The variables related to the search actions were positively associated with the detection of new tuberculosis cases, which suggests a significant contribution to the strengthening of the sensitivity of the surveillance system. On the other hand, primary care coverage was inversely associated with the tuberculosis detection rate, which could represent the overall effect of the primary care on transmission control, probably from the identification and early treatment of cases. PMID:29791528

  15. Exploring lifetime occupational exposure and SLE flare: a patient-focussed pilot study

    PubMed Central

    Squance, Marline L; Guest, Maya; Reeves, Glenn; Attia, John; Bridgman, Howard

    2014-01-01

    Introduction Environmental effectors, such as ultraviolet radiation exposure, infection and stress, have been established as having a role in exacerbating lupus symptoms. However, unpredictable patterns of flare events still remain a mystery. Occupational effectors have also been suggested as having a contributing role; however, they are not widely researched. In this paper we report a pilot study designed to generate focus areas for future research regarding occupational exposures and systemic lupus erythematosus (SLE). Methods The study explored potential links between exposures and the occurrence of patient-reported flare events in 80 Australian women with SLE (American College of Rheumatology (ACR) criteria classified). Specifically, the study assessed the hypothesis that occupational exposure is associated with significant changes in the likelihood of lupus flares. Lifetime employment history was analysed with the Finnish Job Exposure Matrix (FINJEM), 40 different semiquantified exposure class estimates for a wide number of occupations based on probability of exposure (p≥5%=exposed) were analysed with the construction of negative binomial regression models to test relationships between occupational agents and flare days. A backward stepwise elimination was used to generate a parsimonious model. Results Significant associations were noted for exposure classes of manual handling burden, (p=0.02, incidence rate ratio (IRR) 1.01), Iron (p=0.00, IRR 1.37), wood dust (p=0.00, IRR 3.34) and asbestos (p=0.03, IRR 2.48). Conclusion Exposure assessment results indicated that occupations, such as nursing, with a high manual handling burden, posed increased risk to patients with SLE, however, the greatest risk was associated with wood dust and iron exposure with teachers and specialist labourers. PMID:25379190

  16. Antibiotics taken for other illnesses and spontaneous clearance of Helicobacter pylori infection in children.

    PubMed

    Broussard, Cheryl S; Goodman, Karen J; Phillips, Carl V; Smith, Mary Ann; Fischbach, Lori A; Day, R Sue; Aragaki, Corinne C

    2009-08-01

    Factors that determine persistence of untreated Helicobacter pylori (H. pylori) infection in childhood are not well understood. We estimated risk differences for the effect of incidental antibiotic exposure on the probability of a detected clearance at the next test after an initial detected H. pylori infection. The Pasitos Cohort Study (1998-2005) investigated predictors of H. pylori infection in children from El Paso, Texas, and Juarez, Mexico. Children were screened for infection at 6-month target intervals from 6 to 84 months of age, using the 13C-urea breath test corrected for body-size-dependent variation in CO2 production. Exposure was defined as courses of any systemic antibiotic (systemic) or those with anti-H. pylori action (HP-effective) reported for the interval between initial detected infection and next test. Binomial regression models included country of residence, mother's education, adequacy of prenatal care, age at infection, and interval between tests. Of 205 children with a test result and antibiotic data following a detected infection, the number of children who took > or =1 course in the interval between tests was 74 for systemic and 33 for HP-effective. The proportion testing negative at the next test was 66% for 0 courses, 72% for > or =1 systemic course, and 79% for > or =1 HP-effective course. Adjusted risk differences (95%CI) for apparent clearance, comparing > or =1 to 0 courses were 10% (1-20%) for systemic and 11% (0-21%) for HP-effective. Incidental antibiotic exposure appears to influence the duration of childhood H. pylori infection but seems to explain only a small portion of spontaneous clearance. Copyright 2009 John Wiley & Sons, Ltd.

  17. Applications of asymptotic confidence intervals with continuity corrections for asymmetric comparisons in noninferiority trials.

    PubMed

    Soulakova, Julia N; Bright, Brianna C

    2013-01-01

    A large-sample problem of illustrating noninferiority of an experimental treatment over a referent treatment for binary outcomes is considered. The methods of illustrating noninferiority involve constructing the lower two-sided confidence bound for the difference between binomial proportions corresponding to the experimental and referent treatments and comparing it with the negative value of the noninferiority margin. The three considered methods, Anbar, Falk-Koch, and Reduced Falk-Koch, handle the comparison in an asymmetric way, that is, only the referent proportion out of the two, experimental and referent, is directly involved in the expression for the variance of the difference between two sample proportions. Five continuity corrections (including zero) are considered with respect to each approach. The key properties of the corresponding methods are evaluated via simulations. First, the uncorrected two-sided confidence intervals can, potentially, have smaller coverage probability than the nominal level even for moderately large sample sizes, for example, 150 per group. Next, the 15 testing methods are discussed in terms of their Type I error rate and power. In the settings with a relatively small referent proportion (about 0.4 or smaller), the Anbar approach with Yates' continuity correction is recommended for balanced designs and the Falk-Koch method with Yates' correction is recommended for unbalanced designs. For relatively moderate (about 0.6) and large (about 0.8 or greater) referent proportion, the uncorrected Reduced Falk-Koch method is recommended, although in this case, all methods tend to be over-conservative. These results are expected to be used in the design stage of a noninferiority study when asymmetric comparisons are envisioned. Copyright © 2013 John Wiley & Sons, Ltd.

  18. Substance Use, Depression and Sociodemographic Determinants of HIV Sexual Risk Behavior in Outpatient Substance Abuse Treatment Patients.

    PubMed

    Tross, Susan; Feaster, Daniel J; Thorens, Gabriel; Duan, Rui; Gomez, Zoilyn; Pavlicova, Martina; Hu, Mei Chen; Kyle, Tiffany; Erickson, Sarah; Spector, Anya; Haynes, Louise; Metsch, Lisa R

    2015-01-01

    The NIDA Clinical Trials Network trial of rapid HIV testing/counseling in 1281 patients was a unique opportunity to examine relationships among substance use, depressive symptoms, and sex risk behavior. Past 6-month substance use; substance use severity (Drug Abuse Screening Test - 10); depressive symptoms (Quick Inventory of Depressive Symptomatology); and three types of sex risk behavior (unprotected sex occasions [USOs] with primary partners; USOs with nonprimary partners; and USOs while high/drunk) were assessed. Zero-inflated negative binomial analyses provided: probability and rate of sex risk behavior (in risk behavior subsample). Levels of sexual risk behavior were high, while variable across the three types of sex risk behaviors. Among the patients, 50.4% had engaged in USOs with primary partners, 42% in sex while drunk or high, and 23.8% in USOs with nonprimary partners. Similar factors were significantly associated with all three types of sex risk behaviors. For all types, problem drinking, cocaine use, and substance use severity had an exacerbating effect. Older age was associated with lower risk behavior; other relationship categories (eg, married, separated/divorced, cohabitating) were associated with greater risk behavior than was single status. Depressive symptoms were associated with decreased likelihood of USOs with a primary partner. Sexual risk behavior is common among individuals in outpatient substance abuse treatment. Results highlight problem drinking (eg, up to three-fold) and cocaine (eg, up to twice) in increasing sex risk behavior. They demonstrate the utility of distinguishing between partner types and presence/absence of alcohol/drugs during sex. Findings argue for the need to integrate sex risk reduction into drug treatment.

  19. Epidemiology of enteric disease in C-EnterNet’s pilot site – Waterloo region, Ontario, 1990 to 2004

    PubMed Central

    Keegan, Victoria A; Majowicz, Shannon E; Pearl, David L; Marshall, Barbara J; Sittler, Nancy; Knowles, Lewinda; Wilson, Jeffery B

    2009-01-01

    OBJECTIVE: The objective of the present study was to describe the epidemiology of reportable enteric illness in Ontario’s Waterloo region, including comparing calculated incidence rates with published rates, and adjusting for under-reporting to determine the number of community cases, where published data were available. METHODS: Descriptive analyses were performed on reportable disease data for 13 enteric diseases collected in the Waterloo region from 1990 to 2004. Poisson and negative binomial regression analyses were used to investigate differences in incidence rates among age, sex, the 15 years of data and seasons. Disease-specific incidence rates were calculated and compared with the literature, where possible. Under-reporting ratios from the literature were applied to estimate the number of cases of campylobacteriosis, nontyphoidal salmonellosis and verotoxigenic Escherichia coli infection at the community level. RESULTS: Over the study period, the average annual age- and sex-adjusted incidence rates per 100,000 population were highest for campylobacteriosis (49.69 cases), followed by giardiasis (31.87 cases) and nontyphoidal salmonellosis (25.97 cases). The incidence of most enteric illnesses peaked in the summer. The highest incidence occurred in young children, followed by adults in their 20s. Diarrhea (85.4%) was the most frequently reported symptom, food (57.4%) was the most commonly reported probable source, and home (41.7%) and travel (37.0%) were the two most frequently reported risk settings. CONCLUSIONS: Enteric illness was a significant health burden in the Waterloo region from 1990 to 2004. Because reportable disease data are subject to under-reporting, it is likely that the true burden is greater than estimated in the present study. PMID:20808465

  20. Factors associated with falls among older adults living in institutions

    PubMed Central

    2013-01-01

    Background Falls have enormous impact in older adults. Yet, there is insufficient evidence regarding the effectiveness of preventive interventions in this setting. The objectives were to measure the frequency of falls and associated factors among older people living institutions. Methods Data were obtained from a survey on a probabilistic sample of residents aged ≥65 years, drawn in 1998-99 from institutions of Madrid (Spain). Residents, their caregivers, and facility physicians were interviewed. Fall rates were computed based on the number of physician-reported falls in the preceding 30 days. Adjusted rate ratios were computed using negative binomial regression models, including age, sex, cognitive status, functional dependence, number of diseases, and polypharmacy. Results The final sample comprised 733 residents. The fall rate was 2.4 falls per person-year (95% confidence interval [CI], 2.04-2.82). The strongest risk factor was number of diseases, with an adjusted rate ratio (RR) of 1.32 (95% CI, 1.17-1.50) for each additional diagnosis. Other variables associated with falls were: urinary incontinence (RR = 2.56 [95% CI, 1.32-4.94]); antidepressant use (RR = 2.32 [95% CI, 1.22-4.40]); arrhythmias (RR = 2.00 [95% CI, 1.05-3.81]); and polypharmacy (RR = 1.07 [95% CI, 0.95-1.21], for each additional medication). The attributable fraction for number of diseases (with reference to those with ≤ 1 condition) was 84% (95% CI, 45-95%). Conclusions Number of diseases was the main risk factor for falls in this population of institutionalized older adults. Other variables associated with falls, probably more amenable to preventive action, were urinary incontinence, antidepressants, arrhythmias, and polypharmacy. Virtual slides The virtual slide(s) for this article can be found here: http://www.diagnosticpathology.diagnomx.eu/vs/3916151157277337 PMID:23320746

Top