ERIC Educational Resources Information Center
Livingston, Samuel A.; Chen, Haiwen H.
2015-01-01
Quantitative information about test score reliability can be presented in terms of the distribution of equated scores on an alternate form of the test for test takers with a given score on the form taken. In this paper, we describe a procedure for estimating that distribution, for any specified score on the test form taken, by estimating the joint…
ERIC Educational Resources Information Center
Reardon, Sean F.; Shear, Benjamin R.; Castellano, Katherine E.; Ho, Andrew D.
2017-01-01
Test score distributions of schools or demographic groups are often summarized by frequencies of students scoring in a small number of ordered proficiency categories. We show that heteroskedastic ordered probit (HETOP) models can be used to estimate means and standard deviations of multiple groups' test score distributions from such data. Because…
ERIC Educational Resources Information Center
Kim, Seonghoon
2013-01-01
With known item response theory (IRT) item parameters, Lord and Wingersky provided a recursive algorithm for computing the conditional frequency distribution of number-correct test scores, given proficiency. This article presents a generalized algorithm for computing the conditional distribution of summed test scores involving real-number item…
Ho, Andrew D; Yu, Carol C
2015-06-01
Many statistical analyses benefit from the assumption that unconditional or conditional distributions are continuous and normal. More than 50 years ago in this journal, Lord and Cook chronicled departures from normality in educational tests, and Micerri similarly showed that the normality assumption is met rarely in educational and psychological practice. In this article, the authors extend these previous analyses to state-level educational test score distributions that are an increasingly common target of high-stakes analysis and interpretation. Among 504 scale-score and raw-score distributions from state testing programs from recent years, nonnormal distributions are common and are often associated with particular state programs. The authors explain how scaling procedures from item response theory lead to nonnormal distributions as well as unusual patterns of discreteness. The authors recommend that distributional descriptive statistics be calculated routinely to inform model selection for large-scale test score data, and they illustrate consequences of nonnormality using sensitivity studies that compare baseline results to those from normalized score scales.
The Dynamics of the Evolution of the Black-White Test Score Gap
ERIC Educational Resources Information Center
Sohn, Kitae
2012-01-01
We apply a quantile version of the Oaxaca-Blinder decomposition to estimate the counterfactual distribution of the test scores of Black students. In the Early Childhood Longitudinal Study, Kindergarten Class of 1998-1999 (ECLS-K), we find that the gap initially appears only at the top of the distribution of test scores. As children age, however,…
The Influence of an NCLB Accountability Plan on the Distribution of Student Test Score Gains
ERIC Educational Resources Information Center
Springer, Matthew G.
2008-01-01
Previous research on the effect of accountability programs on the distribution of student test score gains is decidedly mixed. This study examines the issue by estimating an educational production function in which test score gains are a function of the incentives schools have to focus instruction on below-proficient students. NCLB's threat of…
Univariate and Bivariate Loglinear Models for Discrete Test Score Distributions.
ERIC Educational Resources Information Center
Holland, Paul W.; Thayer, Dorothy T.
2000-01-01
Applied the theory of exponential families of distributions to the problem of fitting the univariate histograms and discrete bivariate frequency distributions that often arise in the analysis of test scores. Considers efficient computation of the maximum likelihood estimates of the parameters using Newton's Method and computationally efficient…
Derivation and Applicability of Asymptotic Results for Multiple Subtests Person-Fit Statistics
Albers, Casper J.; Meijer, Rob R.; Tendeiro, Jorge N.
2016-01-01
In high-stakes testing, it is important to check the validity of individual test scores. Although a test may, in general, result in valid test scores for most test takers, for some test takers, test scores may not provide a good description of a test taker’s proficiency level. Person-fit statistics have been proposed to check the validity of individual test scores. In this study, the theoretical asymptotic sampling distribution of two person-fit statistics that can be used for tests that consist of multiple subtests is first discussed. Second, simulation study was conducted to investigate the applicability of this asymptotic theory for tests of finite length, in which the correlation between subtests and number of items in the subtests was varied. The authors showed that these distributions provide reasonable approximations, even for tests consisting of subtests of only 10 items each. These results have practical value because researchers do not have to rely on extensive simulation studies to simulate sampling distributions. PMID:29881053
Brown, Zachary M; Gibbs, Jenna C; Adachi, Jonathan D; Ashe, Maureen C; Hill, Keith D; Kendler, David L; Khan, Aliya; Papaioannou, Alexandra; Prasad, Sadhana; Wark, John D; Giangregorio, Lora M
2017-11-28
We sought to evaluate the Balance Outcome Measure for Elder Rehabilitation (BOOMER) in community-dwelling women 65 years and older with vertebral fracture and to describe score distributions and potential ceiling and floor effects. This was a secondary data analysis of baseline data from the Build Better Bones with Exercise randomized controlled trial using the BOOMER. A total of 141 women with osteoporosis and radiographically confirmed vertebral fracture were included. Concurrent validity and internal consistency were assessed in comparison to the Short Physical Performance Battery (SPPB). Normality and ceiling/floor effects of total BOOMER scores and component test items were also assessed. Exploratory analyses of assistive aid use and falls history were performed. Tests for concurrent validity demonstrated moderate correlation between total BOOMER and SPPB scores. The BOOMER component tests showed modest internal consistency. Substantial ceiling effect and nonnormal score distributions were present among overall sample and those not using assistive aids for total BOOMER scores, although scores were normally distributed for those using assistive aids. The static standing with eyes closed test demonstrated the greatest ceiling effects of the component tests, with 92% of participants achieving a maximal score. While the BOOMER compares well with the SPPB in community-dwelling women with vertebral fractures, researchers or clinicians considering using the BOOMER in similar or higher-functioning populations should be aware of the potential for ceiling effects.
Comparison of Program Effects: The Use of Mastery Scores.
ERIC Educational Resources Information Center
Yeh, Jennie P.; Moy, Raymond
The setting of a cut-off score on a mastery test usually involves a consideration of one or more of the following elements: (1) the distribution of observed test scores; (2) the type of mastery criterion used; (3) the level of acceptable risks of mis-classification; (4) the loss of functions of mis-classifications; and (5) the distribution of true…
Standard Errors and Confidence Intervals of Norm Statistics for Educational and Psychological Tests.
Oosterhuis, Hannah E M; van der Ark, L Andries; Sijtsma, Klaas
2016-11-14
Norm statistics allow for the interpretation of scores on psychological and educational tests, by relating the test score of an individual test taker to the test scores of individuals belonging to the same gender, age, or education groups, et cetera. Given the uncertainty due to sampling error, one would expect researchers to report standard errors for norm statistics. In practice, standard errors are seldom reported; they are either unavailable or derived under strong distributional assumptions that may not be realistic for test scores. We derived standard errors for four norm statistics (standard deviation, percentile ranks, stanine boundaries and Z-scores) under the mild assumption that the test scores are multinomially distributed. A simulation study showed that the standard errors were unbiased and that corresponding Wald-based confidence intervals had good coverage. Finally, we discuss the possibilities for applying the standard errors in practical test use in education and psychology. The procedure is provided via the R function check.norms, which is available in the mokken package.
Exact calculation of distributions on integers, with application to sequence alignment.
Newberg, Lee A; Lawrence, Charles E
2009-01-01
Computational biology is replete with high-dimensional discrete prediction and inference problems. Dynamic programming recursions can be applied to several of the most important of these, including sequence alignment, RNA secondary-structure prediction, phylogenetic inference, and motif finding. In these problems, attention is frequently focused on some scalar quantity of interest, a score, such as an alignment score or the free energy of an RNA secondary structure. In many cases, score is naturally defined on integers, such as a count of the number of pairing differences between two sequence alignments, or else an integer score has been adopted for computational reasons, such as in the test of significance of motif scores. The probability distribution of the score under an appropriate probabilistic model is of interest, such as in tests of significance of motif scores, or in calculation of Bayesian confidence limits around an alignment. Here we present three algorithms for calculating the exact distribution of a score of this type; then, in the context of pairwise local sequence alignments, we apply the approach so as to find the alignment score distribution and Bayesian confidence limits.
Li, Leah
2012-01-01
Summary Studies of cognitive development in children are often based on tests designed for specific ages. Examination of the changes of these scores over time may not be meaningful. This paper investigates the influence of early life factors on cognitive development using maths and reading test scores at ages 7, 11, and 16 years in a British birth cohort born in 1958. The distributions of these test scores differ between ages, for example, 20% participants scored the top mark in the reading test at 7 and the distribution of reading score at 16 is heavily skewed. In this paper, we group participants into 5 ordered categories, approximately 20% in each category according to their test scores at each age. Multilevel models for a repeated ordinal outcome are applied to relate the ordinal scale of maths and reading ability to early life factors. PMID:22661923
The Probability of Obtaining Two Statistically Different Test Scores as a Test Index
ERIC Educational Resources Information Center
Muller, Jorg M.
2006-01-01
A new test index is defined as the probability of obtaining two randomly selected test scores (PDTS) as statistically different. After giving a concept definition of the test index, two simulation studies are presented. The first analyzes the influence of the distribution of test scores, test reliability, and sample size on PDTS within classical…
ERIC Educational Resources Information Center
Meijer, Rob R.
2004-01-01
Two new methods have been proposed to determine unexpected sum scores on sub-tests (testlets) both for paper-and-pencil tests and computer adaptive tests. A method based on a conservative bound using the hypergeometric distribution, denoted p, was compared with a method where the probability for each score combination was calculated using a…
Real Time Cockpit Resource Management (CRM) Training
2010-10-01
to post-test. Table 4 Learning Scores for the Five Spiral 1 Classes Spiral 1 Class Pilots Sensors Pretest Posttest Difference Pretest Posttest ...results from the five Spiral 1 classes. Table 6 Pretest / Posttest Gain Scores Associated with Each Learning Test Item Test Item Class Item...SMALL BUSINESS INNOVATION RESEARCH (SBIR) PHASE II REPORT. Distribution A: Approved for public release; distribution unlimited. (Approval given
A Nonparametric Framework for Comparing Trends and Gaps across Tests
ERIC Educational Resources Information Center
Ho, Andrew Dean
2009-01-01
Problems of scale typically arise when comparing test score trends, gaps, and gap trends across different tests. To overcome some of these difficulties, test score distributions on the same score scale can be represented by nonparametric graphs or statistics that are invariant under monotone scale transformations. This article motivates and then…
Kernel Equating Under the Non-Equivalent Groups With Covariates Design
Bränberg, Kenny
2015-01-01
When equating two tests, the traditional approach is to use common test takers and/or common items. Here, the idea is to use variables correlated with the test scores (e.g., school grades and other test scores) as a substitute for common items in a non-equivalent groups with covariates (NEC) design. This is performed in the framework of kernel equating and with an extension of the method developed for post-stratification equating in the non-equivalent groups with anchor test design. Real data from a college admissions test were used to illustrate the use of the design. The equated scores from the NEC design were compared with equated scores from the equivalent group (EG) design, that is, equating with no covariates as well as with equated scores when a constructed anchor test was used. The results indicate that the NEC design can produce lower standard errors compared with an EG design. When covariates were used together with an anchor test, the smallest standard errors were obtained over a large range of test scores. The results obtained, that an EG design equating can be improved by adjusting for differences in test score distributions caused by differences in the distribution of covariates, are useful in practice because not all standardized tests have anchor tests. PMID:29881012
Kernel Equating Under the Non-Equivalent Groups With Covariates Design.
Wiberg, Marie; Bränberg, Kenny
2015-07-01
When equating two tests, the traditional approach is to use common test takers and/or common items. Here, the idea is to use variables correlated with the test scores (e.g., school grades and other test scores) as a substitute for common items in a non-equivalent groups with covariates (NEC) design. This is performed in the framework of kernel equating and with an extension of the method developed for post-stratification equating in the non-equivalent groups with anchor test design. Real data from a college admissions test were used to illustrate the use of the design. The equated scores from the NEC design were compared with equated scores from the equivalent group (EG) design, that is, equating with no covariates as well as with equated scores when a constructed anchor test was used. The results indicate that the NEC design can produce lower standard errors compared with an EG design. When covariates were used together with an anchor test, the smallest standard errors were obtained over a large range of test scores. The results obtained, that an EG design equating can be improved by adjusting for differences in test score distributions caused by differences in the distribution of covariates, are useful in practice because not all standardized tests have anchor tests.
ERIC Educational Resources Information Center
Moses, Tim; Oh, Hyeonjoo J.
2009-01-01
Pseudo Bayes probability estimates are weighted averages of raw and modeled probabilities; these estimates have been studied primarily in nonpsychometric contexts. The purpose of this study was to evaluate pseudo Bayes probability estimates as applied to the estimation of psychometric test score distributions and chained equipercentile equating…
Is the NIHSS Certification Process Too Lenient?
Hills, Nancy K.; Josephson, S. Andrew; Lyden, Patrick D.; Johnston, S. Claiborne
2009-01-01
Background and Purpose The National Institutes of Health Stroke Scale (NIHSS) is a widely used measure of neurological function in clinical trials and patient assessment; inter-rater scoring variability could impact communications and trial power. The manner in which the rater certification test is scored yields multiple correct answers that have changed over time. We examined the range of possible total NIHSS scores from answers given in certification tests by over 7,000 individual raters who were certified. Methods We analyzed the results of all raters who completed one of two standard multiple-patient videotaped certification examinations between 1998 and 2004. The range for the correct score, calculated using NIHSS ‘correct answers’, was determined for each patient. The distribution of scores derived from those who passed the certification test then was examined. Results A total of 6,268 raters scored 5 patients on Test 1; 1,240 scored 6 patients on Test 2. Using a National Stroke Association (NSA) answer key, we found that correct total scores ranged from 2 correct scores to as many as 12 different correct total scores. Among raters who achieved a passing score and were therefore qualified to administer the NIHSS, score distributions were even wider, with 1 certification patient receiving 18 different correct total scores. Conclusions Allowing multiple acceptable answers for questions on the NIHSS certification test introduces scoring variability. It seems reasonable to assume that the wider the range of acceptable answers in the certification test, the greater the variability in the performance of the test in trials and clinical practice by certified examiners. Greater consistency may be achieved by deriving a set of ‘best’ answers through expert consensus on all questions where this is possible, then teaching raters how to derive these answers using a required interactive training module. PMID:19295205
ERIC Educational Resources Information Center
Puhan, Gautam; vonDavier, Alina; Gupta, Shaloo
2008-01-01
Equating under the external anchor design is frequently conducted using scaled scores on the anchor test. However, scaled scores often lead to the unique problem of creating zero frequencies in the score distribution because there may not always be a one-to-one correspondence between raw and scaled scores. For example, raw scores of 17 and 18 may…
Asymptotic Standard Errors of Observed-Score Equating with Polytomous IRT Models
ERIC Educational Resources Information Center
Andersson, Björn
2016-01-01
In observed-score equipercentile equating, the goal is to make scores on two scales or tests measuring the same construct comparable by matching the percentiles of the respective score distributions. If the tests consist of different items with multiple categories for each item, a suitable model for the responses is a polytomous item response…
Small-Sample Equating with Prior Information. Research Report. ETS RR-09-25
ERIC Educational Resources Information Center
Livingston, Samuel A.; Lewis, Charles
2009-01-01
This report proposes an empirical Bayes approach to the problem of equating scores on test forms taken by very small numbers of test takers. The equated score is estimated separately at each score point, making it unnecessary to model either the score distribution or the equating transformation. Prior information comes from equatings of other…
ERIC Educational Resources Information Center
Kang, Che Chang
2014-01-01
The study aimed at investigating TOEIC score distribution patterns and learner satisfaction in an intensive TOEIC course and drew implications for pedagogical practice. A one-group pre-test post-test experiment and a survey on learner satisfaction were conducted on Taiwanese college EFL students (n = 50) in a case study. Results showed that the…
Integral criteria for large-scale multiple fingerprint solutions
NASA Astrophysics Data System (ADS)
Ushmaev, Oleg S.; Novikov, Sergey O.
2004-08-01
We propose the definition and analysis of the optimal integral similarity score criterion for large scale multmodal civil ID systems. Firstly, the general properties of score distributions for genuine and impostor matches for different systems and input devices are investigated. The empirical statistics was taken from the real biometric tests. Then we carry out the analysis of simultaneous score distributions for a number of combined biometric tests and primary for ultiple fingerprint solutions. The explicit and approximate relations for optimal integral score, which provides the least value of the FRR while the FAR is predefined, have been obtained. The results of real multiple fingerprint test show good correspondence with the theoretical results in the wide range of the False Acceptance and the False Rejection Rates.
Disaggregated Effects of Device on Score Comparability
ERIC Educational Resources Information Center
Davis, Laurie; Morrison, Kristin; Kong, Xiaojing; McBride, Yuanyuan
2017-01-01
The use of tablets for large-scale testing programs has transitioned from concept to reality for many state testing programs. This study extended previous research on score comparability between tablets and computers with high school students to compare score distributions across devices for reading, math, and science and to evaluate device…
ERIC Educational Resources Information Center
Reardon, Sean F.; Kalogrides, Demetra; Ho, Andrew D.
2017-01-01
There is no comprehensive database of U.S. district-level test scores that is comparable across states. We describe and evaluate a method for constructing such a database. First, we estimate linear, reliability-adjusted linking transformations from state test score scales to the scale of the National Assessment of Educational Progress (NAEP). We…
ERIC Educational Resources Information Center
Moses, Tim; Liu, Jinghua
2011-01-01
In equating research and practice, equating functions that are smooth are typically assumed to be more accurate than equating functions with irregularities. This assumption presumes that population test score distributions are relatively smooth. In this study, two examples were used to reconsider common beliefs about smoothing and equating. The…
RAId_DbS: Peptide Identification using Database Searches with Realistic Statistics
Alves, Gelio; Ogurtsov, Aleksey Y; Yu, Yi-Kuo
2007-01-01
Background The key to mass-spectrometry-based proteomics is peptide identification. A major challenge in peptide identification is to obtain realistic E-values when assigning statistical significance to candidate peptides. Results Using a simple scoring scheme, we propose a database search method with theoretically characterized statistics. Taking into account possible skewness in the random variable distribution and the effect of finite sampling, we provide a theoretical derivation for the tail of the score distribution. For every experimental spectrum examined, we collect the scores of peptides in the database, and find good agreement between the collected score statistics and our theoretical distribution. Using Student's t-tests, we quantify the degree of agreement between the theoretical distribution and the score statistics collected. The T-tests may be used to measure the reliability of reported statistics. When combined with reported P-value for a peptide hit using a score distribution model, this new measure prevents exaggerated statistics. Another feature of RAId_DbS is its capability of detecting multiple co-eluted peptides. The peptide identification performance and statistical accuracy of RAId_DbS are assessed and compared with several other search tools. The executables and data related to RAId_DbS are freely available upon request. PMID:17961253
ERIC Educational Resources Information Center
Haertel, Edward
2013-01-01
In validating uses of testing, it is helpful to distinguish those that rely directly on the information provided by scores or score distributions ("direct" uses and consequences) versus those that instead capitalize on the motivational effects of testing, or use testing and test reporting to shape public opinion ("indirect" uses and consequences).…
Distribution of Model-based Multipoint Heterogeneity Lod Scores
Xing, Chao; Morris, Nathan; Xing, Guan
2011-01-01
The distribution of two-point heterogeneity lod scores (HLOD) has been intensively investigated because the conventional χ2 approximation to the likelihood ratio test is not directly applicable. However, there was no study investigating the distribution of the multipoint HLOD despite its wide application. Here we want to point out that, compared with the two-point HLOD, the multipoint HLOD essentially tests for homogeneity given linkage and follows a relatively simple limiting distribution 12χ02+12χ12, which can be obtained by established statistical theory. We further examine the theoretical result by simulation studies. PMID:21104892
An Empirical Comparison of Two-Stage and Pyramidal Adaptive Ability Testing.
ERIC Educational Resources Information Center
Larkin, Kevin C.; Weiss, David J.
A 15-stage pyramidal test and a 40-item two-stage test were constructed and administered by computer to 111 college undergraduates. The two-stage test was found to utilize a smaller proportion of its potential score range than the pyramidal test. Score distributions for both tests were positively skewed but not significantly different from the…
Changes in Texas Universities’ Applicant Pools after the Hopwood Decision
Long, Mark C.; Tienda, Marta
2012-01-01
This paper evaluates how the distribution of applicant and enrollee attributes at seven Texas universities changed after the Hopwood decision and the implementation of a policy guaranteeing admission to students with high class ranks. We analyze changes in the distributions of test scores and high school class ranks for underrepresented minority groups as well as white and Asian American applicants across institutions and between admission regimes. We show that these admissions policy changes, which have direct effects on only the most selective institutions, have substantial indirect effects at other institutions. Average test scores of applicants to less selective institutions rose following the change in admission criteria, as students with high test scores who did not qualify for the admission guarantee applied to a broader set of institutions. Furthermore, as the share of high rank applicants at UT-Austin rose, the pre-Hopwood assent in the test scores of their applicants stagnated. PMID:23335823
Multiple-Choice Test Bias Due to Answering Strategy Variation.
ERIC Educational Resources Information Center
Frary, Robert B.; Giles, Mary B.
This paper describes the development and investigation of a new approach to determining the existence of bias in multiple-choice test scores. Previous work in this area has concentrated almost exclusively on bias attributable to specific test items or to differences in test score distributions across racial or ethnic groups. In contrast, the…
Observed-Score Equating as a Test Assembly Problem.
ERIC Educational Resources Information Center
van der Linden, Wim J.; Luecht, Richard M.
1998-01-01
Derives a set of linear conditions of item-response functions that guarantees identical observed-score distributions on two test forms. The conditions can be added as constraints to a linear programming model for test assembly. An example illustrates the use of the model for an item pool from the Law School Admissions Test (LSAT). (SLD)
Federal Register 2010, 2011, 2012, 2013, 2014
2011-09-27
... with FINRA's practice of including ``pre-test'' questions on certain qualification examinations, which... scoring purposes, each examination includes 10 additional, unidentified pre-test questions that do not... of which are scored. The 10 pre-test questions are randomly distributed throughout the examination...
A Bayesian Method for Evaluating Passing Scores: The PPoP Curve
ERIC Educational Resources Information Center
Wainer, Howard; Wang, X. A.; Skorupski, William P.; Bradlow, Eric T.
2005-01-01
In this note, we demonstrate an interesting use of the posterior distributions (and corresponding posterior samples of proficiency) that are yielded by fitting a fully Bayesian test scoring model to a complex assessment. Specifically, we examine the efficacy of the test in combination with the specific passing score that was chosen through expert…
Scoring in genetically modified organism proficiency tests based on log-transformed results.
Thompson, Michael; Ellison, Stephen L R; Owen, Linda; Mathieson, Kenneth; Powell, Joanne; Key, Pauline; Wood, Roger; Damant, Andrew P
2006-01-01
The study considers data from 2 UK-based proficiency schemes and includes data from a total of 29 rounds and 43 test materials over a period of 3 years. The results from the 2 schemes are similar and reinforce each other. The amplification process used in quantitative polymerase chain reaction determinations predicts a mixture of normal, binomial, and lognormal distributions dominated by the latter 2. As predicted, the study results consistently follow a positively skewed distribution. Log-transformation prior to calculating z-scores is effective in establishing near-symmetric distributions that are sufficiently close to normal to justify interpretation on the basis of the normal distribution.
Distribution of model-based multipoint heterogeneity lod scores.
Xing, Chao; Morris, Nathan; Xing, Guan
2010-12-01
The distribution of two-point heterogeneity lod scores (HLOD) has been intensively investigated because the conventional χ(2) approximation to the likelihood ratio test is not directly applicable. However, there was no study investigating th e distribution of the multipoint HLOD despite its wide application. Here we want to point out that, compared with the two-point HLOD, the multipoint HLOD essentially tests for homogeneity given linkage and follows a relatively simple limiting distribution ½χ²₀+ ½χ²₁, which can be obtained by established statistical theory. We further examine the theoretical result by simulation studies. © 2010 Wiley-Liss, Inc.
Utility of TICS-M for the assessment of cognitive function in older adults.
de Jager, Celeste A; Budge, Marc M; Clarke, Robert
2003-04-01
Routine screening of high-risk elderly people for early cognitive impairment is constrained by the limitations of currently available cognitive function tests. The Telephone Interview of Cognitive Status is a novel instrument for assessment of cognitive function that can be administered in person or by telephone. To evaluate the determinants and utility of TICS-M (13-item modified version) for assessment of cognitive function in healthy elderly people. The utility of TICS-M was compared with more widely used MMSE and CAMCOG in a cross-sectional survey of 120 older (62 to 89 years) UK adults. The TICS-M cognitive test scores (27.97, SD 4.15) were normally distributed in contrast with those for MMSE and CAMCOG that had a negatively skewed distribution. TICS-M scores were inversely correlated with age (r = -0.21) and with the NART fullscale IQ (r = -0.35), but were independent of years of education in this cohort. TICS-M was highly correlated with MMSE (r = 0.57) and with CAMCOG (r = 0.62) scores. The time required to complete the test is comparable to MMSE and substantially less than CAMCOG. The normal distribution of TICS-M test scores suggest that this test is less constrained by the ceiling effect which limits the utility of MMSE and CAMCOG test scores in detecting early cognitive impairment. TICS-M is an appropriate instrument to assess cognitive function in both research and in clinical practice. Copyright 2003 John Wiley & Sons, Ltd.
A Note on the Use of the Hiskey-Nebraska Test of Learning Aptitude with Deaf Children.
ERIC Educational Resources Information Center
Watson, Betty U.; Goldgar, David E.
1985-01-01
Comparing distribution of scores on the Hiskey-Nebraska Test of Learning Aptitude (H-NTLA) with those from the Wechsler Performance Scales for 71 hearing impaired Ss revealed a correlation of .85. However, the H-NTLA yielded more Ss with extreme scores. Findings stress the need for caution in interpreting extreme H-NTLA scores. (CL)
Alternative Statistical Frameworks for Student Growth Percentile Estimation
ERIC Educational Resources Information Center
Lockwood, J. R.; Castellano, Katherine E.
2015-01-01
This article suggests two alternative statistical approaches for estimating student growth percentiles (SGP). The first is to estimate percentile ranks of current test scores conditional on past test scores directly, by modeling the conditional cumulative distribution functions, rather than indirectly through quantile regressions. This would…
A Brief Report on How Impossible Scores Affect Smoothing and Equating
ERIC Educational Resources Information Center
Puhan, Gautam; von Davier, Alina A.; Gupta, Shaloo
2010-01-01
Equating under the external anchor design is frequently conducted using scaled scores on the anchor test. However, scaled scores often lead to the unique problem of creating zero frequencies in the score distribution because there may not always be a one-to-one correspondence between raw and scaled scores. For example, raw scores of 17 and 18 may…
ERIC Educational Resources Information Center
Wilcox, Rand R.
A mastery test is frequently described as follows: an examinee responds to n dichotomously scored test items. Depending upon the examinee's observed (number correct) score, a mastery decision is made and the examinee is advanced to the next level of instruction. Otherwise, a nonmastery decision is made and the examinee is given remedial work. This…
Müller, Matthias Johannes; Cabanel, Nicole; Olschinski, Christiane; Jochim, Dorothee; Kundermann, Bernd
2015-01-01
The individual's chronotype is regarded as rather stable trait with substantial heritability and normal distribution of the "morningness-eveningness" dimension in the general population. Eveningness has been related to the risk of developing affective, particularly depressive, disorders. However, age and other sociobiological factors may influence chronotypes. The present study investigated the distribution, stability, and clinical correlates of chronotype and morningness-eveningness in hospitalized patients with affective disorder. Chronotype was assessed with the morningness-eveningness questionnaire (MEQ) in 93 patients with nonseasonal depressive syndrome (85% major depression; 15% depressive adjustment disorder) after admission, and in 19 patients again before discharge. Distribution, stability and correlations of MEQ scores with clinical variables were calculated. Additionally, a literature analysis of chronotype distributions in samples of nondepressed persons and patients with nonseasonal depression was carried out. MEQ scores (mean 49 ± 11, range 23-75, higher scores indicate morningness) in 93 acutely depressed inpatients (age 41 ± 14 years, range 18-75 years; 63% women; hospitalization 48 ± 22 days; BDI-II 32 ± 11) were normally distributed (Shapiro-Wilk test; W = 0.993, p = 0.920) with 59.1% intermediate types, 19.4% evening types, and 21.5% morning types. MEQ change scores from admission to discharge were nonsignificant (-1.3 ± 5.0; paired t-test, t18 = -1.09; p = 0.29) despite significantly improved depression scores (-19.4 ± 7.6; paired t-test, t18 = 11.2, p < 0.001). Age (r = 0.24), and depression scores (r = -0.21) correlated significantly (p < 0.05) with MEQ scores; associations with sex and hospitalization duration were nonsignificant. The present study and literature findings revealed that the frequency of evening types is not clearly elevated in depression, but morning types are less frequent compared to healthy samples (p < 0.001). Morningness-eveningness scores were normally distributed and stable in depressive inpatients. In line with previous findings, but contrary to theoretical assumptions, evening types were not overrepresented in depressed patients. Additionally, relatively less morning types and more intermediate types were found in depressed patients. Future studies should focus on transitions from morning to intermediate types as a tentative risk or correlate of emerging depression.
Cid, Jaime A; von Davier, Alina A
2015-05-01
Test equating is a method of making the test scores from different test forms of the same assessment comparable. In the equating process, an important step involves continuizing the discrete score distributions. In traditional observed-score equating, this step is achieved using linear interpolation (or an unscaled uniform kernel). In the kernel equating (KE) process, this continuization process involves Gaussian kernel smoothing. It has been suggested that the choice of bandwidth in kernel smoothing controls the trade-off between variance and bias. In the literature on estimating density functions using kernels, it has also been suggested that the weight of the kernel depends on the sample size, and therefore, the resulting continuous distribution exhibits bias at the endpoints, where the samples are usually smaller. The purpose of this article is (a) to explore the potential effects of atypical scores (spikes) at the extreme ends (high and low) on the KE method in distributions with different degrees of asymmetry using the randomly equivalent groups equating design (Study I), and (b) to introduce the Epanechnikov and adaptive kernels as potential alternative approaches to reducing boundary bias in smoothing (Study II). The beta-binomial model is used to simulate observed scores reflecting a range of different skewed shapes.
ERIC Educational Resources Information Center
Boudreaux, Wilbert
2011-01-01
Educational stakeholders are aware that school administration has become an incredibly intricate dynamic that is too complex for principals to handle alone. Test-driven accountability has made the already daunting task of school administration even more challenging. Distributed leadership presents an opportunity to explore increased leadership…
A Bayesian Nonparametric Approach to Test Equating
ERIC Educational Resources Information Center
Karabatsos, George; Walker, Stephen G.
2009-01-01
A Bayesian nonparametric model is introduced for score equating. It is applicable to all major equating designs, and has advantages over previous equating models. Unlike the previous models, the Bayesian model accounts for positive dependence between distributions of scores from two tests. The Bayesian model and the previous equating models are…
ERIC Educational Resources Information Center
Zu, Jiyun; Yuan, Ke-Hai
2012-01-01
In the nonequivalent groups with anchor test (NEAT) design, the standard error of linear observed-score equating is commonly estimated by an estimator derived assuming multivariate normality. However, real data are seldom normally distributed, causing this normal estimator to be inconsistent. A general estimator, which does not rely on the…
Marchick, Michael R; Setteducato, Michael L; Revenis, Jesse J; Robinson, Matthew A; Weeks, Emily C; Payton, Thomas F; Winchester, David E; Allen, Brandon R
2017-09-01
The History, Electrocardiography, Age, Risk factors, Troponin (HEART) score enables rapid risk stratification of emergency department patients presenting with chest pain. However, the subjectivity in scoring introduced by the history component has been criticized by some clinicians. We examined the association of 3 objective scoring models with the results of noninvasive cardiac testing. Medical records for all patients evaluated in the chest pain center of an academic medical center during a 1-year period were reviewed retrospectively. Each patient's history component score was calculated using 3 models developed by the authors. Differences in the distribution of HEART scores for each model, as well as their degree of agreement with one another, as well as the results of cardiac testing were analyzed. Seven hundred forty nine patients were studied, 58 of which had an abnormal stress test or computed tomography coronary angiography. The mean HEART scores for models 1, 2, and 3 were 2.97 (SD 1.17), 2.57 (SD 1.25), and 3.30 (SD 1.35), respectively, and were significantly different (P < 0.001). However, for each model, the likelihood of an abnormal cardiovascular test did not correlate with higher scores on the symptom component of the HEART score (P = 0.09, 0.41, and 0.86, respectively). While the objective scoring models produced different distributions of HEART scores, no model performed well with regards to identifying patients with abnormal advanced cardiac studies in this relatively low-risk cohort. Further studies in a broader cohort of patients, as well as comparison with the performance of subjective history scoring, is warranted before adoption of any of these objective models.
Is Coefficient Alpha Robust to Non-Normal Data?
Sheng, Yanyan; Sheng, Zhaohui
2011-01-01
Coefficient alpha has been a widely used measure by which internal consistency reliability is assessed. In addition to essential tau-equivalence and uncorrelated errors, normality has been noted as another important assumption for alpha. Earlier work on evaluating this assumption considered either exclusively non-normal error score distributions, or limited conditions. In view of this and the availability of advanced methods for generating univariate non-normal data, Monte Carlo simulations were conducted to show that non-normal distributions for true or error scores do create problems for using alpha to estimate the internal consistency reliability. The sample coefficient alpha is affected by leptokurtic true score distributions, or skewed and/or kurtotic error score distributions. Increased sample sizes, not test lengths, help improve the accuracy, bias, or precision of using it with non-normal data. PMID:22363306
NASA Astrophysics Data System (ADS)
Kartono; Suryadi, D.; Herman, T.
2018-01-01
This study aimed to analyze the enhancement of non-linear learning (NLL) in the online tutorial (OT) content to students’ knowledge of normal distribution application (KONDA). KONDA is a competence expected to be achieved after students studied the topic of normal distribution application in the course named Education Statistics. The analysis was performed by quasi-experiment study design. The subject of the study was divided into an experimental class that was given OT content in NLL model and a control class which was given OT content in conventional learning (CL) model. Data used in this study were the results of online objective tests to measure students’ statistical prior knowledge (SPK) and students’ pre- and post-test of KONDA. The statistical analysis test of a gain score of KONDA of students who had low and moderate SPK’s scores showed students’ KONDA who learn OT content with NLL model was better than students’ KONDA who learn OT content with CL model. Meanwhile, for students who had high SPK’s scores, the gain score of students who learn OT content with NLL model had relatively similar with the gain score of students who learn OT content with CL model. Based on those findings it could be concluded that the NLL model applied to OT content could enhance KONDA of students in low and moderate SPK’s levels. Extra and more challenging didactical situation was needed for students in high SPK’s level to achieve the significant gain score.
ERIC Educational Resources Information Center
Blagov, Pavel S.; Bi, Wu; Shedler, Jonathan; Westen, Drew
2012-01-01
The Shedler-Westen Assessment Procedure (SWAP) is a personality assessment instrument designed for use by expert clinical assessors. Critics have raised questions about its psychometrics, most notably its validity across observers and situations, the impact of its fixed score distribution on research findings, and its test-retest reliability. We…
Student Neighborhoods, Schools, and Test Score Growth: Evidence from Milwaukee, Wisconsin
ERIC Educational Resources Information Center
Carlson, Deven; Cowen, Joshua M.
2015-01-01
Schools and neighborhoods are thought to be two of the most important contextual influences on student academic outcomes. Drawing on a unique data set that permits simultaneous estimation of neighborhood and school contributions to student test score gains, we analyze the distributions of these contributions to consider the relative importance of…
Comparing Standard Deviation Effects across Contexts
ERIC Educational Resources Information Center
Ost, Ben; Gangopadhyaya, Anuj; Schiman, Jeffrey C.
2017-01-01
Studies using tests scores as the dependent variable often report point estimates in student standard deviation units. We note that a standard deviation is not a standard unit of measurement since the distribution of test scores can vary across contexts. As such, researchers should be cautious when interpreting differences in the numerical size of…
Cognitive and Noncognitive Improvements Among ChalleNGe Cadets: A Survey of Seven Sites
2016-06-01
Distribution unlimited Cognitive and Noncognitive Improvements Among ChalleNGe Cadets: A Survey of Seven Sites Lauren D. Malone and Jennifer R...completion and test score improvement. Using data on cadets’ scores on the Test of Adult Basic Education (TABE) and cadets’ responses to survey questions...total score. The sites also provided information on which cadets completed the program and cadets’ ages. In addition, we use data from a survey that
Trzepacz, Paula T; Hochstetler, Helen; Wang, Shufang; Walker, Brett; Saykin, Andrew J
2015-09-07
The Montreal Cognitive Assessment (MoCA) was developed to enable earlier detection of mild cognitive impairment (MCI) relative to familiar multi-domain tests like the Mini-Mental State Exam (MMSE). Clinicians need to better understand the relationship between MoCA and MMSE scores. For this cross-sectional study, we analyzed 219 healthy control (HC), 299 MCI, and 100 Alzheimer's disease (AD) dementia cases from the Alzheimer's Disease Neuroimaging Initiative (ADNI)-GO/2 database to evaluate MMSE and MoCA score distributions and select MoCA values to capture early and late MCI cases. Stepwise variable selection in logistic regression evaluated relative value of four test domains for separating MCI from HC. Functional Activities Questionnaire (FAQ) was evaluated as a strategy to separate dementia from MCI. Equi-percentile equating produced a translation grid for MoCA against MMSE scores. Receiver Operating Characteristic (ROC) analyses evaluated lower cutoff scores for capturing the most MCI cases. Most dementia cases scored abnormally, while MCI and HC score distributions overlapped on each test. Most MCI cases scored ≥ 17 on MoCA (96.3%) and ≥ 24 on MMSE (98.3%). The ceiling effect (28-30 points) for MCI and HC was less using MoCA (18.1%) versus MMSE (71.4%). MoCA and MMSE scores correlated most for dementia (r = 0.86; versus MCI r = 0.60; HC r = 0.43). Equi-percentile equating showed a MoCA score of 18 was equivalent to MMSE of 24. ROC analysis found MoCA ≥ 17 as the cutoff between MCI and dementia that emphasized high sensitivity (92.3%) to capture MCI cases. The core and orientation domains in both tests best distinguished HC from MCI groups, whereas comprehension/executive function and attention/calculation were not helpful. Mean FAQ scores were significantly higher and a greater proportion had abnormal FAQ scores in dementia than MCI and HC. MoCA and MMSE were more similar for dementia cases, but MoCA distributes MCI cases across a broader score range with less ceiling effect. A cutoff of ≥ 17 on the MoCA may help capture early and late MCI cases; depending on the level of sensitivity desired, ≥ 18 or 19 could be used. Functional assessment can help exclude dementia cases. MoCA scores are translatable to the MMSE to facilitate comparison.
Rank score and permutation testing alternatives for regression quantile estimates
Cade, B.S.; Richards, J.D.; Mielke, P.W.
2006-01-01
Performance of quantile rank score tests used for hypothesis testing and constructing confidence intervals for linear quantile regression estimates (0 ≤ τ ≤ 1) were evaluated by simulation for models with p = 2 and 6 predictors, moderate collinearity among predictors, homogeneous and hetero-geneous errors, small to moderate samples (n = 20–300), and central to upper quantiles (0.50–0.99). Test statistics evaluated were the conventional quantile rank score T statistic distributed as χ2 random variable with q degrees of freedom (where q parameters are constrained by H 0:) and an F statistic with its sampling distribution approximated by permutation. The permutation F-test maintained better Type I errors than the T-test for homogeneous error models with smaller n and more extreme quantiles τ. An F distributional approximation of the F statistic provided some improvements in Type I errors over the T-test for models with > 2 parameters, smaller n, and more extreme quantiles but not as much improvement as the permutation approximation. Both rank score tests required weighting to maintain correct Type I errors when heterogeneity under the alternative model increased to 5 standard deviations across the domain of X. A double permutation procedure was developed to provide valid Type I errors for the permutation F-test when null models were forced through the origin. Power was similar for conditions where both T- and F-tests maintained correct Type I errors but the F-test provided some power at smaller n and extreme quantiles when the T-test had no power because of excessively conservative Type I errors. When the double permutation scheme was required for the permutation F-test to maintain valid Type I errors, power was less than for the T-test with decreasing sample size and increasing quantiles. Confidence intervals on parameters and tolerance intervals for future predictions were constructed based on test inversion for an example application relating trout densities to stream channel width:depth.
An Analysis of Test Equating Models for the Alabama High School Graduation Examination.
ERIC Educational Resources Information Center
Glowacki, Margaret L.
The purpose of this study was to determine which equating models are appropriate for the Alabama High School Graduation Examination (AHSGE) by equating two previously administered fall forms for each subject area of the AHSGE and determining whether differences exist in the test score distributions or passing scores resulting from the equating…
ERIC Educational Resources Information Center
Haile, Getinet Astatike; Nguyen, Anh Ngoc
2008-01-01
We investigate the determinants of high school students' academic attainment in mathematics, reading and science in the United States; focusing particularly on possible differential impacts of ethnicity and family background across the distribution of test scores. Using data from the NELS2000 and employing quantile regression, we find two…
Robust Confidence Interval for a Ratio of Standard Deviations
ERIC Educational Resources Information Center
Bonett, Douglas G.
2006-01-01
Comparing variability of test scores across alternate forms, test conditions, or subpopulations is a fundamental problem in psychometrics. A confidence interval for a ratio of standard deviations is proposed that performs as well as the classic method with normal distributions and performs dramatically better with nonnormal distributions. A simple…
Impaired consciousness in partial seizures is bimodally distributed
Cunningham, Courtney; Chen, William C.; Shorten, Andrew; McClurkin, Michael; Choezom, Tenzin; Schmidt, Christian P.; Chu, Victoria; Bozik, Anne; Best, Cameron; Chapman, Melissa; Furman, Moran; Detyniecki, Kamil; Giacino, Joseph T.
2014-01-01
Objective: To investigate whether impaired consciousness in partial seizures can usually be attributed to specific deficits in the content of consciousness or to a more general decrease in the overall level of consciousness. Methods: Prospective testing during partial seizures was performed in patients with epilepsy using the Responsiveness in Epilepsy Scale (n = 83 partial seizures, 30 patients). Results were compared with responsiveness scores in a cohort of patients with severe traumatic brain injury evaluated with the JFK Coma Recovery Scale–Revised (n = 552 test administrations, 184 patients). Results: Standardized testing during partial seizures reveals a bimodal scoring distribution, such that most patients were either fully impaired or relatively spared in their ability to respond on multiple cognitive tests. Seizures with impaired performance on initial test items remained consistently impaired on subsequent items, while other seizures showed spared performance throughout. In the comparison group, we found that scores of patients with brain injury were more evenly distributed across the full range in severity of impairment. Conclusions: Partial seizures can often be cleanly separated into those with vs without overall impaired responsiveness. Results from similar testing in a comparison group of patients with brain injury suggest that the bimodal nature of Responsiveness in Epilepsy Scale scores is not a result of scale bias but may be a finding unique to partial seizures. These findings support a model in which seizures either propagate or do not propagate to key structures that regulate overall arousal and thalamocortical function. Future investigations are needed to relate these behavioral findings to the physiology underlying impaired consciousness in partial seizures. PMID:24727311
Impaired consciousness in partial seizures is bimodally distributed.
Cunningham, Courtney; Chen, William C; Shorten, Andrew; McClurkin, Michael; Choezom, Tenzin; Schmidt, Christian P; Chu, Victoria; Bozik, Anne; Best, Cameron; Chapman, Melissa; Furman, Moran; Detyniecki, Kamil; Giacino, Joseph T; Blumenfeld, Hal
2014-05-13
To investigate whether impaired consciousness in partial seizures can usually be attributed to specific deficits in the content of consciousness or to a more general decrease in the overall level of consciousness. Prospective testing during partial seizures was performed in patients with epilepsy using the Responsiveness in Epilepsy Scale (n = 83 partial seizures, 30 patients). Results were compared with responsiveness scores in a cohort of patients with severe traumatic brain injury evaluated with the JFK Coma Recovery Scale-Revised (n = 552 test administrations, 184 patients). Standardized testing during partial seizures reveals a bimodal scoring distribution, such that most patients were either fully impaired or relatively spared in their ability to respond on multiple cognitive tests. Seizures with impaired performance on initial test items remained consistently impaired on subsequent items, while other seizures showed spared performance throughout. In the comparison group, we found that scores of patients with brain injury were more evenly distributed across the full range in severity of impairment. Partial seizures can often be cleanly separated into those with vs without overall impaired responsiveness. Results from similar testing in a comparison group of patients with brain injury suggest that the bimodal nature of Responsiveness in Epilepsy Scale scores is not a result of scale bias but may be a finding unique to partial seizures. These findings support a model in which seizures either propagate or do not propagate to key structures that regulate overall arousal and thalamocortical function. Future investigations are needed to relate these behavioral findings to the physiology underlying impaired consciousness in partial seizures.
NASA Astrophysics Data System (ADS)
da Silva, Roberto; Lamb, Luis C.; Barbosa, Marcia C.
2016-09-01
We analyze the scores obtained by students who have taken the ENEM examination, The Brazilian High School National Examination which is used in the admission process at Brazilian universities. The average high schools scores from different disciplines are compared through the Pearson correlation coefficient. The results show a very large correlation between the performance in the different school subjects. Even though the students' scores in the ENEM form a Gaussian due to the standardization, we show that the high schools' scores form a bimodal distribution that cannot be used to evaluate and compare students performance over time. We also show that this high schools distribution reflects the correlation between school performance and the economic level (based on the average family income) of the students. The ENEM scores are compared with a Brazilian non standardized exam, the entrance examination from the Universidade Federal do Rio Grande do Sul. The analysis of the performance of the same individuals in both tests shows that the two tests not only select different abilities, but also lead to the admission of different sets of individuals. Our results indicate that standardized tests might be an interesting tool to compare performance of individuals over the years, but not of institutions.
Comparing Latent Distributions.
ERIC Educational Resources Information Center
Andersen, Erling B.
1980-01-01
The problem of comparing the latent abilities of groups of individuals (as opposed to their observable test scores) is considered. Tests of equality of means, variances, and longitudinal applications are discussed. (JKS)
Continuous equilibrium scores: factoring in the time before a fall.
Wood, Scott J; Reschke, Millard F; Owen Black, F
2012-07-01
The equilibrium (EQ) score commonly used in computerized dynamic posturography is normalized between 0 and 100, with falls assigned a score of 0. The resulting mixed discrete-continuous distribution limits certain statistical analyses and treats all trials with falls equally. We propose a simple modification of the formula in which peak-to-peak sway data from trials with falls is scaled according the percent of the trial completed to derive a continuous equilibrium (cEQ) score. The cEQ scores for trials without falls remain unchanged from the original methodology. The cEQ factors in the time before a fall and results in a continuous variable retaining the central tendencies of the original EQ distribution. A random set of 5315 Sensory Organization Test trials were pooled that included 81 falls. A comparison of the original and cEQ distributions and their rank ordering demonstrated that trials with falls continue to constitute the lower range of scores with the cEQ methodology. The area under the receiver operating characteristic curve (0.997) demonstrates that the cEQ retained near-perfect discrimination between trials with and without falls. We conclude that the cEQ score provides the ability to discriminate between ballistic falls from falls that occur later in the trial. This approach of incorporating time and sway magnitude can be easily extended to enhance other balance tests that include fall data or incomplete trials. Copyright © 2012 Elsevier B.V. All rights reserved.
ERIC Educational Resources Information Center
Ho, Andrew D.; Yu, Carol C.
2015-01-01
Many statistical analyses benefit from the assumption that unconditional or conditional distributions are continuous and normal. More than 50 years ago in this journal, Lord and Cook chronicled departures from normality in educational tests, and Micerri similarly showed that the normality assumption is met rarely in educational and psychological…
An Investigation of the Sampling Distributions of Equating Coefficients.
ERIC Educational Resources Information Center
Baker, Frank B.
1996-01-01
Using the characteristic curve method for dichotomously scored test items, the sampling distributions of equating coefficients were examined. Simulations indicate that for the equating conditions studied, the sampling distributions of the equating coefficients appear to have acceptable characteristics, suggesting confidence in the values obtained…
Ebrahimi-Madiseh, Azadeh; Eikelboom, Robert H; Jayakody, Dona Mp; Atlas, Marcus D
2016-01-01
To evaluate the clinical utility of the City University of New York sentence test in a cohort of post-lingually deafened cochlear implants recipients over time. 117 post-lingually deafened, Australian English-speaking CI recipients aged between 23 and 98 years (M = 66 years; SD = 15.09) were recruited. CUNY sentence test scores in quiet were collated and analysed at two cut-offs, 95% and 100%, as ceiling scores. CUNY sentence scores ranged from 4% to 100% (M = 86.75; SD = 20.65), with 38.8% of participants scoring 95% and 16.5% of participants reaching the 100% scores. The percentage of participants reaching the 95% and 100% ceiling scores increased over time (6 and 12 months post-implantation). The distribution of all post-operative CUNY test scores skewed to the right with 82% of test scores reaching above 90%. This study demonstrates that the CUNY test cannot be used as a valid tool to measure the speech perception skills of post-lingually deafened CI recipients over time. This may be overcome by using adaptive test protocols or linguistically, cognitively or contextually demanding test materials. The high percentage of CI recipients achieving ceiling scores for the CUNY sentence test in quiet at 3 months post-implantation, questions the validity of using CUNY in CI assessment test battery and limits its application for use in longitudinal studies evaluating CI outcomes. Further studies are required to examine different methods to overcome this problem.
ERIC Educational Resources Information Center
Culpepper, Steven Andrew
2013-01-01
A classic topic in the fields of psychometrics and measurement has been the impact of the number of scale categories on test score reliability. This study builds on previous research by further articulating the relationship between item response theory (IRT) and classical test theory (CTT). Equations are presented for comparing the reliability and…
Curricular Policy as a Collective Effects Problem: A Distributional Approach
Penner, Andrew M.; Domina, Thurston; Penner, Emily K.; Conley, AnneMarie
2015-01-01
Current educational policies in the United States attempt to boost student achievement and promote equality by intensifying the curriculum and exposing students to more advanced coursework. This paper investigates the relationship between one such effort -- California's push to enroll all 8th grade students in Algebra -- and the distribution of student achievement. We suggest that this effort is an instance of a “collective effects” problem, where the population-level effects of a policy are different from its effects at the individual level. In such contexts, we argue that it is important to consider broader population effects as well as the difference between “treated” and “untreated” individuals. To do so, we present differences in inverse propensity score weighted distributions to investigate how this curricular policy changed the distribution of student achievement more broadly. We find that California's attempt to intensify the curriculum did not raise test scores at the bottom of the distribution, but did lower scores at the top of the distribution. These results highlight the efficacy of inverse propensity score weighting approaches for estimating collective effects, and provide a cautionary tale for curricular intensification efforts and other policies with collective effects. PMID:26004485
The Impact of Measurement Error on the Accuracy of Individual and Aggregate SGP
ERIC Educational Resources Information Center
McCaffrey, Daniel F.; Castellano, Katherine E.; Lockwood, J. R.
2015-01-01
Student growth percentiles (SGPs) express students' current observed scores as percentile ranks in the distribution of scores among students with the same prior-year scores. A common concern about SGPs at the student level, and mean or median SGPs (MGPs) at the aggregate level, is potential bias due to test measurement error (ME). Shang,…
The Many Null Distributions of Person Fit Indices.
ERIC Educational Resources Information Center
Molenaar, Ivo W.; Hoijtink, Herbert
1990-01-01
Statistical properties of person fit indices are reviewed as indicators of the extent to which a person's score pattern is in agreement with a measurement model. Distribution of a fit index and ability-free fit evaluation are discussed. The null distribution was simulated for a test of 20 items. (SLD)
Estimating the Parameters of the Beta-Binomial Distribution.
ERIC Educational Resources Information Center
Wilcox, Rand R.
1979-01-01
For some situations the beta-binomial distribution might be used to describe the marginal distribution of test scores for a particular population of examinees. Several different methods of approximating the maximum likelihood estimate were investigated, and it was found that the Newton-Raphson method should be used when it yields admissable…
Sargin, Mehmet Akif; Yassa, Murat; Taymur, Bilge Dogan; Taymur, Bulent; Akca, Gizem; Tug, Niyazi
2017-04-01
To compare the status of female sexual dysfunction (FSD) between women with a history of previous gestational diabetes mellitus (GDM) and those with follow-up of a healthy pregnancy, using the female sexual function index (FSFI) questionnaire. Cross-sectional study. Department of Obstetrics and Gynecology, Fatih Sultan Mehmet Training and Research Hospital, Istanbul, Turkey, from September to December 2015. Healthy sexually active adult parous females were included. Participants were asked to complete the validated Turkish versions of the FSFI and Hospital Anxiety and Depression Scale (HADS) questionnaires. Student's t-test was used for two-group comparisons of normally distributed variables and quantitative data. Mann-Whitney U-test was used for two-group comparisons of non-normally distributed variables. Pearson's chi-squared test, the Fisher-FreemanHalton test, Fisher's exact test, and Yates' continuity correction test were used for comparison of qualitative data. The mean FSFI scores of the 179 participants was 23.50 ±3.94. FSFI scores and scores of desire, arousal, lubrication, orgasm, satisfaction, and pain were not statistically significantly different (p>0.05), according to a history of GDM and types of FSD (none, mild, severe). HADS scores and anxiety and depression types did not statistically significantly differ according to the history of GDM (p>0.05). An association could not be found in FSFI scores between participants with both the history of previous GDM and with healthy pregnancy; subclinical sexual dysfunction may be observed in the late postpartum period among women with a history of previous GDM. This may adversely affect their sexual health.
Fukui, Yuriko; Noda, Saeko; Okada, Midori; Mihara, Nakako; Kawakami, Yoriko; Bore, Miles; Munro, Don; Powis, David
2014-01-01
The Personal Qualities Assessment (PQA), developed by the University of Newcastle, Australia to assess the aptitude of future medical professionals, has been used in Western countries. The objective was to investigate whether the PQA is appropriate for Japanese medical school applicants. Two of the PQA tests, Libertarian-Dual-Communitarian moral orientations (Mojac) and Narcissism, Aloofness, Confidence, and Empathy (NACE), were translated into Japanese, and administered at the Tokyo Women's Medical University entrance examinations from 2007 to 2009. The distributions of the applicants' Mojac and NACE scores were close to the normal distribution, and the mean scores did not exhibit a large difference from those in Western countries. The only significant difference was that the mean score of the NACE test was slightly lower than the Western norm. The translated PQA tests may be appropriate for use with Japanese applicants, though further research considering cultural differences is required.
1985-04-01
EM 32 12 MICROCOP REOUTO TETCHR NTOA B URA FSA4ARS16- AFHRL-TR-84-64 9 AIR FORCE 6 __ H EQUIPERCENTILE TEST EQUATING: THE EFFECTS OF PRESMOOTHING AND...combined or compound presmoother and a presmoothing method based on a particular model of test scores. Of the seven methods of presmoothing the score...unsmoothed distributions, the smoothing of that sequence of differences by the same compound method, and, finally, adding the smoothed differences back
Guattery, Jason M; Dardas, Agnes Z; Kelly, Michael; Chamberlain, Aaron; McAndrew, Christopher; Calfee, Ryan P
2018-04-01
The Patient Reported Outcomes Measurement Information System (PROMIS) was developed to provide valid, reliable, and standardized measures to gather patient-reported outcomes for many health domains, including depression, independent of patient condition. Most studies confirming the performance of these measures were conducted with a consented, volunteer study population for testing. Using a study population that has undergone the process of informed consent may be differentiated from the validation group because they are educated specifically as to the purpose of the questions and they will not have answers recorded in their permanent health record. (1) When given as part of routine practice to an orthopaedic population, do PROMIS Physical Function and Depression item banks produce score distributions different than those produced by the populations used to calibrate and validate the item banks? (2) Does the presence of a nonnormal distribution in the PROMIS Depression scores in a clinical population reflect a deliberately hasty answering of questions by patients? (3) Are patients who are reporting minimal depressive symptoms by scoring the minimum score on the PROMIS Depression Computer Adaptive Testing (CAT) distinct from other patients according to demographic data or their scores on other PROMIS assessments? Univariate descriptive statistics and graphic histograms were used to describe the frequency distribution of scores for the Physical Function and Depression item banks for all orthopaedic patients 18 years or older who had an outpatient visit between June 2015 and December 2016. The study population was then broken into two groups based on whether they indicated a lack of depressive symptoms and scored the minimum score (34.2) on the Depression CAT assessment (Floor Group) or not (Standard Group). The distribution of Physical Function CAT scores was compared between the two groups. Finally, a time-per-question value was calculated for both the Physical Function and Depression CATs and was compared between assessments within each group as well as between the two groups. Bivariate statistics compared the demographic data between the two groups. Physical Function CAT scores in musculoskeletal patients were normally distributed like the distribution calibration population; however, the score distribution of the Depression CAT in musculoskeletal patients was nonnormal with a spike in the floor score. After excluding the floor spike, the distribution of the Depression CAT scores was not different from the population control group. Patients who scored the floor score on the Depression CAT took slightly less time per question for Physical Function CAT when compared with other musculoskeletal patients (floor patients: 11 ± 9 seconds; normally distributed patients: 12 ± 10 seconds; mean difference: 1 second [0.8-1.1]; p < 0.001 but not clinically relevant). They spent a substantially shorter amount of time per question on the Depression CAT (Floor Group: 4 ± 3 seconds; Standard Group: 7 ± 7 seconds; mean difference: 3 [2.9-3.2]; p < 0.001). Patients who scored the minimum score on the PROMIS Depression CAT were younger than other patients (Floor Group: 50 ± 18 SD; Standard Group: 55 ± 16 SD; mean difference: 4.5 [4.2-4.7]; p < 0.001) with a larger percentage of men (Floor Group: 48.8%; Standard Group 40.0%; odds ratio 0.6 [0.6-0.7]; p < 0.001) and minor differences in racial breakdown (Floor Group: white 85.2%, black 11.9%, other 0.03%; Standard Group: white 83.9%, black 13.7%, other 0.02%). In an orthopaedic surgery population that is given PROMIS CAT as part of routine practice, the Physical Function item bank had a normal performance, but there is a group of patients who hastily complete Depression questions producing a strong floor effect and calling into question the validity of those floor scores that indicate minimal depression. Level II, diagnostic study.
Robust LOD scores for variance component-based linkage analysis.
Blangero, J; Williams, J T; Almasy, L
2000-01-01
The variance component method is now widely used for linkage analysis of quantitative traits. Although this approach offers many advantages, the importance of the underlying assumption of multivariate normality of the trait distribution within pedigrees has not been studied extensively. Simulation studies have shown that traits with leptokurtic distributions yield linkage test statistics that exhibit excessive Type I error when analyzed naively. We derive analytical formulae relating the deviation from the expected asymptotic distribution of the lod score to the kurtosis and total heritability of the quantitative trait. A simple correction constant yields a robust lod score for any deviation from normality and for any pedigree structure, and effectively eliminates the problem of inflated Type I error due to misspecification of the underlying probability model in variance component-based linkage analysis.
Music therapy career aptitude test.
Lim, Hayoung A
2011-01-01
The purpose of the Music Therapy Career Aptitude Test (MTCAT) was to measure the affective domain of music therapy students including their self-awareness as it relates to the music therapy career, value in human development, interest in general therapy, and aptitude for being a professional music therapist. The MTCAT was administered to 113 music therapy students who are currently freshman or sophomores in an undergraduate music therapy program or in the first year of a music therapy master's equivalency program. The results of analysis indicated that the MTCAT is normally distributed and that all 20 questions are significantly correlated with the total test score of the MTCAT. The reliability of the MTCAT was considerably high (Cronbach's Coefficient Alpha=0.8). The criterion-related validity was examined by comparing the MTCAT scores of music therapy students with the scores of 43 professional music therapists. The correlation between the scores of students and professionals was found to be statistically significant. The results suggests that normal distribution, internal consistency, homogeneity of construct, item discrimination, correlation analysis, content validity, and criterion-related validity in the MTCAT may be helpful in predicting music therapy career aptitude and may aid in the career decision making process of college music therapy students.
A sup-score test for the cure fraction in mixture models for long-term survivors.
Hsu, Wei-Wen; Todem, David; Kim, KyungMann
2016-12-01
The evaluation of cure fractions in oncology research under the well known cure rate model has attracted considerable attention in the literature, but most of the existing testing procedures have relied on restrictive assumptions. A common assumption has been to restrict the cure fraction to a constant under alternatives to homogeneity, thereby neglecting any information from covariates. This article extends the literature by developing a score-based statistic that incorporates covariate information to detect cure fractions, with the existing testing procedure serving as a special case. A complication of this extension, however, is that the implied hypotheses are not typical and standard regularity conditions to conduct the test may not even hold. Using empirical processes arguments, we construct a sup-score test statistic for cure fractions and establish its limiting null distribution as a functional of mixtures of chi-square processes. In practice, we suggest a simple resampling procedure to approximate this limiting distribution. Our simulation results show that the proposed test can greatly improve efficiency over tests that neglect the heterogeneity of the cure fraction under the alternative. The practical utility of the methodology is illustrated using ovarian cancer survival data with long-term follow-up from the surveillance, epidemiology, and end results registry. © 2016, The International Biometric Society.
ERIC Educational Resources Information Center
Jensen, Arthur R.
The first eight chapters of this book introduce the topic of test bias. The basic issues involved in criticisms of mental tests and arguments about test bias include: (1) variety of tests and test items; (2) scaling of scores and the form of the distribution of abilities in the population; (3) quantification of subpopulation differences; (4)…
Standardized UXO Technology Demonstration Site Scoring Record No. 945
2017-07-01
DISTRIBUTION LIST ATEC Project No. 2011-DT-ATC-DODSP-F0292 Note: A copy of this test report has been posted to the Versatile Information Systems...Directorate July 2017 Report Produced by: U.S. Army Aberdeen Test Center Aberdeen Proving Ground, MD 21005-5059 Report Produced for: Strategic...U.S. Army Test and Evaluation Command Aberdeen Proving Ground, MD 21005-5001 Distribution Unlimited, July 2017. The use of a trade name or the
ERIC Educational Resources Information Center
Baker, Frank B.
1997-01-01
Examined the sampling distributions of equating coefficients produced by the characteristic curve method for tests using graded and nominal response scoring using simulated data. For both models and across all three equating situations, the sampling distributions were generally bell-shaped and peaked, and occasionally had a small degree of…
Contrasting OLS and Quantile Regression Approaches to Student "Growth" Percentiles
ERIC Educational Resources Information Center
Castellano, Katherine Elizabeth; Ho, Andrew Dean
2013-01-01
Regression methods can locate student test scores in a conditional distribution, given past scores. This article contrasts and clarifies two approaches to describing these locations in terms of readily interpretable percentile ranks or "conditional status percentile ranks." The first is Betebenner's quantile regression approach that results in…
Refinement of Scoring Procedures for the Basic Attributes Test (BAT) Battery
1993-03-01
see Carretta, 1991). Research on the BAT summary scores has shown that some of them (a) are significantly positively skewed and platykurtic , (b) contain...for positively skewed and platykurtic data distributions, and those that were applied here to the BAT data, are the square-root and natural logarithm
Middleware Trade Study for NASA Domain
NASA Technical Reports Server (NTRS)
Bowman, Dan
2007-01-01
This presentation presents preliminary results of a trade study designed to assess three distributed simulation middleware technologies for support of the NASA Constellation Distributed Space Exploration Simulation (DSES) project and Test and Verification Distributed System Integration Laboratory (DSIL). The technologies are: the High Level Architecture (HLA), the Test and Training Enabling Architecture (TENA), and an XML-based variant of Distributed Interactive Simulation (DIS-XML) coupled with the Extensible Messaging and Presence Protocol (XMPP). According to the criteria and weights determined in this study, HLA scores better than the other two for DSES as well as the DSIL
NASA Constellation Distributed Simulation Middleware Trade Study
NASA Technical Reports Server (NTRS)
Hasan, David; Bowman, James D.; Fisher, Nancy; Cutts, Dannie; Cures, Edwin Z.
2008-01-01
This paper presents the results of a trade study designed to assess three distributed simulation middleware technologies for support of the NASA Constellation Distributed Space Exploration Simulation (DSES) project and Test and Verification Distributed System Integration Laboratory (DSIL). The technologies are the High Level Architecture (HLA), the Test and Training Enabling Architecture (TENA), and an XML-based variant of Distributed Interactive Simulation (DIS-XML) coupled with the Extensible Messaging and Presence Protocol (XMPP). According to the criteria and weights determined in this study, HLA scores better than the other two for DSES as well as the DSIL.
Evaluation and validity of a LORETA normative EEG database.
Thatcher, R W; North, D; Biver, C
2005-04-01
To evaluate the reliability and validity of a Z-score normative EEG database for Low Resolution Electromagnetic Tomography (LORETA), EEG digital samples (2 second intervals sampled 128 Hz, 1 to 2 minutes eyes closed) were acquired from 106 normal subjects, and the cross-spectrum was computed and multiplied by the Key Institute's LORETA 2,394 gray matter pixel T Matrix. After a log10 transform or a Box-Cox transform the mean and standard deviation of the *.lor files were computed for each of the 2394 gray matter pixels, from 1 to 30 Hz, for each of the subjects. Tests of Gaussianity were computed in order to best approximate a normal distribution for each frequency and gray matter pixel. The relative sensitivity of a Z-score database was computed by measuring the approximation to a Gaussian distribution. The validity of the LORETA normative database was evaluated by the degree to which confirmed brain pathologies were localized using the LORETA normative database. Log10 and Box-Cox transforms approximated Gaussian distribution in the range of 95.64% to 99.75% accuracy. The percentage of normative Z-score values at 2 standard deviations ranged from 1.21% to 3.54%, and the percentage of Z-scores at 3 standard deviations ranged from 0% to 0.83%. Left temporal lobe epilepsy, right sensory motor hematoma and a right hemisphere stroke exhibited maximum Z-score deviations in the same locations as the pathologies. We conclude: (1) Adequate approximation to a Gaussian distribution can be achieved using LORETA by using a log10 transform or a Box-Cox transform and parametric statistics, (2) a Z-Score normative database is valid with adequate sensitivity when using LORETA, and (3) the Z-score LORETA normative database also consistently localized known pathologies to the expected Brodmann areas as an hypothesis test based on the surface EEG before computing LORETA.
Use of the binomial distribution to predict impairment: application in a nonclinical sample.
Axelrod, Bradley N; Wall, Jacqueline R; Estes, Bradley W
2008-01-01
A mathematical model based on the binomial theory was developed to illustrate when abnormal score variations occur by chance in a multitest battery (Ingraham & Aiken, 1996). It has been successfully used as a comparison for obtained test scores in clinical samples, but not in nonclinical samples. In the current study, this model has been applied to demographically corrected scores on the Halstead-Reitan Neuropsychological Test Battery, obtained from a sample of 94 nonclinical college students. Results found that 15% of the sample had impairments suggested by the Halstead Impairment Index, using criteria established by Reitan and Wolfson (1993). In addition, one-half of the sample obtained impaired scores on one or two tests. These results were compared to that predicted by the binomial model and found to be consistent. The model therefore serves as a useful resource for clinicians considering the probability of impaired test performance.
LD Score Regression Distinguishes Confounding from Polygenicity in Genome-Wide Association Studies
Bulik-Sullivan, Brendan K.; Loh, Po-Ru; Finucane, Hilary; Ripke, Stephan; Yang, Jian; Patterson, Nick; Daly, Mark J.; Price, Alkes L.; Neale, Benjamin M.
2015-01-01
Both polygenicity (i.e., many small genetic effects) and confounding biases, such as cryptic relatedness and population stratification, can yield an inflated distribution of test statistics in genome-wide association studies (GWAS). However, current methods cannot distinguish between inflation from true polygenic signal and bias. We have developed an approach, LD Score regression, that quantifies the contribution of each by examining the relationship between test statistics and linkage disequilibrium (LD). The LD Score regression intercept can be used to estimate a more powerful and accurate correction factor than genomic control. We find strong evidence that polygenicity accounts for the majority of test statistic inflation in many GWAS of large sample size. PMID:25642630
Vista, Alvin; Care, Esther
2011-06-01
Research on gender differences in intelligence has focused mostly on samples from Western countries and empirical evidence on gender differences from Southeast Asia is relatively sparse. This article presents results on gender differences in variance and means on a non-verbal intelligence test using a national sample of public school students from the Philippines. More than 2,700 sixth graders from public schools across the country were tested with the Naglieri Non-verbal Ability Test (NNAT). Variance ratios (VRs) and log-transformed VRs were computed. Proportion ratios for each of the ability levels were also calculated and a chi-square goodness-of-fit test was performed. An analysis of variance was performed to determine the overall gender difference in mean scores as well as within each of three age subgroups. Our data show non-existent or trivial gender difference in mean scores. However, the tails of the distributions show differences between the males and females, with greater variability among males in the upper half of the distribution and greater variability among females in the lower half of the distribution. Descriptions of the results and their implications are discussed. Results on mean score differences support the hypothesis that there are no significant gender differences in cognitive ability. The unusual results regarding differences in variance and the male-female proportion in the tails require more complex investigations. ©2010 The British Psychological Society.
Percentiles of the null distribution of 2 maximum lod score tests.
Ulgen, Ayse; Yoo, Yun Joo; Gordon, Derek; Finch, Stephen J; Mendell, Nancy R
2004-01-01
We here consider the null distribution of the maximum lod score (LOD-M) obtained upon maximizing over transmission model parameters (penetrance values, dominance, and allele frequency) as well as the recombination fraction. Also considered is the lod score maximized over a fixed choice of genetic model parameters and recombination-fraction values set prior to the analysis (MMLS) as proposed by Hodge et al. The objective is to fit parametric distributions to MMLS and LOD-M. Our results are based on 3,600 simulations of samples of n = 100 nuclear families ascertained for having one affected member and at least one other sibling available for linkage analysis. Each null distribution is approximately a mixture p(2)(0) + (1 - p)(2)(v). The values of MMLS appear to fit the mixture 0.20(2)(0) + 0.80chi(2)(1.6). The mixture distribution 0.13(2)(0) + 0.87chi(2)(2.8). appears to describe the null distribution of LOD-M. From these results we derive a simple method for obtaining critical values of LOD-M and MMLS. Copyright 2004 S. Karger AG, Basel
Chronic Stress and Neuropathology: Neurochemical, Molecular, and Genetic Factors
2005-08-01
as anxiety, overweight, or alcohol use disorders. The lines also may be useful for studying potential prophylactic or therapeutic treatments for such...scores were calculated for each rat according to the position of its ACTH response within the distribution of its gender and generation. Z- scores...the cage washed between tests. A reliable, treatment -naïve rater scored the emission (bouts) and duration (time) of the following behaviors from
Naro, Daniel; Rummel, Christian; Schindler, Kaspar; Andrzejak, Ralph G
2014-09-01
The rank-based nonlinear predictability score was recently introduced as a test for determinism in point processes. We here adapt this measure to time series sampled from time-continuous flows. We use noisy Lorenz signals to compare this approach against a classical amplitude-based nonlinear prediction error. Both measures show an almost identical robustness against Gaussian white noise. In contrast, when the amplitude distribution of the noise has a narrower central peak and heavier tails than the normal distribution, the rank-based nonlinear predictability score outperforms the amplitude-based nonlinear prediction error. For this type of noise, the nonlinear predictability score has a higher sensitivity for deterministic structure in noisy signals. It also yields a higher statistical power in a surrogate test of the null hypothesis of linear stochastic correlated signals. We show the high relevance of this improved performance in an application to electroencephalographic (EEG) recordings from epilepsy patients. Here the nonlinear predictability score again appears of higher sensitivity to nonrandomness. Importantly, it yields an improved contrast between signals recorded from brain areas where the first ictal EEG signal changes were detected (focal EEG signals) versus signals recorded from brain areas that were not involved at seizure onset (nonfocal EEG signals).
NASA Astrophysics Data System (ADS)
Naro, Daniel; Rummel, Christian; Schindler, Kaspar; Andrzejak, Ralph G.
2014-09-01
The rank-based nonlinear predictability score was recently introduced as a test for determinism in point processes. We here adapt this measure to time series sampled from time-continuous flows. We use noisy Lorenz signals to compare this approach against a classical amplitude-based nonlinear prediction error. Both measures show an almost identical robustness against Gaussian white noise. In contrast, when the amplitude distribution of the noise has a narrower central peak and heavier tails than the normal distribution, the rank-based nonlinear predictability score outperforms the amplitude-based nonlinear prediction error. For this type of noise, the nonlinear predictability score has a higher sensitivity for deterministic structure in noisy signals. It also yields a higher statistical power in a surrogate test of the null hypothesis of linear stochastic correlated signals. We show the high relevance of this improved performance in an application to electroencephalographic (EEG) recordings from epilepsy patients. Here the nonlinear predictability score again appears of higher sensitivity to nonrandomness. Importantly, it yields an improved contrast between signals recorded from brain areas where the first ictal EEG signal changes were detected (focal EEG signals) versus signals recorded from brain areas that were not involved at seizure onset (nonfocal EEG signals).
Basagni, Benedetta; Luzzatti, Claudio; Navarrete, Eduardo; Caputo, Marina; Scrocco, Gessica; Damora, Alessio; Giunchi, Laura; Gemignani, Paola; Caiazzo, Annarita; Gambini, Maria Grazia; Avesani, Renato; Mancuso, Mauro; Trojano, Luigi; De Tanti, Antonio
2017-04-01
Verbal reasoning is a complex, multicomponent function, which involves activation of functional processes and neural circuits distributed in both brain hemispheres. Thus, this ability is often impaired after brain injury. The aim of the present study is to describe the construction of a new verbal reasoning test (VRT) for patients with brain injury and to provide normative values in a sample of healthy Italian participants. Three hundred and eighty healthy Italian subjects (193 women and 187 men) of different ages (range 16-75 years) and educational level (primary school to postgraduate degree) underwent the VRT. VRT is composed of seven subtests, investigating seven different domains. Multiple linear regression analysis revealed a significant effect of age and education on the participants' performance in terms of both VRT total score and all seven subtest scores. No gender effect was found. A correction grid for raw scores was built from the linear equation derived from the scores. Inferential cut-off scores were estimated using a non-parametric technique, and equivalent scores were computed. We also provided a grid for the correction of results by z scores.
A weighted generalized score statistic for comparison of predictive values of diagnostic tests
Kosinski, Andrzej S.
2013-01-01
Positive and negative predictive values are important measures of a medical diagnostic test performance. We consider testing equality of two positive or two negative predictive values within a paired design in which all patients receive two diagnostic tests. The existing statistical tests for testing equality of predictive values are either Wald tests based on the multinomial distribution or the empirical Wald and generalized score tests within the generalized estimating equations (GEE) framework. As presented in the literature, these test statistics have considerably complex formulas without clear intuitive insight. We propose their re-formulations which are mathematically equivalent but algebraically simple and intuitive. As is clearly seen with a new re-formulation we present, the generalized score statistic does not always reduce to the commonly used score statistic in the independent samples case. To alleviate this, we introduce a weighted generalized score (WGS) test statistic which incorporates empirical covariance matrix with newly proposed weights. This statistic is simple to compute, it always reduces to the score statistic in the independent samples situation, and it preserves type I error better than the other statistics as demonstrated by simulations. Thus, we believe the proposed WGS statistic is the preferred statistic for testing equality of two predictive values and for corresponding sample size computations. The new formulas of the Wald statistics may be useful for easy computation of confidence intervals for difference of predictive values. The introduced concepts have potential to lead to development of the weighted generalized score test statistic in a general GEE setting. PMID:22912343
Lachin, John M
2011-11-10
The power of a chi-square test, and thus the required sample size, are a function of the noncentrality parameter that can be obtained as the limiting expectation of the test statistic under an alternative hypothesis specification. Herein, we apply this principle to derive simple expressions for two tests that are commonly applied to discrete ordinal data. The Wilcoxon rank sum test for the equality of distributions in two groups is algebraically equivalent to the Mann-Whitney test. The Kruskal-Wallis test applies to multiple groups. These tests are equivalent to a Cochran-Mantel-Haenszel mean score test using rank scores for a set of C-discrete categories. Although various authors have assessed the power function of the Wilcoxon and Mann-Whitney tests, herein it is shown that the power of these tests with discrete observations, that is, with tied ranks, is readily provided by the power function of the corresponding Cochran-Mantel-Haenszel mean scores test for two and R > 2 groups. These expressions yield results virtually identical to those derived previously for rank scores and also apply to other score functions. The Cochran-Armitage test for trend assesses whether there is an monotonically increasing or decreasing trend in the proportions with a positive outcome or response over the C-ordered categories of an ordinal independent variable, for example, dose. Herein, it is shown that the power of the test is a function of the slope of the response probabilities over the ordinal scores assigned to the groups that yields simple expressions for the power of the test. Copyright © 2011 John Wiley & Sons, Ltd.
Pohl, Steffi; Südkamp, Anna; Hardt, Katinka; Carstensen, Claus H.; Weinert, Sabine
2016-01-01
Assessing competencies of students with special educational needs in learning (SEN-L) poses a challenge for large-scale assessments (LSAs). For students with SEN-L, the available competence tests may fail to yield test scores of high psychometric quality, which are—at the same time—measurement invariant to test scores of general education students. We investigated whether we can identify a subgroup of students with SEN-L, for which measurement invariant competence measures of adequate psychometric quality may be obtained with tests available in LSAs. We furthermore investigated whether differences in test-taking behavior may explain dissatisfying psychometric properties and measurement non-invariance of test scores within LSAs. We relied on person fit indices and mixture distribution models to identify students with SEN-L for whom test scores with satisfactory psychometric properties and measurement invariance may be obtained. We also captured differences in test-taking behavior related to guessing and missing responses. As a result we identified a subgroup of students with SEN-L for whom competence scores of adequate psychometric quality that are measurement invariant to those of general education students were obtained. Concerning test taking behavior, there was a small number of students who unsystematically picked response options. Removing these students from the sample slightly improved item fit. Furthermore, two different patterns of missing responses were identified that explain to some extent problems in the assessments of students with SEN-L. PMID:26941665
An Analytical Evaluation of Two Common-Odds Ratios as Population Indicators of DIF.
ERIC Educational Resources Information Center
Pommerich, Mary; And Others
The Mantel-Haenszel (MH) statistic for identifying differential item functioning (DIF) commonly conditions on the observed test score as a surrogate for conditioning on latent ability. When the comparison group distributions are not completely overlapping (i.e., are incongruent), the observed score represents different levels of latent ability…
Reliability of provocative tests of motion sickness susceptibility
NASA Technical Reports Server (NTRS)
Calkins, D. S.; Reschke, M. F.; Kennedy, R. S.; Dunlop, W. P.
1987-01-01
Test-retest reliability values were derived from motion sickness susceptibility scores obtained from two successive exposures to each of three tests: (1) Coriolis sickness sensitivity test; (2) staircase velocity movement test; and (3) parabolic flight static chair test. The reliability of the three tests ranged from 0.70 to 0.88. Normalizing values from predictors with skewed distributions improved the reliability.
ERIC Educational Resources Information Center
Resing, Wilma C. M.; Tunteler, Erika
2007-01-01
In this article, time effects on intelligence test scores have been investigated. In particular, we examined whether the "Flynn effect" is manifest in children from the middle and higher IQ distribution range, measured with a child intelligence test based on information processing principles--the Leiden Diagnostic Test. The test was administered…
Fossati, Andrea; Somma, Antonella; Pincus, Aaron; Borroni, Serena; Dowgwillo, Emily A
2017-06-01
The Italian translations of the Pathological Narcissism Inventory (PNI) and Triarchic Psychopathy Measure (TriPM) were administered to 609 community dwelling adults. Participants who scored in the upper 10% of the distribution of the PNI total score were assigned to the group of participants at risk for pathological narcissism, whereas participants who scored in the upper 10% of the distribution of the TriPM total score were assigned to the group of participants at risk for psychopathy. The final sample included 126 participants who were administered the Reading the Mind in the Eyes Test (RMET) and emotion-eliciting movie clips. Participants at risk for pathological narcissism scored significantly lower on the RMET total score than participants who were not at risk for pathological narcissism. Participants at risk for psychopathy showed a significant reduction in the subjective experience of disgust, fear, sadness, and tenderness compared to participants who were not at risk for psychopathy.
Datta, Somnath; Nevalainen, Jaakko; Oja, Hannu
2012-01-01
SUMMARY Rank based tests are alternatives to likelihood based tests popularized by their relative robustness and underlying elegant mathematical theory. There has been a serge in research activities in this area in recent years since a number of researchers are working to develop and extend rank based procedures to clustered dependent data which include situations with known correlation structures (e.g., as in mixed effects models) as well as more general form of dependence. The purpose of this paper is to test the symmetry of a marginal distribution under clustered data. However, unlike most other papers in the area, we consider the possibility that the cluster size is a random variable whose distribution is dependent on the distribution of the variable of interest within a cluster. This situation typically arises when the clusters are defined in a natural way (e.g., not controlled by the experimenter or statistician) and in which the size of the cluster may carry information about the distribution of data values within a cluster. Under the scenario of an informative cluster size, attempts to use some form of variance adjusted sign or signed rank tests would fail since they would not maintain the correct size under the distribution of marginal symmetry. To overcome this difficulty Datta and Satten (2008; Biometrics, 64, 501–507) proposed a Wilcoxon type signed rank test based on the principle of within cluster resampling. In this paper we study this problem in more generality by introducing a class of valid tests employing a general score function. Asymptotic null distribution of these tests is obtained. A simulation study shows that a more general choice of the score function can sometimes result in greater power than the Datta and Satten test; furthermore, this development offers the user a wider choice. We illustrate our tests using a real data example on spinal cord injury patients. PMID:23074359
Datta, Somnath; Nevalainen, Jaakko; Oja, Hannu
2012-09-01
Rank based tests are alternatives to likelihood based tests popularized by their relative robustness and underlying elegant mathematical theory. There has been a serge in research activities in this area in recent years since a number of researchers are working to develop and extend rank based procedures to clustered dependent data which include situations with known correlation structures (e.g., as in mixed effects models) as well as more general form of dependence.The purpose of this paper is to test the symmetry of a marginal distribution under clustered data. However, unlike most other papers in the area, we consider the possibility that the cluster size is a random variable whose distribution is dependent on the distribution of the variable of interest within a cluster. This situation typically arises when the clusters are defined in a natural way (e.g., not controlled by the experimenter or statistician) and in which the size of the cluster may carry information about the distribution of data values within a cluster.Under the scenario of an informative cluster size, attempts to use some form of variance adjusted sign or signed rank tests would fail since they would not maintain the correct size under the distribution of marginal symmetry. To overcome this difficulty Datta and Satten (2008; Biometrics, 64, 501-507) proposed a Wilcoxon type signed rank test based on the principle of within cluster resampling. In this paper we study this problem in more generality by introducing a class of valid tests employing a general score function. Asymptotic null distribution of these tests is obtained. A simulation study shows that a more general choice of the score function can sometimes result in greater power than the Datta and Satten test; furthermore, this development offers the user a wider choice. We illustrate our tests using a real data example on spinal cord injury patients.
Federal Register 2010, 2011, 2012, 2013, 2014
2013-07-16
... practice of including ``pre-test'' questions on certain qualification examinations, which is designed to..., the examination includes 10 additional, unidentified pre-test questions that do not contribute towards... scored. The 10 pre-test questions are randomly distributed throughout the examination. Availability of...
Federal Register 2010, 2011, 2012, 2013, 2014
2011-09-07
..., each examination includes 10 additional, unidentified ``pre-test'' questions that do not contribute towards the candidate's score. The 10 pre-test questions are randomly distributed throughout the... customers, the integrity of the marketplace or the public. The examination will test applicants on general...
Filipiak, Katarzyna; Klein, Daniel; Roy, Anuradha
2017-01-01
The problem of testing the separability of a covariance matrix against an unstructured variance-covariance matrix is studied in the context of multivariate repeated measures data using Rao's score test (RST). The RST statistic is developed with the first component of the separable structure as a first-order autoregressive (AR(1)) correlation matrix or an unstructured (UN) covariance matrix under the assumption of multivariate normality. It is shown that the distribution of the RST statistic under the null hypothesis of any separability does not depend on the true values of the mean or the unstructured components of the separable structure. A significant advantage of the RST is that it can be performed for small samples, even smaller than the dimension of the data, where the likelihood ratio test (LRT) cannot be used, and it outperforms the standard LRT in a number of contexts. Monte Carlo simulations are then used to study the comparative behavior of the null distribution of the RST statistic, as well as that of the LRT statistic, in terms of sample size considerations, and for the estimation of the empirical percentiles. Our findings are compared with existing results where the first component of the separable structure is a compound symmetry (CS) correlation matrix. It is also shown by simulations that the empirical null distribution of the RST statistic converges faster than the empirical null distribution of the LRT statistic to the limiting χ 2 distribution. The tests are implemented on a real dataset from medical studies. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Effects of Inequality, Family and School on Mathematics Achievement: Country and Student Differences
ERIC Educational Resources Information Center
Chiu, Ming Ming
2010-01-01
Inequality, family and school characteristics were linked to student achievement as shown by multi-level analyses of 107,975 15 year olds' mathematics tests and questionnaires in 41 countries. Equal distribution of country and school resources were linked to higher mathematics scores. Students scored higher in families or schools with more…
An Evaluation of a New Method of IRT Scaling
ERIC Educational Resources Information Center
Ragland, Shelley
2010-01-01
In order to be able to fairly compare scores derived from different forms of the same test within the Item Response Theory framework, all individual item parameters must be on the same scale. A new approach, the RPA method, which is based on transformations of predicted score distributions was evaluated here and was shown to produce results…
A weighted generalized score statistic for comparison of predictive values of diagnostic tests.
Kosinski, Andrzej S
2013-03-15
Positive and negative predictive values are important measures of a medical diagnostic test performance. We consider testing equality of two positive or two negative predictive values within a paired design in which all patients receive two diagnostic tests. The existing statistical tests for testing equality of predictive values are either Wald tests based on the multinomial distribution or the empirical Wald and generalized score tests within the generalized estimating equations (GEE) framework. As presented in the literature, these test statistics have considerably complex formulas without clear intuitive insight. We propose their re-formulations that are mathematically equivalent but algebraically simple and intuitive. As is clearly seen with a new re-formulation we presented, the generalized score statistic does not always reduce to the commonly used score statistic in the independent samples case. To alleviate this, we introduce a weighted generalized score (WGS) test statistic that incorporates empirical covariance matrix with newly proposed weights. This statistic is simple to compute, always reduces to the score statistic in the independent samples situation, and preserves type I error better than the other statistics as demonstrated by simulations. Thus, we believe that the proposed WGS statistic is the preferred statistic for testing equality of two predictive values and for corresponding sample size computations. The new formulas of the Wald statistics may be useful for easy computation of confidence intervals for difference of predictive values. The introduced concepts have potential to lead to development of the WGS test statistic in a general GEE setting. Copyright © 2012 John Wiley & Sons, Ltd.
Estimation of Occupational Test Norms from Job Analysis Data.
ERIC Educational Resources Information Center
Mecham, Robert C.
Occupational norms exist for some tests, and differences in the distributions of test scores by occupation are evident. Sampling error (SE), situationally specific factors (SSFs), and differences in job content (DIJCs) were explored as possible reasons for the observed differences. SE was explored by analyzing 742 validity studies performed by the…
The Spelling Project. Technical Report 1992-2.
ERIC Educational Resources Information Center
Green, Kathy E.; Schroeder, David H.
Results of an analysis of a newly developed spelling test and several related measures are reported. Information about the reliability of a newly developed spelling test; its distribution of scores; its relationship with the standard battery of aptitude tests of the Johnson O'Connor Research Foundation; and its relationships with sex, age,…
ERIC Educational Resources Information Center
Moses, Tim; Holland, Paul W.
2010-01-01
In this study, eight statistical strategies were evaluated for selecting the parameterizations of loglinear models for smoothing the bivariate test score distributions used in nonequivalent groups with anchor test (NEAT) equating. Four of the strategies were based on significance tests of chi-square statistics (Likelihood Ratio, Pearson,…
Can patients interpret health information? An assessment of the medical data interpretation test.
Schwartz, Lisa M; Woloshin, Steven; Welch, H Gilbert
2005-01-01
To establish the reliability/validity of an 18-item test of patients' medical data interpretation skills. Survey with retest after 2 weeks. Subjects. 178 people recruited from advertisements in local newspapers, an outpatient clinic, and a hospital open house. The percentage of correct answers to individual items ranged from 20% to 87%, and medical data interpretation test scores (on a 0- 100 scale) were normally distributed (median 61.1, mean 61.0, range 6-94). Reliability was good (test-retest correlation=0.67, Cronbach's alpha=0.71). Construct validity was supported in several ways. Higher scores were found among people with highest versus lowest numeracy (71 v. 36, P<0.001), highest quantitative literacy (65 v. 28, P<0.001), and highest education (69 v. 42, P=0.004). Scores for 15 physician experts also completing the survey were significantly higher than participants with other postgraduate degrees (mean score 89 v. 69, P<0.001). The medical data interpretation test is a reliable and valid measure of the ability to interpret medical statistics.
Zero-inflated Conway-Maxwell Poisson Distribution to Analyze Discrete Data.
Sim, Shin Zhu; Gupta, Ramesh C; Ong, Seng Huat
2018-01-09
In this paper, we study the zero-inflated Conway-Maxwell Poisson (ZICMP) distribution and develop a regression model. Score and likelihood ratio tests are also implemented for testing the inflation/deflation parameter. Simulation studies are carried out to examine the performance of these tests. A data example is presented to illustrate the concepts. In this example, the proposed model is compared to the well-known zero-inflated Poisson (ZIP) and the zero- inflated generalized Poisson (ZIGP) regression models. It is shown that the fit by ZICMP is comparable or better than these models.
Comparison of an expert system with other clinical scores for the evaluation of severity of asthma.
Gautier, V; Rédier, H; Pujol, J L; Bousquet, J; Proudhon, H; Michel, C; Daurès, J P; Michel, F B; Godard, P
1996-01-01
"Asthmaexpert" was produced at the special request of several clinicians in order to obtain a better understanding of the medical decisions taken by clinical experts in the management of asthmatic patients. In order to assess the severity of asthma, a new score called Artificial Intelligence score (AI score), produced by Asthmaexpert, was compared with three other scores (Aas, Hargreave and Brooks). One hundred patients were enrolled prospectively in the study during their first consultation in the out-patient clinic. Distribution of severity level according to the different scores was studied, and the reliability between AI and other scores was evaluated by Kappa and MacNemar tests. Correlations with functional parameters were performed. The AI score assessed higher levels of severity than the other scores (Kappa = 18, 28 and 10% for Aas, Hargreave and Brooks, respectively) with significant MacNemar test in all cases. There was a significant correlation between AI score and forced expiratory volume in one second (FEV1) (r = 0.73). These data indicate that the AI score is a severity score which defines higher levels of severity than the chosen scores. Correlations for functional parameters are good. This score appears easy to use for the first consultation of an asthmatic patient.
Cold chain monitoring of OPV at transit levels in India: correlation of VVM and potency status.
Jain, R; Sahu, A K; Tewari, S; Malik, N; Singh, S; Khare, S; Bhatia, R
2003-12-01
We have conducted a study to analyze monitoring of the cold chain of 674 OPV field samples collected at four different levels of vaccine distribution viz., immunization clinics, district stores, hospitals and Primary Health Centers (PHC) from states of Uttar Pradesh, Madhya Pradesh, and Delhi. The study design included: collection and scoring of vaccine vial monitor (VVM) status of the samples and testing for total oral polio virus concentration (TOPV) by standard WHO protocol. Ten samples each were exposed to 25 degrees C and 37 degrees C, and 10 samples as controls were kept at -20 degrees C. VVM were scored daily till they attained grade 4 and each sample was subsequently subjected to potency testing for individual polio serotypes 1, 2 and 3, and TOPV. Of the 674 samples tested it was observed that: samples from immunization clinics and district stores had an acceptable VVM score of grade 1 and 2; however the probable risk that a sub potent vaccine could have been administered was 2.15%. In 2.5% samples received from district stores vaccine had a VVM score of grade 3 (i.e., discard point), although vaccine when tested was found to be potent (i.e., leading to the vaccine wastage). With exposure to higher temperatures, VVM changed score to grade 2 and 3 when the vaccine was kept at 25 degrees C/37 degrees C, and the titres of individual serotypes 1, 2 and 3 and TOPV were beyond the acceptable limits. Important observations at the different levels of vaccine distribution network and correlation of VVM and potency status of OPV are discussed in the paper which will be of help to the EPI program managers at different transit levels.
Gitau, Tabither M; Micklesfield, Lisa K; Pettifor, John M; Norris, Shane A
2014-01-01
This cross-sectional study of urban high schools in Johannesburg, South Africa, sought to examine eating attitudes, body image and self-esteem among male adolescents (n = 391). Anthropometric measurements, Eating Attitudes Test-26 (EAT-26), Rosenberg self-esteem, body image satisfaction and perception of females were collected at age 13, 15 and 17 years. Descriptive analysis was done to describe the sample, and non-parametric Wilcoxon Mann-Whitney test was used to test for significant differences between data that were not normally distributed (EAT-26). Spearman's rank correlation coefficient analyses were conducted to test for associations between self-esteem scores and eating attitudes, body mass indices and body image satisfaction scores. To assess the differences between groups that were normally distributed chi-square tests were carried out. Ethnic differences significantly affected adolescent boys' body mass index (BMI), eating attitudes and self-esteem; White boys had higher self-esteem, BMI and normal eating attitudes than the Black boys did. BMI was positively associated with self-esteem (p = 0.01, r = 0.134) and negatively with dieting behaviour in White boys (p = 0.004, r = -0.257), and with lower EAT-26 bulimic and oral control scores in Black boys. In conclusion, the findings highlight ethnic differences and a need to better understand cultural differences that influence adolescent attitudes and behaviour.
Bohn, Justin; Eddings, Wesley; Schneeweiss, Sebastian
2017-03-15
Distributed networks of health-care data sources are increasingly being utilized to conduct pharmacoepidemiologic database studies. Such networks may contain data that are not physically pooled but instead are distributed horizontally (separate patients within each data source) or vertically (separate measures within each data source) in order to preserve patient privacy. While multivariable methods for the analysis of horizontally distributed data are frequently employed, few practical approaches have been put forth to deal with vertically distributed health-care databases. In this paper, we propose 2 propensity score-based approaches to vertically distributed data analysis and test their performance using 5 example studies. We found that these approaches produced point estimates close to what could be achieved without partitioning. We further found a performance benefit (i.e., lower mean squared error) for sequentially passing a propensity score through each data domain (called the "sequential approach") as compared with fitting separate domain-specific propensity scores (called the "parallel approach"). These results were validated in a small simulation study. This proof-of-concept study suggests a new multivariable analysis approach to vertically distributed health-care databases that is practical, preserves patient privacy, and warrants further investigation for use in clinical research applications that rely on health-care databases. © The Author 2017. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Analyzing Test-Taking Behavior: Decision Theory Meets Psychometric Theory.
Budescu, David V; Bo, Yuanchao
2015-12-01
We investigate the implications of penalizing incorrect answers to multiple-choice tests, from the perspective of both test-takers and test-makers. To do so, we use a model that combines a well-known item response theory model with prospect theory (Kahneman and Tversky, Prospect theory: An analysis of decision under risk, Econometrica 47:263-91, 1979). Our results reveal that when test-takers are fully informed of the scoring rule, the use of any penalty has detrimental effects for both test-takers (they are always penalized in excess, particularly those who are risk averse and loss averse) and test-makers (the bias of the estimated scores, as well as the variance and skewness of their distribution, increase as a function of the severity of the penalty).
ERIC Educational Resources Information Center
Wei, Youhua; Morgan, Rick
2016-01-01
As an alternative to common-item equating when common items do not function as expected, the single-group growth model (SGGM) scaling uses common examinees or repeaters to link test scores on different forms. The SGGM scaling assumes that, for repeaters taking adjacent administrations, the conditional distribution of scale scores in later…
Families at the Century's Turn: The Troubling Economic Trends. Family Review.
ERIC Educational Resources Information Center
Lindjord, Denise
2000-01-01
Discusses U.S. economic trends for the past century. Notes that distribution of wealth is more concentrated at top than is distribution of income, with income inequality growing worse in the 1990s. Maintains that wealth disparity explains achievement test score gaps between white and minority students. Presents proposals for asset-building,…
Computerized Algorithms: Evaluation of Capability to Predict Graduation from Air Force Training.
1980-09-01
Distribution of the ASVAB Administrative Aptitude Test Scores for the 1976 AFSC 64530 Population ..................................... 73 ASO Distribution...2.34 2.15 PDA 4.11 3.12 I83 ... Table .. (on LAioi; M.iix of the Ind ’pedttit Variab Independent us .l Variable Merh Adm Gel Elec AF)T Ed Aig Bi Math
de Azeredo Passos, Valéria Maria; Giatti, Luana; Bensenor, Isabela; Tiemeier, Henning; Ikram, M Arfan; de Figueiredo, Roberta Carvalho; Chor, Dora; Schmidt, Maria Inês; Barreto, Sandhi Maria
2015-10-09
Brazil has gone through fast demographic, epidemiologic and nutritional transitions and, despite recent improvements in wealth distribution, continues to present a high level of social and economic inequality. The ELSA-Brasil, a cohort study, aimed at investigating cardiovascular diseases and diabetes, offers a great opportunity to assess cognitive decline in this aging population through time-sequential analyses drawn from the same battery of tests over time. The purpose of this study is to analyze the influence of sex, age and education on cognitive tests performance of the participants at baseline. Analyses pertain to 14,594 participants with aged 35 to 74 years, who were functionally independent and had no history of stroke or use of neuroleptics, anticonvulsants, cholinesterase inhibitors or antiparkinsonian agents. Mean age was 52.0 ± 9.0 years and 54.2% of participants were women. Cognitive tests included the word memory tests (retention, recall and recognition), verbal fluency tests (VFT, animals and letter F) and Trail Making Test B. Multivariable linear regression analysis was used to determine the influence of sociodemographic characteristics on the distribution of the final score of each test. Women had significant and slightly higher scores than men in all memory tests and VFT, but took more time to perform Trail B. Reduced performance in all tests was seen with an increase age and, more importantly, with decrease level of education. The word list and VFT scores decreased at about one word for every 10 years of age; whereas higher-educated participants scored four words more on the word list test, and six or seven more correct words on VFT, when compared to lower-educated participants. Additionally, the oldest and less educated participants showed significant lower response rates in all tests. The higher influence of education than age in this Brazilian population reinforce the need for caution in analyzing and diagnosing cognitive impairments based on traditional cognitive tests and the importance of searching for education-free cognitive tests, especially in low and middle-income countries.
Albin, Thomas J; Vink, Peter
2015-01-01
Anthropometric data are assumed to have a Gaussian (Normal) distribution, but if non-Gaussian, accommodation estimates are affected. When data are limited, users may choose to combine anthropometric elements by Combining Percentiles (CP) (adding or subtracting), despite known adverse effects. This study examined whether global anthropometric data are Gaussian distributed. It compared the Median Correlation Method (MCM) of combining anthropometric elements with unknown correlations to CP to determine if MCM provides better estimates of percentile values and accommodation. Percentile values of 604 male and female anthropometric data drawn from seven countries worldwide were expressed as standard scores. The standard scores were tested to determine if they were consistent with a Gaussian distribution. Empirical multipliers for determining percentile values were developed.In a test case, five anthropometric elements descriptive of seating were combined in addition and subtraction models. Percentile values were estimated for each model by CP, MCM with Gaussian distributed data, or MCM with empirically distributed data. The 5th and 95th percentile values of a dataset of global anthropometric data are shown to be asymmetrically distributed. MCM with empirical multipliers gave more accurate estimates of 5th and 95th percentiles values. Anthropometric data are not Gaussian distributed. The MCM method is more accurate than adding or subtracting percentiles.
Haran, F Jay; Dretsch, Michael N; Slaboda, Jill C; Johnson, Dagny E; Adam, Octavian R; Tsao, Jack W
2016-01-01
To examine differences between the baseline-referenced and norm-referenced approaches for determining decrements in Automated Neuropsychological Assessment Metrics Version 4 TBI-MIL (ANAM) performance following mild traumatic brain injury (mTBI). ANAM data were reviewed for 616 US Service members, with 528 of this sample having experienced an mTBI and 88 were controls. Post-injury change scores were calculated for each sub-test: (1) normative change score = in-theater score - normative mean and (2) baseline change score = in-theater score - pre-deployment baseline. Reliable change cut-scores were applied to the change and the resulting frequency distributions were compared using McNemar tests. Receiver operator curves (ROC) using both samples (i.e. mTBI and control) were calculated for the change scores for each approach to determine the discriminate ability of the ANAM. There were no statistical differences, p < 0.05 (Bonferonni-Holm corrected), between the approaches. When the area under the curve for the ROCs were averaged across sub-tests, there were no significant differences between either the norm-referenced (0.65) or baseline-referenced (0.66) approaches, p > 0.05. Overall, the findings suggest there is no clear advantage of using the baseline-referenced approach over norm-referenced approach.
Kambas, A; Venetsanou, F; Giannakidou, D; Fatouros, I G; Avloniti, A; Chatzinikolaou, A; Draganidis, D; Zimmer, R
2012-01-01
Given the negative influence of motor difficulties on people's quality of life their early identification seems to be crucial and consequently the information provided by a sound assessment tool is of great importance. The aim of this study was to examine the suitability of the MOT 4-6 (Zimmer & Volkamer, 1987) for use with preschoolers in Greece. Seven hundred and seventy-eight Greek children aged 48-71 months participated in the study. The two-way ANOVA used on total MOT performance revealed significant differences among the age groups formed in preschool age within Greeks, while boys' and girls' scores were quite similar. From the comparisons of Greeks' scores with the German standardization sample's ones, statistically significant differences were found in two age groups. However according to the Cohen's d effect size they were not of great importance. The distribution of Greeks' scores according to the test cut-offs, revealed that the MOT can differentiate all levels of performance, although a slight deviation from the distribution of Germans' scores was noticed. Finally, both the test-retest reliability and internal consistency of the test were found to be excellent. The MOT 4-6 seems to be a valuable motor assessment tool for Greek preschoolers. Regarding its norms, despite the minor differences that were noticed between the motor development of Greek and German preschoolers, their adjustment was thought to be unnecessary. Instead of lowering the norms, efforts for preventing the motor performance decline should be enhanced. Copyright © 2012 Elsevier Ltd. All rights reserved.
ERIC Educational Resources Information Center
Denbleyker, John Nickolas
2012-01-01
The shortcomings of the proportion above cut (PAC) statistic used so prominently in the educational landscape renders it a very problematic measure for making correct inferences with student test data. The limitations of PAC-based statistics are more pronounced with cross-test comparisons due to their dependency on cut-score locations. A better…
Nakata, Bruce Negrello; Cavalini, Worens; Bonin, Eduardo A; Salvalaggio, Paolo R; Loureiro, Marcelo P
2017-10-01
Minimally invasive surgery (MIS) requires the mastery of manual skills and a specific training is required. Apart from residencies and fellowships in MIS, other learning opportunities utilize massive training, mainly with use of simulators in short courses. A long-term postgraduate course represents an opportunity to learn through training using distributed practice. The objective of this study is to assess the use of distributed practice for acquisition of basic minimally invasive skills in surgeons who participated in a long-term MIS postgraduate course. A prospective, longitudinal and quantitative study was conducted among surgeons who attended a 1-year postgraduate course of MIS in Brazil, from 2012 to 2014. They were tested through five different exercises in box trainers (peg-transfer, passing, cutting, intracorporeal knot, and suture) in the first (t0), fourth (t1) and last, eighth, (t2) meetings of this course. The time and penalties of each exercise were collected for each participant. Participant skills were assessed based on time and accuracy on a previously tested score. Fifty-seven surgeons (participants) from three consecutive groups participated in this study. There was a significant improvement in scores in all exercises. The average increase in scores between t0 and t2 was 88% for peg-transfer, 174% for passing, 149% for cutting, 130% for intracorporeal knot, and 120% for suture (p < 0.001 for all exercises). Learning through distributed practice is effective and should be integrated into a MIS postgraduate course curriculum for acquisition of core skills.
van der Maas, Nico Arie
2017-03-16
The Multiple Sclerosis Questionnaire for Physical Therapists (MSQPT) is a patient-rated outcome questionnaire for evaluating the rehabilitation of persons with multiple sclerosis (MS). Responsiveness was evaluated, and minimal important difference (MID) estimates were calculated to provide thresholds for clinical change for four items, three sections and the total score of the MSQPT. This multicentre study used a combined distribution- and anchor-based approach with multiple anchors and multiple rating of change questions. Responsiveness was evaluated using effect size, standardized response mean (SRM), modified SRM and relative efficiency. For distribution-based MID estimates, 0.2 and 0.33 standard deviations (SD), standard error of measurement (SEM) and minimal detectable change were used . Triangulation of anchor- and distribution-based MID estimates provided a range of MID values for each of the four items, the three sections and the total score of the MSQPT. The MID values were tested for their sensitivity and specificity for amelioration and deterioration for each of the four items, the three sections and the total score of the MSQPT. The MID values of each item and section and of the total score with the best sensitivity and specificity were selected as thresholds for clinical change. The outcome measures were the MSQPT, Hamburg Quality of Life Questionnaire for Multiple Sclerosis (HAQUAMS), rating of change questionnaires, Expanded Disability Status Scale, 6-metre timed walking test, Berg Balance Scale and 6-minute walking test. The effect size ranged from 0.46 to 1.49. The SRM data showed comparable results. The modified SRM ranged from 0.00 to 0.60. Anchor-based MID estimates were very low and were comparable with SD- and SEM-based estimates. The MSQPT was more responsive than the HAQUAMS in detecting improvement but less responsive in finding deterioration. The best MID estimates of the items, sections and total score, expressed in percentage of their maximum score, were between 5.4% (activity) and 22% (item 10) change for improvement and between 5.7% (total score) and 22% (item 10) change for deterioration. The MSQPT is a responsive questionnaire with an adequate MID that may be used as threshold for change during rehabilitation of MS patients. This trial was retrospectively (01/24/2015) registered in ClinicalTrials.gov as NCT02346279.
Poisson Approximation-Based Score Test for Detecting Association of Rare Variants.
Fang, Hongyan; Zhang, Hong; Yang, Yaning
2016-07-01
Genome-wide association study (GWAS) has achieved great success in identifying genetic variants, but the nature of GWAS has determined its inherent limitations. Under the common disease rare variants (CDRV) hypothesis, the traditional association analysis methods commonly used in GWAS for common variants do not have enough power for detecting rare variants with a limited sample size. As a solution to this problem, pooling rare variants by their functions provides an efficient way for identifying susceptible genes. Rare variant typically have low frequencies of minor alleles, and the distribution of the total number of minor alleles of the rare variants can be approximated by a Poisson distribution. Based on this fact, we propose a new test method, the Poisson Approximation-based Score Test (PAST), for association analysis of rare variants. Two testing methods, namely, ePAST and mPAST, are proposed based on different strategies of pooling rare variants. Simulation results and application to the CRESCENDO cohort data show that our methods are more powerful than the existing methods. © 2016 John Wiley & Sons Ltd/University College London.
Estimating High Tech Army Recruiting Markets
1992-09-01
SCORES: 1963-1988 11 TABLE 4 AVERAGE AMERICAN COLLEGE TESTING ( ACT ) SCORES: 1970-1988 12 TABLE 5 DISTRIBUTION OF THE NLSY SAMPLE BY GENDER AND RACE 21...training, competent leaders, sufficient resources and funds to equip the force 1 . In order to maximize the quality of training, recruiting success is...training policies to current conditions in the ’educational training market 1 . Recruiting success is highly dependent on the nature of the civilian
ERIC Educational Resources Information Center
Sullins, Walter L.
Five-hundred dichotomously scored response patterns were generated with sequentially independent (SI) items and 500 with dependent (SD) items for each of thirty-six combinations of sampling parameters (i.e., three test lengths, three sample sizes, and four item difficulty distributions). KR-20, KR-21, and Split-Half (S-H) reliabilities were…
Neelon, Brian; Gelfand, Alan E.; Miranda, Marie Lynn
2013-01-01
Summary Researchers in the health and social sciences often wish to examine joint spatial patterns for two or more related outcomes. Examples include infant birth weight and gestational length, psychosocial and behavioral indices, and educational test scores from different cognitive domains. We propose a multivariate spatial mixture model for the joint analysis of continuous individual-level outcomes that are referenced to areal units. The responses are modeled as a finite mixture of multivariate normals, which accommodates a wide range of marginal response distributions and allows investigators to examine covariate effects within subpopulations of interest. The model has a hierarchical structure built at the individual level (i.e., individuals are nested within areal units), and thus incorporates both individual- and areal-level predictors as well as spatial random effects for each mixture component. Conditional autoregressive (CAR) priors on the random effects provide spatial smoothing and allow the shape of the multivariate distribution to vary flexibly across geographic regions. We adopt a Bayesian modeling approach and develop an efficient Markov chain Monte Carlo model fitting algorithm that relies primarily on closed-form full conditionals. We use the model to explore geographic patterns in end-of-grade math and reading test scores among school-age children in North Carolina. PMID:26401059
Gui, Jiang; Moore, Jason H.; Williams, Scott M.; Andrews, Peter; Hillege, Hans L.; van der Harst, Pim; Navis, Gerjan; Van Gilst, Wiek H.; Asselbergs, Folkert W.; Gilbert-Diamond, Diane
2013-01-01
We present an extension of the two-class multifactor dimensionality reduction (MDR) algorithm that enables detection and characterization of epistatic SNP-SNP interactions in the context of a quantitative trait. The proposed Quantitative MDR (QMDR) method handles continuous data by modifying MDR’s constructive induction algorithm to use a T-test. QMDR replaces the balanced accuracy metric with a T-test statistic as the score to determine the best interaction model. We used a simulation to identify the empirical distribution of QMDR’s testing score. We then applied QMDR to genetic data from the ongoing prospective Prevention of Renal and Vascular End-Stage Disease (PREVEND) study. PMID:23805232
Classical Testing in Functional Linear Models.
Kong, Dehan; Staicu, Ana-Maria; Maity, Arnab
2016-01-01
We extend four tests common in classical regression - Wald, score, likelihood ratio and F tests - to functional linear regression, for testing the null hypothesis, that there is no association between a scalar response and a functional covariate. Using functional principal component analysis, we re-express the functional linear model as a standard linear model, where the effect of the functional covariate can be approximated by a finite linear combination of the functional principal component scores. In this setting, we consider application of the four traditional tests. The proposed testing procedures are investigated theoretically for densely observed functional covariates when the number of principal components diverges. Using the theoretical distribution of the tests under the alternative hypothesis, we develop a procedure for sample size calculation in the context of functional linear regression. The four tests are further compared numerically for both densely and sparsely observed noisy functional data in simulation experiments and using two real data applications.
Classical Testing in Functional Linear Models
Kong, Dehan; Staicu, Ana-Maria; Maity, Arnab
2016-01-01
We extend four tests common in classical regression - Wald, score, likelihood ratio and F tests - to functional linear regression, for testing the null hypothesis, that there is no association between a scalar response and a functional covariate. Using functional principal component analysis, we re-express the functional linear model as a standard linear model, where the effect of the functional covariate can be approximated by a finite linear combination of the functional principal component scores. In this setting, we consider application of the four traditional tests. The proposed testing procedures are investigated theoretically for densely observed functional covariates when the number of principal components diverges. Using the theoretical distribution of the tests under the alternative hypothesis, we develop a procedure for sample size calculation in the context of functional linear regression. The four tests are further compared numerically for both densely and sparsely observed noisy functional data in simulation experiments and using two real data applications. PMID:28955155
Mallett, Susan; Halligan, Steve; Collins, Gary S.; Altman, Doug G.
2014-01-01
Background Different methods of evaluating diagnostic performance when comparing diagnostic tests may lead to different results. We compared two such approaches, sensitivity and specificity with area under the Receiver Operating Characteristic Curve (ROC AUC) for the evaluation of CT colonography for the detection of polyps, either with or without computer assisted detection. Methods In a multireader multicase study of 10 readers and 107 cases we compared sensitivity and specificity, using radiological reporting of the presence or absence of polyps, to ROC AUC calculated from confidence scores concerning the presence of polyps. Both methods were assessed against a reference standard. Here we focus on five readers, selected to illustrate issues in design and analysis. We compared diagnostic measures within readers, showing that differences in results are due to statistical methods. Results Reader performance varied widely depending on whether sensitivity and specificity or ROC AUC was used. There were problems using confidence scores; in assigning scores to all cases; in use of zero scores when no polyps were identified; the bimodal non-normal distribution of scores; fitting ROC curves due to extrapolation beyond the study data; and the undue influence of a few false positive results. Variation due to use of different ROC methods exceeded differences between test results for ROC AUC. Conclusions The confidence scores recorded in our study violated many assumptions of ROC AUC methods, rendering these methods inappropriate. The problems we identified will apply to other detection studies using confidence scores. We found sensitivity and specificity were a more reliable and clinically appropriate method to compare diagnostic tests. PMID:25353643
Mallett, Susan; Halligan, Steve; Collins, Gary S; Altman, Doug G
2014-01-01
Different methods of evaluating diagnostic performance when comparing diagnostic tests may lead to different results. We compared two such approaches, sensitivity and specificity with area under the Receiver Operating Characteristic Curve (ROC AUC) for the evaluation of CT colonography for the detection of polyps, either with or without computer assisted detection. In a multireader multicase study of 10 readers and 107 cases we compared sensitivity and specificity, using radiological reporting of the presence or absence of polyps, to ROC AUC calculated from confidence scores concerning the presence of polyps. Both methods were assessed against a reference standard. Here we focus on five readers, selected to illustrate issues in design and analysis. We compared diagnostic measures within readers, showing that differences in results are due to statistical methods. Reader performance varied widely depending on whether sensitivity and specificity or ROC AUC was used. There were problems using confidence scores; in assigning scores to all cases; in use of zero scores when no polyps were identified; the bimodal non-normal distribution of scores; fitting ROC curves due to extrapolation beyond the study data; and the undue influence of a few false positive results. Variation due to use of different ROC methods exceeded differences between test results for ROC AUC. The confidence scores recorded in our study violated many assumptions of ROC AUC methods, rendering these methods inappropriate. The problems we identified will apply to other detection studies using confidence scores. We found sensitivity and specificity were a more reliable and clinically appropriate method to compare diagnostic tests.
Rane, Shruti; Caroselli, Jerome Silvio; Dickinson, Mercedes; Tran, Kim; Kuang, Fanny; Hiscock, Merrill
2016-01-01
The Trail Making Test (TMT), a widely used neuropsychological test, is highly effective in detecting brain damage. A shortcoming of the test is that it requires drawing lines and thus is impractical for use with persons suffering manual impairment. The 3 studies described herein were designed to describe and evaluate a nonmanual Trail Making Test (NMTMT) that would be suitable for use with manually impaired individuals. The NMTMT utilizes color to permit oral reporting of the stimuli constituting a series of numbers (Part A) or alternating series of numbers and letters (Part B). The studies, which involved a total of 200 university students, indicate that the standard TMT and the NMTMT are moderately related to each other and have similar patterns of association and nonassociation with other neuropsychological measures. Participants with scores falling near the bottom of the NMTMT distribution have a high probability of scoring at least 1 standard deviation below the mean of the TMT distribution for Part B. The clinically important relationship of Part A to Part B seems to be retained in the NMTMT. It is concluded that the NMTMT shows promise as a substitute for the TMT when the TMT cannot be used.
Item response theory scoring and the detection of curvilinear relationships.
Carter, Nathan T; Dalal, Dev K; Guan, Li; LoPilato, Alexander C; Withrow, Scott A
2017-03-01
Psychologists are increasingly positing theories of behavior that suggest psychological constructs are curvilinearly related to outcomes. However, results from empirical tests for such curvilinear relations have been mixed. We propose that correctly identifying the response process underlying responses to measures is important for the accuracy of these tests. Indeed, past research has indicated that item responses to many self-report measures follow an ideal point response process-wherein respondents agree only to items that reflect their own standing on the measured variable-as opposed to a dominance process, wherein stronger agreement, regardless of item content, is always indicative of higher standing on the construct. We test whether item response theory (IRT) scoring appropriate for the underlying response process to self-report measures results in more accurate tests for curvilinearity. In 2 simulation studies, we show that, regardless of the underlying response process used to generate the data, using the traditional sum-score generally results in high Type 1 error rates or low power for detecting curvilinearity, depending on the distribution of item locations. With few exceptions, appropriate power and Type 1 error rates are achieved when dominance-based and ideal point-based IRT scoring are correctly used to score dominance and ideal point response data, respectively. We conclude that (a) researchers should be theory-guided when hypothesizing and testing for curvilinear relations; (b) correctly identifying whether responses follow an ideal point versus dominance process, particularly when items are not extreme is critical; and (c) IRT model-based scoring is crucial for accurate tests of curvilinearity. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Atayero, Aderemi A; Popoola, Segun I; Egeonu, Jesse; Oludayo, Olumuyiwa
2018-08-01
Citation is one of the important metrics that are used in measuring the relevance and the impact of research publications. The potentials of citation analytics may be exploited to understand the gains of publishing scholarly peer-reviewed research outputs in either Open Access (OA) sources or Subscription-Based (SB) sources in the bid to increase citation impact. However, relevant data required for such comparative analysis must be freely accessible for evidence-based findings and conclusions. In this data article, citation scores ( CiteScores ) of 2542 OA sources and 15,040 SB sources indexed in Scopus from 2014 to 2016 were presented and analyzed based on a set of five inclusion criteria. A robust dataset, which contains the CiteScores of OA and SB publication sources included, is attached as supplementary material to this data article to facilitate further reuse. Descriptive statistics and frequency distributions of OA CiteScores and SB CiteScores are presented in tables. Boxplot representations and scatter plots are provided to show the statistical distributions of OA CiteScores and SB CiteScores across the three sub-categories (Book Series, Journal, and Trade Journal). Correlation coefficient and p-value matrices are made available within the data article. In addition, Probability Density Functions (PDFs) and Cumulative Distribution Functions (CDFs) of OA CiteScores and SB CiteScores are computed and the results are presented using tables and graphs. Furthermore, Analysis of Variance (ANOVA) and multiple comparison post-hoc tests are conducted to understand the statistical difference (and its significance, if any) in the citation impact of OA publication sources and SB publication source based on CiteScore . In the long run, the data provided in this article will help policy makers and researchers in Higher Education Institutions (HEIs) to identify the appropriate publication source type and category for dissemination of scholarly research findings with maximum citation impact.
Eddy, Sean R.
2008-01-01
Sequence database searches require accurate estimation of the statistical significance of scores. Optimal local sequence alignment scores follow Gumbel distributions, but determining an important parameter of the distribution (λ) requires time-consuming computational simulation. Moreover, optimal alignment scores are less powerful than probabilistic scores that integrate over alignment uncertainty (“Forward” scores), but the expected distribution of Forward scores remains unknown. Here, I conjecture that both expected score distributions have simple, predictable forms when full probabilistic modeling methods are used. For a probabilistic model of local sequence alignment, optimal alignment bit scores (“Viterbi” scores) are Gumbel-distributed with constant λ = log 2, and the high scoring tail of Forward scores is exponential with the same constant λ. Simulation studies support these conjectures over a wide range of profile/sequence comparisons, using 9,318 profile-hidden Markov models from the Pfam database. This enables efficient and accurate determination of expectation values (E-values) for both Viterbi and Forward scores for probabilistic local alignments. PMID:18516236
Consequences of Violated Equating Assumptions under the Equivalent Groups Design
ERIC Educational Resources Information Center
Lyren, Per-Erik; Hambleton, Ronald K.
2011-01-01
The equal ability distribution assumption associated with the equivalent groups equating design was investigated in the context of a selection test for admission to higher education. The purpose was to assess the consequences for the test-takers in terms of receiving improperly high or low scores compared to their peers, and to find strong…
An Investigation of the Raudenbush (1988) Test for Studying Variance Heterogeneity.
ERIC Educational Resources Information Center
Harwell, Michael
1997-01-01
The meta-analytic method proposed by S. W. Raudenbush (1988) for studying variance heterogeneity was studied. Results of a Monte Carlo study indicate that the Type I error rate of the test is sensitive to even modestly platykurtic score distributions and to the ratio of study sample size to the number of studies. (SLD)
Fort, Alfredo L; Deussom, Rachel; Burlew, Randi; Gilroy, Kate; Nelson, David
2017-07-19
Despite its importance, the field of human resources for health (HRH) has lagged in developing methods to measure its status and progress in low- and middle-income countries suffering a workforce crisis. Measures of professional health worker densities and distribution are purely numerical, unreliable, and do not represent the full spectrum of workers providing health services. To provide more information on the multi-dimensional characteristics of human resources for health, in 2013-2014, the global USAID-funded CapacityPlus project, led by IntraHealth International, developed and tested a 79-item HRH Effort Index modeled after the widely used Family Planning Effort Index. The index includes seven recognized HRH dimensions: Leadership and Advocacy; Policy and Governance; Finance; Education and Training; Recruitment, Distribution, and Retention; Human Resources Management; and Monitoring, Evaluation, and Information Systems. Each item is scored from 1 to 10 and scores are averaged with equal weights for each dimension and overall. The questionnaire is applied to knowledgeable informants from public, nongovernmental organization, and private sectors in each country. A pilot test among 49 respondents in Kenya and Nigeria provided useful information to improve, combine, and streamline questions. CapacityPlus applied the revised 50-item questionnaire in 2015 in Burkina Faso, Dominican Republic, Ghana, and Mali, among 92 respondents. Additionally, the index was applied subnationally in the Dominican Republic (16 respondents) and in a consensus-building meeting in Mali (43 respondents) after the national application. The results revealed a range of scores between 3.7 and 6.2 across dimensions, for overall scores between 4.8 and 5.5. Dimensions with lower scores included Recruitment, Distribution, and Retention, while Leadership and Advocacy had higher scores. The tool proved to be well understood and provided key qualitative information on the health workforce to assist in health systems strengthening. It is expected that subsequent applications should provide more information for comparison purposes, to refine aspects of the questionnaire and to correlate scores with measures of service outputs and outcomes.
Fang, Mingying; Oremus, Mark; Tarride, Jean-Eric; Raina, Parminder
2016-07-18
The use of the EQ-5D to asses the economic benefits of health technologies has led to questions about the cross-population transferability of preference weights to calculate health utility scores. The aim of this study is to investigate whether the use of UK and Canadian preference weights will lead to the calculation of different health utility scores in a sample of persons with Alzheimer's disease (AD) and their primary informal caregivers. We recruited 216 patient-caregiver dyads from nine geriatric and memory clinics across Canada. Participants used the EQ-5D-3L to rate their health-related quality-of-life (HRQoL). EQ-5D-3L responses were transformed into health utility scores using UK and Canadian preference weights. The levels of agreement between the two sets of scores were assessed using intraclass correlation coefficients (ICCs). Bland-Altman plots depicted individual-level differences between the two sets of scores. Differences in health utility scores were tested using the Wilcoxon signed rank sum test. A generalized linear model with a gamma distribution was used to examine whether participants' socio-demographic characteristics were associated with their health utility scores. The distributions of health utility scores derived from both the UK and Canadian preference weights were skewed to the left. The intraclass correlation coefficient was 0.94 (95 % CI: 0.92, 0.95) for persons with AD and 0.92 (95 % CI: 0.88, 0.94) for the caregivers. The Canadian weights yielded slightly higher median health utility scores than the UK weights for caregivers (median difference: 0.009; 95 % confidence interval: 0.007, 0.013). This finding persisted after stratifying by disease severity. Few socio-demographic characteristics were associated with the two sets of health utility scores. Health utility scores exhibited small and clinically unimportant differences when calculated with UK versus Canadian preference weights in persons with AD and their caregivers. The original UK and Canadian population samples used to obtain the preference weights valued health states similarly.
Guilloux, Jean-Philippe; Seney, Marianne; Edgar, Nicole; Sibille, Etienne
2011-01-01
Defining anxiety- and depressive-like states in mice (“emotionality”) is best characterized by the use of complementary tests, leading sometimes to puzzling discrepancies and lack of correlation between similar paradigms. To address this issue, we hypothesized that integrating measures along the same behavioral dimensions in different tests would reduce the intrinsic variability of single tests and provide a robust characterization of the underlying “emotionality” of individual mouse, similarly as mood and related syndromes are defined in humans through various related symptoms over time. We describe the use of simple mathematical and integrative tools to help phenotype animals across related behavioral tests (syndrome diagnosis) and experiments (meta-analysis). We applied z-normalization across complementary measures of emotionality in different behavioral tests after unpredictable chronic mild stress (UCMS) or prolonged corticosterone exposure - two approaches to induce anxious-/depressive-like states in mice. Combining z-normalized test values, lowered the variance of emotionality measurement, enhanced the reliability of behavioral phenotyping, and increased analytical opportunities. Comparing integrated emotionality scores across studies revealed a robust sexual dimorphism in the vulnerability to develop high emotionality, manifested as higher UCMS-induced emotionality z-scores, but lower corticosterone-induced scores in females compared to males. Interestingly, the distribution of individual z-scores revealed a pattern of increased baseline emotionality in female mice, reminiscent of what is observed in humans. Together, we show that the z-scoring method yields robust measures of emotionality across complementary tests for individual mice and experimental groups, hence facilitating the comparison across studies and refining the translational applicability of these models. PMID:21277897
Guilloux, Jean-Philippe; Seney, Marianne; Edgar, Nicole; Sibille, Etienne
2011-04-15
Defining anxiety- and depressive-like states in mice (emotionality) is best characterized by the use of complementary tests, leading sometimes to puzzling discrepancies and lack of correlation between similar paradigms. To address this issue, we hypothesized that integrating measures along the same behavioral dimensions in different tests would reduce the intrinsic variability of single tests and provide a robust characterization of the underlying "emotionality" of individual mouse, similarly as mood and related syndromes are defined in humans through various related symptoms over time. We describe the use of simple mathematical and integrative tools to help phenotype animals across related behavioral tests (syndrome diagnosis) and experiments (meta-analysis). We applied z-normalization across complementary measures of emotionality in different behavioral tests after unpredictable chronic mild stress (UCMS) or prolonged corticosterone exposure - two approaches to induce anxious-/depressive-like states in mice. Combining z-normalized test values, lowered the variance of emotionality measurement, enhanced the reliability of behavioral phenotyping, and increased analytical opportunities. Comparing integrated emotionality scores across studies revealed a robust sexual dimorphism in the vulnerability to develop high emotionality, manifested as higher UCMS-induced emotionality z-scores, but lower corticosterone-induced scores in females compared to males. Interestingly, the distribution of individual z-scores revealed a pattern of increased baseline emotionality in female mice, reminiscent of what is observed in humans. Together, we show that the z-scoring method yields robust measures of emotionality across complementary tests for individual mice and experimental groups, hence facilitating the comparison across studies and refining the translational applicability of these models. Copyright © 2011 Elsevier B.V. All rights reserved.
Measuring professional identity formation early in medical school.
Kalet, Adina; Buckvar-Keltz, Lynn; Harnik, Victoria; Monson, Verna; Hubbard, Steven; Crowe, Ruth; Song, Hyuksoon S; Yingling, Sandra
2017-03-01
To assess the feasibility and utility of measuring baseline professional identity formation (PIF) in a theory-based professionalism curriculum for early medical students. All 132 entering students completed the professional identity essay (PIE) and the defining issues test (DIT2). Students received score reports with individualized narrative feedback and wrote a structured reflection after a large-group session in which the PIF construct was reviewed. Analysis of PIEs resulted in assignment of a full or transitional PIF stage (1-5). The DIT2 score reflects the proportion of the time students used universal ethical principles to justify a response to 6 moral dilemma cases. Students' reflections were content analyzed. PIF scores were distributed across stage 2/3, stage 3, stage 3/4, and stage 4. No student scores were in stages 1, 2, 4/5, or 5. The mean DIT2 score was 53% (range 9.7?76.5%); the correlation between PIF stage and DIT score was ρ = 0.18 (p = 0.03). Students who took an analytic approach to the data and demonstrated both awareness that they are novices and anticipation of continued PIF tended to respond more positively to the feedback. These PIF scores distributed similarly to novice students in other professions. Developmental-theory based PIF and moral reasoning measures are related. Students reflected on these measures in meaningful ways suggesting utility of measuring PIF scores in medical education.
PQScal (Power Quality Score Calculation for Distribution Systems with DER Integration)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Power Quality is of great importance to evaluate the “health” of a distribution system, especially when the distributed energy resource (DER) penetration becomes more significant. The individual components that make up power quality, such as voltage magnitude and unbalance, can be measured in simulations or in the field, however, a comprehensive method to incorporate all of these values into a single score doesn't exist. As a result, we propose a methodology to quantify the power quality health using the single number value, named as Power Quality Score (PQS). The PQS is dependent on six metrics that are developed based onmore » both components that directly impact power quality and those are often reference in the context of power quality. These six metrics are named as System Average Voltage Magnitude Violation Index (SAVMVI), System Average Voltage Fluctuation Index (SAVFI), System Average Voltage Unbalance Index (SAVUI), System Control Device Operation Index (SCDOI), System Reactive Power Demand Index (SRPDI) and System Energy Loss Index (SELI). This software tool, PQScal, is developed based on this novel PQS methodology. Besides of traditional distribution systems, PQScal can also measure the power quality for distribution systems with various DER penetrations. PQScal has been tested on two utility distribution feeders with distinct model characteristics and its effectiveness has been proved. In sum, PQScal can help utilities or other parties to measure the power quality of distribution systems with DER integration easily and effectively.« less
Knowledge of European orthodontic postgraduate students on biostatistics.
Polychronopoulou, Argy; Eliades, Theodore; Taoufik, Konstantina; Papadopoulos, Moschos A; Athanasiou, Athanasios E
2011-08-01
The purpose of this study was to explore the level of knowledge in biostatistics of orthodontic postgraduate students. A four-section questionnaire, which included a knowledge test/quiz on biostatistics and epidemiology, was developed. This questionnaire was distributed to postgraduate programme directors of European universities to be delivered to students for completion under mock examination conditions (in-class session). The frequency distributions of demographic characteristics were examined, the percentages of participants who agreed or strongly agreed with each attitudinal statement were calculated, and the percentages of participants who felt fairly to highly confident for each statement were determined. Knowledge scores were calculated by the percentage of correct answers; missing values were counted as incorrect answers. The Student's t-test or one-way analysis of variance, where appropriate, was utilized to determine the participants' characteristics associated with mean knowledge scores. Data were further analysed with multiple linear regression modelling to determine the adjusted/unconfounded effect of possible knowledge score predictors. A two-tailed P-value of 0.05 was considered statistically significant with a 95 percent confidence interval (CI). One hundred and twenty seven from a total of 129 orthodontic students who replied completed the questionnaire. The mean correct answers of the participants were 43.8 percent with a 95 percent CI of 40.2-47.3 percent. This score was not influenced by gender, years elapsed from graduation, other advanced degree, or year of study; the sole parameter, which seemed to influence this score was attendance at a biostatistics/epidemiology course (51.9 versus 39.5 percent score of participants who had previously taken a course versus those who had not, P<0.001). A surprising finding was the inability of the responders to identify the appropriate use of the chi-square test (11.8 percent, 95 percent CI: 6.1-17.5 percent). The knowledge on biostatistics of orthodontic postgraduate students in Europe is only influenced by previous relevant education.
NASA Astrophysics Data System (ADS)
Sanchez, Gerardo
A flipped laboratory model involves significant preparation by the students on lab material prior to entry to the laboratory. This allows laboratory time to be focused on active learning through experiments. The aim of this study was to observe changes in student performance through the transition from a traditional laboratory format, to a flipped format. The data showed that for both Anatomy and Physiology (I and II) laboratories a more normal distribution of grades was observed once labs were flipped and lecture grade averages increased. Chi square and analysis of variance tests showed grade changes to a statistically significant degree, with a p value of less than 0.05 on both analyses. Regression analyses gave decreasing numbers after the flipped labs were introduced with an r. 2 value of .485 for A&P I, and .564 for A&P II. Results indicate improved scores for the lecture part of the A&P course, decreased outlying scores above 100, and all score distributions approached a more normal distribution.
Quality Control of Direct Molecular Diagnostics for Methicillin-Resistant Staphylococcus aureus▿
van Belkum, Alex; Niesters, Hubert G. M.; MacKay, William G.; van Leeuwen, Willem B.
2007-01-01
Ten samples containing various amounts of methicillin-resistant Staphylococcus aureus (MRSA), methicillin-susceptible S. aureus, methicillin-resistant Staphylococcus epidermidis (MRSE), and combinations thereof were distributed to 51 laboratories for molecular diagnostics testing. Samples containing 102 to 103 MRSA cells were frequently reported to be negative. MRSE samples were scored as negative by all commercial tests but by only two out of three in-house tests. PMID:17581936
Conway-Habes, Erin E; Herbst, Brian F; Herbst, Lori A; Kinnear, Benjamin; Timmons, Kristen; Horewitz, Deborah; Falgout, Rachel; O'Toole, Jennifer K; Vossmeyer, Michael
2017-03-01
The population of adults with childhood-onset chronic illness is growing across children's hospitals and constitutes a high risk population. National Early Warning Score (NEWS) is among the most recently validated adult early warning scores (EWSs) for early recognition of and response to clinical deterioration. Our aim was to implement and standardize NEWS scoring in 80% of patients age 21 and older admitted to a children's hospital. Our intervention was tested on a single unit of our children's hospital. The primary process measure was the percentage of NEWS documented within 1 hour of routine nursing assessments, and was tracked using a run chart. Improvement activities focused on effective training, key stakeholder buy-in, increased awareness, real-time mitigation of failures, accountability for adherence, and action-oriented response. We also tracked the distribution of NEWS values and medical emergency team calls. The percentage of NEWS documented with routine nursing assessments for patients age 21 and over increased from 0% to 90% within 15 weeks and remained at 77% or greater for 17 weeks. Our distribution of NEWS values was similar to previously reported NEWS distribution. A nurse-driven adult early warning system for inpatients age 21 and older at a children's hospital can be achieved through a standardized EWS assessment process, incorporation into the electronic health record, and charge nurse and key stakeholder oversight. Furthermore, implementation of an adult EWS being used at a pediatric institution and our distribution of NEWS values were comparable to distribution published from adult hospitals. Copyright © 2017 by the American Academy of Pediatrics.
Xing, Chao; Elston, Robert C
2006-07-01
The multipoint lod score and mod score methods have been advocated for their superior power in detecting linkage. However, little has been done to determine the distribution of multipoint lod scores or to examine the properties of mod scores. In this paper we study the distribution of multipoint lod scores both analytically and by simulation. We also study by simulation the distribution of maximum multipoint lod scores when maximized over different penetrance models. The multipoint lod score is approximately normally distributed with mean and variance that depend on marker informativity, marker density, specified genetic model, number of pedigrees, pedigree structure, and pattern of affection status. When the multipoint lod scores are maximized over a set of assumed penetrances models, an excess of false positive indications of linkage appear under dominant analysis models with low penetrances and under recessive analysis models with high penetrances. Therefore, caution should be taken in interpreting results when employing multipoint lod score and mod score approaches, in particular when inferring the level of linkage significance and the mode of inheritance of a trait.
Power analysis to detect treatment effects in longitudinal clinical trials for Alzheimer's disease.
Huang, Zhiyue; Muniz-Terrera, Graciela; Tom, Brian D M
2017-09-01
Assessing cognitive and functional changes at the early stage of Alzheimer's disease (AD) and detecting treatment effects in clinical trials for early AD are challenging. Under the assumption that transformed versions of the Mini-Mental State Examination, the Clinical Dementia Rating Scale-Sum of Boxes, and the Alzheimer's Disease Assessment Scale-Cognitive Subscale tests'/components' scores are from a multivariate linear mixed-effects model, we calculated the sample sizes required to detect treatment effects on the annual rates of change in these three components in clinical trials for participants with mild cognitive impairment. Our results suggest that a large number of participants would be required to detect a clinically meaningful treatment effect in a population with preclinical or prodromal Alzheimer's disease. We found that the transformed Mini-Mental State Examination is more sensitive for detecting treatment effects in early AD than the transformed Clinical Dementia Rating Scale-Sum of Boxes and Alzheimer's Disease Assessment Scale-Cognitive Subscale. The use of optimal weights to construct powerful test statistics or sensitive composite scores/endpoints can reduce the required sample sizes needed for clinical trials. Consideration of the multivariate/joint distribution of components' scores rather than the distribution of a single composite score when designing clinical trials can lead to an increase in power and reduced sample sizes for detecting treatment effects in clinical trials for early AD.
Verbal and visual memory in patients with early Parkinson's disease: effect of levodopa.
Singh, Sumit; Behari, Madhuri
2006-03-01
The effect of initiation of levodopa therapy on the memory functions in patients with Parkinson's disease remains poorly understood. To evaluate the effect of initiation of levodopa therapy on memory, in patients with early Parkinson's disease. Prospective case control study. Seventeen patients with early Parkinson's disease were evaluated for verbal memory using Rey's auditory verbal learning test, and visual memory using the Benton's visual retention test and Form sequence learning test. UPDRS scores, Hoehn and Yahr's Staging and Schwab and England scores of Activities of daily living. Hamilton's depression rating scale and MMSE were also evaluated. Six controls were also evaluated according to similar study protocol. Levodopa was then prescribed to the cases. Same tests were repeated on all the subjects after 12 weeks. The mean age of the patients was 59.8 (+ 12.9 yrs); mean disease duration of 3.26 (+ 2.06 yrs). The mean UPDRS scores of patients were 36.52 (+ 15.84). Controls were of a similar age and sex distribution. A statistically significant improvement in the scores on the UPDRS, Hamilton's depression scale, Schwab and England scale, and a statistically significant deterioration in the scores of visual memory was observed in patients with PD after starting levodopa, as compared to their baseline scores. There was no correlation between degree of deterioration and the dose of levodopa. Initiation of levodopa therapy in patients with early and stable Parkinson's disease is associated with deterioration in visual memory functions, with relative preservation of the verbal memory.
Normative data for the Maryland CNC Test.
Mendel, Lisa Lucks; Mustain, William D; Magro, Jessica
2014-09-01
The Maryland consonant-vowel nucleus-consonant (CNC) Test is routinely used in Veterans Administration medical centers, yet there is a paucity of published normative data for this test. The purpose of this study was to provide information on the means and distribution of word-recognition scores on the Maryland CNC Test as a function of degree of hearing loss for a veteran population. A retrospective, descriptive design was conducted. The sample consisted of records from veterans who had Compensation and Pension (C&P) examinations at a Veterans Administration medical center (N = 1,760 ears). Audiometric records of veterans who had C&P examinations during a 10 yr period were reviewed, and the pure-tone averages (PTA4) at four frequencies (1000, 2000, 3000, and 4000 Hz) were documented. The maximum word-recognition score (PBmax) was determined from the performance-intensity functions obtained using the Maryland CNC Test. Correlations were made between PBmax and PTA4. A wide range of word-recognition scores were obtained at all levels of PTA4 for this population. In addition, a strong negative correlation between the PBmax and the PTA4 was observed, indicating that as PTA4 increased, PBmax decreased. Word-recognition scores decreased significantly as hearing loss increased beyond a mild hearing loss. Although threshold was influenced by age, no statistically significant relationship was found between word-recognition score and the age of the participants. RESULTS from this study provide normative data in table and figure format to assist audiologists in interpreting patient results on the Maryland CNC test for a veteran population. These results provide a quantitative method for audiologists to use to interpret word-recognition scores based on pure-tone hearing loss. American Academy of Audiology.
ERIC Educational Resources Information Center
Puhan, Gautam; Moses, Tim P.; Yu, Lei; Dorans, Neil J.
2007-01-01
The purpose of the current study was to examine whether log-linear smoothing of observed score distributions in small samples results in more accurate differential item functioning (DIF) estimates under the simultaneous item bias test (SIBTEST) framework. Data from a teacher certification test were analyzed using White candidates in the reference…
Music therapy career aptitude and generalized self-efficacy in music therapy students.
Lim, Hayoung A; Befi, Cathy M
2014-01-01
While the Music Therapy Career Aptitude Test (MTCAT) provides a measure of student aptitude, measures of perceived self-efficacy may provide additional information about a students' suitability for a music therapy career. As a first step in determining whether future studies examining combined scores from the MTCAT and the Generalized Self-Efficacy (GSE) scale would be useful to help predict academic success in music therapy, we explored the internal reliability of these two measures in a sample of undergraduate students, and the relationship (concurrent validity) of the measures to one another. Eighty undergraduate music therapy students (14 male; 66 female) completed the MTCAT and GSE. To determine internal reliability we conducted tests of normality and calculated Cronbach's Coefficient Alpha for each measure. Pearson correlation coefficients were calculated to ascertain the strength of the relationship between the MTCAT and GSE. MTCAT scores were normally distributed and had high internal consistency (Cronbach's α = 0.706). GSE scores were not normally distributed, but had high internal consistency (Cronbach's α = 0.748). The correlation coefficient analysis revealed that MTCAT and GSE scores were moderately correlated ((r = 0.426, p < 0.0001). MTCAT scores can be used to partially determine perceived self-efficacy in undergraduate music therapy students; however, a more complete picture of student suitability for music therapy may be determined by administering the GSE alongside the MTCAT. Future studies are needed to determine whether combined MTCAT and GSE scores can be used to predict student success in an undergraduate music therapy program. © the American Music Therapy Association 2014. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Using Minimum Acceptable GRE Scores for Graduate Admissions Suppresses Diversity
NASA Astrophysics Data System (ADS)
Miller, Casey
2014-01-01
I will present data showing that significant performance disparities on the GRE general test exist based on the test taker's race and gender [1]. Because of the belief that high GRE scores qualify one for graduate studies, the diversity issues faced by STEM fields may originate, at least in part, in misuse of the GRE scores by graduate admissions committees. I will quantitatively demonstrate this by showing that the combination of a hard cut-off and the different score distributions leads to the systematic underrepresentation of certain groups. I will present data from USF’s PhD program that shows a lack of correlation between GRE scores and research ability; similar null results are emerging from numerous other programs. I will then discuss how assessing non-cognitive competencies in the selection process may lead to a more enlightened search for the next generation of scientists. [1] C. W. Miller, "Admissions Criteria and Diversity in Graduate School", APS News Vol 22, Issue 2, The Back Page (2013) http://www.aps.org/publications/apsnews/201302/backpage.cfm
Improving IQ measurement in intellectual disabilities using true deviation from population norms
2014-01-01
Background Intellectual disability (ID) is characterized by global cognitive deficits, yet the very IQ tests used to assess ID have limited range and precision in this population, especially for more impaired individuals. Methods We describe the development and validation of a method of raw z-score transformation (based on general population norms) that ameliorates floor effects and improves the precision of IQ measurement in ID using the Stanford Binet 5 (SB5) in fragile X syndrome (FXS; n = 106), the leading inherited cause of ID, and in individuals with idiopathic autism spectrum disorder (ASD; n = 205). We compared the distributional characteristics and Q-Q plots from the standardized scores with the deviation z-scores. Additionally, we examined the relationship between both scoring methods and multiple criterion measures. Results We found evidence that substantial and meaningful variation in cognitive ability on standardized IQ tests among individuals with ID is lost when converting raw scores to standardized scaled, index and IQ scores. Use of the deviation z- score method rectifies this problem, and accounts for significant additional variance in criterion validation measures, above and beyond the usual IQ scores. Additionally, individual and group-level cognitive strengths and weaknesses are recovered using deviation scores. Conclusion Traditional methods for generating IQ scores in lower functioning individuals with ID are inaccurate and inadequate, leading to erroneously flat profiles. However assessment of cognitive abilities is substantially improved by measuring true deviation in performance from standardization sample norms. This work has important implications for standardized test development, clinical assessment, and research for which IQ is an important measure of interest in individuals with neurodevelopmental disorders and other forms of cognitive impairment. PMID:26491488
Improving IQ measurement in intellectual disabilities using true deviation from population norms.
Sansone, Stephanie M; Schneider, Andrea; Bickel, Erika; Berry-Kravis, Elizabeth; Prescott, Christina; Hessl, David
2014-01-01
Intellectual disability (ID) is characterized by global cognitive deficits, yet the very IQ tests used to assess ID have limited range and precision in this population, especially for more impaired individuals. We describe the development and validation of a method of raw z-score transformation (based on general population norms) that ameliorates floor effects and improves the precision of IQ measurement in ID using the Stanford Binet 5 (SB5) in fragile X syndrome (FXS; n = 106), the leading inherited cause of ID, and in individuals with idiopathic autism spectrum disorder (ASD; n = 205). We compared the distributional characteristics and Q-Q plots from the standardized scores with the deviation z-scores. Additionally, we examined the relationship between both scoring methods and multiple criterion measures. We found evidence that substantial and meaningful variation in cognitive ability on standardized IQ tests among individuals with ID is lost when converting raw scores to standardized scaled, index and IQ scores. Use of the deviation z- score method rectifies this problem, and accounts for significant additional variance in criterion validation measures, above and beyond the usual IQ scores. Additionally, individual and group-level cognitive strengths and weaknesses are recovered using deviation scores. Traditional methods for generating IQ scores in lower functioning individuals with ID are inaccurate and inadequate, leading to erroneously flat profiles. However assessment of cognitive abilities is substantially improved by measuring true deviation in performance from standardization sample norms. This work has important implications for standardized test development, clinical assessment, and research for which IQ is an important measure of interest in individuals with neurodevelopmental disorders and other forms of cognitive impairment.
Robust joint score tests in the application of DNA methylation data analysis.
Li, Xuan; Fu, Yuejiao; Wang, Xiaogang; Qiu, Weiliang
2018-05-18
Recently differential variability has been showed to be valuable in evaluating the association of DNA methylation to the risks of complex human diseases. The statistical tests based on both differential methylation level and differential variability can be more powerful than those based only on differential methylation level. Anh and Wang (2013) proposed a joint score test (AW) to simultaneously detect for differential methylation and differential variability. However, AW's method seems to be quite conservative and has not been fully compared with existing joint tests. We proposed three improved joint score tests, namely iAW.Lev, iAW.BF, and iAW.TM, and have made extensive comparisons with the joint likelihood ratio test (jointLRT), the Kolmogorov-Smirnov (KS) test, and the AW test. Systematic simulation studies showed that: 1) the three improved tests performed better (i.e., having larger power, while keeping nominal Type I error rates) than the other three tests for data with outliers and having different variances between cases and controls; 2) for data from normal distributions, the three improved tests had slightly lower power than jointLRT and AW. The analyses of two Illumina HumanMethylation27 data sets GSE37020 and GSE20080 and one Illumina Infinium MethylationEPIC data set GSE107080 demonstrated that three improved tests had higher true validation rates than those from jointLRT, KS, and AW. The three proposed joint score tests are robust against the violation of normality assumption and presence of outlying observations in comparison with other three existing tests. Among the three proposed tests, iAW.BF seems to be the most robust and effective one for all simulated scenarios and also in real data analyses.
On computation of p-values in parametric linkage analysis.
Kurbasic, Azra; Hössjer, Ola
2004-01-01
Parametric linkage analysis is usually used to find chromosomal regions linked to a disease (phenotype) that is described with a specific genetic model. This is done by investigating the relations between the disease and genetic markers, that is, well-characterized loci of known position with a clear Mendelian mode of inheritance. Assume we have found an interesting region on a chromosome that we suspect is linked to the disease. Then we want to test the hypothesis of no linkage versus the alternative one of linkage. As a measure we use the maximal lod score Z(max). It is well known that the maximal lod score has asymptotically a (2 ln 10)(-1) x (1/2 chi2(0) + 1/2 chi2(1)) distribution under the null hypothesis of no linkage when only one point (one marker) on the chromosome is studied. In this paper, we show, both by simulations and theoretical arguments, that the null hypothesis distribution of Zmax has no simple form when more than one marker is used (multipoint analysis). In fact, the distribution of Zmax depends on the number of families, their structure, the assumed genetic model, marker denseness, and marker informativity. This means that a constant critical limit of Zmax leads to tests associated with different significance levels. Because of the above-mentioned problems, from the statistical point of view the maximal lod score should be supplemented by a p-value when results are reported. Copyright (c) 2004 S. Karger AG, Basel.
1981-02-01
monotonic increasing function of true ability or performance score. A cumulative probability function is * then very convenient for describiny; one’s...possible outcomes such as test scores, grade-point averages or other common outcome variables. Utility is usually a monotonic increasing function of true ...r(0) is negative for 8 <i and positive for 0 > M, U(o) is risk-prone for low 0 values and risk-averse for high 0 values. This property is true for
Berndl, K; von Cranach, M; Grüsser, O J
1986-01-01
The perception and recognition of faces, mimic expression and gestures were investigated in normal subjects and schizophrenic patients by means of a movie test described in a previous report (Berndl et al. 1986). The error scores were compared with results from a semi-quantitative evaluation of psychopathological symptoms and with some data from the case histories. The overall error scores found in the three groups of schizophrenic patients (paranoic, hebephrenic, schizo-affective) were significantly increased (7-fold) over those of normals. No significant difference in the distribution of the error scores in the three different patient groups was found. In 10 different sub-tests following the movie the deficiencies found in the schizophrenic patients were analysed in detail. The error score for the averbal test was on average higher in paranoic patients than in the two other groups of patients, while the opposite was true for the error scores found in the verbal tests. Age and sex had some impact on the test results. In normals, female subjects were somewhat better than male. In schizophrenic patients the reverse was true. Thus female patients were more affected by the disease than male patients with respect to the task performance. The correlation between duration of the disease and error score was small; less than 10% of the error scores could be attributed to factors related to the duration of illness. Evaluation of psychopathological symptoms indicated that the stronger the schizophrenic defect, the higher the error score, but again this relationship was responsible for not more than 10% of the errors. The estimated degree of acute psychosis and overall sum of psychopathological abnormalities as scored in a semi-quantitative exploration did not correlate with the error score, but with each other. Similarly, treatment with psychopharmaceuticals, previous misuse of drugs or of alcohol had practically no effect on the outcome of the test data. The analysis of performance and test data of schizophrenic patients indicated that our findings are most likely not due to a "non-specific" impairment of cognitive function in schizophrenia, but point to a fairly selective defect in elementary cognitive visual functions necessary for averbal social communication. Some possible explanations of the data are discussed in relation to neuropsychological and neurophysiological findings on "face-specific" cortical areas located in the primate temporal lobe.
Standardized UXO Technology Demonstration Site Scoring Record No. 946
2017-07-01
VA 22350 U.S. Army Test and Evaluation Command Aberdeen Proving Ground, MD 21005-5001 Distribution Unlimited, July 2017. The use of a...Address . . . . . . . . . . . . . . . 4 2.1.2 System Description ...4 2.1.3 Data Processing Description . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 2.1.4 Data Submission
Validity of the Dictionary of Occupational Titles for Assessing Upper Extremity Work Demands
Opsteegh, Lonneke; Soer, Remko; Reinders-Messelink, Heleen A.; Reneman, Michiel F.; van der Sluis, Corry K.
2010-01-01
Objectives The Dictionary of Occupational Titles (DOT) is used in vocational rehabilitation to guide decisions about the ability of a person with activity limitations to perform activities at work. The DOT has categorized physical work demands in five categories. The validity of this categorization is unknown. Aim of this study was to investigate whether the DOT could be used validly to guide decisions for patients with injuries to the upper extremities. Four hypotheses were tested. Methods A database including 701 healthy workers was used. All subjects filled out the Dutch Musculoskeletal Questionnaire, from which an Upper Extremity Work Demands score (UEWD) was derived. First, relation between the DOT-categories and UEWD-score was analysed using Spearman correlations. Second, variance of the UEWD-score in occupational groups was tested by visually inspecting boxplots and assessing kurtosis of the distribution. Third, it was investigated whether occupations classified in one DOT-category, could significantly differ on UEWD-scores. Fourth, it was investigated whether occupations in different DOT-categories could have similar UEWD-scores using Mann Whitney U-tests (MWU). Results Relation between the DOT-categories and the UEWD-score was weak (rsp = 0.40; p<.01). Overlap between categories was found. Kurtosis exceeded ±1.0 in 3 occupational groups, indicating large variance. UEWD-scores were significantly different within one DOT-category (MWU = 1.500; p<.001). UEWD scores between DOT-categories were not significantly different (MWU = 203.000; p = .49). Conclusion All four hypotheses could not be rejected. The DOT appears to be invalid for assessing upper extremity work demands. PMID:21151934
Ahmed, Haitham M; Al-Mallah, Mouaz H; McEvoy, John W; Nasir, Khurram; Blumenthal, Roger S; Jones, Steven R; Brawner, Clinton A; Keteyian, Steven J; Blaha, Michael J
2015-03-01
To determine which routinely collected exercise test variables most strongly correlate with survival and to derive a fitness risk score that can be used to predict 10-year survival. This was a retrospective cohort study of 58,020 adults aged 18 to 96 years who were free of established heart disease and were referred for an exercise stress test from January 1, 1991, through May 31, 2009. Demographic, clinical, exercise, and mortality data were collected on all patients as part of the Henry Ford ExercIse Testing (FIT) Project. Cox proportional hazards models were used to identify exercise test variables most predictive of survival. A "FIT Treadmill Score" was then derived from the β coefficients of the model with the highest survival discrimination. The median age of the 58,020 participants was 53 years (interquartile range, 45-62 years), and 28,201 (49%) were female. Over a median of 10 years (interquartile range, 8-14 years), 6456 patients (11%) died. After age and sex, peak metabolic equivalents of task and percentage of maximum predicted heart rate achieved were most highly predictive of survival (P<.001). Subsequent addition of baseline blood pressure and heart rate, change in vital signs, double product, and risk factor data did not further improve survival discrimination. The FIT Treadmill Score, calculated as [percentage of maximum predicted heart rate + 12(metabolic equivalents of task) - 4(age) + 43 if female], ranged from -200 to 200 across the cohort, was near normally distributed, and was found to be highly predictive of 10-year survival (Harrell C statistic, 0.811). The FIT Treadmill Score is easily attainable from any standard exercise test and translates basic treadmill performance measures into a fitness-related mortality risk score. The FIT Treadmill Score should be validated in external populations. Copyright © 2015 Mayo Foundation for Medical Education and Research. Published by Elsevier Inc. All rights reserved.
IQ variations across time, race, and nationality: an artifact of differences in literacy skills.
Marks, David F
2010-06-01
A body of data on IQ collected over 50 years has revealed that average population IQ varies across time, race, and nationality. An explanation for these differences may be that intelligence test performance requires literacy skills not present in all people to the same extent. In eight analyses, population mean full scale IQ and literacy scores yielded correlations ranging from .79 to .99. In cohort studies, significantly larger improvements in IQ occurred in the lower half of the IQ distribution, affecting the distribution variance and skewness in the predicted manner. In addition, three Verbal subscales on the WAIS show the largest Flynn effect sizes and all four Verbal subscales are among those showing the highest racial IQ differences. This pattern of findings supports the hypothesis that both secular and racial differences in intelligence test scores have an environmental explanation: secular and racial differences in IQ are an artifact of variation in literacy skills. These findings suggest that racial IQ distributions will converge if opportunities are equalized for different population groups to achieve the same high level of literacy skills. Social justice requires more effective implementation of policies and programs designed to eliminate inequities in IQ and literacy.
ERIC Educational Resources Information Center
Feldt, Leonard S.
2011-01-01
This article presents a simple, computer-assisted method of determining the extent to which increases in reliability increase the power of the "F" test of equality of means. The method uses a derived formula that relates the changes in the reliability coefficient to changes in the noncentrality of the relevant "F" distribution. A readily available…
Is there an association between astrological data and personality?
Hume, N
1977-07-01
A test was made of the hypothesis that personality characteristics can be predicted on the basis of various features of the individual's astrological chart. Astrological charts were prepared for 196 college-age Ss who also were administered the MMPI and the Leary Interpersonal Check List. Ss were divided into those who had extreme scores on any of the 13 personality variables studied and those who did not. For each personality variable, comparisons were made on a large number of astrological dimensions between distributions of Ss with and without extreme test scores. Six hundred thirty-two such comparisons were made and evaluated with chi-square tests. In that the obtained number of statistically significnat chi-squares was less than what would be expected on a chance basis, the hypothesis was rejected.
Analytic score distributions for a spatially continuous tridirectional Monte Carol transport problem
DOE Office of Scientific and Technical Information (OSTI.GOV)
Booth, T.E.
1996-01-01
The interpretation of the statistical error estimates produced by Monte Carlo transport codes is still somewhat of an art. Empirically, there are variance reduction techniques whose error estimates are almost always reliable, and there are variance reduction techniques whose error estimates are often unreliable. Unreliable error estimates usually result from inadequate large-score sampling from the score distribution`s tail. Statisticians believe that more accurate confidence interval statements are possible if the general nature of the score distribution can be characterized. Here, the analytic score distribution for the exponential transform applied to a simple, spatially continuous Monte Carlo transport problem is provided.more » Anisotropic scattering and implicit capture are included in the theory. In large part, the analytic score distributions that are derived provide the basis for the ten new statistical quality checks in MCNP.« less
The first OSCE; does students' experience of performing in public affect their results?
Chan, Michael; Bax, Nigel; Woodley, Caroline; Jennings, Michael; Nicolson, Rod; Chan, Philip
2015-03-26
Personal qualities have been shown to affect students' exam results. We studied the effect of experience, and level, of public performance in music, drama, dance, sport, and debate at the time of admission to medical school as a predictor of student achievement in their first objective structured clinical examination (OSCE). A single medical school cohort (n = 265) sitting their first clinical exam in 2011 as third year students were studied. Pre-admission statements made at the time of application were coded for their stated achievements in the level of public performance; participation in each activity was scored 0-3, where 0 was no record, 1 = leisure time activity, 2 = activity at school or local level, 3 = activity at district, regional or national level. These scores were correlated to OSCE results by linear regression and t-test. Comparison was made between the highest scoring students in each area, and students scoring zero by t-test. There was a bell shaped distribution in public performance score in this cohort. There was no significant linear regression relationship between OSCE results and overall performance score, or between any subgroups. There was a significant difference between students with high scores in theatre, debate and vocal music areas, grouped together as verbal performance, and students scoring zero in these areas. (p < 0.05, t-test) with an effect size of 0.4. We found modest effects from pre-admission experience of verbal performance on students' scores in the OSCE examination. As these data are taken from students' admission statements, we call into question the received wisdom that such statements are unreliable.
Lodeiro-Fernández, Leire; Lorenzo-López, Laura; Maseda, Ana; Núñez-Naveira, Laura; Rodríguez-Villamil, José Luis; Millán-Calenti, José Carlos
2015-01-01
Purpose The possible relationship between audiometric hearing thresholds and cognitive performance on language tests was analyzed in a cross-sectional cohort of older adults aged ≥65 years (N=98) with different degrees of cognitive impairment. Materials and methods Participants were distributed into two groups according to Reisberg’s Global Deterioration Scale (GDS): a normal/predementia group (GDS scores 1–3) and a moderate/moderately severe dementia group (GDS scores 4 and 5). Hearing loss (pure-tone audiometry) and receptive and production-based language function (Verbal Fluency Test, Boston Naming Test, and Token Test) were assessed. Results Results showed that the dementia group achieved significantly lower scores than the predementia group in all language tests. A moderate negative correlation between hearing loss and verbal comprehension (r=−0.298; P<0.003) was observed in the predementia group (r=−0.363; P<0.007). However, no significant relationship between hearing loss and verbal fluency and naming scores was observed, regardless of cognitive impairment. Conclusion In the predementia group, reduced hearing level partially explains comprehension performance but not language production. In the dementia group, hearing loss cannot be considered as an explanatory factor of poor receptive and production-based language performance. These results are suggestive of cognitive rather than simply auditory problems to explain the language impairment in the elderly. PMID:25914528
Schoenmakers, Birgitte; Wens, Johan
2014-03-04
To investigate if the psychometric qualities of an OSCE consisting of more complex simulated patient encounters remain valid and reliable in the assessment of postgraduate trainees in general practice. In this intervention study without control group, the traditional OSCE was formally replaced by the new, complex version. The study population was composed by all postgraduate trainees (second and third phase) in general practice during the ongoing academic year. Data were handled and collected as part of the formal assessment program. Univariate analyses, the variance of scores and multivariate analyses were performed to assess the test qualities. A total of 340 students participated. Average final scores were slightly higher for third-phase students (t-test, p =0.05). Overall test scores were equally distributed on station level, circuit level and phase level. A multiple regression analysis revealed that test scores were dependent on the stations and circuits, but not on the master phase. In a changing learning environment, assessment and evaluation strategies require reorientation. The reliability and validity of the OSCE remain subject to discussion. In particular, when it comes to content and design, the traditional OSCE might underestimate the performance level of postgraduate trainees in general practice. A reshaping of this OSCE to a more sophisticated design with more complex patient encounters appears to restore the validity of the test results.
Storey, Jennifer E; Hart, Stephen D; Cooke, David J; Michie, Christine
2016-04-01
The Hare Psychopathy Checklist-Revised (PCL-R; Hare, 2003) is a commonly used psychological test for assessing traits of psychopathic personality disorder. Despite the abundance of research using the PCL-R, the vast majority of research used samples of convenience rather than systematic methods to minimize sampling bias and maximize the generalizability of findings. This potentially complicates the interpretation of test scores and research findings, including the "norms" for offenders from the United States and Canada included in the PCL-R manual. In the current study, we evaluated the psychometric properties of PCL-R scores for all male offenders admitted to a regional reception center of the Correctional Service of Canada during a 1-year period (n = 375). Because offenders were admitted for assessment prior to institutional classification, they comprise a sample that was heterogeneous with respect to correctional risks and needs yet representative of all offenders in that region of the service. We examined the distribution of PCL-R scores, classical test theory indices of its structural reliability, the factor structure of test items, and the external correlates of test scores. The findings were highly consistent with those typically reported in previous studies. We interpret these results as indicating it is unlikely any sampling limitations of past research using the PCL-R resulted in findings that were, overall, strongly biased or unrepresentative. (c) 2016 APA, all rights reserved).
Psychometric properties of a scale to measure alexithymia.
Blanchard, E B; Arena, J G; Pallmeyer, T P
1981-01-01
Four studies were conducted on a sample of 230 undergraduates to determine the psychometric properties of a measure of alexithymia, the Schalling-Sifneos Scale. In the first study it was found that scores on the scale are approximately normally distributed for each sex with 8.2% of males and 1.8% of females in the alexithymia range. In the second study a factor analysis of the scale revealed three distinct factors: (1) 'difficulty in expression of feelings'; (2) 'the importance of feelings especially about people'; (3) 'day-dreaming or introspection'. In the second factor analytic study, scores from several standard psychological tests on the same subjects were introduced with the scale items. Two factors in this analysis were comprised almost entirely of the other test scores: a 'general psychological distress factor' and a 'concerns about physical symptoms factor'. The other two factors were similar to factors 1 and 2 above in terms of items. The Rathus Assertiveness Scale loaded positively on the equivalent of factor 1. In the lst study, it was shown that Schalling-Sifneos Scale score is relatively orthogonal to other psychological tests with the exception of a Psychosomatic Symptom Checklist and thus is measuring something other than depression, anxiety, etc.
Gärtner, Fania R; de Miranda, Esteriek; Rijnders, Marlies E; Freeman, Liv M; Middeldorp, Johanna M; Bloemenkamp, Kitty W M; Stiggelbout, Anne M; van den Akker-van Marle, M Elske
2015-10-01
To validate the Labor and Delivery Index (LADY-X), a new delivery-specific utility measure. In a test-retest design, women were surveyed online, 6 to 8 weeks postpartum and again 1 to 2 weeks later. For reliability testing, we assessed the standard error of measurement (S.E.M.) and the intraclass correlation coefficient (ICC). For construct validity, we tested hypotheses on the association with comparison instruments (Mackey Childbirth Satisfaction Rating Scale and Wijma Delivery Experience Questionnaire), both on domain and total score levels. We assessed known-group differences using eight obstetrical indicators: method and place of birth, induction, transfer, control over pain medication, complications concerning mother and child, and experienced control. The questionnaire was completed by 308 women, 257 (83%) completed the retest. The distribution of LADY-X scores was skewed. The reliability was good, as the ICC exceeded 0.80 and the S.E.M. was 0.76. Requirements for good construct validity were fulfilled: all hypotheses for convergent and divergent validity were confirmed, and six of eight hypotheses for known-group differences were confirmed as all differences were statistically significant (P-values: <0.001-0.023), but for two tests, difference scores did not exceed the S.E.M. The LADY-X demonstrates good reliability and construct validity. Despite its skewed distribution, the LADY-X can discriminate between groups. With the preference weights available, the LADY-X might fulfill the need for a utility measure for cost-effectiveness studies for perinatal care interventions. Copyright © 2015 Elsevier Inc. All rights reserved.
The Use of Propensity Scores in Mediation Analysis
ERIC Educational Resources Information Center
Jo, Booil; Stuart, Elizabeth A.; MacKinnon, David P.; Vinokur, Amiram D.
2011-01-01
Mediation analysis uses measures of hypothesized mediating variables to test theory for how a treatment achieves effects on outcomes and to improve subsequent treatments by identifying the most efficient treatment components. Most current mediation analysis methods rely on untested distributional and functional form assumptions for valid…
Arneson, Justin J; Sackett, Paul R; Beatty, Adam S
2011-10-01
The nature of the relationship between ability and performance is of critical importance for admission decisions in the context of higher education and for personnel selection. Although previous research has supported the more-is-better hypothesis by documenting linearity of ability-performance relationships, such research has not been sensitive enough to detect deviations at the top ends of the score distributions. An alternative position receiving considerable attention is the good-enough hypothesis, which suggests that although higher levels of ability may result in better performance up to a threshold, above this threshold greater ability does not translate to better performance. In this study, the nature of the relationship between cognitive ability and performance was examined throughout the score range in four large-scale data sets. Monotonicity was maintained in all instances. Contrary to the good-enough hypothesis, the ability-performance relationship was commonly stronger at the top end of the score distribution than at the bottom end.
Multisample adjusted U-statistics that account for confounding covariates.
Satten, Glen A; Kong, Maiying; Datta, Somnath
2018-06-19
Multisample U-statistics encompass a wide class of test statistics that allow the comparison of 2 or more distributions. U-statistics are especially powerful because they can be applied to both numeric and nonnumeric data, eg, ordinal and categorical data where a pairwise similarity or distance-like measure between categories is available. However, when comparing the distribution of a variable across 2 or more groups, observed differences may be due to confounding covariates. For example, in a case-control study, the distribution of exposure in cases may differ from that in controls entirely because of variables that are related to both exposure and case status and are distributed differently among case and control participants. We propose to use individually reweighted data (ie, using the stratification score for retrospective data or the propensity score for prospective data) to construct adjusted U-statistics that can test the equality of distributions across 2 (or more) groups in the presence of confounding covariates. Asymptotic normality of our adjusted U-statistics is established and a closed form expression of their asymptotic variance is presented. The utility of our approach is demonstrated through simulation studies, as well as in an analysis of data from a case-control study conducted among African-Americans, comparing whether the similarity in haplotypes (ie, sets of adjacent genetic loci inherited from the same parent) occurring in a case and a control participant differs from the similarity in haplotypes occurring in 2 control participants. Copyright © 2018 John Wiley & Sons, Ltd.
Domina, Thurston; Penner, Emily; Hoynes, Hilary
2014-01-01
We use quantile treatment effects estimation to examine the consequences of the random-assignment New York City School Choice Scholarship Program (NYCSCSP) across the distribution of student achievement. Our analyses suggest that the program had negligible and statistically insignificant effects across the skill distribution. In addition to contributing to the literature on school choice, the paper illustrates several ways in which distributional effects estimation can enrich educational research: First, we demonstrate that moving beyond a focus on mean effects estimation makes it possible to generate and test new hypotheses about the heterogeneity of educational treatment effects that speak to the justification for many interventions. Second, we demonstrate that distributional effects can uncover issues even with well-studied datasets by forcing analysts to view their data in new ways. Finally, such estimates highlight where in the overall national achievement distribution test scores of children exposed to particular interventions lie; this is important for exploring the external validity of the intervention’s effects. PMID:26207158
Donegan, Thomas M.
2018-01-01
Abstract Existing models for assigning species, subspecies, or no taxonomic rank to populations which are geographically separated from one another were analyzed. This was done by subjecting over 3,000 pairwise comparisons of vocal or biometric data based on birds to a variety of statistical tests that have been proposed as measures of differentiation. One current model which aims to test diagnosability (Isler et al. 1998) is highly conservative, applying a hard cut-off, which excludes from consideration differentiation below diagnosis. It also includes non-overlap as a requirement, a measure which penalizes increases to sample size. The “species scoring” model of Tobias et al. (2010) involves less drastic cut-offs, but unlike Isler et al. (1998), does not control adequately for sample size and attributes scores in many cases to differentiation which is not statistically significant. Four different models of assessing effect sizes were analyzed: using both pooled and unpooled standard deviations and controlling for sample size using t-distributions or omitting to do so. Pooled standard deviations produced more conservative effect sizes when uncontrolled for sample size but less conservative effect sizes when so controlled. Pooled models require assumptions to be made that are typically elusive or unsupported for taxonomic studies. Modifications to improving these frameworks are proposed, including: (i) introducing statistical significance as a gateway to attributing any weighting to findings of differentiation; (ii) abandoning non-overlap as a test; (iii) recalibrating Tobias et al. (2010) scores based on effect sizes controlled for sample size using t-distributions. A new universal method is proposed for measuring differentiation in taxonomy using continuous variables and a formula is proposed for ranking allopatric populations. This is based first on calculating effect sizes using unpooled standard deviations, controlled for sample size using t-distributions, for a series of different variables. All non-significant results are excluded by scoring them as zero. Distance between any two populations is calculated using Euclidian summation of non-zeroed effect size scores. If the score of an allopatric pair exceeds that of a related sympatric pair, then the allopatric population can be ranked as species and, if not, then at most subspecies rank should be assigned. A spreadsheet has been programmed and is being made available which allows this and other tests of differentiation and rank studied in this paper to be rapidly analyzed. PMID:29780266
Menary, Kyle; Collins, Paul F.; Porter, James N.; Muetzel, Ryan; Olson, Elizabeth A.; Kumar, Vipin; Steinbach, Michael; Lim, Kelvin O.; Luciana, Monica
2013-01-01
Neuroimaging research indicates that human intellectual ability is related to brain structure including the thickness of the cerebral cortex. Most studies indicate that general intelligence is positively associated with cortical thickness in areas of association cortex distributed throughout both brain hemispheres. In this study, we performed a cortical thickness mapping analysis on data from 182 healthy typically developing males and females ages 9 to 24 years to identify correlates of general intelligence (g) scores. To determine if these correlates also mediate associations of specific cognitive abilities with cortical thickness, we regressed specific cognitive test scores on g scores and analyzed the residuals with respect to cortical thickness. The effect of age on the association between cortical thickness and intelligence was examined. We found a widely distributed pattern of positive associations between cortical thickness and g scores, as derived from the first unrotated principal factor of a factor analysis of Wechsler Abbreviated Scale of Intelligence (WASI) subtest scores. After WASI specific cognitive subtest scores were regressed on g factor scores, the residual score variances did not correlate significantly with cortical thickness in the full sample with age covaried. When participants were grouped at the age median, significant positive associations of cortical thickness were obtained in the older group for g-residualized scores on Block Design (a measure of visual-motor integrative processing) while significant negative associations of cortical thickness were observed in the younger group for g-residualized Vocabulary scores. These results regarding correlates of general intelligence are concordant with the existing literature, while the findings from younger versus older subgroups have implications for future research on brain structural correlates of specific cognitive abilities, as well as the cognitive domain specificity of behavioral performance correlates of normative gray matter thinning during adolescence. PMID:24744452
Yang, Huixia; Wei, Yumei; Su, Rina; Wang, Chen; Meng, Wenying; Wang, Yongqing; Shang, Lixin; Cai, Zhenyu; Ji, Liping; Wang, Yunfeng; Sun, Ying; Liu, Jiaxiu; Wei, Li; Sun, Yufeng; Zhang, Xueying; Luo, Tianxia; Chen, Haixia; Yu, Lijun
2016-01-01
Objective To use Z-scores to compare different charts of femur length (FL) applied to our population with the aim of identifying the most appropriate chart. Methods A retrospective study was conducted in Beijing. Fifteen hospitals in Beijing were chosen as clusters using a systemic cluster sampling method, in which 15,194 pregnant women delivered from June 20th to November 30th, 2013. The measurements of FL in the second and third trimester were recorded, as well as the last measurement obtained before delivery. Based on the inclusion and exclusion criteria, we identified FL measurements from 19996 ultrasounds from 7194 patients between 11 and 42 weeks gestation. The FL data were then transformed into Z-scores that were calculated using three series of reference equations obtained from three reports: Leung TN, Pang MW et al (2008); Chitty LS, Altman DG et al (1994); and Papageorghiou AT et al (2014). Each Z-score distribution was presented as the mean and standard deviation (SD). Skewness and kurtosis and were compared with the standard normal distribution using the Kolmogorov-Smirnov test. The histogram of their distributions was superimposed on the non-skewed standard normal curve (mean = 0, SD = 1) to provide a direct visual impression. Finally, the sensitivity and specificity of each reference chart for identifying fetuses <5th or >95th percentile (based on the observed distribution of Z-scores) were calculated. The Youden index was also listed. A scatter diagram with the 5th, 50th, and 95th percentile curves calculated from and superimposed on each reference chart was presented to provide a visual impression. Results The three Z-score distribution curves appeared to be normal, but none of them matched the expected standard normal distribution. In our study, the Papageorghiou reference curve provided the best results, with a sensitivity of 100% for identifying fetuses with measurements < 5th and > 95th percentile, and specificities of 99.9% and 81.5%, respectively. Conclusions It is important to choose an appropriate reference curve when defining what is normal. The Papageorghiou reference curve for FL seems to be the best fit for our population. Perhaps it is time to change our reference curve for femur length. PMID:27458922
Maity, Arnab; Carroll, Raymond J; Mammen, Enno; Chatterjee, Nilanjan
2009-01-01
Motivated from the problem of testing for genetic effects on complex traits in the presence of gene-environment interaction, we develop score tests in general semiparametric regression problems that involves Tukey style 1 degree-of-freedom form of interaction between parametrically and non-parametrically modelled covariates. We find that the score test in this type of model, as recently developed by Chatterjee and co-workers in the fully parametric setting, is biased and requires undersmoothing to be valid in the presence of non-parametric components. Moreover, in the presence of repeated outcomes, the asymptotic distribution of the score test depends on the estimation of functions which are defined as solutions of integral equations, making implementation difficult and computationally taxing. We develop profiled score statistics which are unbiased and asymptotically efficient and can be performed by using standard bandwidth selection methods. In addition, to overcome the difficulty of solving functional equations, we give easy interpretations of the target functions, which in turn allow us to develop estimation procedures that can be easily implemented by using standard computational methods. We present simulation studies to evaluate type I error and power of the method proposed compared with a naive test that does not consider interaction. Finally, we illustrate our methodology by analysing data from a case-control study of colorectal adenoma that was designed to investigate the association between colorectal adenoma and the candidate gene NAT2 in relation to smoking history.
Parker, Michael; Goldberg, Ross F; Dinkins, Maryane M; Asbun, Horacio J; Daniel Smith, C; Preissler, Susanne; Bowers, Steven P
2011-11-01
Outcomes after ventral incisional hernia (VIH) repair are measured by recurrence rate and subjective measures. No objective metrics evaluate functional outcomes after abdominal wall reconstruction. This study aimed to develop testing of abdominal wall strength (AWS) that could be validated as a useful metric. Data were prospectively collected during 9 months from 35 patients. A total of 10 patients were evaluated before and after VIH repair, for a total of 45 encounters. The patients were tested simultaneously or in succession by two of three examiners. Data were collected for three tests: double leg lowering (DLL), trunk raising (TR), and supine reaching (SR). Raw data were compared and tested for validity, and continuous data were transformed to categorical data. Agreement was measured using the intraclass correlation coefficient (ICC) for DLL and using kappa for the ordinal measures. Simultaneous testing yielded the following interobserver reliability: DLL (0.96 and 0.87), TR (1.00 and 0.95), and SR (0.76). Reproducibility was assessed by consecutive tests, with correlation as follows: DLL (0.81), TR (0.81), and RCH (0.21). Due to poor interobserver reliability for the SR test compared with the DLL and TR tests, the SR test was excluded from calculation of an overall score. Based on raw data distribution from the DLL and TR tests, the DLL data were categorized into 10º increments, allowing construction of a 10-point score. The median AWS score was 5 (interquartile range [IQR], 4-7), and there was agreement within 1 point for 42 of the 45 encounters (93%). The findings from this study demonstrate that the 10-point AWS score may measure AWS in an accurate and reproducible fashion, with potential for objective description of abdominal wall function of VIH patients. This score may help to identify patients suited for abdominal wall reconstruction while measuring progress after VIH repair. Further longitudinal outcomes studies are needed.
Competitiveness Improvement Project Informational Workshop
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sinclair, Karin C; Preus, Robert W; Dana, Scott
This presentation was given at the Competitiveness Improvement Project (CIP) Informational Workshop on December 6, 2017. Topics covered during the workshop include an overview of the CIP, past projects, scoring criteria, technical support opportunities, certification body requirements, standards applicable to distributed wind generators, information on the National Electric Code, certification testing requirements, test site requirements, National Environmental Policy Act, design review, levelized cost of energy, procurement/contracting, project management/deliverables, and outreach materials.
Elderly quality of life impacted by traditional chinese medicine techniques
Figueira, Helena A; Figueira, Olivia A; Figueira, Alan A; Figueira, Joana A; Giani, Tania S; Dantas, Estélio HM
2010-01-01
Background: The shift in age structure is having a profound impact, suggesting that the aged should be consulted as reporters on the quality of their own lives. Objectives: The aim of this research was to establish the possible impact of traditional Chinese medicine (TCM) techniques on the quality of life (QOL) of the elderly. Sample: Two non-selected, volunteer groups of Rio de Janeiro municipality inhabitants: a control group (36 individuals), not using TCM, and an experimental group (28 individuals), using TCM at ABACO/Sohaku-in Institute, Brazil. Methods: A questionnaire on elderly QOL devised by the World Health Organization, the WHOQOL-Old, was adopted and descriptive statistical techniques were used: mean and standard deviation. The Shapiro–Wilk test checked the normality of the distribution. Furthermore, based on its normality distribution for the intergroup comparison, the Student t test was applied to facets 2, 4, 5, 6, and total score, and the Mann–Whitney U rank test to facets 1 and 3, both tests aiming to analyze the P value between experimental and control groups. The significance level utilized was 95% (P < 0.05). Results: The experimental group reported the highest QOL for every facet and the total score. Conclusions: The results suggest that TCM raises the level of QOL. PMID:21103400
Assessment of numeracy in sports and exercise science students at an Australian university
NASA Astrophysics Data System (ADS)
Green, Simon; McGlynn, Susan; Stuart, Deidre; Fahey, Paul; Pettigrew, Jim; Clothier, Peter
2018-05-01
The effect of high school study of mathematics on numeracy performance of sports and exercise science (SES) students is not clear. To investigate this further, we tested the numeracy skills of 401 students enrolled in a Bachelor of Health Sciences degree in SES using a multiple-choice survey consisting of four background questions and 39 numeracy test questions. Background questions (5-point scale) focused on highest level of mathematics studied at high school, self-perception of mathematics proficiency, perceived importance of mathematics to SES and likelihood of seeking help with mathematics. Numeracy questions focused on rational number, ratios and rates, basic algebra and graph interpretation. Numeracy performance was based on answers to these questions (1 mark each) and represented by the total score (maximum = 39). Students from first (n = 212), second (n = 78) and third (n = 111) years of the SES degree completed the test. The distribution of numeracy test scores for the entire cohort was negatively skewed with a median (IQR) score of 27(11). We observed statistically significant associations between test scores and the highest level of mathematics studied (P < 0.05), being lowest in students who studied Year 10 Mathematics (20 (9)), intermediate in students who studied Year 12 General Mathematics (26 (8)) and highest in two groups of students who studied higher-level Year 12 Mathematics (31 (9), 31 (6)). There were statistically significant associations between test scores and level of self-perception of mathematics proficiency and also likelihood of seeking help with mathematics (P < 0.05) but not with perceived importance of mathematics to SES. These findings reveal that the level of mathematics studied in high school is a critical factor determining the level of numeracy performance in SES students.
Fitting and Testing Conditional Multinormal Partial Credit Models
ERIC Educational Resources Information Center
Hessen, David J.
2012-01-01
A multinormal partial credit model for factor analysis of polytomously scored items with ordered response categories is derived using an extension of the Dutch Identity (Holland in "Psychometrika" 55:5-18, 1990). In the model, latent variables are assumed to have a multivariate normal distribution conditional on unweighted sums of item…
Heterogeneous Trends in U.S. Teacher Quality 1980-2010
ERIC Educational Resources Information Center
Richey, Jeremiah
2015-01-01
This paper documents changes in the entire ability distribution of individuals entering the teaching profession using the 1979 and 1997 cohorts of the National Longitudinal Survey of Youth and a constructed Armed Force Qualifying Test score that allows direct comparison of ability between cohorts. Such direct comparison between cohorts was…
After-School Tutoring and the Distribution of Student Performance
ERIC Educational Resources Information Center
Huang, Min-Hsiung
2013-01-01
As more primary and secondary students worldwide seek after-school tutoring in academic subjects, concerns are being raised about whether after-school tutoring can raise average test scores without widening the variability in student performance, and whether students of certain ability levels may benefit more than others from after-school…
The Mandarin Childhood Autism Spectrum Test (CAST): Sex Differences
ERIC Educational Resources Information Center
Sun, Xiang; Allison, Carrie; Auyeung, Bonnie; Matthews, Fiona E.; Sharp, Stephen J.; Baron-Cohen, Simon; Brayne, Carol
2014-01-01
Sex differences in social and communication behaviours related to autism spectrum conditions (ASC) have been investigated mainly in Western populations. Little research has been done in Chinese populations. This study explored sex differences related to ASC characteristics by examining differences in item responses and score distributions in…
Extensions of Rasch's Multiplicative Poisson Model.
ERIC Educational Resources Information Center
Jansen, Margo G. H.; van Duijn, Marijtje A. J.
1992-01-01
A model developed by G. Rasch that assumes scores on some attainment tests can be realizations of a Poisson process is explained and expanded by assuming a prior distribution, with fixed but unknown parameters, for the subject parameters. How additional between-subject and within-subject factors can be incorporated is discussed. (SLD)
Statistical Summary of Missouri Higher Education, 1996-1997.
ERIC Educational Resources Information Center
Missouri Coordinating Board for Higher Education, Jefferson City.
Extensive data tables on higher education in Missouri present information on: the academic preparation of college freshmen (fall 1996), including distribution of American College Testing (ACT) scores and high school rankings; tuition, fees, and financial aid (state and federal, by aid type, including merit-based scholarships) and trends;…
Missouri Higher Education: 1995-1996 Statistical Summary.
ERIC Educational Resources Information Center
Missouri State Coordinating Board for Higher Education, Jefferson City.
Extensive data tables on higher education in Missouri present information on: the academic preparation of college freshmen (fall 1995), including distribution of American College Testing (ACT) scores and high school rankings; tuition, fees, and financial aid (state and federal, by aid type, including merit-based scholarships) and trends;…
Liu, Liu; Li, Shunping; Wang, Min; Chen, Gang
2017-01-01
The objective of this study was to compare the differences in the five-level EuroQol-5 dimensions (EQ-5D-5L) health state utility scores derived from Chinese, Japanese, Korean, and UK tariffs. Six hundred and twenty-one breast cancer patients were invited for a face-to-face interview in Qingdao Municipal Hospital, China. EQ-5D-5L was scored using tariffs from China, Japan, Korea, and the UK. The null hypothesis of normal distribution of the EQ-5D-5L utility score was tested by the Shapiro-Wilk test. Nonparametric Friedman test and Wilcoxon signed-rank test were used to determine the difference among the four tariffs. The intraclass correlation coefficients (ICCs) and Bland-Altman plots were used to study the agreement among the four EQ-5D-5L scores. Known-groups validity was studied using a regression framework. There were 608 participants in the final analysis, with a mean ± standard deviation (SD) age of 48.0±9.6 years. EQ-5D-5L utility scores were non-normally distributed. The means (median) ± SD of EQ-5D-5L utilities derived from Chinese, Japanese, Korean, and UK tariffs were 0.828 (0.879) ±0.184, 0.802 (0.823) ±0.164, 0.831 (0.829) ±0.137, and 0.838 (0.866) ±0.154, respectively. Among pairwise comparisons, the difference of median EQ-5D-5L utility scores was only insignificant between Chinese and UK tariffs. Excellent agreements (with ICCs >0.9) were found among the four tariffs albeit the limits of agreement between each pair of tariffs were wide. Known-groups validity was supported. Although four country-specific EQ-5D-5L tariffs have shown an overall high level of correlation and agreement, none of them could be regarded as interchangeable. The higher correlation and agreement between Chinese and UK tariffs may be due to the similar functions that were used in the tariff development. In the absence of Chinese-specific tariff, the UK tariff is the second-best option to be applied in the Chinese population. Results of this study further contribute to the explanation of variations among country-specific tariffs.
Effects of Saccular Function on Recovery of Subjective Dizziness After Vestibular Rehabilitation.
Jeong, Junhui; Jung, Jinsei; Lee, Jeon Mi; Suh, Michelle J; Kwak, Sang Hyun; Kim, Sung Huhn
2017-08-01
We attempted to investigate whether the integrity of saccular function influences the severity of subjective dizziness after vestibular rehabilitation in vestibular neuritis. Retrospective analysis. Tertiary referral center. Forty-six patients with acute unilateral vestibular neuritis were included. Diagnostic, therapeutic, and rehabilitative. All the patients completed vestibular rehabilitation therapy until their computerized dynamic posturography and rotary chair test results were significantly improved. The rehabilitation patients were classified into the normal to mild subjective dizziness and moderate to severe subjective dizziness groups according to the dizziness handicap inventory score (cutoff of 40). Differences between the two groups were analyzed. After rehabilitation, 32.6% of the patients still complained of moderate to severe dizziness. Age, sex distribution, the presence of comorbidities, caloric weakness, pre- and postrehabilitation gain values in rotary chair test, postrehabilitation composite scores in posturography, and the duration of rehabilitation were not significantly different between the two groups. However, initial dizziness handicap inventory (DHI) score and composite score in dynamic posturography were worse and the proportion of patients with absent cervical vestibular-evoked myogenic potential in the moderate to severe group was much higher (93.3% vs. 35.5%, p < 0.001). After multiple regression analysis of those factors, initial DHI score and absent cervical vestibular-evoked myogenic potential response were identified as being associated with higher postrehabilitation DHI score. Saccular dysfunction in acute vestibular neuritis can contribute to persistent subjective dizziness, even after the objective parameters of vestibular function tests have been improved by vestibular rehabilitation.
Bidwell, L. Cinnamon; Palmer, Rohan H.C.; Brick, Leslie; Madden, Pamela A.F.; Heath, Andrew C.; Knopik, Valerie S.
2016-01-01
When examining the effects of prenatal exposure to maternal smoking during pregnancy (MSDP) on later offspring substance use, it is critical to consider familial environments confounded with MSDP. The purpose of this study was to examine the effect of MSDP on offspring's initial reactions to cigarettes and alcohol, which are indicators of future substance-use related problems. We tested these effects using two propensity score approaches (1) by controlling for confounding using the MSDP propensity score and 2) examining effects of MSDP across the MSDP risk distribution by grouping individuals into quantiles based on their MSDP propensity score. This study used data from 829 unrelated mothers with a reported lifetime history of smoking to determine the propensity for smoking only during their first trimester (MSDP-E) or throughout their entire pregnancy (MSDP-T). Propensity score analyses focused on the offspring (N=1616 female twins) of a large subset of these mothers. We examined the effects of levels of MSDP-E/T on offspring initial reactions to their first experiences with alcohol and cigarettes, across the distribution of liability for MSDP-E/T. MSDP-E/T emerged as significant predictors of offspring reactions to alcohol and cigarettes, but the effects were confounded by the familial liability for MSDP. Further, the unique MSDP effects that emerged were not uniform across the MSDP familial risk distribution. Our findings underscore the importance of properly accounting for correlated familial risk factors when examining the effects of MSDP on substance related outcomes. PMID:27098899
Divers, Jasmin; Hugenschmidt, Christina; Sink, Kaycee M; Williamson, Jeffrey D; Ge, Yaorong; Smith, S Carrie; Bowden, Donald W; Whitlow, Christopher T; Lyders, Eric; Maldjian, Joseph A; Freedman, Barry I
2013-10-01
Previous studies involving inner city populations detected higher cerebral white matter hyperintensity (WMH) scores in African Americans (AAs) compared with European Americans (EAs). This finding might be attributable to the higher prevalence of cardiovascular disease (CVD) risk factors and poorer access to healthcare in AAs. Despite racial differences in CVD risk factor profiles, AAs have paradoxically lower levels of subclinical CVD. We hypothesized that AAs with diabetes and good access to healthcare would have comparable or lower levels of WMH as EAs. Racial differences in the distribution of WMH were analyzed in 46 AAs and 156 EAs with type 2 diabetes enrolled in the Diabetes Heart Study (DHS)-Mind, and replicated in a sample of 113 AAs and 61 EAs patients who had clinically indicated cerebral magnetic resonance imaging. Wilcoxon 2-sample tests and linear models were used to compare the distribution of WMH in AAs and EAs and to test for association between WMH and race. The unadjusted mean WMH score from the Diabetes Heart Study-Mind was 1.9 in AAs and 2.3 in EAs (P = .3244). Among those with clinically indicated magnetic resonance imaging, the mean WMH score was 2.9 in AAs and 3.9 in EAs (P = .0503). Adjustment for age and sex produced no statistically significant differences in WMH score between AAs and EAs. These independent datasets reveal comparable WMH scores in AAs and EAs, suggesting that disparities in access to healthcare and environmental exposures likely underlie the previously reported excess burden of WMH in AAs. Copyright © 2013 National Stroke Association. Published by Elsevier Inc. All rights reserved.
Labad, Javier; Martorell, Lourdes; Gaviria, Ana; Bayón, Carmen; Vilella, Elisabet; Cloninger, C. Robert
2015-01-01
Objectives. The psychometric properties regarding sex and age for the revised version of the Temperament and Character Inventory (TCI-R) and its derived short version, the Temperament and Character Inventory (TCI-140), were evaluated with a randomized sample from the community. Methods. A randomized sample of 367 normal adult subjects from a Spanish municipality, who were representative of the general population based on sex and age, participated in the current study. Descriptive statistics and internal consistency according to α coefficient were obtained for all of the dimensions and facets. T-tests and univariate analyses of variance, followed by Bonferroni tests, were conducted to compare the distributions of the TCI-R dimension scores by age and sex. Results. On both the TCI-R and TCI-140, women had higher scores for Harm Avoidance, Reward Dependence and Cooperativeness than men, whereas men had higher scores for Persistence. Age correlated negatively with Novelty Seeking, Reward Dependence and Cooperativeness and positively with Harm Avoidance and Self-transcendence. Young subjects between 18 and 35 years had higher scores than older subjects in NS and RD. Subjects between 51 and 77 years scored higher in both HA and ST. The alphas for the dimensions were between 0.74 and 0.87 for the TCI-R and between 0.63 and 0.83 for the TCI-140. Conclusion. Results, which were obtained with a randomized sample, suggest that there are specific distributions of personality traits by sex and age. Overall, both the TCI-R and the abbreviated TCI-140 were reliable in the ‘good-to-excellent’ range. A strength of the current study is the representativeness of the sample. PMID:26713237
el Galta, Rachid; Uitte de Willige, Shirley; de Visser, Marieke C H; Helmer, Quinta; Hsu, Li; Houwing-Duistermaat, Jeanine J
2007-09-24
In this paper, we propose a one degree of freedom test for association between a candidate gene and a binary trait. This method is a generalization of Terwilliger's likelihood ratio statistic and is especially powerful for the situation of one associated haplotype. As an alternative to the likelihood ratio statistic, we derive a score statistic, which has a tractable expression. For haplotype analysis, we assume that phase is known. By means of a simulation study, we compare the performance of the score statistic to Pearson's chi-square statistic and the likelihood ratio statistic proposed by Terwilliger. We illustrate the method on three candidate genes studied in the Leiden Thrombophilia Study. We conclude that the statistic follows a chi square distribution under the null hypothesis and that the score statistic is more powerful than Terwilliger's likelihood ratio statistic when the associated haplotype has frequency between 0.1 and 0.4 and has a small impact on the studied disorder. With regard to Pearson's chi-square statistic, the score statistic has more power when the associated haplotype has frequency above 0.2 and the number of variants is above five.
Rank-Based Inference without Symmetric Errors.
1982-06-01
a rank test statistic for testing H : 8=0. The distributional properties0 of S+ were studied in great detail by Hajek and Sidak (1967). The test...fn (x)dx, where F(x) is the integral of f (X). On the other hand, Schuster (1974) and Ahmad (1976) studied ff n(x)dFn(x), where Fn (x) is the empirical...the results cited in the previous sections. In the case of Wilcoxon scores, Aubuchon (1982) proved consistency of y and studied its behavior. Further
Shi, Xiaohu; Zhang, Jingfen; He, Zhiquan; Shang, Yi; Xu, Dong
2011-09-01
One of the major challenges in protein tertiary structure prediction is structure quality assessment. In many cases, protein structure prediction tools generate good structural models, but fail to select the best models from a huge number of candidates as the final output. In this study, we developed a sampling-based machine-learning method to rank protein structural models by integrating multiple scores and features. First, features such as predicted secondary structure, solvent accessibility and residue-residue contact information are integrated by two Radial Basis Function (RBF) models trained from different datasets. Then, the two RBF scores and five selected scoring functions developed by others, i.e., Opus-CA, Opus-PSP, DFIRE, RAPDF, and Cheng Score are synthesized by a sampling method. At last, another integrated RBF model ranks the structural models according to the features of sampling distribution. We tested the proposed method by using two different datasets, including the CASP server prediction models of all CASP8 targets and a set of models generated by our in-house software MUFOLD. The test result shows that our method outperforms any individual scoring function on both best model selection, and overall correlation between the predicted ranking and the actual ranking of structural quality.
A Supervised Learning Process to Validate Online Disease Reports for Use in Predictive Models.
Patching, Helena M M; Hudson, Laurence M; Cooke, Warrick; Garcia, Andres J; Hay, Simon I; Roberts, Mark; Moyes, Catherine L
2015-12-01
Pathogen distribution models that predict spatial variation in disease occurrence require data from a large number of geographic locations to generate disease risk maps. Traditionally, this process has used data from public health reporting systems; however, using online reports of new infections could speed up the process dramatically. Data from both public health systems and online sources must be validated before they can be used, but no mechanisms exist to validate data from online media reports. We have developed a supervised learning process to validate geolocated disease outbreak data in a timely manner. The process uses three input features, the data source and two metrics derived from the location of each disease occurrence. The location of disease occurrence provides information on the probability of disease occurrence at that location based on environmental and socioeconomic factors and the distance within or outside the current known disease extent. The process also uses validation scores, generated by disease experts who review a subset of the data, to build a training data set. The aim of the supervised learning process is to generate validation scores that can be used as weights going into the pathogen distribution model. After analyzing the three input features and testing the performance of alternative processes, we selected a cascade of ensembles comprising logistic regressors. Parameter values for the training data subset size, number of predictors, and number of layers in the cascade were tested before the process was deployed. The final configuration was tested using data for two contrasting diseases (dengue and cholera), and 66%-79% of data points were assigned a validation score. The remaining data points are scored by the experts, and the results inform the training data set for the next set of predictors, as well as going to the pathogen distribution model. The new supervised learning process has been implemented within our live site and is being used to validate the data that our system uses to produce updated predictive disease maps on a weekly basis.
Ravi, Deepthi; Prabhu, S Smitha; Rao, Raghavendra; Balachandran, C; Bairy, Indira
2017-01-01
Background: Pemphigus is an acquired immunobullous disorder in which antibodies are directed against epidermal cadherins. Despite the commercial availability and less cost of enzyme-linked immunosorbent assays (ELISAs) to detect antidesmoglein 1 (Dsg1) and anti-Dsg3, immunofluorescence is still widely used for confirmation of diagnosis. Aims: (1) To compare the usefulness of indirect immunofluorescence (IIF) and ELISA tests in the diagnosis of pemphigus. (2) To find the clinical correlation between the tests and severity of the disease. Materials and Methods: Sixty-one patients (27 women and 34 men, age distribution from 20 to 75) were clinically diagnosed as pemphigus (pemphigus foliaceus - 11, pemphigus vulgaris - 50) and were recruited for the study. IIF and Dsg ELISA were performed and the findings were compared with each other and with the pemphigus area activity score. Data were entered in SPSS and were analyzed using Kruskal–Wallis test. Results: There was a moderate positive correlation between the cutaneous score and Dsg1 titer, and mucosal score and Dsg3 titer. The titer of IIF showed statistically significant positive correlation with the cutaneous score but not the mucosal score. Dsg ELISA showed higher sensitivity (90.2%) than IIF (75.4%) in the diagnosis of pemphigus. Conclusions: Dsg ELISA is a more sensitive method than IIF and shows more correlation with the disease severity. PMID:28400637
Healthcare teams as complex adaptive systems: Focus on interpersonal interaction.
Pype, Peter; Krystallidou, Demi; Deveugele, Myriam; Mertens, Fien; Rubinelli, Sara; Devisch, Ignaas
2017-11-01
The aim of this study is to test the feasibility of a tool to objectify the functioning of healthcare teams operating in the complexity zone, and to evaluate its usefulness in identifying areas for team quality improvement. We distributed The Complex Adaptive Leadership (CAL™) Organisational Capability Questionnaire (OCQ) to all members of one palliative care team (n=15) and to palliative care physicians in Flanders, Belgium (n=15). Group discussions were held on feasibility aspects and on the low scoring topics. Data was analysed calculating descriptive statistics (sum score, mean and standard deviation). The one sample T-Test was used to detect differences within each group. Both groups of participants reached mean scores ranging from good to excellent. The one sample T test showed statistically significant differences between participants' sum scores within each group (p<0,001). Group discussion led to suggestions for quality improvement e.g. enhanced feedback strategies between team members. The questionnaire used in our study shows to be a feasible and useful instrument for the evaluation of the palliative care teams' day-to-day operations and to identify areas for quality improvement. The CAL™OCQ is a promising instrument to evaluate any healthcare team functioning. A group discussion on the questionnaire scores can serve as a starting point to identify targets for quality improvement initiatives. Copyright © 2017 Elsevier B.V. All rights reserved.
2017-04-06
Pressure Level (SPL) background pink noise. The speech intelligibility tests shall result in a Modified Rhyme Test (MRT) score as listed below...Speech intelligibility testing shall be measured per ANSI S3.2 for each background pink noise level using a minimum of ten talkers and of ten...listeners. The test shall be conducted wearing the JSAM-TA using appropriate communication 6 DISTRIBUTION STATEMENT A: Approved for public release
Thandassery, Ragesh B; Al Kaabi, Saad; Soofi, Madiha E; Mohiuddin, Syed A; John, Anil K; Al Mohannadi, Muneera; Al Ejji, Khalid; Yakoob, Rafie; Derbala, Moutaz F; Wani, Hamidullah; Sharma, Manik; Al Dweik, Nazeeh; Butt, Mohammed T; Kamel, Yasser M; Sultan, Khaleel; Pasic, Fuad; Singh, Rajvir
2016-07-01
Many indirect noninvasive scores to predict liver fibrosis are calculated from routine blood investigations. Only limited studies have compared their efficacy head to head. We aimed to compare these scores with liver biopsy fibrosis stages in patients with chronic hepatitis C. From blood investigations of 1602 patients with chronic hepatitis C who underwent a liver biopsy before initiation of antiviral treatment, 19 simple noninvasive scores were calculated. The area under the receiver operating characteristic curves and diagnostic accuracy of each of these scores were calculated (with reference to the Scheuer staging) and compared. The mean age of the patients was 41.8±9.6 years (1365 men). The most common genotype was genotype 4 (65.6%). Significant fibrosis, advanced fibrosis, and cirrhosis were seen in 65.1%, 25.6, and 6.6% of patients, respectively. All the scores except the aspartate transaminase (AST) alanine transaminase ratio, Pohl score, mean platelet volume, fibro-alpha, and red cell distribution width to platelet count ratio index showed high predictive accuracy for the stages of fibrosis. King's score (cutoff, 17.5) showed the highest predictive accuracy for significant and advanced fibrosis. King's score, Göteborg university cirrhosis index, APRI (the AST/platelet count ratio index), and Fibrosis-4 (FIB-4) had the highest predictive accuracy for cirrhosis, with the APRI (cutoff, 2) and FIB-4 (cutoff, 3.25) showing the highest diagnostic accuracy.We derived the study score 8.5 - 0.2(albumin, g/dL) +0.01(AST, IU/L) -0.02(platelet count, 10/L), which at a cutoff of >4.7 had a predictive accuracy of 0.868 (95% confidence interval, 0.833-0.904) for cirrhosis. King's score for significant and advanced fibrosis and the APRI or FIB-4 score for cirrhosis could be the best simple indirect noninvasive scores.
Tomitaka, Shinichiro; Kawasaki, Yohei; Ide, Kazuki; Akutagawa, Maiko; Yamada, Hiroshi; Furukawa, Toshiaki A; Ono, Yutaka
2016-01-01
Previously, we proposed a model for ordinal scale scoring in which individual thresholds for each item constitute a distribution by each item. This lead us to hypothesize that the boundary curves of each depressive symptom score in the distribution of total depressive symptom scores follow a common mathematical model, which is expressed as the product of the frequency of the total depressive symptom scores and the probability of the cumulative distribution function of each item threshold. To verify this hypothesis, we investigated the boundary curves of the distribution of total depressive symptom scores in a general population. Data collected from 21,040 subjects who had completed the Center for Epidemiologic Studies Depression Scale (CES-D) questionnaire as part of a national Japanese survey were analyzed. The CES-D consists of 20 items (16 negative items and four positive items). The boundary curves of adjacent item scores in the distribution of total depressive symptom scores for the 16 negative items were analyzed using log-normal scales and curve fitting. The boundary curves of adjacent item scores for a given symptom approximated a common linear pattern on a log normal scale. Curve fitting showed that an exponential fit had a markedly higher coefficient of determination than either linear or quadratic fits. With negative affect items, the gap between the total score curve and boundary curve continuously increased with increasing total depressive symptom scores on a log-normal scale, whereas the boundary curves of positive affect items, which are not considered manifest variables of the latent trait, did not exhibit such increases in this gap. The results of the present study support the hypothesis that the boundary curves of each depressive symptom score in the distribution of total depressive symptom scores commonly follow the predicted mathematical model, which was verified to approximate an exponential mathematical pattern.
Kawasaki, Yohei; Akutagawa, Maiko; Yamada, Hiroshi; Furukawa, Toshiaki A.; Ono, Yutaka
2016-01-01
Background Previously, we proposed a model for ordinal scale scoring in which individual thresholds for each item constitute a distribution by each item. This lead us to hypothesize that the boundary curves of each depressive symptom score in the distribution of total depressive symptom scores follow a common mathematical model, which is expressed as the product of the frequency of the total depressive symptom scores and the probability of the cumulative distribution function of each item threshold. To verify this hypothesis, we investigated the boundary curves of the distribution of total depressive symptom scores in a general population. Methods Data collected from 21,040 subjects who had completed the Center for Epidemiologic Studies Depression Scale (CES-D) questionnaire as part of a national Japanese survey were analyzed. The CES-D consists of 20 items (16 negative items and four positive items). The boundary curves of adjacent item scores in the distribution of total depressive symptom scores for the 16 negative items were analyzed using log-normal scales and curve fitting. Results The boundary curves of adjacent item scores for a given symptom approximated a common linear pattern on a log normal scale. Curve fitting showed that an exponential fit had a markedly higher coefficient of determination than either linear or quadratic fits. With negative affect items, the gap between the total score curve and boundary curve continuously increased with increasing total depressive symptom scores on a log-normal scale, whereas the boundary curves of positive affect items, which are not considered manifest variables of the latent trait, did not exhibit such increases in this gap. Discussion The results of the present study support the hypothesis that the boundary curves of each depressive symptom score in the distribution of total depressive symptom scores commonly follow the predicted mathematical model, which was verified to approximate an exponential mathematical pattern. PMID:27761346
A critique of the use of indicator-species scores for identifying thresholds in species responses
Cuffney, Thomas F.; Qian, Song S.
2013-01-01
Identification of ecological thresholds is important both for theoretical and applied ecology. Recently, Baker and King (2010, King and Baker 2010) proposed a method, threshold indicator analysis (TITAN), to calculate species and community thresholds based on indicator species scores adapted from Dufrêne and Legendre (1997). We tested the ability of TITAN to detect thresholds using models with (broken-stick, disjointed broken-stick, dose-response, step-function, Gaussian) and without (linear) definitive thresholds. TITAN accurately and consistently detected thresholds in step-function models, but not in models characterized by abrupt changes in response slopes or response direction. Threshold detection in TITAN was very sensitive to the distribution of 0 values, which caused TITAN to identify thresholds associated with relatively small differences in the distribution of 0 values while ignoring thresholds associated with large changes in abundance. Threshold identification and tests of statistical significance were based on the same data permutations resulting in inflated estimates of statistical significance. Application of bootstrapping to the split-point problem that underlies TITAN led to underestimates of the confidence intervals of thresholds. Bias in the derivation of the z-scores used to identify TITAN thresholds and skewedness in the distribution of data along the gradient produced TITAN thresholds that were much more similar than the actual thresholds. This tendency may account for the synchronicity of thresholds reported in TITAN analyses. The thresholds identified by TITAN represented disparate characteristics of species responses that, when coupled with the inability of TITAN to identify thresholds accurately and consistently, does not support the aggregation of individual species thresholds into a community threshold.
Brennan, Peter A; Croke, David T; Reed, Malcolm; Smith, Lee; Munro, Euan; Foulkes, John; Arnett, Richard
2016-01-01
Objective structured clinical examinations (OSCE) are widely used for summative assessment in surgery. Despite standardizing these as much as possible, variation, including examiner scoring, can occur which may affect reliability. In study of a high-stakes UK postgraduate surgical OSCE, we investigated whether examiners changing stations once during a long examining day affected marking, reliability, and overall candidates' scores compared with examiners who examined the same scenario all day. An observational study of 18,262 examiner-candidate interactions from the UK Membership of the Royal College of Surgeons examination was carried at 3 Surgical Colleges across the United Kingdom. Scores between examiners were compared using analysis of variance. Examination reliability was assessed with Cronbach's alpha, and the comparative distribution of total candidates' scores for each day was evaluated using t-tests of unit-weighted z scores. A significant difference was found in absolute scores differences awarded in the morning and afternoon sessions between examiners who changed stations at lunchtime and those who did not (p < 0.001). No significant differences were found for the main effects of either broad content area (p = 0.290) or station content area (p = 0.450). The reliability of each day was not affected by examiner switching (p = 0.280). Overall, no difference was found in z-score distribution of total candidate scores and categories of examiner switching. This large study has found that although the range of marks awarded varied when examiners change OSCE stations, examination reliability and the likely candidate outcome were not affected. These results may have implications for examination design and examiner experience in surgical OSCEs and beyond. Copyright © 2016 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.
Tomitaka, Shinichiro; Kawasaki, Yohei; Ide, Kazuki; Yamada, Hiroshi; Miyake, Hirotsugu; Furukawa, Toshiaki A; Furukaw, Toshiaki A
2016-01-01
In a previous study, we reported that the distribution of total depressive symptoms scores according to the Center for Epidemiologic Studies Depression Scale (CES-D) in a general population is stable throughout middle adulthood and follows an exponential pattern except for at the lowest end of the symptom score. Furthermore, the individual distributions of 16 negative symptom items of the CES-D exhibit a common mathematical pattern. To confirm the reproducibility of these findings, we investigated the distribution of total depressive symptoms scores and 16 negative symptom items in a sample of Japanese employees. We analyzed 7624 employees aged 20-59 years who had participated in the Northern Japan Occupational Health Promotion Centers Collaboration Study for Mental Health. Depressive symptoms were assessed using the CES-D. The CES-D contains 20 items, each of which is scored in four grades: "rarely," "some," "much," and "most of the time." The descriptive statistics and frequency curves of the distributions were then compared according to age group. The distribution of total depressive symptoms scores appeared to be stable from 30-59 years. The right tail of the distribution for ages 30-59 years exhibited a linear pattern with a log-normal scale. The distributions of the 16 individual negative symptom items of the CES-D exhibited a common mathematical pattern which displayed different distributions with a boundary at "some." The distributions of the 16 negative symptom items from "some" to "most" followed a linear pattern with a log-normal scale. The distributions of the total depressive symptoms scores and individual negative symptom items in a Japanese occupational setting show the same patterns as those observed in a general population. These results show that the specific mathematical patterns of the distributions of total depressive symptoms scores and individual negative symptom items can be reproduced in an occupational population.
Guenther, Patricia M; Kirkpatrick, Sharon I; Reedy, Jill; Krebs-Smith, Susan M; Buckman, Dennis W; Dodd, Kevin W; Casavale, Kellie O; Carroll, Raymond J
2014-03-01
The Healthy Eating Index (HEI), a measure of diet quality, was updated to reflect the 2010 Dietary Guidelines for Americans and the accompanying USDA Food Patterns. To assess the validity and reliability of the HEI-2010, exemplary menus were scored and 2 24-h dietary recalls from individuals aged ≥2 y from the 2003-2004 NHANES were used to estimate multivariate usual intake distributions and assess whether the HEI-2010 1) has a distribution wide enough to detect meaningful differences in diet quality among individuals, 2) distinguishes between groups with known differences in diet quality by using t tests, 3) measures diet quality independently of energy intake by using Pearson correlation coefficients, 4) has >1 underlying dimension by using principal components analysis (PCA), and 5) is internally consistent by calculating Cronbach's coefficient α. HEI-2010 scores were at or near the maximum levels for the exemplary menus. The distribution of scores among the population was wide (5th percentile = 31.7; 95th percentile = 70.4). As predicted, men's diet quality (mean HEI-2010 total score = 49.8) was poorer than women's (52.7), younger adults' diet quality (45.4) was poorer than older adults' (56.1), and smokers' diet quality (45.7) was poorer than nonsmokers' (53.3) (P < 0.01). Low correlations with energy were observed for HEI-2010 total and component scores (|r| ≤ 0.21). Cronbach's coefficient α was 0.68, supporting the reliability of the HEI-2010 total score as an indicator of overall diet quality. Nonetheless, PCA indicated multiple underlying dimensions, highlighting the fact that the component scores are equally as important as the total. A comparable reevaluation of the HEI-2005 yielded similar results. This study supports the validity and the reliability of both versions of the HEI.
Verification of forecast ensembles in complex terrain including observation uncertainty
NASA Astrophysics Data System (ADS)
Dorninger, Manfred; Kloiber, Simon
2017-04-01
Traditionally, verification means to verify a forecast (ensemble) with the truth represented by observations. The observation errors are quite often neglected arguing that they are small when compared to the forecast error. In this study as part of the MesoVICT (Mesoscale Verification Inter-comparison over Complex Terrain) project it will be shown, that observation errors have to be taken into account for verification purposes. The observation uncertainty is estimated from the VERA (Vienna Enhanced Resolution Analysis) and represented via two analysis ensembles which are compared to the forecast ensemble. For the whole study results from COSMO-LEPS provided by Arpae-SIMC Emilia-Romagna are used as forecast ensemble. The time period covers the MesoVICT core case from 20-22 June 2007. In a first step, all ensembles are investigated concerning their distribution. Several tests have been executed (Kolmogorov-Smirnov-Test, Finkelstein-Schafer Test, Chi-Square Test etc.) showing no exact mathematical distribution. So the main focus is on non-parametric statistics (e.g. Kernel density estimation, Boxplots etc.) and also the deviation between "forced" normal distributed data and the kernel density estimations. In a next step the observational deviations due to the analysis ensembles are analysed. In a first approach scores are multiple times calculated with every single ensemble member from the analysis ensemble regarded as "true" observation. The results are presented as boxplots for the different scores and parameters. Additionally, the bootstrapping method is also applied to the ensembles. These possible approaches to incorporating observational uncertainty into the computation of statistics will be discussed in the talk.
Tuli, Sanjeev Y; Thompson, Lindsay A; Saliba, Heidi; Black, Erik W; Ryan, Kathleen A; Kelly, Maria N; Novak, Maureen; Mellott, Jane; Tuli, Sonal S
2011-12-01
Board certification is an important professional qualification and a prerequisite for credentialing, and the Accreditation Council for Graduate Medical Education (ACGME) assesses board certification rates as a component of residency program effectiveness. To date, research has shown that preresidency measures, including National Board of Medical Examiners scores, Alpha Omega Alpha Honor Medical Society membership, or medical school grades poorly predict postresidency board examination scores. However, learning styles and temperament have been identified as factors that 5 affect test-taking performance. The purpose of this study is to characterize the learning styles and temperaments of pediatric residents and to evaluate their relationships to yearly in-service and postresidency board examination scores. This cross-sectional study analyzed the learning styles and temperaments of current and past pediatric residents by administration of 3 validated tools: the Kolb Learning Style Inventory, the Keirsey Temperament Sorter, and the Felder-Silverman Learning Style test. These results were compared with known, normative, general and medical population data and evaluated for correlation to in-service examination and postresidency board examination scores. The predominant learning style for pediatric residents was converging 44% (33 of 75 residents) and the predominant temperament was guardian 61% (34 of 56 residents). The learning style and temperament distribution of the residents was significantly different from published population data (P = .002 and .04, respectively). Learning styles, with one exception, were found to be unrelated to standardized test scores. The predominant learning style and temperament of pediatric residents is significantly different than that of the populations of general and medical trainees. However, learning styles and temperament do not predict outcomes on standardized in-service and board examinations in pediatric residents.
Tuli, Sanjeev Y.; Thompson, Lindsay A.; Saliba, Heidi; Black, Erik W.; Ryan, Kathleen A.; Kelly, Maria N.; Novak, Maureen; Mellott, Jane; Tuli, Sonal S.
2011-01-01
Background Board certification is an important professional qualification and a prerequisite for credentialing, and the Accreditation Council for Graduate Medical Education (ACGME) assesses board certification rates as a component of residency program effectiveness. To date, research has shown that preresidency measures, including National Board of Medical Examiners scores, Alpha Omega Alpha Honor Medical Society membership, or medical school grades poorly predict postresidency board examination scores. However, learning styles and temperament have been identified as factors that 5 affect test-taking performance. The purpose of this study is to characterize the learning styles and temperaments of pediatric residents and to evaluate their relationships to yearly in-service and postresidency board examination scores. Methods This cross-sectional study analyzed the learning styles and temperaments of current and past pediatric residents by administration of 3 validated tools: the Kolb Learning Style Inventory, the Keirsey Temperament Sorter, and the Felder-Silverman Learning Style test. These results were compared with known, normative, general and medical population data and evaluated for correlation to in-service examination and postresidency board examination scores. Results The predominant learning style for pediatric residents was converging 44% (33 of 75 residents) and the predominant temperament was guardian 61% (34 of 56 residents). The learning style and temperament distribution of the residents was significantly different from published population data (P = .002 and .04, respectively). Learning styles, with one exception, were found to be unrelated to standardized test scores. Conclusions The predominant learning style and temperament of pediatric residents is significantly different than that of the populations of general and medical trainees. However, learning styles and temperament do not predict outcomes on standardized in-service and board examinations in pediatric residents. PMID:23205211
Length of internship influences performance on medical residency exam.
Santos, Itamar de Souza; Vieira, Joaquim Edson; Nunes, Maria do Patrocínio Tenório
2009-01-01
Medical education encompasses globally diverse context and conditions. The Brazilian scenario seemed a natural environment to study the influence of medical education programs and internship duration on the entrance exam for medical residency. This investigation evaluates some methods used during the entrance exam for medical residency as a means to make a distinction between candidates with longer clerkships. Candidates selected for a residency program performed a multiple-choice (MC), an open question (OQ) and OSCE-like tests, an interview and a curriculum analysis for participation in scientific meetings, papers published and voluntary activities. Groups were compared for gender, year of graduation, tests and OSCE scores. Participants were distributed into two groups based on clerkship duration: 2 years or less than 2 years. There was no difference for the MCT score among groups or any of the activities from interview and curriculum analysis. The 2 years clerkship group showed significantly higher OQ (p=0.009) and OSCE-like affective (p=0.025) and knowledge (p=0.002) scores. The OSCE test identified some aspects related to competence acquisition and assessed basic skills and attitudes essential to the supervised practice of medicine during residency. OSCE discriminated aspects not perceived by the sole use of knowledge tests.
Score distributions of gapped multiple sequence alignments down to the low-probability tail
NASA Astrophysics Data System (ADS)
Fieth, Pascal; Hartmann, Alexander K.
2016-08-01
Assessing the significance of alignment scores of optimally aligned DNA or amino acid sequences can be achieved via the knowledge of the score distribution of random sequences. But this requires obtaining the distribution in the biologically relevant high-scoring region, where the probabilities are exponentially small. For gapless local alignments of infinitely long sequences this distribution is known analytically to follow a Gumbel distribution. Distributions for gapped local alignments and global alignments of finite lengths can only be obtained numerically. To obtain result for the small-probability region, specific statistical mechanics-based rare-event algorithms can be applied. In previous studies, this was achieved for pairwise alignments. They showed that, contrary to results from previous simple sampling studies, strong deviations from the Gumbel distribution occur in case of finite sequence lengths. Here we extend the studies to multiple sequence alignments with gaps, which are much more relevant for practical applications in molecular biology. We study the distributions of scores over a large range of the support, reaching probabilities as small as 10-160, for global and local (sum-of-pair scores) multiple alignments. We find that even after suitable rescaling, eliminating the sequence-length dependence, the distributions for multiple alignment differ from the pairwise alignment case. Furthermore, we also show that the previously discussed Gaussian correction to the Gumbel distribution needs to be refined, also for the case of pairwise alignments.
Berres, M; Kukull, W A; Miserez, A R; Monsch, A U; Monsell, S E; Spiegel, R
2014-01-01
The PGSA (Placebo Group Simulation Approach) aims at avoiding problems of sample representativeness and ethical issues typical of placebo-controlled secondary prevention trials with MCI patients. The PGSA uses mathematical modeling to forecast the distribution of quantified outcomes of MCI patient groups based on their own baseline data established at the outset of clinical trials. These forecasted distributions are then compared with the distribution of actual outcomes observed on candidate treatments, thus substituting for a concomitant placebo group. Here we investigate whether a PGSA algorithm that was developed from the MCI population of ADNI 1*, can reliably simulate the distribution of composite neuropsychological outcomes from a larger, independently selected MCI subject sample. Data available from the National Alzheimer's Coordinating Center (NACC) were used. We included 1523 patients with single or multiple domain amnestic mild cognitive impairment (aMCI) and at least two follow-ups after baseline. In order to strengthen the analysis and to verify whether there was a drift over time in the neuropsychological outcomes, the NACC subject sample was split into 3 subsamples of similar size. The previously described PGSA algorithm for the trajectory of a composite neuropsychological test battery (NTB) score was adapted to the test battery used in NACC. Nine demographic, clinical, biological and neuropsychological candidate predictors were included in a mixed model; this model and its error terms were used to simulate trajectories of the adapted NTB. The distributions of empirically observed and simulated data after 1, 2 and 3 years were very similar, with some over-estimation of decline in all 3 subgroups. The by far most important predictor of the NTB trajectories is the baseline NTB score. Other significant predictors are the MMSE baseline score and the interactions of time with ApoE4 and FAQ (functional abilities). These are essentially the same predictors as determined for the original NTB score. An algorithm comprising a small number of baseline variables, notably cognitive performance at baseline, forecasts the group trajectory of cognitive decline in subsequent years with high accuracy. The current analysis of 3 independent subgroups of aMCI patients from the NACC database supports the validity of the PGSA longitudinal algorithm for a NTB. Use of the PGSA in long-term secondary AD prevention trials deserves consideration.
Hammad, Shaza M.; El-Wassefy, Noha; Maher, Ahmed; Fawakerji, Shafik M.
2017-01-01
ABSTRACT Objective: To evaluate the effect of silica dioxide (SiO2) nanofillers in different bonding systems on shear bond strength (SBS) and mode of failure of orthodontic brackets at two experimental times. Methods: Ninety-six intact premolars were divided into four groups: A) Conventional acid-etch and primer Transbond XT; B) Transbond Plus self-etch primer; and two self-etch bonding systems reinforced with silica dioxide nanofiller at different concentrations: C) Futurabond DC at 1%; D) Optibond All-in-One at 7%. Each group was allocated into two subgroups (n = 12) according to experimental time (12 and 24 hours). SBS test was performed using a universal testing machine. ARI scores were determined under a stereomicroscope. Scanning electron microscopy (SEM) and transmission electron microscopy (TEM) were used to determine the size and distribution of nanofillers. One-way ANOVA was used to compare SBS followed by the post-hoc Tukey test. The chi-square test was used to evaluate ARI scores. Results: Mean SBS of Futurabond DC and Optibond All-in-One were significantly lower than conventional system, and there were no significant differences between means SBS obtained with all self-etch bonding systems used in the study. Lower ARI scores were found for Futurabond DC and Optibond All-in-One. There was no significant difference of SBS and ARI obtained at either time points for all bonding systems. Relative homogeneous distribution of the fillers was observed with the bonding systems. Conclusion: Two nanofilled systems revealed the lowest bond strengths, but still clinically acceptable and less adhesive was left on enamel. It is advisable not to load the brackets immediately to the maximum. PMID:28444018
NASA Astrophysics Data System (ADS)
Sarti, E.; Zamuner, S.; Cossio, P.; Laio, A.; Seno, F.; Trovato, A.
2013-12-01
In protein structure prediction it is of crucial importance, especially at the refinement stage, to score efficiently large sets of models by selecting the ones that are closest to the native state. We here present a new computational tool, BACHSCORE, that allows its users to rank different structural models of the same protein according to their quality, evaluated by using the BACH++ (Bayesian Analysis Conformation Hunt) scoring function. The original BACH statistical potential was already shown to discriminate with very good reliability the protein native state in large sets of misfolded models of the same protein. BACH++ features a novel upgrade in the solvation potential of the scoring function, now computed by adapting the LCPO (Linear Combination of Pairwise Orbitals) algorithm. This change further enhances the already good performance of the scoring function. BACHSCORE can be accessed directly through the web server: bachserver.pd.infn.it. Catalogue identifier: AEQD_v1_0 Program summary URL:http://cpc.cs.qub.ac.uk/summaries/AEQD_v1_0.html Program obtainable from: CPC Program Library, Queen’s University, Belfast, N. Ireland Licensing provisions: GNU General Public License version 3 No. of lines in distributed program, including test data, etc.: 130159 No. of bytes in distributed program, including test data, etc.: 24 687 455 Distribution format: tar.gz Programming language: C++. Computer: Any computer capable of running an executable produced by a g++ compiler (4.6.3 version). Operating system: Linux, Unix OS-es. RAM: 1 073 741 824 bytes Classification: 3. Nature of problem: Evaluate the quality of a protein structural model, taking into account the possible “a priori” knowledge of a reference primary sequence that may be different from the amino-acid sequence of the model; the native protein structure should be recognized as the best model. Solution method: The contact potential scores the occurrence of any given type of residue pair in 5 possible contact classes (α-helical contact, parallel β-sheet contact, anti-parallel β-sheet contact, side-chain contact, no contact). The solvation potential scores the occurrence of any residue type in 2 possible environments: buried and solvent exposed. Residue environment is assigned by adapting the LCPO algorithm. Residues present in the reference primary sequence and not present in the model structure contribute to the model score as solvent exposed and as non contacting all other residues. Restrictions: Input format file according to the Protein Data Bank standard Additional comments: Parameter values used in the scoring function can be found in the file /folder-to-bachscore/BACH/examples/bach_std.par. Running time: Roughly one minute to score one hundred structures on a desktop PC, depending on their size.
[Cognitive markers to discriminate between mild cognitive impairment and normal ageing].
Rodríguez Rodríguez, Nely; Juncos-Rabadán, Onésimo; Facal Mayo, David
2008-01-01
mild cognitive impairment (MCI) has been characterized as a transitional stage between normal ageing and dementia. The aim of the present study was to examine differences between normal ageing and MCI in the performance of several cognitive tests. These differences might serve as differential markers. we performed a longitudinal study (24 months) with two evaluations at 12-monthly intervals using the CAMCOG-R and a verbal learning test [test de aprendizaje verbal España-Complutense (TAVEC)]. The sample was composed of 25 persons aged more than 50 years old (five men and 20 women), distributed into two groups: the control group and the MCI group. To assign persons to either of the two groups, Petersen's MCI criteria were applied to Mini-Mental State Examination (MMSE) scores. repeated measures ANOVA (2 groups x 2 assessment) showed significant differences between the MCI and control group in the CAMCOG-R scores in orientation, language, memory, abstract thinking, executive function and global score and in the TAVEC scores for immediate recall and short- and long-term free and clued recall. No significant differences were found between the first and second assessment or in the interaction group assessment. the results of the present study confirm that the CAMCOG-R and the TAVEC effectively discriminate between normal ageing and MCI and can be used complementarily.
Gros, Auriane; Manera, Valeria; Daumas, Anaïs; Guillemin, Sophie; Rouaud, Olivier; Martin, Martine Lemesle; Giroud, Maurice; Béjot, Yannick
2016-01-01
Objective: At present emotional experience and implicit emotion regulation (IER) abilities are mainly assessed though self-reports, which are subjected to several biases. The aim of the present studies was to validate the Clock’N test, a recently developed time estimation task employing emotional priming to assess implicitly emotional reactivity and IER. Methods: In Study 1, the Clock’N test was administered to 150 healthy participants with different age, laterality and gender, in order to ascertain whether these factors affected the test results. In phase 1 participant were asked to judge the duration of seven sounds. In phase 2, before judging the duration of the same sounds, participants were presented with short arousing video-clip used as emotional priming stimuli. Time warp was calculated as the difference in time estimation between phase 2 and phase 1, and used to assess how emotions affected subjective time estimations. In study 2, a representative sample was selected to provide normative scores to be employed to assess emotional reactivity (Score 1) and IER (Score 2), and to calculate statistical cutoffs, based on the 10th and 90th score distribution percentiles. Results: Converging with previous findings, the results of study 1 suggested that the Clock’N test can be employed to assess both emotional reactivity, as indexed by an initial time underestimation, and IER, as indexed by a progressive shift to time overestimation. No effects of gender, age and laterality were found. Conclusions: These results suggest that the Clock’N test is adapted to assess emotional reactivity and IER. After collection of data on the test discriminant and convergent validity, this test may be employed to assess deficits in these abilities in different clinical populations. PMID:26903825
Bowker, Matthew A.; Maestre, Fernando T.
2012-01-01
Dryland vegetation is inherently patchy. This patchiness goes on to impact ecology, hydrology, and biogeochemistry. Recently, researchers have proposed that dryland vegetation patch sizes follow a power law which is due to local plant facilitation. It is unknown what patch size distribution prevails when competition predominates over facilitation, or if such a pattern could be used to detect competition. We investigated this question in an alternative vegetation type, mosses and lichens of biological soil crusts, which exhibit a smaller scale patch-interpatch configuration. This micro-vegetation is characterized by competition for space. We proposed that multiplicative effects of genetics, environment and competition should result in a log-normal patch size distribution. When testing the prevalence of log-normal versus power law patch size distributions, we found that the log-normal was the better distribution in 53% of cases and a reasonable fit in 83%. In contrast, the power law was better in 39% of cases, and in 8% of instances both distributions fit equally well. We further hypothesized that the log-normal distribution parameters would be predictably influenced by competition strength. There was qualitative agreement between one of the distribution's parameters (μ) and a novel intransitive (lacking a 'best' competitor) competition index, suggesting that as intransitivity increases, patch sizes decrease. The correlation of μ with other competition indicators based on spatial segregation of species (the C-score) depended on aridity. In less arid sites, μ was negatively correlated with the C-score (suggesting smaller patches under stronger competition), while positive correlations (suggesting larger patches under stronger competition) were observed at more arid sites. We propose that this is due to an increasing prevalence of competition transitivity as aridity increases. These findings broaden the emerging theory surrounding dryland patch size distributions and, with refinement, may help us infer cryptic ecological processes from easily observed spatial patterns in the field.
Speaking of Salaries: What It Will Take to Get Qualified, Effective Teachers in All Communities
ERIC Educational Resources Information Center
Adamson, Frank; Darling-Hammond, Linda
2011-01-01
The fact that well-qualified teachers are inequitably distributed to students in the United States has received growing public attention. By every measure of qualifications--certification, subject matter background, pedagogical training, selectivity of college attended, test scores, or experience--less-qualified teachers tend to be found in…
The Relationship of Maternal and Infant Variables to School Readiness.
ERIC Educational Resources Information Center
Rubin, Rosalyn A.; And Others
A prospective longitudinal investigation related 76 maternal and infant variables to performance on the Metropolitan Readiness Tests (MRT) at age six. The 1,245 study subjects have been followed since birth. Their distribution on measures of intelligence and socioeconomic status is essentially normal. Subjects with high MRT scores were found to…
The Inverted Student Density and Test Scores.
ERIC Educational Resources Information Center
Boldt, Robert F.
The inverted density is one whose contour lines are spheroidal as in the normal distribution, but whose moments differ from those of the normal in that its conditional arrays are not homoscedastic, being quadratic functions of the values of the linear regression functions. It is also platykurtic, its measure of kurtosis ranging from that of the…
In Search of Good Teachers: Patterns of Teacher Quality in Two Mexican States
ERIC Educational Resources Information Center
Luschei, Thomas F.
2012-01-01
This study uses longitudinal data from Mexico's Carrera Magisterial teacher incentive program to identify teacher attributes that are positively associated with student test scores and to describe how teachers with these attributes are distributed across schools in two diverse Mexican states, Aguascalientes and Sonora. I find that teachers' scores…
ERIC Educational Resources Information Center
Lockwood, J. R.; Castellano, Katherine E.
2017-01-01
Student Growth Percentiles (SGPs) increasingly are being used in the United States for inferences about student achievement growth and educator effectiveness. Emerging research has indicated that SGPs estimated from observed test scores have large measurement errors. As such, little is known about "true" SGPs, which are defined in terms…
Gender Differences in Mathematical Achievement at the Norwegian Elementary-School Level.
ERIC Educational Resources Information Center
Manger, Terje
1995-01-01
The relationship between gender and mathematical achievement was investigated in 440 female and 480 male Norwegian third graders. Boys had higher test scores, but the effect size was small. Boys performed better in numeracy, mental arithmetic, and measurement problems. Marked gender differences were found at extreme tails of the distribution.…
Quality Differences of Higher Education and Its Determinants in a Less-Developed Country
ERIC Educational Resources Information Center
Sarmiento Espinel, Jaime Andrés; Silva Arias, Adriana Carolina; Van Gameren, Edwin
2015-01-01
Two key measures to determine the quality of higher education are the performance of students and the accreditation of a programme's quality. We analyse the difference in the distributions of the student's scores in a standardised test of economics knowledge between accredited and non-accredited undergraduate economics programmes in a…
Podczeck, Fridrun; Newton, J Michael; Fromme, Paul
2014-12-30
Flat, round tablets may have a breaking ("score") line. Pharmacopoeial tablet breaking load tests are diametral in their design, and industrially used breaking load testers often have automatic tablet feeding systems, which position the tablets between the loading platens of the machine with the breaking lines in random orientation to the applied load. The aim of this work was to ascertain the influence of the position of the breaking line in a diametral compression test using finite element methodology (FEM) and to compare the theoretical results with practical findings using commercially produced bevel-edged, scored tablets. Breaking line test positions at an angle of 0°, 22.5°, 45°, 67.5° and 90° relative to the loading plane were studied. FEM results obtained for fully elastic and elasto-plastic tablets were fairly similar, but they highlighted large differences in stress distributions depending on the position of the breaking line. The stress values at failure were predicted to be similar for tablets tested at an angle of 45° or above, whereas at lower test angles the predicted breaking loads were up to three times larger. The stress distributions suggested that not all breaking line angles would result in clean tensile failure. Practical results, however, did not confirm the differences in the predicted breaking loads, but they confirmed differences in the way tablets broke. The results suggest that it is not advisable to convert breaking loads obtained on scored tablets into tablet tensile strength values, and comparisons between different tablets or batches should carefully consider the orientation of the breaking line with respect to the loading plane, as the failure mechanisms appear to vary. Copyright © 2014 Elsevier B.V. All rights reserved.
Tariq, Nabia; Tayyab, Ali; Jaffery, Tara
2018-04-01
To measure mean empathy scores of Pakistani medical students and to explore any association of empathy scores with gender, medical school year and future career choice. Cross-sectional survey. Shifa College of Medicine, Shifa Tameer-e-Millat University, during the academic year 2015-2016. The student version of Jefferson Scale of Physician Empathy (JSPE) was distributed to the students electronically via the student portal. Response that were completed in full were included in the study. Descriptive statistics was used to analyse student demographic data. The student score on the JSPE was reported as the mean (out of 7) of each item. Independent samples t-test was employed to check the significant differences between genders. Empathy score with advancing year of study was investigated using ANOVA. ANOVA with post-hoc Tukey's test was used to study the relationship between career choice and empathy score. The response rate was 70.94%. The mean score was 4.51 ±0.69. Females obtained greater, but statistically insignificant (p=0.08) empathy score (4.58) as compared to the male students (4.45). No statistically significant difference was seen between scores on the survey across the five academic years (F=0.88, p=0.47). Students who selected medicine and allied as career choice showed a significantly higher empathy score than those who opted for surgery. The internal consistency reliability (Cronbach's alpha) was 0.78. There were low levels of empathy in Pakistani medical students. Students with interest in medicine and allied showed higher empathy scores compared to surgical or technical specialties. No association of empathy scores with gender and medical school year was observed.
Kawasaki, Yohei; Ide, Kazuki; Akutagawa, Maiko; Yamada, Hiroshi; Furukawa, Toshiaki A.; Ono, Yutaka
2016-01-01
Background Several studies have shown that total depressive symptom scores in the general population approximate an exponential pattern, except for the lower end of the distribution. The Center for Epidemiologic Studies Depression Scale (CES-D) consists of 20 items, each of which may take on four scores: “rarely,” “some,” “occasionally,” and “most of the time.” Recently, we reported that the item responses for 16 negative affect items commonly exhibit exponential patterns, except for the level of “rarely,” leading us to hypothesize that the item responses at the level of “rarely” may be related to the non-exponential pattern typical of the lower end of the distribution. To verify this hypothesis, we investigated how the item responses contribute to the distribution of the sum of the item scores. Methods Data collected from 21,040 subjects who had completed the CES-D questionnaire as part of a Japanese national survey were analyzed. To assess the item responses of negative affect items, we used a parameter r, which denotes the ratio of “rarely” to “some” in each item response. The distributions of the sum of negative affect items in various combinations were analyzed using log-normal scales and curve fitting. Results The sum of the item scores approximated an exponential pattern regardless of the combination of items, whereas, at the lower end of the distributions, there was a clear divergence between the actual data and the predicted exponential pattern. At the lower end of the distributions, the sum of the item scores with high values of r exhibited higher scores compared to those predicted from the exponential pattern, whereas the sum of the item scores with low values of r exhibited lower scores compared to those predicted. Conclusions The distributional pattern of the sum of the item scores could be predicted from the item responses of such items. PMID:27806132
The relationship between clinical and standardized tests for hand-arm vibration syndrome.
Poole, C J M; Mason, H; Harding, A-H
2016-06-01
Standardized laboratory tests are undertaken to assist the diagnosis and staging of hand-arm vibration syndrome (HAVS), but the strength of the relationship between the tests and clinical stages of HAVS is unknown. To assess the relationship between the results of thermal aesthesiometry (TA), vibrotactile (VT) thresholds and cold provocation (CP) tests with the modified Stockholm scales for HAVS and to determine whether the relationship is affected by finger skin temperature. Consecutive records of workers referred to a Tier 5 HAVS assessment centre from 2006 to 2015 were identified. The diagnosis and staging of cases was undertaken from the clinical information contained in the records. Cases with alternative or mixed diagnoses were excluded and staging performed according to the modified Stockholm scale without knowledge of the results of the standardized laboratory tests. A total of 279 cases of HAVS were analysed. Although there was a significant trend for sensorineural (SN) and vascular scores to increase with clinical stage (P < 0.01), there was no significant difference in scores between 2SN early and 2SN late or between 2SN late and 3SN. There was moderate correlation between the TA and VT scores and the clinical SN stages (r = 0.6). This correlation did not change when subjects were divided into those with a finger skin temperature <30 and >30°C. CP scores distributed bimodally and correlated poorly with clinical staging (r = 0.2). Standardized SN tests distinguish between the lower Stockholm stages, but not above 2SN early. This has implications for health surveillance and UK policy. © Crown copyright 2016.
Video quality pooling adaptive to perceptual distortion severity.
Park, Jincheol; Seshadrinathan, Kalpana; Lee, Sanghoon; Bovik, Alan Conrad
2013-02-01
It is generally recognized that severe video distortions that are transient in space and/or time have a large effect on overall perceived video quality. In order to understand this phenomena, we study the distribution of spatio-temporally local quality scores obtained from several video quality assessment (VQA) algorithms on videos suffering from compression and lossy transmission over communication channels. We propose a content adaptive spatial and temporal pooling strategy based on the observed distribution. Our method adaptively emphasizes "worst" scores along both the spatial and temporal dimensions of a video sequence and also considers the perceptual effect of large-area cohesive motion flow such as egomotion. We demonstrate the efficacy of the method by testing it using three different VQA algorithms on the LIVE Video Quality database and the EPFL-PoliMI video quality database.
The efficacy of commercially available veterinary diets recommended for dogs with atopic dermatitis.
Glos, Katharina; Linek, Monika; Loewenstein, Christine; Mayer, Ursula; Mueller, Ralf S
2008-10-01
The classical treatments for dogs with atopic dermatitis have traditionally been oral antipruritic drugs, allergen-specific immunotherapy and topical therapy. Fifty dogs with atopic dermatitis were included in this multicentred, double-blinded, randomized study to compare clinical response to an 8-week period of feeding one of three commercial veterinary foods marketed for dogs with atopic dermatitis (diets A-C) or a widely distributed supermarket food (diet D). Atopic dermatitis was diagnosed using Willemse's criteria and through the exclusion of differential diagnoses. Fourteen dogs were assigned to diet A and 12 dogs each to diet B, C or D. Flea and tick control using a monthly fipronil spot-on product was administered for a minimum of 4 weeks prior to inclusion in the study and during the study period. Evaluations were made monthly. These included lesion scores, using an established scoring system (canine atopic dermatitis extent and severity index, CADESI-03) and owner evaluation of pruritus level using a visual analogue scale. After 8 weeks on the new diets, there was a significant improvement in CADESI and pruritus scores with diet B (Wilcoxon test, P = 0.043 and paired t-test, P = 0.012, respectively), in pruritus scores with diet A (paired t-test, P = 0.019) and in CADESI scores with diet D (Wilcoxon test, P = 0.037). No significant changes were detected with diet C. Based on the results of this study, in addition to the conventional therapies, changing the diet of dogs with atopic dermatitis may be a useful adjunctive therapeutic measure.
CASPASE-12 and rheumatoid arthritis in African-Americans
Marshall, Laura; Obaidullah, Mohammad; Fuchs, Trista; Fineberg, Naomi S.; Brinkley, Garland; Mikuls, Ted R.; Bridges, S. Louis; Hermel, Evan
2014-01-01
CASPASE-12 (CASP12) has a down-regulatory function during infection, and thus may protect against inflammatory disease. We investigated the distribution of CASP12 alleles (#rs497116) in African-Americans (AA) with rheumatoid arthritis (RA). CASP12 alleles were genotyped in 953 RA patients and 342 controls. Statistical analyses comparing genotype groups were performed using Kruskal-Wallis non-parametric ANOVA with Mann-Whitney U tests and chi-square tests. There was no significant difference in the overall distribution of CASP12 genotypes within AA with RA, but CASP12 homozygous patients had lower baseline joint narrowing scores. CASP12 homozygosity appears to be a subtle protective factor for some aspects of RA in AA patients. PMID:24515649
Shin, Saemi; Moon, Hyung-Il; Lee, Kwon Seob; Hong, Mun Ki; Byeon, Sang-Hoon
2014-11-20
This study aimed to devise a method for prioritizing hazardous chemicals for further regulatory action. To accomplish this objective, we chose appropriate indicators and algorithms. Nine indicators from the Globally Harmonized System of Classification and Labeling of Chemicals were used to identify categories to which the authors assigned numerical scores. Exposure indicators included handling volume, distribution, and exposure level. To test the method devised by this study, sixty-two harmful substances controlled by the Occupational Safety and Health Act in Korea, including acrylamide, acrylonitrile, and styrene were ranked using this proposed method. The correlation coefficients between total score and each indicator ranged from 0.160 to 0.641, and those between total score and hazard indicators ranged from 0.603 to 0.641. The latter were higher than the correlation coefficients between total score and exposure indicators, which ranged from 0.160 to 0.421. Correlations between individual indicators were low (-0.240 to 0.376), except for those between handling volume and distribution (0.613), suggesting that each indicator was not strongly correlated. The low correlations between each indicator mean that the indicators and independent and were well chosen for prioritizing harmful chemicals. This method proposed by this study can improve the cost efficiency of chemical management as utilized in occupational regulatory systems.
Students' Midprogram Content Area Performance as a Predictor of End-of-Program NCLEX Readiness.
Brussow, Jennifer A; Dunham, Michelle
2017-12-22
Many programs have implemented end-of-program predictive testing to identify students at risk of NCLEX-RN failure. Unfortunately, for many students, end-of-program testing comes too late. Regression and relative importance analysis were used to explore relationships between 9 content area assessments and an end-of-program assessment shown to be predictive of NCLEX-RN success. Results indicate that scores on assessments for content areas such as medical surgical nursing and care of children are predictive of end-of-program test scores, suggesting that instructors should provide remediation at the first sign of lagging performance.This is an open-access article distributed under the terms of the Creative Commons Attribution-Non Commercial-No Derivatives License 4.0 (CCBY-NC-ND), where it is permissible to download and share the work provided it is properly cited. The work cannot be changed in anyway or used commercially without permission from the journal.
Evaluation of probabilistic forecasts with the scoringRules package
NASA Astrophysics Data System (ADS)
Jordan, Alexander; Krüger, Fabian; Lerch, Sebastian
2017-04-01
Over the last decades probabilistic forecasts in the form of predictive distributions have become popular in many scientific disciplines. With the proliferation of probabilistic models arises the need for decision-theoretically principled tools to evaluate the appropriateness of models and forecasts in a generalized way in order to better understand sources of prediction errors and to improve the models. Proper scoring rules are functions S(F,y) which evaluate the accuracy of a forecast distribution F , given that an outcome y was observed. In coherence with decision-theoretical principles they allow to compare alternative models, a crucial ability given the variety of theories, data sources and statistical specifications that is available in many situations. This contribution presents the software package scoringRules for the statistical programming language R, which provides functions to compute popular scoring rules such as the continuous ranked probability score for a variety of distributions F that come up in applied work. For univariate variables, two main classes are parametric distributions like normal, t, or gamma distributions, and distributions that are not known analytically, but are indirectly described through a sample of simulation draws. For example, ensemble weather forecasts take this form. The scoringRules package aims to be a convenient dictionary-like reference for computing scoring rules. We offer state of the art implementations of several known (but not routinely applied) formulas, and implement closed-form expressions that were previously unavailable. Whenever more than one implementation variant exists, we offer statistically principled default choices. Recent developments include the addition of scoring rules to evaluate multivariate forecast distributions. The use of the scoringRules package is illustrated in an example on post-processing ensemble forecasts of temperature.
Dexter, Franklin; Ledolter, Johannes; Hindman, Bradley J
2017-06-01
Our department monitors the quality of anesthesiologists' clinical supervision and provides each anesthesiologist with periodic feedback. We hypothesized that greater differentiation among anesthesiologists' supervision scores could be obtained by adjusting for leniency of the rating resident. From July 1, 2013 to December 31, 2015, our department has utilized the de Oliveira Filho unidimensional nine-item supervision scale to assess the quality of clinical supervision provided by faculty as rated by residents. We examined all 13,664 ratings of the 97 anesthesiologists (ratees) by the 65 residents (raters). Testing for internal consistency among answers to questions (large Cronbach's alpha > 0.90) was performed to rule out that one or two questions accounted for leniency. Mixed-effects logistic regression was used to compare ratees while controlling for rater leniency vs using Student t tests without rater leniency. The mean supervision scale score was calculated for each combination of the 65 raters and nine questions. The Cronbach's alpha was very large (0.977). The mean score was calculated for each of the 3,421 observed combinations of resident and anesthesiologist. The logits of the percentage of scores equal to the maximum value of 4.00 were normally distributed (residents, P = 0.24; anesthesiologists, P = 0.50). There were 20/97 anesthesiologists identified as significant outliers (13 with below average supervision scores and seven with better than average) using the mixed-effects logistic regression with rater leniency entered as a fixed effect but not by Student's t test. In contrast, there were three of 97 anesthesiologists identified as outliers (all three above average) using Student's t tests but not by logistic regression with leniency. The 20 vs 3 was significant (P < 0.001). Use of logistic regression with leniency results in greater detection of anesthesiologists with significantly better (or worse) clinical supervision scores than use of Student's t tests (i.e., without adjustment for rater leniency).
Baldi, Pierre
2010-01-01
As repositories of chemical molecules continue to expand and become more open, it becomes increasingly important to develop tools to search them efficiently and assess the statistical significance of chemical similarity scores. Here we develop a general framework for understanding, modeling, predicting, and approximating the distribution of chemical similarity scores and its extreme values in large databases. The framework can be applied to different chemical representations and similarity measures but is demonstrated here using the most common binary fingerprints with the Tanimoto similarity measure. After introducing several probabilistic models of fingerprints, including the Conditional Gaussian Uniform model, we show that the distribution of Tanimoto scores can be approximated by the distribution of the ratio of two correlated Normal random variables associated with the corresponding unions and intersections. This remains true also when the distribution of similarity scores is conditioned on the size of the query molecules in order to derive more fine-grained results and improve chemical retrieval. The corresponding extreme value distributions for the maximum scores are approximated by Weibull distributions. From these various distributions and their analytical forms, Z-scores, E-values, and p-values are derived to assess the significance of similarity scores. In addition, the framework allows one to predict also the value of standard chemical retrieval metrics, such as Sensitivity and Specificity at fixed thresholds, or ROC (Receiver Operating Characteristic) curves at multiple thresholds, and to detect outliers in the form of atypical molecules. Numerous and diverse experiments carried in part with large sets of molecules from the ChemDB show remarkable agreement between theory and empirical results. PMID:20540577
Student assessment by objective structured examination in a neurology clerkship
Adesoye, Taiwo; Smith, Sandy; Blood, Angela; Brorson, James R.
2012-01-01
Objectives: We evaluated the reliability and predictive ability of an objective structured clinical examination (OSCE) in the assessment of medical students at the completion of a neurology clerkship. Methods: We analyzed data from 195 third-year medical students who took the OSCE. For each student, the OSCE consisted of 2 standardized patient encounters. The scores obtained from each encounter were compared. Faculty clinical evaluations of each student for 2 clinical inpatient rotations were also compared. Hierarchical regression analysis was applied to test the ability of the averaged OSCE scores to predict standardized written examination scores and composite clinical scores. Results: Students' OSCE scores from the 2 standardized patient encounters were significantly correlated with each other (r = 0.347, p < 0.001), and the scores for all students were normally distributed. In contrast, students' faculty clinical evaluation scores from 2 different clinical inpatient rotations were uncorrelated, and scores were skewed toward the highest ratings. After accounting for clerkship order, better OSCE scores were predictive of better National Board of Medical Examiners standardized examination scores (R2Δ = 0.131, p < 0.001) and of better faculty clinical scores (R2Δ = 0.078, p < 0.001). Conclusions: Student assessment by an OSCE provides a reliable and predictive objective assessment of clinical performance in a neurology clerkship. PMID:22855865
Walhain, Fenna; van Gorp, Marloes; Lamur, Kenneth S; Veeger, Dirkjan H E J; Ledebt, Annick
2016-10-01
Health-related fitness (HRF) and motor coordination (MC) can be influenced by children's environment and lifestyle behavior. This study evaluates the association between living environment and HRF, MC, and physical and sedentary activities of children in Suriname. Tests were performed for HRF (morphological, muscular, and cardiorespiratory component), gross MC (Körperkoordinations Test für Kinder), fine MC (Movement Assessment Battery for Children), and self-reported activities in 79 urban and 77 rural 7-year-old Maroon children. Urban-rural differences were calculated by an independent sample t test (Mann-Whitney U test if not normally distributed) and χ 2 test. No difference was found in body mass index, muscle strength, and the overall score of gross and fine MC. However, urban children scored lower in HRF on the cardiorespiratory component (P ≤ .001), in gross MC on walking backward (P = .014), and jumping sideways (P = 0.011). They scored higher in the gross MC component moving sideways (P ≤ .001) and lower in fine MC on the trail test (P = .036) and reported significantly more sedentary and fewer physical activities than rural children. Living environment was associated with certain components of HRF, MC, and physical and sedentary activities of 7-year-old children in Suriname. Further research is needed to evaluate the development of urban children to provide information for possible intervention and prevention strategies.
Accelerometry-enabled measurement of walking performance with a robotic exoskeleton: a pilot study.
Lonini, Luca; Shawen, Nicholas; Scanlan, Kathleen; Rymer, William Z; Kording, Konrad P; Jayaraman, Arun
2016-03-31
Clinical scores for evaluating walking skills with lower limb exoskeletons are often based on a single variable, such as distance walked or speed, even in cases where a host of features are measured. We investigated how to combine multiple features such that the resulting score has high discriminatory power, in particular with few patients. A new score is introduced that allows quantifying the walking ability of patients with spinal cord injury when using a powered exoskeleton. Four spinal cord injury patients were trained to walk over ground with the ReWalk™ exoskeleton. Body accelerations during use of the device were recorded by a wearable accelerometer and 4 features to evaluate walking skills were computed. The new score is the Gaussian naïve Bayes surprise, which evaluates patients relative to the features' distribution measured in 7 expert users of the ReWalk™. We compared our score based on all the features with a standard outcome measure, which is based on number of steps only. All 4 patients improved over the course of training, as their scores trended towards the expert users' scores. The combined score (Gaussian naïve surprise) was considerably more discriminative than the one using only walked distance (steps). At the end of training, 3 out of 4 patients were significantly different from the experts, according to the combined score (p < .001, Wilcoxon Signed-Rank Test). In contrast, all but one patient were scored as experts when number of steps was the only feature. Integrating multiple features could provide a more robust metric to measure patients' skills while they learn to walk with a robotic exoskeleton. Testing this approach with other features and more subjects remains as future work.
Age- and Gender-Specific Normative Information from Children Assessed with a Dichotic Words Test.
Moncrieff, Deborah
2015-01-01
The most widely used assessment in the clinical auditory processing disorder (APD) battery is the dichotic listening test. New tests with normative information are helpful for assessment and cross-check of results for reliable diagnosis. The Dichotic Words Test was developed for use in the clinical test battery for diagnosis of APD. The test stimuli were common single syllable words matched for average root-mean-square amplitude and each pair was temporally aligned at both onset and offset. The study was conducted to collect performance results from typically developing children to create normative information for the test. The study follows a cross-sectional design. Typically developing children (n = 416) between the ages of 5 and 12 yr were recruited from schools in the community. There were 217 males and 199 females in the study sample. Only children who passed a hearing screening were eligible to participate. Scores for each ear were recorded during administration of the first free recall version of the test. Ear advantages based on results recorded for left and right ears were used to measure prevalence of right, left, and no ear advantages. Results for each listener's dominant and non-dominant ears and the absolute difference between them were put into the data analysis. Results were analyzed for normality and because no results were normally distributed, all further analyses were done with nonparametric statistical tests. Normative data for dominant and non-dominant ear scores and ear advantages were determined at the 95% confidence interval through bootstrapping methods with 1,000 samples. Children were divided into four age groups based on results in their dominant ears. Females generally performed better than males and the prevalence of a right-ear advantage was ∼60% across all children tested. Normative lower-bound cut-off scores were established for males and females within each age group for dominant and non-dominant ear scores. Normative upper-bound cut-off scores were established for males and females within each age group for ear advantage scores. Normative information specific to age group and gender will be useful in clinical assessment for APD. Prevalence of left-ear advantage results in the sample may have been partly due to uncontrolled influences of voice-onset time in arranging the dichotic pairs. American Academy of Audiology.
Cross-cultural validity of a dietary questionnaire for studies of dental caries risk in Japanese.
Shinga-Ishihara, Chikako; Nakai, Yukie; Milgrom, Peter; Murakami, Kaori; Matsumoto-Nakano, Michiyo
2014-01-02
Diet is a major modifiable contributing factor in the etiology of dental caries. The purpose of this paper is to examine the reliability and cross-cultural validity of the Japanese version of the Food Frequency Questionnaire to assess dietary intake in relation to dental caries risk in Japanese. The 38-item Food Frequency Questionnaire, in which Japanese food items were added to increase content validity, was translated into Japanese, and administered to two samples. The first sample comprised 355 pregnant women with mean age of 29.2 ± 4.2 years for the internal consistency and criterion validity analyses. Factor analysis (principal components with Varimax rotation) was used to determine dimensionality. The dietary cariogenicity score was calculated from the Food Frequency Questionnaire and used for the analyses. Salivary mutans streptococci level was used as a semi-quantitative assessment of dental caries risk and measured by Dentocult SM. Dentocult SM scores were compared with the dietary cariogenicity score computed from the Food Frequency Questionnaire to examine criterion validity, and assessed by Spearman's correlation coefficient (rs) and Kruskal-Wallis test. Test-retest reliability of the Food Frequency Questionnaire was assessed with a second sample of 25 adults with mean age of 34.0 ± 3.0 years by using the intraclass correlation coefficient analysis. The Japanese language version of the Food Frequency Questionnaire showed high test-retest reliability (ICC = 0.70) and good criterion validity assessed by relationship with salivary mutans streptococci levels (rs = 0.22; p < 0.001). Factor analysis revealed four subscales that construct the questionnaire (solid sugars, solid and starchy sugars, liquid and semisolid sugars, sticky and slowly dissolving sugars). Internal consistency were low to acceptable (Cronbach's alpha = 0.67 for the total scale, 0.46-0.61 for each subscale). Mean dietary cariogenicity scores were 50.8 ± 19.5 in the first sample, 47.4 ± 14.1, and 40.6 ± 11.3 for the first and second administrations in the second sample. The distribution of Dentocult SM score was 6.8% (score = 0), 34.4% (score = 1), 39.4% (score = 2), and 19.4% (score = 3). Participants with higher scores were more likely to have higher dietary cariogenicity scores (p < 0.001; Kruskal-Wallis test). These results provide the preliminary evidence for the reliability and validity of the Japanese language Food Frequency Questionnaire.
Risk scores for outcome in bacterial meningitis: Systematic review and external validation study.
Bijlsma, Merijn W; Brouwer, Matthijs C; Bossuyt, Patrick M; Heymans, Martijn W; van der Ende, Arie; Tanck, Michael W T; van de Beek, Diederik
2016-11-01
To perform an external validation study of risk scores, identified through a systematic review, predicting outcome in community-acquired bacterial meningitis. MEDLINE and EMBASE were searched for articles published between January 1960 and August 2014. Performance was evaluated in 2108 episodes of adult community-acquired bacterial meningitis from two nationwide prospective cohort studies by the area under the receiver operating characteristic curve (AUC), the calibration curve, calibration slope or Hosmer-Lemeshow test, and the distribution of calculated risks. Nine risk scores were identified predicting death, neurological deficit or death, or unfavorable outcome at discharge in bacterial meningitis, pneumococcal meningitis and invasive meningococcal disease. Most studies had shortcomings in design, analyses, and reporting. Evaluation showed AUCs of 0.59 (0.57-0.61) and 0.74 (0.71-0.76) in bacterial meningitis, 0.67 (0.64-0.70) in pneumococcal meningitis, and 0.81 (0.73-0.90), 0.82 (0.74-0.91), 0.84 (0.75-0.93), 0.84 (0.76-0.93), 0.85 (0.75-0.95), and 0.90 (0.83-0.98) in meningococcal meningitis. Calibration curves showed adequate agreement between predicted and observed outcomes for four scores, but statistical tests indicated poor calibration of all risk scores. One score could be recommended for the interpretation and design of bacterial meningitis studies. None of the existing scores performed well enough to recommend routine use in individual patient management. Copyright © 2016 The British Infection Association. Published by Elsevier Ltd. All rights reserved.
van Geel, Nanja; Lommerts, Janny E; Bekkenk, Marcel W; Prinsen, Cecilia A C; Eleftheriadou, Viktoria; Taieb, Alain; Picardo, Mauro; Ezzedine, Khaled; Wolkerstorfer, Albert; Speeckaert, Reinhart
2017-03-01
The Vitiligo Extent Score (VES) has recently been introduced as a physicians' score for the clinical assessment of the extent of vitiligo, but a good patient self-assessment score is lacking. The objective is to develop and validate a simplified version of the VES as a patient-reported outcome measure (PROM). After extensive pilot testing, patients were asked to score their vitiligo extent twice with an interval of 2 weeks using the Self Assessment Vitiligo Extent Score (SA-VES). The scores were compared with the physicians' evaluation (VES). The SA-VES demonstrated very good test-retest reliability (intraclass correlation = 0.948, 95% confidence interval [CI]: 0.911-0.970) that was not affected by age, skin type, or vitiligo distribution pattern. According to patients, this evaluation method was easy to use (22% very easy; 49% easy; 29% normal) and required <5 minutes in the majority of patients (73%, <5 minutes; 24%, 5-10 minutes; 2%, 10-15 minutes). Comparison of the SA-VES and the VES demonstrated excellent correlation (r = 0.986, P <.001). Few patients had a dark skin type. The results demonstrate excellent reliability of the SA-VES and excellent correlation with its investigator-reported counterpart (VES). This patient-oriented evaluation method provides a useful tool for the assessment of vitiligo extent. Copyright © 2016 American Academy of Dermatology, Inc. Published by Elsevier Inc. All rights reserved.
McLachlan, G J; Bean, R W; Jones, L Ben-Tovim
2006-07-01
An important problem in microarray experiments is the detection of genes that are differentially expressed in a given number of classes. We provide a straightforward and easily implemented method for estimating the posterior probability that an individual gene is null. The problem can be expressed in a two-component mixture framework, using an empirical Bayes approach. Current methods of implementing this approach either have some limitations due to the minimal assumptions made or with more specific assumptions are computationally intensive. By converting to a z-score the value of the test statistic used to test the significance of each gene, we propose a simple two-component normal mixture that models adequately the distribution of this score. The usefulness of our approach is demonstrated on three real datasets.
Preparedness for pandemics: does variation among states affect the nation as a whole?
Potter, Margaret A; Brown, Shawn T; Lee, Bruce Y; Grefenstette, John; Keane, Christopher R; Lin, Chyongchiou J; Quinn, Sandra C; Stebbins, Samuel; Sweeney, Patricia M; Burke, Donald S
2012-01-01
Since states' public health systems differ as to pandemic preparedness, this study explored whether such heterogeneity among states could affect the nation's overall influenza rate. The Centers for Disease Control and Prevention produced a uniform set of scores on a 100-point scale from its 2008 national evaluation of state preparedness to distribute materiel from the Strategic National Stockpile (SNS). This study used these SNS scores to represent each state's relative preparedness to distribute influenza vaccine in a timely manner and assumed that "optimal" vaccine distribution would reach at least 35% of the state's population within 4 weeks. The scores were used to determine the timing of vaccine distribution for each state: each 10-point decrement of score below 90 added an additional delay increment to the distribution time. A large-scale agent-based computational model simulated an influenza pandemic in the US population. In this synthetic population each individual or agent had an assigned household, age, workplace or school destination, daily commute, and domestic intercity air travel patterns. Simulations compared influenza case rates both nationally and at the state level under 3 scenarios: no vaccine distribution (baseline), optimal vaccine distribution in all states, and vaccine distribution time modified according to state-specific SNS score. Between optimal and SNS-modified scenarios, attack rates rose not only in low-scoring states but also in high-scoring states, demonstrating an interstate spread of infections. Influenza rates were sensitive to variation of the SNS-modified scenario (delay increments of 1 day versus 5 days), but the interstate effect remained. The effectiveness of a response activity such as vaccine distribution could benefit from national standards and preparedness funding allocated in part to minimize interstate disparities.
Correlation between plasma homocysteine levels and craving in alcohol dependent stabilized patients.
Coppola, Maurizio; Mondola, Raffaella
2018-06-01
Homocysteine is a sulfur amino acid strictly related with alcohol consumption. In alcoholics, hyperhomocysteinemia can increase the risk of various alcohol-related disorders such as: brain atrophy, epileptic seizures during withdrawal, and mood disorders. To evaluate the correlation among serum homocysteine concentrations, craving, hazardous and harmful patterns of alcohol consumption in patients stabilized for withdrawal symptoms. Participants were adult outpatients accessed at the Addiction Treatment Unit. Alcoholism was assessed using the following tools: Mini-International Neuropsychiatric Interview Plus (MINI Plus), Alcohol Use Disorder Identification test (AUDIT), Visual Analogic Scale for craving (VAS). Furthermore, during the first visit a blood sample was taken from all patients to measure the plasma concentration of both homocysteine and Carboxy Deficient Transferrin (CDT). Differences between groups in socio-demographic and clinical characteristics were analyzed using the t-test and the Mann-Whitney's U test for normally and non-normally distributed data, respectively. Correlation between clinical scale scores and plasma concentration of homocysteine and CDT was evaluated using the Pearson's correlation coefficient and the Kendall's Tau-b bivariate correlation coefficient for normally and non-normally distributed data, respectively. Our study included 92 patients. No difference was found in socio-demographic characteristics between groups. The group with high homocysteine had higher prevalence of mood disorders (p < 0.001), plasma CDT percentage (p < 0.001), VAS score (p < 0.001) and AUDIT score (p < 0.001) than group with normal homocysteine. Plasma homocysteine showed a positive correlation with both VAS score (p < 0.001), and AUDIT score (p < 0.05). In our study, plasma homocysteine concentration is associated with craving, hazardous and harmful patterns of alcohol consumption. In particular, homocysteine is correlated with alcoholism in a bidirectional manner because its level appears to be related with alcohol degree, but simultaneously, hyperhomocysteinemia could enhance the alcohol consumption increasing the severity of craving in a circular self reinforcing mechanism. Copyright © 2017 Elsevier Ltd and European Society for Clinical Nutrition and Metabolism. All rights reserved.
ERIC Educational Resources Information Center
Peña, Elizabeth D.; Bedore, Lisa M.; Kester, Ellen S.
2016-01-01
Background: Significant progress has been made in the identification of language impairment in children are bilingual. Bilingual children's vocabulary knowledge may be distributed across languages. Thus, when testing bilingual children it is difficult to know how to weigh each language for diagnostic purposes. Even when conceptual scoring is used…
Could situational judgement tests be used for selection into dental foundation training?
Patterson, F; Ashworth, V; Mehra, S; Falcon, H
2012-07-13
To pilot and evaluate a machine-markable situational judgement test (SJT) designed to select candidates into UK dental foundation training. Single centre pilot study. UK postgraduate deanery in 2010. Seventy-four candidates attending interview for dental foundation training in Oxford and Wessex Deaneries volunteered to complete the situational judgement test. The situational judgement test was developed to assess relevant professional attributes for dentistry (for example, empathy and integrity) in a machine-markable format. Test content was developed by subject matter experts working with experienced psychometricians. Evaluation of psychometric properties of the pilot situational judgement test (for example, reliability, validity and fairness). Scores in the dental foundation training selection process (short-listing and interviews) were used to examine criterion-related validity. Candidates completed an evaluation questionnaire to examine candidate reactions and face validity of the new test. Forty-six candidates were female and 28 male; mean age was 23.5-years-old (range 22-32). Situational judgement test scores were normally distributed and the test showed good internal reliability when corrected for test length (α = 0.74). Situational judgement test scores positively correlated with the management, leadership and professionalism interview (N = 50; r = 0.43, p <0.01) but not with the clinical skills interview, providing initial evidence of criterion-related validity as the situational judgement test is designed to test non-cognitive professional attributes beyond clinical knowledge. Most candidates perceived the situational judgement test as relevant to dentistry, appropriate for their training level, and fair. This initial pilot study suggests that a situational judgement test is an appropriate and innovative method to measure professional attributes (eg empathy and integrity) for selection into foundation training. Further research will explore the long-term predictive validity of the situational judgement test once candidates have entered training.
Fear the serpent: A psychometric study of snake phobia.
Polák, Jakub; Sedláčková, Kristýna; Nácar, David; Landová, Eva; Frynta, Daniel
2016-08-30
Millions of people worldwide suffer from specific phobias. Almost any stimulus may trigger a phobic reaction, but snakes are among the most feared objects. Half of the population feel anxious about snakes and 2-3% meet the diagnostic criteria for snake phobia. Despite such a high ratio, only one instrument is commonly used, the Snake Questionnaire (SNAQ). The aim of this study was to develop a standardized Czech translation, describe its psychometric properties and analyze the distribution of snake fears. In a counter-balanced design 755 respondents were asked to complete the English and Czech SNAQ (first or last) with a 2-3 month delay; 300 of them completed both instruments. We found excellent test-retest reliability (0.94), although the total scores differed significantly when the English version was administered first. The mean score was 5.80 and Generalized Linear Models revealed significant effects of sex and field of study (women and people with no biology education scored higher than men and biologists). A cut-off point for snake phobia as derived from a previous study identified 2.6% of the subjects as phobic. Finally, the score distribution was similar to other countries supporting the view that fear of snakes is universal. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Eating Disorders Among Female Students of Taif University, Saudi Arabia.
Abd El-Azeem Taha, Azza Ali; Abu-Zaid, Hany Ahmed; El-Sayed Desouky, Dalia
2018-03-01
Eating disorders are a common health problem among adolescents, and females are especially vulnerable to them. There is lack of information on the prevalence of eating disorders in Saudi Arabia. The current study aimed to investigate the prevalence of eating disorders among female undergraduate university students in Taif city, Saudi Arabia. The study was undertaken in the female section at Taif university from November 1, 2016 to March 30, 2017. Eating Attitudes Test (EAT-26) was used to determine the prevalence of eating disorders. The questionnaire was distributed among undergraduate students and their anthropometric measurements were assessed after obtaining their consent. The sample included 1200 university students with a median age of 21 years (range 17-33). Nonparametric tests were used to assess relationship between variables. Chi-squared test was used to compare items of the disordered eating attitudes and behaviors between positive and negative EAT respondents. Using the cutoff score of 20 on EAT-26 test, 35.4% of the students were classified at risk for eating disorders. Medical and obese students achieved the highest significant EAT scores. A high prevalence of eating disorders was found among females at Taif university, Kingdom of Saudi Arabia. Our findings call for prevention of these disorders and we recommend establishing a national screening program among Saudi university female students for early detection and management of these problems. © 2018 The Author(s). This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Genome-Wide Polygenic Scores Predict Reading Performance Throughout the School Years.
Selzam, Saskia; Dale, Philip S; Wagner, Richard K; DeFries, John C; Cederlöf, Martin; O'Reilly, Paul F; Krapohl, Eva; Plomin, Robert
2017-07-04
It is now possible to create individual-specific genetic scores, called genome-wide polygenic scores (GPS). We used a GPS for years of education ( EduYears ) to predict reading performance assessed at UK National Curriculum Key Stages 1 (age 7), 2 (age 12) and 3 (age 14) and on reading tests administered at ages 7 and 12 in a UK sample of 5,825 unrelated individuals. EduYears GPS accounts for up to 5% of the variance in reading performance at age 14. GPS predictions remained significant after accounting for general cognitive ability and family socioeconomic status. Reading performance of children in the lowest and highest 12.5% of the EduYears GPS distribution differed by a mean growth in reading ability of approximately two school years. It seems certain that polygenic scores will be used to predict strengths and weaknesses in education.
Genome-Wide Polygenic Scores Predict Reading Performance Throughout the School Years
Selzam, Saskia; Dale, Philip S.; Wagner, Richard K.; DeFries, John C.; Cederlöf, Martin; O’Reilly, Paul F.; Krapohl, Eva; Plomin, Robert
2017-01-01
ABSTRACT It is now possible to create individual-specific genetic scores, called genome-wide polygenic scores (GPS). We used a GPS for years of education (EduYears) to predict reading performance assessed at UK National Curriculum Key Stages 1 (age 7), 2 (age 12) and 3 (age 14) and on reading tests administered at ages 7 and 12 in a UK sample of 5,825 unrelated individuals. EduYears GPS accounts for up to 5% of the variance in reading performance at age 14. GPS predictions remained significant after accounting for general cognitive ability and family socioeconomic status. Reading performance of children in the lowest and highest 12.5% of the EduYears GPS distribution differed by a mean growth in reading ability of approximately two school years. It seems certain that polygenic scores will be used to predict strengths and weaknesses in education. PMID:28706435
Use of multivariate measures of disability in health surveys.
Charlton, J R; Patrick, D L; Peach, H
1983-01-01
It has been claimed that the aggregation of information from several areas of life into a small set of global measures has certain advantages for describing disability. Global measures of disability were constructed from a modified version of an existing health survey instrument and the sickness impact profile (SIP) and their properties were tested. The disability items grouped satisfactorily into five global measures (physical, psychosocial, eating, communication, and work). All disability measures (global and original category scores) were poor predictors of service use by individuals but were related as expected to age and number of medical conditions. The global measures generally had lower standard errors and better repeatability. All scores exhibit J-shaped distributions for cross sectional data but the change in global measures over time was consistent with the normal distribution. Preferably, both global and category measures should be used for comparing changes over time between groups of individuals. PMID:6655420
Assessing the predictive value of the American Board of Family Practice In-training Examination.
Replogle, William H; Johnson, William D
2004-03-01
The American Board of Family Practice In-training Examination (ABFP ITE) is a cognitive examination similar in content to the ABFP Certification Examination (CE). The ABFP ITE is widely used in family medicine residency programs. It was originally developed and intended to be used for assessment of groups of residents. Despite lack of empirical support, however, some residency programs are using ABFP ITE scores as individual resident performance indicators. This study's objective was to estimate the positive predictive value of the ABFP ITE for identifying residents at risk for poor performance on the ABFP CE or a subsequent ABFP ITE. We used a normal distribution model for correlated test scores and Monte Carlo simulation to investigate the effect of test reliability (measurement errors) on the positive predictive value of the ABFP ITE. The positive predictive value of the composite score was .72. The positive predictive value of the eight specialty subscales ranged from .26 to .57. Only the composite score of the ABFP ITE has acceptable positive predictive value to be used as part of a comprehension resident evaluation system. The ABFP ITE specialty subscales do not have sufficient positive predictive value or reliability to warrant use as performance indicators.
Martinková, Patrícia; Drabinová, Adéla; Liaw, Yuan-Ling; Sanders, Elizabeth A.; McFarland, Jenny L.; Price, Rebecca M.
2017-01-01
We provide a tutorial on differential item functioning (DIF) analysis, an analytic method useful for identifying potentially biased items in assessments. After explaining a number of methodological approaches, we test for gender bias in two scenarios that demonstrate why DIF analysis is crucial for developing assessments, particularly because simply comparing two groups’ total scores can lead to incorrect conclusions about test fairness. First, a significant difference between groups on total scores can exist even when items are not biased, as we illustrate with data collected during the validation of the Homeostasis Concept Inventory. Second, item bias can exist even when the two groups have exactly the same distribution of total scores, as we illustrate with a simulated data set. We also present a brief overview of how DIF analysis has been used in the biology education literature to illustrate the way DIF items need to be reevaluated by content experts to determine whether they should be revised or removed from the assessment. Finally, we conclude by arguing that DIF analysis should be used routinely to evaluate items in developing conceptual assessments. These steps will ensure more equitable—and therefore more valid—scores from conceptual assessments. PMID:28572182
Emotional intelligence among nursing students: Findings from a cross-sectional study.
Štiglic, Gregor; Cilar, Leona; Novak, Žiga; Vrbnjak, Dominika; Stenhouse, Rosie; Snowden, Austyn; Pajnkihar, Majda
2018-07-01
Emotional intelligence in nursing is of global interest. International studies identify that emotional intelligence influences nurses' work and relationships with patients. It is associated with compassion and care. Nursing students scored higher on measures of emotional intelligence compared to students of other study programmes. The level of emotional intelligence increases with age and tends to be higher in women. This study aims to measure the differences in emotional intelligence between nursing students with previous caring experience and those without; to examine the effects of gender on emotional intelligence scores; and to test whether nursing students score higher than engineering colleagues on emotional intelligence measures. A cross-sectional descriptive study design was used. The study included 113 nursing and 104 engineering students at the beginning of their first year of study at a university in Slovenia. Emotional intelligence was measured using the Trait Emotional Intelligence Questionnaire (TEIQue) and Schutte Self Report Emotional Intelligence Test (SSEIT). Shapiro-Wilk's test of normality was used to test the sample distribution, while the differences in mean values were tested using Student t-test of independent samples. Emotional intelligence was higher in nursing students (n = 113) than engineering students (n = 104) in both measures [TEIQue t = 3.972; p < 0.001; SSEIT t = 8.288; p < 0.001]. Although nursing female students achieved higher emotional intelligence scores than male students on both measures, the difference was not statistically significant [TEIQue t = -0.839; p = 0.403; SSEIT t = -1.159; p = 0.249]. EI scores in nursing students with previous caring experience were not higher compared to students without such experience for any measure [TEIQue t = -1.633; p = 0.105; SSEIT t = -0.595; p = 0.553]. Emotional intelligence was higher in nursing than engineering students, and slightly higher in women than men. It was not associated with previous caring experience. Copyright © 2018 Elsevier Ltd. All rights reserved.
Asymmetric bias in perception of facial affect among Roman and Arabic script readers.
Heath, Robin L; Rouhana, Aida; Ghanem, Dana Abi
2005-01-01
The asymmetric chimeric faces test is used frequently as an indicator of right hemisphere involvement in the perception of facial affect, as the test is considered free of linguistic elements. Much of the original research with the asymmetric chimeric faces test was conducted with subjects reading left-to-right Roman script, i.e., English. As readers of right-to-left scripts, such as Arabic, demonstrated a mixed or weak rightward bias in judgements of facial affect, the influence of habitual scanning direction was thought to intersect with laterality. We administered the asymmetric chimeric faces test to 1239 adults who represented a range of script experience, i.e., Roman script readers (English and French), Arabic readers, bidirectional readers of Roman and Arabic scripts, and illiterates. Our findings supported the hypothesis that the bias in facial affect judgement is rooted in laterality, but can be influenced by script direction. Specifically, right-handed readers of Roman script demonstrated the greatest mean leftward score, and mixed-handed Arabic script readers demonstrated the greatest mean rightward score. Biliterates showed a gradual shift in asymmetric perception, as their scores fell between those of Roman and Arabic script readers, basically distributed in the order expected by their handedness and most often used script. Illiterates, whose only directional influence was laterality, showed a slight leftward bias.
Football goal distributions and extremal statistics
NASA Astrophysics Data System (ADS)
Greenhough, J.; Birch, P. C.; Chapman, S. C.; Rowlands, G.
2002-12-01
We analyse the distributions of the number of goals scored by home teams, away teams, and the total scored in the match, in domestic football games from 169 countries between 1999 and 2001. The probability density functions (PDFs) of goals scored are too heavy-tailed to be fitted over their entire ranges by Poisson or negative binomial distributions which would be expected for uncorrelated processes. Log-normal distributions cannot include zero scores and here we find that the PDFs are consistent with those arising from extremal statistics. In addition, we show that it is sufficient to model English top division and FA Cup matches in the seasons of 1970/71-2000/01 on Poisson or negative binomial distributions, as reported in analyses of earlier seasons, and that these are not consistent with extremal statistics.
Biases and power for groups comparison on subjective health measurements.
Hamel, Jean-François; Hardouin, Jean-Benoit; Le Neel, Tanguy; Kubis, Gildas; Roquelaure, Yves; Sébille, Véronique
2012-01-01
Subjective health measurements are increasingly used in clinical research, particularly for patient groups comparisons. Two main types of analytical strategies can be used for such data: so-called classical test theory (CTT), relying on observed scores and models coming from Item Response Theory (IRT) relying on a response model relating the items responses to a latent parameter, often called latent trait. Whether IRT or CTT would be the most appropriate method to compare two independent groups of patients on a patient reported outcomes measurement remains unknown and was investigated using simulations. For CTT-based analyses, groups comparison was performed using t-test on the scores. For IRT-based analyses, several methods were compared, according to whether the Rasch model was considered with random effects or with fixed effects, and the group effect was included as a covariate or not. Individual latent traits values were estimated using either a deterministic method or by stochastic approaches. Latent traits were then compared with a t-test. Finally, a two-steps method was performed to compare the latent trait distributions, and a Wald test was performed to test the group effect in the Rasch model including group covariates. The only unbiased IRT-based method was the group covariate Wald's test, performed on the random effects Rasch model. This model displayed the highest observed power, which was similar to the power using the score t-test. These results need to be extended to the case frequently encountered in practice where data are missing and possibly informative.
Karabuva, Svjetlana; Carević, Vedran; Radić, Mislav; Fabijanić, Damir
2013-01-01
The aim of study was to: 1) examine the relationship between ABO blood groups and extent of coronary atherosclerosis in patients with chronic coronary artery disease (CAD), 2) compare ABO blood groups distribution in CAD patients and general population, 3) examine possible differences in traditional risk factors frequency in CAD patients with different ABO blood groups. In the 646 chronic CAD patients (72.4% males) coronary angiograms were scored by quantitative assessment using multiple angiographic scoring system, Traditional risk factors were self reported or measured by standard methods. ABO blood distribution of patients was compared with group of 651 healthy blood donors (74.6% males). Among all ABO blood group patients there was no significant difference between the extent of coronary atherosclerosis with regard to all the three scoring systems: number of affected coronary arteries (P = 0.857), Gensini score (P = 0.818), and number of segments narrowed > 50% (P = 0.781). There was no significant difference in ABO blood group distribution between CAD patients and healthy blood donors. Among CAD patients, men with blood group AB were significantly younger than their pairs with non-AB blood groups (P = 0.008). Among CAD patients with AB blood group, males < 50 yrs were significantly overrepresented when compared with the non-AB groups (P = 0.003). No association between ABO blood groups and the extent of coronary atherosclerosis in Croatian CAD patients is observed. Observation that AB blood group might possibly identify Croatian males at risk to develop the premature CAD has to be tested in larger cohort of patients.
Hazel, Susan J; Signal, Tania D; Taylor, Nicola
2011-01-01
Attitudes toward animals are important in influencing how animals are treated. Few studies have investigated attitudes toward animals in veterinary or animal-science students, and no studies have compared attitudes to animals before and after a course teaching animal welfare and ethics. In this study, students enrolled in veterinary (first-year) or animal-science (first- and third-year) programs completed a questionnaire on attitudes toward different categories of animals before and after the course. Higher attitude scores suggest a person more concerned about how an animal is treated. Normally distributed data were compared using parametric statistics, and non-normally distributed data were compared using non-parametric tests, with significance p < .05. Attitudes toward pets (45.5-47.6) were higher than those toward pests (34.2-38.4) or profit animals (30.3-32.1). Attitude scores increased from before to after the course in the veterinary cohort on the Pest (36.9 vs. 38.4, respectively, n = 27, p < .05) and Profit (30.3 vs. 32.1, respectively, n = 28, p < .05) subscales, but not in the animal-science cohorts. Attitude scores in all categories were higher for women than for men. Currently having an animal was associated with higher pet scores (46.8 vs. 43.8, ns = 120 and 13, respectively, p < .05), and having an animal as a child was associated with higher profit scores (31.0 vs. 26.6, ns = 129 and 8, respectively, p < .05). Students electing to work with livestock had lower scores on the Pest and Profit subscales, and students wanting to work with wildlife had significantly higher scores on the Pest and Profit subscales. This study demonstrates attitudinal changes after an animal-welfare course, with significant increases in veterinary but not animal-science students.
Roitberg, Ben Z; Kania, Patrick; Luciano, Cristian; Dharmavaram, Naga; Banerjee, Pat
2015-01-01
Manual skill is an important attribute for any surgeon. Current methods to evaluate sensory-motor skills in neurosurgical residency applicants are limited. We aim to develop an objective multifaceted measure of sensory-motor skills using a virtual reality surgical simulator. A set of 3 tests of sensory-motor function was performed using a 3-dimensional surgical simulator with head and arm tracking, collocalization, and haptic feedback. (1) Trajectory planning: virtual reality drilling of a pedicle. Entry point, target point, and trajectory were scored-evaluating spatial memory and orientation. (2) Motor planning: sequence, timing, and precision: hemostasis in a postresection cavity in the brain. (3) Haptic perception: touching virtual spheres to determine which is softest of the group, with progressive difficulty. Results were analyzed individually and for a combined score of all the tasks. The University of Chicago Hospital's tertiary care academic center. A total of 95 consecutive applicants interviewed at a neurosurgery residency program over 2 years were offered anonymous participation in the study; in 2 cohorts, 36 participants in year 1 and 27 participants in year 2 (validation cohort) agreed and completed all the tasks. We also tested 10 first-year medical students and 4 first- and second-year neurosurgery residents. A cumulative score was generated from the 3 tests. The mean score was 14.47 (standard deviation = 4.37), median score was 13.42, best score was 8.41, and worst score was 30.26. Separate analysis of applicants from each of 2 years yielded nearly identical results. Residents tended to cluster on the better performance side, and first-year students were not different from applicants. (1) Our cumulative score measures sensory-motor skills in an objective and reproducible way. (2) Better performance by residents hints at validity for neurosurgery. (3) We were able to demonstrate good psychometric qualities and generate a proposed sensory-motor quotient distribution in our tested population. Copyright © 2015 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.
Kim, Roger H; Kurtzman, Scott H; Collier, Ashley N; Shabahang, Mohsen M
Learning styles theory posits that learners have distinct preferences for how they assimilate new information. The VARK model categorizes learners based on combinations of 4 learning preferences: visual (V), aural (A), read/write (R), and kinesthetic (K). A previous single institution study demonstrated that the VARK preferences of applicants who interview for general surgery residency are different from that of the general population and that learning preferences were associated with performance on standardized tests. This multiinstitutional study was conducted to determine the distribution of VARK preferences among interviewees for general surgery residency and the effect of those preferences on United States Medical Licensing Examination (USMLE) scores. The VARK learning inventory was administered to applicants who interviewed at 3 general surgery programs during the 2014 to 2015 academic year. The distribution of VARK learning preferences among interviewees was compared with that of the general population of VARK respondents. Performance on USMLE Step 1 and Step 2 Clinical Knowledge was analyzed for associations with VARK learning preferences. Chi-square, analysis of variance, and Dunnett's test were used for statistical analysis, with p < 0.05 considered statistically significant. The VARK inventory was completed by a total of 140 residency interviewees. Sixty-four percent of participants were male, and 41% were unimodal, having a preference for a single learning modality. The distribution of VARK preferences of interviewees was different than that of the general population (p = 0.02). By analysis of variance, there were no overall differences in USMLE Step 1 and Step 2 Clinical Knowledge scores by VARK preference (p = 0.06 and 0.21, respectively). However, multiple comparison analysis using Dunnett's test revealed that interviewees with R preferences had significantly higher scores than those with multimodal preferences on USMLE Step 1 (239 vs. 222, p = 0.02). Applicants who interview for general surgery residency have a different pattern of VARK preferences than that of the general population. Interviewees with preferences for read/write learning modalities have higher scores on the USMLE Step 1 than those with multimodal preferences. Learning preferences may have impact on residency applicant selection and represents a topic that warrants further investigation. Copyright © 2016 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.
The Effect of Foot Reflexology on Anxiety, Pain, and Outcomes of the Labor in Primigravida Women.
Moghimi-Hanjani, Soheila; Mehdizadeh-Tourzani, Zahra; Shoghi, Mahnaz
2015-08-01
Reflexology is a technique used widely as one of non-pharmacological pain management techniques. The present study aimed to review and determine the effect of foot reflexology on anxiety, pain and outcomes of the labor in primigravida women. This clinical trial study was conducted on 80 primigravida mothers who were divided randomly into an intervention group (Foot reflexology applied for 40 min, n=40) and control group (n=40). The pain intensity was scored immediately after the end of intervention and at 30,60 and 120 min after the intervention in both groups, based on McGill Questionnaire for Pain Rating Index (PRI). Spielberger State-Trait Anxiety Inventory (STAI) was completed before and after intervention in both groups. Duration of labor phases, the type of labor and Apgar scores of the infant at the first and fifth minute were recorded in both groups. Descriptive and inferential statistics methods (t-test and chi-square test) were applied in analyzing data. Application of reflexology technique decreased pain intensity (at 30, 60 and 120 min after intervention) and duration of labor as well as anxiety level significantly (P<0.001). Furthermore, a significant difference was observed between two groups in terms of the frequency distribution of the type of labor and Apgar score (P<0.001). Results of this study show that reflexology reduces labor pain intensity, duration of labor, anxiety, frequency distribution of natural delivery and increases Apgar scores. Using this non-invasive technique, obstetricians can achieve, to some extent, to one of the most important goals of midwifery as pain relief and reducing anxiety during labor and encourage the mothers to have a vaginal delivery.
Alford, Timothy J; Roberts, W Eugene; Hartsfield, James K; Eckert, George J; Snyder, Ronald J
2011-05-01
Utilize American Board of Orthodontics (ABO) cast/radiographic evaluation (CRE) to compare a series of 63 consecutive patients, finished with manual wire bending (conventional) treatment, vs a subsequent series of 69 consecutive patients, finished by the same orthodontist using the SureSmile™ (SS) method. Records of 132 nonextraction patients were scored by a calibrated examiner blinded to treatment mode. Age and discrepancy index (DI) between groups were compared by t-tests. A chi-square test was used to compare for differences in sex and whether the patient was treated using braces only (no orthopedic correction). Analysis of covariance tested for differences in CRE outcomes and treatment times, with sex and DI included as covariates. A logarithmic transformation of CRE outcomes and treatment times was used because their distributions were skewed. Significance was defined as P < .05. Compared with conventional finishing, SS patients had significantly lower DI scores, less treatment time (∼7 months), and better CRE scores for first-order alignment-rotation and interproximal space closure; however, second-order root angulation (RA) was inferior. SS patients were treated in less time to better CRE scores for first-order rotation (AR) and interproximal space closure (IC) but on the average, malocclusions were less complex and second order root alignment was inferior, compared with patients finished with manual wire bending.
Alford, Timothy J.; Roberts, W. Eugene; Hartsfield, James K.; Eckert, George J.; Snyder, Ronald J.
2016-01-01
Objective Utilize American Board of Orthodontics (ABO) cast/radiographic evaluation (CRE) to compare a series of 63 consecutive patients, finished with manual wire bending (conventional) treatment, vs a subsequent series of 69 consecutive patients, finished by the same orthodontist using the SureSmile™ (SS) method. Materials and Methods Records of 132 nonextraction patients were scored by a calibrated examiner blinded to treatment mode. Age and discrepancy index (DI) between groups were compared by t-tests. A chi-square test was used to compare for differences in sex and whether the patient was treated using braces only (no orthopedic correction). Analysis of covariance tested for differences in CRE outcomes and treatment times, with sex and DI included as covariates. A logarithmic transformation of CRE outcomes and treatment times was used because their distributions were skewed. Significance was defined as P < .05. Results Compared with conventional finishing, SS patients had significantly lower DI scores, less treatment time (~7 months), and better CRE scores for first-order alignment-rotation and interproximal space closure; however, second-order root angulation (RA) was inferior. Conclusion SS patients were treated in less time to better CRE scores for first-order rotation (AR) and interproximal space closure (IC) but on the average, malocclusions were less complex and second order root alignment was inferior, compared with patients finished with manual wire bending. PMID:21261488
scoringRules - A software package for probabilistic model evaluation
NASA Astrophysics Data System (ADS)
Lerch, Sebastian; Jordan, Alexander; Krüger, Fabian
2016-04-01
Models in the geosciences are generally surrounded by uncertainty, and being able to quantify this uncertainty is key to good decision making. Accordingly, probabilistic forecasts in the form of predictive distributions have become popular over the last decades. With the proliferation of probabilistic models arises the need for decision theoretically principled tools to evaluate the appropriateness of models and forecasts in a generalized way. Various scoring rules have been developed over the past decades to address this demand. Proper scoring rules are functions S(F,y) which evaluate the accuracy of a forecast distribution F , given that an outcome y was observed. As such, they allow to compare alternative models, a crucial ability given the variety of theories, data sources and statistical specifications that is available in many situations. This poster presents the software package scoringRules for the statistical programming language R, which contains functions to compute popular scoring rules such as the continuous ranked probability score for a variety of distributions F that come up in applied work. Two main classes are parametric distributions like normal, t, or gamma distributions, and distributions that are not known analytically, but are indirectly described through a sample of simulation draws. For example, Bayesian forecasts produced via Markov Chain Monte Carlo take this form. Thereby, the scoringRules package provides a framework for generalized model evaluation that both includes Bayesian as well as classical parametric models. The scoringRules package aims to be a convenient dictionary-like reference for computing scoring rules. We offer state of the art implementations of several known (but not routinely applied) formulas, and implement closed-form expressions that were previously unavailable. Whenever more than one implementation variant exists, we offer statistically principled default choices.
Gender differences in illness behavior after cardiac surgery.
Modica, Maddalena; Ferratini, Maurizio; Spezzaferri, Rosa; De Maria, Renata; Previtali, Emanuele; Castiglioni, Paolo
2014-01-01
Differences in the ways male and female patients confront their illness after cardiac surgery may contribute to previously observed gender differences in the outcomes of cardiac rehabilitation. The aim of this cross-sectional study was to verify whether there are gender-related differences in illness behavior (IB) soon after cardiac surgery and before entering cardiac rehabilitation. Patients (N = 1323) completed the IB Questionnaire and Hospital Anxiety and Depression Scale (HADS) 9 ± 5 (mean ± SD) days after cardiac surgery. The scores were tested for gender differences in score distributions (Mann-Whitney U test) and in prevalence of clinically relevant scores (the Pearson χ² test). Multivariate regression analyses were made with IB Questionnaire and HADS scores as independent variables, and gender, age, education, marital status, and type of surgery as predictors. Denial was significantly (P < .01) prevalent among the men (3.6 ± 1.4) versus women (3.2 ± 1.6), whereas disease conviction (men = 2.1 ± 1.5, women = 2.5 ± 1.6), dysphoria (men = 1.5 ± 1.5, women = 2.0 ± 1.6), anxiety (men = 6.0 ± 3.6, women = 6.9 ± 3.9), and depression (men = 5.3 ± 3.8, women = 6.5 ± 4.0) were significantly more prevalent among women. The prevalences of clinically relevant scores for disease conviction, anxiety, and depression were also significantly higher in women. Multivariate analysis showed that gender predicted these scores even after the removal of confounders. Gender differences exist in denial, disease conviction, and dysphoria, probably depending on the culturally assigned roles of men and women. As these aspects of IB may compromise treatment compliance and the quality of life, the efficacy of cardiac rehabilitation programs might be improved taking into account the different prevalences in men and women.
Luschin-Ebengreuth, Marion; Dimai, Hans P; Ithaler, Daniel; Neges, Heide M; Reibnegger, Gilbert
2015-03-14
In the framework of medical university admission procedures the assessment of non-cognitive abilities is increasingly demanded. As tool for assessing personal qualities or the ability to handle theoretical social constructs in complex situations, the Situational Judgment Test (SJT), among other measurement instruments, is discussed in the literature. This study focuses on the development and the results of the SJT as part of the admission test for the study of human medicine and dentistry at one medical university in Austria. Observational investigation focusing on the results of the SJT. 4741 applicants were included in the study. To yield comparable results for the different test parts, "relative scores" for each test part were calculated. Performance differences between women and men in the various test parts are analyzed using effect sizes based on comparison of mean values (Cohen's d). The associations between the relative scores achieved in the various test parts were assessed by computing pairwise linear correlation coefficients between all test parts and visualized by bivariate scatterplots. Among successful candidates, men consistently outperform women. Men perform better in physics and mathematics. Women perform better in the SJT part. The least discriminatory test part was the SJT. A strong correlation between biology and chemistry and moderate correlations between the other test parts except SJT is obvious. The relative scores are not symmetrically distributed. The cognitive loading of the performed SJTs points to the low correlation between the SJTs and cognitive abilities. Adding the SJT part into the admission test, in order to cover more than only knowledge and understanding of natural sciences among the applicants has been quite successful.
The Relation of Education and Income to Cognitive Function among Professional Women
Lee, Sunmin; Buring, Julie E.; Cook, Nancy R.; Grodstein, Francine
2005-01-01
We investigated the relation of educational attainment and annual household income to cognitive function and cognitive decline in community-dwelling women aged 66 years or older. Subjects were 6,314 health professionals participating in the Women’s Health Study, among whom information on education and income was self-reported. From 1998 to 2000, we administered five cognitive tests, measuring general cognition, episodic memory and verbal fluency, using a validated telephone interview. Second cognitive assessments were conducted approximately two years later; information was complete for 5,573 women at the time of analysis, with 94% follow-up. We used linear and logistic regression to calculate multivariate-adjusted mean differences, and odds of cognitive impairment (defined as worst 10% of test distribution) and of substantial decline in performance (worst 10% of distribution), across various levels of education and income. After adjusting for numerous potential confounding factors, we found strong trends of increasing mean cognitive performance with increasing level of education (p-trend<0.0005 on all cognitive measures). Odds of cognitive impairment also consistently decreased with increasing education (eg, on summary score combining all tests, OR=0.6, 95% CI 0.3–0.9 comparing those with a doctoral degree to those with a 3-year associate’s degree). For income, we found significant trends of increasing mean cognitive performance with increasing income on the summary score and on episodic memory (p-trends<0.0001). For example, the OR was 0.6 (95% CI 0.4–0.8) comparing those with the highest income to the lowest income on the summary score. Results were generally similar for cognitive decline over two years, although somewhat weaker. Thus, in these well-educated, professional women, educational attainment and income both predicted cognitive function and decline. PMID:16352912
Nguyen, Allison M; Arbuckle, Rob; Korver, Tjeerd; Chen, Fang; Taylor, Beverley; Turnbull, Alice; Norquist, Josephine M
2017-08-01
The objective of this study was to evaluate the psychometric properties of the Dysmenorrhea Daily Diary (DysDD), an electronic patient-reported outcome, in a sample of 355 women with primary dysmenorrhea enrolled in a phase IIb, multicenter, randomized, partially blinded, placebo-controlled trial for treatment of dysmenorrhea. Subjects completed the DysDD over three menstrual cycles, one pre-treatment baseline cycle and two treatment cycles. The DysDD was administered alongside the Menstrual Distress Questionnaire (MDQ), the Short-Form 36 Version 2.0 (SF-36v2), and a Global Assessment of Change (GAC). Item response distributions, test-retest reliability, concurrent and known groups validity, responsiveness, and minimally important difference (MID) were evaluated for the DysDD. As expected, item response distributions varied throughout the menstrual period for all items, with the response scales fully utilized. Within-cycle test-retest reliability was adequate (weighted kappa: 0.5-0.7), although between-cycle test-retest was poor (weighted kappa: 0.1-0.5), most likely due to the highly variable nature of dysmenorrhea between cycles rather than limitations of the measure. Correlations with the MDQ and SF-36v2 were low-moderate, but in the predicted direction, supporting concurrent validity. There were significant differences in DysDD scores across severity groups based on pain medication use. The DysDD was responsive to changes in patients' dysmenorrhea with significantly different changes in scores between change groups (p < 0.0001). MID analyses suggest changes on the DysDD 0-10 pelvic pain score of three points can be considered clinically meaningful. Overall, findings indicate that the DysDD has acceptable reliability and is a valid and responsive instrument for assessing dysmenorrhea.
Upadhyay, Dinesh Kumar; Mohamed Ibrahim, Mohamed Izham; Mishra, Pranaya; Alurkar, Vijay M
2015-02-12
Patient satisfaction is the ultimate goal of healthcare system which can be achieved from good patient-healthcare professional relationship and quality of healthcare services provided. Study was conducted to determine the baseline satisfaction level of newly diagnosed diabetics and to explore the impact of pharmaceutical care intervention on patients' satisfaction during their follow-ups in a tertiary care teaching hospital in Nepal. An interventional, pre-post non-clinical randomised controlled study was designed among randomly distributed 162 [control group (n = 54), test 1 group (n = 54) and test 2 group (n = 54)] newly diagnosed diabetes mellitus patients by consecutive sampling method for 18 months. Diabetes Patient Satisfaction Questionnaire was used to evaluate patient's satisfaction scores at baseline, three, six, nine and, twelve months' follow-ups. Test groups patients were provided pharmaceutical care whereas control group patients only received their usual care from physician/nurses. The responses were entered in SPSS version 16. Data distribution was not normal on Kolmogorov-Smirnov test. Non-parametric tests i.e. Friedman test, Mann-Whitney U test and Wilcoxon signed rank test were used to find the differences among the groups before and after the intervention (p ≤0.05). There were significant (p < 0.001) improvements in patients' satisfaction scores in the test groups on Friedman test. Mann-Whitney U test identified the significant differences in satisfaction scores between test 1 and test 2 groups, control and test 1 groups and, control and test 2 groups at 3-months (p = 0.008), (p < 0.001) and (p < 0.001), 6-months (p = 0.010), (p < 0.001) and (p < 0.001), 9-months (p < 0.001), (p < 0.001) and (p < 0.001) and, 12-months (p < 0.001), (p < 0.001) and (p < 0.001) follow-ups respectively. Pharmaceutical care intervention significantly improved the satisfaction level of diabetics in the test groups compare to the control group. Diabetic kit demonstration strengthened the satisfaction level among the test 2 group patients. Therefore, pharmacist can act as a counsellor through pharmaceutical care program and assist the patients in managing their disease. This will not only modify the patients' related outcomes and their level of satisfaction but also improve the healthcare system.
ERIC Educational Resources Information Center
Lee, Yi-Hsuan; von Davier, Alina A.
2008-01-01
The kernel equating method (von Davier, Holland, & Thayer, 2004) is based on a flexible family of equipercentile-like equating functions that use a Gaussian kernel to continuize the discrete score distributions. While the classical equipercentile, or percentile-rank, equating method carries out the continuization step by linear interpolation,…
ERIC Educational Resources Information Center
Ruta, Liliana; Mazzone, Domenico; Mazzone, Luigi; Wheelwright, Sally; Baron-Cohen, Simon
2012-01-01
The Autism Spectrum Quotient (AQ) has been used to define the "broader" (BAP), "medium" (MAP) and "narrow" autism phenotypes (NAP). We used a new Italian version of the AQ to test if difference on AQ scores and the distribution of BAP, MAP and NAP in autism parents (n = 245) versus control parents (n = 300) were…
A psychometric comparison of three scales and a single-item measure to assess sexual satisfaction.
Mark, Kristen P; Herbenick, Debby; Fortenberry, J Dennis; Sanders, Stephanie; Reece, Michael
2014-01-01
This study was designed to systematically compare and contrast the psychometric properties of three scales developed to measure sexual satisfaction and a single-item measure of sexual satisfaction. The Index of Sexual Satisfaction (ISS), Global Measure of Sexual Satisfaction (GMSEX), and the New Sexual Satisfaction Scale-Short (NSSS-S) were compared to one another and to a single-item measure of sexual satisfaction. Conceptualization of the constructs, distribution of scores, internal consistency, convergent validity, test-retest reliability, and factor structure were compared between the measures. A total of 211 men and 214 women completed the scales and a measure of relationship satisfaction, with 33% (n = 139) of the sample reassessed two months later. All scales demonstrated appropriate distribution of scores and adequate internal consistency. The GMSEX, NSSS-S, and the single-item measure demonstrated convergent validity. Test-retest reliability was demonstrated by the ISS, GMSEX, and NSSS-S, but not the single-item measure. Taken together, the GMSEX received the strongest psychometric support in this sample for a unidimensional measure of sexual satisfaction and the NSSS-S received the strongest psychometric support in this sample for a bidimensional measure of sexual satisfaction.
The reliability and validity of the SF-8 with a conflict-affected population in northern Uganda.
Roberts, Bayard; Browne, John; Ocaka, Kaducu Felix; Oyok, Thomas; Sondorp, Egbert
2008-12-02
The SF-8 is a health-related quality of life instrument that could provide a useful means of assessing general physical and mental health amongst populations affected by conflict. The purpose of this study was to test the validity and reliability of the SF-8 with a conflict-affected population in northern Uganda. A cross-sectional multi-staged, random cluster survey was conducted with 1206 adults in camps for internally displaced persons in Gulu and Amuru districts of northern Uganda. Data quality was assessed by analysing the number of incomplete responses to SF-8 items. Response distribution was analysed using aggregate endorsement frequency. Test-retest reliability was assessed in a separate smaller survey using the intraclass correlation test. Construct validity was measured using principal component analysis, and the Pearson Correlation test for item-summary score correlation and inter-instrument correlations. Known groups validity was assessed using a two sample t-test to evaluates the ability of the SF-8 to discriminate between groups known to have, and not have, physical and mental health problems. The SF-8 showed excellent data quality. It showed acceptable item response distribution based upon analysis of aggregate endorsement frequencies. Test-retest showed a good intraclass correlation of 0.61 for PCS and 0.68 for MCS. The principal component analysis indicated strong construct validity and concurred with the results of the validity tests by the SF-8 developers. The SF-8 also showed strong construct validity between the 8 items and PCS and MCS summary score, moderate inter-instrument validity, and strong known groups validity. This study provides evidence on the reliability and validity of the SF-8 amongst IDPs in northern Uganda.
The reliability and validity of the SF-8 with a conflict-affected population in northern Uganda
Roberts, Bayard; Browne, John; Ocaka, Kaducu Felix; Oyok, Thomas; Sondorp, Egbert
2008-01-01
Background The SF-8 is a health-related quality of life instrument that could provide a useful means of assessing general physical and mental health amongst populations affected by conflict. The purpose of this study was to test the validity and reliability of the SF-8 with a conflict-affected population in northern Uganda. Methods A cross-sectional multi-staged, random cluster survey was conducted with 1206 adults in camps for internally displaced persons in Gulu and Amuru districts of northern Uganda. Data quality was assessed by analysing the number of incomplete responses to SF-8 items. Response distribution was analysed using aggregate endorsement frequency. Test-retest reliability was assessed in a separate smaller survey using the intraclass correlation test. Construct validity was measured using principal component analysis, and the Pearson Correlation test for item-summary score correlation and inter-instrument correlations. Known groups validity was assessed using a two sample t-test to evaluates the ability of the SF-8 to discriminate between groups known to have, and not have, physical and mental health problems. Results The SF-8 showed excellent data quality. It showed acceptable item response distribution based upon analysis of aggregate endorsement frequencies. Test-retest showed a good intraclass correlation of 0.61 for PCS and 0.68 for MCS. The principal component analysis indicated strong construct validity and concurred with the results of the validity tests by the SF-8 developers. The SF-8 also showed strong construct validity between the 8 items and PCS and MCS summary score, moderate inter-instrument validity, and strong known groups validity. Conclusion This study provides evidence on the reliability and validity of the SF-8 amongst IDPs in northern Uganda. PMID:19055716
The Effect of Empathy Training on the Empathic Skills of Nurses.
Kahriman, Ilknur; Nural, Nesrin; Arslan, Umit; Topbas, Murat; Can, Gamze; Kasim, Suheyla
2016-06-01
The profound impact of empathy training on quality nursing care has been recognized. Studies have shown that there has been little improvement in nurses' communication skills, and that they should work to enhance this area. Relevant training will lead to an improvement in nurses' empathic skills, which in turn, will enable them to understand their patients better, establish positive interpersonal relationships with them, and boost their professional satisfaction. To reveal the effect of empathy training on the empathic skills of nurses. This study was conducted as an experimental design. The research sample consisted of 48 nurses working at the pediatric clinics of Farabi hospital of Karadeniz Technical University in Turkey (N = 83). Two groups, an experimental group (group 1) and a control group (group 2) were determined after questionnaires were supplied to all nurses in the study sample. At first, it was intended to select these groups using a random method. However, since this may have meant that the experimental and control groups were formed from nurses working in the same service, the two groups were selected from different services to avoid possible interaction between them. The nurses in the Group 1 were provided with empathy training through group and creative drama techniques. Pre-tests and post-tests were conducted on both groups. Data was collected via a questionnaire designed around the topic "empathic skill scale-ESS", developed by Dokmen. The Kolmogorov Smirnov test was employed to assess whether the measurable data was suitable for normal distribution. Data was presented as numbers and percentage distributions, as mean ± standard deviation and Chi-square, and as student t tests and paired t tests. The level of significance was accepted as P < 0.05. The nurses in the experimental group had a mean score of 146.7 ± 38.8 and 169.5 ± 22.1 in the ESS pre-test and post-test, respectively. Although the nurses in the control group had a pre-test mean score of 133.7 ± 37.1, which increased to 135.1 ± 51.7 after the training, no statistically significant difference was found (P = 0.886). A comparison of the groups indicated that they scored similarly in the pre-test. However, the experimental group scored significantly higher than the control group in the post-test (P = 0.270 and P = 0.015, respectively). In the light of these findings, it is recommended that communication skills should be widely included in in-service training programs; similar studies should be conducted on broader control groups formed through randomization; and a comparison should be made between the findings.
The Effect of Empathy Training on the Empathic Skills of Nurses
Kahriman, Ilknur; Nural, Nesrin; Arslan, Umit; Topbas, Murat; Can, Gamze; Kasim, Suheyla
2016-01-01
Background The profound impact of empathy training on quality nursing care has been recognized. Studies have shown that there has been little improvement in nurses’ communication skills, and that they should work to enhance this area. Relevant training will lead to an improvement in nurses’ empathic skills, which in turn, will enable them to understand their patients better, establish positive interpersonal relationships with them, and boost their professional satisfaction. Objectives To reveal the effect of empathy training on the empathic skills of nurses. Patients and Methods This study was conducted as an experimental design. The research sample consisted of 48 nurses working at the pediatric clinics of Farabi hospital of Karadeniz Technical University in Turkey (N = 83). Two groups, an experimental group (group 1) and a control group (group 2) were determined after questionnaires were supplied to all nurses in the study sample. At first, it was intended to select these groups using a random method. However, since this may have meant that the experimental and control groups were formed from nurses working in the same service, the two groups were selected from different services to avoid possible interaction between them. The nurses in the Group 1 were provided with empathy training through group and creative drama techniques. Pre-tests and post-tests were conducted on both groups. Data was collected via a questionnaire designed around the topic “empathic skill scale-ESS”, developed by Dokmen. The Kolmogorov Smirnov test was employed to assess whether the measurable data was suitable for normal distribution. Data was presented as numbers and percentage distributions, as mean ± standard deviation and Chi-square, and as student t tests and paired t tests. The level of significance was accepted as P < 0.05. Results The nurses in the experimental group had a mean score of 146.7 ± 38.8 and 169.5 ± 22.1 in the ESS pre-test and post-test, respectively. Although the nurses in the control group had a pre-test mean score of 133.7 ± 37.1, which increased to 135.1 ± 51.7 after the training, no statistically significant difference was found (P = 0.886). A comparison of the groups indicated that they scored similarly in the pre-test. However, the experimental group scored significantly higher than the control group in the post-test (P = 0.270 and P = 0.015, respectively). Conclusions In the light of these findings, it is recommended that communication skills should be widely included in in-service training programs; similar studies should be conducted on broader control groups formed through randomization; and a comparison should be made between the findings. PMID:27621922
Study on Diagnosing Three Dimensional Cloud Region
NASA Astrophysics Data System (ADS)
Cai, M., Jr.; Zhou, Y., Sr.
2017-12-01
Cloud mask and relative humidity (RH) provided by Cloudsat products from 2007 to 2008 are statistical analyzed to get RH Threshold between cloud and clear sky and its variation with height. A diagnosis method is proposed based on reanalysis data and applied to three-dimensional cloud field diagnosis of a real case. Diagnostic cloud field was compared to satellite, radar and other cloud precipitation observation. Main results are as follows. 1.Cloud region where cloud mask is bigger than 20 has a good space and time corresponding to the high value relative humidity region, which is provide by ECWMF AUX product. Statistical analysis of the RH frequency distribution within and outside cloud indicated that, distribution of RH in cloud at different height range shows single peak type, and the peak is near a RH value of 100%. Local atmospheric environment affects the RH distribution outside cloud, which leads to TH distribution vary in different region or different height. 2. RH threshold and its vertical distribution used for cloud diagnostic was analyzed from Threat Score method. The method is applied to a three dimension cloud diagnosis case study based on NCEP reanalysis data and th diagnostic cloud field is compared to satellite, radar and cloud precipitation observation on ground. It is found that, RH gradient is very big around cloud region and diagnosed cloud area by RH threshold method is relatively stable. Diagnostic cloud area has a good corresponding to updraft region. The cloud and clear sky distribution corresponds to satellite the TBB observations overall. Diagnostic cloud depth, or sum cloud layers distribution consists with optical thickness and precipitation on ground better. The cloud vertical profile reveals the relation between cloud vertical structure and weather system clearly. Diagnostic cloud distribution correspond to cloud observations on ground very well. 3. The method is improved by changing the vertical interval from altitude to temperature. The result shows that, the five factors , including TS score for clear sky, empty forecast, missed forecast, and especially TS score for cloud region and the accurate rate increased obviously. So, the RH threshold and its vertical distribution with temperature is better than with altitude. More tests and comparision should be done to assess the diagnosis method.
Huang, J; Vieland, V J
2001-01-01
It is well known that the asymptotic null distribution of the homogeneity lod score (LOD) does not depend on the genetic model specified in the analysis. When appropriately rescaled, the LOD is asymptotically distributed as 0.5 chi(2)(0) + 0.5 chi(2)(1), regardless of the assumed trait model. However, because locus heterogeneity is a common phenomenon, the heterogeneity lod score (HLOD), rather than the LOD itself, is often used in gene mapping studies. We show here that, in contrast with the LOD, the asymptotic null distribution of the HLOD does depend upon the genetic model assumed in the analysis. In affected sib pair (ASP) data, this distribution can be worked out explicitly as (0.5 - c)chi(2)(0) + 0.5chi(2)(1) + cchi(2)(2), where c depends on the assumed trait model. E.g., for a simple dominant model (HLOD/D), c is a function of the disease allele frequency p: for p = 0.01, c = 0.0006; while for p = 0.1, c = 0.059. For a simple recessive model (HLOD/R), c = 0.098 independently of p. This latter (recessive) distribution turns out to be the same as the asymptotic distribution of the MLS statistic under the possible triangle constraint, which is asymptotically equivalent to the HLOD/R. The null distribution of the HLOD/D is close to that of the LOD, because the weight c on the chi(2)(2) component is small. These results mean that the cutoff value for a test of size alpha will tend to be smaller for the HLOD/D than the HLOD/R. For example, the alpha = 0.0001 cutoff (on the lod scale) for the HLOD/D with p = 0.05 is 3.01, while for the LOD it is 3.00, and for the HLOD/R it is 3.27. For general pedigrees, explicit analytical expression of the null HLOD distribution does not appear possible, but it will still depend on the assumed genetic model. Copyright 2001 S. Karger AG, Basel
Bacchini, Dario; Licenziati, Maria Rosaria; Affuso, Gaetana; Garrasi, Alessandra; Corciulo, Nicola; Driul, Daniela; Tanas, Rita; Fiumani, Perla Maria; Di Pietro, Elena; Pesce, Sabino; Crinò, Antonino; Maltoni, Giulio; Iughetti, Lorenzo; Sartorio, Alessandro; Deiana, Manuela; Lombardi, Francesca; Valerio, Giuliana
2017-06-01
Research has provided evidence that obesity is associated with peer victimization and low levels of self-concept. No study has examined the relationship between BMI z-score, self-concept in multiple domains, and peer victimization. The aim of the research was to investigate the interplay between BMI z-score, self-concept in multiple domains (physical, athletic, social), and peer victimization, testing direct, mediated, and moderated associations. Eighty hundred fifteen outpatient children and adolescents were consecutively recruited in 14 hospitals distributed over the Italian country. The sample consisted of 419 males and 396 females; mean age 10.91 ± 1.97 years (range 6-14 years) and mean BMI z-score 1.85 ± 0.74 (range -0.97 ± 3.27). Peer victimization and self-concept were assessed with a revised Olweus Bully/Victim Questionnaire and with the Self-Perception Profile for Children. A structural equation model approach was used to determine the associations among variables, testing two competing models. In both models, path analysis revealed that BMI z-score was directly associated with peer victimization and self-concept in multiple domains. In the first model, peer victimization mediated the relationship between BMI-score and self-concept, whereas in the alternative model, self-concept mediated the relationship between BMI z-score and peer victimization. Interaction analyses revealed that social competence moderated the relationship between BMI z-score and peer victimization and that peer victimization moderated the relationship between BMI z-score and physical appearance. Higher levels of BMI z-score are a risk factor for peer victimization and poor self-concept. When high levels of BMI z-score are associated with a negative self-concept, the risk of victimization increases. Preventive and supportive interventions are needed to avoid negative consequences on quality of life in children and adolescents with obesity.
Jaipuria, Jiten; Suryavanshi, Manav; Sen, Tridib K
2016-12-01
To assess the reliability of the Guy's Stone Score, the Seoul National University Renal Stone Complexity (S-ReSC) score and the S.T.O.N.E. scores in percutaneous nephrolithotomy (PCNL), and assess their utility in discriminating outcomes [stone free rate (SFR), complications, need for multiple PCNL sessions, and auxiliary procedures] valid across parameters of experience of surgeon, independence from surgical approach, and variations in institution-specific instrumentation. A prospectively maintained database of two tertiary institutions was analysed (606 cases). Institutes differed in instrumentation, while the overall surgical team comprised: two trainees (experience <100 cases), two junior consultants (experience 100-200 cases), and two senior surgeons (experience >1000 cases). Scores were assigned and re-assigned after 4 months by one trainee and an expert surgeon. Inter-rater and test-retest agreement were analysed by Cohen's κ and intraclass correlation coefficient. Multivariate logistic regression models were created adjusting outcomes for the institution, comorbidity, Amplatz size, access tract location, the number of punctures, the experience level of the surgeon, and individual scoring system, and receiver operating curves were analysed for comparison. Despite some areas of inconsistencies, individually all scores had excellent inter-rater and test-retest concordance. On multivariable analyses, while the experience of the surgeon and surgical approach characteristics (such as access tract location, Amplatz size, and number of punctures) remained independently associated with different outcomes in varying combinations, calculus complexity scores were found consistently to be independently associated with all outcomes. The S-ReSC score had a superior association with SFR, the need for multiple PCNL sessions, and auxiliary procedures. Individually all scoring systems performed well. On cross comparison, the S-ReSC score consistently emerged to be more superiorly associated with all outcomes, signifying the importance of the distributional complexity of the calculus (which also indirectly amalgamates the influence of stone number, size, and anatomical location) in discriminating outcomes. Our study proves the utility of scoring systems in prognosticating multiple outcomes and also clarifies important aspects of their practical application including future roles such as benchmarking, audit, training, and objective assessment of surgical technique modifications. © 2016 The Authors BJU International © 2016 BJU International Published by John Wiley & Sons Ltd.
Verdaasdonk, E G G; Stassen, L P S; van Wijk, R P J; Dankelman, J
2007-02-01
Psychomotor skills for endoscopic surgery can be trained with virtual reality simulators. Distributed training is more effective than massed training, but it is unclear whether distributed training over several days is more effective than distributed training within 1 day. This study aimed to determine which of these two options is the most effective for training endoscopic psychomotor skills. Students with no endoscopic experience were randomly assigned either to distributed training on 3 consecutive days (group A, n = 10) or distributed training within 1 day (group B, n = 10). For this study the SIMENDO virtual reality simulator for endoscopic skills was used. The training involved 12 repetitions of three different exercises (drop balls, needle manipulation, 30 degree endoscope) in differently distributed training schedules. All the participants performed a posttraining test (posttest) for the trained tasks 7 days after the training. The parameters measured were time, nontarget environment collisions, and instrument path length. There were no significant differences between the groups in the first training session for all the parameters. In the posttest, group A (training over several days) performed 18.7% faster than group B (training on 1 day) (p = 0.013). The collision and path length scores for group A did not differ significantly from the scores for group B. The distributed group trained over several days was faster, with the same number of errors and the same instrument path length used. Psychomotor skill training for endoscopic surgery distributed over several days is superior to training on 1 day.
Zeitler, Daniel M; Dorman, Michael F; Natale, Sarah J; Loiselle, Louise; Yost, William A; Gifford, Rene H
2015-09-01
To assess improvements in sound source localization and speech understanding in complex listening environments after unilateral cochlear implantation for single-sided deafness (SSD). Nonrandomized, open, prospective case series. Tertiary referral center. Nine subjects with a unilateral cochlear implant (CI) for SSD (SSD-CI) were tested. Reference groups for the task of sound source localization included young (n = 45) and older (n = 12) normal-hearing (NH) subjects and 27 bilateral CI (BCI) subjects. Unilateral cochlear implantation. Sound source localization was tested with 13 loudspeakers in a 180 arc in front of the subject. Speech understanding was tested with the subject seated in an 8-loudspeaker sound system arrayed in a 360-degree pattern. Directionally appropriate noise, originally recorded in a restaurant, was played from each loudspeaker. Speech understanding in noise was tested using the Azbio sentence test and sound source localization quantified using root mean square error. All CI subjects showed poorer-than-normal sound source localization. SSD-CI subjects showed a bimodal distribution of scores: six subjects had scores near the mean of those obtained by BCI subjects, whereas three had scores just outside the 95th percentile of NH listeners. Speech understanding improved significantly in the restaurant environment when the signal was presented to the side of the CI. Cochlear implantation for SSD can offer improved speech understanding in complex listening environments and improved sound source localization in both children and adults. On tasks of sound source localization, SSD-CI patients typically perform as well as BCI patients and, in some cases, achieve scores at the upper boundary of normal performance.
Validation of the tablet-administered Brief Assessment of Cognition (BAC App).
Atkins, Alexandra S; Tseng, Tina; Vaughan, Adam; Twamley, Elizabeth W; Harvey, Philip; Patterson, Thomas; Narasimhan, Meera; Keefe, Richard S E
2017-03-01
Computerized tests benefit from automated scoring procedures and standardized administration instructions. These methods can reduce the potential for rater error. However, especially in patients with severe mental illnesses, the equivalency of traditional and tablet-based tests cannot be assumed. The Brief Assessment of Cognition in Schizophrenia (BACS) is a pen-and-paper cognitive assessment tool that has been used in hundreds of research studies and clinical trials, and has normative data available for generating age- and gender-corrected standardized scores. A tablet-based version of the BACS called the BAC App has been developed. This study compared performance on the BACS and the BAC App in patients with schizophrenia and healthy controls. Test equivalency was assessed, and the applicability of paper-based normative data was evaluated. Results demonstrated the distributions of standardized composite scores for the tablet-based BAC App and the pen-and-paper BACS were indistinguishable, and the between-methods mean differences were not statistically significant. The discrimination between patients and controls was similarly robust. The between-methods correlations for individual measures in patients were r>0.70 for most subtests. When data from the Token Motor Test was omitted, the between-methods correlation of composite scores was r=0.88 (df=48; p<0.001) in healthy controls and r=0.89 (df=46; p<0.001) in patients, consistent with the test-retest reliability of each measure. Taken together, results indicate that the tablet-based BAC App generates results consistent with the traditional pen-and-paper BACS, and support the notion that the BAC App is appropriate for use in clinical trials and clinical practice. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
Teaching children about bicycle safety: an evaluation of the New Jersey Bike School program.
Lachapelle, Ugo; Noland, Robert B; Von Hagen, Leigh Ann
2013-03-01
There are multiple health and environmental benefits associated with increasing bicycling among children. However, the use of bicycles is also associated with severe injuries and fatalities. In order to reduce bicycle crashes, a bicycling education program was implemented in selected New Jersey schools and summer camps as part of the New Jersey Safe Routes to School Program. Using a convenience sample of participants to the program, an opportunistic study was designed to evaluate the effectiveness of two bicycle education programs, the first a more-structured program delivered in a school setting, with no on-road component, and the other a less structured program delivered in a summer camp setting that included an on-road component. Tests administered before and after training were designed to assess knowledge acquired during the training. Questions assessed children's existing knowledge of helmet use and other equipment, bicycle safety, as well as their ability to discriminate hazards and understand rules of the road. Participating children (n=699) also completed a travel survey that assessed their bicycling behavior and their perception of safety issues. Response to individual questions, overall pre- and post-training test scores, and changes in test scores were compared using comparison of proportion, t-tests, and ordinary least-squares (OLS) regression. Improvements between the pre-training and post-training test are apparent from the frequency distribution of test results and from t-tests. Both summer camps and school-based programs recorded similar improvements in test results. Children who bicycled with their parents scored higher on the pre-training test but did not improve as much on the post-training test. Without evaluating long-term changes in behavior, it is difficult to ascertain how successful the program is on eventual behavioral and safety outcomes. Copyright © 2012 Elsevier Ltd. All rights reserved.
Psychometric properties of the Stroke Impairment Assessment Set (SIAS).
Liu, Meigen; Chino, Naoichi; Tuji, Testuya; Masakado, Yoshihisa; Hase, Kimitaka; Kimura, Akio
2002-12-01
To review the psychometric properties of the Stroke Impairment Assessment Set (SAS), which was developed in 1990 as a comprehensive instrument to assess stroke impairment. Articles related to the SIAS were retrieved from the MEDLINE and the Folia Centro Japonica. Thirty-five articles were retrieved and analyzed. 1) Scale quality: Rasch analysis demonstrated the unidimensionality of the SIAS. Factor analysis produced factors corresponding to the 6 SIAS subscales. 2) Interrater reliability: The weighted kappas were high except for the unaffected side quadriceps item for which the score distribution was skewed. 3) Concurrent validity: Significant correlations were found between a) SIAS motor items and the Motricity Index or the Brunnstrom stage, b) SIAS lower extremity scores and the Functional Independence Measure (FIMSM) locomotion scores, c) trunk scores and abdominal manual muscle testing, d) visuospatial scores and line bisection and copying task scores, and e) speech scores and the FIMSM communication scores. 4) Predictive validity: Three studies attempting to predict discharge functional status demonstrated that adding the SIAS as one of the predictors enhanced the predictive power 5) Responsiveness: The SIAS was more responsive to changes than the Motricity Index, the Brunnstrom stage, or the National Institutes of Health Stroke Scale. The SIAS is a useful measure of stroke impairment with well-established psychometric properties.
Prevost, Marie; Carrier, Marie-Eve; Chowne, Gabrielle; Zelkowitz, Phyllis; Joseph, Lawrence; Gold, Ian
2014-01-01
The first aim of our study was to validate the French version of the Reading the Mind in the Eyes test, a theory of mind test. The second aim was to test whether cultural differences modulate performance on this test. A total of 109 participants completed the original English version and 97 participants completed the French version. Another group of 30 participants completed the French version twice, one week apart. We report a similar overall distribution of scores in both versions and no differences in the mean scores between them. However, 2 items in the French version did not collect a majority of responses, which differed from the results of the English version. Test-retest showed good stability of the French version. As expected, participants who do not speak French or English at home, and those born in Asia, performed worse than North American participants, and those who speak English or French at home. We report a French version with acceptable validity and good stability. The cultural differences observed support the idea that Asian culture does not use theory of mind to explain people's behaviours as much as North American people do.
PROCOS: computational analysis of protein-protein complexes.
Fink, Florian; Hochrein, Jochen; Wolowski, Vincent; Merkl, Rainer; Gronwald, Wolfram
2011-09-01
One of the main challenges in protein-protein docking is a meaningful evaluation of the many putative solutions. Here we present a program (PROCOS) that calculates a probability-like measure to be native for a given complex. In contrast to scores often used for analyzing complex structures, the calculated probabilities offer the advantage of providing a fixed range of expected values. This will allow, in principle, the comparison of models corresponding to different targets that were solved with the same algorithm. Judgments are based on distributions of properties derived from a large database of native and false complexes. For complex analysis PROCOS uses these property distributions of native and false complexes together with a support vector machine (SVM). PROCOS was compared to the established scoring schemes of ZRANK and DFIRE. Employing a set of experimentally solved native complexes, high probability values above 50% were obtained for 90% of these structures. Next, the performance of PROCOS was tested on the 40 binary targets of the Dockground decoy set, on 14 targets of the RosettaDock decoy set and on 9 targets that participated in the CAPRI scoring evaluation. Again the advantage of using a probability-based scoring system becomes apparent and a reasonable number of near native complexes was found within the top ranked complexes. In conclusion, a novel fully automated method is presented that allows the reliable evaluation of protein-protein complexes. Copyright © 2011 Wiley Periodicals, Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Merchant, Thomas E., E-mail: thomas.merchant@stjude.org; Schreiber, Jane E.; Wu, Shengjie
Purpose: To prospectively follow children treated with craniospinal irradiation to determine critical combinations of radiation dose and volume that would predict for cognitive effects. Methods and Materials: Between 1996 and 2003, 58 patients (median age 8.14 years, range 3.99-20.11 years) with medulloblastoma received risk-adapted craniospinal irradiation followed by dose-intense chemotherapy and were followed longitudinally with multiple cognitive evaluations (through 5 years after treatment) that included intelligence quotient (estimated intelligence quotient, full-scale, verbal, and performance) and academic achievement (math, reading, spelling) tests. Craniospinal irradiation consisted of 23.4 Gy for average-risk patients (nonmetastatic) and 36-39.6 Gy for high-risk patients (metastatic or residual disease >1.5 cm{sup 2}). The primary sitemore » was treated using conformal or intensity modulated radiation therapy using a 2-cm clinical target volume margin. The effect of clinical variables and radiation dose to different brain volumes were modeled to estimate cognitive scores after treatment. Results: A decline with time for all test scores was observed for the entire cohort. Sex, race, and cerebrospinal fluid shunt status had a significant impact on baseline scores. Age and mean radiation dose to specific brain volumes, including the temporal lobes and hippocampi, had a significant impact on longitudinal scores. Dichotomized dose distributions at 25 Gy, 35 Gy, 45 Gy, and 55 Gy were modeled to show the impact of the high-dose volume on longitudinal test scores. The 50% risk of a below-normal cognitive test score was calculated according to mean dose and dose intervals between 25 Gy and 55 Gy at 10-Gy increments according to brain volume and age. Conclusions: The ability to predict cognitive outcomes in children with medulloblastoma using dose-effects models for different brain subvolumes will improve treatment planning, guide intervention, and help estimate the value of newer methods of irradiation.« less
Confidence Intervals for True Scores Using the Skew-Normal Distribution
ERIC Educational Resources Information Center
Garcia-Perez, Miguel A.
2010-01-01
A recent comparative analysis of alternative interval estimation approaches and procedures has shown that confidence intervals (CIs) for true raw scores determined with the Score method--which uses the normal approximation to the binomial distribution--have actual coverage probabilities that are closest to their nominal level. It has also recently…
Biases and Power for Groups Comparison on Subjective Health Measurements
Hamel, Jean-François; Hardouin, Jean-Benoit; Le Neel, Tanguy; Kubis, Gildas; Roquelaure, Yves; Sébille, Véronique
2012-01-01
Subjective health measurements are increasingly used in clinical research, particularly for patient groups comparisons. Two main types of analytical strategies can be used for such data: so-called classical test theory (CTT), relying on observed scores and models coming from Item Response Theory (IRT) relying on a response model relating the items responses to a latent parameter, often called latent trait. Whether IRT or CTT would be the most appropriate method to compare two independent groups of patients on a patient reported outcomes measurement remains unknown and was investigated using simulations. For CTT-based analyses, groups comparison was performed using t-test on the scores. For IRT-based analyses, several methods were compared, according to whether the Rasch model was considered with random effects or with fixed effects, and the group effect was included as a covariate or not. Individual latent traits values were estimated using either a deterministic method or by stochastic approaches. Latent traits were then compared with a t-test. Finally, a two-steps method was performed to compare the latent trait distributions, and a Wald test was performed to test the group effect in the Rasch model including group covariates. The only unbiased IRT-based method was the group covariate Wald’s test, performed on the random effects Rasch model. This model displayed the highest observed power, which was similar to the power using the score t-test. These results need to be extended to the case frequently encountered in practice where data are missing and possibly informative. PMID:23115620
The effect of constructivist teaching strategies on science test scores of middle school students
NASA Astrophysics Data System (ADS)
Vaca, James L., Jr.
International studies show that the United States is lagging behind other industrialized countries in science proficiency. The studies revealed how American students showed little significant gain on standardized tests in science between 1995 and 2005. Little information is available regarding how reform in American teaching strategies in science could improve student performance on standardized testing. The purpose of this quasi-experimental quantitative study using a pretest/posttest control group design was to examine how the use of a hands-on, constructivist teaching approach with low achieving eighth grade science students affected student achievement on the 2007 Ohio Eighth Grade Science Achievement Test posttest (N = 76). The research question asked how using constructivist teaching strategies in the science classroom affected student performance on standardized tests. Two independent samples of 38 students each consisting of low achieving science students as identified by seventh grade science scores and scores on the Ohio Eighth Grade Science Half-Length Practice Test pretest were used. Four comparisons were made between the control group receiving traditional classroom instruction and the experimental group receiving constructivist instruction including: (a) pretest/posttest standard comparison, (b) comparison of the number of students who passed the posttest, (c) comparison of the six standards covered on the posttest, (d) posttest's sample means comparison. A Mann-Whitney U Test revealed that there was no significant difference between the independent sample distributions for the control group and the experimental group. These findings contribute to positive social change by investigating science teaching strategies that could be used in eighth grade science classes to improve student achievement in science.
Comparing student learning with multiple research-based conceptual surveys: CSEM and BEMA.
NASA Astrophysics Data System (ADS)
Pollock, S. J.
2008-10-01
We present results demonstrating similar distributions of student scores, and statistically indistinguishable gains on two popular research-based assessment tools: the Brief Electricity and Magnetism Assessment (BEMA) and the Conceptual Survey of Electricity and Magnetism(CSEM). To deepen our understanding of student learning in our course environment and of these assessment tools as measures of student learning, we identify systematic trends and differences in results from these two instruments. We investigate correlations of both pre- and post- conceptual scores with other measures including traditional exam scores and course grades, student background (earlier grades), gender, a pretest of scientific reasoning, and tests of attitudes and beliefs about science and learning science. Overall, for practical purposes, we find the BEMA and CSEM are roughly equivalently useful instruments for measuring student learning in our course.
Quantification of type I error probabilities for heterogeneity LOD scores.
Abreu, Paula C; Hodge, Susan E; Greenberg, David A
2002-02-01
Locus heterogeneity is a major confounding factor in linkage analysis. When no prior knowledge of linkage exists, and one aims to detect linkage and heterogeneity simultaneously, classical distribution theory of log-likelihood ratios does not hold. Despite some theoretical work on this problem, no generally accepted practical guidelines exist. Nor has anyone rigorously examined the combined effect of testing for linkage and heterogeneity and simultaneously maximizing over two genetic models (dominant, recessive). The effect of linkage phase represents another uninvestigated issue. Using computer simulation, we investigated type I error (P value) of the "admixture" heterogeneity LOD (HLOD) score, i.e., the LOD score maximized over both recombination fraction theta and admixture parameter alpha and we compared this with the P values when one maximizes only with respect to theta (i.e., the standard LOD score). We generated datasets of phase-known and -unknown nuclear families, sizes k = 2, 4, and 6 children, under fully penetrant autosomal dominant inheritance. We analyzed these datasets (1) assuming a single genetic model, and maximizing the HLOD over theta and alpha; and (2) maximizing the HLOD additionally over two dominance models (dominant vs. recessive), then subtracting a 0.3 correction. For both (1) and (2), P values increased with family size k; rose less for phase-unknown families than for phase-known ones, with the former approaching the latter as k increased; and did not exceed the one-sided mixture distribution xi = (1/2) chi1(2) + (1/2) chi2(2). Thus, maximizing the HLOD over theta and alpha appears to add considerably less than an additional degree of freedom to the associated chi1(2) distribution. We conclude with practical guidelines for linkage investigators. Copyright 2002 Wiley-Liss, Inc.
Shanks, Carmen Byker; Smith, Teresa; Ahmed, Selena; Hunts, Holly
2017-01-01
Objective To assess the nutritional quality of food packages offered in the Food Distribution Program on Indian Reservations (FDPIR) program using Healthy Eating Index-2010 (HEI-2010). Design Data were collected from the list of the food products provided by the United States Department of Agriculture’s Food and Nutrition Handbook 501 for FDPIR. Nutritional quality was measured through a cross-sectional analysis of five randomly selected food packages offered through FDPIR. HEI-2010 component and total scores were calculated for each food package. Analysis of variance and t-tests assessed significant differences between food packages and HEI-2010 maximum scores, respectively. Setting This study took place in the United States. Subjects Study units included food products offered through FDPIR. Results The mean total HEI-2010 score for the combined FDPIR food packages was significantly lower than the total HEI-2010 maximum score of 100 (66.38, SD=11.60; p<0.01). Mean scores for total fruit (3.52, SD=0.73; p<0.05), total vegetables (2.58, SD=0.15; p<0.001), greens and beans (0.92, SD=1.00; p<0.001), dairy (5.12, SD=0.63; p<0.001), total protein foods (4.14, SD=0.56; p<0.05), and refined grains (3.04, SD=2.90; p<0.001) were all significantly lower than the maximum values. Conclusions The FDPIR food package HEI-2010 score was notably higher than other federal food assistance and nutrition programs. Study findings highlight opportunities for the FDPIR to modify its offerings to best support lifestyles towards prevention of diet-related chronic disease. PMID:26298513
Methods to Assess the Utility of Proxies
1990-08-01
PROXIES Gottfredson [5 has reviewed ways to analyze potential proxies for the National Academy of Science committee that oversees the work of the...distribution might get the highest possible scores on the test. Together, the Gottfredson and Allred papers suggest that different analyses are required...summarizes the implications of the Gottfredson and Allred papers. These two papers suggest that different kinds of analyses should be done, depending
ERIC Educational Resources Information Center
Miyake, Misao
This document reports the state of science achievement of Japanese students based on the Second International Science study conducted in 1983-84. Results are compared to the first study conducted in 1970. The target populations, samples, and structure of the second study are described. Test results including score distribution and high and low…
A Random Walk Picture of Basketball
NASA Astrophysics Data System (ADS)
Gabel, Alan; Redner, Sidney
2012-02-01
We analyze NBA basketball play-by-play data and found that scoring is well described by a weakly-biased, anti-persistent, continuous-time random walk. The time between successive scoring events follows an exponential distribution, with little memory between events. We account for a wide variety of statistical properties of scoring, such as the distribution of the score difference between opponents and the fraction of game time that one team is in the lead.
Distribution of lod scores in oligogenic linkage analysis.
Williams, J T; North, K E; Martin, L J; Comuzzie, A G; Göring, H H; Blangero, J
2001-01-01
In variance component oligogenic linkage analysis it can happen that the residual additive genetic variance bounds to zero when estimating the effect of the ith quantitative trait locus. Using quantitative trait Q1 from the Genetic Analysis Workshop 12 simulated general population data, we compare the observed lod scores from oligogenic linkage analysis with the empirical lod score distribution under a null model of no linkage. We find that zero residual additive genetic variance in the null model alters the usual distribution of the likelihood-ratio statistic.
Comparison of Clinpro Cario L-Pop estimates with CIA lactic acid estimates of the oral microflora.
Gerardu, Véronique; Heijnsbroek, Muriel; Buijs, Mark; van der Weijden, Fridus; Ten Cate, Bob; van Loveren, Cor
2006-04-01
Clinpro Cario L-Pop (CCLP) is a semiquantitive test claimed to determine the general potential for caries development and to monitor the individual caries risk. This test translates the capacity of the tongue microflora to produce lactic acid into a score of 1-9, indicating a low, medium or high risk for caries development. The aim of this randomized crossover, clinical trial was to evaluate the CCLP on its variation over time and its capacity to monitor the effect of three different oral hygiene procedures. The CCLP readings were compared with measurements of lactic acid in tongue biofilm and plaque samples by capillary ion electrophoresis (CIA). After four washout periods, the distribution of scores in the low-, medium-, and high-risk categories was 10%, 16%, and 74%, respectively. Out of 30 subjects, 11 scored consistently in the same category. The coefficients of variance of lactic acid concentrations were 31% for tongue samples and 25% for plaque samples. After using antimicrobial toothpaste and mouthwash, the number of high-risk scores was reduced to 33%; reduced acidogenicity was also found in tongue and plaque samples. We conclude that CCLP can be used to monitor and stimulate compliance to an antimicrobial oral hygiene protocol.
ERIC Educational Resources Information Center
Zimmerman, Donald W.
2011-01-01
This study investigated how population parameters representing heterogeneity of variance, skewness, kurtosis, bimodality, and outlier-proneness, drawn from normal and eleven non-normal distributions, also characterized the ranks corresponding to independent samples of scores. When the parameters of population distributions from which samples were…
2013-01-01
Background Few studies regarding Knowledge, Attitude and Practice (KAP) towards medicines among school teachers have been carried out in Nepal. Obtaining baseline KAP is important to note deficiencies and plan appropriate interventions. School teachers have to know about medicines as they can be an important source of information about rational and safe use of medicines. The department of Clinical Pharmacology, KIST Medical College, Lalitpur, conducted a study regarding KAP of school teachers about medicines before and after an educational intervention from April 2011 to December 2011. Methods The study was done in selected schools of Lalitpur district. Teachers were selected on a voluntary basis after obtaining written informed consent. Gender, ethnic or caste group, native place, age, educational qualifications, subject taught were noted. An educational intervention using a combination of methods like presentations, brainstorming sessions, interactive discussions using posters and distribution of information leaflets about the use of medicines was conducted. The KAP and overall scores among subgroups according to gender, age, level of education, subject, ethnicity, type of school (primary vs. secondary and government vs. private school) were studied. KAP and overall scores before and after the intervention was compared using Wilcoxon signed ranks test as the scores were not normally distributed. Results A total of 393 teachers participated before and after the intervention. The median (interquartile range) knowledge, attitude and practice scores before the intervention were 63 (10), 23 (5) and 270 (48) respectively while the overall score was 356. The median knowledge, attitude and practice scores after the intervention were 71 (10), 28 (5) and 270 (48) respectively while the overall score increased to 369. Maximum possible score of knowledge, attitude and practice were 100, 40 and 320 respectively. Scores improved significantly for knowledge (p<0.001), attitude (p<0.001) and total scores (p<0.001) but not for practice (p=0.528). Conclusion The intervention was effective in improving knowledge and attitude of the teachers. More studies among school teachers about their knowledge, attitude and practice about medicines are required in Nepal. PMID:23849402
High throughput nonparametric probability density estimation.
Farmer, Jenny; Jacobs, Donald
2018-01-01
In high throughput applications, such as those found in bioinformatics and finance, it is important to determine accurate probability distribution functions despite only minimal information about data characteristics, and without using human subjectivity. Such an automated process for univariate data is implemented to achieve this goal by merging the maximum entropy method with single order statistics and maximum likelihood. The only required properties of the random variables are that they are continuous and that they are, or can be approximated as, independent and identically distributed. A quasi-log-likelihood function based on single order statistics for sampled uniform random data is used to empirically construct a sample size invariant universal scoring function. Then a probability density estimate is determined by iteratively improving trial cumulative distribution functions, where better estimates are quantified by the scoring function that identifies atypical fluctuations. This criterion resists under and over fitting data as an alternative to employing the Bayesian or Akaike information criterion. Multiple estimates for the probability density reflect uncertainties due to statistical fluctuations in random samples. Scaled quantile residual plots are also introduced as an effective diagnostic to visualize the quality of the estimated probability densities. Benchmark tests show that estimates for the probability density function (PDF) converge to the true PDF as sample size increases on particularly difficult test probability densities that include cases with discontinuities, multi-resolution scales, heavy tails, and singularities. These results indicate the method has general applicability for high throughput statistical inference.
High throughput nonparametric probability density estimation
Farmer, Jenny
2018-01-01
In high throughput applications, such as those found in bioinformatics and finance, it is important to determine accurate probability distribution functions despite only minimal information about data characteristics, and without using human subjectivity. Such an automated process for univariate data is implemented to achieve this goal by merging the maximum entropy method with single order statistics and maximum likelihood. The only required properties of the random variables are that they are continuous and that they are, or can be approximated as, independent and identically distributed. A quasi-log-likelihood function based on single order statistics for sampled uniform random data is used to empirically construct a sample size invariant universal scoring function. Then a probability density estimate is determined by iteratively improving trial cumulative distribution functions, where better estimates are quantified by the scoring function that identifies atypical fluctuations. This criterion resists under and over fitting data as an alternative to employing the Bayesian or Akaike information criterion. Multiple estimates for the probability density reflect uncertainties due to statistical fluctuations in random samples. Scaled quantile residual plots are also introduced as an effective diagnostic to visualize the quality of the estimated probability densities. Benchmark tests show that estimates for the probability density function (PDF) converge to the true PDF as sample size increases on particularly difficult test probability densities that include cases with discontinuities, multi-resolution scales, heavy tails, and singularities. These results indicate the method has general applicability for high throughput statistical inference. PMID:29750803
Direct and contextual effects of individual values on organizational citizenship behavior in teams.
Arthaud-Day, Marne L; Rode, Joseph C; Turnley, William H
2012-07-01
The authors use Schwartz's values theory as an integrative framework for testing the relationship between individual values and peer-reported organizational citizenship behavior (OCB) in teams, controlling for sex, satisfaction, and personality traits. Using hierarchical linear modeling in a sample of 582 students distributed across 135 class project teams, the authors find positive, direct effects for achievement on citizenship behaviors directed toward individuals (OCB-I), for benevolence on citizenship behaviors directed toward the group (OCB-O), and for self-direction on both OCB-I and OCB-O. Applying relational demography techniques to test for contextual effects, the authors find that group mean power scores negatively moderate the relationship between individual power and OCB-I, whereas group mean self-direction scores positively moderate the relationship between self-direction and both OCB-I and OCB-O. (PsycINFO Database Record (c) 2012 APA, all rights reserved).
White Matter Changes and Confrontation Naming in Retired Aging National Football League Athletes.
Strain, Jeremy F; Didehbani, Nyaz; Spence, Jeffrey; Conover, Heather; Bartz, Elizabeth K; Mansinghani, Sethesh; Jeroudi, Myrtle K; Rao, Neena K; Fields, Lindy M; Kraut, Michael A; Cullum, C Munro; Hart, John; Womack, Kyle B
2017-01-15
Using diffusion tensor imaging (DTI), we assessed the relationship of white matter integrity and performance on the Boston Naming Test (BNT) in a group of retired professional football players and a control group. We examined correlations between fractional anisotropy (FA) and mean diffusivity (MD) with BNT T-scores in an unbiased voxelwise analysis processed with tract-based spatial statistics (TBSS). We also analyzed the DTI data by grouping voxels together as white matter tracts and testing each tract's association with BNT T-scores. Significant voxelwise correlations between FA and BNT performance were only seen in the retired football players (p < 0.02). Two tracts had mean FA values that significantly correlated with BNT performance: forceps minor and forceps major. White matter integrity is important for distributed cognitive processes, and disruption correlates with diminished performance in athletes exposed to concussive and subconcussive brain injuries, but not in controls without such exposure.
Cognitive effects of pregabalin in the treatment of long-term benzodiazepine-use and dependence.
Oulis, Panagiotis; Kalogerakou, Stamatina; Anyfandi, Eleni; Konstantakopoulos, George; Papakosta, Vassiliki-Maria; Masdrakis, Vasilios; Tsaltas, Eleftheria
2014-05-01
Long-term benzodiazepine (BDZ) use and dependence affect cognitive functioning adversely and partly irreversibly. Emerging evidence suggests that pregabalin (PGB) might be a safe and efficacious treatment of long-term BDZ use. The aim of the present study was to investigate the changes in several core cognitive functions after successful treatment of long-term BDZ use and dependence with PGB. Fourteen patients with long-term BDZ use (mean duration >15 years) underwent neuropsychological assessment with the mini-mental state examination and four tests from the Cambridge Neuropsychological Test Automated Battery (CANTAB) battery before the initiation of PGB treatment and at a two months follow-up after the cessation of BDZs. Patients' CANTAB percentile score distributions were compared with normative CANTAB data. Patients improved on cognitive measures of global cognitive functioning, time orientation, psychomotor speed, and visuospatial memory and learning with strong effect sizes. By contrast, they failed to improve on measures of attentional flexibility. Despite their significant improvement, patients' scores on most tests remained still at the lower percentiles of CANTAB normative scores. Although preliminary, our findings suggest that successful treatment of long-term BDZ use with PGB is associated with a substantial, though only partial, recovery of BDZ-compromised neuropsychological functioning, at least at a 2-month follow-up. Copyright © 2014 John Wiley & Sons, Ltd.
Madeni, Frida; Horiuchi, Shigeko; Iida, Mariko
2011-06-27
Sub-Saharan Africa is among the countries where 10% of girls become mothers by the age of 16 years old. The United Republic of Tanzania located in Sub-Saharan Africa is one country where teenage pregnancy is a problem facing adolescent girls. Adolescent pregnancy has been identified as one of the reasons for girls dropping out from school. This study's purpose was to evaluate a reproductive health awareness program for the improvement of reproductive health for adolescents in urban Tanzania. A quasi-experimental pre-test and post-test research design was conducted to evaluate adolescents' knowledge, attitude, and behavior about reproductive health before and after the program. Data were collected from students aged 11 to 16, at Ilala Municipal, Dar es Salaam, Tanzania. An anonymous 23-item questionnaire provided the data. The program was conducted using a picture drama, reproductive health materials and group discussion. In total, 313 questionnaires were distributed and 305 (97.4%) were useable for the final analysis. The mean age for girls was 12.5 years and 13.2 years for boys. A large minority of both girls (26.8%) and boys (41.4%) had experienced sex and among the girls who had experienced sex, 51.2% reported that it was by force. The girls' mean score in the knowledge pre-test was 5.9, and 6.8 in post-test, which increased significantly (t=7.9, p=0.000). The mean behavior pre-test score was 25.8 and post-test was 26.6, which showed a significant increase (t=3.0, p=0.003). The boys' mean score in the knowledge pre-test was 6.4 and 7.0 for the post-test, which increased significantly (t=4.5, p=0.000). The mean behavior pre-test score was 25.6 and 26.4 in post-test, which showed a significant increase (t=2.4, p=0.019). However, the pre-test and post-test attitude scores showed no statistically significant difference for either girls or boys. Teenagers have sexual experiences including sexual violence. Both of these phenomena are prevalent among school-going adolescents. The reproductive health program improved the students' knowledge and behavior about sexuality and decision-making after the program for both girls and boys. However, their attitudes about reproductive health were not likely to change based on the educational intervention as designed for this study.
2011-01-01
Background Sub-Saharan Africa is among the countries where 10% of girls become mothers by the age of 16 years old. The United Republic of Tanzania located in Sub-Saharan Africa is one country where teenage pregnancy is a problem facing adolescent girls. Adolescent pregnancy has been identified as one of the reasons for girls dropping out from school. This study's purpose was to evaluate a reproductive health awareness program for the improvement of reproductive health for adolescents in urban Tanzania. Methods A quasi-experimental pre-test and post-test research design was conducted to evaluate adolescents' knowledge, attitude, and behavior about reproductive health before and after the program. Data were collected from students aged 11 to 16, at Ilala Municipal, Dar es Salaam, Tanzania. An anonymous 23-item questionnaire provided the data. The program was conducted using a picture drama, reproductive health materials and group discussion. Results In total, 313 questionnaires were distributed and 305 (97.4%) were useable for the final analysis. The mean age for girls was 12.5 years and 13.2 years for boys. A large minority of both girls (26.8%) and boys (41.4%) had experienced sex and among the girls who had experienced sex, 51.2% reported that it was by force. The girls' mean score in the knowledge pre-test was 5.9, and 6.8 in post-test, which increased significantly (t = 7.9, p = 0.000). The mean behavior pre-test score was 25.8 and post-test was 26.6, which showed a significant increase (t = 3.0, p = 0.003). The boys' mean score in the knowledge pre-test was 6.4 and 7.0 for the post-test, which increased significantly (t = 4.5, p = 0.000). The mean behavior pre-test score was 25.6 and 26.4 in post-test, which showed a significant increase (t = 2.4, p = 0.019). However, the pre-test and post-test attitude scores showed no statistically significant difference for either girls or boys. Conclusions Teenagers have sexual experiences including sexual violence. Both of these phenomena are prevalent among school-going adolescents. The reproductive health program improved the students' knowledge and behavior about sexuality and decision-making after the program for both girls and boys. However, their attitudes about reproductive health were not likely to change based on the educational intervention as designed for this study. PMID:21707996
Kelay, Tanika; Chan, Kah Leong; Ako, Emmanuel; Yasin, Mohammad; Costopoulos, Charis; Gold, Matthew; Kneebone, Roger K; Malik, Iqbal S; Bello, Fernando
2017-01-01
Distributed Simulation is the concept of portable, high-fidelity immersive simulation. Here, it is used for the development of a simulation-based training programme for cardiovascular specialities. We present an evidence base for how accessible, portable and self-contained simulated environments can be effectively utilised for the modelling, development and testing of a complex training framework and assessment methodology. Iterative user feedback through mixed-methods evaluation techniques resulted in the implementation of the training programme. Four phases were involved in the development of our immersive simulation-based training programme: ( 1) initial conceptual stage for mapping structural criteria and parameters of the simulation training framework and scenario development ( n = 16), (2) training facility design using Distributed Simulation , (3) test cases with clinicians ( n = 8) and collaborative design, where evaluation and user feedback involved a mixed-methods approach featuring (a) quantitative surveys to evaluate the realism and perceived educational relevance of the simulation format and framework for training and (b) qualitative semi-structured interviews to capture detailed feedback including changes and scope for development. Refinements were made iteratively to the simulation framework based on user feedback, resulting in (4) transition towards implementation of the simulation training framework, involving consistent quantitative evaluation techniques for clinicians ( n = 62). For comparative purposes, clinicians' initial quantitative mean evaluation scores for realism of the simulation training framework, realism of the training facility and relevance for training ( n = 8) are presented longitudinally, alongside feedback throughout the development stages from concept to delivery, including the implementation stage ( n = 62). Initially, mean evaluation scores fluctuated from low to average, rising incrementally. This corresponded with the qualitative component, which augmented the quantitative findings; trainees' user feedback was used to perform iterative refinements to the simulation design and components (collaborative design), resulting in higher mean evaluation scores leading up to the implementation phase. Through application of innovative Distributed Simulation techniques, collaborative design, and consistent evaluation techniques from conceptual, development, and implementation stages, fully immersive simulation techniques for cardiovascular specialities are achievable and have the potential to be implemented more broadly.
A quantitative trait locus mixture model that avoids spurious LOD score peaks.
Feenstra, Bjarke; Skovgaard, Ib M
2004-01-01
In standard interval mapping of quantitative trait loci (QTL), the QTL effect is described by a normal mixture model. At any given location in the genome, the evidence of a putative QTL is measured by the likelihood ratio of the mixture model compared to a single normal distribution (the LOD score). This approach can occasionally produce spurious LOD score peaks in regions of low genotype information (e.g., widely spaced markers), especially if the phenotype distribution deviates markedly from a normal distribution. Such peaks are not indicative of a QTL effect; rather, they are caused by the fact that a mixture of normals always produces a better fit than a single normal distribution. In this study, a mixture model for QTL mapping that avoids the problems of such spurious LOD score peaks is presented. PMID:15238544
A quantitative trait locus mixture model that avoids spurious LOD score peaks.
Feenstra, Bjarke; Skovgaard, Ib M
2004-06-01
In standard interval mapping of quantitative trait loci (QTL), the QTL effect is described by a normal mixture model. At any given location in the genome, the evidence of a putative QTL is measured by the likelihood ratio of the mixture model compared to a single normal distribution (the LOD score). This approach can occasionally produce spurious LOD score peaks in regions of low genotype information (e.g., widely spaced markers), especially if the phenotype distribution deviates markedly from a normal distribution. Such peaks are not indicative of a QTL effect; rather, they are caused by the fact that a mixture of normals always produces a better fit than a single normal distribution. In this study, a mixture model for QTL mapping that avoids the problems of such spurious LOD score peaks is presented.
IS THE SUICIDE RATE A RANDOM WALK?
Yang, Bijou; Lester, David; Lyke, Jennifer; Olsen, Robert
2015-06-01
The yearly suicide rates for the period 1933-2010 and the daily suicide numbers for 1990 and 1991 were examined for whether the distribution of difference scores (from year to year and from day to day) fitted a normal distribution, a characteristic of stochastic processes that follow a random walk. If the suicide rate were a random walk, then any disturbance to the suicide rate would have a permanent effect and national suicide prevention efforts would likely fail. The distribution of difference scores from day to day (but not the difference scores from year to year) fitted a normal distribution and, therefore, were consistent with a random walk.
Al Nozha, Omar Mansour; Fadel, Hani T
2017-01-01
Taibah University offers regular nursing (RNP) and nursing bridging (NBP) bachelor programs. We evaluated student perception of the learning environment as one means of quality assurance. To assess nursing student perception of their educational environment, to compare the perceptions of regular and bridging students, and to compare the perceptions of students in the old and new curricula. Cross-sectional survey. College of Nursing at Taibah University, Madinah, Saudi Arabia. The Dundee Ready Educational Environment Measure (DREEM) instrument was distributed to over 714 nursing students to assess perception of the educational environment. Independent samples t test and Pearson's chi square were used to compare the programs and curricula. The DREEM inventory score. Of 714 students, 271 (38%) were RNP students and 443 (62%) were NBP students. The mean (standard deviation) DREEM score was 111 (25). No significant differences were observed between the programs except for the domain "academic self-perceptions" being higher in RNP students (P < .001). Higher mean DREEM scores were observed among students studying the new curriculum in the RNP (P < .001) and NBP (P > .05). Nursing students generally perceived their learning environment as more positive than negative. Regular students were more positive than bridging students. Students who experienced the new curriculum were more positive towards learning. The cross-sectional design and unequal gender and study level distributions may limit generalizability of the results. Longitudinal, large-scale studies with more even distributions of participant characteristics are needed.
NASA Astrophysics Data System (ADS)
Christensen, Hannah; Moroz, Irene; Palmer, Tim
2015-04-01
Forecast verification is important across scientific disciplines as it provides a framework for evaluating the performance of a forecasting system. In the atmospheric sciences, probabilistic skill scores are often used for verification as they provide a way of unambiguously ranking the performance of different probabilistic forecasts. In order to be useful, a skill score must be proper -- it must encourage honesty in the forecaster, and reward forecasts which are reliable and which have good resolution. A new score, the Error-spread Score (ES), is proposed which is particularly suitable for evaluation of ensemble forecasts. It is formulated with respect to the moments of the forecast. The ES is confirmed to be a proper score, and is therefore sensitive to both resolution and reliability. The ES is tested on forecasts made using the Lorenz '96 system, and found to be useful for summarising the skill of the forecasts. The European Centre for Medium-Range Weather Forecasts (ECMWF) ensemble prediction system (EPS) is evaluated using the ES. Its performance is compared to a perfect statistical probabilistic forecast -- the ECMWF high resolution deterministic forecast dressed with the observed error distribution. This generates a forecast that is perfectly reliable if considered over all time, but which does not vary from day to day with the predictability of the atmospheric flow. The ES distinguishes between the dynamically reliable EPS forecasts and the statically reliable dressed deterministic forecasts. Other skill scores are tested and found to be comparatively insensitive to this desirable forecast quality. The ES is used to evaluate seasonal range ensemble forecasts made with the ECMWF System 4. The ensemble forecasts are found to be skilful when compared with climatological or persistence forecasts, though this skill is dependent on region and time of year.
Newcomb, Tara L; Bruhn, Ann M; Ulmer, Loreta H; Diawara, Norou
2015-10-01
Mass fatality incidents can overwhelm local, state and national resources quickly. Dental hygienists are widely distributed and have the potential to increase response teams' capacity. However, appropriate training is required. The literature is void of addressing this type of training for dental hygienists and scant in dentistry. Hence, the purpose of this study was to assess one facet of such training: Whether the use of multimedia is likely to enhance educational outcomes related to mass fatality training. A randomized, double-blind, pre- and post-test design was used to evaluate the effectiveness of comparable educational modules for 2 groups: a control group (n=19) that received low media training and a treatment group (n=20) that received multimedia training. Participants were second-year, baccalaureate dental hygiene students. Study instruments included a multiple-choice examination, a clinical competency-based radiology lab scored via a standardized rubric, and an assessment of interest in mass fatality education as a specialty. ANOVA was used to analyze results. Participants' pre- and post-test scores and clinical competency-based radiology lab scores increased following both educational approaches. Interest in mass fatality training also increased significantly for all participants (p=0.45). There was no significant difference in pre- and post-test multiple choice scores (p=0.6455), interest (p=0.9133) or overall competency-based radiology lab scores (p=0.997) between groups. Various educational technique may be effective for mass fatality training. However, mass fatality training that incorporates multimedia is an appropriate avenue for training instruction. Continued research about multimedia's role in this specialty area is encouraged. Copyright © 2015 The American Dental Hygienists’ Association.
Job stress and burnout among urban and rural hospital physicians in Japan.
Saijo, Yasuaki; Chiba, Shigeru; Yoshioka, Eiji; Kawanishi, Yasuyuki; Nakagi, Yoshihiko; Ito, Toshihiro; Sugioka, Yoshihiko; Kitaoka-Higashiguchi, Kazuyo; Yoshida, Takahiko
2013-08-01
To elucidate the differences in job stress and burnout status of Japanese hospital physicians between large cities, small cities, and towns and villages. Cross-sectional study. Postal self-administered questionnaires were distributed to 2937 alumni of Asahikawa Medical University. Four hundred and twenty-two hospital physicians. The Brief Job Stress Questionnaire was used to evaluate job demand, job control and social support. The Japanese version of the Maslach Burnout Inventory-General Survey (MBI-GS) was used to evaluate burnout. An analysis of covariance was conducted on the mean scores on the Brief Job Stress Questionnaire and the MBI-GS scales after adjusting for sex, age and specialties. In adjusted analyses, the job demand score was significantly different among physicians in the three areas. In Bonferroni post-hoc tests, scores in large cities was significantly higher than those in small cities and towns and villages. The job control score showed a significant difference and a marginally significant trend, with large cities associated with lower job control. There were significant differences in support from supervisors and that from family/friends, and scores in large cities was significantly higher than those in small cities in the post-hoc test. There was a significant effect on the exhaustion scale of the MBI-GS, with large cities associated with higher exhaustion, and scores in large cities was significantly higher than those in small cities. Urban hospital physicians had more job demand, less job control and exhaustion caused by burnout, and rural hospital physicians had less social support. © 2013 The Authors. Australian Journal of Rural Health © National Rural Health Alliance Inc.
Hermida-Ameijeiras, Á; López-Paz, J E; Riveiro-Cruz, M A; Calvo-Gómez, C
2016-01-01
Carotid intima-media thickness (cIMT) has been suggested as a further tool for risk function charts. The aim of this study was to describethe relationship between cIMT and cardiovascular risk (CVR) estimation according to Framingham-REGICOR and SCORE equations. Observational, cross-sectional cohort study from 362 hypertensive subjects. Demographic and clinical information were collected as well as laboratory, ultrasonographic and CVR estimation by the Framingham-REGICOR and SCORE functions. Statistical analysis was performed using SPSS software (version 20,0). To analyze the data, statistical tests such as Chi-square, T-test, ANOVA, and Pearson correlation coefficient were used. According to both functions, differences on mean cIMT were found between low CVR group and intermediate to high groups. No differences were found between intermediate and high risk groups (cIMT: 0,73mm low risk patients vs. 0,89 or 0,88mm respectively according to SCORE function and cIMT: 0,73 vs. 0,85 or 0,87mm respectively according to Framingham-REGICOR function). cIMT correlated positively with CVR estimation according to both SCORE (r=0,421; P<.01), and Framingham-REGICOR functions (r=0,363; P<.01). cIMT correlates positively with CVR estimated by SCORE and Framingham-REGICOR functions. cIMT in those subjects at intermediate risk is similar to those at high risk. Our findings highlight the importance of carotid ultrasound in identifying silent target-organ damage in those patients at intermediate CVR. Copyright © 2015 SEHLELHA. Published by Elsevier España, S.L.U. All rights reserved.
Silberstein, M.; Tzemach, A.; Dovgolevsky, N.; Fishelson, M.; Schuster, A.; Geiger, D.
2006-01-01
Computation of LOD scores is a valuable tool for mapping disease-susceptibility genes in the study of Mendelian and complex diseases. However, computation of exact multipoint likelihoods of large inbred pedigrees with extensive missing data is often beyond the capabilities of a single computer. We present a distributed system called “SUPERLINK-ONLINE,” for the computation of multipoint LOD scores of large inbred pedigrees. It achieves high performance via the efficient parallelization of the algorithms in SUPERLINK, a state-of-the-art serial program for these tasks, and through the use of the idle cycles of thousands of personal computers. The main algorithmic challenge has been to efficiently split a large task for distributed execution in a highly dynamic, nondedicated running environment. Notably, the system is available online, which allows computationally intensive analyses to be performed with no need for either the installation of software or the maintenance of a complicated distributed environment. As the system was being developed, it was extensively tested by collaborating medical centers worldwide on a variety of real data sets, some of which are presented in this article. PMID:16685644
Rasch model based analysis of the Force Concept Inventory
NASA Astrophysics Data System (ADS)
Planinic, Maja; Ivanjek, Lana; Susac, Ana
2010-06-01
The Force Concept Inventory (FCI) is an important diagnostic instrument which is widely used in the field of physics education research. It is therefore very important to evaluate and monitor its functioning using different tools for statistical analysis. One of such tools is the stochastic Rasch model, which enables construction of linear measures for persons and items from raw test scores and which can provide important insight in the structure and functioning of the test (how item difficulties are distributed within the test, how well the items fit the model, and how well the items work together to define the underlying construct). The data for the Rasch analysis come from the large-scale research conducted in 2006-07, which investigated Croatian high school students’ conceptual understanding of mechanics on a representative sample of 1676 students (age 17-18 years). The instrument used in research was the FCI. The average FCI score for the whole sample was found to be (27.7±0.4)% , indicating that most of the students were still non-Newtonians at the end of high school, despite the fact that physics is a compulsory subject in Croatian schools. The large set of obtained data was analyzed with the Rasch measurement computer software WINSTEPS 3.66. Since the FCI is routinely used as pretest and post-test on two very different types of population (non-Newtonian and predominantly Newtonian), an additional predominantly Newtonian sample ( N=141 , average FCI score of 64.5%) of first year students enrolled in introductory physics course at University of Zagreb was also analyzed. The Rasch model based analysis suggests that the FCI has succeeded in defining a sufficiently unidimensional construct for each population. The analysis of fit of data to the model found no grossly misfitting items which would degrade measurement. Some items with larger misfit and items with significantly different difficulties in the two samples of students do require further examination. The analysis revealed some problems with item distribution in the FCI and suggested that the FCI may function differently in non-Newtonian and predominantly Newtonian population. Some possible improvements of the test are suggested.
A Directed Acyclic Graph-Large Margin Distribution Machine Model for Music Symbol Classification
Wen, Cuihong; Zhang, Jing; Rebelo, Ana; Cheng, Fanyong
2016-01-01
Optical Music Recognition (OMR) has received increasing attention in recent years. In this paper, we propose a classifier based on a new method named Directed Acyclic Graph-Large margin Distribution Machine (DAG-LDM). The DAG-LDM is an improvement of the Large margin Distribution Machine (LDM), which is a binary classifier that optimizes the margin distribution by maximizing the margin mean and minimizing the margin variance simultaneously. We modify the LDM to the DAG-LDM to solve the multi-class music symbol classification problem. Tests are conducted on more than 10000 music symbol images, obtained from handwritten and printed images of music scores. The proposed method provides superior classification capability and achieves much higher classification accuracy than the state-of-the-art algorithms such as Support Vector Machines (SVMs) and Neural Networks (NNs). PMID:26985826
A Directed Acyclic Graph-Large Margin Distribution Machine Model for Music Symbol Classification.
Wen, Cuihong; Zhang, Jing; Rebelo, Ana; Cheng, Fanyong
2016-01-01
Optical Music Recognition (OMR) has received increasing attention in recent years. In this paper, we propose a classifier based on a new method named Directed Acyclic Graph-Large margin Distribution Machine (DAG-LDM). The DAG-LDM is an improvement of the Large margin Distribution Machine (LDM), which is a binary classifier that optimizes the margin distribution by maximizing the margin mean and minimizing the margin variance simultaneously. We modify the LDM to the DAG-LDM to solve the multi-class music symbol classification problem. Tests are conducted on more than 10000 music symbol images, obtained from handwritten and printed images of music scores. The proposed method provides superior classification capability and achieves much higher classification accuracy than the state-of-the-art algorithms such as Support Vector Machines (SVMs) and Neural Networks (NNs).
Analysis of the Korean Navy Selection Process for the Naval Post Graduate School
1988-06-01
OUTCOME OF ECL TESTING SCORE..........................54 C. OUTCOME OF TOEFL TESTING SCORE.......................55 D. PLOT OF NPS GRADE WITH ECL...TESTING SCORE..............55 E. PLOT OF NPS GRADE WIHT NA GRADE......................56 F. PLOT OF NPS GRADE WITH TOEFL TESTING SCORE............56...OF ECL TESTING SCORE ............. 30 Table S. EXPECTANCY TABLE OF NAG ............................ 31 Table 9. EXPECTANCY TABLE OF TOEFL TESTING SCORE
Executive function assessment in New Zealand 2-year olds born at risk of neonatal hypoglycemia.
Ansell, Judith M; Wouldes, Trecia A; Harding, Jane E
2017-01-01
A growing number of babies are born with perinatal risk factors that may impair later development. These children are often assessed at 2 years to help predict outcome and direct support services. Executive function is an important predictor of academic achievement and behavior, but there are limited assessments of executive function in 2-year-olds and few have been tested in at-risk populations. Therefore, we developed a battery of four age-appropriate tasks to assess executive function in 2-year-olds. At 24 months' corrected age 368 children completed tasks assessing attention, inhibition, working memory and cognitive flexibility. Scores on different tasks were weakly correlated, suggesting that they measured separate aspects of executive function, with combined scores for this cohort approximating a normal distribution. Significantly more boys (67%) than girls (57%) were unable to inhibit their behavior on the Snack Delay Task and girls (M = 3.24, SD = 2.4) had higher mean scores than boys (M = 2.7, SD = 2.7) on the Ducks and Buckets Reverse Categorization Task of working memory. Performance was significantly affected by family socioeconomic status. Mean scores were lower on all four individual tasks and on the global score of overall performance in children from a low household income (<$40,000) compared to those from medium ($40,001-$70,000) and high income households (>$70,001). Maternal education was only associated with scores on the working memory task and the global score; and a measure of neighborhood deprivation was only associated with scores on the two inhibitory tasks and the global score. Our findings confirm the feasibility of assessing executive function in 2-year-olds, and its ability to discriminate effects of socioeconomic status, a common confounder in child development research. Further development and standardization of this test battery comparing at-risk children with a normative population would provide a much-needed measure of executive function in early childhood.
Executive function assessment in New Zealand 2-year olds born at risk of neonatal hypoglycemia
2017-01-01
A growing number of babies are born with perinatal risk factors that may impair later development. These children are often assessed at 2 years to help predict outcome and direct support services. Executive function is an important predictor of academic achievement and behavior, but there are limited assessments of executive function in 2-year-olds and few have been tested in at-risk populations. Therefore, we developed a battery of four age-appropriate tasks to assess executive function in 2-year-olds. At 24 months’ corrected age 368 children completed tasks assessing attention, inhibition, working memory and cognitive flexibility. Scores on different tasks were weakly correlated, suggesting that they measured separate aspects of executive function, with combined scores for this cohort approximating a normal distribution. Significantly more boys (67%) than girls (57%) were unable to inhibit their behavior on the Snack Delay Task and girls (M = 3.24, SD = 2.4) had higher mean scores than boys (M = 2.7, SD = 2.7) on the Ducks and Buckets Reverse Categorization Task of working memory. Performance was significantly affected by family socioeconomic status. Mean scores were lower on all four individual tasks and on the global score of overall performance in children from a low household income (<$40,000) compared to those from medium ($40,001-$70,000) and high income households (>$70,001). Maternal education was only associated with scores on the working memory task and the global score; and a measure of neighborhood deprivation was only associated with scores on the two inhibitory tasks and the global score. Our findings confirm the feasibility of assessing executive function in 2-year-olds, and its ability to discriminate effects of socioeconomic status, a common confounder in child development research. Further development and standardization of this test battery comparing at-risk children with a normative population would provide a much-needed measure of executive function in early childhood. PMID:29166407
ERIC Educational Resources Information Center
Zhang, Mo; Williamson, David M.; Breyer, F. Jay; Trapani, Catherine
2012-01-01
This article describes two separate, related studies that provide insight into the effectiveness of "e-rater" score calibration methods based on different distributional targets. In the first study, we developed and evaluated a new type of "e-rater" scoring model that was cost-effective and applicable under conditions of absent human rating and…
ERIC Educational Resources Information Center
Smith, Ruth Suessmuth
The purpose of this study was to determine the effect of the Macmillan Tutorial System when used as a supplement to regular classroom instruction in beginning reading. The experimental subjects were first grade children who ranked in the lower third of the distribution scores on the Macmillan Reading Readiness Test or who, in the opinion of their…
Do Examinees Understand Score Reports for Alternate Methods of Scoring Computer Based Tests?
ERIC Educational Resources Information Center
Whittaker, Tiffany A.; Williams, Natasha J.; Dodd, Barbara G.
2011-01-01
This study assessed the interpretability of scaled scores based on either number correct (NC) scoring for a paper-and-pencil test or one of two methods of scoring computer-based tests: an item pattern (IP) scoring method and a method based on equated NC scoring. The equated NC scoring method for computer-based tests was proposed as an alternative…
Corgnet, Brice; Espín, Antonio M.; Hernán-González, Roberto
2017-01-01
Groups make decisions on both the production and the distribution of resources. These decisions typically involve a tension between increasing the total level of group resources (i.e. social efficiency) and distributing these resources among group members (i.e. individuals' relative shares). This is the case because the redistribution process may destroy part of the resources, thus resulting in socially inefficient allocations. Here we apply a dual-process approach to understand the cognitive underpinnings of this fundamental tension. We conducted a set of experiments to examine the extent to which different allocation decisions respond to intuition or deliberation. In a newly developed approach, we assess intuition and deliberation at both the trait level (using the Cognitive Reflection Test, henceforth CRT) and the state level (through the experimental manipulation of response times). To test for robustness, experiments were conducted in two countries: the USA and India. Despite absolute-level differences across countries, in both locations we show that: (i) time pressure and low CRT scores are associated with individuals' concerns for their relative shares and (ii) time delay and high CRT scores are associated with individuals' concerns for social efficiency. These findings demonstrate that deliberation favours social efficiency by overriding individuals' intuitive tendency to focus on relative shares. PMID:28386421
Usami, Masahide; Iwadare, Yoshitaka; Watanabe, Kyota; Kodaira, Masaki; Ushijima, Hirokage; Tanaka, Tetsuya; Harada, Maiko; Tanaka, Hiromi; Sasaki, Yoshinori; Saito, Kazuhiko
2014-01-01
On March 11, 2011, Japan was struck by a massive earthquake and tsunami. The tsunami caused tremendous damage and traumatized several people, including children. The aim of this study was to assess changes in traumatic symptoms 8, 20, and 30 months of the 2011 tsunami. The study comprised three groups. Copies of the Post-Traumatic Stress Symptoms for Children 15 items (PTSSC-15), a self-rating questionnaire on traumatic symptoms, were distributed to 12,524 children (8-month period), 12,193 children (20-month period), and 11,819 children (30-month period). An effective response of children 8 months, 20 months, and 30 month after the disaster was obtained in 11,639 (92.9%), 10,597 (86.9%), and 10,812 children (91.4%), respectively. We calculated the total score, PTSD subscale, and Depression subscale of PTSSC-15. We calculated the total score, PTSD subscale, and Depression subscale of PTSSC-15. The PTSSC-15 total score and PTSD subscale of children belonging to 1st-9th grade groups who were tested 30 and 20 months after the tsunami significantly decreased compared with those of children tested 8 months after the tsunami. The PTSSC-15 total score and PTSD subscale of children in 1st-9th grade groups tested after 30 months did not decrease significantly compared with those of children tested after 20 months. The PTSSC-15 Depression subscale and PTSD subscale of children in 1st-9th grade groups tested after 30 months significantly decreased compared with those of children tested 8 months after the tsunami. The PTSSC-15 Depression subscale of children in 1st-9th grade groups evaluated after 30 months significantly decreased compared with those of children evaluated after 20 months. This study demonstrates that the traumatic symptoms of children who survived the massive tsunami improved with time. Nonetheless, the traumatic symptoms, which in some cases did not improve with time.
Yildirim-Gorter, Margina; Groot, Djahill; Hermens, Linda; Diesfeldt, Han; Scherder, Erik
2018-06-01
Alzheimer's Dementia (AD) may be associated with symptoms of depression. In AD, problems of language expression or understanding will arise sooner or later. The aim of this study was to determine whether elderly persons with AD, with or without a language disorder, experience difficulties understanding and answering mood related questions. In addition to this, it was our object to test the validity of the answers of nurses as informants, on the mood of an elderly client. 53 elderly persons, living in care homes, and their nurses, took part in the study. 25 participants had been diagnosed with Alzheimer's disease, 28 participants had no cognitive impairment. Language skills were tested using the SAN-test (Stichting Afasie Nederland) and subtests of the Aachen Aphasia Test (AAT). Mood was assessed with the Beck Depression Inventory-second edition (BDI-II-NL) and the Geriatric Depression Scale (GDS-30). There were no significant differences in scores on the mood related questionnaires between participants without cognitive impairment and participants with Alzheimer's disease, with or without a language disorder. The correlation between self- and informant-rating was very limited. In general, nurses reported more depressive symptoms than the elderly persons did themselves. Disparities between self- and informant-ratings varied from informant scores overestimating low self-ratings of depression to informant scores underestimating high self-ratings. Alzheimer's disease, whether or not it is complicated by a language disorder, does not disturb the normal score distribution on either test (BDI or GDS). This means that elderly persons with Alzheimer's disease are capable of adequately answering questions related to their own mood. However, considerable discrepancies were found between observer- and self-ratings of emotional wellbeing. Therefore it is important to not only take into account the information of an informant when testing for depression, but also the elderly person's own assessment of their mood.
McDonnell, Jeffrey; Haddow, Lewis; Daskalopoulou, Marina; Lampe, Fiona; Speakman, Andrew; Gilson, Richard; Phillips, Andrew; Sherr, Lorraine; Wayal, Sonali; Harrison, John; Antinori, Andrea; Maruff, Paul; Schembri, Adrian; Johnson, Margaret; Collins, Simon; Rodger, Alison
2014-10-01
To determine the prevalence of neurocognitive impairment (NCI) in UK HIV-positive and HIV-negative men who have sex with men (MSM). HIV-positive and HIV-negative participants were recruited to a cross-sectional study from 2 London clinics and completed computer-assisted neuropsychological tests and questionnaires of depression, anxiety, and activities of daily living. Published definitions of HIV-associated neurocognitive disorders (HAND) and global deficit scores were used. Age- and education-adjusted neuropsychological test scores were directly compared with reference population data. A total of 248 HIV-positive and 45 HIV-negative MSM participated. In the HIV-positive group, median time since diagnosis was 9.4 years, median CD4 count was 550 cells per cubic millimeter, and 88% were on antiretroviral therapy. Prevalence of HAND was 21.0% in HIV-positive MSM (13.7% asymptomatic neurocognitive impairment, 6.5% mild neurocognitive disorder, and 0.8% HIV-associated dementia). Using a global deficit score threshold of 0.5, the prevalence of NCI was 31.5% (when averaged over 5 neuropsychological domains) and 40.3% (over 10 neuropsychological test scores). These results were not significantly different from the HIV-negative study sample. No consistent pattern of impairment was seen in HIV-positive patients relative to general male population data (n = 380). We found a prevalence of HAND and degree of impairment on neuropsychological testing of HIV-positive MSM that could represent a normal population distribution. These findings suggest that NCI may be overestimated in HIV-positive MSM, and that the attribution of NCI to HIV infection implied by the term HAND requires revision.
Bassani, Diego G; Corsi, Daniel J; Gaffey, Michelle F; Barros, Aluisio J D
2014-01-01
Worse health outcomes including higher morbidity and mortality are most often observed among the poorest fractions of a population. In this paper we present and validate national, regional and state-level distributions of national wealth index scores, for urban and rural populations, derived from household asset data collected in six survey rounds in India between 1992-3 and 2007-8. These new indices and their sub-national distributions allow for comparative analyses of a standardized measure of wealth across time and at various levels of population aggregation in India. Indices were derived through principal components analysis (PCA) performed using standardized variables from a correlation matrix to minimize differences in variance. Valid and simple indices were constructed with the minimum number of assets needed to produce scores with enough variability to allow definition of unique decile cut-off points in each urban and rural area of all states. For all indices, the first PCA components explained between 36% and 43% of the variance in household assets. Using sub-national distributions of national wealth index scores, mean height-for-age z-scores increased from the poorest to the richest wealth quintiles for all surveys, and stunting prevalence was higher among the poorest and lower among the wealthiest. Urban and rural decile cut-off values for India, for the six regions and for the 24 major states revealed large variability in wealth by geographical area and level, and rural wealth score gaps exceeded those observed in urban areas. The large variability in sub-national distributions of national wealth index scores indicates the importance of accounting for such variation when constructing wealth indices and deriving score distribution cut-off points. Such an approach allows for proper within-sample economic classification, resulting in scores that are valid indicators of wealth and correlate well with health outcomes, and enables wealth-related analyses at whichever geographical area and level may be most informative for policy-making processes.
Smoothing of the bivariate LOD score for non-normal quantitative traits.
Buil, Alfonso; Dyer, Thomas D; Almasy, Laura; Blangero, John
2005-12-30
Variance component analysis provides an efficient method for performing linkage analysis for quantitative traits. However, type I error of variance components-based likelihood ratio testing may be affected when phenotypic data are non-normally distributed (especially with high values of kurtosis). This results in inflated LOD scores when the normality assumption does not hold. Even though different solutions have been proposed to deal with this problem with univariate phenotypes, little work has been done in the multivariate case. We present an empirical approach to adjust the inflated LOD scores obtained from a bivariate phenotype that violates the assumption of normality. Using the Collaborative Study on the Genetics of Alcoholism data available for the Genetic Analysis Workshop 14, we show how bivariate linkage analysis with leptokurtotic traits gives an inflated type I error. We perform a novel correction that achieves acceptable levels of type I error.
Hughes, Carmel M; Donnelly, Ailis; Moyes, Simon A; Peri, Kathy; Scahill, Shane; Chen, Charlotte; McCormack, Brendan; Kerse, Ngaire
2012-05-01
In this study, we sought to measure treatment culture (beliefs, values, and normative practices associated with medication prescribing and administration) in two samples of nursing homes (in Northern Ireland and New Zealand) and to document the range of scoring achieved by staff in both countries. Responses between nurse managers and registered nurses were also compared. A cross-sectional study using an adapted treatment culture questionnaire was distributed by mail (in June and September 2008) to 159 nursing homes in Northern Ireland and completed by the nurse manager and registered nurses. In New Zealand, staff in 14 facilities participated and questionnaires were distributed by a research assistant who visited the homes (March to November 2008). Completed questionnaires were scored using a prespecified scoring system, with a higher score indicating a more resident-centered treatment culture and a lower score indicating a more traditional approach to care. The maximum score possible was 75. Scores were compared between countries and between different categories of staff. Views were also sought and knowledge tested (from structured questions) on the use of psychotropic prescribing in the nursing home environment. The response rates for nurse managers and nurses in Northern Ireland were 35.5% and 10.1%, respectively; in New Zealand, the response rate was 90.9% for managers and 71% for nurses. The mean score for the Northern Ireland and New Zealand homes was 39.5 and 39.1, respectively (P > .05). There were also no differences between scores achieved by nurse managers and registered nurses between and across both countries. There were some cross-country differences on the approach to challenging behavior in residents and nurses (in both countries) were more likely than nurse managers to report (incorrectly) that haloperidol is indicated for short-term insomnia. This quantitative assessment has raised interesting issues in relation to the measurement of treatment culture in the nursing home setting in two countries. Further insights into the importance of treatment culture will be pursued in qualitative studies. Copyright © 2012 American Medical Directors Association, Inc. Published by Elsevier Inc. All rights reserved.
Tian, Guo-Liang; Li, Hui-Qiong
2017-08-01
Some existing confidence interval methods and hypothesis testing methods in the analysis of a contingency table with incomplete observations in both margins entirely depend on an underlying assumption that the sampling distribution of the observed counts is a product of independent multinomial/binomial distributions for complete and incomplete counts. However, it can be shown that this independency assumption is incorrect and can result in unreliable conclusions because of the under-estimation of the uncertainty. Therefore, the first objective of this paper is to derive the valid joint sampling distribution of the observed counts in a contingency table with incomplete observations in both margins. The second objective is to provide a new framework for analyzing incomplete contingency tables based on the derived joint sampling distribution of the observed counts by developing a Fisher scoring algorithm to calculate maximum likelihood estimates of parameters of interest, the bootstrap confidence interval methods, and the bootstrap testing hypothesis methods. We compare the differences between the valid sampling distribution and the sampling distribution under the independency assumption. Simulation studies showed that average/expected confidence-interval widths of parameters based on the sampling distribution under the independency assumption are shorter than those based on the new sampling distribution, yielding unrealistic results. A real data set is analyzed to illustrate the application of the new sampling distribution for incomplete contingency tables and the analysis results again confirm the conclusions obtained from the simulation studies.
Raymond, Mark R; Clauser, Brian E; Furman, Gail E
2010-10-01
The use of standardized patients to assess communication skills is now an essential part of assessing a physician's readiness for practice. To improve the reliability of communication scores, it has become increasingly common in recent years to use statistical models to adjust ratings provided by standardized patients. This study employed ordinary least squares regression to adjust ratings, and then used generalizability theory to evaluate the impact of these adjustments on score reliability and the overall standard error of measurement. In addition, conditional standard errors of measurement were computed for both observed and adjusted scores to determine whether the improvements in measurement precision were uniform across the score distribution. Results indicated that measurement was generally less precise for communication ratings toward the lower end of the score distribution; and the improvement in measurement precision afforded by statistical modeling varied slightly across the score distribution such that the most improvement occurred in the upper-middle range of the score scale. Possible reasons for these patterns in measurement precision are discussed, as are the limitations of the statistical models used for adjusting performance ratings.
ERIC Educational Resources Information Center
Ruscio, John; Walters, Glenn D.
2009-01-01
Factor-analytic research is common in the study of constructs and measures in psychological assessment. Latent factors can represent traits as continuous underlying dimensions or as discrete categories. When examining the distributions of estimated scores on latent factors, one would expect unimodal distributions for dimensional data and bimodal…
A multicenter examination and strategic revisions of the Yale Global Tic Severity Scale.
McGuire, Joseph F; Piacentini, John; Storch, Eric A; Murphy, Tanya K; Ricketts, Emily J; Woods, Douglas W; Walkup, John W; Peterson, Alan L; Wilhelm, Sabine; Lewin, Adam B; McCracken, James T; Leckman, James F; Scahill, Lawrence
2018-05-08
To examine the internal consistency and distribution of the Yale Global Tic Severity Scale (YGTSS) scores to inform modification of the measure. This cross-sectional study included 617 participants with a tic disorder (516 children and 101 adults), who completed an age-appropriate diagnostic interview and the YGTSS to evaluate tic symptom severity. The distributions of scores on YGTSS dimensions were evaluated for normality and skewness. For dimensions that were skewed across motor and phonic tics, a modified Delphi consensus process was used to revise selected anchor points. Children and adults had similar clinical characteristics, including tic symptom severity. All participants were examined together. Strong internal consistency was identified for the YGTSS Motor Tic score (α = 0.80), YGTSS Phonic Tic score (α = 0.87), and YGTSS Total Tic score (α = 0.82). The YGTSS Total Tic and Impairment scores exhibited relatively normal distributions. Several subscales and individual item scales departed from a normal distribution. Higher scores were more often used on the Motor Tic Number, Frequency, and Intensity dimensions and the Phonic Tic Frequency dimension. By contrast, lower scores were more often used on Motor Tic Complexity and Interference, and Phonic Tic Number, Intensity, Complexity, and Interference. The YGTSS exhibits good internal consistency across children and adults. The parallel findings across Motor and Phonic Frequency, Complexity, and Interference dimensions prompted minor revisions to the anchor point description to promote use of the full range of scores in each dimension. Specific minor revisions to the YGTSS Phonic Tic Symptom Checklist were also proposed. © 2018 American Academy of Neurology.
Al-Ghatani, Ali M; Obonsawin, Marc C; Binshaig, Basmah A; Al-Moutaery, Khalaf R
2011-01-01
There are 2 aims for this study: first, to collect normative data for the Wisconsin Card Sorting Test (WCST), Stroop test, Test of Non-verbal Intelligence (TONI-3), Picture Completion (PC) and Vocabulary (VOC) sub-test of the Wechsler Adult Intelligence Scale-Revised for use in a Saudi Arabian culture, and second, to use the normative data provided to generate the regression equations. To collect the normative data and generate the regression equations, 198 healthy individuals were selected to provide a representative distribution for age, gender, years of education, and socioeconomic class. The WCST, Stroop test, TONI-3, PC, and VOC were administrated to the healthy individuals. This study was carried out at the Department of Clinical Neurosciences, Riyadh Military Hospital, Riyadh, Kingdom of Saudi Arabia from January 2000 to July 2002. Normative data were obtained for all tests, and tables were constructed to interpret scores for different age groups. Regression equations to predict performance on the 3 tests of frontal function from scores on tests of fluid (TONI-3) and premorbid intelligence were generated from the data from the healthy individuals. The data collected in this study provide normative tables for 3 tests of frontal lobe function and for tests of general intellectual ability for use in Saudi Arabia. The data also provide a method to estimate pre-injury ability without the use of verbally based tests.
Dale, Philip S; Rice, Mabel L; Rimfeld, Kaili; Hayiou-Thomas, Marianna E
2018-01-22
There is a need for well-defined language phenotypes suitable for adolescents in twin studies and other large-scale research projects. Rice, Hoffman, and Wexler (2009) have developed a grammatical judgment measure as a clinical marker of language impairment, which has an extended developmental range to adolescence. We conducted the first twin analysis, along with associated phenotypic analyses of validity, of an abridged, 20-item version of this grammatical judgment measure (GJ-20), based on telephone administration using prerecorded stimuli to 405 pairs of 16-year-olds (148 monozygotic and 257 dizygotic) drawn from the Twins Early Development Study (Haworth, Davis, & Plomin, 2012). The distribution of scores is markedly skewed negatively, as expected for a potential clinical marker. Low performance on GJ-20 is associated with lower maternal education, reported learning disability (age 7 years), and low scores on language tests administered via the Twins Early Development Study (age 16 years) as well as General Certificate of Secondary Education English and Math examination performance (age 16 years). Liability threshold estimates for the genetic influence on low performance on GJ-20 are substantial, ranging from 36% with a lowest 10% criterion to 74% for a lowest 5% criterion. The heritability of GJ-20 scores, especially at more extreme cutoffs, along with the score distribution and association with other indicators of language impairments, provides additional evidence for the potential value of this measure as a clinical marker of specific language impairment.
Effect of the lung allocation score on lung transplantation in the United States.
Egan, Thomas M; Edwards, Leah B
2016-04-01
On May 4, 2005, the system for allocation of deceased donor lungs for transplant in the United States changed from allocation based on waiting time to allocation based on the lung allocation score (LAS). We sought to determine the effect of the LAS on lung transplantation in the United States. Organ Procurement and Transplantation Network data on listed and transplanted patients were analyzed for 5 calendar years before implementation of the LAS (2000-2004), and compared with data from 6 calendar years after implementation (2006-2011). Counts were compared between eras using the Wilcoxon rank sum test. The rates of transplant increase within each era were compared using an F-test. Survival rates computed using the Kaplan-Meier method were compared using the log-rank test. After introduction of the LAS, waitlist deaths decreased significantly, from 500/year to 300/year; the number of lung transplants increased, with double the annual increase in rate of lung transplants, despite no increase in donors; the distribution of recipient diagnoses changed dramatically, with significantly more patients with fibrotic lung disease receiving transplants; age of recipients increased significantly; and 1-year survival had a small but significant increase. Allocating lungs for transplant based on urgency and benefit instead of waiting time was associated with fewer waitlist deaths, more transplants performed, and a change in distribution of recipient diagnoses to patients more likely to die on the waiting list. Copyright © 2016 International Society for Heart and Lung Transplantation. All rights reserved.
ERIC Educational Resources Information Center
Wingersky, Marilyn S.; and others
1969-01-01
One in a series of nine articles in a section entitled, "Electronic Computer Program and Accounting Machine Procedures. Research supported in part by contract Nonr-2752(00) from the Office of Naval Research.
Kuselman, Ilya; Pennecchi, Francesca; Epstein, Malka; Fajgelj, Ales; Ellison, Stephen L R
2014-12-01
Monte Carlo simulation of expert judgments on human errors in a chemical analysis was used for determination of distributions of the error quantification scores (scores of likelihood and severity, and scores of effectiveness of a laboratory quality system in prevention of the errors). The simulation was based on modeling of an expert behavior: confident, reasonably doubting and irresolute expert judgments were taken into account by means of different probability mass functions (pmfs). As a case study, 36 scenarios of human errors which may occur in elemental analysis of geological samples by ICP-MS were examined. Characteristics of the score distributions for three pmfs of an expert behavior were compared. Variability of the scores, as standard deviation of the simulated score values from the distribution mean, was used for assessment of the score robustness. A range of the score values, calculated directly from elicited data and simulated by a Monte Carlo method for different pmfs, was also discussed from the robustness point of view. It was shown that robustness of the scores, obtained in the case study, can be assessed as satisfactory for the quality risk management and improvement of a laboratory quality system against human errors. Copyright © 2014 Elsevier B.V. All rights reserved.
Hojat, Mohammadreza; Gonnella, Joseph S
2015-01-01
This study was designed to provide typical descriptive statistics, score distributions and percentile ranks of the Jefferson Scale of Empathy-Medical Student version (JSE-S) of male and female medical school matriculants to serve as proxy norm data and tentative cutoff scores. The participants were 2,637 students (1,336 women and 1,301 men) who matriculated at Sidney Kimmel (formerly Jefferson) Medical College between 2002 and 2012, and completed the JSE at the beginning of medical school. Information extracted from descriptive statistics, score distributions and percentile ranks for male and female matriculants were used to develop proxy norm data and tentative cutoff scores. The score distributions of the JSE tended to be moderately skewed and platykurtic. Women obtained a significantly higher mean score (116.2 ± 9.7) than men (112.3 ± 10.8) on the JSE-S (t2,635 = 9.9, p < 0.01). It was suggested that percentile ranks can be used as proxy norm data. The tentative cutoff score to identify low scorers was ≤ 95 for men and ≤ 100 for women. Our findings provide norm data and cutoff scores for admission decisions under certain conditions and for identifying students in need of enhancing their empathy. © 2015 S. Karger AG, Basel.
Retrospective study of a TTR FAP cohort to modify NIS+7 for therapeutic trials.
Suanprasert, N; Berk, J L; Benson, M D; Dyck, P J B; Klein, C J; Gollob, J A; Bettencourt, B R; Karsten, V; Dyck, P J
2014-09-15
Protein stabilization and oligonucleotide therapies are being tested in transthyretin amyloid polyneuropathy (TTR FAP) trials. From retrospective analysis of 97 untreated TTR FAP patients, we test the adequacy of Neuropathy Impairment Score+7 tests (NIS+7) and modifications to comprehensively score impairments for use in such therapeutic trials. Our data confirms that TTR FAP usually is a sensorimotor polyneuropathy with autonomic features which usually is symmetric, length dependent, lower limb predominant and progressive. NIS+7 adequately assesses weakness and muscle stretch reflexes without ceiling effects but not sensation loss, autonomic dysfunction or nerve conduction abnormalities. Three modifications of NIS+7 are suggested: 1) use of Smart Somatotopic Quantitative Sensation Testing (S ST QSTing); 2) choice of new autonomic assessments, e.g., sudomotor testing of distributed anatomical sites; and 3) use of only compound muscle action potential amplitudes (of ulnar, peroneal and tibial nerves) and sensory nerve action potentials of ulnar and sural nerve - than the previously recommended attributes suggested for the sensitive detection of diabetic sensorimotor polyneuropathy. These modifications of NIS+7 if used in therapeutic trials should improve characterization and quantification of sensation and autonomic impairment in TTR FAP and provide better nerve conduction tests. Copyright © 2014 Elsevier B.V. All rights reserved.
Complex dynamics in the distribution of players’ scoring performance in Rugby Union world cups
NASA Astrophysics Data System (ADS)
Seuront, Laurent
2013-09-01
The evolution of the scoring performance of Rugby Union players is investigated over the seven rugby world cups (RWC) that took place from 1987 to 2011, and a specific attention is given to how they may have been impacted by the switch from amateurism to professionalism that occurred in 1995. The distribution of the points scored by individual players, Ps, ranked in order of performance were well described by the simplified canonical law Ps∝(, where r is the rank, and ϕ and α are the parameters of the distribution. The parameter α did not significantly change from 1987 to 2007 (α=0.92±0.03), indicating a negligible effect of professionalism on players’ scoring performance. In contrast, the parameter ϕ significantly increased from ϕ=1.32 for 1987 RWC, ϕ=2.30 for 1999 to 2003 RWC and ϕ=5.60 for 2007 RWC, suggesting a progressive decrease in the relative performance of the best players. Finally, the sharp decreases observed in both α(α=0.38) and ϕ(ϕ=0.70) in the 2011 RWC indicate a more even distribution of the performance of individuals among scorers, compared to the more heterogeneous distributions observed from 1987 to 2007, and suggest a sharp increase in the level of competition leading to an increase in the average quality of players and a decrease in the relative skills of the top players. Note that neither α nor ϕ significantly correlate with traditional performance indicators such as the number of points scored by the best players, the number of games played by the best players, the number of points scored by the team of the best players or the total number of points scored over each RWC. This indicates that the dynamics of the scoring performance of Rugby Union players is influenced by hidden processes hitherto inaccessible through standard performance metrics; this suggests that players’ scoring performance is connected to ubiquitous phenomena such as anomalous diffusion.
Govindarajan, Rangaswamy; Posey, James; Chao, Calvin Y; Lu, Ruixiao; Jadhav, Trafina; Javed, Ahmed Y; Javed, Awais; Mahmoud, Fade A; Osarogiagbon, Raymond U; Manne, Upender
2016-06-18
African American (AA) colon cancer patients have a worse prognosis than Caucasian (CA) colon cancer patients, however, reasons for this disparity are not well understood. To determine if tumor biology might contribute to differential prognosis, we measured recurrence risk and gene expression using the Oncotype DX® Colon Cancer Assay (12-gene assay) and compared the Recurrence Score results and gene expression profiles between AA patients and CA patients with stage II colon cancer. We retrieved demographic, clinical, and archived tumor tissues from stage II colon cancer patients at four institutions. The 12-gene assay and mismatch repair (MMR) status were performed by Genomic Health (Redwood City, California). Student's t-test and the Wilcoxon rank sum test were used to compare Recurrence Score data and gene expression data from AA and CA patients (SAS Enterprise Guide 5.1). Samples from 122 AA and 122 CA patients were analyzed. There were 118 women (63 AA, 55 CA) and 126 men (59 AA, 67 CA). Median age was 66 years for AA patients and 68 for CA patients. Age, gender, year of surgery, pathologic T-stage, tumor location, the number of lymph nodes examined, lymphovascular invasion, and MMR status were not significantly different between groups (p = 0.93). The mean Recurrence Score result for AA patients (27.9 ± 12.8) and CA patients (28.1 ± 11.8) was not significantly different and the proportions of patients with high Recurrence Score values (≥41) were similar between the groups (17/122 AA; 15/122 CA). None of the gene expression variables, either single genes or gene groups (cell cycle group, stromal group, BGN1, FAP, INHBA1, Ki67, MYBL2, cMYC and GADD45B), was significantly different between the racial groups. After controlling for clinical and pathologic covariates, the means and distributions of Recurrence Score results and gene expression profiles showed no statistically significant difference between patient groups. The distribution of Recurrence Score results and gene expression data was similar in a cohort of AA and CA patients with stage II colon cancer and similar clinical characteristics, suggesting that tumor biology, as represented by the 12-gene assay, did not differ between patient groups.
Applications of computerized adaptive testing (CAT) to the assessment of headache impact.
Ware, John E; Kosinski, Mark; Bjorner, Jakob B; Bayliss, Martha S; Batenhorst, Alice; Dahlöf, Carl G H; Tepper, Stewart; Dowson, Andrew
2003-12-01
To evaluate the feasibility of computerized adaptive testing (CAT) and the reliability and validity of CAT-based estimates of headache impact scores in comparison with 'static' surveys. Responses to the 54-item Headache Impact Test (HIT) were re-analyzed for recent headache sufferers (n = 1016) who completed telephone interviews during the National Survey of Headache Impact (NSHI). Item response theory (IRT) calibrations and the computerized dynamic health assessment (DYNHA) software were used to simulate CAT assessments by selecting the most informative items for each person and estimating impact scores according to pre-set precision standards (CAT-HIT). Results were compared with IRT estimates based on all items (total-HIT), computerized 6-item dynamic estimates (CAT-HIT-6), and a developmental version of a 'static' 6-item form (HIT-6-D). Analyses focused on: respondent burden (survey length and administration time), score distributions ('ceiling' and 'floor' effects), reliability and standard errors, and clinical validity (diagnosis, level of severity). A random sample (n = 245) was re-assessed to test responsiveness. A second study (n = 1103) compared actual CAT surveys and an improved 'static' HIT-6 among current headache sufferers sampled on the Internet. Respondents completed measures from the first study and the generic SF-8 Health Survey; some (n = 540) were re-tested on the Internet after 2 weeks. In the first study, simulated CAT-HIT and total-HIT scores were highly correlated (r = 0.92) without 'ceiling' or 'floor' effects and with a substantial reduction (90.8%) in respondent burden. Six of the 54 items accounted for the great majority of item administrations (3603/5028, 77.6%). CAT-HIT reliability estimates were very high (0.975-0.992) in the range where 95% of respondents scored, and relative validity (RV) coefficients were high for diagnosis (RV = 0.87) and severity (RV = 0.89); patient-level classifications were accurate 91.3% for a diagnosis of migraine. For all three criteria of change, CAT-HIT scores were more responsive than all other measures. In the second study, estimates of respondent burden, item usage, reliability and clinical validity were replicated. The test-retest reliability of CAT-HIT was 0.79 and alternate forms coefficients ranged from 0.85 to 0.91. All correlations with the generic SF-8 were negative. CAT-based administrations of headache impact items achieved very large reductions in respondent burden without compromising validity for purposes of patient screening or monitoring changes in headache impact over time. IRT models and CAT-based dynamic health assessments warrant testing among patients with other conditions.
Daskivich, Timothy; Luu, Michael; Noah, Benjamin; Fuller, Garth; Anger, Jennifer; Spiegel, Brennan
2018-05-09
Health care consumers are increasingly using online ratings to select providers, but differences in the distribution of scores across specialties and skew of the data have the potential to mislead consumers about the interpretation of ratings. The objective of our study was to determine whether distributions of consumer ratings differ across specialties and to provide specialty-specific data to assist consumers and clinicians in interpreting ratings. We sampled 212,933 health care providers rated on the Healthgrades consumer ratings website, representing 29 medical specialties (n=128,678), 15 surgical specialties (n=72,531), and 6 allied health (nonmedical, nonnursing) professions (n=11,724) in the United States. We created boxplots depicting distributions and tested the normality of overall patient satisfaction scores. We then determined the specialty-specific percentile rank for scores across groupings of specialties and individual specialties. Allied health providers had higher median overall satisfaction scores (4.5, interquartile range [IQR] 4.0-5.0) than physicians in medical specialties (4.0, IQR 3.3-4.5) and surgical specialties (4.2, IQR 3.6-4.6, P<.001). Overall satisfaction scores were highly left skewed (normal between -0.5 and 0.5) for all specialties, but skewness was greatest among allied health providers (-1.23, 95% CI -1.280 to -1.181), followed by surgical (-0.77, 95% CI -0.787 to -0.755) and medical specialties (-0.64, 95% CI -0.648 to -0.628). As a result of the skewness, the percentages of overall satisfaction scores less than 4 were only 23% for allied health, 37% for surgical specialties, and 50% for medical specialties. Percentile ranks for overall satisfaction scores varied across specialties; percentile ranks for scores of 2 (0.7%, 2.9%, 0.8%), 3 (5.8%, 16.6%, 8.1%), 4 (23.0%, 50.3%, 37.3%), and 5 (63.9%, 89.5%, 86.8%) differed for allied health, medical specialties, and surgical specialties, respectively. Online consumer ratings of health care providers are highly left skewed, fall within narrow ranges, and differ by specialty, which precludes meaningful interpretation by health care consumers. Specialty-specific percentile ranks may help consumers to more meaningfully assess online physician ratings. ©Timothy Daskivich, Michael Luu, Benjamin Noah, Garth Fuller, Jennifer Anger, Brennan Spiegel. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 09.05.2018.
Exploring a Source of Uneven Score Equity across the Test Score Range
ERIC Educational Resources Information Center
Huggins-Manley, Anne Corinne; Qiu, Yuxi; Penfield, Randall D.
2018-01-01
Score equity assessment (SEA) refers to an examination of population invariance of equating across two or more subpopulations of test examinees. Previous SEA studies have shown that score equity may be present for examinees scoring at particular test score ranges but absent for examinees scoring at other score ranges. No studies to date have…
Teaching the Anatomy of Oncology: Evaluating the Impact of a Dedicated Oncoanatomy Course
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chino, Junzo P., E-mail: junzo.chino@duke.ed; Lee, W. Robert; Madden, Richard
Purpose: Anatomic considerations are often critical in multidisciplinary cancer care. We developed an anatomy-focused educational program for radiation oncology residents integrating cadaver dissection into the didactic review of diagnostic, surgical, radiologic, and treatment planning, and herein assess its efficacy. Methods and Materials: Monthly, anatomic-site based educational modules were designed and implemented during the 2008-2009 academic year at Duke University Medical Center. Ten radiation oncology residents participated in these modules consisting of a 1-hour didactic introduction followed by a 1-hour session in the gross anatomy lab with cadavers prepared by trained anatomists. Pretests and posttests were given for six modules, andmore » post-module feedback surveys were distributed. Additional review questions testing knowledge from prior sessions were integrated into the later testing to evaluate knowledge retention. Paired analyses of pretests and postests were performed by Wilcoxon signed-rank test. Results: Ninety tests were collected and scored with 35 evaluable pretest and posttest pairs for six site-specific sessions. Posttests had significantly higher scores (median percentage correct 66% vs. 85%, p < 0.001). Of 47 evaluable paired pretest and review questions given 1-3 months after the intervention, correct responses rates were significantly higher for the later (59% vs. 86%, p = 0.008). Resident course satisfaction was high, with a median rating of 9 of 10 (IQR 8-9); with 1 being 'less effective than most educational interventions' and 10 being 'more effective than most educational interventions.' Conclusions: An integrated oncoanatomy course is associated with improved scores on post-intervention tests, sustained knowledge retention, and high resident satisfaction.« less
Assessment of the acceptance and effectiveness of peer-assisted learning in pediatrics.
Awasthi, Shally; Yadav, Krishna Kumar
2015-08-01
Peer-assisted learning (PAL) is the development of knowledge and skill through active help and support of equals. However, this has not been tested in medical education in India. To assess the effectiveness of PAL on improvement in cognitive assessment scores and its acceptance among undergraduate medical students in one public teaching medical university in North India. After approval from the Institutional Ethics Committee, three PAL sessions, 1 per week, each on specific topic, were conducted using small group discussion methodology with a faculty contact and student leader and 4-6 peer-learners, in 9(th) semester MBBS students. A pretest with multiple choice questions (MCQs) was followed by distribution of learning objectives and list of resource material. PAL session was conducted after 72 h, followed by posttest by MCQs and then focus group discussion (FGD) on students' experiences. Of the 26 students enrolled, three PAL sessions was completed by 22 (84.6%) students. The correlation coefficient between pre- and post-test scores was 0.48 (P < 0.0001), with a 24.2% improvement in posttest scores. In the nine FGDs most said that PALs helped in the better preparation of the topic, clarifying doubts, lessened examination anxiety, improved communication skills, and increased self-confidence. PAL was well accepted, and it improved assessment scores. Therefore, it can be adopted for teaching selected topics across all subjects of MBBS course.
Development of a risk index for prediction of abnormal pap test results in Serbia.
Vukovic, Dejana; Antic, Ljiljana; Vasiljevic, Mladenko; Antic, Dragan; Matejic, Bojana
2015-01-01
Serbia is one of the countries with highest incidence and mortality rates for cervical cancer in Central and South Eastern Europe. Introducing a risk index could provide a powerful means for targeting groups at high likelihood of having an abnormal cervical smear and increase efficiency of screening. The aim of the present study was to create and assess validity ofa index for prediction of an abnormal Pap test result. The study population was drawn from patients attending Departments for Women's Health in two primary health care centers in Serbia. Out of 525 respondents 350 were randomly selected and data obtained from them were used as the index creation dataset. Data obtained from the remaining 175 were used as an index validation data set. Age at first intercourse under 18, more than 4 sexual partners, history of STD and multiparity were attributed statistical weights 16, 15, 14 and 13, respectively. The distribution of index scores in index-creation data set showed that most respondents had a score 0 (54.9%). In the index-creation dataset mean index score was 10.3 (SD-13.8), and in the validation dataset the mean was 9.1 (SD=13.2). The advantage of such scoring system is that it is simple, consisting of only four elements, so it could be applied to identify women with high risk for cervical cancer that would be referred for further examination.
Rademakers, Jany; Jansen, Daphne; van der Hoek, Lucas; Heijmans, Monique
2015-04-03
The aim of this study was to test the Dutch version of the Clinician Support for Patient Activation Measure (CS-PAM), to explore the beliefs of Dutch clinicians about patients' self-management, and to establish whether there are differences in this respect between general practitioners and other primary care providers. The CS-PAM was translated in Dutch and data were collected in a sample of 489 general practitioners and other primary care providers. Statistical analyses (RASCH, Cronbach's α) were performed to establish the psychometric properties of the instrument. The psychometric scores of the Dutch CS-PAM were acceptable to good, and the difficulty level and structure was comparable to that of the original instrument. The average score of Dutch clinicians on the CS-PAM was 65.1 (SD 10.7), somewhat lower compared to their colleagues in the US (69; SD 12.1) and the UK (69, SD 12.8). Dutch general practitioners scored significantly lower on the CS-PAM compared to other primary care providers. The Dutch CS-PAM is a reliable instrument to measure beliefs of clinicians regarding patient self-management. Further validation studies are necessary to establish the distribution of scores in specific provider populations and to assess the clinical relevance of the instrument for different outcomes.
Quintas, Rui; Pagani, Marco; Brock, Stefano; Visintini, Sergio; Cusin, Alberto; Schiariti, Marco; Broggi, Morgan; Ferroli, Paolo; Leonardi, Matilde
2014-01-01
Background. The aim of this paper is to present the preliminary results of QoL, well-being, disability, and coping strategies of patients before neurosurgical procedure. Methods. We analysed data on preoperative quality of life (EUROHIS-QoL), disability (WHODAS-II), well-being (PGWB-S), coping strategies (Brief COPE), and functional status (KPS score) of a sample of patients with brain tumours and cerebrovascular and spinal degenerative disease admitted to Neurological Institute Carlo Besta. Statistical analysis was performed to illustrate the distribution of sociodemographic and clinical data, to compare mean test scores to the respective normative samples, and to investigate the differences between diagnoses, the correlation between tests, and the predictive power of sociodemographic and clinical variables of QoL. Results. 198 patients were included in the study. PGWB-S and EUROHIS-QoL scores were significantly lower than normative population. Patients with spinal diseases reported higher scores in WHODAS-II compared with oncological and cerebrovascular groups. Finally sociodemographic and clinical variables were significant predictors of EUROHIS-QoL, in particular PGWB-S and WHODAS-II. Conclusion. Our preliminary results show that preoperatory period is critical and the evaluation of coping strategies, quality of life, disability, and well-being is useful to plan tailored intervention and for a better management of each patient. PMID:25538963
Suryavanshi, Moushumi; Mehta, Anurag; Jaipuria, Jiten; Kumar, Dushyant; Vishwakarma, Gayatri; Panigrahi, Manoj Kumar; Verma, Haristuti; Saifi, Mumtaz; Sharma, Sanjeev; Tandon, Simran; Doval, D C; Das, Bhudev C
2018-02-09
IHC and FISH are used for categorizing HER 2 status in breast cancer at the protein and DNA level, respectively. HER 2 expression at the RNA level is quantitative, cheaper, easier to standardize and free from interobserver variation. 115 consecutive patients were tested by IHC, FISH and RT-PCR (test cohort). Assuming FISH result to be the response variable, ROC curves for RT-PCR ratio were analyzed to label HER 2 negative, equivocal and positive cases as RT-PCR score 1, 2 and 3, respectively. Inter-relationships between RT-PCR, IHC and FISH were defined. 'Clinical benefit' of a test was defined as proportion of patients labeled unequivocally as HER 2 positive or negative. Population for 1 year was simulated constraint to previous reports of HER 2 positivity and IHC category distribution by a meta-analysis of previous studies that evaluated concordance between IHC and FISH to determine HER 2 status (simulation cohort). Four diagnostic pathways in the simulation cohort were defined-(1) initial IHC, followed by FISH (conventional pathway); (2) initial RT-PCR, followed by FISH; (3) initial IHC, followed by RT-PCR and then by FISH; (4) initial RT-PCR, followed by IHC and then by FISH. The clinical benefit of IHC and RT-PCR in the four pathways was analyzed and sensitivity analysis for incremental cost-effectiveness ratio and cost-benefit comapring RT-PCR against IHC, both as first-line tests and among those with IHC score 2 as a reflex second-line test was performed by the Monte Carlo technique. 115 patients comprised the study population. While none with IHC score of 0 or 1 was FISH positive for HER 2, all cases with IHC score of 3 were FISH positive. 43 cases were assigned IHC score of 2. Thus, 72 patients benefited from the initial IHC testing [clinical benefit 62.6%], with the overall concordance between IHC and FISH being 100% for those with IHC score of 0, 1 and 3 (conclusive IHC categories). For RT-PCR with 100% concordance, 15.7% (115-97 = 18) patients would have benefited from RT-PCR testing if it was used as a first-line test. If RT-PCR would have been used as a second-line test among those with IHC score 2 (n = 43), then only 6 patients would have been assigned a conclusive RT-PCR category (category 1 or 3) translating to a clinical benefit of 14% (6/43) as a second-line test. As a second-line test it had 51% probability to prove more cost-effective than the conventional pathway, provided the cost of RT-PCR was 0.4 times the cost of IHC. Also in a three-step pathway, RT-PCR upfront would have 56% probability of higher cost-benefit provided the cost of RT-PCR was 0.1 times the cost of IHC. RT-PCR results were found to be suboptimal to IHC in terms of discriminative ability and clinical benefit; thus, it is unlikely to replace IHC as a first-line test in the near future.
Ren, Li; Peng, Lihua; Qin, Peipei; Min, Su
2015-07-01
To evaluate the efficacy of continuous femoral block on the postoperative analgesia and functional recovery after total knee arthroplasty (TKA). Two hundreds and eighty patients who underwent TKA were randomized into two groups:the group receiving continuous femoral block (CFNB) and the group receiving patient controlled intravenous analgesia (PCIA), each group included 140 participants. Femoral nerve block with ropivacaine by ultrasonic guidance was performed in group CFNB and group PCIA were administrated with patient controlled intravenous analgesia. Numerical rating scale (NRS) scores at rest and in motion at 24, 48, 72 h, 3, 6 and 12 months postoperatively, also the NRS scores at hospital discharge were recorded. The incidence of moderate-severity pain, as well as the degree of knee flexion and the WOMAC scores at 3, 6 and 12 months after surgery were analyzed. The rescue analgesic administration and analgesia-related adverse effects were also recorded. Data were expressed as mean± standard deviation (SD) for normally distributed continuous variables and total number (percent frequency) for categorical variables. If non-normally distributed, data were expressed median inter-quartile range. Student's t-test, Wilcoxon rank test were used to compare results for continuous variables, when appropriate. Chi-square test was used to compare results for categorical variable, Fisher exact test was used for categorical variables when the number of event was less than 5. NRS scores of group CFNB in motion was 3 (3-4) at discharge time, and 3 (2-4), 3 (2-3) at 3 months and 6 months postoperatively, while the scores of group PCIA was 4 (4-4), 3 (3-4), 3 (3-4), respectively. And at rest, NRS scores of group CFNB was 3 (2-3), 1 (1-2), 1 (1-1) at discharge time, and 3, 6 months postoperatively. Compared with group PCIA, NRS scores in motion of group CFNB at discharge time (Z=-5.174, P<0.05) and 3 months (Z=2.308, P=0.021), as well as 6 months postoperatively (Z=-2.495, P=0.013), were significantly lower,also for the NRS scores at rest (Z=-2.405, P=0.016; Z=-4.360, P<0.05; Z=-9.268, P<0.05). The degree of knee flexion of group CFNB at 3 and 6 months postoperatively was 92 (88-97), 103 (99-106), while the degree of knee flexion of group PCIA was 89 (86-95), 100 (97-105); the WOMAC scores of group CFNB at 3 and 6 months postoperatively was 21 (18-26), 18 (16-22), while the scores of group PCIA was 24 (20-27), 21 (17-24). WOMAC scores of group CFNB was lower compared with group PCIA at 3 (Z=-2.467, P=0.014) and 6 (Z=-2.537, P=0.011) months postoperatively while the degree of knee flexion of group CFNB was higher (Z=-2.175, P=0.030; Z=-2.471, P=0.013). Moreover, the frequency of bolus and frequency of rescue of group CFNB was 2.3 and 0.6, while the frequency of group PCIA was 2.6 and 1.1, the frequency of bolus and frequency of rescue were lower in group CFNB (t=-2.984, P=0.003; t=-3.213, P=0.002). The incidence of adverse events such muscle weakness of low limbs,nausea and vomiting were similar in two groups (P>0.05). CFNB can alleviate the postoperative pain after TKA with safety, help improving the short-middle-term functions of knee and quality of patients' lives.
Briefer assessment of social network drinking: A test of the Important People Instrument-5 (IP-5).
Hallgren, Kevin A; Barnett, Nancy P
2016-12-01
The Important People instrument (IP; Longabaugh et al., 2010) is one of the most commonly used measures of social network drinking. Although its reliability and validity are well-supported, the length of the instrument may limit its use in many settings. The present study evaluated whether a briefer, 5-person version of the IP (IP-5) adequately reproduces scores from the full IP. College freshmen (N = 1,053) reported their own past-month drinking, alcohol-related consequences, and information about drinking in their close social networks at baseline and 1 year later. From this we derived network members' drinking frequency, percentage of drinkers, and percentage of heavy drinkers, assessed for up to 10 (full IP) or 5 (IP-5) network members. We first modeled the expected concordance between full-IP scores and scores from simulated shorter IP instruments by sampling smaller subsets of network members from full IP data. Then, using quasi-experimental methods, we administered the full IP and IP-5 and compared the 2 instruments' score distributions and concurrent and year-lagged associations with participants' alcohol consumption and consequences. Most of the full-IP variance was reproduced from simulated shorter versions of the IP (ICCs ≥ 0.80). The full IP and IP-5 yielded similar score distributions, concurrent associations with drinking (r = 0.22 to 0.52), and year-lagged associations with drinking. The IP-5 retains most of the information about social network drinking from the full IP. The shorter instrument may be useful in clinical and research settings that require frequent measure administration, yielding greater temporal resolution for monitoring social network drinking. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Stingone, Jeanette A.; McVeigh, Katharine H.; Claudio, Luz
2016-01-01
The objective of this research was to determine if prenatal exposure to two common urban air pollutants, diesel and perchloroethylene, affects children's 3rd grade standardized test scores in mathematics and English language arts (ELA). Exposure estimates consisted of annual average ambient concentrations of diesel particulate matter and perchloroethylene obtained from the Environmental Protection Agency's 1996 National Air Toxics Assessment for the residential census tract at birth. Outcome data consisted of linked birth and educational records for 201,559 singleton, non-anomalous children born between 1994-1998 who attended New York City public schools. Quantile regression models were used to estimate the effects of these exposures on multiple points within the continuous distribution of standardized test scores. Modified Poisson regression models were used to calculate risk ratios (RR) and 95% confidence intervals (CI) of failing to meet curricula standards, an indicator derived from test scores. Models were adjusted for a number of maternal, neighborhood and childhood factors. Results showed that math scores were approximately 6% of a standard deviation lower for children exposed to the highest levels of both pollutants as compared to children with low levels of both pollutants. Children exposed to high levels of both pollutants also had the largest risk of failing to meet math test standards when compared to children with low levels of exposure to the pollutants (RR 1.10 95%CI 1.07,1.12 RR high perchloroethylene only 1.03 95%CI 1.00,1.06; RR high diesel PM only 1.02 95%CI 0.99,1.06). There was no association observed between exposure to only one of the pollutants and failing to meet ELA standards. This study provides preliminary evidence of associations between prenatal exposure to urban air pollutants and lower academic outcomes. Additionally, these findings suggest that individual pollutants may additively impact health and point to the need to study the collective effects of air pollutant mixtures. Key Words: air toxics, academic outcomes, urban health, tetrachloroethylene, air pollutant mixtures PMID:27058443
Stingone, Jeanette A; McVeigh, Katharine H; Claudio, Luz
2016-07-01
The objective of this research was to determine if prenatal exposure to two common urban air pollutants, diesel and perchloroethylene, affects children's 3rd grade standardized test scores in mathematics and English language arts (ELA). Exposure estimates consisted of annual average ambient concentrations of diesel particulate matter and perchloroethylene obtained from the Environmental Protection Agency's 1996 National Air Toxics Assessment for the residential census tract at birth. Outcome data consisted of linked birth and educational records for 201,559 singleton, non-anomalous children born between 1994 and 1998 who attended New York City public schools. Quantile regression models were used to estimate the effects of these exposures on multiple points within the continuous distribution of standardized test scores. Modified Poisson regression models were used to calculate risk ratios (RR) and 95% confidence intervals (CI) of failing to meet curricula standards, an indicator derived from test scores. Models were adjusted for a number of maternal, neighborhood and childhood factors. Results showed that math scores were approximately 6% of a standard deviation lower for children exposed to the highest levels of both pollutants as compared to children with low levels of both pollutants. Children exposed to high levels of both pollutants also had the largest risk of failing to meet math test standards when compared to children with low levels of exposure to the pollutants (RR 1.10 95%CI 1.07,1.12 RR high perchloroethylene only 1.03 95%CI 1.00,1.06; RR high diesel PM only 1.02 95%CI 0.99,1.06). There was no association observed between exposure to the pollutants and failing to meet ELA standards. This study provides preliminary evidence of associations between prenatal exposure to urban air pollutants and lower academic outcomes. Additionally, these findings suggest that individual pollutants may additively impact health and point to the need to study the collective effects of air pollutant mixtures. air toxics, academic outcomes, urban health, tetrachloroethylene, air pollutant mixtures. Copyright © 2016 Elsevier Inc. All rights reserved.
Evaluation of the Biotyper MALDI-TOF MS system for identification of Staphylococcus species.
Zhu, Wenming; Sieradzki, Krzysztof; Albrecht, Valerie; McAllister, Sigrid; Lin, Wen; Stuchlik, Olga; Limbago, Brandi; Pohl, Jan; Kamile Rasheed, J
2015-10-01
The Bruker Biotyper MALDI-TOF MS (Biotyper) system, with a modified 30 minute formic acid extraction method, was evaluated by its ability to identify 216 clinical Staphylococcus isolates from the CDC reference collection comprising 23 species previously identified by conventional biochemical tests. 16S rDNA sequence analysis was used to resolve discrepancies. Of these, 209 (96.8%) isolates were correctly identified: 177 (84.7%) isolates had scores ≥2.0, while 32 (15.3%) had scores between 1.70 and 1.99. The Biotyper identification was inconsistent with the biochemical identification for seven (3.2%) isolates, but the Biotyper identifications were confirmed by 16S rDNA analysis. The distribution of low scores was strongly species-dependent, e.g. only 5% of Staphylococcus epidermidis and 4.8% of Staphylococcus aureus isolates scored below 2.0, while 100% of Staphylococcus cohnii, 75% of Staphylococcus sciuri, and 60% of Staphylococcus caprae produced low but accurate Biotyper scores. Our results demonstrate that the Biotyper can reliably identify Staphylococcus species with greater accuracy than conventional biochemicals. Broadening of the reference database by inclusion of additional examples of under-represented species could further optimize Biotyper results. Published by Elsevier B.V.
ERIC Educational Resources Information Center
Krueger, Alan; Rothstein, Jesse; Turner, Sarah
2006-01-01
In Grutter v. Bollinger (2003), Justice Sandra Day O'Connor conjectured that in 25 years affirmative action in college admissions will be unnecessary. We project the test score distribution of black and white college applicants 25 years from now, focusing on the role of black-white family income gaps. Economic progress alone is unlikely to narrow…
Shallow Water UXO Technology Demonstration Site Scoring Record No. 7
2007-05-01
categories: ferrous, nonferrous , and mixed metals . The ferrous and nonferrous items are further divided into the three weight zones as presented in... nonferrous component and could reasonably be encountered in a range area. The mixed- metals clutter was placed in the open water area only. TABLE 1-3...Table 1-4, and distributed throughout all test areas. Most of this clutter is composed of ordnance components; however, industrial scrap metal and
Expression of calcium binding protein S100 A7 (psoriasin) in laryngeal carcinoma.
Tiveron, Rogério Costa; de Freitas, Luiz Carlos Conti; Figueiredo, David L; Serafini, Luciano N; Mamede, Rui Celso Martins; Zago, Marco A
2012-01-01
Many studies have reported increased expression of S100 A7 (psoriasin) in neoplastic lesions. Among them are studies on breast carcinoma, bladder squamous cell carcinoma, skin tumors and oral cavity squamous cell carcinoma. The expression of S100 A7 has not been described for laryngeal cancer. This study aims to identify the expression of the calcium-binding protein S100 A7 and its correlation with squamous cell carcinomas of the larynx. Specimens from 63 patients were submitted to immunohistochemistry testing with antibody S100 A7. Results were classified and compared. The group with highly differentiated tumors had the highest treatment failure scores. Moderately differentiated tumors had higher treatment failure scores than poorly differentiated tumors. Higher scores were predominantly seen on stages I and II in moderately differentiated tumors, whereas score distribution was more homogeneous in advanced stage disease (III and IV). Regarding failure in treatment, the group scoring zero (3/4 complications: 75%) differed significantly from the remaining groups (13/59: 22%). S100 A7 marker was expressed in 93.7% of laryngeal cancer cases, with higher positive correlation rates in more differentiated tumors and significantly lower rates of treatment failure. Scores had no impact on survival rates.
Better prognostic marker in ICU - APACHE II, SOFA or SAP II!
Naqvi, Iftikhar Haider; Mahmood, Khalid; Ziaullaha, Syed; Kashif, Syed Mohammad; Sharif, Asim
2016-01-01
This study was designed to determine the comparative efficacy of different scoring system in assessing the prognosis of critically ill patients. This was a retrospective study conducted in medical intensive care unit (MICU) and high dependency unit (HDU) Medical Unit III, Civil Hospital, from April 2012 to August 2012. All patients over age 16 years old who have fulfilled the criteria for MICU admission were included. Predictive mortality of APACHE II, SAP II and SOFA were calculated. Calibration and discrimination were used for validity of each scoring model. A total of 96 patients with equal gender distribution were enrolled. The average APACHE II score in non-survivors (27.97+8.53) was higher than survivors (15.82+8.79) with statistically significant p value (<0.001). The average SOFA score in non-survivors (9.68+4.88) was higher than survivors (5.63+3.63) with statistically significant p value (<0.001). SAP II average score in non-survivors (53.71+19.05) was higher than survivors (30.18+16.24) with statistically significant p value (<0.001). All three tested scoring models (APACHE II, SAP II and SOFA) would be accurate enough for a general description of our ICU patients. APACHE II has showed better calibration and discrimination power than SAP II and SOFA.
Cross-cultural comparison of motor competence in children from Australia and Belgium
Bardid, Farid; Rudd, James R.; Lenoir, Matthieu; Polman, Remco; Barnett, Lisa M.
2015-01-01
Motor competence in childhood is an important determinant of physical activity and physical fitness in later life. However, childhood competence levels in many countries are lower than desired. Due to the many different motor skill instruments in use, children's motor competence across countries is rarely compared. The purpose of this study was to evaluate the motor competence of children from Australia and Belgium using the Körperkoordinationstest für Kinder (KTK). The sample consisted of 244 (43.4% boys) Belgian children and 252 (50.0% boys) Australian children, aged 6–8 years. A MANCOVA for the motor scores showed a significant country effect. Belgian children scored higher on jumping sideways, moving sideways and hopping for height but not for balancing backwards. Moreover, a Chi squared test revealed significant differences between the Belgian and Australian score distribution with 21.3% Belgian and 39.3% Australian children scoring “below average.” The very low levels reported by Australian children may be the result of cultural differences in physical activity contexts such as physical education and active transport. When compared to normed scores, both samples scored significantly worse than children 40 years ago. The decline in children's motor competence is a global issue, largely influenced by increasing sedentary behavior and a decline in physical activity. PMID:26217282
Bazelet, Corinna S; Thompson, Aileen C; Naskrecki, Piotr
2016-01-01
The use of endemism and vascular plants only for biodiversity hotspot delineation has long been contested. Few studies have focused on the efficacy of global biodiversity hotspots for the conservation of insects, an important, abundant, and often ignored component of biodiversity. We aimed to test five alternative diversity measures for hotspot delineation and examine the efficacy of biodiversity hotspots for conserving a non-typical target organism, South African katydids. Using a 1° fishnet grid, we delineated katydid hotspots in two ways: (1) count-based: grid cells in the top 10% of total, endemic, threatened and/or sensitive species richness; vs. (2) score-based: grid cells with a mean value in the top 10% on a scoring system which scored each species on the basis of its IUCN Red List threat status, distribution, mobility and trophic level. We then compared katydid hotspots with each other and with recognized biodiversity hotspots. Grid cells within biodiversity hotspots had significantly higher count-based and score-based diversity than non-hotspot grid cells. There was a significant association between the three types of hotspots. Of the count-based measures, endemic species richness was the best surrogate for the others. However, the score-based measure out-performed all count-based diversity measures. Species richness was the least successful surrogate of all. The strong performance of the score-based method for hotspot prediction emphasizes the importance of including species' natural history information for conservation decision-making, and is easily adaptable to other organisms. Furthermore, these results add empirical support for the efficacy of biodiversity hotspots in conserving non-target organisms.
Work capability during isolation.
Gushin, V I; Efimov, V A; Smirnova, T M
1996-01-01
The aim of this study was to investigate the effects of prolonged isolation on the higher psychic functions, like working memory, attention concentration, and intellect (problem solving and decision making), and on sensory-motor skills and stress resistance. Previous Soviet simulation studies and the ISEMSI isolation experiment have indicated that prolonged isolation can affect higher psychic functions. A set of psychological tests in the form of a computer game was presented each workday to the chamber crew and to the ground crew serving as a control group. In analyzing the data it was taken into account that performance can be affected not only by the influence of isolation, but also by a learning process and by subject motivation. In addition, a distinction was made between absolute score and stability (range) of the score. Analysis of the chamber crew's work capability as a function of time showed the occurrence of three distinct periods of adaptation: (1) a period of acute adaptation in week 1, (2) a period of stable adaptation during weeks 3-6, and (3) a period of "final effort" in weeks 8-9. While in general the effect of isolation on the absolute scores was minor, larger ranges for the scores in "working memory," "attention concentration," and "calculation under time deficit" tests are an indication of increased instability, probably due to stress resistance. The 4 female subjects of the combined groups scored significantly higher than the 5 males in "attention concentration/distribution," "spatial orientation," "intuition in visual search," and "logical decision making under time deficit." Males presented higher scores in "calculation under time deficit" and working memory, and higher stability in "attention concentration" and "calculation under time deficit."
Bazelet, Corinna S.; Thompson, Aileen C.; Naskrecki, Piotr
2016-01-01
The use of endemism and vascular plants only for biodiversity hotspot delineation has long been contested. Few studies have focused on the efficacy of global biodiversity hotspots for the conservation of insects, an important, abundant, and often ignored component of biodiversity. We aimed to test five alternative diversity measures for hotspot delineation and examine the efficacy of biodiversity hotspots for conserving a non-typical target organism, South African katydids. Using a 1° fishnet grid, we delineated katydid hotspots in two ways: (1) count-based: grid cells in the top 10% of total, endemic, threatened and/or sensitive species richness; vs. (2) score-based: grid cells with a mean value in the top 10% on a scoring system which scored each species on the basis of its IUCN Red List threat status, distribution, mobility and trophic level. We then compared katydid hotspots with each other and with recognized biodiversity hotspots. Grid cells within biodiversity hotspots had significantly higher count-based and score-based diversity than non-hotspot grid cells. There was a significant association between the three types of hotspots. Of the count-based measures, endemic species richness was the best surrogate for the others. However, the score-based measure out-performed all count-based diversity measures. Species richness was the least successful surrogate of all. The strong performance of the score-based method for hotspot prediction emphasizes the importance of including species’ natural history information for conservation decision-making, and is easily adaptable to other organisms. Furthermore, these results add empirical support for the efficacy of biodiversity hotspots in conserving non-target organisms. PMID:27631131
Rau, Cheng-Shyuan; Wu, Shao-Chun; Kuo, Pao-Jen; Chen, Yi-Chun; Chien, Peng-Chen; Hsieh, Hsiao-Yun; Hsieh, Ching-Hua
2017-09-11
Background: Polytrauma patients are expected to have a higher risk of mortality than that obtained by the summation of expected mortality owing to their individual injuries. This study was designed to investigate the outcome of patients with polytrauma, which was defined using the new Berlin definition, as cases with an Abbreviated Injury Scale (AIS) ≥ 3 for two or more different body regions and one or more additional variables from five physiologic parameters (hypotension [systolic blood pressure ≤ 90 mmHg], unconsciousness [Glasgow Coma Scale score ≤ 8], acidosis [base excess ≤ -6.0], coagulopathy [partial thromboplastin time ≥ 40 s or international normalized ratio ≥ 1.4], and age [≥70 years]). Methods: We retrieved detailed data on 369 polytrauma patients and 1260 non-polytrauma patients with an overall Injury Severity Score (ISS) ≥ 18 who were hospitalized between 1 January 2009 and 31 December 2015 for the treatment of all traumatic injuries, from the Trauma Registry System at a level I trauma center. Patients with burn injury or incomplete registered data were excluded. Categorical data were compared with two-sided Fisher exact or Pearson chi-square tests. The unpaired Student t -test and the Mann-Whitney U -test was used to analyze normally distributed continuous data and non-normally distributed data, respectively. Propensity-score matched cohort in a 1:1 ratio was allocated using the NCSS software with logistic regression to evaluate the effect of polytrauma on patient outcomes. Results: The polytrauma patients had a significantly higher ISS than non-polytrauma patients (median (interquartile range Q1-Q3), 29 (22-36) vs. 24 (20-25), respectively; p < 0.001). Polytrauma patients had a 1.9-fold higher odds of mortality than non-polytrauma patients (95% CI 1.38-2.49; p < 0.001). Compared to non-polytrauma patients, polytrauma patients had a substantially longer hospital length of stay (LOS). In addition, a higher proportion of polytrauma patients were admitted to the intensive care unit (ICU), spent longer LOS in the ICU, and had significantly higher total medical expenses. Among 201 selected propensity score-matched pairs of polytrauma and non-polytrauma patients who showed no significant difference in sex, age, co-morbidity, AIS ≥ 3, and Injury Severity Score (ISS), the polytrauma patients had a significantly higher mortality rate (OR 17.5, 95% CI 4.21-72.76; p < 0.001), and a higher proportion of patients admitted to the ICU (84.1% vs. 74.1%, respectively; p = 0.013) with longer stays in the ICU (10.3 days vs. 7.5 days, respectively; p = 0.003). The total medical expenses for polytrauma patients were 35.1% higher than those of non-polytrauma patients. However, there was no significant difference in the LOS between polytrauma and non-polytrauma patients (21.1 days vs. 19.8 days, respectively; p = 0.399). Conclusions: The findings of this propensity-score matching study suggest that the new Berlin definition of polytrauma is feasible and applicable for trauma patients.
Rau, Cheng-Shyuan; Wu, Shao-Chun; Kuo, Pao-Jen; Chen, Yi-Chun; Chien, Peng-Chen; Hsieh, Hsiao-Yun; Hsieh, Ching-Hua
2017-01-01
Background: Polytrauma patients are expected to have a higher risk of mortality than that obtained by the summation of expected mortality owing to their individual injuries. This study was designed to investigate the outcome of patients with polytrauma, which was defined using the new Berlin definition, as cases with an Abbreviated Injury Scale (AIS) ≥ 3 for two or more different body regions and one or more additional variables from five physiologic parameters (hypotension [systolic blood pressure ≤ 90 mmHg], unconsciousness [Glasgow Coma Scale score ≤ 8], acidosis [base excess ≤ −6.0], coagulopathy [partial thromboplastin time ≥ 40 s or international normalized ratio ≥ 1.4], and age [≥70 years]). Methods: We retrieved detailed data on 369 polytrauma patients and 1260 non-polytrauma patients with an overall Injury Severity Score (ISS) ≥ 18 who were hospitalized between 1 January 2009 and 31 December 2015 for the treatment of all traumatic injuries, from the Trauma Registry System at a level I trauma center. Patients with burn injury or incomplete registered data were excluded. Categorical data were compared with two-sided Fisher exact or Pearson chi-square tests. The unpaired Student t-test and the Mann–Whitney U-test was used to analyze normally distributed continuous data and non-normally distributed data, respectively. Propensity-score matched cohort in a 1:1 ratio was allocated using the NCSS software with logistic regression to evaluate the effect of polytrauma on patient outcomes. Results: The polytrauma patients had a significantly higher ISS than non-polytrauma patients (median (interquartile range Q1–Q3), 29 (22–36) vs. 24 (20–25), respectively; p < 0.001). Polytrauma patients had a 1.9-fold higher odds of mortality than non-polytrauma patients (95% CI 1.38–2.49; p < 0.001). Compared to non-polytrauma patients, polytrauma patients had a substantially longer hospital length of stay (LOS). In addition, a higher proportion of polytrauma patients were admitted to the intensive care unit (ICU), spent longer LOS in the ICU, and had significantly higher total medical expenses. Among 201 selected propensity score-matched pairs of polytrauma and non-polytrauma patients who showed no significant difference in sex, age, co-morbidity, AIS ≥ 3, and Injury Severity Score (ISS), the polytrauma patients had a significantly higher mortality rate (OR 17.5, 95% CI 4.21–72.76; p < 0.001), and a higher proportion of patients admitted to the ICU (84.1% vs. 74.1%, respectively; p = 0.013) with longer stays in the ICU (10.3 days vs. 7.5 days, respectively; p = 0.003). The total medical expenses for polytrauma patients were 35.1% higher than those of non-polytrauma patients. However, there was no significant difference in the LOS between polytrauma and non-polytrauma patients (21.1 days vs. 19.8 days, respectively; p = 0.399). Conclusions: The findings of this propensity-score matching study suggest that the new Berlin definition of polytrauma is feasible and applicable for trauma patients. PMID:28891977
Transformational leadership and moral reasoning.
Turner, Nick; Barling, Julian; Epitropaki, Olga; Butcher, Vicky; Milner, Caroline
2002-04-01
Terms such as moral and ethical leadership are used widely in theory, yet little systematic research has related a sociomoral dimension to leadership in organizations. This study investigated whether managers' moral reasoning (n = 132) was associated with the transformational and transactional leadership behaviors they exhibited as perceived by their subordinates (n = 407). Managers completed the Defining Issues Test (J. R. Rest, 1990), whereas their subordinates completed the Multifactor Leadership Questionnaire (B. M. Bass & B. J. Avolio, 1995). Analysis of covariance indicated that managers scoring in the highest group of the moral-reasoning distribution exhibited more transformational leadership behaviors than leaders scoring in the lowest group. As expected, there was no relationship between moral-reasoning group and transactional leadership behaviors. Implications for leadership development are discussed.
Kobayashi, Tohru; Fuse, Shigeto; Sakamoto, Naoko; Mikami, Masashi; Ogawa, Shunichi; Hamaoka, Kenji; Arakaki, Yoshio; Nakamura, Tsuneyuki; Nagasawa, Hiroyuki; Kato, Taichi; Jibiki, Toshiaki; Iwashima, Satoru; Yamakawa, Masaru; Ohkubo, Takashi; Shimoyama, Shinya; Aso, Kentaro; Sato, Seiichi; Saji, Tsutomu
2016-08-01
Several coronary artery Z score models have been developed. However, a Z score model derived by the lambda-mu-sigma (LMS) method has not been established. Echocardiographic measurements of the proximal right coronary artery, left main coronary artery, proximal left anterior descending coronary artery, and proximal left circumflex artery were prospectively collected in 3,851 healthy children ≤18 years of age and divided into developmental and validation data sets. In the developmental data set, smooth curves were fitted for each coronary artery using linear, logarithmic, square-root, and LMS methods for both sexes. The relative goodness of fit of these models was compared using the Bayesian information criterion. The best-fitting model was tested for reproducibility using the validation data set. The goodness of fit of the selected model was visually compared with that of the previously reported regression models using a Q-Q plot. Because the internal diameter of each coronary artery was not similar between sexes, sex-specific Z score models were developed. The LMS model with body surface area as the independent variable showed the best goodness of fit; therefore, the internal diameter of each coronary artery was transformed into a sex-specific Z score on the basis of body surface area using the LMS method. In the validation data set, a Q-Q plot of each model indicated that the distribution of Z scores in the LMS models was closer to the normal distribution compared with previously reported regression models. Finally, the final models for each coronary artery in both sexes were developed using the developmental and validation data sets. A Microsoft Excel-based Z score calculator was also created, which is freely available online (http://raise.umin.jp/zsp/calculator/). Novel LMS models with which to estimate the sex-specific Z score of each internal coronary artery diameter were generated and validated using a large pediatric population. Copyright © 2016 American Society of Echocardiography. Published by Elsevier Inc. All rights reserved.
Al-Dorzi, Hasan M; Cherfan, Antoine; Al-Harbi, Shmylan; Al-Askar, Ahmad; Al-Azzam, Saleh; Hroub, Ahmad; Olivier, Joan; Al-Hameed, Fahad; Al-Moamary, Mohamed; Abdelaal, Mohamed; Poff, Gregory A; Arabi, Yaseen M
2013-07-01
Didactic lectures are frequently used to improve compliance with practice guidelines. This study assessed the knowledge of health-care providers (HCPs) at a tertiary-care hospital of its evidence-based thromboprophylaxis guidelines and the impact of didactic lectures on their knowledge. The hospital launched a multifaceted approach to improve thromboprophylaxis practices, which included posters, a pocket-size guidelines summary and didactic lectures during the annual thromboprophylaxis awareness days. A self-administered questionnaire was distributed to HCPs before and after lectures on thromboprophylaxis guidelines (June 2010). The questionnaire, formulated and validated by two physicians, two nurses and a clinical pharmacist, covered various subjects such as risk stratification, anticoagulant dosing and the choice of anticoagulants in specific clinical situations. Seventy-two and 63 HCPs submitted the pre- and post-test, respectively (62% physicians, 28% nurses, from different clinical disciplines). The mean scores were 7.8 ± 2.1 (median = 8.0, range = 2-12, maximum possible score = 15) for the pre-test and 8.4 ± 1.8 for the post-test, P = 0.053. There was no significant difference in the pre-test scores of nurses and physicians (7.9 ± 1.7 and 8.2 ± 2.4, respectively, P = 0.67). For the 35 HCPs who completed the pre- and post-tests, their scores were 7.7 ± 1.7 and 8.8 ± 1.6, respectively, P = 0.003. Knowledge of appropriate anticoagulant administration in specific clinical situations was frequently inadequate, with approximately two-thirds of participants failing to adjust low-molecular-weight heparin doses in patients with renal failure. Education via didactic lectures resulted in a modest improvement of HCPs' knowledge of thromboprophylaxis guidelines. This supports the need for a multifaceted approach to improve the awareness and implementation of thromboprophylaxis guidelines.
Al-Dorzi, Hasan M.; Cherfan, Antoine; Al-Harbi, Shmylan; Al-Askar, Ahmad; Al-Azzam, Saleh; Hroub, Ahmad; Olivier, Joan; Al-Hameed, Fahad; Al-Moamary, Mohamed; Abdelaal, Mohamed; Poff, Gregory A.; Arabi, Yaseen M.
2013-01-01
BACKGROUND: Didactic lectures are frequently used to improve compliance with practice guidelines. This study assessed the knowledge of health-care providers (HCPs) at a tertiary-care hospital of its evidence-based thromboprophylaxis guidelines and the impact of didactic lectures on their knowledge. METHODS: The hospital launched a multifaceted approach to improve thromboprophylaxis practices, which included posters, a pocket-size guidelines summary and didactic lectures during the annual thromboprophylaxis awareness days. A self-administered questionnaire was distributed to HCPs before and after lectures on thromboprophylaxis guidelines (June 2010). The questionnaire, formulated and validated by two physicians, two nurses and a clinical pharmacist, covered various subjects such as risk stratification, anticoagulant dosing and the choice of anticoagulants in specific clinical situations. RESULTS: Seventy-two and 63 HCPs submitted the pre- and post-test, respectively (62% physicians, 28% nurses, from different clinical disciplines). The mean scores were 7.8 ± 2.1 (median = 8.0, range = 2-12, maximum possible score = 15) for the pre-test and 8.4 ± 1.8 for the post-test, P = 0.053. There was no significant difference in the pre-test scores of nurses and physicians (7.9 ± 1.7 and 8.2 ± 2.4, respectively, P = 0.67). For the 35 HCPs who completed the pre- and post-tests, their scores were 7.7 ± 1.7 and 8.8 ± 1.6, respectively, P = 0.003. Knowledge of appropriate anticoagulant administration in specific clinical situations was frequently inadequate, with approximately two-thirds of participants failing to adjust low-molecular-weight heparin doses in patients with renal failure. CONCLUSIONS: Education via didactic lectures resulted in a modest improvement of HCPs′ knowledge of thromboprophylaxis guidelines. This supports the need for a multifaceted approach to improve the awareness and implementation of thromboprophylaxis guidelines. PMID:23922612
[Value of brain MR imaging in infants with a severe idiopathic apparent life threatening event].
Christophe, C; Boutemy, R; Christiaens, F; Fonteyne, C; Ziereisen, F; Dan, B
2000-01-01
Prognostic value of a magnetic resonance imaging (MRI) scoring system in infants with a severe apparent life threatening event (ALTE). Ten infants with an ALTE (aged between 6 and 31 weeks) were clinically graded according to the PRISM score and evaluated with EEG, evoked potentials and MRI. The 18 MRIs obtained were distributed in 3 classes according to the delay after which they were obtained; class A (n=5): within the first 48 hours after the event, class B (n=7): between day 3 and 8 and class C (n=6): between day 9 and 50. The 18 MRIs were evaluated retrospectively using a scoring system based on 3 categories of lesions: edema, basal ganglia injury and watershed injuries. Five infants died between day 2 and day 15 after the event. The five surviving infants had follow up neurodevelopmental testing after 38 to 77 months. There was no correlation between the 5 MRIs of class A and the neurological outcome. For the MRIs of class B and C, the scoring system can be of great value when combined with the scores of EEG, EP and PRISM. The scoring system for MRI performed within 48 hours after the event is falsely reassuring. MRI can be helpful as early as 3 days after the event when combined with the score of the electrophysiological investigations and the PRISM.
Kawasaki, Yohei; Ide, Kazuki; Akutagawa, Maiko; Yamada, Hiroshi; Yutaka, Ono; Furukawa, Toshiaki A.
2017-01-01
Background Several recent studies have shown that total scores on depressive symptom measures in a general population approximate an exponential pattern except for the lower end of the distribution. Furthermore, we confirmed that the exponential pattern is present for the individual item responses on the Center for Epidemiologic Studies Depression Scale (CES-D). To confirm the reproducibility of such findings, we investigated the total score distribution and item responses of the Kessler Screening Scale for Psychological Distress (K6) in a nationally representative study. Methods Data were drawn from the National Survey of Midlife Development in the United States (MIDUS), which comprises four subsamples: (1) a national random digit dialing (RDD) sample, (2) oversamples from five metropolitan areas, (3) siblings of individuals from the RDD sample, and (4) a national RDD sample of twin pairs. K6 items are scored using a 5-point scale: “none of the time,” “a little of the time,” “some of the time,” “most of the time,” and “all of the time.” The pattern of total score distribution and item responses were analyzed using graphical analysis and exponential regression model. Results The total score distributions of the four subsamples exhibited an exponential pattern with similar rate parameters. The item responses of the K6 approximated a linear pattern from “a little of the time” to “all of the time” on log-normal scales, while “none of the time” response was not related to this exponential pattern. Discussion The total score distribution and item responses of the K6 showed exponential patterns, consistent with other depressive symptom scales. PMID:28289560
Kant, Kamal; Lal, Uma Ranjan; Ghosh, Manik
2018-01-01
To date, efforts for the prevention and treatment of human respiratory syncytial virus (RSV) infection have been still vain, and there is no safe and effective clinical accepted vaccine. Arisaema genus has claimed for various traditional bioactivities, but scientific assessments are quite limited. This encouraged us to carry out our present study on around 60 phytoconstituents of different Arisaema species as a natural inhibitor against the human RSV. Selected 60 phytochemical entities were evaluated on the docking behavior of human RSV receptor (PDB: 4UCC) using Maestro 9.3 (Schrödinger, LLC, Cambridge, USA). Furthermore, kinetic properties and toxicity nature of top graded ligands were analyzed through QikProp and ProTox tools. Notably, rutin (glide score: -8.49), schaftoside (glide score: -8.18) and apigenin-6,8-di-C-β-D-galactoside (glide score - 7.29) have resulted in hopeful natural lead hits with an ideal range of kinetic descriptors values. ProTox tool (oral rodent toxicity) has resulted in likely toxicity targets of apex-graded tested ligands. Finally, the whole efforts can be explored further as a model to confirm its anti-human RSV potential with wet laboratory experiments. Rutin, schaftoside, and apigenin-6,8-di-C-β-D-galactoside showed promising top hits docking profile against human respiratory syncytial virusMoreover, absorption, distribution, metabolism, excretion properties (QikProp) of top hits resulted within an ideal range of kinetic descriptorsProTox tool highlighted toxicity class ranges, LD 50 values, and possible toxicity targets of apex-graded tested ligands. Abbreviations used: RSV: Respiratory syncytial virus, PRRSV: Porcine respiratory and reproductive syndrome virus, ADME-T: Absorption, distribution, metabolism, excretion, and toxicity.
Yaseen, Zimri S.; Kopeykina, Irina; Gutkovich, Zinoviy; Bassirnia, Anahita; Cohen, Lisa J.; Galynker, Igor I.
2014-01-01
Background The greatly increased risk of suicide after psychiatric hospitalization is a critical problem, yet we are unable to identify individuals who would attempt suicide upon discharge. The Suicide Trigger Scale v.3 (STS-3), was designed to measure the construct of an affective ‘suicide trigger state’ hypothesized to precede a suicide attempt (SA). This study aims to test the predictive validity of the STS-3 for post-discharge SA on a high-risk psychiatric-inpatient sample. Methods The STS-3, and a psychological test battery measuring suicidality, mood, impulsivity, trauma history, and attachment style were administered to 161 adult psychiatric patients hospitalized following suicidal ideation (SI) or SA. Receiver Operator Characteristic and logistic regression analyses were used to assess prediction of SA in the 6-month period following discharge from hospitalization. Results STS-3 scores for the patients who made post-discharge SA followed a bimodal distribution skewed to high and low scores, thus a distance from median transform was applied to the scores. The transformed score was a significant predictor of post-discharge SA (AUC 0.731), and a subset of six STS-3 scale items was identified that produced improved prediction of post-discharge SA (AUC 0.814). Scores on C-SSRS and BSS were not predictive. Patients with ultra-high (90th percentile) STS-3 scores differed significantly from ultra-low (10th percentile) scorers on measures of affective intensity, depression, impulsiveness, abuse history, and attachment security. Conclusion STS-3 transformed scores at admission to the psychiatric hospital predict suicide attempts following discharge among the high-risk group of suicidal inpatients. Patients with high transformed scores appear to comprise two clinically distinct groups; an impulsive, affectively intense, fearfully attached group with high raw STS-3 scores and a low-impulsivity, low affect and low trauma-reporting group with low raw STS-3 scores. These groups may correspond to low-plan and planned suicide attempts, respectively, but this remains to be established by future research. PMID:24466229
An Approach to Scoring and Equating Tests with Binary Items: Piloting With Large-Scale Assessments
ERIC Educational Resources Information Center
Dimitrov, Dimiter M.
2016-01-01
This article describes an approach to test scoring, referred to as "delta scoring" (D-scoring), for tests with dichotomously scored items. The D-scoring uses information from item response theory (IRT) calibration to facilitate computations and interpretations in the context of large-scale assessments. The D-score is computed from the…
Hojat, Mohammadreza; Gonnella, Joseph S.
2015-01-01
Objective This study was designed to provide typical descriptive statistics, score distributions and percentile ranks of the Jefferson Scale of Empathy-Medical Student version (JSE-S) of male and female medical school matriculants to serve as proxy norm data and tentative cutoff scores. Subjects and Methods The participants were 2,637 students (1,336 women and 1,301 men) who matriculated at Sidney Kimmel (formerly Jefferson) Medical College between 2002 and 2012, and completed the JSE at the beginning of medical school. Information extracted from descriptive statistics, score distributions and percentile ranks for male and female matriculants were used to develop proxy norm data and tentative cutoff scores. Results The score distributions of the JSE tended to be moderately skewed and platykurtic. Women obtained a significantly higher mean score (116.2 ± 9.7) than men (112.3 ± 10.8) on the JSE-S (t2,635 = 9.9, p < 0.01). It was suggested that percentile ranks can be used as proxy norm data. The tentative cutoff score to identify low scorers was ≤95 for men and ≤100 for women. Conclusions Our findings provide norm data and cutoff scores for admission decisions under certain conditions and for identifying students in need of enhancing their empathy. PMID:25924560
Bassani, Diego G.; Corsi, Daniel J.; Gaffey, Michelle F.; Barros, Aluisio J. D.
2014-01-01
Background Worse health outcomes including higher morbidity and mortality are most often observed among the poorest fractions of a population. In this paper we present and validate national, regional and state-level distributions of national wealth index scores, for urban and rural populations, derived from household asset data collected in six survey rounds in India between 1992–3 and 2007–8. These new indices and their sub-national distributions allow for comparative analyses of a standardized measure of wealth across time and at various levels of population aggregation in India. Methods Indices were derived through principal components analysis (PCA) performed using standardized variables from a correlation matrix to minimize differences in variance. Valid and simple indices were constructed with the minimum number of assets needed to produce scores with enough variability to allow definition of unique decile cut-off points in each urban and rural area of all states. Results For all indices, the first PCA components explained between 36% and 43% of the variance in household assets. Using sub-national distributions of national wealth index scores, mean height-for-age z-scores increased from the poorest to the richest wealth quintiles for all surveys, and stunting prevalence was higher among the poorest and lower among the wealthiest. Urban and rural decile cut-off values for India, for the six regions and for the 24 major states revealed large variability in wealth by geographical area and level, and rural wealth score gaps exceeded those observed in urban areas. Conclusions The large variability in sub-national distributions of national wealth index scores indicates the importance of accounting for such variation when constructing wealth indices and deriving score distribution cut-off points. Such an approach allows for proper within-sample economic classification, resulting in scores that are valid indicators of wealth and correlate well with health outcomes, and enables wealth-related analyses at whichever geographical area and level may be most informative for policy-making processes. PMID:25356667
Roshetsky, Lisa M; Coltri, Ainoa; Flores, Andrea; Vekhter, Ben; Humphrey, Holly J; Meltzer, David O; Arora, Vineet M
2013-09-01
Understanding the association between attending physicians' workload and teaching is critical to preserving residents' learning experience. The authors tested the association between attending physicians' self-reported workload and perceptions of time for teaching before and after the 2003 resident duty hours regulations. From 2001 to 2008, the authors surveyed all inpatient general medicine attending physicians at a teaching hospital. To measure workload, they used a conceptual framework to create a composite score from six domains (mental demand, physical demand, temporal demand, effort, performance, frustration). They measured time for teaching using (1) open-ended responses to hours per week spent doing didactic teaching and (2) responses (agree, strongly agree) to the statement "I had enough time for teaching." They conducted multivariate logistic regression analyses, controlling for month, year, and clustering by attending physicians, to test the association between workload scores and time for teaching. Of 738 eligible attending physicians, 482 (65%) completed surveys. Respondents spent a median of three hours per week dedicated to teaching. Less than half (198; 43%) reporting enough time for teaching. The composite workload scores were normally distributed (median score of 15) and demonstrated a weak positive correlation with actual patient volume (r = 0.25). The odds of an attending physician reporting enough time for teaching declined by 21% for each point increase in composite workload score (odds ratio = 0.79 [95% confidence interval 0.69-0.91]; P = .001). The authors found that attending physicians' greater self-perceived workload was associated with decreased time for teaching.
Evaluation of the COPD Assessment Test and GOLD patient types: a cross-sectional analysis.
Lopez-Campos, Jose Luis; Fernandez-Villar, Alberto; Calero-Acuña, Carmen; Represas-Represas, Cristina; Lopez-Ramírez, Cecilia; Fernández, Virginia Leiro; Soler-Cataluña, Juan Jose; Casamor, Ricard
2015-01-01
The COPD Assessment Test (CAT) has been recently developed to quantify COPD impact in routine practice. However, no relationship with other measures in the Global Initiative for Obstructive Lung Disease (GOLD) strategy has been evaluated. The present study aimed to evaluate the relationship of the CAT with other GOLD multidimensional axes, patient types, and the number of comorbidities. This was a cross-sectional analysis of the Clinical presentation, diagnosis, and course of chronic obstructive pulmonary disease (On-Sint) study. The CAT score was administered to all participants at the inclusion visit. A GOLD 2011 strategy consisting of modified Medical Research Council scale (MRC) scores was devised to study the relationship between the CAT, and GOLD 2011 axes and patient types. The relationship with comorbidities was assessed using the Charlson comorbidity index, grouped as zero, one to two, and three or more. The CAT questionnaire was completed by 1,212 patients with COPD. The CAT maintained a relationship with all the three axes, with a ceiling effect for dyspnea and no distinction between mild and moderate functional impairment. The CAT score increased across GOLD 2011 patient types A-D, with similar scores for types B and C. Within each GOLD 2011 patient type, there was a considerably wide distribution of CAT values. Our study indicates a correlation between CAT and the GOLD 2011 classification axes as well as the number of comorbidities. The CAT score can help clinicians, as a complementary tool to evaluate patients with COPD within the different GOLD patient types.
Donini, Lorenzo Maria; Rosano, Aldo; Di Lazzaro, Luca; Poggiogalle, Eleonora; Lubrano, Carla; Migliaccio, Silvia; Carbonelli, Mariagrazia; Pinto, Alessandro; Lenzi, Andrea
2017-05-15
Obesity is associated to increased risk of metabolic comorbidity as well as increased mortality. Notably, obesity is also associated to the impairment of the psychological status and of quality of life. Only three questionnaires are available in the Italian language evaluating the health-related quality of life in subjects with obesity. The aim of the present study was to test the validity and reliability of the Italian version of the Laval Questionnaire. The original French version was translated into Italian and back-translated by a French native speaker. 273 subjects with obesity (Body Mass Index ≥ 30 kg/m 2 ) were enrolled; the Italian version of the Laval Questionnaire and the O.R.Well-97 questionnaire were administered in order to assess health- related quality of life. The Laval questionnaire consists of 44 items distributed in 6 domains (symptoms, activity/mobility, personal hygiene/clothing, emotions, social interaction, sexual life). Disability and overall psychopathology levels were assessed through the TSD-OC test (SIO test for obesity correlated disabilities) and the SCL-90 (Symptom Checklist-90) questionnaire, respectively. To verify the validity of the Italian version, the analysis of internal consistency, test-retest reliability, and construct validity were performed. The observed proportion of agreement concordance of results was 50.2% with Cohen's K = 0.336 (CI 95%: 0.267-0.404), indicating a fair agreement between the two tests. Test-retest correlation was statistically significant (ρ = 0.82; p < 0.01); validity (standardized Chronbach's alpha) was considered reliable (α > 0.70). The analysis of construct validity showed a statistically significant association in terms of both total score (ρ = -0.66) and scores at each single domain (p < 0.01). A high correlation (p < 0.01) was observed between Laval questionnaire total and single domain scores and other related measures (Body Mass Index, TSD-OC scores, SCL-90 global severity index), revealing a high construct validity of the test. The Italian version of the Laval Questionnaire is a valid and reliable measure to assess the health-related quality of life in subjects with obesity.
Christensen, Stacy
2014-01-01
An experimental study was conducted using a 2-group randomized control pretest/ posttest design to determine if knowledge about Pap testing could be increased through use of a nurse-designed mobile smartphone app developed to educate individuals about the Pap test. A 14-item pretest survey of knowledge about Pap tests was distributed to women attending a university in New England. Participants in the intervention group were provided with an Android device on which a digital health education application on Pap testing had been downloaded. The control group was given a standard pamphlet on Pap testing., Paired t test results demonstrated that knowledge scores on the posttest increased significantly in both groups, but were significantly higher in the intervention group. User satisfaction with the app was high. The results of this study may enhance nursing care by informing nurses about a unique way of learning about Pap testing to recommend to patients.
Testing homogeneity in Weibull-regression models.
Bolfarine, Heleno; Valença, Dione M
2005-10-01
In survival studies with families or geographical units it may be of interest testing whether such groups are homogeneous for given explanatory variables. In this paper we consider score type tests for group homogeneity based on a mixing model in which the group effect is modelled as a random variable. As opposed to hazard-based frailty models, this model presents survival times that conditioned on the random effect, has an accelerated failure time representation. The test statistics requires only estimation of the conventional regression model without the random effect and does not require specifying the distribution of the random effect. The tests are derived for a Weibull regression model and in the uncensored situation, a closed form is obtained for the test statistic. A simulation study is used for comparing the power of the tests. The proposed tests are applied to real data sets with censored data.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Parrado, G., E-mail: gparrado@sgc.gov.co; Cañón, Y.; Peña, M., E-mail: mlpena@sgc.gov.co
The Neutron Activation Analysis (NAA) laboratory at the Colombian Geological Survey has developed a technique for multi-elemental analysis of soil and plant matrices, based on Instrumental Neutron Activation Analysis (INAA) using the comparator method. In order to evaluate the analytical capabilities of the technique, the laboratory has been participating in inter-comparison tests organized by Wepal (Wageningen Evaluating Programs for Analytical Laboratories). In this work, the experimental procedure and results for the multi-elemental analysis of four soil and four plant samples during participation in the first round on 2015 of Wepal proficiency test are presented. Only elements with radioactive isotopes withmore » medium and long half-lives have been evaluated, 15 elements for soils (As, Ce, Co, Cr, Cs, Fe, K, La, Na, Rb, Sb, Sc, Th, U and Zn) and 7 elements for plants (Br, Co, Cr, Fe, K, Na and Zn). The performance assessment by Wepal based on Z-score distributions showed that most results obtained |Z-scores| ≤ 3.« less
Decreasing assault occurrence on a psychogeriatric ward: an agitation management model.
Savage, Troy; Crawford, Ian; Nashed, Yousery
2004-05-01
An agitation management model providing staff education, quantitative assessment of agitation, and emphasized psychosocial interventions was introduced on a geriatric psychiatry ward for male patients. A within-subjects comparison was made of Cohen-Mansfield Agitation Inventory (CMAI) scores and frequency of committing assault under pre- and post-intervention conditions. Among participants (N = 8) who finished the 72-week study, CMAI scores did not differ significantly under either of the study conditions (p > .05, two-tailed t test). Twenty-nine assaults occurred during the pre-intervention time period and six assaults occurred during the post-intervention time period. According to analysis with the Wilcoxon signed ranks test, the distribution of assaults differed significantly between the two time periods (p < .05, two-tailed). Among individuals who were excluded from the intervention because of lack of consent, assaults increased over the same two time periods. Psychosocial interventions intended to reduce agitation among elderly men with dementia may not necessarily serve to decrease agitation, but may serve to decrease assault occurrence.
NASA Astrophysics Data System (ADS)
Parrado, G.; Cañón, Y.; Peña, M.; Sierra, O.; Porras, A.; Alonso, D.; Herrera, D. C.; Orozco, J.
2016-07-01
The Neutron Activation Analysis (NAA) laboratory at the Colombian Geological Survey has developed a technique for multi-elemental analysis of soil and plant matrices, based on Instrumental Neutron Activation Analysis (INAA) using the comparator method. In order to evaluate the analytical capabilities of the technique, the laboratory has been participating in inter-comparison tests organized by Wepal (Wageningen Evaluating Programs for Analytical Laboratories). In this work, the experimental procedure and results for the multi-elemental analysis of four soil and four plant samples during participation in the first round on 2015 of Wepal proficiency test are presented. Only elements with radioactive isotopes with medium and long half-lives have been evaluated, 15 elements for soils (As, Ce, Co, Cr, Cs, Fe, K, La, Na, Rb, Sb, Sc, Th, U and Zn) and 7 elements for plants (Br, Co, Cr, Fe, K, Na and Zn). The performance assessment by Wepal based on Z-score distributions showed that most results obtained |Z-scores| ≤ 3.
Conditional Standard Errors of Measurement for Composite Scores Using IRT
ERIC Educational Resources Information Center
Kolen, Michael J.; Wang, Tianyou; Lee, Won-Chan
2012-01-01
Composite scores are often formed from test scores on educational achievement test batteries to provide a single index of achievement over two or more content areas or two or more item types on that test. Composite scores are subject to measurement error, and as with scores on individual tests, the amount of error variability typically depends on…
Evaluation of Low-Voltage Distribution Network Index Based on Improved Principal Component Analysis
NASA Astrophysics Data System (ADS)
Fan, Hanlu; Gao, Suzhou; Fan, Wenjie; Zhong, Yinfeng; Zhu, Lei
2018-01-01
In order to evaluate the development level of the low-voltage distribution network objectively and scientifically, chromatography analysis method is utilized to construct evaluation index model of low-voltage distribution network. Based on the analysis of principal component and the characteristic of logarithmic distribution of the index data, a logarithmic centralization method is adopted to improve the principal component analysis algorithm. The algorithm can decorrelate and reduce the dimensions of the evaluation model and the comprehensive score has a better dispersion degree. The clustering method is adopted to analyse the comprehensive score because the comprehensive score of the courts is concentrated. Then the stratification evaluation of the courts is realized. An example is given to verify the objectivity and scientificity of the evaluation method.
Analytical workflow profiling gene expression in murine macrophages
Nixon, Scott E.; González-Peña, Dianelys; Lawson, Marcus A.; McCusker, Robert H.; Hernandez, Alvaro G.; O’Connor, Jason C.; Dantzer, Robert; Kelley, Keith W.
2015-01-01
Comprehensive and simultaneous analysis of all genes in a biological sample is a capability of RNA-Seq technology. Analysis of the entire transcriptome benefits from summarization of genes at the functional level. As a cellular response of interest not previously explored with RNA-Seq, peritoneal macrophages from mice under two conditions (control and immunologically challenged) were analyzed for gene expression differences. Quantification of individual transcripts modeled RNA-Seq read distribution and uncertainty (using a Beta Negative Binomial distribution), then tested for differential transcript expression (False Discovery Rate-adjusted p-value < 0.05). Enrichment of functional categories utilized the list of differentially expressed genes. A total of 2079 differentially expressed transcripts representing 1884 genes were detected. Enrichment of 92 categories from Gene Ontology Biological Processes and Molecular Functions, and KEGG pathways were grouped into 6 clusters. Clusters included defense and inflammatory response (Enrichment Score = 11.24) and ribosomal activity (Enrichment Score = 17.89). Our work provides a context to the fine detail of individual gene expression differences in murine peritoneal macrophages during immunological challenge with high throughput RNA-Seq. PMID:25708305
Distributed neural system for emotional intelligence revealed by lesion mapping.
Barbey, Aron K; Colom, Roberto; Grafman, Jordan
2014-03-01
Cognitive neuroscience has made considerable progress in understanding the neural architecture of human intelligence, identifying a broadly distributed network of frontal and parietal regions that support goal-directed, intelligent behavior. However, the contributions of this network to social and emotional aspects of intellectual function remain to be well characterized. Here we investigated the neural basis of emotional intelligence in 152 patients with focal brain injuries using voxel-based lesion-symptom mapping. Latent variable modeling was applied to obtain measures of emotional intelligence, general intelligence and personality from the Mayer, Salovey, Caruso Emotional Intelligence Test (MSCEIT), the Wechsler Adult Intelligence Scale and the Neuroticism-Extroversion-Openness Inventory, respectively. Regression analyses revealed that latent scores for measures of general intelligence and personality reliably predicted latent scores for emotional intelligence. Lesion mapping results further indicated that these convergent processes depend on a shared network of frontal, temporal and parietal brain regions. The results support an integrative framework for understanding the architecture of executive, social and emotional processes and make specific recommendations for the interpretation and application of the MSCEIT to the study of emotional intelligence in health and disease.
Distributed neural system for emotional intelligence revealed by lesion mapping
Colom, Roberto; Grafman, Jordan
2014-01-01
Cognitive neuroscience has made considerable progress in understanding the neural architecture of human intelligence, identifying a broadly distributed network of frontal and parietal regions that support goal-directed, intelligent behavior. However, the contributions of this network to social and emotional aspects of intellectual function remain to be well characterized. Here we investigated the neural basis of emotional intelligence in 152 patients with focal brain injuries using voxel-based lesion-symptom mapping. Latent variable modeling was applied to obtain measures of emotional intelligence, general intelligence and personality from the Mayer, Salovey, Caruso Emotional Intelligence Test (MSCEIT), the Wechsler Adult Intelligence Scale and the Neuroticism-Extroversion-Openness Inventory, respectively. Regression analyses revealed that latent scores for measures of general intelligence and personality reliably predicted latent scores for emotional intelligence. Lesion mapping results further indicated that these convergent processes depend on a shared network of frontal, temporal and parietal brain regions. The results support an integrative framework for understanding the architecture of executive, social and emotional processes and make specific recommendations for the interpretation and application of the MSCEIT to the study of emotional intelligence in health and disease. PMID:23171618
Makkar, Steve R; Williamson, Anna; D'Este, Catherine; Redman, Sally
2017-12-19
Few measures of research use in health policymaking are available, and the reliability of such measures has yet to be evaluated. A new measure called the Staff Assessment of Engagement with Evidence (SAGE) incorporates an interview that explores policymakers' research use within discrete policy documents and a scoring tool that quantifies the extent of policymakers' research use based on the interview transcript and analysis of the policy document itself. We aimed to conduct a preliminary investigation of the usability, sensitivity, and reliability of the scoring tool in measuring research use by policymakers. Nine experts in health policy research and two independent coders were recruited. Each expert used the scoring tool to rate a random selection of 20 interview transcripts, and each independent coder rated 60 transcripts. The distribution of scores among experts was examined, and then, interrater reliability was tested within and between the experts and independent coders. Average- and single-measure reliability coefficients were computed for each SAGE subscales. Experts' scores ranged from the limited to extensive scoring bracket for all subscales. Experts as a group also exhibited at least a fair level of interrater agreement across all subscales. Single-measure reliability was at least fair except for three subscales: Relevance Appraisal, Conceptual Use, and Instrumental Use. Average- and single-measure reliability among independent coders was good to excellent for all subscales. Finally, reliability between experts and independent coders was fair to excellent for all subscales. Among experts, the scoring tool was comprehensible, usable, and sensitive to discriminate between documents with varying degrees of research use. Secondly, the scoring tool yielded scores with good reliability among the independent coders. There was greater variability among experts, although as a group, the tool was fairly reliable. The alignment between experts' and independent coders' ratings indicates that the independent coders were scoring in a manner comparable to health policy research experts. If the present findings are replicated in a larger sample, end users (e.g. policy agency staff) could potentially be trained to use SAGE to reliably score research use within their agencies, which would provide a cost-effective and time-efficient approach to utilising this measure in practice.
2011-01-01
to include body mass index or the presence of gestational diabetes and other medical conditions. However, to our knowledge the strongest predictor of...The distribution and predictive value of Bishop scores in nulliparas between 37 and 42 weeks gestation PETER E. NIELSEN, BOBBY C. HOWARD, TAMI...accuracy of Bishop scores was evaluated to predict cesarean delivery (CD) in nulliparas between 37 and 42 weeks gestation . Study design. Subjects underwent
A Unified Mixed-Effects Model for Rare-Variant Association in Sequencing Studies
Sun, Jianping; Zheng, Yingye; Hsu, Li
2013-01-01
For rare-variant association analysis, due to extreme low frequencies of these variants, it is necessary to aggregate them by a prior set (e.g., genes and pathways) in order to achieve adequate power. In this paper, we consider hierarchical models to relate a set of rare variants to phenotype by modeling the effects of variants as a function of variant characteristics while allowing for variant-specific effect (heterogeneity). We derive a set of two score statistics, testing the group effect by variant characteristics and the heterogeneity effect. We make a novel modification to these score statistics so that they are independent under the null hypothesis and their asymptotic distributions can be derived. As a result, the computational burden is greatly reduced compared with permutation-based tests. Our approach provides a general testing framework for rare variants association, which includes many commonly used tests, such as the burden test [Li and Leal, 2008] and the sequence kernel association test [Wu et al., 2011], as special cases. Furthermore, in contrast to these tests, our proposed test has an added capacity to identify which components of variant characteristics and heterogeneity contribute to the association. Simulations under a wide range of scenarios show that the proposed test is valid, robust and powerful. An application to the Dallas Heart Study illustrates that apart from identifying genes with significant associations, the new method also provides additional information regarding the source of the association. Such information may be useful for generating hypothesis in future studies. PMID:23483651
Davies, John R; Chang, Yu-mei; Bishop, D Timothy; Armstrong, Bruce K; Bataille, Veronique; Bergman, Wilma; Berwick, Marianne; Bracci, Paige M; Elwood, J Mark; Ernstoff, Marc S; Green, Adele; Gruis, Nelleke A; Holly, Elizabeth A; Ingvar, Christian; Kanetsky, Peter A; Karagas, Margaret R; Lee, Tim K; Le Marchand, Loïc; Mackie, Rona M; Olsson, Håkan; Østerlind, Anne; Rebbeck, Timothy R; Reich, Kristian; Sasieni, Peter; Siskind, Victor; Swerdlow, Anthony J; Titus, Linda; Zens, Michael S; Ziegler, Andreas; Gallagher, Richard P.; Barrett, Jennifer H; Newton-Bishop, Julia
2015-01-01
Background We report the development of a cutaneous melanoma risk algorithm based upon 7 factors; hair colour, skin type, family history, freckling, nevus count, number of large nevi and history of sunburn, intended to form the basis of a self-assessment webtool for the general public. Methods Predicted odds of melanoma were estimated by analysing a pooled dataset from 16 case-control studies using logistic random coefficients models. Risk categories were defined based on the distribution of the predicted odds in the controls from these studies. Imputation was used to estimate missing data in the pooled datasets. The 30th, 60th and 90th centiles were used to distribute individuals into four risk groups for their age, sex and geographic location. Cross-validation was used to test the robustness of the thresholds for each group by leaving out each study one by one. Performance of the model was assessed in an independent UK case-control study dataset. Results Cross-validation confirmed the robustness of the threshold estimates. Cases and controls were well discriminated in the independent dataset (area under the curve 0.75, 95% CI 0.73-0.78). 29% of cases were in the highest risk group compared with 7% of controls, and 43% of controls were in the lowest risk group compared with 13% of cases. Conclusion We have identified a composite score representing an estimate of relative risk and successfully validated this score in an independent dataset. Impact This score may be a useful tool to inform members of the public about their melanoma risk. PMID:25713022
Edwards, Robert D; Crisp, Michael D; Cook, Dianne H; Cook, Lyn G
2017-01-01
To test whether novel and previously hypothesized biogeogaphic barriers in the Australian Tropics represent significant disjunction points or hard barriers, or both, to the distribution of plants. Australian tropics: Australian Monsoon Tropics and Australian Wet Tropics. The presence or absence of 6,861 plant species was scored across 13 putative biogeographic barriers in the Australian Tropics, including two that have not previously been recognised. Randomizations of these data were used to test whether more species showed disjunctions (gaps in distribution) or likely barriers (range limits) at these points than expected by chance. Two novel disjunctions in the Australian Tropics flora are identified in addition to eleven putative barriers previously recognized for animals. Of these, eleven disjunction points (all within the Australian Monsoon Tropics) were found to correspond to range-ending barriers to a significant number of species, while neither of the two disjunctions found within the Australian Wet Tropics limited a significant number of species' ranges. Biogeographic barriers present significant distributional limits to native plant species in the Australian Monsoon Tropics but not in the Australian Wet Tropics.
Internet cognitive testing of large samples needed in genetic research.
Haworth, Claire M A; Harlaar, Nicole; Kovas, Yulia; Davis, Oliver S P; Oliver, Bonamy R; Hayiou-Thomas, Marianna E; Frances, Jane; Busfield, Patricia; McMillan, Andrew; Dale, Philip S; Plomin, Robert
2007-08-01
Quantitative and molecular genetic research requires large samples to provide adequate statistical power, but it is expensive to test large samples in person, especially when the participants are widely distributed geographically. Increasing access to inexpensive and fast Internet connections makes it possible to test large samples efficiently and economically online. Reliability and validity of Internet testing for cognitive ability have not been previously reported; these issues are especially pertinent for testing children. We developed Internet versions of reading, language, mathematics and general cognitive ability tests and investigated their reliability and validity for 10- and 12-year-old children. We tested online more than 2500 pairs of 10-year-old twins and compared their scores to similar internet-based measures administered online to a subsample of the children when they were 12 years old (> 759 pairs). Within 3 months of the online testing at 12 years, we administered standard paper and pencil versions of the reading and mathematics tests in person to 30 children (15 pairs of twins). Scores on Internet-based measures at 10 and 12 years correlated .63 on average across the two years, suggesting substantial stability and high reliability. Correlations of about .80 between Internet measures and in-person testing suggest excellent validity. In addition, the comparison of the internet-based measures to ratings from teachers based on criteria from the UK National Curriculum suggests good concurrent validity for these tests. We conclude that Internet testing can be reliable and valid for collecting cognitive test data on large samples even for children as young as 10 years.
Doyle, Orla; McGlanaghy, Edel; O’Farrelly, Christine; Tremblay, Richard E.
2016-01-01
This study examined the impact of a targeted Irish early intervention program on children’s emotional and behavioral development using multiple methods to test the robustness of the results. Data on 164 Preparing for Life participants who were randomly assigned into an intervention group, involving home visits from pregnancy onwards, or a control group, was used to test the impact of the intervention on Child Behavior Checklist scores at 24-months. Using inverse probability weighting to account for differential attrition, permutation testing to address small sample size, and quantile regression to characterize the distributional impact of the intervention, we found that the few treatment effects were largely concentrated among boys most at risk of developing emotional and behavioral problems. The average treatment effect identified a 13% reduction in the likelihood of falling into the borderline clinical threshold for Total Problems. The interaction and subgroup analysis found that this main effect was driven by boys. The distributional analysis identified a 10-point reduction in the Externalizing Problems score for boys at the 90th percentile. No effects were observed for girls or for the continuous measures of Total, Internalizing, and Externalizing problems. These findings suggest that the impact of this prenatally commencing home visiting program may be limited to boys experiencing the most difficulties. Further adoption of the statistical methods applied here may help to improve the internal validity of randomized controlled trials and contribute to the field of evaluation science more generally. Trial Registration: ISRCTN Registry ISRCTN04631728 PMID:27253184
Dias, Raylene; Baliarsing, Lipika; Barnwal, Neeraj Kumar; Mogal, Shweta; Gujjar, Pinakin
2016-01-01
Background and Aims: A high incidence of anxiety has been reported in patients in the operation theatre set up. We developed a short visual clip of 206 s duration depicting the procedure of spinal anaesthesia (SAB) and aimed to compare the effect of this video on perioperative anxiety in patients undergoing procedures under SAB. Methods: A prospective randomised study of 200 patients undergoing surgery under SAB was conducted. Patients were allotted to either the nonvideo group (Group NV - those who were not shown the video) or the video group (Group V - those who were shown the video). Anxiety was assessed using the Spielberger State-Trait Anxiety Inventory during the pre-anaesthetic check-up and before surgery. Haemodynamic parameters such as heart rate (HR) and mean arterial pressure (MAP) were also noted. Student's t-test was used for normally distributed and Mann–Whitney U-test for nonnormally distributed quantitative data. Chi-square test was used for categorical data. Results: Both groups were comparable with respect to baseline anxiety scores and haemodynamic parameters. The nonvideo group showed a significant increase in state anxiety scores before administration of SAB (P < 0.001). Patients in the video group had significantly lower HR and MAP preoperatively (P < 0.001). The prevalence of ‘high anxiety’ for SAB was 81% in our study which decreased to 66% in the video group before surgery. Conclusion: Multimedia information in the form of a short audiovisual clip is an effective and feasible method to reduce perioperative anxiety related to SAB. PMID:27942059
Cardiovascular disease risk in women with migraine
2013-01-01
Background Studies suggest a higher prevalence of unfavourable cardiovascular risk factors amongst migraineurs, but results have been conflicting. The aim of this study was to investigate traditional and newly recognized risk factors as well as other surrogate markers of cardiovascular risk in obese and normal weight women with migraine. Methods Fifty-nine adult female probands participated in this case–control study. The sample was divided into normal weight and obese migraineurs and age- and body mass index-matched control groups. The following cardiovascular risk factors were analyzed: serum levels of lipids, fasting glucose, and insulin; insulin resistance; blood pressure; smoking (categorized as current, past or never); Framingham 10-year risk of general cardiovascular disease score; C-reactive protein; family history of cardiovascular disease; physical activity; sleep disturbances; depression; and bioelectrical impedance phase angle. The means of continuous variables were compared using Student’s t-test for independent samples or the Mann–Whitney U-test (for 2 groups) and ANOVA or the Kruskal-Wallis test (for 4 groups) depending on the distribution of data. Results All migraineurs were sedentary irrespective of nutritional status. Migraineurs had higher depression scores and shorter sleep duration, and obese migraineurs, in particular, had worse sleep quality scores. Insulin resistance and insulinaemia were associated with obesity, and obese migraineurs had lower HDL-c than normal weight controls and migraineurs. Also, the Framingham risk score was higher in obese migraineurs. Conclusion These findings suggest that female migraineurs experience marked inactivity, depression, and some sleep disturbance, that higher insulin resistance and insulinaemia are related to obesity, and that obesity and migraine probably exert overlapping effects on HDL-c levels and Framingham 10-year cardiovascular risk. PMID:24011175
Minakuchi, Hajime; Sogawa, Chiharu; Hara, Emilio Satoshi; Miki, Haruna; Maekawa, Kenji; Sogawa, Norio; Kitayama, Shigeo; Matsuka, Yoshizo; Clark, Glenn T; Kuboki, Takuo
2014-10-01
The aim of this study was to evaluate the correlation between sleep bruxism (SB) frequency and serotonin transporter (SERT)-driven serotonin (5-HT)-uptake in platelets. Subjects were dental trainee residents and faculty members of Okayama University Hospital who were aware of having severe or no SB. SB frequency was assessed for 3-consecutive nights by a self-contained electromyographic detector/analyzer, which indicated individual SB levels as one of four grades (score 0, 1, 2 and 3). Subjects were classified as normal control (NC) when SB scores indicated only 0 or 1 during the 3 nights, or as severe SB for scores 2 or 3. Those subjects whose scores fluctuated from 0 to 3 during the 3 nights were omitted from further analysis. Fasting peripheral venous blood samples were collected in the morning following the final SB assessment. Amounts of SERTs proteins collected from peripheral platelets were quantified using ELISA, and SERTs transport activity was assessed by uptake assay using [3H]-5-HT. Thirteen severe SB subjects and 7 NC subjects were eligible. Gender distribution, mean age, 5-HT concentration and total amounts of SERT protein in platelets showed no significant differences between NC and severe SB (p=0.85: Chi-squared test; p=0.64, 0.26, 0.46: t-test). However, [3H]-5-HT uptake by platelets was significantly greater in NC compared to severe SB subjects (12.79±1.97, 8.27±1.91 fmol/10(5) platelets/min, p<0.001, t-test). The results of this pilot study suggest a possible correlation between peripheral platelet serotonin transporter uptake ability and SB severity. Copyright © 2014 Japan Prosthodontic Society. Published by Elsevier Ltd. All rights reserved.
Rosselli, M; Ardila, A; Bateman, J R; Guzmán, M
2001-01-01
Limited information is currently available about performance of Spanish-speaking children on different neuropsychological tests. This study was designed to (a) analyze the effects of age and sex on different neuropsychological test scores of a randomly selected sample of Spanish-speaking children, (b) analyze the value of neuropsychological test scores for predicting school performance, and (c) describe the neuropsychological profile of Spanish-speaking children with learning disabilities (LD). Two hundred ninety (141 boys, 149 girls) 6- to 11-year-old children were selected from a school in Bogotá, Colombia. Three age groups were distinguished: 6- to 7-, 8- to 9-, and 10- to 11-year-olds. Performance was measured utilizing the following neuropsychological tests: Seashore Rhythm Test, Finger Tapping Test (FTT), Grooved Pegboard Test, Children's Category Test (CCT), California Verbal Learning Test-Children's Version (CVLT-C), Benton Visual Retention Test (BVRT), and Bateria Woodcock Psicoeducativa en Español (Woodcock, 1982). Normative scores were calculated. Age effect was significant for most of the test scores. A significant sex effect was observed for 3 test scores. Intercorrelations were performed between neuropsychological test scores and academic areas (science, mathematics, Spanish, social studies, and music). In a post hoc analysis, children presenting very low scores on the reading, writing, and arithmetic achievement scales of the Woodcock battery were identified in the sample, and their neuropsychological test scores were compared with a matched normal group. Finally, a comparison was made between Colombian and American norms.
Zahiruddin, Kowser; Banu, Shaj; Dharmarajan, Ramya; Kulothungan, Vaitheeswaran; Vijayan, Deepa; Raman, Rajiv; Sharma, Tarun
2010-06-01
To evaluate a customized, portable Farnsworth-Munsell 100 (FM 100) hue viewing booth for compliance with colour vision testing standards and to compare it with room illumination in subjects with normal colour vision (trichromats), subjects with acquired colour vision defects (secondary to diabetes mellitus), and subjects with congenital colour vision defects (dichromats). Discrete wavelengths of the tube in the customized booth were measured using a spectrometer using the normal incident method and were compared with the spectral distribution of sunlight. Forty-eight subjects were recruited for the study and were divided into 3 groups: Group 1, Normal Trichromats (30 eyes); Group 2, Congenital Colour Vision Defects (16 eyes); and Group 3, Diabetes Mellitus (20 eyes). The FM 100 hue test performance was compared using two illumination conditions, booth illumination and room illumination. Total error scores of the classical method in Group 2 as mean+/-SD for room and booth illumination was 243.05+/-85.96 and 149.85+/-54.50 respectively (p=0.0001). Group 2 demonstrated lesser correlation (r=0.50, 0.55), lesser reliability (Cronbach's alpha, 0.625, 0.662) and greater variability (Bland & Altman value, 10.5) in total error scores for the classical method and the moment of inertia method between the two illumination conditions when compared to the other two groups. The customized booth demonstrated illumination meeting CIE standards. The total error scores were overestimated by the classical and moment of inertia methods in all groups for room illumination compared with booth illumination, however overestimation was more significant in the diabetes group.
Dufour, Simon; Latour, Sylvie; Chicoine, Yvan; Fecteau, Gilles; Forget, Sylvain; Moreau, Jean; Trépanier, André
2012-01-01
A script concordance test (SCT) was developed measuring clinical reasoning of food-ruminant practitioners for whom potential clinical competence difficulties were identified by their provincial professional organization. The SCT was designed to be used as part of a broader evaluation procedure. A scoring key was developed based on answers from a reference panel of 12 experts and using the modified aggregate method commonly used for SCTs. A convenient sample of 29 food-ruminant practitioners was constituted to assess the reliability and precision of the SCT and to determine a fair threshold value for success. Cronbach's α coefficients were computed to evaluate internal reliability. To evaluate SCT precision, a test-retest methodology was used and measures of agreement beyond chance were computed at question and test levels. After optimization, the 36-question SCT yielded acceptable internal reliability (Cronbach's α=0.70). Precision of the SCT at question level was excellent with 33 questions (92%) yielding moderate to almost perfect agreement between administrations. At test level, fair agreement (concordance correlation coefficient=0.32) was observed between administrations. A slight SCT score improvement (M=+2.8 points) on the second administration was in part responsible for some of the disagreement and was potentially a result of an adaptation to the SCT format. Scores distribution was used to determine a fair threshold value for success, while considering the underlying objectives of the examination. The data suggest that the developed SCT can be used as a reliable and precise measurement of clinical reasoning of food-ruminant practitioners.
Gonzalez, Carlos; Gomes, Elisabete; Kazachkova, Nadiya; Bettencourt, Conceição; Raposo, Mafalda; Kay, Teresa Taylor; MacLeod, Patrick; Vasconcelos, João; Lima, Manuela
2012-12-01
The present study on long-term outcome of presymptomatic testing for Machado-Joseph disease (MJD) aimed to evaluate the psychological well-being and the familial satisfaction of subjects that 5 years prior received an unfavorable result in the predictive testing (PT). The study included 47 testees of Azorean origin (23 from the island of Flores and 24 from S. Miguel) that completed the fourth evaluation session of the MJD protocol, and undertook a neurological examination at the moment of participation in the study. Nearly 50% of testees were symptomatic at the time of the study. Psychological well-being of the 47 participants was evaluated using the Psychological General Well-Being Index (PGWB). The family satisfaction scale by adjectives was applied to obtain information on family dynamics. The average PGWB score of the total participants was of 73.3, a value indicative of psychological well-being. Nearly half of the testees presented scores indicating psychological well-being, whereas scores indicating moderate (28.9%) or severe (23.7%) stress were found in the remaining. The average score in the PGWB scale was lower in symptomatic than in asymptomatic subjects; moreover, the distinct distribution of the well-being categories seen in the two groups shows an impact of the appearance of first symptoms on the psychological state. Motives for undertaking the test, provided 5 years prior, failed to show an impact in well-being. The average score for familial satisfaction was of 134, a value compatible with high familial satisfaction, which represented the most frequent category (59.6%). Results demonstrate that well-being and family satisfaction need to be monitored in confirmed carriers of the MJD mutation. The inclusion of acceptance studies, after PT, as well as the development of acceptance training actions, should be of major importance to anticipate the possibility of psychological damage.
ERIC Educational Resources Information Center
Powers, Donald; Schedl, Mary; Papageorgiou, Spiros
2017-01-01
The aim of this study was to develop, for the benefit of both test takers and test score users, enhanced "TOEFL ITP"® test score reports that go beyond the simple numerical scores that are currently reported. To do so, we applied traditional scale anchoring (proficiency scaling) to item difficulty data in order to develop performance…
Hackethal, A; Immenroth, M; Bürger, T
2006-04-01
The Minimally Invasive Surgical Trainer-Virtual Reality (MIST-VR) simulator is validated for laparoscopy training, but benchmarks and target scores for assessing single tasks are needed. Control data for the MIST-VR traversal task scenario were collected from 61 novices who performed the task 10 times over 3 days (1 h daily). Data were collected on the time taken, error score, economy of movement, and total score. Test differences were analyzed through percentage scores and t-tests for paired samples. Improvement was greatest over tests 1 to 5 (improvement: test(1.2), 38.07%; p = 0.000; test(4.5), 10.66%; p = 0.010): between tests 5 and 10, improvement slowed and scores stabilized. Variation in participants' performance fell steadily over the 10 tests. Trainees should perform at least 10 tests of the traversal task-five to get used to the equipment and task (automation phase; target total score, 95.16) and five to stabilize and consolidate performance (test 10 target total score, 74.11).
Genetic variants linked to education predict longevity
Marioni, Riccardo E.; Ritchie, Stuart J.; Joshi, Peter K.; Hagenaars, Saskia P.; Fischer, Krista; Adams, Mark J.; Hill, W. David; Davies, Gail; Nagy, Reka; Amador, Carmen; Läll, Kristi; Metspalu, Andres; Liewald, David C.; Wilson, James F.; Hayward, Caroline; Esko, Tõnu; Porteous, David J.; Gale, Catharine R.; Deary, Ian J.
2016-01-01
Educational attainment is associated with many health outcomes, including longevity. It is also known to be substantially heritable. Here, we used data from three large genetic epidemiology cohort studies (Generation Scotland, n = ∼17,000; UK Biobank, n = ∼115,000; and the Estonian Biobank, n = ∼6,000) to test whether education-linked genetic variants can predict lifespan length. We did so by using cohort members’ polygenic profile score for education to predict their parents’ longevity. Across the three cohorts, meta-analysis showed that a 1 SD higher polygenic education score was associated with ∼2.7% lower mortality risk for both mothers (total ndeaths = 79,702) and ∼2.4% lower risk for fathers (total ndeaths = 97,630). On average, the parents of offspring in the upper third of the polygenic score distribution lived 0.55 y longer compared with those of offspring in the lower third. Overall, these results indicate that the genetic contributions to educational attainment are useful in the prediction of human longevity. PMID:27799538
Genetic variants linked to education predict longevity.
Marioni, Riccardo E; Ritchie, Stuart J; Joshi, Peter K; Hagenaars, Saskia P; Okbay, Aysu; Fischer, Krista; Adams, Mark J; Hill, W David; Davies, Gail; Nagy, Reka; Amador, Carmen; Läll, Kristi; Metspalu, Andres; Liewald, David C; Campbell, Archie; Wilson, James F; Hayward, Caroline; Esko, Tõnu; Porteous, David J; Gale, Catharine R; Deary, Ian J
2016-11-22
Educational attainment is associated with many health outcomes, including longevity. It is also known to be substantially heritable. Here, we used data from three large genetic epidemiology cohort studies (Generation Scotland, n = ∼17,000; UK Biobank, n = ∼115,000; and the Estonian Biobank, n = ∼6,000) to test whether education-linked genetic variants can predict lifespan length. We did so by using cohort members' polygenic profile score for education to predict their parents' longevity. Across the three cohorts, meta-analysis showed that a 1 SD higher polygenic education score was associated with ∼2.7% lower mortality risk for both mothers (total n deaths = 79,702) and ∼2.4% lower risk for fathers (total n deaths = 97,630). On average, the parents of offspring in the upper third of the polygenic score distribution lived 0.55 y longer compared with those of offspring in the lower third. Overall, these results indicate that the genetic contributions to educational attainment are useful in the prediction of human longevity.
Faust, Kevin; Xie, Quin; Han, Dominick; Goyle, Kartikay; Volynskaya, Zoya; Djuric, Ugljesa; Diamandis, Phedias
2018-05-16
There is growing interest in utilizing artificial intelligence, and particularly deep learning, for computer vision in histopathology. While accumulating studies highlight expert-level performance of convolutional neural networks (CNNs) on focused classification tasks, most studies rely on probability distribution scores with empirically defined cutoff values based on post-hoc analysis. More generalizable tools that allow humans to visualize histology-based deep learning inferences and decision making are scarce. Here, we leverage t-distributed Stochastic Neighbor Embedding (t-SNE) to reduce dimensionality and depict how CNNs organize histomorphologic information. Unique to our workflow, we develop a quantitative and transparent approach to visualizing classification decisions prior to softmax compression. By discretizing the relationships between classes on the t-SNE plot, we show we can super-impose randomly sampled regions of test images and use their distribution to render statistically-driven classifications. Therefore, in addition to providing intuitive outputs for human review, this visual approach can carry out automated and objective multi-class classifications similar to more traditional and less-transparent categorical probability distribution scores. Importantly, this novel classification approach is driven by a priori statistically defined cutoffs. It therefore serves as a generalizable classification and anomaly detection tool less reliant on post-hoc tuning. Routine incorporation of this convenient approach for quantitative visualization and error reduction in histopathology aims to accelerate early adoption of CNNs into generalized real-world applications where unanticipated and previously untrained classes are often encountered.
Estimating Total-Test Scores from Partial Scores in a Matrix Sampling Design.
ERIC Educational Resources Information Center
Sachar, Jane; Suppes, Patrick
1980-01-01
The present study compared six methods, two of which utilize the content structure of items, to estimate total-test scores using 450 students and 60 items of the 110-item Stanford Mental Arithmetic Test. Three methods yielded fairly good estimates of the total-test score. (Author/RL)
Porto, C L Lascasas; Milhomens, A L M; Pires, C E; Xavier, S Salles; Sicuro, F; Bottino, D A; Bouskela, E
2009-06-01
To evaluate changes on venous diameter and perimeter of lower limbs in chronic venous disorder (CVD) patients after different clinical treatments for four weeks. Fifty-two female patients classified as C2,s or C2,3,s (CEAP classification) were allocated consecutively in three groups: Cirkan (40 mg of the root extract of Ruscus aculeatus + 100 mg of flavonoid hesperidine methylchalcone + 200 mg of vitamin C per pill); elastic compression stockings (ECS) and no treatment (NT). Diameters were determined by duplex ultrasound and perimeter with Leg-O-Meter. After treatment, Cirkan significantly decreased popliteal vein and great saphenous vein (GSV) diameters bilaterally and ECS decreased popliteal vein diameter bilaterally and GSV and varices only on the left limb. Perimeters changed only with ECS. Clinical scores changed between Cirkan x NT and ECS x Cirkan. Disability score varied for ECS x NT and Cirkan x NT. chi2 test detected different distribution frequency for C3 and C2 classes according to treatment: ECS (both limbs) and Cirkan (only left limb). Varices and anatomical scores did not change. ECS emerges as the most effective clinical treatment tested but improvements with Cirkan on vein diameter and CEAP class were also observed. Clinical scores improved due to pain relief and edema reduction (ECS). These findings point to a positive effect of Cirkan, suggesting that venotonic drugs should be taken into account in the treatment of CVD.
Weaver, K F; Morales, V; Nelson, M; Weaver, P F; Toledo, A; Godde, K
2016-01-01
This study examines the relationship between the introduction of a four-course writing-intensive capstone series and improvement in inquiry and analysis skills of biology senior undergraduates. To measure the impact of the multicourse write-to-learn and peer-review pedagogy on student performance, we used a modified Valid Assessment of Learning in Undergraduate Education rubric for Inquiry and Analysis and Written Communication to score senior research theses from 2006 to 2008 (pretreatment) and 2009 to 2013 (intervention). A Fisher-Freeman-Halton test and a two-sample Student's t test were used to evaluate individual rubric dimensions and composite rubric scores, respectively, and a randomized complete block design analysis of variance was carried out on composite scores to examine the impact of the intervention across ethnicity, legacy (e.g., first-generation status), and research laboratory. The results show an increase in student performance in rubric scoring categories most closely associated with science literacy and critical-thinking skills, in addition to gains in students' writing abilities. © 2016 K. F. Weaver et al. CBE—Life Sciences Education © 2016 The American Society for Cell Biology. This article is distributed by The American Society for Cell Biology under license from the author(s). It is available to the public under an Attribution–Noncommercial–Share Alike 3.0 Unported Creative Commons License (http://creativecommons.org/licenses/by-nc-sa/3.0).
Usami, Masahide; Iwadare, Yoshitaka; Watanabe, Kyota; Kodaira, Masaki; Ushijima, Hirokage; Tanaka, Tetsuya; Harada, Maiko; Tanaka, Hiromi; Sasaki, Yoshinori; Saito, Kazuhiko
2014-01-01
Background On March 11, 2011, Japan was struck by a massive earthquake and tsunami. The tsunami caused tremendous damage and traumatized several people, including children. The aim of this study was to assess changes in traumatic symptoms 8, 20, and 30 months of the 2011 tsunami. Methods The study comprised three groups. Copies of the Post-Traumatic Stress Symptoms for Children 15 items (PTSSC-15), a self-rating questionnaire on traumatic symptoms, were distributed to 12,524 children (8-month period), 12,193 children (20-month period), and 11,819 children (30-month period). An effective response of children 8 months, 20 months, and 30 month after the disaster was obtained in 11,639 (92.9%), 10,597 (86.9%), and 10,812 children (91.4%), respectively. We calculated the total score, PTSD subscale, and Depression subscale of PTSSC-15. We calculated the total score, PTSD subscale, and Depression subscale of PTSSC-15. Results The PTSSC-15 total score and PTSD subscale of children belonging to 1st–9th grade groups who were tested 30 and 20 months after the tsunami significantly decreased compared with those of children tested 8 months after the tsunami. The PTSSC-15 total score and PTSD subscale of children in 1st–9th grade groups tested after 30 months did not decrease significantly compared with those of children tested after 20 months. The PTSSC-15 Depression subscale and PTSD subscale of children in 1st–9th grade groups tested after 30 months significantly decreased compared with those of children tested 8 months after the tsunami. The PTSSC-15 Depression subscale of children in 1st–9th grade groups evaluated after 30 months significantly decreased compared with those of children evaluated after 20 months. Conclusions This study demonstrates that the traumatic symptoms of children who survived the massive tsunami improved with time. Nonetheless, the traumatic symptoms, which in some cases did not improve with time. PMID:25340759
Girão, V C C; Nunes-Pinheiro, D C S; Morais, S M; Sequeira, J L; Gioso, M A
2003-05-30
We evaluated the effect of a mouth-rinse prepared using Lippia sidoides essential oil (EO) in dogs with marginal gingivitis. German Shepherd dogs were distributed in two groups: control (control mouth-rinse) and EO (EO mouth-rinse). Both mouth-rinses were applied on the dogs' teeth every 2 days for 2 weeks. At day 0 and day 15, the scores for plaque-bacteria (P), calculus (C), gingivitis (G) and the inflammatory infiltrate (INF) were evaluated blindly. The results were analyzed by the Wilcoxon signed-rank and Mann-Whitney tests (P=0.05). P, C, G, and INF did not show any alteration in the control group, while in the EO group there were significant reductions in these scores.
Experience of handicap and anxiety in phobic postural vertigo.
Holmberg, Johan; Karlberg, Mikael; Harlacher, Uwe; Magnusson, Mans
2005-03-01
We found a difference in gender distribution in a population of phobic postural vertigo patients compared with dizzy patients seen in general neuro-otological practice. It appears as if women with phobic postural vertigo suffer more and are more handicapped by dizziness than both men with phobic postural vertigo and a population with dizziness. These differences may reflect other causes of phobic postural vertigo besides anxiety, such as gender-related coping behaviour and postural strategy. Anxiety influences the degree of suffering and handicap in dizzy patients. Experiences of anxiety and handicap were investigated among a population with phobic postural vertigo. Using the Dizziness Handicap Inventory, the Vertigo Symptom Scale and the Vertigo Handicap Questionnaire, 34 consecutive patients with phobic postural vertigo were compared with a population of 95 consecutive patients seen at a balance disorder clinic. Patients with phobic postural vertigo scored higher than the control subjects with respect to all parameters with the exception of the physical subscale of the Dizziness Handicap Inventory. Because there were significantly more women in the control group we performed a gender-specific analysis of the results. The higher test scores among patients with phobic postural vertigo can be explained by the higher scores among women in this group, while the test results for men were more similar to those of the control group.
Radha, G; Swathi, V; Jha, Abhishek
2016-01-01
This study explores the association of disabilities and oral health. The aim of the study was to assess the salivary and plaque pH and oral health status of children with and without disabilities. A total of 100 schoolchildren (50 with disabilities and 50 without disabilities) were examined from 9 to 15 years age group. Saliva and plaque pH analysis were done to both the groups. Clinical data were collected on periodontal status, dental caries using WHO criteria. pH values of different groups, difference between the means were calculated using independent t-test, and frequency distribution was analyzed using Chi-square test. Statistical significance, P value was set at 0.05. Mean plaque and salivary pH scores were lesser (5.73 and 5.67) in children with intellectual disabilities (IDs) (P< 0.001). Subjects with disabilities had also statistically significant higher CPI scores and decayed, missing, and filled scores than their healthy counterparts (P< 0.001). There is a statistically significant difference in plaque and salivary pH among children with and without ID with lower plaque and salivary pH among children with ID. In addition to this, the oral health was also more compromised in children with ID, which confirms a need for preventive treatment for these children.
Li, Kuan-Yi; Lin, Keh-Chung; Wang, Tien-Ni; Wu, Ching-Yi; Huang, Yan-Hua; Ouyang, Pei
2012-01-01
This investigation examined the demographic characteristics along with 3 measures of motor function in determining outcomes in activities of daily living (ADL) after distributed constraint-induced therapy (dCIT). The study recruited 69 stroke patients who received 3 weeks of dCIT for 2 hours daily, 5 days a week. The self-reported outcome measures for daily function were the Motor Activity Log (MAL) including the amount of use (AOU) and quality of movement (QOM), Nottingham Extended Activities of Daily Living Questionnaire (NEADL), and the Stroke Impact Scale (SIS). Age, sex, onset, side of stroke, Fugl-Meyer assessment (FMA), Wolf Motor Function Test (WMFT), and Action Research Arm Test (ARAT) were the potential predictors. The ARAT grasp-grip-pinch score was the most dominant predictor for MAL-AOU and NEADL (P< 0.05), and the ARAT total score for the subscore of the ADL/instrumental ADL section of the SIS (P< 0.05). The FMA wrist-hand score was a significant predictor for MAL-QOM (P< 0.05). Age was the only demographic factor that significantly predicted NEADL performance (P< 0.05). Among the 3 commonly used measures of motor function after stroke, ARAT was the strongest determinant in predicting MAL-AOU, MAL-QOM, and SIS-ADL/instrumental ADL after dCIT.
Bright, Peter; Hale, Emily; Gooch, Vikki Jayne; Myhill, Thomas; van der Linde, Ian
2018-09-01
Since publication in 1982, the 50-item National Adult Reading Test (NART; Nelson, 1982; NART-R; Nelson & Willison, 1991) has remained a widely adopted method for estimating premorbid intelligence both for clinical and research purposes. However, the NART has not been standardised against the most recent revisions of the Wechsler Adult Intelligence Scale (WAIS-III; Wechsler, 1997, and WAIS-IV; Wechsler, 2008). Our objective, therefore, was to produce reliable standardised estimates of WAIS-IV IQ from the NART. Ninety-two neurologically healthy British adults were assessed and regression equations calculated to produce population estimates of WAIS-IV full-scale IQ (FSIQ) and constituent index scores. Results showed strong NART/WAIS-IV FSIQ correlations with more moderate correlations observed between NART error and constituent index scores. FSIQ estimates were closely similar to the published WAIS and WAIS-R estimates at the high end of the distribution, but at the lower end were approximately equidistant from the highly discrepant WAIS (low) and WAIS-R (high) values. We conclude that the NART is likely to remain an important tool for estimating the impact of neurological damage on general cognitive ability. We advise caution in the use of older published WAIS and/or WAIS-R estimates for estimating premorbid WAIS-IV FSIQ, particularly for those with low NART scores.
A validation study of the psychometric properties of the Groningen Reflection Ability Scale.
Andersen, Nina Bjerre; O'Neill, Lotte; Gormsen, Lise Kirstine; Hvidberg, Line; Morcke, Anne Mette
2014-10-10
Reflection, the ability to examine critically one's own learning and functioning, is considered important for 'the good doctor'. The Groningen Reflection Ability Scale (GRAS) is an instrument measuring student reflection, which has not yet been validated beyond the original Dutch study. The aim of this study was to adapt GRAS for use in a Danish setting and to investigate the psychometric properties of GRAS-DK. We performed a cross-cultural adaptation of GRAS from Dutch to Danish. Next, we collected primary data online, performed a retest, analysed data descriptively, estimated measurement error, performed an exploratory and a confirmatory factor analysis to test the proposed three-factor structure. 361 (69%) of 523 invited students completed GRAS-DK. Their mean score was 88 (SD = 11.42; scale maximum 115). Scores were approximately normally distributed. Measurement error and test-retest score differences were acceptable, apart from a few extreme outliers. However, the confirmatory factor analysis did not replicate the original three-factor model and neither could a one-dimensional structure be confirmed. GRAS is already in use, however we advise that use of GRAS-DK for effect measurements and group comparison awaits further review and validation studies. Our negative finding might be explained by a weak conceptualisation of personal reflection.
Bardid, Farid; Huyben, Floris; Lenoir, Matthieu; Seghers, Jan; De Martelaer, Kristine; Goodway, Jacqueline D; Deconinck, Frederik J A
2016-06-01
This study aimed to understand the fundamental motor skills (FMS) of Belgian children using the process-oriented Test of Gross Motor Development, Second Edition (TGMD-2) and to investigate the suitability of using the United States (USA) test norms in Belgium. FMS were assessed using the TGMD-2. Gender, age and motor performance were examined in 1614 Belgian children aged 3-8 years (52.1% boys) and compared with the US reference sample. More proficient FMS performance was found with increasing age, from 3 to 6 years for locomotor skills and 3 to 7 years for object control skills. Gender differences were observed in object control skills, with boys performing better than girls. In general, Belgian children had lower levels of motor competence than the US reference sample, specifically for object control skills. The score distribution of the Belgian sample was skewed, with 37.4% scoring below average and only 6.9% scoring above average. This study supported the usefulness of the TGMD-2 as a process-oriented instrument to measure gross motor development in early childhood in Belgium. However, it also demonstrated that caution is warranted when using the US reference norms. ©2016 Foundation Acta Paediatrica. Published by John Wiley & Sons Ltd.
The minimal important difference of exercise tests in severe COPD
Puhan, M.A.; Chandra, D.; Mosenifar, Z.; Ries, A.; Make, B.; Hansel, N.N.; Wise, R.A.; Sciurba, F.
2017-01-01
Our aim was to determine the minimal important difference (MID) for 6-min walk distance (6MWD) and maximal cycle exercise capacity (MCEC) in patients with severe chronic obstructive pulmonary disease (COPD). 1,218 patients enrolled in the National Emphysema Treatment Trial completed exercise tests before and after 4–6 weeks of pre-trial rehabilitation, and 6 months after randomisation to surgery or medical care. The St George’s Respiratory Questionnaire (domain and total scores) and University of California San Diego Shortness of Breath Questionnaire (total score) served as anchors for anchor-based MID estimates. In order to calculate distribution-based estimates, we used the standard error of measurement, Cohen’s effect size and the empirical rule effect size. Anchor-based estimates for the 6MWD were 18.9 m (95% CI 18.1–20.1 m), 24.2 m (95% CI 23.4–25.4 m), 24.6 m (95% CI 23.4–25.7 m) and 26.4 m (95% CI 25.4–27.4 m), which were similar to distribution-based MID estimates of 25.7, 26.8 and 30.6 m. For MCEC, anchor-based estimates for the MID were 2.2 W (95% CI 2.0–2.4 W), 3.2 W (95% CI 3.0–3.4 W), 3.2 W (95% CI 3.0–3.4 W) and 3.3 W (95% CI 3.0–3.5 W), while distribution-based estimates were 5.3 and 5.5 W. We suggest a MID of 26±2 m for 6MWD and 4±1 W for MCEC for patients with severe COPD. PMID:20693247
Adjorlolo, Samuel
2018-06-01
The sociocultural differences between Western and sub-Saharan African countries make it imperative to standardize neuropsychological tests in the latter. However, Western-normed tests are frequently administered in sub-Saharan Africa because of challenges hampering standardization efforts. Yet a salient topical issue in the cross-cultural neuropsychology literature relates to the utility of Western-normed neuropsychological tests in minority groups, non-Caucasians, and by extension Ghanaians. Consequently, this study investigates the diagnostic accuracy, sensitivity, and specificity of executive function (EF) tests (The Stroop Test, Trail Making Test, and Controlled Oral Word Association Test), and a Revised Quick Cognitive Screening Test (RQCST) in a sample of 50 patients diagnosed with moderate traumatic brain injury and 50 healthy controls in Ghana. The EF test scores showed good diagnostic accuracy, with area under the curve (AUC) values of the Trail Making Test scores ranging from .746 to .902. With respect to the Stroop Test scores, the AUC values ranged from .793 to .898, while Controlled Oral Word Association Test had AUC value of .787. The RQCST scores discriminated between the groups, with AUC values ranging from .674 to .912. The AUC values of composite EF score and a neuropsychological score created from EF and RQCST scores were .936 and. 942, respectively. Additionally, the Stroop Test, Trail Making Test, EF composite score, and RQCST scores showed good to excellent sensitivities and specificities. In general, this study has shown that commonly used EF tests in Western countries have diagnostic accuracy, sensitivity, and specificity when administered in Ghanaian samples. The findings and implications of the study are discussed.
Spörlein, Christoph; Schlueter, Elmar
2018-01-01
Here we examine a conceptualization of immigrant assimilation that is based on the more general notion that distributional differences erode across generations. We explore this idea by reinvestigating the efficiency-equality trade-off hypothesis, which posits that stratified education systems educate students more efficiently at the cost of increasing inequality in overall levels of competence. In the context of ethnic inequality in math achievement, this study explores the extent to which an education system's characteristics are associated with ethnic inequality in terms of both the group means and group variances in achievement. Based on data from the 2012 PISA and mixed-effect location scale models, our analyses revealed two effects: on average, minority students had lower math scores than majority students, and minority students' scores were more concentrated at the lower end of the distribution. However, the ethnic inequality in the distribution of scores declined across generations. We did not find compelling evidence that stratified education systems increase mean differences in competency between minority and majority students. However, our analyses revealed that in countries with early educational tracking, minority students' math scores tended to cluster at the lower end of the distribution, regardless of compositional and school differences between majority and minority students.
Spörlein, Christoph
2018-01-01
Here we examine a conceptualization of immigrant assimilation that is based on the more general notion that distributional differences erode across generations. We explore this idea by reinvestigating the efficiency-equality trade-off hypothesis, which posits that stratified education systems educate students more efficiently at the cost of increasing inequality in overall levels of competence. In the context of ethnic inequality in math achievement, this study explores the extent to which an education system’s characteristics are associated with ethnic inequality in terms of both the group means and group variances in achievement. Based on data from the 2012 PISA and mixed-effect location scale models, our analyses revealed two effects: on average, minority students had lower math scores than majority students, and minority students’ scores were more concentrated at the lower end of the distribution. However, the ethnic inequality in the distribution of scores declined across generations. We did not find compelling evidence that stratified education systems increase mean differences in competency between minority and majority students. However, our analyses revealed that in countries with early educational tracking, minority students’ math scores tended to cluster at the lower end of the distribution, regardless of compositional and school differences between majority and minority students. PMID:29494677
Universality in the distance between two teams in a football tournament
NASA Astrophysics Data System (ADS)
da Silva, Roberto; Dahmen, Silvio R.
2014-03-01
Is football (soccer) a universal sport? Beyond the question of geographical distribution, where the answer is most certainly yes, when looked at from a mathematical viewpoint the scoring process during a match can be thought of, in a first approximation, as being modeled by a Poisson distribution. Recently, it was shown that the scoring of real tournaments can be reproduced by means of an agent-based model (da Silva et al. (2013) [24]) based on two simple hypotheses: (i) the ability of a team to win a match is given by the rate of a Poisson distribution that governs its scoring during a match; and (ii) such ability evolves over time according to results of previous matches. In this article we are interested in the question of whether the time series represented by the scores of teams have universal properties. For this purpose we define a distance between two teams as the square root of the sum of squares of the score differences between teams over all rounds in a double-round-robin-system and study how this distance evolves over time. Our results suggest a universal distance distribution of tournaments of different major leagues which is better characterized by an exponentially modified Gaussian (EMG). This result is corroborated by our agent-based model.
2013-01-01
Background Asthma is becoming increasingly prevalent among children in China. Poor parent knowledge and attitudes often contribute to inappropriate management practices, leading to deficiencies in the care process. We aimed to document the knowledge, attitudes and practices (KAP) of parents of children with asthma and analyze how knowledge and attitudes relate to practices. Our secondary objective was to identify the factors associated with parent KAP scores. Methods A KAP questionnaire was distributed to parents caring for 2960 children (0–14 years) diagnosed with asthma for at least 3 months from China’s 29 provinces. A 50-item questionnaire was devised for this cross-sectional survey based on a comprehensive review of the subject. Questionnaires were scored on 30 items regarding parent asthma-related KAP, with one point for every correct response and a possible range of 0–13 for knowledge, 0–7 for attitudes and 0–10 for practices. Higher scores indicated better KAP. Chi-squared tests and logistic regression were used to identify factors associated with practices and combined KAP scores. Results The response rate was 83.95% (2485/2960). Only 18.31% (455/2485) of parents correctly answered ≥ 60% of the knowledge questions (mean = 5.69). Most (89.85%; 2226/2485) gave positive responses to ≥ 60% of the attitude questions (mean = 5.23) while 67.89% (1687/2485) correctly answered ≥ 60% of the practices questions (mean = 6.19). Knowledge and attitudes were positively associated with pulmonary function testing, regular physician visits, monitoring with a peak flow meter and the Children’s Asthma Control Test questionnaire, avoidance of asthma triggers, using an inhaled β2 receptor agonist and adherence to medication regimen (p ≤ 0.05). Attitudes were also associated with allergen testing. In logistic regression analysis, high KAP scores (dichotomized by a cut-off score of 18) were positively associated with food allergy, rhinitis, physician visits, frequency of visits and parent education (p < 0.05, OR > 1). Conclusions Generally, the parents’ KAP were poor. A gap between recommended and actual practice was observed, which may be related to inadequate knowledge about and poor attitudes toward childhood asthma. Improving knowledge and attitudes may encourage better practices among parents of children with asthma. PMID:23379859
Reporting Diagnostic Scores in Educational Testing: Temptations, Pitfalls, and Some Solutions
ERIC Educational Resources Information Center
Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J.
2010-01-01
Diagnostic scores are of increasing interest in educational testing due to their potential remedial and instructional benefit. Naturally, the number of educational tests that report diagnostic scores is on the rise, as are the number of research publications on such scores. This article provides a critical evaluation of diagnostic score reporting…
2007-01-01
Background The US Food and Drug Administration approved the Charité artificial disc on October 26, 2004. This approval was based on an extensive analysis and review process; 20 years of disc usage worldwide; and the results of a prospective, randomized, controlled clinical trial that compared lumbar artificial disc replacement to fusion. The results of the investigational device exemption (IDE) study led to a conclusion that clinical outcomes following lumbar arthroplasty were at least as good as outcomes from fusion. Methods The author performed a new analysis of the Visual Analog Scale pain scores and the Oswestry Disability Index scores from the Charité artificial disc IDE study and used a nonparametric statistical test, because observed data distributions were not normal. The analysis included all of the enrolled subjects in both the nonrandomized and randomized phases of the study. Results Subjects from both the treatment and control groups improved from the baseline situation (P < .001) at all follow-up times (6 weeks to 24 months). Additionally, these pain and disability levels with artificial disc replacement were superior (P < .05) to the fusion treatment at all follow-up times including 2 years. Conclusions The a priori statistical plan for an IDE study may not adequately address the final distribution of the data. Therefore, statistical analyses more appropriate to the distribution may be necessary to develop meaningful statistical conclusions from the study. A nonparametric statistical analysis of the Charité artificial disc IDE outcomes scores demonstrates superiority for lumbar arthroplasty versus fusion at all follow-up time points to 24 months. PMID:25802574
Geostatistical interpolation of available copper in orchard soil as influenced by planting duration.
Fu, Chuancheng; Zhang, Haibo; Tu, Chen; Li, Lianzhen; Luo, Yongming
2018-01-01
Mapping the spatial distribution of available copper (A-Cu) in orchard soils is important in agriculture and environmental management. However, data on the distribution of A-Cu in orchard soils is usually highly variable and severely skewed due to the continuous input of fungicides. In this study, ordinary kriging combined with planting duration (OK_PD) is proposed as a method for improving the interpolation of soil A-Cu. Four normal distribution transformation methods, namely, the Box-Cox, Johnson, rank order, and normal score methods, were utilized prior to interpolation. A total of 317 soil samples were collected in the orchards of the Northeast Jiaodong Peninsula. Moreover, 1472 orchards were investigated to obtain a map of planting duration using Voronoi tessellations. The soil A-Cu content ranged from 0.09 to 106.05 with a mean of 18.10 mg kg -1 , reflecting the high availability of Cu in the soils. Soil A-Cu concentrations exhibited a moderate spatial dependency and increased significantly with increasing planting duration. All the normal transformation methods successfully decreased the skewness and kurtosis of the soil A-Cu and the associated residuals, and also computed more robust variograms. OK_PD could generate better spatial prediction accuracy than ordinary kriging (OK) for all transformation methods tested, and it also provided a more detailed map of soil A-Cu. Normal score transformation produced satisfactory accuracy and showed an advantage in ameliorating smoothing effect derived from the interpolation methods. Thus, normal score transformation prior to kriging combined with planting duration (NSOK_PD) is recommended for the interpolation of soil A-Cu in this area.
A comparison of KABCO and AIS injury severity metrics using CODES linked data.
Burch, Cynthia; Cook, Lawrence; Dischinger, Patricia
2014-01-01
The research objective is to compare the consistency of distributions between crash assigned (KABCO) and hospital assigned (Abbreviated Injury Scale, AIS) injury severity scoring systems for 2 states. The hypothesis is that AIS scores will be more consistent between the 2 studied states (Maryland and Utah) than KABCO. The analysis involved Crash Outcome Data Evaluation System (CODES) data from 2 states, Maryland and Utah, for years 2006-2008. Crash report and hospital inpatient data were linked probabilistically and International Classification of Diseases (CMS 2013) codes from hospital records were translated into AIS codes. KABCO scores from police crash reports were compared to those AIS scores within and between the 2 study states. Maryland appears to have the more severe crash report KABCO scoring for injured crash participants, with close to 50 percent of all injured persons being coded as a level B or worse, and Utah observes approximately 40 percent in this group. When analyzing AIS scores, some fluctuation was seen within states over time, but the distribution of MAIS is much more comparable between states. Maryland had approximately 85 percent of hospitalized injured cases coded as MAIS = 1 or minor. In Utah this percentage was close to 80 percent for all 3 years. This is quite different from the KABCO distributions, where Maryland had a smaller percentage of cases in the lowest injury severity category as compared to Utah. This analysis examines the distribution of 2 injury severity metrics different in both design and collection and found that both classifications are consistent within each state from 2006 to 2008. However, the distribution of both KABCO and Maximum Abbreviated Injury Scale (MAIS) varies between the states. MAIS was found to be more consistent between states than KABCO.
ERIC Educational Resources Information Center
Dorans, Neil J.
2002-01-01
The history of SAT® score scales is summarized, and the need for realigning SAT score scales is demonstrated. The process employed to produce the conversions that take scores from the original SAT scales to recentered scales in which reference group scores are centered near the midpoint of the score-reporting range is laid out. For the purposes of…
Reitan, Ralph M; Wolfson, Deborah
2004-03-01
This study explores the use of the Progressive Figures Test as an instrument for broad initial screening of children in the 6- through 8-year age range with respect to the possible need for more definitive neuropsychological evaluation. Considering earlier results obtained in comparison of brain-damaged and control children [Clinical Neuropsychology: Current Applications, Hemisphere Publishing Corp., Washington, DC, 1974, p. 53; Proceedings of the Conference on Minimal Brain Dysfunction, New York Academy of Sciences, New York, 1973, p. 65], the Progressive Figures Test seemed potentially useful as a first step in determining whether a comprehensive neuropsychological evaluation is indicated. In this investigation, three groups were studied: (1) children with definitive evidence of brain damage or disease who, when compared with normal controls, help to establish the limits of neuropsychological functioning, (2) a group of children who had normal neurological examinations but also had academic problems of significant concern to both parents and teachers, and (3) a normal control group. Statistically significant differences were present in comparing each pair of groups, with the brain-damaged children performing most poorly and the controls performing best. Score distributions for the three groups make it possible to identify a score-range that represented a borderline or "gray" area and to suggest a cutting score that identified children whose academic problems might have a neurological basis and for whom additional neuropsychological evaluation appeared to be indicated.
Labeau, S; Vandijck, D; Rello, J; Adam, S; Rosa, A; Wenisch, C; Bäckman, C; Agbaht, K; Csomos, A; Seha, M; Dimopoulos, G; Vandewoude, K H; Blot, S
2008-10-01
As part of a needs analysis preceding the development of an e-learning platform on infection prevention, European intensive care unit (ICU) nurses were subjected to a knowledge test on evidence-based guidelines for preventing ventilator-associated pneumonia (VAP). A validated multiple-choice questionnaire was distributed to 22 European countries between October 2006 and March 2007. Demographics included nationality, gender, ICU experience, number of ICU beds and acquisition of a specialised degree in intensive care. We collected 3329 questionnaires (response rate 69.1%). The average score was 45.1%. Fifty-five percent of respondents knew that the oral route is recommended for intubation; 35% knew that ventilator circuits should be changed for each new patient; 38% knew that heat and moisture exchangers were the recommended humidifier type, but only 21% knew that these should be changed once weekly; closed suctioning systems were recommended by 46%, and 18% knew that these must be changed for each new patient only; 51% and 57%, respectively, recognised that subglottic drainage and kinetic beds reduce VAP incidence. Most (85%) knew that semi-recumbent positioning prevents VAP. Professional seniority and number of ICU beds were shown to be independently associated with better test scores. Further research may determine whether low scores are related to a lack of knowledge, deficiencies in training, differences in what is regarded as good practice, and/or a lack of consistent policy.
Arias, Ana; Scott, Raymond; Peters, Ove A; McClain, Elizabeth; Gluskin, Alan H
2016-04-01
The aim of this prospective quantitative study was to compare the effect of different instructional formats on dental students' skills and knowledge acquisition for access cavity preparation. All first-year dental students were invited to participate in this study conducted during the four consecutive two-week endodontic rotation courses at the University of the Pacific Arthur A. Dugoni School of Dentistry in spring semester 2015. Four alphabetically distributed intact groups of students were randomly allocated to two groups (n=70 each) that participated in either small-group discussion or a traditional lecture on access preparation. The first outcome measure was skill acquisition, measured by the quality of access cavities prepared in extracted teeth at the conclusion of the session. Two blinded raters scored direct observations on a continuous scale. Knowledge, the second outcome measure, was scored with a multiple-choice and open-ended question test at the end of each two-week session. Data were obtained for 134 of the 140 students, for a 96% response rate. The results showed that students in the small-group discussion groups scored significantly higher than those in the lecture groups when skill performance was tested (p=8.9 × 10(-7)). However, no significant differences were found in the acquisition of knowledge between the two groups on the written test. Active student participation was significantly related to improved manual skill acquisition, but the format of the session does not seem to have had a direct influence on acquired knowledge.
Protocadherin α (PCDHA) as a novel susceptibility gene for autism
Anitha, Ayyappan; Thanseem, Ismail; Nakamura, Kazuhiko; Yamada, Kazuo; Iwayama, Yoshimi; Toyota, Tomoko; Iwata, Yasuhide; Suzuki, Katsuaki; Sugiyama, Toshiro; Tsujii, Masatsugu; Yoshikawa, Takeo; Mori, Norio
2013-01-01
Background Synaptic dysfunction has been shown to be involved in the pathogenesis of autism. We hypothesized that the protocadherin α gene cluster (PCDHA), which is involved in synaptic specificity and in serotonergic innervation of the brain, could be a suitable candidate gene for autism. Methods We examined 14 PCDHA single nucleotide polymorphisms (SNPs) for genetic association with autism in DNA samples of 3211 individuals (841 families, including 574 multiplex families) obtained from the Autism Genetic Resource Exchange. Results Five SNPs (rs251379, rs1119032, rs17119271, rs155806 and rs17119346) showed significant associations with autism. The strongest association (p < 0.001) was observed for rs1119032 (z score of risk allele G = 3.415) in multiplex families; SNP associations withstand multiple testing correction in multiplex families (p = 0.041). Haplotypes involving rs1119032 showed very strong associations with autism, withstanding multiple testing corrections. In quantitative transmission disequilibrium testing of multiplex families, the G allele of rs1119032 showed a significant association (p = 0.033) with scores on the Autism Diagnostic Interview–Revised (ADI-R)_D (early developmental abnormalities). We also found a significant difference in the distribution of ADI-R_A (social interaction) scores between the A/A, A/G and G/G genotypes of rs17119346 (p = 0.002). Limitations Our results should be replicated in an independent population and/or in samples of different racial backgrounds. Conclusion Our study provides strong genetic evidence of PCDHA as a potential candidate gene for autism. PMID:23031252
Løchting, Ida; Grotle, Margreth; Storheim, Kjersti; Werner, Erik L; Garratt, Andrew M
2014-09-01
To evaluate the reliability and validity of the improved version of the Patient Generated Index (PGI) in patients with low back pain. The PGI was administered to 90 patients attending care in 1 of 6 institutions in Norway and evaluated for reliability and validity. The questionnaire was given out to 61 patients for re-test purposes. The PGI was completed correctly by 80 (88.9%) patients and, of the 61 patients responding to the re-test, 50 (82.0%) completed both surveys correctly. PGI scores were approximately normally distributed, with a median of 40 (range 80), where 100 is the best possible quality of life. There were no floor or ceiling effects. The 5 most frequently listed areas affecting quality of life were pain, sleep, stiffness, socializing and housework. The test-retest intraclass correlation coefficient was 0.73. The smallest detectable changes for individual and group purposes were 32.8 and 4.6, respectively. The correlations between PGI scores and other instrument scores followed a priori hypotheses of low to moderate correlations. The PGI has evidence for reliability and validity in Norwegian patients with low back pain at the group level and may be considered for application in intervention studies when a comprehensive evaluation of quality of life is important. However, the smallest detectable change, of approximately 30 points, may be considered too large for individual purposes in clinical applications.
Jaiprakash, Heethal; Min, Aung Ko Ko; Ghosh, Sarmishtha
2016-03-01
This paper is aimed at finding if there was a change of correlation between the written test score and tutors' performance test scores in the assessment of medical students during a problem-based learning (PBL) course in Malaysia. This is a cross-sectional observational study, conducted among 264 medical students in two groups from November 2010 to November 2012. The first group's tutors did not receive tutor training; while the second group's tutors were trained in the PBL process. Each group was divided into high, middle and low achievers based on their end-of-semester exam scores. PBL scores were taken which included written test scores and tutors' performance test scores. Pearson correlation coefficient was calculated between the two kinds of scores in each group. The correlation coefficient between the written scores and tutors' scores in group 1 was 0.099 (p<0.001) and for group 2 was 0.305 (p<0.001). The higher correlation coefficient in the group where tutors received the PBL training reinforces the importance of tutor training before their participation in the PBL course.
Road March Performance of Special Operations Soldiers Carrying Various Loads and Load Distributions
1993-01-01
groups were used (Ramos and Knaplk, 1979; Knapik et al,, 1980 ; Hermansen et al., 1972), In the hand-grip test, the 7 soldier, in a seated position...Inventory (DIshman et al,, 1980 ). The POMS was a 65-item questionnaire which provided measures of six mood states, Soldiers scored each item on a five-point...estimates require individual calibration (Acheson et al., 1980 ) and heart rate can be influenced by a number of factors including training state (Saltin
Wang, Ying-Chih; Wickstrom, Rick; Yen, Sheng-Che; Kapellusch, Jay; Grogan, Kimberly A
2017-05-10
Cross-sectional study. The WorkAbility Rate of Manipulation Test (WRMT), an adaptation of the Minnesota Manual Dexterity Test (MMDT), contains a revised board and protocols to improve its utility for therapy or fitness assessment. To describe the development and preliminary psychometric properties of WRMT. Sixty-six healthy participants completed MMDT and WRMT in a random order followed by a user experience survey. We compared tests using repeated-measures analysis of variance, test-retest reliability, and examined agreement between tests. Despite the similarities of these 2 instruments, the different administration protocols resulted in statistically different score distributions (P < .001). Results supported good test-retest reliability of WRMT (placing test ICC = 0.88-0.90 and turning test ICC = 0.68-0.82). The WRMT correlated moderately with MMDT (r = 0.81 in placing test and r = 0.44-0.57 in turning test). Bland-Altman plot showed that the differences in completion time were 3.8 seconds between placing tests and 19.6 (both hands), 0.3 (right hand), and 3.9 (left hand) seconds between turning tests. Overall, participants felt that the instruction of WRMT was easier to follow (44%) and preferred its setup, color, and depth of the test board (49%). Time required to complete 1 panel of 20 disks correlated highly with the time needed to finish a complete trial of 60 disks in both MMDT (r = 0.91-0.97) and WRMT (r = 0.88-0.95). Caution is warranted in comparing scores from these 2 test variants. 3b. Copyright © 2017 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Confidence Intervals for Weighted Composite Scores under the Compound Binomial Error Model
ERIC Educational Resources Information Center
Kim, Kyung Yong; Lee, Won-Chan
2018-01-01
Reporting confidence intervals with test scores helps test users make important decisions about examinees by providing information about the precision of test scores. Although a variety of estimation procedures based on the binomial error model are available for computing intervals for test scores, these procedures assume that items are randomly…
Self-affirmation model for football goal distributions
NASA Astrophysics Data System (ADS)
Bittner, E.; Nußbaumer, A.; Janke, W.; Weigel, M.
2007-06-01
Analyzing football score data with statistical techniques, we investigate how the highly co-operative nature of the game is reflected in averaged properties such as the distributions of scored goals for the home and away teams. It turns out that in particular the tails of the distributions are not well described by independent Bernoulli trials, but rather well modeled by negative binomial or generalized extreme value distributions. To understand this behavior from first principles, we suggest to modify the Bernoulli random process to include a simple component of self-affirmation which seems to describe the data surprisingly well and allows to interpret the observed deviation from Gaussian statistics. The phenomenological distributions used before can be understood as special cases within this framework. We analyzed historical football score data from many leagues in Europe as well as from international tournaments and found the proposed models to be applicable rather universally. In particular, here we compare men's and women's leagues and the separate German leagues during the cold war times and find some remarkable differences.
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Utah
ERIC Educational Resources Information Center
Center on Education Policy, 2010
2010-01-01
This paper profiles Utah's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 8 reading. In grade 4 reading, the percentage scoring proficient on the state test showed a…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Washington
ERIC Educational Resources Information Center
Center on Education Policy, 2010
2010-01-01
This paper profiles Washington's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) decreased in grade 4 reading. In grade 4 math, the percentage scoring proficient on the state test decreased…
ERIC Educational Resources Information Center
Goldhaber, Dan; Gratz, Trevor; Theobald, Roddy
2016-01-01
We investigate the predictive validity of teacher credential test scores for student performance in secondary STEM classrooms in Washington state. After replicating earlier findings that teacher basic skills licensure test scores are a modest and statistically significant predictor of student math test score gains in elementary grades, we focus on…
Dobson, Cassandra
2015-11-01
The aim of this study was to describe self-efficacy as a theoretical component of behavior change in various therapeutic treatments such as the management of SCD pain. The participants were prepared to self-initiate the GI for 5 to 10 minutes three times each day regardless of pain and also during each pain episode. As part of the GI training a tape or CD with guided imagery messages was provided. Participants were monitored for 4 weeks pre and 4 weeks post intervention (GI training). Children kept a daily record of pain episodes. During this time, children continued to record as before in their personal study diary: pain episodes (intensity and treatment), school attendance, and also the frequency of GI use. At the conclusion of this 4-week period, usual pain patterns (PAT), visual imagery ability (KIAQ), and disease specific self-efficacy scale were measured again. The Sickle Cell Self-Efficacy Scale (SCSES) is a new nine-item scale measuring disease-specific perceptions of self-efficacy. The instrument's developers established internal consistency by Cronbach's alpha of 0.89. H1: Children with SCD who are trained in guided imagery will have greater disease-specific self-efficacy following the training than they had prior to learning guided imagery; the hypothesis was tested and supported using t-tests of mean interval-level scores on the SCSES. Eighteen children had positive gained scores and sixteen children raised their scores more than one standard deviation above the mean score for this sample distribution. Greater self-efficacy scores are associated with better physical and psychological functioning. Copyright © 2015 Elsevier Inc. All rights reserved.
Lai, Jin-Shei; Bregman, Corey; Zelko, Frank; Nowinski, Cindy; Cella, David; Beaumont, Jennifer J; Goldman, Stewart
2017-09-01
Cognitive dysfunction is a major concern for children with brain tumors. A valid, user-friendly screening tool could facilitate prompt referral for comprehensive neuropsychological assessments and therefore early intervention. Applications of the pediatric perceived cognitive function item bank (pedsPCF) such as computerized adaptive testing can potentially serve as such a tool given its brevity and user-friendly nature. This study aimed to evaluate whether pedsPCF was a valid indicator of cerebral compromise using the criterion of structural brain changes indicated by leukoencephalopathy grades. Data from 99 children (mean age = 12.6 years) with brain tumors and their parents were analyzed. Average time since diagnosis was 5.8 years; time since last treatment was 4.3 years. Leukoencephalopathy grade (range 0-4) was based on white matter damage and degree of deep white matter volume loss shown on MRI. Parents of patients completed the pedsPCF. Scores were based on the US general population-based T-score metric (mean = 50; SD = 10). Higher scores reflect better function. Leukoencephalopathy grade distributions were as follows: 36 grade 0, 27 grade 1, 22 grade 2, 13 grade 3, and 1 grade 4. The mean pedsPCF T-score was 48.3 (SD = 8.3; range 30.5-63.7). The pedsPCF scores significantly discriminated patients with different leukoencephalopathy grades, F = 4.14, p = 0.0084. Effect sizes ranged from 0.09 (grade 0 vs. 1) to 1.22 (grade 0 vs. 3/4). This study demonstrates that the pedsPCF is a valid indicator of leukoencephalopathy and provides support for its use as a screening tool for more comprehensive neurocognitive testing.
Cross-cultural validity of a dietary questionnaire for studies of dental caries risk in Japanese
2014-01-01
Background Diet is a major modifiable contributing factor in the etiology of dental caries. The purpose of this paper is to examine the reliability and cross-cultural validity of the Japanese version of the Food Frequency Questionnaire to assess dietary intake in relation to dental caries risk in Japanese. Methods The 38-item Food Frequency Questionnaire, in which Japanese food items were added to increase content validity, was translated into Japanese, and administered to two samples. The first sample comprised 355 pregnant women with mean age of 29.2 ± 4.2 years for the internal consistency and criterion validity analyses. Factor analysis (principal components with Varimax rotation) was used to determine dimensionality. The dietary cariogenicity score was calculated from the Food Frequency Questionnaire and used for the analyses. Salivary mutans streptococci level was used as a semi-quantitative assessment of dental caries risk and measured by Dentocult SM. Dentocult SM scores were compared with the dietary cariogenicity score computed from the Food Frequency Questionnaire to examine criterion validity, and assessed by Spearman’s correlation coefficient (rs) and Kruskal-Wallis test. Test-retest reliability of the Food Frequency Questionnaire was assessed with a second sample of 25 adults with mean age of 34.0 ± 3.0 years by using the intraclass correlation coefficient analysis. Results The Japanese language version of the Food Frequency Questionnaire showed high test-retest reliability (ICC = 0.70) and good criterion validity assessed by relationship with salivary mutans streptococci levels (rs = 0.22; p < 0.001). Factor analysis revealed four subscales that construct the questionnaire (solid sugars, solid and starchy sugars, liquid and semisolid sugars, sticky and slowly dissolving sugars). Internal consistency were low to acceptable (Cronbach’s alpha = 0.67 for the total scale, 0.46-0.61 for each subscale). Mean dietary cariogenicity scores were 50.8 ± 19.5 in the first sample, 47.4 ± 14.1, and 40.6 ± 11.3 for the first and second administrations in the second sample. The distribution of Dentocult SM score was 6.8% (score = 0), 34.4% (score = 1), 39.4% (score = 2), and 19.4% (score = 3). Participants with higher scores were more likely to have higher dietary cariogenicity scores (p < 0.001; Kruskal-Wallis test). Conclusions These results provide the preliminary evidence for the reliability and validity of the Japanese language Food Frequency Questionnaire. PMID:24383547
ERIC Educational Resources Information Center
Feldt, Leonard S.
2004-01-01
In some settings, the validity of a battery composite or a test score is enhanced by weighting some parts or items more heavily than others in the total score. This article describes methods of estimating the total score reliability coefficient when differential weights are used with items or parts.
KAVEH, MOHAMMAD HOSSIEN; MORADI, LEILA; GHAHREMANI, LEILA; TABATABAEE, HAMID REZA
2014-01-01
Introduction: One of the main determinants of adolescents’ life satisfaction is parenting skills. Due to the lack of educational trials in this field, this research was done to evaluate the effect of a parenting education program on girls’ life satisfaction in governmental guidance schools of Shiraz. Methods: This study is an educational randomized controlled trial. At first, 152 female students in 2nd grade of governmental guidance schools and 304 parents (152 mother and 152 father) were selected by multistage random cluster sampling method. Then, they were categorized into experimental and control groups. Before and after the intervention, data were collected from two groups using multidimensional students’ life satisfaction scale with stability (Cronbach's alpha=0.89), test–retest and correlation coefficient (r=0.70). Educational intervention for parents was performed in the experimental group through presentations with question and answer, discussion in small groups and distribution of educational booklets in 5 volumes. Finally, the data were analyzed using SPSS 14 and through Mann-Whitney test, Chi-square test, Fisher’s Exact test, Wilcoxon test. Results: Before the intervention, the experimental and control groups did not show a statistically significant difference based on the demographic variables. Thetotal of life satisfaction scores and also its subscales in the experimental and controlgroup, before and six weeks afterthe educational interventiondid showstatisticallysignificant difference (p<0.001). The scores of differences (pre-test/post-test) in total life satisfaction between the experimental and control groups were statistically significant difference (p<0.001). Conclusion: According to low scores of the students in the pre-test, especially in the control group which didn’t undergo any educational program, holding scheduled educational intervention is necessary. This study not only supports the effectiveness of educational intervention but also recommends further educational research to develop knowledge regarding patterns of parenting education. PMID:25512913
Leroy, Vincent; Sturm, Nathalie; Faure, Patrice; Trocme, Candice; Marlu, Alice; Hilleret, Marie-Noëlle; Morel, Françoise; Zarski, Jean-Pierre
2014-07-01
Fibrosis blood tests have been validated in chronic hepatitis C. Their diagnostic accuracy is less documented in hepatitis B. The aim of this study was to describe the diagnostic performance of FibroTest®, FibroMeter®, and HepaScore® for liver fibrosis in hepatitis B compared to hepatitis C. 510 patients mono-infected with hepatitis B or C and matched on fibrosis stage were included. Blood tests were performed the day of the liver biopsy. Histological lesions were staged according to METAVIR. Fibrosis stages were distributed as followed: F0 n=76, F1 n=192, F2 n=132, F3 n=54, F4 n=56. Overall diagnostic performance of blood tests were similar between hepatitis B and C with AUROC ranging from 0.75 to 0.84 for significant fibrosis, 0.82 to 0.85 for extensive fibrosis and 0.84 to 0.87 for cirrhosis. Optimal cut-offs were consistently lower in hepatitis B compared to hepatitis C, especially for the diagnosis of extensive fibrosis and cirrhosis, with decreased sensitivity and negative predictive values. More hepatitis B than C patients with F ⩾3 were underestimated: FibroTest®: 47% vs. 26%, FibroMeter®: 24% vs. 6%, HepaScore®: 41% vs. 24%, p<0.01. Multivariate analysis showed that hepatitis B (0R 3.4, 95% CI 1.2-19.2, p<0.02) and low γGT (OR 7.3, 95% CI 2.0-27.0, p<0.003) were associated with fibrosis underestimation. Overall the diagnostic performance of blood tests is similar in hepatitis B and C. The risk of underestimating significant fibrosis and cirrhosis is however greater in hepatitis B and cannot be entirely corrected by the use of more stringent cut-offs. Copyright © 2014 European Association for the Study of the Liver. Published by Elsevier B.V. All rights reserved.
Nguyen, Shon; Ramos, Artur; Chang, Joy; Li, Bin; Shanmugam, Vedapuri; Boeras, Debrah; Nkengasong, John N; Yang, Chunfu; Ellenberger, Dennis
2015-04-01
HIV-1 viral load (VL) levels are used for monitoring disease progression and antiretroviral therapy outcomes in HIV-infected patients. To assess the performance of laboratories conducting HIV-1 VL testing in resource-limited settings, the U.S. Centers for Disease Control and Prevention implemented a voluntary, free-of-charge, external quality assurance program using dried tube specimens (DTSs). Between 2010 and 2012, DTS proficiency testing (PT) panels consisting of 5 specimens were distributed at ambient temperature to participants. The results from the participants (n≥6) using the same assay were grouped, analyzed, and graded as acceptable within a group mean±3 standard deviations. Mean proficiency scores were calculated by dividing the combined PT scores by the number of testing cycles using a linear regression model. Between 2010 and 2012, the number of participants enrolled increased from 32 in 16 countries to 114 in 44 countries. A total of 78.2% of the participants reported results using 10 different VL assays. The rates of reporting of acceptable results by the participants were 96.6% for the Abbott assay, 96.3% for the Roche Cobas assay, 94.5% for the Roche Amplicor assay, 93.0% for the Biocentric assay, and 89.3% for the NucliSens assay. The overall mean proficiency scores improved over time (P=0.024). DTSs are a good alternative specimen type to plasma specimens for VL PT programs, as they do not require cold chain transportation and can be used on PCR-based assays. Our data suggest that the CDC HIV-1 VL PT program using DTSs positively impacts the testing performance of the participants, which might translate into better and more accurate VL testing services for patients. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
Comparison of two scores for allocating resources to doctors in deprived areas.
Hutchinson, A; Foy, C; Sandhu, B
1989-11-04
Current proposals in the general practitioner contract include additional payments to doctors working among deprived populations. The underprivileged area score will be used to identify local authority wards with the greatest levels of deprivation, thus acting as the basis for distributing considerable resources. Two methods of identifying deprived populations--the underprivileged area score and the material deprivation score--were compared to determine whether they result in similar allocation of resources to regions. Financial allocations to regions based on figures derived from the contract differed considerably if the material deprivation score was used instead of the underprivileged area score: Northern and Mersey regions gained over 50% of their allocation whereas East Anglia, Oxford, and South West Thames regions lost more than 30% of theirs. Such differences have considerable implications for doctors working among deprived populations as up to 60m pounds each year might be distributed by these payments.
Pearman, Timothy; Yanez, Betina; Peipert, John; Wortman, Katy; Beaumont, Jennifer; Cella, David
2014-09-15
Health-related quality of life (HRQOL) measures are commonly used in oncology research. Interest in their use for monitoring or screening is increasing. The Functional Assessment of Cancer Therapy (FACT) is one of the most widely used HRQOL instruments. Consequently, oncology researchers and practitioners have an increasing need for reference values for the Functional Assessment of Cancer Therapy-General (FACT-G) and its 7-item rapid version, the Functional Assessment of Cancer Therapy-General 7 (FACT-G7), to compare FACT scores across specific subgroups of patients in research trials and practice. The objectives of this study are to provide 1) reference values from a sample of the general US adult population and a sample of adults diagnosed with cancer and 2) cutoff scores for quality of life. A sample of the general US population (N = 1075) and a sample of patients with cancer from 12 studies (N = 5065) were analyzed. Cutoff scores were established using distribution- and anchor-based methods. Mean values for the cancer sample were analyzed by performance status, cancer type, and disease status. Also, t tests and established criteria for meaningful differences were used to compare values. FACT-G and FACT-G7 scores in the general US population sample and cancer sample were generally comparable. Among the sample of patients with cancer, FACT-G and FACT-G7 scores worsened with declining performance status and increasing disease status. These data will aid interpretation of the magnitude and meaning of FACT scores, and allow for comparisons of scores across studies. © 2014 American Cancer Society.
Anastario, Michael P; Rodriguez, Hector P; Gallagher, Patricia M; Cleary, Paul D; Shaller, Dale; Rogers, William H; Bogen, Karen; Safran, Dana Gelb
2010-01-01
Objective To assess the effect of survey distribution protocol (mail versus handout) on data quality and measurement of patient care experiences. Data Sources/Study Setting Multisite randomized trial of survey distribution protocols. Analytic sample included 2,477 patients of 15 clinicians at three practice sites in New York State. Data Collection/Extraction Methods Mail and handout distribution modes were alternated weekly at each site for 6 weeks. Principal Findings Handout protocols yielded an incomplete distribution rate (74 percent) and lower overall response rates (40 percent versus 58 percent) compared with mail. Handout distribution rates decreased over time and resulted in more favorable survey scores compared with mailed surveys. There were significant mode–physician interaction effects, indicating that data cannot simply be pooled and adjusted for mode. Conclusions In-office survey distribution has the potential to bias measurement and comparison of physicians and sites on patient care experiences. Incomplete distribution rates observed in-office, together with between-office differences in distribution rates and declining rates over time suggest staff may be burdened by the process and selective in their choice of patients. Further testing with a larger physician and site sample is important to definitively establish the potential role for in-office distribution in obtaining reliable, valid assessment of patient care experiences. PMID:20579126
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP
ERIC Educational Resources Information Center
Chudowsky, Naomi; Chudowsky, Victor
2010-01-01
In recent years, scores on the annual state reading and mathematics tests used for accountability have gone up in most states. These trends in state test scores do not always coincide, however, with trends on the National Assessment of Educational Progress (NAEP), the federally sponsored assessment that is administered periodically to…
ERIC Educational Resources Information Center
Doppelt, Jerome E.
1956-01-01
The standard error of measurement as a means for estimating the margin of error that should be allowed for in test scores is discussed. The true score measures the performance that is characteristic of the person tested; the variations, plus and minus, around the true score describe a characteristic of the test. When the standard deviation is used…
ERIC Educational Resources Information Center
Bell, Michael L.; Roubinek, Darrell L.
1989-01-01
Compares fourth-graders' subtest scores on the Stanford Achievement Test (SAT), the Iowa Test of Basic Skills (ITBS), and the Metropolitan Achievement Test (MAT). Finds right-brain dominant students scored better on four SAT subtests, and left-brain dominant students scored better on four ITBS subtests and two MAT subtests. (NH)
Developing Test Score Reports that Work: The Process and Best Practices for Effective Communication
ERIC Educational Resources Information Center
Zenisky, April L.; Hambleton, Ronald K.
2012-01-01
Test scores matter these days. Test-takers want to understand how they performed, and test score reports, particularly those for individual examinees, are the vehicles by which most people get the bulk of this information. Historically, score reports have not always met the examinees' information or usability needs, but this is clearly changing…
Rubínová, Eva; Nikolai, Tomáš; Marková, Hana; Siffelová, Kamila; Laczó, Jan; Hort, Jakub; Vyhnálek, Martin
2014-01-01
The Clock Drawing Test is a frequently used cognitive screening test with several scoring systems in elderly populations. We compare simple and complex scoring systems and evaluate the usefulness of the combination of the Clock Drawing Test with the Mini-Mental State Examination to detect patients with mild cognitive impairment. Patients with amnestic mild cognitive impairment (n = 48) and age- and education-matched controls (n = 48) underwent neuropsychological examinations, including the Clock Drawing Test and the Mini-Mental State Examination. Clock drawings were scored by three blinded raters using one simple (6-point scale) and two complex (17- and 18-point scales) systems. The sensitivity and specificity of these scoring systems used alone and in combination with the Mini-Mental State Examination were determined. Complex scoring systems, but not the simple scoring system, were significant predictors of the amnestic mild cognitive impairment diagnosis in logistic regression analysis. At equal levels of sensitivity (87.5%), the Mini-Mental State Examination showed higher specificity (31.3%, compared with 12.5% for the 17-point Clock Drawing Test scoring scale). The combination of Clock Drawing Test and Mini-Mental State Examination scores increased the area under the curve (0.72; p < .001) and increased specificity (43.8%), but did not increase sensitivity, which remained high (85.4%). A simple 6-point scoring system for the Clock Drawing Test did not differentiate between healthy elderly and patients with amnestic mild cognitive impairment in our sample. Complex scoring systems were slightly more efficient, yet still were characterized by high rates of false-positive results. We found psychometric improvement using combined scores from the Mini-Mental State Examination and the Clock Drawing Test when complex scoring systems were used. The results of this study support the benefit of using combined scores from simple methods.
Li, Li; Xiong, De-fu; Liu, Jia-wen; Li, Zi-xin; Zeng, Guang-cheng; Li, Hua-liang
2014-03-01
We aimed to evaluate the interference of 50 Hz extremely low frequency electromagnetic field (ELF-EMF) occupational exposure on the neurobehavior tests of workers performing tour-inspection close to transformers and distribution power lines. Occupational short-term "spot" measurements were carried out. 310 inspection workers and 300 logistics staff were selected as exposure and control. The neurobehavior tests were performed through computer-based neurobehavior evaluation system, including mental arithmetic, curve coincide, simple visual reaction time, visual retention, auditory digit span and pursuit aiming. In 500 kV areas electric field intensity at 71.98% of total measured 590 spots were above 5 kV/m (national occupational standard), while in 220 kV areas electric field intensity at 15.69% of total 701 spots were above 5 kV/m. Magnetic field flux density at all the spots was below 1,000 μT (ICNIRP occupational standard). The neurobehavior score changes showed no statistical significance. Results of neurobehavior tests among different age, seniority groups showed no significant changes. Neurobehavior changes caused by daily repeated ELF-EMF exposure were not observed in the current study.
Gasquoine, Philip Gerard; Gonzalez, Cassandra Dayanira
2012-05-01
Conventional neuropsychological norms developed for monolinguals likely overestimate normal performance in bilinguals on language but not visual-perceptual format tests. This was studied by comparing neuropsychological false-positive rates using the 50th percentile of conventional norms and individual comparison standards (Picture Vocabulary or Matrix Reasoning scores) as estimates of preexisting neuropsychological skill level against the number expected from the normal distribution for a consecutive sample of 56 neurologically intact, bilingual, Hispanic Americans. Participants were tested in separate sessions in Spanish and English in the counterbalanced order on La Bateria Neuropsicologica and the original English language tests on which this battery was based. For language format measures, repeated-measures multivariate analysis of variance showed that individual estimates of preexisting skill level in English generated the mean number of false positives most approximate to that expected from the normal distribution, whereas the 50th percentile of conventional English language norms did the same for visual-perceptual format measures. When using conventional Spanish or English monolingual norms for language format neuropsychological measures with bilingual Hispanic Americans, individual estimates of preexisting skill level are recommended over the 50th percentile.
Gelesko, Savannah; Long, Leann; Faulk, Jan; Phillips, Ceib; Dicus, Carolyn; White, Raymond P.
2013-01-01
Purpose To assess the impact of cryotherapy or topical minocycline on patients’ perceptions of recovery from pain after third molar surgery in an exploratory comparative-effectiveness study. Patients and Methods Subjects aged at least 14 years who were having all 4 third molars removed were enrolled in 3 separate institutional review board–approved studies. Study groups included subjects treated with a passively applied cold wrap for 24 hours postoperatively, subjects treated with topical minocycline during surgery, and subjects enrolled in a nonconcurrent comparison group who had received neither topical minocycline nor directed cryotherapy. Third molar surgery was performed in all cases by trained surgeons using the same protocol. An exact Kruskal-Wallis test was used to compare the distributions of the worst and average pain scores and a Fisher exact test to compare verbal responses from Gracely pain scales among the 3 groups for postsurgical days (PSDs) 1 to 3. Results This study comprised 51 cryotherapy subjects (2005–2009), 63 minocycline subjects (2003–2004), and 92 comparison-group subjects (2002–2006) who were treated at academic centers and in community practices across the United States (N = 206). Demographic descriptors were similar among all groups. For PSDs 1 through 3 (unadjusted), the highest scores for worst pain (6–7 [out of 7] on Likert-type scale) were reported less frequently in each of the study groups than in subjects in the comparison group, although the numbers of subjects reporting the highest scores were few. The distribution of pain outcomes was significantly different among the 3 groups for worst pain and affective words on PSD 1 (P = .04 for both). However, the small number of subjects who reported the highest pain scores precluded adequate multivariate statistical analyses for all outcomes on PSD 1 to 3. Conclusions Data from this exploratory study suggest that adjunctive therapy to decrease postoperative pain—cryotherapy or topical minocycline—might be effective at moderating the patient’s highest pain levels after third molar surgery. The topic should be studied further in a multicenter, prospective, randomized trial. PMID:21802812
Rodrigues, Rosalina Aparecida Partezani; Robazzi, Maria Lúcia do Carmo Cruz; Erdmann, Alacoque Lorenzini; Fernandes, Josicélia Dumet; de Barros, Alba Lucia Bottura Leite; Ramos, Flávia Regina Souza
2015-01-01
The Millennium Development Goals are centered around combatting poverty and other social evils all over the world. Thus, this study seeks to identify the Millennium Development Goals as an object of study in theses from Postgraduate Nursing Programs in Brazil scoring 5 (national excellence) and 6 or 7 (international excellence), and evaluate the association between the score for the program and achieving the Millennium Development Goals. Exploratory descriptive document research. Data were collected from the Notes on Indicators/Coordination for Higher Education Personnel Improvement for the 15 Postgraduate Nursing Courses scoring between 5 and 7 in the three-year-period of 2010/2012. of the 8 Millennium Development Objectives, 6 were dealt with in the theses. There was an association (Fisher's exact test p-value=0.0059) between the distribution of the theses and the program scores in relation to the Millennium Development Objectives (p-valor=0.0347)CONCLUSION: the doctoral theses were slightly related to the Millennium Development Objectives, covering the population's economic development, health conditions and quality of life. It is recommended that Postgraduate Programs in Nursing pay closer attention to the Millennium Development Objectives.
Rodrigues, Rosalina Aparecida Partezani; Robazzi, Maria Lúcia do Carmo Cruz; Erdmann, Alacoque Lorenzini; Fernandes, Josicélia Dumet; de Barros, Alba Lucia Bottura Leite; Ramos, Flávia Regina Souza
2015-01-01
OBJECTIVES: The Millennium Development Goals are centered around combatting poverty and other social evils all over the world. Thus, this study seeks to identify the Millennium Development Goals as an object of study in theses from Postgraduate Nursing Programs in Brazil scoring 5 (national excellence) and 6 or 7 (international excellence), and evaluate the association between the score for the program and achieving the Millennium Development Goals. METHOD: Exploratory descriptive document research. Data were collected from the Notes on Indicators/Coordination for Higher Education Personnel Improvement for the 15 Postgraduate Nursing Courses scoring between 5 and 7 in the three-year-period of 2010/2012. RESULTS: of the 8 Millennium Development Objectives, 6 were dealt with in the theses. There was an association (Fisher's exact test p-value=0.0059) between the distribution of the theses and the program scores in relation to the Millennium Development Objectives (p-valor=0.0347) CONCLUSION: the doctoral theses were slightly related to the Millennium Development Objectives, covering the population's economic development, health conditions and quality of life. It is recommended that Postgraduate Programs in Nursing pay closer attention to the Millennium Development Objectives.. PMID:26312631
Liu, Hui; Liu, Wei; Lin, Ying; Liu, Teng; Ma, Zhaowu; Li, Mo; Zhang, Hong-Mei; Kenneth Wang, Qing; Guo, An-Yuan
2015-05-27
Scoring the correlation between two genes by their shared properties is a common and basic work in biological study. A prospective way to score this correlation is to quantify the overlap between the two sets of homogeneous properties of the two genes. However the proper model has not been decided, here we focused on studying the quantification of overlap and proposed a more effective model after theoretically compared 7 existing models. We defined three characteristic parameters (d, R, r) of an overlap, which highlight essential differences among the 7 models and grouped them into two classes. Then the pros and cons of the two groups of model were fully examined by their solution space in the (d, R, r) coordinate system. Finally we proposed a new model called OScal (Overlap Score calculator), which was modified on Poisson distribution (one of 7 models) to avoid its disadvantages. Tested in assessing gene relation using different data, OScal performs better than existing models. In addition, OScal is a basic mathematic model, with very low computation cost and few restrictive conditions, so it can be used in a wide-range of research areas to measure the overlap or similarity of two entities.
Balicza, Peter; Terebessy, Andras; Grosz, Zoltan; Varga, Noemi Agnes; Gal, Aniko; Fekete, Balint Andras; Molnar, Maria Judit
2018-03-01
Next-generation sequencing is increasingly utilized worldwide as a research and diagnostic tool and is anticipated to be implemented into everyday clinical practice. Since Central-Eastern European attitude toward genetic testing, especially broad genetic testing, is not well known, we performed a survey on this issue among Hungarian participants. A self-administered questionnaire was distributed among patients and patient relatives at our neurogenetic outpatient clinic. Members of the general population were also recruited via public media. We used chi-square testing and binary logistic regression to examine factors influencing attitude. We identified a mixed attitude toward genetic testing. Access to physician consultation positively influenced attitude. A higher self-determined genetic familiarity score associated with higher perceived genetic influence score, which in turn associated with greater willingness to participate in genetic testing. Medical professionals constituted a skeptical group. We think that given the controversies and complexities of the next-generation sequencing field, the optimal clinical translation of NGS data should be performed in institutions which have the unique capability to provide interprofessional health education, transformative biomedical research, and crucial patient care. With optimization of the clinical translational process, improvement of genetic literacy may increase patient engagement and empowerment. The paper highlights that in countries with relatively low-genetic literacy, a special strategy is needed to enhance the implementation of personalized medicine.
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Arkansas
ERIC Educational Resources Information Center
Center on Education Policy, 2010
2010-01-01
This paper profiles Arkansas's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) went up in math at grades 4 and 8. In reading, the percentages scoring proficient on the state test went up at…
What Do Test Score Really Mean? A Latent Class Analysis of Danish Test Score Performance
ERIC Educational Resources Information Center
McIntosh, James; Munk, Martin D.
2014-01-01
Latent class Poisson count models are used to analyse a sample of Danish test score results from a cohort of individuals born in 1954-1955, tested in 1968, and followed until 2011. The procedure takes account of unobservable effects as well as excessive zeros in the data. We show that the test scores measure manifest or measured ability as it has…
NASA Astrophysics Data System (ADS)
Mungan, Muhittin; Rador, Tonguç
2008-02-01
We study the dynamics and resulting score distribution of three-agent games where after each competition a single agent wins and scores a point. A single competition is described by a triplet of numbers p, t and q denoting the probabilities that the team with the highest, middle or lowest accumulated score wins. The three-agent game can be regarded as a social model where a player can be favored or disfavored for advancement, based on his/her accumulated score. We study the full family of solutions in the regime, where the number of agents and competitions is large, which can be regarded as a hydrodynamic limit. Depending on the parameter values (p, q, t), we find six qualitatively different asymptotic score distributions and we provide a qualitative explanation of these results. We also compare our analytical results against numerical simulations of the microscopic model and find these to be in excellent agreement. It is possible to decide the outcome of a three-agent game through a mini-tournament of two-agent competitions among the participating players and it turns out that the resulting possible score distributions are a subset of those obtained for the general three-agent games. We discuss how one can add a steady and democratic decline rate to the model and present a simple geometric construction that allows one to obtain the score evolution equations for n-agent games.
Exploratory study of the relations between spatial ability and drawing from memory.
Czarnolewski, Mark Y; Eliot, John
2012-04-01
Test scores of 119 students, attending either a public four-year college or a technical school, were related to their proportionality and detail drawing scores on the Memory for Designs Test. In regression models, the ETS Maze Tracing, Eliot-Price Mental Rotations, and Bender-Gestalt tests were consistent predictors of proportionality scores, with the latter two tests uniquely related to these. The ETS Shapes Memory Test and the Form Board Test were the strongest predictors for detail accuracy scores. The Shapes test predicted proportionality when the CTY Visual Memory Test BB was excluded. The models then provided support for the hypothesis that drawing designs from memory, a critical skill in drawing, regardless of whether one focuses on accuracy for proportionality scores or for detail scores, is jointly related to the measures of recognition, production, and traditional spatial ability measures. This study identified multifaceted skills in drawing from memory.
1988-05-01
C and Task Reference List 42 APPENDIX C: FE Tasks, Rating Scores , and ID Codes for Forms A and B 54 APPENDIX D: Nonstandard ADP Systems From Form B...DISTRIBUTION 4a "U p o:.U TABLES Number Page 1 Questionnaire Distribution and Response Rate 12 2 Mean Rating Scores for Standard System 13 3 Frequency of...Standard System Use 14 4 Use of System by Division: Standard Systems 16 5 Mean Rating Scores for Nonstandard Systems 22 6 Frequency of Nonstandard
Maenner, Matthew J; Greenberg, Jan S; Mailick, Marsha R
2015-05-01
Lower (versus higher) IQ scores have been shown to increase the risk of early mortality, however, the underlying mechanisms are poorly understood and previous studies underrepresent individuals with intellectual disability (ID) and women. This study followed one third of all senior-year students (approximately aged 17) attending public high school in Wisconsin, U.S. in 1957 (n = 10,317) until 2011. Men and women with the lowest IQ test scores (i.e., IQ scores ≤ 85) had increased rates of mortality compared to people with the highest IQ test scores, particularly for cardiovascular disease. Importantly, when educational attainment was held constant, people with lower IQ test scores did not have higher mortality by age 70 than people with higher IQ test scores. Individuals with lower IQ test scores likely experience multiple disadvantages throughout life that contribute to increased risk of early mortality.
Santos, Itamar S; Goulart, Alessandra C; Pereira, Alexandre C; Lotufo, Paulo A; Benseñor, Isabela M
2016-12-01
The American Heart Association aims to reduce the burden of cardiovascular disease in this decade by improving seven ideal cardiovascular health (CVH) characteristics in the population. The aim of this study was to quantify the association between the American Heart Association's CVH score and values for carotid intima-media thickness (CIMT) in the Brazilian Longitudinal Study of Adult Health baseline assessment. The Brazilian Longitudinal Study of Adult Health is a multicenter cohort study of civil servants aged 35 to 74 years in Brazil. In this study, the investigators analyzed 9,662 individuals with no previous cardiovascular disease. The distribution of CIMT values (categorized into age-, sex-, and race-specific quartiles) was analyzed according to CVH scores using χ 2 trend tests. Linear and multinomial regression models were built to evaluate the association between CIMT and CVH score. A significant increase was observed in the proportion of individuals within the first and second CIMT quartiles, as well as a decrease within the fourth quartile with higher CVH score strata (P for trend < .001). A 1-point increase in CVH score was associated in adjusted models with a decrease of 0.011 mm in CIMT and an odds ratio of 0.79 (95% CI, 0.77-0.81) of having CIMT in the fourth quartile. However, nearly 16% of individuals with optimal CVH scores had CIMT values in the highest quartile. In this study, significant associations were found between CIMT and CVH score in a large sample of middle-aged adults. However, a high CVH score did not warrant the absence of a significant subclinical atherosclerotic burden. Copyright © 2016 American Society of Echocardiography. Published by Elsevier Inc. All rights reserved.
High ABCD2 Scores and In-Hospital Interventions following Transient Ischemic Attack
Cutting, Shawna; Regan, Elizabeth; Lee, Vivien H.; Prabhakaran, Shyam
2016-01-01
Background and Purpose Following transient ischemic attack (TIA), there is increased risk for ischemic stroke. The American Heart Association recommends admission of patients with ABCD2 scores ≥3 for observation, rapid performance of diagnostic tests, and potential acute intervention. We aimed to determine if there is a relationship between ABCD2 scores, in-hospital ischemic events, and in-hospital treatments after TIA admission. Methods We reviewed consecutive patients admitted between 2006 and 2011 following a TIA, defined as transient focal neurological symptoms attributed to a specific vascular distribution and lasting <24 h. Three interventions were prespecified: anticoagulation for atrial fibrillation, carotid or intracranial revascularization, and intravenous or intra-arterial reperfusion therapies. We compared rates of in-hospital recurrent TIA or ischemic stroke and the receipt of interventions among patients with low (<3) versus high (≥3) ABCD2 scores. Results Of 249 patients, 11 patients (4.4%) had recurrent TIAs or strokes during their stay (8 TIAs, 3 strokes). All 11 had ABCD2 scores ≥3, and no neurological events occurred in patients with lower scores (5.1 vs. 0%; p = 0.37). Twelve patients (4.8%) underwent revascularization for large artery stenosis, 16 (6.4%) were started on anticoagulants, and no patient received intravenous or intra-arterial reperfusion therapy. The ABCD2 score was not associated with anticoagulation (p = 0.59) or revascularization (p = 0.20). Conclusions Higher ABCD2 scores may predict early ischemic events after TIA but do not predict the need for intervention. Outpatient evaluation for those with scores <3 would potentially have delayed revascularization or anticoagulant treatment in nearly one-fifth of ‘low-risk’ patients. PMID:27721312
[The Freiburg monosyllable word test in postoperative cochlear implant diagnostics].
Hey, M; Brademann, G; Ambrosch, P
2016-08-01
The Freiburg monosyllable word test represents a central tool of postoperative cochlear implant (CI) diagnostics. The objective of this study is to test the equivalence of different word lists by analysing word comprehension. For patients whose CI has been implanted for more than 5 years, the distribution of suprathreshold speech intelligibility outcomes will also be analysed. In a retrospective data analysis, speech understanding for 626 CI users word correct scores were evaluated using a total of 5211 lists with 20 words each. The analysis of word comprehension within each list shows differences in mean and in the kind of distribution function. There are lists which show a significant difference of their mean word recognition to the overall mean. The Freiburg monosyllable word test is easy to administer at suprathreshold speech level for CI recipients, and typically has a saturation level above 80 %. The Freiburg monosyllable word test can be performed successfully by the majority of CI patients. The limited balance of the test lists elicits the conclusion that an adaptive test procedure with the Freiburg monosyllable test does not make sense. The Freiburg monosyllable test can be restructured by resorting all words across lists, or by omitting individual words of a test list to increase the reliability of the test. The results show that speech intelligibility in quiet should also be investigated in CI recipients al levels below 70 dB.
Technical analysis of the Slosson Written Expression Test.
Erford, Bradley T; Hofler, Donald B
2004-06-01
The Slosson Written Expression Test was designed to assess students ages 8-17 years at risk for difficulties in written expression. Scores from three independent samples were used to evaluate the test's reliability and validity for measuring students' written expression. Test-retest reliability of the SWET subscales ranged from .80 to .94 (n = 151), and .95 for the Written Expression Total Standard Scores. The median alternate-form reliability for students' Written Expression Total Standard Scores was .81 across the three forms. Scores on the Slosson test yielded concurrent validity coefficients (n = 143) of .60 with scores from the Woodcock-Johnson: Tests of Achievement-Third Edition Broad Written Language Domain and .49 with scores on the Test of Written Language-Third Edition Spontaneous Writing Quotient. Exploratory factor analytic procedures suggested the Slosson test is comprised of two dimensions, Writing Mechanics and Writing Maturity (47.1% and 20.1% variance accounted for, respectively). In general, the Slosson Written Expression Test presents with sufficient technical characteristics to be considered a useful written expression screening test.