A weighted generalized score statistic for comparison of predictive values of diagnostic tests
Kosinski, Andrzej S.
2013-01-01
Positive and negative predictive values are important measures of a medical diagnostic test performance. We consider testing equality of two positive or two negative predictive values within a paired design in which all patients receive two diagnostic tests. The existing statistical tests for testing equality of predictive values are either Wald tests based on the multinomial distribution or the empirical Wald and generalized score tests within the generalized estimating equations (GEE) framework. As presented in the literature, these test statistics have considerably complex formulas without clear intuitive insight. We propose their re-formulations which are mathematically equivalent but algebraically simple and intuitive. As is clearly seen with a new re-formulation we present, the generalized score statistic does not always reduce to the commonly used score statistic in the independent samples case. To alleviate this, we introduce a weighted generalized score (WGS) test statistic which incorporates empirical covariance matrix with newly proposed weights. This statistic is simple to compute, it always reduces to the score statistic in the independent samples situation, and it preserves type I error better than the other statistics as demonstrated by simulations. Thus, we believe the proposed WGS statistic is the preferred statistic for testing equality of two predictive values and for corresponding sample size computations. The new formulas of the Wald statistics may be useful for easy computation of confidence intervals for difference of predictive values. The introduced concepts have potential to lead to development of the weighted generalized score test statistic in a general GEE setting. PMID:22912343
A weighted generalized score statistic for comparison of predictive values of diagnostic tests.
Kosinski, Andrzej S
2013-03-15
Positive and negative predictive values are important measures of a medical diagnostic test performance. We consider testing equality of two positive or two negative predictive values within a paired design in which all patients receive two diagnostic tests. The existing statistical tests for testing equality of predictive values are either Wald tests based on the multinomial distribution or the empirical Wald and generalized score tests within the generalized estimating equations (GEE) framework. As presented in the literature, these test statistics have considerably complex formulas without clear intuitive insight. We propose their re-formulations that are mathematically equivalent but algebraically simple and intuitive. As is clearly seen with a new re-formulation we presented, the generalized score statistic does not always reduce to the commonly used score statistic in the independent samples case. To alleviate this, we introduce a weighted generalized score (WGS) test statistic that incorporates empirical covariance matrix with newly proposed weights. This statistic is simple to compute, always reduces to the score statistic in the independent samples situation, and preserves type I error better than the other statistics as demonstrated by simulations. Thus, we believe that the proposed WGS statistic is the preferred statistic for testing equality of two predictive values and for corresponding sample size computations. The new formulas of the Wald statistics may be useful for easy computation of confidence intervals for difference of predictive values. The introduced concepts have potential to lead to development of the WGS test statistic in a general GEE setting. Copyright © 2012 John Wiley & Sons, Ltd.
Standard Errors and Confidence Intervals of Norm Statistics for Educational and Psychological Tests.
Oosterhuis, Hannah E M; van der Ark, L Andries; Sijtsma, Klaas
2016-11-14
Norm statistics allow for the interpretation of scores on psychological and educational tests, by relating the test score of an individual test taker to the test scores of individuals belonging to the same gender, age, or education groups, et cetera. Given the uncertainty due to sampling error, one would expect researchers to report standard errors for norm statistics. In practice, standard errors are seldom reported; they are either unavailable or derived under strong distributional assumptions that may not be realistic for test scores. We derived standard errors for four norm statistics (standard deviation, percentile ranks, stanine boundaries and Z-scores) under the mild assumption that the test scores are multinomially distributed. A simulation study showed that the standard errors were unbiased and that corresponding Wald-based confidence intervals had good coverage. Finally, we discuss the possibilities for applying the standard errors in practical test use in education and psychology. The procedure is provided via the R function check.norms, which is available in the mokken package.
The Probability of Obtaining Two Statistically Different Test Scores as a Test Index
ERIC Educational Resources Information Center
Muller, Jorg M.
2006-01-01
A new test index is defined as the probability of obtaining two randomly selected test scores (PDTS) as statistically different. After giving a concept definition of the test index, two simulation studies are presented. The first analyzes the influence of the distribution of test scores, test reliability, and sample size on PDTS within classical…
Factors related to student performance in statistics courses in Lebanon
NASA Astrophysics Data System (ADS)
Naccache, Hiba Salim
The purpose of the present study was to identify factors that may contribute to business students in Lebanese universities having difficulty in introductory and advanced statistics courses. Two statistics courses are required for business majors at Lebanese universities. Students are not obliged to be enrolled in any math courses prior to taking statistics courses. Drawing on recent educational research, this dissertation attempted to identify the relationship between (1) students’ scores on Lebanese university math admissions tests; (2) students’ scores on a test of very basic mathematical concepts; (3) students’ scores on the survey of attitude toward statistics (SATS); (4) course performance as measured by students’ final scores in the course; and (5) their scores on the final exam. Data were collected from 561 students enrolled in multiple sections of two courses: 307 students in the introductory statistics course and 260 in the advanced statistics course in seven campuses across Lebanon over one semester. The multiple regressions results revealed four significant relationships at the introductory level: between students’ scores on the math quiz with their (1) final exam scores; (2) their final averages; (3) the Cognitive subscale of the SATS with their final exam scores; and (4) their final averages. These four significant relationships were also found at the advanced level. In addition, two more significant relationships were found between students’ final average and the two subscales of Effort (5) and Affect (6). No relationship was found between students’ scores on the admission math tests and both their final exam scores and their final averages in both the introductory and advanced level courses. On the other hand, there was no relationship between students’ scores on Lebanese admissions tests and their final achievement. Although these results were consistent across course formats and instructors, they may encourage Lebanese universities to assess the effectiveness of prerequisite math courses. Moreover, these findings may lead the Lebanese Ministry of Education to make changes to the admissions exams, course prerequisites, and course content. Finally, to enhance the attitude of students, new learning techniques, such as group work during class meetings can be helpful, and future research should aim to test the effectiveness of these pedagogical techniques on students’ attitudes toward statistics.
Can Percentiles Replace Raw Scores in the Statistical Analysis of Test Data?
ERIC Educational Resources Information Center
Zimmerman, Donald W.; Zumbo, Bruno D.
2005-01-01
Educational and psychological testing textbooks typically warn of the inappropriateness of performing arithmetic operations and statistical analysis on percentiles instead of raw scores. This seems inconsistent with the well-established finding that transforming scores to ranks and using nonparametric methods often improves the validity and power…
Kiekkas, Panagiotis; Panagiotarou, Aliki; Malja, Alvaro; Tahirai, Daniela; Zykai, Rountina; Bakalis, Nick; Stefanopoulos, Nikolaos
2015-12-01
Although statistical knowledge and skills are necessary for promoting evidence-based practice, health sciences students have expressed anxiety about statistics courses, which may hinder their learning of statistical concepts. To evaluate the effects of a biostatistics course on nursing students' attitudes toward statistics and to explore the association between these attitudes and their performance in the course examination. One-group quasi-experimental pre-test/post-test design. Undergraduate nursing students of the fifth or higher semester of studies, who attended a biostatistics course. Participants were asked to complete the pre-test and post-test forms of The Survey of Attitudes Toward Statistics (SATS)-36 scale at the beginning and end of the course respectively. Pre-test and post-test scale scores were compared, while correlations between post-test scores and participants' examination performance were estimated. Among 156 participants, post-test scores of the overall SATS-36 scale and of the Affect, Cognitive Competence, Interest and Effort components were significantly higher than pre-test ones, indicating that the course was followed by more positive attitudes toward statistics. Among 104 students who participated in the examination, higher post-test scores of the overall SATS-36 scale and of the Affect, Difficulty, Interest and Effort components were significantly but weakly correlated with higher examination performance. Students' attitudes toward statistics can be improved through appropriate biostatistics courses, while positive attitudes contribute to higher course achievements and possibly to improved statistical skills in later professional life. Copyright © 2015 Elsevier Ltd. All rights reserved.
Ho, Andrew D; Yu, Carol C
2015-06-01
Many statistical analyses benefit from the assumption that unconditional or conditional distributions are continuous and normal. More than 50 years ago in this journal, Lord and Cook chronicled departures from normality in educational tests, and Micerri similarly showed that the normality assumption is met rarely in educational and psychological practice. In this article, the authors extend these previous analyses to state-level educational test score distributions that are an increasingly common target of high-stakes analysis and interpretation. Among 504 scale-score and raw-score distributions from state testing programs from recent years, nonnormal distributions are common and are often associated with particular state programs. The authors explain how scaling procedures from item response theory lead to nonnormal distributions as well as unusual patterns of discreteness. The authors recommend that distributional descriptive statistics be calculated routinely to inform model selection for large-scale test score data, and they illustrate consequences of nonnormality using sensitivity studies that compare baseline results to those from normalized score scales.
Derivation and Applicability of Asymptotic Results for Multiple Subtests Person-Fit Statistics
Albers, Casper J.; Meijer, Rob R.; Tendeiro, Jorge N.
2016-01-01
In high-stakes testing, it is important to check the validity of individual test scores. Although a test may, in general, result in valid test scores for most test takers, for some test takers, test scores may not provide a good description of a test taker’s proficiency level. Person-fit statistics have been proposed to check the validity of individual test scores. In this study, the theoretical asymptotic sampling distribution of two person-fit statistics that can be used for tests that consist of multiple subtests is first discussed. Second, simulation study was conducted to investigate the applicability of this asymptotic theory for tests of finite length, in which the correlation between subtests and number of items in the subtests was varied. The authors showed that these distributions provide reasonable approximations, even for tests consisting of subtests of only 10 items each. These results have practical value because researchers do not have to rely on extensive simulation studies to simulate sampling distributions. PMID:29881053
The score statistic of the LD-lod analysis: detecting linkage adaptive to linkage disequilibrium.
Huang, J; Jiang, Y
2001-01-01
We study the properties of a modified lod score method for testing linkage that incorporates linkage disequilibrium (LD-lod). By examination of its score statistic, we show that the LD-lod score method adaptively combines two sources of information: (a) the IBD sharing score which is informative for linkage regardless of the existence of LD and (b) the contrast between allele-specific IBD sharing scores which is informative for linkage only in the presence of LD. We also consider the connection between the LD-lod score method and the transmission-disequilibrium test (TDT) for triad data and the mean test for affected sib pair (ASP) data. We show that, for triad data, the recessive LD-lod test is asymptotically equivalent to the TDT; and for ASP data, it is an adaptive combination of the TDT and the ASP mean test. We demonstrate that the LD-lod score method has relatively good statistical efficiency in comparison with the ASP mean test and the TDT for a broad range of LD and the genetic models considered in this report. Therefore, the LD-lod score method is an interesting approach for detecting linkage when the extent of LD is unknown, such as in a genome-wide screen with a dense set of genetic markers. Copyright 2001 S. Karger AG, Basel
el Galta, Rachid; Uitte de Willige, Shirley; de Visser, Marieke C H; Helmer, Quinta; Hsu, Li; Houwing-Duistermaat, Jeanine J
2007-09-24
In this paper, we propose a one degree of freedom test for association between a candidate gene and a binary trait. This method is a generalization of Terwilliger's likelihood ratio statistic and is especially powerful for the situation of one associated haplotype. As an alternative to the likelihood ratio statistic, we derive a score statistic, which has a tractable expression. For haplotype analysis, we assume that phase is known. By means of a simulation study, we compare the performance of the score statistic to Pearson's chi-square statistic and the likelihood ratio statistic proposed by Terwilliger. We illustrate the method on three candidate genes studied in the Leiden Thrombophilia Study. We conclude that the statistic follows a chi square distribution under the null hypothesis and that the score statistic is more powerful than Terwilliger's likelihood ratio statistic when the associated haplotype has frequency between 0.1 and 0.4 and has a small impact on the studied disorder. With regard to Pearson's chi-square statistic, the score statistic has more power when the associated haplotype has frequency above 0.2 and the number of variants is above five.
Alternative Statistical Frameworks for Student Growth Percentile Estimation
ERIC Educational Resources Information Center
Lockwood, J. R.; Castellano, Katherine E.
2015-01-01
This article suggests two alternative statistical approaches for estimating student growth percentiles (SGP). The first is to estimate percentile ranks of current test scores conditional on past test scores directly, by modeling the conditional cumulative distribution functions, rather than indirectly through quantile regressions. This would…
RAId_DbS: Peptide Identification using Database Searches with Realistic Statistics
Alves, Gelio; Ogurtsov, Aleksey Y; Yu, Yi-Kuo
2007-01-01
Background The key to mass-spectrometry-based proteomics is peptide identification. A major challenge in peptide identification is to obtain realistic E-values when assigning statistical significance to candidate peptides. Results Using a simple scoring scheme, we propose a database search method with theoretically characterized statistics. Taking into account possible skewness in the random variable distribution and the effect of finite sampling, we provide a theoretical derivation for the tail of the score distribution. For every experimental spectrum examined, we collect the scores of peptides in the database, and find good agreement between the collected score statistics and our theoretical distribution. Using Student's t-tests, we quantify the degree of agreement between the theoretical distribution and the score statistics collected. The T-tests may be used to measure the reliability of reported statistics. When combined with reported P-value for a peptide hit using a score distribution model, this new measure prevents exaggerated statistics. Another feature of RAId_DbS is its capability of detecting multiple co-eluted peptides. The peptide identification performance and statistical accuracy of RAId_DbS are assessed and compared with several other search tools. The executables and data related to RAId_DbS are freely available upon request. PMID:17961253
Testing for independence in J×K contingency tables with complex sample survey data.
Lipsitz, Stuart R; Fitzmaurice, Garrett M; Sinha, Debajyoti; Hevelone, Nathanael; Giovannucci, Edward; Hu, Jim C
2015-09-01
The test of independence of row and column variables in a (J×K) contingency table is a widely used statistical test in many areas of application. For complex survey samples, use of the standard Pearson chi-squared test is inappropriate due to correlation among units within the same cluster. Rao and Scott (1981, Journal of the American Statistical Association 76, 221-230) proposed an approach in which the standard Pearson chi-squared statistic is multiplied by a design effect to adjust for the complex survey design. Unfortunately, this test fails to exist when one of the observed cell counts equals zero. Even with the large samples typical of many complex surveys, zero cell counts can occur for rare events, small domains, or contingency tables with a large number of cells. Here, we propose Wald and score test statistics for independence based on weighted least squares estimating equations. In contrast to the Rao-Scott test statistic, the proposed Wald and score test statistics always exist. In simulations, the score test is found to perform best with respect to type I error. The proposed method is motivated by, and applied to, post surgical complications data from the United States' Nationwide Inpatient Sample (NIS) complex survey of hospitals in 2008. © 2015, The International Biometric Society.
The effect of rare variants on inflation of the test statistics in case-control analyses.
Pirie, Ailith; Wood, Angela; Lush, Michael; Tyrer, Jonathan; Pharoah, Paul D P
2015-02-20
The detection of bias due to cryptic population structure is an important step in the evaluation of findings of genetic association studies. The standard method of measuring this bias in a genetic association study is to compare the observed median association test statistic to the expected median test statistic. This ratio is inflated in the presence of cryptic population structure. However, inflation may also be caused by the properties of the association test itself particularly in the analysis of rare variants. We compared the properties of the three most commonly used association tests: the likelihood ratio test, the Wald test and the score test when testing rare variants for association using simulated data. We found evidence of inflation in the median test statistics of the likelihood ratio and score tests for tests of variants with less than 20 heterozygotes across the sample, regardless of the total sample size. The test statistics for the Wald test were under-inflated at the median for variants below the same minor allele frequency. In a genetic association study, if a substantial proportion of the genetic variants tested have rare minor allele frequencies, the properties of the association test may mask the presence or absence of bias due to population structure. The use of either the likelihood ratio test or the score test is likely to lead to inflation in the median test statistic in the absence of population structure. In contrast, the use of the Wald test is likely to result in under-inflation of the median test statistic which may mask the presence of population structure.
Datta, Rakesh; Datta, Karuna; Venkatesh, M D
2015-07-01
The classical didactic lecture has been the cornerstone of the theoretical undergraduate medical education. Their efficacy however reduces due to reduced interaction and short attention span of the students. It is hypothesized that the interactive response pad obviates some of these drawbacks. The aim of this study was to evaluate the effectiveness of an interactive response system by comparing it with conventional classroom teaching. A prospective comparative longitudinal study was conducted on 192 students who were exposed to either conventional or interactive teaching over 20 classes. Pre-test, Post-test and retentions test (post 8-12 weeks) scores were collated and statistically analysed. An independent observer measured number of student interactions in each class. Pre-test scores from both groups were similar (p = 0.71). There was significant improvement in both post test scores when compared to pre-test scores in either method (p < 0.001). The interactive post-test score was better than conventional post test score (p < 0.001) by 8-10% (95% CI-difference of means - 8.2%-9.24%-10.3%). The interactive retention test score was better than conventional retention test score (p < 0.001) by 15-18% (95% CI-difference of means - 15.0%-16.64%-18.2%). There were 51 participative events in the interactive group vs 25 in the conventional group. The Interactive Response Pad method was efficacious in teaching. Students taught with the interactive method were likely to score 8-10% higher (statistically significant) in the immediate post class time and 15-18% higher (statistically significant) after 8-12 weeks. The number of student-teacher interactions increases when using the interactive response pads.
A Comparison of Student Understanding of Seasons Using Inquiry and Didactic Teaching Methods
NASA Astrophysics Data System (ADS)
Ashcraft, Paul G.
2006-02-01
Student performance on open-ended questions concerning seasons in a university physical science content course was examined to note differences between classes that experienced inquiry using a 5-E lesson planning model and those that experienced the same content with a traditional, didactic lesson. The class examined is a required content course for elementary education majors and understanding the seasons is part of the university's state's elementary science standards. The two self-selected groups of students showed no statistically significant differences in pre-test scores, while there were statistically significant differences between the groups' post-test scores with those who participated in inquiry-based activities scoring higher. There were no statistically significant differences between the pre-test and the post-test for the students who experienced didactic teaching, while there were statistically significant improvements for the students who experienced the 5-E lesson.
ERIC Educational Resources Information Center
Goldhaber, Dan; Gratz, Trevor; Theobald, Roddy
2016-01-01
We investigate the predictive validity of teacher credential test scores for student performance in secondary STEM classrooms in Washington state. After replicating earlier findings that teacher basic skills licensure test scores are a modest and statistically significant predictor of student math test score gains in elementary grades, we focus on…
LD Score Regression Distinguishes Confounding from Polygenicity in Genome-Wide Association Studies
Bulik-Sullivan, Brendan K.; Loh, Po-Ru; Finucane, Hilary; Ripke, Stephan; Yang, Jian; Patterson, Nick; Daly, Mark J.; Price, Alkes L.; Neale, Benjamin M.
2015-01-01
Both polygenicity (i.e., many small genetic effects) and confounding biases, such as cryptic relatedness and population stratification, can yield an inflated distribution of test statistics in genome-wide association studies (GWAS). However, current methods cannot distinguish between inflation from true polygenic signal and bias. We have developed an approach, LD Score regression, that quantifies the contribution of each by examining the relationship between test statistics and linkage disequilibrium (LD). The LD Score regression intercept can be used to estimate a more powerful and accurate correction factor than genomic control. We find strong evidence that polygenicity accounts for the majority of test statistic inflation in many GWAS of large sample size. PMID:25642630
Score tests for independence in semiparametric competing risks models.
Saïd, Mériem; Ghazzali, Nadia; Rivest, Louis-Paul
2009-12-01
A popular model for competing risks postulates the existence of a latent unobserved failure time for each risk. Assuming that these underlying failure times are independent is attractive since it allows standard statistical tools for right-censored lifetime data to be used in the analysis. This paper proposes simple independence score tests for the validity of this assumption when the individual risks are modeled using semiparametric proportional hazards regressions. It assumes that covariates are available, making the model identifiable. The score tests are derived for alternatives that specify that copulas are responsible for a possible dependency between the competing risks. The test statistics are constructed by adding to the partial likelihoods for the individual risks an explanatory variable for the dependency between the risks. A variance estimator is derived by writing the score function and the Fisher information matrix for the marginal models as stochastic integrals. Pitman efficiencies are used to compare test statistics. A simulation study and a numerical example illustrate the methodology proposed in this paper.
Test anxiety and academic performance in chiropractic students.
Zhang, Niu; Henderson, Charles N R
2014-01-01
Objective : We assessed the level of students' test anxiety, and the relationship between test anxiety and academic performance. Methods : We recruited 166 third-quarter students. The Test Anxiety Inventory (TAI) was administered to all participants. Total scores from written examinations and objective structured clinical examinations (OSCEs) were used as response variables. Results : Multiple regression analysis shows that there was a modest, but statistically significant negative correlation between TAI scores and written exam scores, but not OSCE scores. Worry and emotionality were the best predictive models for written exam scores. Mean total anxiety and emotionality scores for females were significantly higher than those for males, but not worry scores. Conclusion : Moderate-to-high test anxiety was observed in 85% of the chiropractic students examined. However, total test anxiety, as measured by the TAI score, was a very weak predictive model for written exam performance. Multiple regression analysis demonstrated that replacing total anxiety (TAI) with worry and emotionality (TAI subscales) produces a much more effective predictive model of written exam performance. Sex, age, highest current academic degree, and ethnicity contributed little additional predictive power in either regression model. Moreover, TAI scores were not found to be statistically significant predictors of physical exam skill performance, as measured by OSCEs.
A Nonparametric Framework for Comparing Trends and Gaps across Tests
ERIC Educational Resources Information Center
Ho, Andrew Dean
2009-01-01
Problems of scale typically arise when comparing test score trends, gaps, and gap trends across different tests. To overcome some of these difficulties, test score distributions on the same score scale can be represented by nonparametric graphs or statistics that are invariant under monotone scale transformations. This article motivates and then…
The effects of academic grouping on student performance in science
NASA Astrophysics Data System (ADS)
Scoggins, Sally Smykla
The current action research study explored how student placement in heterogeneous or homogeneous classes in seventh-grade science affected students' eighth-grade Science State of Texas Assessment of Academic Readiness (STAAR) scores, and how ability grouping affected students' scores based on race and socioeconomic status. The population included all eighth-grade students in the target district who took the regular eighth-grade science STAAR over four academic school years. The researcher ran three statistical tests: a t-test for independent samples, a one-way between subjects analysis of variance (ANOVA) and a two-way between subjects ANOVA. The results showed no statistically significant difference between eighth-grade Pre-AP students from seventh-grade Pre-AP classes and eighth-grade Pre-AP students from heterogeneous seventh-grade classes and no statistically significant difference between Pre-AP students' scores based on socioeconomic status. There was no statistically significant interaction between socioeconomic status and the seventh-grade science classes. The scores between regular eighth-grade students who were in heterogeneous seventh-grade classes were statistically significantly higher than the scores of regular eighth-grade students who were in regular seventh-grade classes. The results also revealed that the scores of students who were White were statistically significantly higher than the scores of students who were Black and Hispanic. Black and Hispanic scores did not differ significantly. Further results indicated that the STAAR Level II and Level III scores were statistically significantly higher for the Pre-AP eighth-grade students who were in heterogeneous seventh-grade classes than the STAAR Level II and Level III scores of Pre-AP eighth-grade students who were in Pre-AP seventh-grade classes.
Evidence-based practice knowledge, attitudes, and practice of online graduate nursing students.
Rojjanasrirat, Wilaiporn; Rice, Jan
2017-06-01
This study aimed to evaluate changes in evidence-based practice (EBP) knowledge, attitudes, and practice of nursing students before and after completing an online, graduate level, introductory research/EBP course. A prospective one-group pretest-posttest design. A private university in the Midwestern, USA. Sixty-three online nurse practitioner students in Master's program. A convenient sample of online graduate nursing students who enrolled in the research/EBP course was invited to participate in the study. Study outcomes were measured using the Evidence-Based Practice Questionnaire (EBPQ) before and after completing the course. Descriptive statistics and paired-Samples t-test was used to assess the mean differences between pre-and post-test scores. Overall, students' post-test EBP scores were significantly improved over pre-test scores, t(63)=-9.034, p<0.001). Statistically significant differences were found for practice of EBP mean scores t(63)=-12.78, p=0.001). No significant differences were found between pre and post-tests on knowledge and attitudes toward EBP scores. Most frequently cited barriers to EBP were lack of understanding of statistics, interpretation of findings, lack of time, and lack of library resources. Copyright © 2017 Elsevier Ltd. All rights reserved.
Meijer, Rob R; Niessen, A Susan M; Tendeiro, Jorge N
2016-02-01
Although there are many studies devoted to person-fit statistics to detect inconsistent item score patterns, most studies are difficult to understand for nonspecialists. The aim of this tutorial is to explain the principles of these statistics for researchers and clinicians who are interested in applying these statistics. In particular, we first explain how invalid test scores can be detected using person-fit statistics; second, we provide the reader practical examples of existing studies that used person-fit statistics to detect and to interpret inconsistent item score patterns; and third, we discuss a new R-package that can be used to identify and interpret inconsistent score patterns. © The Author(s) 2015.
Montague, J R; Frei, J K
1993-04-01
To determine whether significant correlations existed among quantitative and qualitative predictors of students' academic success and quantitative outcomes of such success over a 12-year period in a small university's premedical program. A database was assembled from information on the 199 graduates who earned BS degrees in biology from Barry University's School of Natural and Health Sciences from 1980 through 1991. The quantitative variables were year of BS degree, total score on the Scholastic Aptitude Test (SAT), various measures of undergraduate grade-point averages (GPAs), and total score on the Medical College Admission Test (MCAT); and the qualitative variables were minority (54% of the students) or majority status and transfer (about one-third of the students) or nontransfer status. The statistical methods were multiple analysis of variance and stepwise multiple regression. Statistically significant positive correlations were found among SAT total scores, final GPAs, biology GPAs versus nonbiology GPAs, and MCAT total scores. These correlations held for transfer versus nontransfer students and for minority versus majority students. Over the 12-year period there were significant fluctuations in mean MCAT scores. The students' SAT scores and GPAs proved to be statistically reliable predictors of MCAT scores, but the minority or majority status and the transfer or nontransfer status of the students were statistically insignificant.
On the Performance of the Marginal Homogeneity Test to Detect Rater Drift.
Sgammato, Adrienne; Donoghue, John R
2018-06-01
When constructed response items are administered repeatedly, "trend scoring" can be used to test for rater drift. In trend scoring, raters rescore responses from the previous administration. Two simulation studies evaluated the utility of Stuart's Q measure of marginal homogeneity as a way of evaluating rater drift when monitoring trend scoring. In the first study, data were generated based on trend scoring tables obtained from an operational assessment. The second study tightly controlled table margins to disentangle certain features present in the empirical data. In addition to Q , the paired t test was included as a comparison, because of its widespread use in monitoring trend scoring. Sample size, number of score categories, interrater agreement, and symmetry/asymmetry of the margins were manipulated. For identical margins, both statistics had good Type I error control. For a unidirectional shift in margins, both statistics had good power. As expected, when shifts in the margins were balanced across categories, the t test had little power. Q demonstrated good power for all conditions and identified almost all items identified by the t test. Q shows substantial promise for monitoring of trend scoring.
Linking the Smarter Balanced Assessments to NWEA MAP Assessments
ERIC Educational Resources Information Center
Northwest Evaluation Association, 2015
2015-01-01
Concordance tables have been used for decades to relate scores on different tests measuring similar but distinct constructs. These tables, typically derived from statistical linking procedures, provide a direct link between scores on different tests and serve various purposes. Aside from describing how a score on one test relates to performance on…
ERIC Educational Resources Information Center
Jacob, Brian A.
2016-01-01
Contrary to popular belief, modern cognitive assessments--including the new Common Core tests--produce test scores based on sophisticated statistical models rather than the simple percent of items a student answers correctly. While there are good reasons for this, it means that reported test scores depend on many decisions made by test designers,…
ERIC Educational Resources Information Center
King, Molly Elizabeth
2016-01-01
The purpose of this quantitative, causal-comparative study was to compare the effect elementary music and visual arts lessons had on third through sixth grade standardized mathematics test scores. Inferential statistics were used to compare the differences between test scores of students who took in-school, elementary, music instruction during the…
Sequi, Marco; Campi, Rita; Clavenna, Antonio; Bonati, Maurizio
2013-03-01
To evaluate the quality of data reporting and statistical methods performed in drug utilization studies in the pediatric population. Drug utilization studies evaluating all drug prescriptions to children and adolescents published between January 1994 and December 2011 were retrieved and analyzed. For each study, information on measures of exposure/consumption, the covariates considered, descriptive and inferential analyses, statistical tests, and methods of data reporting was extracted. An overall quality score was created for each study using a 12-item checklist that took into account the presence of outcome measures, covariates of measures, descriptive measures, statistical tests, and graphical representation. A total of 22 studies were reviewed and analyzed. Of these, 20 studies reported at least one descriptive measure. The mean was the most commonly used measure (18 studies), but only five of these also reported the standard deviation. Statistical analyses were performed in 12 studies, with the chi-square test being the most commonly performed test. Graphs were presented in 14 papers. Sixteen papers reported the number of drug prescriptions and/or packages, and ten reported the prevalence of the drug prescription. The mean quality score was 8 (median 9). Only seven of the 22 studies received a score of ≥10, while four studies received a score of <6. Our findings document that only a few of the studies reviewed applied statistical methods and reported data in a satisfactory manner. We therefore conclude that the methodology of drug utilization studies needs to be improved.
ERIC Educational Resources Information Center
Gidey, Mu'uz
2015-01-01
This action research is carried out in a practical class room setting to devise an innovative way of administering tutorial classes to improve students' learning competence with particular reference to gendered test scores. A before-after test score analyses of mean and standard deviations along with t-statistical tests of hypotheses of second…
Grade Equivalents: We Report Them, You Should Too.
ERIC Educational Resources Information Center
Ligon, Glynn; Battaile, Richard
In certain situations, grade equivalent scores are the most appropriate statistic available for reporting achievement test data. It is noted that testing practitioners have found that raw scores, normal curve equivalents, stanines, and standard scores are very useful. However, it is best to convert to either grade equivalents or percentiles before…
Verification of learner’s differences by team-based learning in biochemistry classes
2017-01-01
Purpose We tested the effect of team-based learning (TBL) on medical education through the second-year premedical students’ TBL scores in biochemistry classes over 5 years. Methods We analyzed the results based on test scores before and after the students’ debate. The groups of students for statistical analysis were divided as follows: group 1 comprised the top-ranked students, group 3 comprised the low-ranked students, and group 2 comprised the medium-ranked students. Therefore, group T comprised 382 students (the total number of students in group 1, 2, and 3). To calibrate the difficulty of the test, original scores were converted into standardized scores. We determined the differences of the tests using Student t-test, and the relationship between scores before, and after the TBL using linear regression tests. Results Although there was a decrease in the lowest score, group T and 3 showed a significant increase in both original and standardized scores; there was also an increase in the standardized score of group 3. There was a positive correlation between the pre- and the post-debate scores in group T, and 2. And the beta values of the pre-debate scores and “the changes between the pre- and post-debate scores” were statistically significant in both original and standardized scores. Conclusion TBL is one of the educational methods for helping students improve their grades, particularly those of low-ranked students. PMID:29207457
Predicting occupational personality test scores.
Furnham, A; Drakeley, R
2000-01-01
The relationship between students' actual test scores and their self-estimated scores on the Hogan Personality Inventory (HPI; R. Hogan & J. Hogan, 1992), an omnibus personality questionnaire, was examined. Despite being given descriptive statistics and explanations of each of the dimensions measured, the students tended to overestimate their scores; yet all correlations between actual and estimated scores were positive and significant. Correlations between self-estimates and actual test scores were highest for sociability, ambition, and adjustment (r = .62 to r = .67). The results are discussed in terms of employers' use and abuse of personality assessment for job recruitment.
Effects of ozone (O3) therapy on cisplatin-induced ototoxicity in rats.
Koçak, Hasan Emre; Taşkın, Ümit; Aydın, Salih; Oktay, Mehmet Faruk; Altınay, Serdar; Çelik, Duygu Sultan; Yücebaş, Kadir; Altaş, Bengül
2016-12-01
The aim of this study is to investigate the effect of rectal ozone and intratympanic ozone therapy on cisplatin-induced ototoxicity in rats. Eighteen female Wistar albino rats were included in our study. External auditory canal and tympanic membrane examinations were normal in all rats. The rats were randomly divided into three groups. Initially, all the rats were tested with distortion product otoacoustic emissions (DPOAE), and emissions were measured normally. All rats were injected with 5-mg/kg/day cisplatin for 3 days intraperitoneally. Ototoxicy had developed in all rats, as confirmed with DPOAE after 1 week. Rectal and intratympanic ozone therapy group was Group 1. No treatment was administered for the rats in Group 2 as the control group. The rats in Group 3 were treated with rectal ozone. All the rats were tested with DPOAE under general anesthesia, and all were sacrificed for pathological examination 1 week after ozone administration. Their cochleas were removed. The outer hair cell damage and stria vascularis damage were examined. In the statistical analysis conducted, a statistically significant difference between Group 1 and Group 2 was observed in all frequencies according to the DPOAE test. In addition, between Group 2 and Group 3, a statistically significant difference was observed in the DPOAE test. However, a statistically significant difference was not observed between Group 1 and Group 3 according to the DPOAE test. According to histopathological scoring, the outer hair cell damage score was statistically significantly high in Group 2 compared with Group 1. In addition, the outer hair cell damage score was also statistically significantly high in Group 2 compared with Group 3. Outer hair cell damage scores were low in Group 1 and Group 3, but there was no statistically significant difference between these groups. There was no statistically significant difference between the groups in terms of stria vascularis damage score examinations. Systemic ozone gas therapy is effective in the treatment of cell damage in cisplatin-induced ototoxicity. The intratympanic administration of ozone gas does not have any additional advantage over the rectal administration.
A Seven-Year Follow-Up of Intelligence Test Scores of Foster Grandparents
ERIC Educational Resources Information Center
Troll, Lillian E.; And Others
1976-01-01
After seven years, a group (N=32) of originally nonemployed poverty-level older people (over 60) now employed as foster grandparents were retested with the WAIS. Three subtest scores showed stability and Digit Span showed a statistically significant drop. Neither age nor initial level of health or WAIS scores was related to test-score changes over…
An Evaluation of the Euroncap Crash Test Safety Ratings in the Real World
Segui-Gomez, Maria; Lopez-Valdes, Francisco J.; Frampton, Richard
2007-01-01
We investigated whether the rating obtained in the EuroNCAP test procedures correlates with injury protection to vehicle occupants in real crashes using data in the UK Cooperative Crash Injury Study (CCIS) database from 1996 to 2005. Multivariate Poisson regression models were developed, using the Abbreviated Injury Scale (AIS) score by body region as the dependent variable and the EuroNCAP score for that particular body region, seat belt use, mass ratio and Equivalent Test Speed (ETS) as independent variables. Our models identified statistically significant relationships between injury severity and safety belt use, mass ratio and ETS. We could not identify any statistically significant relationships between the EuroNCAP body region scores and real injury outcome except for the protection to pelvis-femur-knee in frontal impacts where scoring “green” is significantly better than scoring “yellow” or “red”.
General Framework for Meta-analysis of Rare Variants in Sequencing Association Studies
Lee, Seunggeun; Teslovich, Tanya M.; Boehnke, Michael; Lin, Xihong
2013-01-01
We propose a general statistical framework for meta-analysis of gene- or region-based multimarker rare variant association tests in sequencing association studies. In genome-wide association studies, single-marker meta-analysis has been widely used to increase statistical power by combining results via regression coefficients and standard errors from different studies. In analysis of rare variants in sequencing studies, region-based multimarker tests are often used to increase power. We propose meta-analysis methods for commonly used gene- or region-based rare variants tests, such as burden tests and variance component tests. Because estimation of regression coefficients of individual rare variants is often unstable or not feasible, the proposed method avoids this difficulty by calculating score statistics instead that only require fitting the null model for each study and then aggregating these score statistics across studies. Our proposed meta-analysis rare variant association tests are conducted based on study-specific summary statistics, specifically score statistics for each variant and between-variant covariance-type (linkage disequilibrium) relationship statistics for each gene or region. The proposed methods are able to incorporate different levels of heterogeneity of genetic effects across studies and are applicable to meta-analysis of multiple ancestry groups. We show that the proposed methods are essentially as powerful as joint analysis by directly pooling individual level genotype data. We conduct extensive simulations to evaluate the performance of our methods by varying levels of heterogeneity across studies, and we apply the proposed methods to meta-analysis of rare variant effects in a multicohort study of the genetics of blood lipid levels. PMID:23768515
Patients and medical statistics. Interest, confidence, and ability.
Woloshin, Steven; Schwartz, Lisa M; Welch, H Gilbert
2005-11-01
People are increasingly presented with medical statistics. There are no existing measures to assess their level of interest or confidence in using medical statistics. To develop 2 new measures, the STAT-interest and STAT-confidence scales, and assess their reliability and validity. Survey with retest after approximately 2 weeks. Two hundred and twenty-four people were recruited from advertisements in local newspapers, an outpatient clinic waiting area, and a hospital open house. We developed and revised 5 items on interest in medical statistics and 3 on confidence understanding statistics. Study participants were mostly college graduates (52%); 25% had a high school education or less. The mean age was 53 (range 20 to 84) years. Most paid attention to medical statistics (6% paid no attention). The mean (SD) STAT-interest score was 68 (17) and ranged from 15 to 100. Confidence in using statistics was also high: the mean (SD) STAT-confidence score was 65 (19) and ranged from 11 to 100. STAT-interest and STAT-confidence scores were moderately correlated (r=.36, P<.001). Both scales demonstrated good test-retest repeatability (r=.60, .62, respectively), internal consistency reliability (Cronbach's alpha=0.70 and 0.78), and usability (individual item nonresponse ranged from 0% to 1.3%). Scale scores correlated only weakly with scores on a medical data interpretation test (r=.15 and .26, respectively). The STAT-interest and STAT-confidence scales are usable and reliable. Interest and confidence were only weakly related to the ability to actually use data.
Stability of scores for the Slosson Full-Range Intelligence Test.
Williams, Thomas O; Eaves, Ronald C; Woods-Groves, Suzanne; Mariano, Gina
2007-08-01
The test-retest stability of the Slosson Full-Range Intelligence Test by Algozzine, Eaves, Mann, and Vance was investigated with test scores from a sample of 103 students. With a mean interval of 13.7 mo. and different examiners for each of the two test administrations, the test-retest reliability coefficients for the Full-Range IQ, Verbal Reasoning, Abstract Reasoning, Quantitative Reasoning, and Memory were .93, .85, .80, .80, and .83, respectively. Mean differences from the test-retest scores were not statistically significantly different for any of the scales. Results suggest that Slosson scores are stable over time even when different examiners administer the test.
ERIC Educational Resources Information Center
Denbleyker, John Nickolas
2012-01-01
The shortcomings of the proportion above cut (PAC) statistic used so prominently in the educational landscape renders it a very problematic measure for making correct inferences with student test data. The limitations of PAC-based statistics are more pronounced with cross-test comparisons due to their dependency on cut-score locations. A better…
How White Teachers Experience and Think about Race in Professional Development
ERIC Educational Resources Information Center
Marcy, Renee
2010-01-01
The public educational system in the United States fails to proficiently educate a majority of African American, Latino/a, and students from low-income backgrounds. Test score statistics show an average scaled score gap of twenty-six points between African American and White students (National Center for Education Statistics, 2007). The term…
Rank score and permutation testing alternatives for regression quantile estimates
Cade, B.S.; Richards, J.D.; Mielke, P.W.
2006-01-01
Performance of quantile rank score tests used for hypothesis testing and constructing confidence intervals for linear quantile regression estimates (0 ≤ τ ≤ 1) were evaluated by simulation for models with p = 2 and 6 predictors, moderate collinearity among predictors, homogeneous and hetero-geneous errors, small to moderate samples (n = 20–300), and central to upper quantiles (0.50–0.99). Test statistics evaluated were the conventional quantile rank score T statistic distributed as χ2 random variable with q degrees of freedom (where q parameters are constrained by H 0:) and an F statistic with its sampling distribution approximated by permutation. The permutation F-test maintained better Type I errors than the T-test for homogeneous error models with smaller n and more extreme quantiles τ. An F distributional approximation of the F statistic provided some improvements in Type I errors over the T-test for models with > 2 parameters, smaller n, and more extreme quantiles but not as much improvement as the permutation approximation. Both rank score tests required weighting to maintain correct Type I errors when heterogeneity under the alternative model increased to 5 standard deviations across the domain of X. A double permutation procedure was developed to provide valid Type I errors for the permutation F-test when null models were forced through the origin. Power was similar for conditions where both T- and F-tests maintained correct Type I errors but the F-test provided some power at smaller n and extreme quantiles when the T-test had no power because of excessively conservative Type I errors. When the double permutation scheme was required for the permutation F-test to maintain valid Type I errors, power was less than for the T-test with decreasing sample size and increasing quantiles. Confidence intervals on parameters and tolerance intervals for future predictions were constructed based on test inversion for an example application relating trout densities to stream channel width:depth.
Testing the non-unity of rate ratio under inverse sampling.
Tang, Man-Lai; Liao, Yi Jie; Ng, Hong Keung Tony; Chan, Ping Shing
2007-08-01
Inverse sampling is considered to be a more appropriate sampling scheme than the usual binomial sampling scheme when subjects arrive sequentially, when the underlying response of interest is acute, and when maximum likelihood estimators of some epidemiologic indices are undefined. In this article, we study various statistics for testing non-unity rate ratios in case-control studies under inverse sampling. These include the Wald, unconditional score, likelihood ratio and conditional score statistics. Three methods (the asymptotic, conditional exact, and Mid-P methods) are adopted for P-value calculation. We evaluate the performance of different combinations of test statistics and P-value calculation methods in terms of their empirical sizes and powers via Monte Carlo simulation. In general, asymptotic score and conditional score tests are preferable for their actual type I error rates are well controlled around the pre-chosen nominal level, and their powers are comparatively the largest. The exact version of Wald test is recommended if one wants to control the actual type I error rate at or below the pre-chosen nominal level. If larger power is expected and fluctuation of sizes around the pre-chosen nominal level are allowed, then the Mid-P version of Wald test is a desirable alternative. We illustrate the methodologies with a real example from a heart disease study. (c) 2007 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim
Evaluation of "e-rater"® for the "Praxis I"®Writing Test. Research Report. ETS RR-15-03
ERIC Educational Resources Information Center
Ramineni, Chaitanya; Trapani, Catherine S.; Williamson, David M.
2015-01-01
Automated scoring models were trained and evaluated for the essay task in the "Praxis I"® writing test. Prompt-specific and generic "e-rater"® scoring models were built, and evaluation statistics, such as quadratic weighted kappa, Pearson correlation, and standardized differences in mean scores, were examined to evaluate the…
ERIC Educational Resources Information Center
Baggerly, Jennifer; Ferretti, Larissa K.
2008-01-01
What is the impact of natural disasters on students' statewide assessment scores? To answer this question, Florida Comprehensive Assessment Test (FCAT) scores of 55,881 students in grades 4 through 10 were analyzed to determine if there were significant decreases after the 2004 hurricanes. Results reveal that there was statistical but no practical…
The effects of hands-on-science instruction on the science achievement of middle school students
NASA Astrophysics Data System (ADS)
Wiggins, Felita
Student achievement in the Twenty First Century demands a new rigor in student science knowledge, since advances in science and technology require students to think and act like scientists. As a result, students must acquire proficient levels of knowledge and skills to support a knowledge base that is expanding exponentially with new scientific advances. This study examined the effects of hands-on-science instruction on the science achievement of middle school students. More specifically, this study was concerned with the influence of hands-on science instruction versus traditional science instruction on the science test scores of middle school students. The subjects in this study were one hundred and twenty sixth-grade students in six classes. Instruction involved lecture/discussion and hands-on activities carried out for a three week period. Specifically, the study ascertained the influence of the variables gender, ethnicity, and socioeconomic status on the science test scores of middle school students. Additionally, this study assessed the effect of the variables gender, ethnicity, and socioeconomic status on the attitudes of sixth grade students toward science. The two instruments used to collect data for this study were the Prentice Hall unit ecosystem test and the Scientific Work Experience Programs for Teachers Study (SWEPT) student's attitude survey. Moreover, the data for the study was treated using the One-Way Analysis of Covariance and the One-Way Analysis of Variance. The following findings were made based on the results: (1) A statistically significant difference existed in the science performance of middle school students exposed to hands-on science instruction. These students had significantly higher scores than the science performance of middle school students exposed to traditional instruction. (2) A statistically significant difference did not exist between the science scores of male and female middle school students. (3) A statistically significant difference did not exist between the science scores of African American and non-African American middle school students. (4) A statistically significant difference existed in the socioeconomic status of students who were not provided with assisted lunches. Students with unassisted lunches had significantly higher science scores than those middle school students who were provided with assisted lunches. (5) A statistically significant difference was not found in the attitude scores of middle school students who were exposed to hands-on or traditional science instruction. (6) A statistically significant difference was not found in the observed attitude scores of middle school students who were exposed to either hands-on or traditional science instruction by their socioeconomic status. (7) A statistically significant difference was not found in the observed attitude scores of male and female students. (8) A statistically significant difference was not found in the observed attitude scores of African American and non African American students.
Caruso, J C
2001-06-01
The unreliability of difference scores is a well documented phenomenon in the social sciences and has led researchers and practitioners to interpret differences cautiously, if at all. In the case of the Kaufman Adult and Adolescent Intelligence Test (KAIT), the unreliability of the difference between the Fluid IQ and the Crystallized IQ is due to the high correlation between the two scales. The consequences of the lack of precision with which differences are identified are wide confidence intervals and unpowerful significance tests (i.e., large differences are required to be declared statistically significant). Reliable component analysis (RCA) was performed on the subtests of the KAIT in order to address these problems. RCA is a new data reduction technique that results in uncorrelated component scores with maximum proportions of reliable variance. Results indicate that the scores defined by RCA have discriminant and convergent validity (with respect to the equally weighted scores) and that differences between the scores, derived from a single testing session, were more reliable than differences derived from equal weighting for each age group (11-14 years, 15-34 years, 35-85+ years). This reliability advantage results in narrower confidence intervals around difference scores and smaller differences required for statistical significance.
Intelligence--Group Administered, Grades 7 and Above. Annotated Bibliography of Tests.
ERIC Educational Resources Information Center
Educational Testing Service, Princeton, NJ. Test Collection.
Most of the 47 tests included in this bibliography assess intelligence and provide an actual I.Q. score or other score with similar statistical properties. Many of the tests are designed to measure occupational qualifications or to aid in career guidance. Although all ages are represented, the majority of tests are targeted to grade 7 and above. A…
ERIC Educational Resources Information Center
Moses, Tim
2008-01-01
Nine statistical strategies for selecting equating functions in an equivalent groups design were evaluated. The strategies of interest were likelihood ratio chi-square tests, regression tests, Kolmogorov-Smirnov tests, and significance tests for equated score differences. The most accurate strategies in the study were the likelihood ratio tests…
How to Compare Parametric and Nonparametric Person-Fit Statistics Using Real Data
ERIC Educational Resources Information Center
Sinharay, Sandip
2017-01-01
Person-fit assessment (PFA) is concerned with uncovering atypical test performance as reflected in the pattern of scores on individual items on a test. Existing person-fit statistics (PFSs) include both parametric and nonparametric statistics. Comparison of PFSs has been a popular research topic in PFA, but almost all comparisons have employed…
Stein, Marjorie W; Frank, Susan J; Roberts, Jeffrey H; Finkelstein, Malka; Heo, Moonseong
2016-05-01
The aim of this study was to determine whether group-based or didactic teaching is more effective to teach ACR Appropriateness Criteria to medical students. An identical pretest, posttest, and delayed multiple-choice test was used to evaluate the efficacy of the two teaching methods. Descriptive statistics comparing test scores were obtained. On the posttest, the didactic group gained 12.5 points (P < .0001), and the group-based learning students gained 16.3 points (P < .0001). On the delayed test, the didactic group gained 14.4 points (P < .0001), and the group-based learning students gained 11.8 points (P < .001). The gains in scores on both tests were statistically significant for both groups. However, the differences in scores were not statistically significant comparing the two educational methods. Compared with didactic lectures, group-based learning is more enjoyable, time efficient, and equally efficacious. The choice of educational method can be individualized for each institution on the basis of group size, time constraints, and faculty availability. Copyright © 2016 American College of Radiology. Published by Elsevier Inc. All rights reserved.
Morris, Roisin; MacNeela, Padraig; Scott, Anne; Treacy, Pearl; Hyde, Abbey; O'Brien, Julian; Lehwaldt, Daniella; Byrne, Anne; Drennan, Jonathan
2008-04-01
In a study to establish the interrater reliability of the Irish Nursing Minimum Data Set (I-NMDS) for mental health difficulties relating to the choice of reliability test statistic were encountered. The objective of this paper is to highlight the difficulties associated with testing interrater reliability for an ordinal scale using a relatively homogenous sample and the recommended kw statistic. One pair of mental health nurses completed the I-NMDS for mental health for a total of 30 clients attending a mental health day centre over a two-week period. Data was analysed using the kw and percentage agreement statistics. A total of 34 of the 38 I-NMDS for mental health variables with lower than acceptable levels of kw reliability scores achieved acceptable levels of reliability according to their percentage agreement scores. The study findings implied that, due to the homogeneity of the sample, low variability within the data resulted in the 'base rate problem' associated with the use of kw statistic. Conclusions point to the interpretation of kw in tandem with percentage agreement scores. Suggestions that kw scores were low due to chance agreement and that one should strive to use a study sample with known variability are queried.
A Note on Three Statistical Tests in the Logistic Regression DIF Procedure
ERIC Educational Resources Information Center
Paek, Insu
2012-01-01
Although logistic regression became one of the well-known methods in detecting differential item functioning (DIF), its three statistical tests, the Wald, likelihood ratio (LR), and score tests, which are readily available under the maximum likelihood, do not seem to be consistently distinguished in DIF literature. This paper provides a clarifying…
Using Microsoft Excel[R] to Calculate Descriptive Statistics and Create Graphs
ERIC Educational Resources Information Center
Carr, Nathan T.
2008-01-01
Descriptive statistics and appropriate visual representations of scores are important for all test developers, whether they are experienced testers working on large-scale projects, or novices working on small-scale local tests. Many teachers put in charge of testing projects do not know "why" they are important, however, and are utterly convinced…
ERIC Educational Resources Information Center
Moses, Tim; Holland, Paul W.
2010-01-01
In this study, eight statistical strategies were evaluated for selecting the parameterizations of loglinear models for smoothing the bivariate test score distributions used in nonequivalent groups with anchor test (NEAT) equating. Four of the strategies were based on significance tests of chi-square statistics (Likelihood Ratio, Pearson,…
NASA Astrophysics Data System (ADS)
Anderson, Pamela Bennett
Purpose. The purpose of the first study was to ascertain the extent to which differences were present in the STAAR Mathematics and Science test scores by Grade 5 and Grade 8 student economic status. The purpose of the second study was to examine differences in Grade 5 STAAR Mathematics and Science test performance by gender and by ethnicity/race (i.e., Asian, Black, Hispanic, and White). Finally, with respect to the third study in this journal-ready dissertation, the purpose was to investigate the STAAR Mathematics and Science test scores of Grade 8 students by gender and by ethnicity/race (i.e., Asian, Black, Hispanic, and White). Method. For this journal-ready dissertation, a non-experimental, causal-comparative research design (Creswell, 2009) was used in all three studies. Grade 5 and Grade 8 STAAR Mathematics and Science test data were analyzed for the 2011-2012 through the 2014-2015 school years. The dependent variables were the STAAR Mathematics and Science test scores for Grade 5 and Grade 8. The independent variables analyzed in these studies were student economic status, gender, and ethnicity/race. Findings. Regarding the first study, statistically significant differences were present in Grade 5 and Grade 8 STAAR Mathematics and Science test scores by student economic status for each year. Moderate effect sizes (Cohen's d) were present for each year of the study for the Grade 5 STAAR Mathematics and Science exams, Grade 8 Science exams, and the 2014-2015 Grade 8 STAAR Mathematics exam. However, a small effect size was present for the 2011-2012 through 2013-2014 Grade 8 STAAR Mathematics exam. Regarding the second and third study, statistically significant differences were revealed for Grade 5 and Grade 8 STAAR Mathematics and Science test scores based on gender, with trivial effect sizes. Furthermore, statistically significant differences were present in these test scores by ethnicity/race, with moderate effects for each year of the study. With regard to each year for both studies, Asian students had the highest average test scores, followed by White, Hispanic, and Black students, respectively. Thus, a stairstep achievement gap (Carpenter, Ramirez, & Severn, 2006) was present.
Immersive Theater - a Proven Way to Enhance Learning Retention
NASA Astrophysics Data System (ADS)
Reiff, P. H.; Zimmerman, L.; Spillane, S.; Sumners, C.
2014-12-01
The portable immersive theater has gone from our first demonstration at fall AGU 2003 to a product offered by multiple companies in various versions to literally millions of users per year. As part of our NASA funded outreach program, we conducted a test of learning in a portable Discovery Dome as contrasted with learning the same materials (visuals and sound track) on a computer screen. We tested 200 middle school students (primarily underserved minorities). Paired t-tests and an independent t-test were used to compare the amount of learning that students achieved. Interest questionnaires were administered to participants in formal (public school) settings and focus groups were conducted in informal (museum camp and educational festival) settings. Overall results from the informal and formal educational setting indicated that there was a statistically significant increase in test scores after viewing We Choose Space. There was a statistically significant increase in test scores for students who viewed We Choose Space in the portable Discovery Dome (9.75) as well as with the computer (8.88). However, long-term retention of the material tested on the questionnaire indicated that for students who watched We Choose Space in the portable Discovery Dome, there was a statistically significant long-term increase in test scores (10.47), whereas, six weeks after learning on the computer, the improvements over the initial baseline (3.49) were far less and were not statistically significant. The test score improvement six weeks after learning in the dome was essentially the same as the post test immediately after watching the show, demonstrating virtually no loss of gained information in the six week interval. In the formal educational setting, approximately 34% of the respondents indicated that they wanted to learn more about becoming a scientist, while 35% expressed an interest in a career in space science. In the informal setting, 26% indicated that they were interested in pursuing a career in space science.
Lippert, Christoph; Xiang, Jing; Horta, Danilo; Widmer, Christian; Kadie, Carl; Heckerman, David; Listgarten, Jennifer
2014-11-15
Set-based variance component tests have been identified as a way to increase power in association studies by aggregating weak individual effects. However, the choice of test statistic has been largely ignored even though it may play an important role in obtaining optimal power. We compared a standard statistical test-a score test-with a recently developed likelihood ratio (LR) test. Further, when correction for hidden structure is needed, or gene-gene interactions are sought, state-of-the art algorithms for both the score and LR tests can be computationally impractical. Thus we develop new computationally efficient methods. After reviewing theoretical differences in performance between the score and LR tests, we find empirically on real data that the LR test generally has more power. In particular, on 15 of 17 real datasets, the LR test yielded at least as many associations as the score test-up to 23 more associations-whereas the score test yielded at most one more association than the LR test in the two remaining datasets. On synthetic data, we find that the LR test yielded up to 12% more associations, consistent with our results on real data, but also observe a regime of extremely small signal where the score test yielded up to 25% more associations than the LR test, consistent with theory. Finally, our computational speedups now enable (i) efficient LR testing when the background kernel is full rank, and (ii) efficient score testing when the background kernel changes with each test, as for gene-gene interaction tests. The latter yielded a factor of 2000 speedup on a cohort of size 13 500. Software available at http://research.microsoft.com/en-us/um/redmond/projects/MSCompBio/Fastlmm/. heckerma@microsoft.com Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press.
Evidential Value That Exercise Improves BMI z-Score in Overweight and Obese Children and Adolescents
Kelley, George A.; Kelley, Kristi S.
2015-01-01
Background. Given the cardiovascular disease (CVD) related importance of understanding the true effects of exercise on adiposity in overweight and obese children and adolescents, this study examined whether there is evidential value to rule out excessive and inappropriate reporting of statistically significant results, a major problem in the published literature, with respect to exercise-induced improvements in BMI z-score among overweight and obese children and adolescents. Methods. Using data from a previous meta-analysis of 10 published studies that included 835 overweight and obese children and adolescents, a novel, recently developed approach (p-curve) was used to test for evidential value and rule out selective reporting of findings. Chi-squared tests (χ 2) were used to test for statistical significance with alpha (p) values <0.05 considered statistically significant. Results. Six of 10 findings (60%) were statistically significant. Statistically significant right-skew to rule out selective reporting was found (χ 2 = 38.8, p = 0.0001). Conversely, studies neither lacked evidential value (χ 2 = 6.8, p = 0.87) nor lacked evidential value and were intensely p-hacked (χ 2 = 4.3, p = 0.98). Conclusion. Evidential value results confirm that exercise reduces BMI z-score in overweight and obese children and adolescents, an important therapeutic strategy for treating and preventing CVD. PMID:26509145
Kelley, George A; Kelley, Kristi S
2015-01-01
Background. Given the cardiovascular disease (CVD) related importance of understanding the true effects of exercise on adiposity in overweight and obese children and adolescents, this study examined whether there is evidential value to rule out excessive and inappropriate reporting of statistically significant results, a major problem in the published literature, with respect to exercise-induced improvements in BMI z-score among overweight and obese children and adolescents. Methods. Using data from a previous meta-analysis of 10 published studies that included 835 overweight and obese children and adolescents, a novel, recently developed approach (p-curve) was used to test for evidential value and rule out selective reporting of findings. Chi-squared tests (χ (2)) were used to test for statistical significance with alpha (p) values <0.05 considered statistically significant. Results. Six of 10 findings (60%) were statistically significant. Statistically significant right-skew to rule out selective reporting was found (χ (2) = 38.8, p = 0.0001). Conversely, studies neither lacked evidential value (χ (2) = 6.8, p = 0.87) nor lacked evidential value and were intensely p-hacked (χ (2) = 4.3, p = 0.98). Conclusion. Evidential value results confirm that exercise reduces BMI z-score in overweight and obese children and adolescents, an important therapeutic strategy for treating and preventing CVD.
Goodpaster, Aaron M.; Kennedy, Michael A.
2015-01-01
Currently, no standard metrics are used to quantify cluster separation in PCA or PLS-DA scores plots for metabonomics studies or to determine if cluster separation is statistically significant. Lack of such measures makes it virtually impossible to compare independent or inter-laboratory studies and can lead to confusion in the metabonomics literature when authors putatively identify metabolites distinguishing classes of samples based on visual and qualitative inspection of scores plots that exhibit marginal separation. While previous papers have addressed quantification of cluster separation in PCA scores plots, none have advocated routine use of a quantitative measure of separation that is supported by a standard and rigorous assessment of whether or not the cluster separation is statistically significant. Here quantification and statistical significance of separation of group centroids in PCA and PLS-DA scores plots are considered. The Mahalanobis distance is used to quantify the distance between group centroids, and the two-sample Hotelling's T2 test is computed for the data, related to an F-statistic, and then an F-test is applied to determine if the cluster separation is statistically significant. We demonstrate the value of this approach using four datasets containing various degrees of separation, ranging from groups that had no apparent visual cluster separation to groups that had no visual cluster overlap. Widespread adoption of such concrete metrics to quantify and evaluate the statistical significance of PCA and PLS-DA cluster separation would help standardize reporting of metabonomics data. PMID:26246647
The power to detect linkage in complex disease by means of simple LOD-score analyses.
Greenberg, D A; Abreu, P; Hodge, S E
1998-01-01
Maximum-likelihood analysis (via LOD score) provides the most powerful method for finding linkage when the mode of inheritance (MOI) is known. However, because one must assume an MOI, the application of LOD-score analysis to complex disease has been questioned. Although it is known that one can legitimately maximize the maximum LOD score with respect to genetic parameters, this approach raises three concerns: (1) multiple testing, (2) effect on power to detect linkage, and (3) adequacy of the approximate MOI for the true MOI. We evaluated the power of LOD scores to detect linkage when the true MOI was complex but a LOD score analysis assumed simple models. We simulated data from 14 different genetic models, including dominant and recessive at high (80%) and low (20%) penetrances, intermediate models, and several additive two-locus models. We calculated LOD scores by assuming two simple models, dominant and recessive, each with 50% penetrance, then took the higher of the two LOD scores as the raw test statistic and corrected for multiple tests. We call this test statistic "MMLS-C." We found that the ELODs for MMLS-C are >=80% of the ELOD under the true model when the ELOD for the true model is >=3. Similarly, the power to reach a given LOD score was usually >=80% that of the true model, when the power under the true model was >=60%. These results underscore that a critical factor in LOD-score analysis is the MOI at the linked locus, not that of the disease or trait per se. Thus, a limited set of simple genetic models in LOD-score analysis can work well in testing for linkage. PMID:9718328
The power to detect linkage in complex disease by means of simple LOD-score analyses.
Greenberg, D A; Abreu, P; Hodge, S E
1998-09-01
Maximum-likelihood analysis (via LOD score) provides the most powerful method for finding linkage when the mode of inheritance (MOI) is known. However, because one must assume an MOI, the application of LOD-score analysis to complex disease has been questioned. Although it is known that one can legitimately maximize the maximum LOD score with respect to genetic parameters, this approach raises three concerns: (1) multiple testing, (2) effect on power to detect linkage, and (3) adequacy of the approximate MOI for the true MOI. We evaluated the power of LOD scores to detect linkage when the true MOI was complex but a LOD score analysis assumed simple models. We simulated data from 14 different genetic models, including dominant and recessive at high (80%) and low (20%) penetrances, intermediate models, and several additive two-locus models. We calculated LOD scores by assuming two simple models, dominant and recessive, each with 50% penetrance, then took the higher of the two LOD scores as the raw test statistic and corrected for multiple tests. We call this test statistic "MMLS-C." We found that the ELODs for MMLS-C are >=80% of the ELOD under the true model when the ELOD for the true model is >=3. Similarly, the power to reach a given LOD score was usually >=80% that of the true model, when the power under the true model was >=60%. These results underscore that a critical factor in LOD-score analysis is the MOI at the linked locus, not that of the disease or trait per se. Thus, a limited set of simple genetic models in LOD-score analysis can work well in testing for linkage.
Testing manifest monotonicity using order-constrained statistical inference.
Tijmstra, Jesper; Hessen, David J; van der Heijden, Peter G M; Sijtsma, Klaas
2013-01-01
Most dichotomous item response models share the assumption of latent monotonicity, which states that the probability of a positive response to an item is a nondecreasing function of a latent variable intended to be measured. Latent monotonicity cannot be evaluated directly, but it implies manifest monotonicity across a variety of observed scores, such as the restscore, a single item score, and in some cases the total score. In this study, we show that manifest monotonicity can be tested by means of the order-constrained statistical inference framework. We propose a procedure that uses this framework to determine whether manifest monotonicity should be rejected for specific items. This approach provides a likelihood ratio test for which the p-value can be approximated through simulation. A simulation study is presented that evaluates the Type I error rate and power of the test, and the procedure is applied to empirical data.
Lippert, Christoph; Xiang, Jing; Horta, Danilo; Widmer, Christian; Kadie, Carl; Heckerman, David; Listgarten, Jennifer
2014-01-01
Motivation: Set-based variance component tests have been identified as a way to increase power in association studies by aggregating weak individual effects. However, the choice of test statistic has been largely ignored even though it may play an important role in obtaining optimal power. We compared a standard statistical test—a score test—with a recently developed likelihood ratio (LR) test. Further, when correction for hidden structure is needed, or gene–gene interactions are sought, state-of-the art algorithms for both the score and LR tests can be computationally impractical. Thus we develop new computationally efficient methods. Results: After reviewing theoretical differences in performance between the score and LR tests, we find empirically on real data that the LR test generally has more power. In particular, on 15 of 17 real datasets, the LR test yielded at least as many associations as the score test—up to 23 more associations—whereas the score test yielded at most one more association than the LR test in the two remaining datasets. On synthetic data, we find that the LR test yielded up to 12% more associations, consistent with our results on real data, but also observe a regime of extremely small signal where the score test yielded up to 25% more associations than the LR test, consistent with theory. Finally, our computational speedups now enable (i) efficient LR testing when the background kernel is full rank, and (ii) efficient score testing when the background kernel changes with each test, as for gene–gene interaction tests. The latter yielded a factor of 2000 speedup on a cohort of size 13 500. Availability: Software available at http://research.microsoft.com/en-us/um/redmond/projects/MSCompBio/Fastlmm/. Contact: heckerma@microsoft.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25075117
Mautner, Kenneth; Sussman, Walter I; Axtman, Matthew; Al-Farsi, Yahya; Al-Adawi, Samir
2015-07-01
To investigate whether attention deficit hyperactivity disorder (ADHD) influences postconcussion recovery, as measured by computerized neurocognitive testing. This is a retrospective case control study. Computer laboratories across 10 high schools in the greater Atlanta, Georgia area. Immediate postconcussion assessment and cognitive testing (ImPACT) scores of 70 athletes with a self-reported diagnosis of ADHD and who sustained a sport-related concussion were compared with a randomly selected age-matched control group. Immediate postconcussion assessment and cognitive testing scores over a 5-year interval were reviewed for inclusion. Postconcussion recovery was defined as a return to equivalent baseline neurocognitive score on the ImPACT battery, and a concussion symptom score of ≤7. Athletes with ADHD had on average a longer time to recovery when compared with the control group (16.5 days compared with 13.5 days), although not statistically significant. The number of previous concussions did not have any effect on the rate of recovery in the ADHD or the control group. In addition, baseline neurocognitive testing did not statistically differ between the 2 groups, except in verbal memory. Although not statistically significant, youth athletes with ADHD took on average 3 days longer to return to baseline neurocognitive testing compared with a control group without ADHD. Youth athletes with ADHD may have a marginally prolonged recovery as indexed by neurocognitive testing and should be considered when prognosticating time to recovery in this subset of student athletes.
Munsawaengsub, Chokchai; Yimklib, Somkid; Nanthamongkolchai, Sutham; Apinanthavech, Suporn
2009-12-01
To study the effect of promoting self-esteem by participatory learning program on emotional intelligence among early adolescents. The quasi-experimental study was conducted in grade 9 students from two schools in Bangbuathong district, Nonthaburi province. Each experimental and comparative group consisted of 34 students with the lowest score of emotional intelligence. The instruments were questionnaires, Program to Develop Emotional Intelligence and Handbook of Emotional Intelligence Development. The experimental group attended 8 participatory learning activities in 4 weeks to Develop Emotional Intelligence while the comparative group received the handbook for self study. Assessment the effectiveness of program was done by pre-test and post-test immediately and 4 weeks apart concerning the emotional intelligence. Implementation and evaluation was done during May 24-August 12, 2005. Data were analyzed by frequency, percentage, mean, standard deviation, Chi-square, independent sample t-test and paired sample t-test. Before program implementation, both groups had no statistical difference in mean score of emotional intelligence. After intervention, the experimental group had higher mean score of emotional intelligence both immediately and 4 weeks later with statistical significant (p = 0.001 and < 0.001). At 4 weeks after experiment, the mean score in experimental group was higher than the mean score at immediate after experiment with statistical significance (p < 0.001). The program to promote self-esteem by participatory learning process could enhance the emotional intelligence in early-adolescent. This program could be modified and implemented for early adolescent in the community.
Vajawat, Mayuri; Deepika, P. C.; Kumar, Vijay; Rajeshwari, P.
2015-01-01
Aim: To compare the efficacy of powered toothbrushes in improving gingival health and reducing salivary red complex counts as compared to manual toothbrushes, among autistic individuals. Materials and Methods: Forty autistics was selected. Test group received powered toothbrushes, and control group received manual toothbrushes. Plaque index and gingival index were recorded. Unstimulated saliva was collected for analysis of red complex organisms using polymerase chain reaction. Results: A statistically significant reduction in the plaque scores was seen over a period of 12 weeks in both the groups (P < 0.001 for tests and P = 0.002 for controls). This reduction was statistically more significant in the test group (P = 0.024). A statistically significant reduction in the gingival scores was seen over a period of 12 weeks in both the groups (P < 0.001 for tests and P = 0.001 for controls). This reduction was statistically more significant in the test group (P = 0.042). No statistically significant reduction in the detection rate of red complex organisms were seen at 4 weeks in both the groups. Conclusion: Powered toothbrushes result in a significant overall improvement in gingival health when constant reinforcement of oral hygiene instructions is given. PMID:26681855
ERIC Educational Resources Information Center
Guo, Hongwen; Zu, Jiyun; Kyllonen, Patrick; Schmitt, Neal
2016-01-01
In this report, systematic applications of statistical and psychometric methods are used to develop and evaluate scoring rules in terms of test reliability. Data collected from a situational judgment test are used to facilitate the comparison. For a well-developed item with appropriate keys (i.e., the correct answers), agreement among various…
Milic, Natasa M.; Trajkovic, Goran Z.; Bukumiric, Zoran M.; Cirkovic, Andja; Nikolic, Ivan M.; Milin, Jelena S.; Milic, Nikola V.; Savic, Marko D.; Corac, Aleksandar M.; Marinkovic, Jelena M.; Stanisavljevic, Dejana M.
2016-01-01
Background Although recent studies report on the benefits of blended learning in improving medical student education, there is still no empirical evidence on the relative effectiveness of blended over traditional learning approaches in medical statistics. We implemented blended along with on-site (i.e. face-to-face) learning to further assess the potential value of web-based learning in medical statistics. Methods This was a prospective study conducted with third year medical undergraduate students attending the Faculty of Medicine, University of Belgrade, who passed (440 of 545) the final exam of the obligatory introductory statistics course during 2013–14. Student statistics achievements were stratified based on the two methods of education delivery: blended learning and on-site learning. Blended learning included a combination of face-to-face and distance learning methodologies integrated into a single course. Results Mean exam scores for the blended learning student group were higher than for the on-site student group for both final statistics score (89.36±6.60 vs. 86.06±8.48; p = 0.001) and knowledge test score (7.88±1.30 vs. 7.51±1.36; p = 0.023) with a medium effect size. There were no differences in sex or study duration between the groups. Current grade point average (GPA) was higher in the blended group. In a multivariable regression model, current GPA and knowledge test scores were associated with the final statistics score after adjusting for study duration and learning modality (p<0.001). Conclusion This study provides empirical evidence to support educator decisions to implement different learning environments for teaching medical statistics to undergraduate medical students. Blended and on-site training formats led to similar knowledge acquisition; however, students with higher GPA preferred the technology assisted learning format. Implementation of blended learning approaches can be considered an attractive, cost-effective, and efficient alternative to traditional classroom training in medical statistics. PMID:26859832
Milic, Natasa M; Trajkovic, Goran Z; Bukumiric, Zoran M; Cirkovic, Andja; Nikolic, Ivan M; Milin, Jelena S; Milic, Nikola V; Savic, Marko D; Corac, Aleksandar M; Marinkovic, Jelena M; Stanisavljevic, Dejana M
2016-01-01
Although recent studies report on the benefits of blended learning in improving medical student education, there is still no empirical evidence on the relative effectiveness of blended over traditional learning approaches in medical statistics. We implemented blended along with on-site (i.e. face-to-face) learning to further assess the potential value of web-based learning in medical statistics. This was a prospective study conducted with third year medical undergraduate students attending the Faculty of Medicine, University of Belgrade, who passed (440 of 545) the final exam of the obligatory introductory statistics course during 2013-14. Student statistics achievements were stratified based on the two methods of education delivery: blended learning and on-site learning. Blended learning included a combination of face-to-face and distance learning methodologies integrated into a single course. Mean exam scores for the blended learning student group were higher than for the on-site student group for both final statistics score (89.36±6.60 vs. 86.06±8.48; p = 0.001) and knowledge test score (7.88±1.30 vs. 7.51±1.36; p = 0.023) with a medium effect size. There were no differences in sex or study duration between the groups. Current grade point average (GPA) was higher in the blended group. In a multivariable regression model, current GPA and knowledge test scores were associated with the final statistics score after adjusting for study duration and learning modality (p<0.001). This study provides empirical evidence to support educator decisions to implement different learning environments for teaching medical statistics to undergraduate medical students. Blended and on-site training formats led to similar knowledge acquisition; however, students with higher GPA preferred the technology assisted learning format. Implementation of blended learning approaches can be considered an attractive, cost-effective, and efficient alternative to traditional classroom training in medical statistics.
NASA Astrophysics Data System (ADS)
Kartono; Suryadi, D.; Herman, T.
2018-01-01
This study aimed to analyze the enhancement of non-linear learning (NLL) in the online tutorial (OT) content to students’ knowledge of normal distribution application (KONDA). KONDA is a competence expected to be achieved after students studied the topic of normal distribution application in the course named Education Statistics. The analysis was performed by quasi-experiment study design. The subject of the study was divided into an experimental class that was given OT content in NLL model and a control class which was given OT content in conventional learning (CL) model. Data used in this study were the results of online objective tests to measure students’ statistical prior knowledge (SPK) and students’ pre- and post-test of KONDA. The statistical analysis test of a gain score of KONDA of students who had low and moderate SPK’s scores showed students’ KONDA who learn OT content with NLL model was better than students’ KONDA who learn OT content with CL model. Meanwhile, for students who had high SPK’s scores, the gain score of students who learn OT content with NLL model had relatively similar with the gain score of students who learn OT content with CL model. Based on those findings it could be concluded that the NLL model applied to OT content could enhance KONDA of students in low and moderate SPK’s levels. Extra and more challenging didactical situation was needed for students in high SPK’s level to achieve the significant gain score.
Oral azithromycin for treatment of posterior blepharitis.
Igami, Thais Zamudio; Holzchuh, Ricardo; Osaki, Tammy Hentona; Santo, Ruth Miyuki; Kara-Jose, Newton; Hida, Richard Y
2011-10-01
To evaluate the effects of oral azithromycin in patients with posterior blepharitis. Twenty-six eyes of 13 patients with posterior blepharitis diagnosed by a qualified ophthalmologist were enrolled in this study. Patients were instructed to use oral azithromycin 500 mg per day for 3 days in 3 cycles with 7-day intervals. Subjective clinical outcomes were graded and scored 1 day before and 30 days after the end of the treatment (53 days after initiating the treatment) based on severity scores of: (1) eyelid debris; (2) eyelid telangiectasia; (3) swelling of the eyelid margin; (4) redness of the eyelid margin; and (5) ocular mucus secretion. For the assessment of global efficacy, patients were asked by the investigator to rate the subjective symptoms (eyelid itching, ocular itching, eyelid hyperemia, ocular hyperemia, ocular mucus secretion, photophobia, foreign body sensation, and dry eye sensation) on a scale of 0 (no symptoms) to 5 (severe symptoms). Break-up time, Schirmer I test, corneal fluorescein staining score, and rose bengal staining score were also performed in all patients. All clinical outcomes scoring showed statistically significant improvement after oral azithromycin, except for eyelid swelling. Average subjective symptom grading improved statistically after treatment with oral azithromycin, except for eyelid hyperemia, photophobia, and foreign body sensation. Average tear film break-up time values showed statistically significant improvement after the treatment with oral azithromycin. No statistically significant improvement was observed on average values of Schirmer I test, corneal fluorescein staining score, and rose bengal staining score. The combination of multiple clinical parameters shown in this study supports the clinical efficacy of pulsed oral azithromycin therapy for the management of posterior blepharitis.
Graphical method for comparative statistical study of vaccine potency tests.
Pay, T W; Hingley, P J
1984-03-01
Producers and consumers are interested in some of the intrinsic characteristics of vaccine potency assays for the comparative evaluation of suitable experimental design. A graphical method is developed which represents the precision of test results, the sensitivity of such results to changes in dosage, and the relevance of the results in the way they reflect the protection afforded in the host species. The graphs can be constructed from Producer's scores and Consumer's scores on each of the scales of test score, antigen dose and probability of protection against disease. A method for calculating these scores is suggested and illustrated for single and multiple component vaccines, for tests which do or do not employ a standard reference preparation, and for tests which employ quantitative or quantal systems of scoring.
Gerber, S; Rodolphe, F
1994-06-01
The first step in the construction of a linkage map involves the estimation and test for linkage between all possible pairs of markers. The lod score method is used in many linkage studies for the latter purpose. In contrast with classical statistical tests, this method does not rely on the choice of a first-type error level. We thus provide a comparison between the lod score and a χ (2) test on linkage data from a gymnosperm, the maritime pine. The lod score appears to be a very conservative test with the usual thresholds. Its severity depends on the type of data used.
Statistical Assessment of Estimated Transformations in Observed-Score Equating
ERIC Educational Resources Information Center
Wiberg, Marie; González, Jorge
2016-01-01
Equating methods make use of an appropriate transformation function to map the scores of one test form into the scale of another so that scores are comparable and can be used interchangeably. The equating literature shows that the ways of judging the success of an equating (i.e., the score transformation) might differ depending on the adopted…
ERIC Educational Resources Information Center
Longford, Nicholas T.
This study is a critical evaluation of the roles for coding and scoring of missing responses to multiple-choice items in educational tests. The focus is on tests in which the test-takers have little or no motivation; in such tests omitting and not reaching (as classified by the currently adopted operational rules) is quite frequent. Data from the…
The Michigan Alcoholism Screening Test (MAST): A Statistical Validation Analysis
ERIC Educational Resources Information Center
Laux, John M.; Newman, Isadore; Brown, Russ
2004-01-01
This study extends the Michigan Alcoholism Screening Test (MAST; M. L. Selzer, 1971) literature base by examining 4 issues related to the validity of the MAST scores. Specifically, the authors examine the validity of the MAST scores in light of the presence of impression management, participant demographic variables, and item endorsement…
Klein, A A; Collier, T; Yeates, J; Miles, L F; Fletcher, S N; Evans, C; Richards, T
2017-09-01
A simple and accurate scoring system to predict risk of transfusion for patients undergoing cardiac surgery is lacking. We identified independent risk factors associated with transfusion by performing univariate analysis, followed by logistic regression. We then simplified the score to an integer-based system and tested it using the area under the receiver operator characteristic (AUC) statistic with a Hosmer-Lemeshow goodness-of-fit test. Finally, the scoring system was applied to the external validation dataset and the same statistical methods applied to test the accuracy of the ACTA-PORT score. Several factors were independently associated with risk of transfusion, including age, sex, body surface area, logistic EuroSCORE, preoperative haemoglobin and creatinine, and type of surgery. In our primary dataset, the score accurately predicted risk of perioperative transfusion in cardiac surgery patients with an AUC of 0.76. The external validation confirmed accuracy of the scoring method with an AUC of 0.84 and good agreement across all scores, with a minor tendency to under-estimate transfusion risk in very high-risk patients. The ACTA-PORT score is a reliable, validated tool for predicting risk of transfusion for patients undergoing cardiac surgery. This and other scores can be used in research studies for risk adjustment when assessing outcomes, and might also be incorporated into a Patient Blood Management programme. © The Author 2017. Published by Oxford University Press on behalf of the British Journal of Anaesthesia. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Physique and Performance of Young Wheelchair Basketball Players in Relation with Classification
Zancanaro, Carlo
2015-01-01
The relationships among physical characteristics, performance, and functional ability classification of younger wheelchair basketball players have been barely investigated to date. The purpose of this work was to assess anthropometry, body composition, and performance in sport-specific field tests in a national sample of Italian younger wheelchair basketball players as well as to evaluate the association of these variables with the players’ functional ability classification and game-related statistics. Several anthropometric measurements were obtained for 52 out of 91 eligible players nationwide. Performance was assessed in seven sport-specific field tests (5m sprint, 20m sprint with ball, suicide, maximal pass, pass for accuracy, spot shot and lay-ups) and game-related statistics (free-throw points scored per match, two- and three-point field-goals scored per match, and their sum). Association between variables, and predictivity was assessed by correlation and regression analysis, respectively. Players were grouped into four Classes of increasing functional ability (A-D). One-way ANOVA with Bonferroni’s correction for multiple comparisons was used to assess differences between Classes. Sitting height and functional ability Class especially correlated with performance outcomes, but wheelchair basketball experience and skinfolds did not. Game-related statistics and sport-specific field-test scores all showed significant correlation with each other. Upper arm circumference and/or maximal pass and lay-ups test scores were able to explain 42 to 59% of variance in game-related statistics (P<0.001). A clear difference in performance was only found for functional ability Class A and D. Conclusion: In younger wheelchair basketball players, sitting height positively contributes to performance. The maximal pass and lay-ups test should be carefully considered in younger wheelchair basketball training plans. Functional ability Class reflects to a limited extent the actual differences in performance. PMID:26606681
Comparison between the Laser-Badal and Vernier Optometers.
1988-09-01
naval aviators (SNAs). We also measured dark vcrgence in the same sample of SNAs. THE FINDINGS There was no statistically significant difference found...relatively inexperienced operator. 7. The difference between mean scores on the vernier and laser-Badal optometers was statistically significant...thus indicating that test results were reliable within instru- menrts. TAbLE 1. Test and Retest Statistics . Measure Mean SD n t-value Dark vergence
Tarescavage, Anthony M; Alosco, Michael L; Ben-Porath, Yossef S; Wood, Arcangela; Luna-Jones, Lynn
2015-04-01
We investigated the internal structure comparability of Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF) scores derived from the MMPI-2 and MMPI-2-RF booklets in a sample of 320 criminal defendants (229 males and 54 females). After exclusion of invalid protocols, the final sample consisted of 96 defendants who were administered the MMPI-2-RF booklet and 83 who completed the MMPI-2. No statistically significant differences in MMPI-2-RF invalidity rates were observed between the two forms. Individuals in the final sample who completed the MMPI-2-RF did not statistically differ on demographics or referral question from those who were administered the MMPI-2 booklet. Independent t tests showed no statistically significant differences between MMPI-2-RF scores generated with the MMPI-2 and MMPI-2-RF booklets on the test's substantive scales. Statistically significant small differences were observed on the revised Variable Response Inconsistency (VRIN-r) and True Response Inconsistency (TRIN-r) scales. Cronbach's alpha and standard errors of measurement were approximately equal between the booklets for all MMPI-2-RF scales. Finally, MMPI-2-RF intercorrelations produced from the two forms yielded mostly small and a few medium differences, indicating that discriminant validity and test structure are maintained. Overall, our findings reflect the internal structure comparability of MMPI-2-RF scale scores generated from MMPI-2 and MMPI-2-RF booklets. Implications of these results and limitations of these findings are discussed. © The Author(s) 2014.
Broët, Philippe; Tsodikov, Alexander; De Rycke, Yann; Moreau, Thierry
2004-06-01
This paper presents two-sample statistics suited for testing equality of survival functions against improper semi-parametric accelerated failure time alternatives. These tests are designed for comparing either the short- or the long-term effect of a prognostic factor, or both. These statistics are obtained as partial likelihood score statistics from a time-dependent Cox model. As a consequence, the proposed tests can be very easily implemented using widely available software. A breast cancer clinical trial is presented as an example to demonstrate the utility of the proposed tests.
Generalized functional linear models for gene-based case-control association studies.
Fan, Ruzong; Wang, Yifan; Mills, James L; Carter, Tonia C; Lobach, Iryna; Wilson, Alexander F; Bailey-Wilson, Joan E; Weeks, Daniel E; Xiong, Momiao
2014-11-01
By using functional data analysis techniques, we developed generalized functional linear models for testing association between a dichotomous trait and multiple genetic variants in a genetic region while adjusting for covariates. Both fixed and mixed effect models are developed and compared. Extensive simulations show that Rao's efficient score tests of the fixed effect models are very conservative since they generate lower type I errors than nominal levels, and global tests of the mixed effect models generate accurate type I errors. Furthermore, we found that the Rao's efficient score test statistics of the fixed effect models have higher power than the sequence kernel association test (SKAT) and its optimal unified version (SKAT-O) in most cases when the causal variants are both rare and common. When the causal variants are all rare (i.e., minor allele frequencies less than 0.03), the Rao's efficient score test statistics and the global tests have similar or slightly lower power than SKAT and SKAT-O. In practice, it is not known whether rare variants or common variants in a gene region are disease related. All we can assume is that a combination of rare and common variants influences disease susceptibility. Thus, the improved performance of our models when the causal variants are both rare and common shows that the proposed models can be very useful in dissecting complex traits. We compare the performance of our methods with SKAT and SKAT-O on real neural tube defects and Hirschsprung's disease datasets. The Rao's efficient score test statistics and the global tests are more sensitive than SKAT and SKAT-O in the real data analysis. Our methods can be used in either gene-disease genome-wide/exome-wide association studies or candidate gene analyses. © 2014 WILEY PERIODICALS, INC.
Generalized Functional Linear Models for Gene-based Case-Control Association Studies
Mills, James L.; Carter, Tonia C.; Lobach, Iryna; Wilson, Alexander F.; Bailey-Wilson, Joan E.; Weeks, Daniel E.; Xiong, Momiao
2014-01-01
By using functional data analysis techniques, we developed generalized functional linear models for testing association between a dichotomous trait and multiple genetic variants in a genetic region while adjusting for covariates. Both fixed and mixed effect models are developed and compared. Extensive simulations show that Rao's efficient score tests of the fixed effect models are very conservative since they generate lower type I errors than nominal levels, and global tests of the mixed effect models generate accurate type I errors. Furthermore, we found that the Rao's efficient score test statistics of the fixed effect models have higher power than the sequence kernel association test (SKAT) and its optimal unified version (SKAT-O) in most cases when the causal variants are both rare and common. When the causal variants are all rare (i.e., minor allele frequencies less than 0.03), the Rao's efficient score test statistics and the global tests have similar or slightly lower power than SKAT and SKAT-O. In practice, it is not known whether rare variants or common variants in a gene are disease-related. All we can assume is that a combination of rare and common variants influences disease susceptibility. Thus, the improved performance of our models when the causal variants are both rare and common shows that the proposed models can be very useful in dissecting complex traits. We compare the performance of our methods with SKAT and SKAT-O on real neural tube defects and Hirschsprung's disease data sets. The Rao's efficient score test statistics and the global tests are more sensitive than SKAT and SKAT-O in the real data analysis. Our methods can be used in either gene-disease genome-wide/exome-wide association studies or candidate gene analyses. PMID:25203683
Foxton, C R; Black, D; Muhlschlegel, J; Jardine, A
2014-12-01
To assess whether there is a difference in ENT knowledge amongst nurses caring for patients on a dedicated ENT ward and nurses caring for ENT patients in a similar hospital without a dedicated ENT ward. A test of theoretical knowledge of ENT nursing care was devised and administered to nurses working on a dedicated ENT ward and then to nurses working on generic non-subspecialist wards regularly caring for ENT patients in a hospital without a dedicated ENT ward. The test scores were then compared. A single specialist ENT/Maxillo-Facial/Opthalmology ward in hospital A and 3 generic surgical wards in hospital B. Both hospitals are comparable district general hospitals in the south west of England. Nursing staff working in hospital A and hospital B on the relevant wards were approached during the working day. 11 nurses on ward 1, 10 nurses on ward 2, 11 nurses on ward 3 and 10 nurses on ward 4 (the dedicated ENT ward). Each individual test score was used to generate an average score per ward and these scores compared to see if there was a significant difference. The average score out of 10 on ward 1 was 6.8 (+/-1.6). The average score on ward two was 4.8 (+/-1.6). The average score on ward three was 5.5 (+/-2.1). The average score on ward 4, which is the dedicated ENT ward, was 9.7 (+/-0.5). The differences in average test score between the dedicated ENT ward and all of the other wards are statistically significant. Nurses working on a dedicated ENT ward have an average higher score in a test of knowledge than nurses working on generic surgical wards. This difference is statistically significant and persists despite banding or training. © 2014 John Wiley & Sons Ltd.
A more powerful exact test of noninferiority from binary matched-pairs data.
Lloyd, Chris J; Moldovan, Max V
2008-08-15
Assessing the therapeutic noninferiority of one medical treatment compared with another is often based on the difference in response rates from a matched binary pairs design. This paper develops a new exact unconditional test for noninferiority that is more powerful than available alternatives. There are two new elements presented in this paper. First, we introduce the likelihood ratio statistic as an alternative to the previously proposed score statistic of Nam (Biometrics 1997; 53:1422-1430). Second, we eliminate the nuisance parameter by estimation followed by maximization as an alternative to the partial maximization of Berger and Boos (Am. Stat. Assoc. 1994; 89:1012-1016) or traditional full maximization. Based on an extensive numerical study, we recommend tests based on the score statistic, the nuisance parameter being controlled by estimation followed by maximization. 2008 John Wiley & Sons, Ltd
Race, Socioeconomic Status, and Implicit Bias: Implications for Closing the Achievement Gap
NASA Astrophysics Data System (ADS)
Schlosser, Elizabeth Auretta Cox
This study accessed the relationship between race, socioeconomic status, age and the race implicit bias held by middle and high school science teachers in Mobile and Baldwin County Public School Systems. Seventy-nine participants were administered the race Implicit Association Test (race IAT), created by Greenwald, A. G., Nosek, B. A., & Banaji, M. R., (2003) and a demographic survey. Quantitative analysis using analysis of variances, ANOVA and t-tests were used in this study. An ANOVA was performed comparing the race IAT scores of African American science teachers and their Caucasian counterparts. A statically significant difference was found (F = .4.56, p = .01). An ANOVA was also performed using the race IAT scores comparing the age of the participants; the analysis yielded no statistical difference based on age. A t-test was performed comparing the race IAT scores of African American teachers who taught at either Title I or non-Title I schools; no statistical difference was found between groups (t = -17.985, p < .001). A t-test was also performed comparing the race IAT scores of Caucasian teachers who taught at either Title I or non-Title I schools; a statistically significant difference was found between groups ( t = 2.44, p > .001). This research examines the implications of the achievement gap among African American and Caucasian students in science.
Peña-Casanova, Jordi; Quiñones-Ubeda, Sonia; Gramunt-Fombuena, Nina; Aguilar, Miquel; Casas, Laura; Molinuevo, José Luis; Robles, Alfredo; Rodríguez, Dolores; Barquero, María Sagrario; Antúnez, Carmen; Martínez-Parra, Carlos; Frank-García, Anna; Fernández, Manuel; Molano, Ana; Alfonso, Verónica; Sol, Josep M; Blesa, Rafael
2009-06-01
As part of the Spanish Multicenter Normative Studies (NEURONORMA project), we provide age- and education-adjusted norms for the Boston naming test and Token test. The sample consists of 340 and 348 participants, respectively, who are cognitively normal, community-dwelling, and ranging in age from 50 to 94 years. Tables are provided to convert raw scores to age-adjusted scaled scores. These were further converted into education-adjusted scaled scores by applying regression-based adjustments. Age and education affected the score of the both tests, but sex was found to be unrelated to naming and verbal comprehension efficiency. Our norms should provide clinically useful data for evaluating elderly Spaniards. The normative data presented here were obtained from the same study sample as all the other NEURONORMA norms and the same statistical procedures for data analyses were applied. These co-normed data allow clinicians to compare scores from one test with all tests.
Trickey, Amber W; Crosby, Moira E; Singh, Monika; Dort, Jonathan M
2014-12-01
The application of evidence-based medicine to patient care requires unique skills of the physician. Advancing residents' abilities to accurately evaluate the quality of evidence is built on understanding of fundamental research concepts. The American Board of Surgery In-Training Examination (ABSITE) provides a relevant measure of surgical residents' knowledge of research design and statistics. We implemented a research education curriculum in an independent academic medical center general residency program, and assessed the effect on ABSITE scores. The curriculum consisted of five 1-hour monthly research and statistics lectures. The lectures were presented before the 2012 and 2013 examinations. Forty residents completing ABSITE examinations from 2007 to 2013 were included in the study. Two investigators independently identified research-related item topics from examination summary reports. Correct and incorrect responses were compared precurriculum and postcurriculum. Regression models were calculated to estimate improvement in postcurriculum scores, adjusted for individuals' scores over time and postgraduate year level. Residents demonstrated significant improvement in postcurriculum examination scores for research and statistics items. Correct responses increased 27% (P < .001). Residents were 5 times more likely to achieve a perfect score on research and statistics items postcurriculum (P < .001). Residents at all levels demonstrated improved research and statistics scores after receiving the curriculum. Because the ABSITE includes a wide spectrum of research topics, sustained improvements suggest a genuine level of understanding that will promote lifelong evaluation and clinical application of the surgical literature.
Menkes, Daniel L; Reed, Mary
2008-01-01
To determine the effectiveness of didactic case-based instruction methodology to improve medical student comprehension of common neurological illnesses and neurological emergencies. Neurology department, academic university. 415 third and fourth year medical students performing a required four week neurology clerkship. Raw test scores on a 1 hour, 50-item clinical vignette based examination and open-ended questions in a post-clerkship feedback session. There was a statistically significant improvement in overall test scores (p<0.001). Didactic teaching sessions have a significant positive impact on neurology student clerkship test score performance and perception of their educational experience. Confirmation of these results across multiple specialties in a multi-center trial is warranted.
Gibson, Todd A; Oller, D Kimbrough; Jarmulowicz, Linda
2018-03-01
Receptive standardized vocabulary scores have been found to be much higher than expressive standardized vocabulary scores in children with Spanish as L1, learning L2 (English) in school (Gibson et al., 2012). Here we present evidence suggesting the receptive-expressive gap may be harder to evaluate than previously thought because widely-used standardized tests may not offer comparable normed scores. Furthermore monolingual Spanish-speaking children tested in Mexico and monolingual English-speaking children in the US showed other, yet different statistically significant discrepancies between receptive and expressive scores. Results suggest comparisons across widely used standardized tests in attempts to assess a receptive-expressive gap are precarious.
Jones, Loretta; Bazargan, Mohsen; Lucas-Wright, Anna; Vadgama, Jaydutt V; Vargas, Roberto; Smith, James; Otoukesh, Salman; Maxwell, Annette E
2013-01-01
Most theoretical formulations acknowledge that knowledge and awareness of cancer screening and prevention recommendations significantly influence health behaviors. This study compares perceived knowledge of cancer prevention and screening with test-based knowledge in a community sample. We also examine demographic variables and self-reported cancer screening and prevention behaviors as correlates of both knowledge scores, and consider whether cancer related knowledge can be accurately assessed using just a few, simple questions in a short and easy-to-complete survey. We used a community-partnered participatory research approach to develop our study aims and a survey. The study sample was composed of 180 predominantly African American and Hispanic community individuals who participated in a full-day cancer prevention and screening promotion conference in South Los Angeles, California, on July 2011. Participants completed a self-administered survey in English or Spanish at the beginning of the conference. Our data indicate that perceived and test-based knowledge scores are only moderately correlated. Perceived knowledge score shows a stronger association with demographic characteristics and other cancer related variables than the test-based score. Thirteen out of twenty variables that are examined in our study showed a statistically significant correlation with the perceived knowledge score, however, only four variables demonstrated a statistically significant correlation with the test-based knowledge score. Perceived knowledge of cancer prevention and screening was assessed with fewer items than test-based knowledge. Thus, using this assessment could potentially reduce respondent burden. However, our data demonstrate that perceived and test-based knowledge are separate constructs.
Interrater reliability: the kappa statistic.
McHugh, Mary L
2012-01-01
The kappa statistic is frequently used to test interrater reliability. The importance of rater reliability lies in the fact that it represents the extent to which the data collected in the study are correct representations of the variables measured. Measurement of the extent to which data collectors (raters) assign the same score to the same variable is called interrater reliability. While there have been a variety of methods to measure interrater reliability, traditionally it was measured as percent agreement, calculated as the number of agreement scores divided by the total number of scores. In 1960, Jacob Cohen critiqued use of percent agreement due to its inability to account for chance agreement. He introduced the Cohen's kappa, developed to account for the possibility that raters actually guess on at least some variables due to uncertainty. Like most correlation statistics, the kappa can range from -1 to +1. While the kappa is one of the most commonly used statistics to test interrater reliability, it has limitations. Judgments about what level of kappa should be acceptable for health research are questioned. Cohen's suggested interpretation may be too lenient for health related studies because it implies that a score as low as 0.41 might be acceptable. Kappa and percent agreement are compared, and levels for both kappa and percent agreement that should be demanded in healthcare studies are suggested.
Tariq, Nabia; Tayyab, Ali; Jaffery, Tara
2018-04-01
To measure mean empathy scores of Pakistani medical students and to explore any association of empathy scores with gender, medical school year and future career choice. Cross-sectional survey. Shifa College of Medicine, Shifa Tameer-e-Millat University, during the academic year 2015-2016. The student version of Jefferson Scale of Physician Empathy (JSPE) was distributed to the students electronically via the student portal. Response that were completed in full were included in the study. Descriptive statistics was used to analyse student demographic data. The student score on the JSPE was reported as the mean (out of 7) of each item. Independent samples t-test was employed to check the significant differences between genders. Empathy score with advancing year of study was investigated using ANOVA. ANOVA with post-hoc Tukey's test was used to study the relationship between career choice and empathy score. The response rate was 70.94%. The mean score was 4.51 ±0.69. Females obtained greater, but statistically insignificant (p=0.08) empathy score (4.58) as compared to the male students (4.45). No statistically significant difference was seen between scores on the survey across the five academic years (F=0.88, p=0.47). Students who selected medicine and allied as career choice showed a significantly higher empathy score than those who opted for surgery. The internal consistency reliability (Cronbach's alpha) was 0.78. There were low levels of empathy in Pakistani medical students. Students with interest in medicine and allied showed higher empathy scores compared to surgical or technical specialties. No association of empathy scores with gender and medical school year was observed.
ERIC Educational Resources Information Center
Ho, Andrew D.; Yu, Carol C.
2015-01-01
Many statistical analyses benefit from the assumption that unconditional or conditional distributions are continuous and normal. More than 50 years ago in this journal, Lord and Cook chronicled departures from normality in educational tests, and Micerri similarly showed that the normality assumption is met rarely in educational and psychological…
Global, Local, and Graphical Person-Fit Analysis Using Person-Response Functions
ERIC Educational Resources Information Center
Emons, Wilco H. M.; Sijtsma, Klaas; Meijer, Rob R.
2005-01-01
Person-fit statistics test whether the likelihood of a respondent's complete vector of item scores on a test is low given the hypothesized item response theory model. This binary information may be insufficient for diagnosing the cause of a misfitting item-score vector. The authors propose a comprehensive methodology for person-fit analysis in the…
Hong, Hye Jeong; Kim, Jin Sung; Seo, Wan Seok; Koo, Bon Hoon; Bai, Dai Seg; Jeong, Jin Young
2010-01-01
Objective We investigated executive functions (EFs), as evaluated by the Wisconsin Card Sorting Test (WCST), and other EF between lower grades (LG) and higher grades (HG) in elementary-school-age attention deficit hyperactivity disorder (ADHD) children. Methods We classified a sample of 112 ADHD children into 4 groups (composed of 28 each) based on age (LG vs. HG) and WCST performance [lower vs. higher performance on WCST, defined by the number of completed categories (CC)] Participants in each group were matched according to age, gender, ADHD subtype, and intelligence. We used the Wechsler intelligence Scale for Children 3rd edition to test intelligence and the Computerized Neurocognitive Function Test-IV, which included the WCST, to test EF. Results Comparisons of EFs scores in LG ADHD children showed statistically significant differences in performing digit spans backward, some verbal learning scores, including all memory scores, and Stroop test scores. However, comparisons of EF scores in HG ADHD children did not show any statistically significant differences. Correlation analyses of the CC and EF variables and stepwise multiple regression analysis in LG ADHD children showed a combination of the backward form of the Digit span test and Visual span test in lower-performance ADHD participants significantly predicted the number of CC (R2=0.273, p<0.001). Conclusion This study suggests that the design of any battery of neuropsychological tests for measuring EF in ADHD children should first consider age before interpreting developmental variations and neuropsychological test results. Researchers should consider the dynamics of relationships within EF, as measured by neuropsychological tests. PMID:20927306
The Motivated Strategies for Learning Questionnaire: score validity among medicine residents.
Cook, David A; Thompson, Warren G; Thomas, Kris G
2011-12-01
The Motivated Strategies for Learning Questionnaire (MSLQ) purports to measure motivation using the expectancy-value model. Although it is widely used in other fields, this instrument has received little study in health professions education. The purpose of this study was to evaluate the validity of MSLQ scores. We conducted a validity study evaluating the relationships of MSLQ scores to other variables and their internal structure (reliability and factor analysis). Participants included 210 internal medicine and family medicine residents participating in a web-based course on ambulatory medicine at an academic medical centre. Measurements included pre-course MSLQ scores, pre- and post-module motivation surveys, post-module knowledge test and post-module Instructional Materials Motivation Survey (IMMS) scores. Internal consistency was universally high for all MSLQ items together (Cronbach's α = 0.93) and for each domain (α ≥ 0.67). Total MSLQ scores showed statistically significant positive associations with post-test knowledge scores. For example, a 1-point rise in total MSLQ score was associated with a 4.4% increase in post-test scores (β = 4.4; p < 0.0001). Total MSLQ scores showed moderately strong, statistically significant associations with several other measures of effort, motivation and satisfaction. Scores on MSLQ domains demonstrated associations that generally aligned with our hypotheses. Self-efficacy and control of learning belief scores demonstrated the strongest domain-specific relationships with knowledge scores (β = 2.9 for both). Confirmatory factor analysis showed a borderline model fit. Follow-up exploratory factor analysis revealed the scores of five factors (self-efficacy, intrinsic interest, test anxiety, extrinsic goals, attribution) demonstrated psychometric and predictive properties similar to those of the original scales. Scores on the MSLQ are reliable and predict meaningful outcomes. However, the factor structure suggests a simplified model might better fit the empiric data. Future research might consider how assessing and responding to motivation could enhance learning. © Blackwell Publishing Ltd 2011.
NASA Astrophysics Data System (ADS)
Leonardi, Marcelo
The primary purpose of this study was to examine the impact of a scheduling change from a trimester 4x4 block schedule to a modified hybrid schedule on student achievement in ninth grade biology courses. This study examined the impact of the scheduling change on student achievement through teacher created benchmark assessments in Genetics, DNA, and Evolution and on the California Standardized Test in Biology. The secondary purpose of this study examined the ninth grade biology teacher perceptions of ninth grade biology student achievement. Using a mixed methods research approach, data was collected both quantitatively and qualitatively as aligned to research questions. Quantitative methods included gathering data from departmental benchmark exams and California Standardized Test in Biology and conducting multiple analysis of covariance and analysis of covariance to determine significance differences. Qualitative methods include journal entries questions and focus group interviews. The results revealed a statistically significant increase in scores on both the DNA and Evolution benchmark exams. DNA and Evolution benchmark exams showed significant improvements from a change in scheduling format. The scheduling change was responsible for 1.5% of the increase in DNA benchmark scores and 2% of the increase in Evolution benchmark scores. The results revealed a statistically significant decrease in scores on the Genetics Benchmark exam as a result of the scheduling change. The scheduling change was responsible for 1% of the decrease in Genetics benchmark scores. The results also revealed a statistically significant increase in scores on the CST Biology exam. The scheduling change was responsible for .7% of the increase in CST Biology scores. Results of the focus group discussions indicated that all teachers preferred the modified hybrid schedule over the trimester schedule and that it improved student achievement.
Medical ethical standards in dermatology: an analytical study of knowledge, attitudes and practices.
Mostafa, W Z; Abdel Hay, R M; El Lawindi, M I
2015-01-01
Dermatology practice has not been ethically justified at all times. The objective of the study was to find out dermatologists' knowledge about medical ethics, their attitudes towards regulatory measures and their practices, and to study the different factors influencing the knowledge, the attitude and the practices of dermatologists. This is a cross-sectional comparative study conducted among 214 dermatologists, from five Academic Universities and from participants in two conferences. A 54 items structured anonymous questionnaire was designed to describe the demographical characteristics of the study group as well as their knowledge, attitude and practices regarding the medical ethics standards in clinical and research settings. Five scoring indices were estimated regarding knowledge, attitude and practice. Inferential statistics were used to test differences between groups as indicated. The Student's t-test and analysis of variance were carried out for quantitative variables. The chi-squared test was conducted for qualitative variables. The results were considered statistically significant at a P > 0.05. Analysis of the possible factors having impact on the overall scores revealed that the highest knowledge scores were among dermatologists who practice in an academic setting plus an additional place; however, this difference was statistically non-significant (P = 0.060). Female dermatologists showed a higher attitude score compared to males (P = 0.028). The highest significant attitude score (P = 0.019) regarding clinical practice was recorded among those practicing cosmetic dermatology. The different studied groups of dermatologists revealed a significant impact on the attitude score (P = 0.049), and the evidence-practice score (P < 0.001). Ethical practices will improve the quality and integrity of dermatology research. © 2014 European Academy of Dermatology and Venereology.
Impact of Measurement Error on Statistical Power: Review of an Old Paradox.
ERIC Educational Resources Information Center
Williams, Richard H.; And Others
1995-01-01
The paradox that a Student t-test based on pretest-posttest differences can attain its greatest power when the difference score reliability is zero was explained by demonstrating that power is not a mathematical function of reliability unless either true score variance or error score variance is constant. (SLD)
Defensiveness in Female College Students and Its Impact on Their MAST and CAGE Scores
ERIC Educational Resources Information Center
Laux, John M.; Salyers, Kathleen M.; Jones, Amy L.
2007-01-01
This study found a statistically significant inverse relationship between defensiveness and female college students' Michigan Alcoholism Screening Test (M. L. Selzer, 1971) and CAGE (J. A. Ewing, 198 ) scores. Female college students who produce negative screening scores were more defensive than those whose alcohol use screens were positive.…
Mathiasen, Ross; Hogrefe, Christopher; Harland, Kari; Peterson, Andrew; Smoot, M Kyle
2018-02-15
The Balance Error Scoring System (BESS) is a commonly used concussion assessment tool. Recent studies have questioned the stability and reliability of baseline BESS scores. The purpose of this longitudinal prospective cohort study is to examine differences in yearly baseline BESS scores in athletes participating on an NCAA Division-I football team. NCAA Division-I freshman football athletes were videotaped performing the BESS test at matriculation and after 1 year of participation in the football program. Twenty-three athletes were enrolled in year 1 of the study, and 25 athletes were enrolled in year 2. Those athletes enrolled in year 1 were again videotaped after year 2 of the study. The paired t-test was used to assess for change in score over time for the firm surface, foam surface, and the cumulative BESS score. Additionally, inter- and intrarater reliability values were calculated. Cumulative errors on the BESS significantly decreased from a mean of 20.3 at baseline to 16.8 after 1 year of participation. The mean number of errors following the second year of participation was 15.0. Inter-rater reliability for the cumulative score ranged from 0.65 to 0.75. Intrarater reliability was 0.81. After 1 year of participation, there is a statistically and clinically significant improvement in BESS scores in an NCAA Division-I football program. Although additional improvement in BESS scores was noted after a second year of participation, it did not reach statistical significance. Football athletes should undergo baseline BESS testing at least yearly if the BESS is to be optimally useful as a diagnostic test for concussion.
NASA Astrophysics Data System (ADS)
Jacek, Laura Lee
This dissertation details an experiment designed to identify gender differences in learning using three experimental treatments: animation, static graphics, and verbal instruction alone. Three learning presentations were used in testing of 332 university students. Statistical analysis was performed using ANOVA, binomial tests for differences of proportion, and descriptive statistics. Results showed that animation significantly improved women's long-term learning over static graphics (p = 0.067), but didn't significantly improve men's long-term learning over static graphics. In all cases, women's scores improved with animation over both other forms of instruction for long-term testing, indicating that future research should not abandon the study of animation as a tool that may promote gender equity in science. Short-term test differences were smaller, and not statistically significant. Variation present in short-term scores was related more to presentation topic than treatment. This research also details characteristics of each of the three presentations, to identify variables (e.g. level of abstraction in presentation) affecting score differences within treatments. Differences between men's and women's scores were non-standard between presentations, but these differences were not statistically significant (long-term p = 0.2961, short-term p = 0.2893). In future research, experiments might be better designed to test these presentational variables in isolation, possibly yielding more distinctive differences between presentational scores. Differences in confidence interval overlaps between presentations suggested that treatment superiority may be somewhat dependent on the design or topic of the learning presentation. Confidence intervals greatly overlap in all situations. This undercut, to some degree, the surety of conclusions indicating superiority of one treatment type over the others. However, confidence intervals for animation were smaller, overlapped nearly completely for men and women (there was less overlap between the genders for the other two treatments), and centered around slightly higher means, lending further support to the conclusion that animation helped equalize men's and women's learning. The most important conclusion identified in this research is that gender is an important variable experimental populations testing animation as a learning device. Averages indicated that both men and women prefer to work with animation over either static graphics or verbal instruction alone.
BROËT, PHILIPPE; TSODIKOV, ALEXANDER; DE RYCKE, YANN; MOREAU, THIERRY
2010-01-01
This paper presents two-sample statistics suited for testing equality of survival functions against improper semi-parametric accelerated failure time alternatives. These tests are designed for comparing either the short- or the long-term effect of a prognostic factor, or both. These statistics are obtained as partial likelihood score statistics from a time-dependent Cox model. As a consequence, the proposed tests can be very easily implemented using widely available software. A breast cancer clinical trial is presented as an example to demonstrate the utility of the proposed tests. PMID:15293627
A prognostic scoring system for arm exercise stress testing.
Xie, Yan; Xian, Hong; Chandiramani, Pooja; Bainter, Emily; Wan, Leping; Martin, Wade H
2016-01-01
Arm exercise stress testing may be an equivalent or better predictor of mortality outcome than pharmacological stress imaging for the ≥50% for patients unable to perform leg exercise. Thus, our objective was to develop an arm exercise ECG stress test scoring system, analogous to the Duke Treadmill Score, for predicting outcome in these individuals. In this retrospective observational cohort study, arm exercise ECG stress tests were performed in 443 consecutive veterans aged 64.1 (11.1) years. (mean (SD)) between 1997 and 2002. From multivariate Cox models, arm exercise scores were developed for prediction of 5-year and 12-year all-cause and cardiovascular mortality and 5-year cardiovascular mortality or myocardial infarction (MI). Arm exercise capacity in resting metabolic equivalents (METs), 1 min heart rate recovery (HRR) and ST segment depression ≥1 mm were the stress test variables independently associated with all-cause and cardiovascular mortality by step-wise Cox analysis (all p<0.01). A score based on the relation HRR (bpm)+7.3×METs-10.5×ST depression (0=no; 1=yes) prognosticated 5-year cardiovascular mortality with a C-statistic of 0.81 before and 0.88 after adjustment for significant demographic and clinical covariates. Arm exercise scores for the other outcome end points yielded C-statistic values of 0.77-0.79 before and 0.82-0.86 after adjustment for significant covariates versus 0.64-0.72 for best fit pharmacological myocardial perfusion imaging models in a cohort of 1730 veterans who were evaluated over the same time period. Arm exercise scores, analogous to the Duke Treadmill Score, have good power for prediction of mortality or MI in patients who cannot perform leg exercise.
Does arthroscopic rotator cuff repair improve patients' activity levels?
Baumgarten, Keith M; Chang, Peter S; Dannenbring, Tasha M; Foley, Elaine K
2018-06-04
Rotator cuff repair decreases pain, improves range of motion, and increases strength. Whether these improvements translate to an improvement in a patient's activity level postoperatively remains unknown. The Shoulder Activity Level is a valid and reliable outcomes survey that can be used to measure a patient's shoulder-specific activity level. Currently, there are no studies that examine the effect of rotator cuff repair on shoulder activity level. Preoperative patient-determined outcomes scores collected prospectively on patients undergoing rotator cuff repair were compared with postoperative scores at a minimum of 2 years. These scores included the Shoulder Activity Level, Western Ontario Rotator Cuff Index, American Shoulder and Elbow Surgeons Standardized Shoulder Assessment Form, Single Assessment Numeric Evaluation, and simple shoulder test. Inclusion criteria were patients undergoing arthroscopic rotator cuff repair. Included were 281 shoulders from 273 patients with a mean follow-up of 3.7 years. The postoperative median Western Ontario Rotator Cuff Index (42 vs. 94), American Shoulder and Elbow Surgeons (41 vs. 95), Single Assessment Numeric Evaluation (30 vs. 95), and simple shoulder test (4 vs. 11) scores were statistically significantly improved compared with preoperative scores (P < .0001). The postoperative median Shoulder Activity Level score decreased compared with the preoperative score (12 vs. 11; P < .0001). Patients reported a statistically significant deterioration of their Shoulder Activity Level score after rotator cuff repair compared with their preoperative scores, although disease-specific and joint-specific quality of life scores all had statistically significantly improvement. This study suggests that patients generally have (1) significant improvements in their quality of life and (2) small deteriorations in activity level after arthroscopic rotator cuff repair. Copyright © 2018 Journal of Shoulder and Elbow Surgery Board of Trustees. Published by Elsevier Inc. All rights reserved.
The assessment of fetal brain function in fetuses with ventrikulomegaly: the role of the KANET test.
Talic, Amira; Kurjak, Asim; Stanojevic, Milan; Honemeyer, Ulrich; Badreldeen, Ahmed; DiRenzo, Gian Carlo
2012-08-01
To assess differences in fetal behavior in both normal fetuses and fetuses with cerebral ventriculomegaly (VM). In a period of eighteen months, in a longitudinal prospective cohort study, Kurjak Antenatal NeuorogicalTest (KANET) was applied to assess fetal behavior in both normal pregnancies and pregnancies with cerebral VM using four-dimensional ultrasound (4D US). According to the degree of enlargement of the ventricles, VM was divided into three groups: mild, moderate and severe. Moreover fetuses with isolated VM were separated from those with additional abnormalities. According to the KANET, fetuses with scores ≥ 14 were considered normal, those with scores 6-13 borderline and abnormal if the score was ≤ 5. Differences between two groups were examined by Fisher's exact test. Differences within the subgroups were examined by Kruskal-Wallis test and contingency table test. KANET scores in normal pregnancies and pregnancies with VM showed statistically significant differences. Most of the abnormal KANET scores as well as most of the borderline-scores were found among the fetuses with severe VM associated with additional abnormalities. There were no statistically significant differences between the control group and the groups with isolated and mild and /or moderate VM. Evaluation of the fetal behavior in fetuses with cerebral VM using KANET test has the potential to detect fetuses with abnormal behavior, and to add the dimension of CNS function to the morphological criteria of VM. Long-term postnatal neurodevelopmental follow-up should confirm the data from prenatal investigation of fetal behavior.
Gallob, John; Petrone, Dolores M; Mateo, Luis R; Chaknis, Patricia; Morrison, Boyce M; Williams, Malcolm; Panagakos, Foti
2016-06-01
Evaluation of the efficacy of a soft toothbrush with tapered-tip bristles (Test Toothbrush) and an ADA reference soft toothbrush (ADA Toothbrush) on established gingivitis and supragingival plaque over a 12-week period. This randomized, single-center, examiner-blind, two-cell, parallel clinical research study assessed plaque removal by the comparison of pre- to- post-brushing after a single use, and again after six- and 12-weeks' use, using the Quigley-Hein Plaque Index, Turesky Modification. The study also assessed gingivitis after six weeks and 12 weeks using the Löe & Silness Gingival Index. Adult male and female subjects from the Central New Jersey, USA area refrained from all oral hygiene procedures for 24 hours. They reported to the study site after refraining from eating, drinking, and smoking for four hours. Subjects had the study procedure explained to them both orally and by written instructions. Subjects then gave written consent to participate before entry into the study. Following an examination for plaque (pre-brushing) and gingivitis (baseline), the subjects were randomized into two balanced groups, each group assigned to one of the two study toothbrushes. Subjects were instructed to brush their teeth for one minute under supervision with their assigned toothbrush and a commercially available fluoride toothpaste (Colgate© Cavity Protection Toothpaste), after which they were again evaluated for plaque (post-brushing). Subjects were dismissed from the study site with their assigned toothbrush and toothpaste, and instructed to brush twice daily at home for the next 12 weeks. The subjects were instructed to brush for one minute during each tooth brushing. The subjects reported to the study site after six weeks and 12 weeks of product use, at which time they were evaluated for plaque and gingivitis. Seventy-one (71) subjects complied with the protocol and completed the clinical study. Compared to the ADA Toothbrush, the Test Toothbrush provided statistically significantly (p < 0.05) greater reductions of 71.1% in whole mouth plaque index scores, 43.8% in plaque severity index scores, and 81.3% in interproximal sites plaque scores after a single tooth brushing. After six weeks' use, the Test Toothbrush provided statistically significantly (p < 0.05) greater reductions of 700% in whole mouth gingival index scores, 700% in gingivitis severity index scores, and 400% in interproximal sites gingival scores compared to the ADA Toothbrush. Also after six weeks' use, the Test Toothbrush provided statistically significantly (p < 0.05) greater reductions of 188.9% in whole mouth plaque index scores, 165% in plaque severity index scores, and 203% in interproximal sites plaque scores compared to the ADA Toothbrush. After 12 weeks' use, the Test Toothbrush provided statistically significantly (p < 0.05) greater reductions of 266.7% in whole mouth gingival index scores, 300% in gingivitis severity index scores, and 250% in interproximal sites gingival scores compared to the ADA Toothbrush. Also after 12 weeks' use, the Test Toothbrush provided statistically significantly (p < 0.05) greater reductions of 158.1% in whole mouth plaque index scores, 143.5% in plaque severity index scores, and 145.4% in interproximal sites plaque scores compared to the ADA Toothbrush. This study demonstrated that a soft toothbrush with tapered-tip bristles provided a significantly greater reduction in supragingival plaque after a single tooth brushing, as well as after six and 12 weeks of twice-daily use, compared to the ADA Toothbrush. After six and 12 weeks of twice-daily use, it also provided a significantly greater reduction in gingivitis as compared to the ADA Toothbrush.
Austin, Peter C
2007-11-01
I conducted a systematic review of the use of propensity score matching in the cardiovascular surgery literature. I examined the adequacy of reporting and whether appropriate statistical methods were used. I examined 60 articles published in the Annals of Thoracic Surgery, European Journal of Cardio-thoracic Surgery, Journal of Cardiovascular Surgery, and the Journal of Thoracic and Cardiovascular Surgery between January 1, 2004, and December 31, 2006. Thirty-one of the 60 studies did not provide adequate information on how the propensity score-matched pairs were formed. Eleven (18%) of studies did not report on whether matching on the propensity score balanced baseline characteristics between treated and untreated subjects in the matched sample. No studies used appropriate methods to compare baseline characteristics between treated and untreated subjects in the propensity score-matched sample. Eight (13%) of the 60 studies explicitly used statistical methods appropriate for the analysis of matched data when estimating the effect of treatment on the outcomes. Two studies used appropriate methods for some outcomes, but not for all outcomes. Thirty-nine (65%) studies explicitly used statistical methods that were inappropriate for matched-pairs data when estimating the effect of treatment on outcomes. Eleven studies did not report the statistical tests that were used to assess the statistical significance of the treatment effect. Analysis of propensity score-matched samples tended to be poor in the cardiovascular surgery literature. Most statistical analyses ignored the matched nature of the sample. I provide suggestions for improving the reporting and analysis of studies that use propensity score matching.
ERIC Educational Resources Information Center
Rubin, Rosalyn; And Others
Scores on the Coopersmith Self-Esteem Inventory were related to scores on achievement and intelligence tests, and to socioeconomic level and to teachers' ratings of student behavior, in order to test the hypothesis that student self esteem would have a positive effect on academic achievement. There was a small but statistically significant…
ERIC Educational Resources Information Center
Bokossa, Maxime C.; Huang, Gary G.
This report describes the imputation procedures used to deal with missing data in the National Education Longitudinal Study of 1988 (NELS:88), the only current National Center for Education Statistics (NCES) dataset that contains scores from cognitive tests given the same set of students at multiple time points. As is inevitable, cognitive test…
ERIC Educational Resources Information Center
Fryer, Roland G., Jr.; Levitt, Steven D.
In previous research, a substantial gap in test scores between white and black students persists, even after controlling for a wide range of observable characteristics. Using a data set made available by the National Center for Education Statistics, the Early Childhood Longitudinal Study, this paper demonstrates that in stark contrast to earlier…
An Empirical Investigation of Methods for Assessing Item Fit for Mixed Format Tests
ERIC Educational Resources Information Center
Chon, Kyong Hee; Lee, Won-Chan; Ansley, Timothy N.
2013-01-01
Empirical information regarding performance of model-fit procedures has been a persistent need in measurement practice. Statistical procedures for evaluating item fit were applied to real test examples that consist of both dichotomously and polytomously scored items. The item fit statistics used in this study included the PARSCALE's G[squared],…
Page, G P; Amos, C I; Boerwinkle, E
1998-04-01
We present a test statistic, the quantitative LOD (QLOD) score, for the testing of both linkage and exclusion of quantitative-trait loci in randomly selected human sibships. As with the traditional LOD score, the boundary values of 3, for linkage, and -2, for exclusion, can be used for the QLOD score. We investigated the sample sizes required for inferring exclusion and linkage, for various combinations of linked genetic variance, total heritability, recombination distance, and sibship size, using fixed-size sampling. The sample sizes required for both linkage and exclusion were not qualitatively different and depended on the percentage of variance being linked or excluded and on the total genetic variance. Information regarding linkage and exclusion in sibships larger than size 2 increased as approximately all possible pairs n(n-1)/2 up to sibships of size 6. Increasing the recombination (theta) distance between the marker and the trait loci reduced empirically the power for both linkage and exclusion, as a function of approximately (1-2theta)4.
An explorative study of school performance and antipsychotic medication.
van der Schans, J; Vardar, S; Çiçek, R; Bos, H J; Hoekstra, P J; de Vries, T W; Hak, E
2016-09-21
Antipsychotic therapy can reduce severe symptoms of psychiatric disorders, however, data on school performance among children on such treatment are lacking. The objective was to explore school performance among children using antipsychotic drugs at the end of primary education. A cross-sectional study was conducted using the University Groningen pharmacy database linked to academic achievement scores at the end of primary school (Dutch Cito-test) obtained from Statistics Netherlands. Mean Cito-test scores and standard deviations were obtained for children on antipsychotic therapy and reference children, and statistically compared using analyses of covariance. In addition, differences in subgroups as boys versus girls, ethnicity, household income, and late starters (start date within 12 months of the Cito-test) versus early starters (start date > 12 months before the Cito-test) were tested. In all, data from 7994 children could be linked to Cito-test scores. At the time of the Cito-test, 45 (0.6 %) were on treatment with antipsychotics. Children using antipsychotics scored on average 3.6 points lower than the reference peer group (534.5 ± 9.5). Scores were different across gender and levels of household income (p < 0.05). Scores of early starters were significantly higher than starters within 12 months (533.7 ± 1.7 vs. 524.1 ± 2.6). This first exploration showed that children on antipsychotic treatment have lower school performance compared to the reference peer group at the end of primary school. This was most noticeable for girls, but early starters were less affected than later starters. Due to the observational cross-sectional nature of this study, no causality can be inferred, but the results indicate that school performance should be closely monitored and causes of underperformance despite treatment warrants more research.
Assessment Alternatives for a High Skill MOS
1975-12-01
tests. CR measurement advocates frequently claim that variance dependent statistics are inapplicable In CR test- ing because CR test scores have...rather than statistically . The Spearman- Brown reliability coefficient was .70. 17 In 1964, Shriver, Fink and Trexler (76) modified the M-33...ATTN: ATSW-SE-L 1 USA Cmd ft General Stf C- IVge . Ft Leavenworth, ATTN: Ed Advisor 1 USA Combined Arms Cmbt Dev Act, Ft Leavenworth, ATTN: DepCdr
Academic self-concept of ability and cortisol reactivity.
Minkley, N; Westerholt, D M; Kirchner, W H
2014-05-01
The present study aimed to clarify the relationship between a school-specific trait (academic self-concept of ability [ASCA]) and hormonal stress response by using a trait-compatible stressor (test). First, we determined 52 students' ASCA scores for biology and measured their salivary cortisol concentration before and after a biology test (experimental group, n=28) or a free writing task (control group, n=24). For participants who took the test, statistical analysis indicated a significant negative correlation between ASCA score and cortisol response. In contrast, the control group showed a decrease in cortisol concentrations between test times and no correlation between cortisol concentration and ASCA scores were found. These findings indicated an interaction between ASCA scores and hormonal stress response when an academic-related stressor is present. Furthermore, these variables might influence each other adversely: high cortisol concentrations during a test situation may lead to greater feelings of insecurity, resulting in low ASCA scores and awareness of these low scores may lead to a further increase in cortisol, creating a vicious cycle.
Patients and Medical Statistics
Woloshin, Steven; Schwartz, Lisa M; Welch, H Gilbert
2005-01-01
BACKGROUND People are increasingly presented with medical statistics. There are no existing measures to assess their level of interest or confidence in using medical statistics. OBJECTIVE To develop 2 new measures, the STAT-interest and STAT-confidence scales, and assess their reliability and validity. DESIGN Survey with retest after approximately 2 weeks. SUBJECTS Two hundred and twenty-four people were recruited from advertisements in local newspapers, an outpatient clinic waiting area, and a hospital open house. MEASURES We developed and revised 5 items on interest in medical statistics and 3 on confidence understanding statistics. RESULTS Study participants were mostly college graduates (52%); 25% had a high school education or less. The mean age was 53 (range 20 to 84) years. Most paid attention to medical statistics (6% paid no attention). The mean (SD) STAT-interest score was 68 (17) and ranged from 15 to 100. Confidence in using statistics was also high: the mean (SD) STAT-confidence score was 65 (19) and ranged from 11 to 100. STAT-interest and STAT-confidence scores were moderately correlated (r=.36, P<.001). Both scales demonstrated good test–retest repeatability (r=.60, .62, respectively), internal consistency reliability (Cronbach's α=0.70 and 0.78), and usability (individual item nonresponse ranged from 0% to 1.3%). Scale scores correlated only weakly with scores on a medical data interpretation test (r=.15 and .26, respectively). CONCLUSION The STAT-interest and STAT-confidence scales are usable and reliable. Interest and confidence were only weakly related to the ability to actually use data. PMID:16307623
Loring, David W; Larrabee, Glenn J
2006-06-01
The Halstead-Reitan Battery has been instrumental in the development of neuropsychological practice in the United States. Although Reitan administered both the Wechsler-Bellevue Intelligence Scale and Halstead's test battery when evaluating Halstead's theory of biologic intelligence, the relative sensitivity of each test battery to brain damage continues to be an area of controversy. Because Reitan did not perform direct parametric analysis to contrast group performances, we reanalyze Reitan's original validation data from both Halstead (Reitan, 1955) and Wechsler batteries (Reitan, 1959a) and calculate effect sizes and probability levels using traditional parametric approaches. Eight of the 10 tests comprising Halstead's original Impairment Index, as well as the Impairment Index itself, statistically differentiated patients with unequivocal brain damage from controls. In addition, 13 of 14 Wechsler measures including Full-Scale IQ also differed statistically between groups (Brain Damage Full-Scale IQ = 96.2; Control Group Full Scale IQ = 112.6). We suggest that differences in the statistical properties of each battery (e.g., raw scores vs. standardized scores) likely contribute to classification characteristics including test sensitivity and specificity.
Koami, Hiroyuki; Sakamoto, Yuichiro; Sakurai, Ryota; Ohta, Miho; Imahase, Hisashi; Yahata, Mayuko; Umeka, Mitsuru; Miike, Toru; Nagashima, Futoshi; Iwamura, Takashi; Yamada, Kosuke Chris; Inoue, Satoshi
2016-08-01
The aim of this study is to evaluate the hematological differences between septic and traumatic disseminated intravascular coagulation (DIC) using the rotational thromboelastometry (ROTEM).This retrospective study includes all sepsis or severe trauma patients transported to our emergency department who underwent ROTEM from 2013 to 2014. All patients were divided into 2 groups based on the presence of DIC diagnosed by the Japanese Association for Acute Medicine (JAAM) DIC score. We statistically analyzed the demographics, clinical characteristics, laboratory data, ROTEM findings (EXTEM and FIBTEM), and outcome.Fifty-seven patients (30 sepsis and 27 severe trauma) were included in primary analysis. Sepsis cases were significantly older and had higher systemic inflammatory response syndrome (SIRS) scores, whereas there were no significant differences in other parameters including Acute Physiology and Chronic Health Evaluation (APACHE) II score, sequential organ failure assessment (SOFA) score. Twenty-six patients (14 sepsis and 12 severe trauma) were diagnosed with DIC. The Septic DIC (S-DIC) group was significantly older and had higher DIC scores than the traumatic DIC (T-DIC) group. Hematologic examination revealed significantly higher CRP, fibrinogen, lower FDP, DD, and higher FDP/DD ratio were found in the S-DIC group in comparison with the T-DIC group. ROTEM findings showed that the A10, A20, and MCF in the FIBTEM test were significantly higher in the S-DIC group. However, no statistical differences were confirmed in the LI30, LI45, and ML in EXTEM test.The plasma fibrinogen level and fibrinogen based clot firmness in whole-blood test revealed statistical significance between septic and traumatic DIC patients.
Applications of "Integrated Data Viewer'' (IDV) in the classroom
NASA Astrophysics Data System (ADS)
Nogueira, R.; Cutrim, E. M.
2006-06-01
Conventionally, weather products utilized in synoptic meteorology reduce phenomena occurring in four dimensions to a 2-dimensional form. This constitutes a road-block for non-atmospheric-science majors who need to take meteorology as a non-mathematical and complementary course to their major programs. This research examines the use of Integrated Data Viewer-IDV as a teaching tool, as it allows a 4-dimensional representation of weather products. IDV was tested in the teaching of synoptic meteorology, weather analysis, and weather map interpretation to non-science students in the laboratory sessions of an introductory meteorology class at Western Michigan University. Comparison of student exam scores according to the laboratory teaching techniques, i.e., traditional lab manual and IDV was performed for short- and long-term learning. Results of the statistical analysis show that the Fall 2004 students in the IDV-based lab session retained learning. However, in the Spring 2005 the exam scores did not reflect retention in learning when compared with IDV-based and MANUAL-based lab scores (short term learning, i.e., exam taken one week after the lab exercise). Testing the long-term learning, seven weeks between the two exams in the Spring 2005, show no statistically significant difference between IDV-based group scores and MANUAL-based group scores. However, the IDV group obtained exam score average slightly higher than the MANUAL group. Statistical testing of the principal hypothesis in this study, leads to the conclusion that the IDV-based method did not prove to be a better teaching tool than the traditional paper-based method. Future studies could potentially find significant differences in the effectiveness of both manual and IDV methods if the conditions had been more controlled. That is, students in the control group should not be exposed to the weather analysis using IDV during lecture.
Friedrich, Orsolya; Hemmerling, Kay; Kuehlmeyer, Katja; Nörtemann, Stefanie; Fischer, Martin; Marckmann, Georg
2017-03-03
Recent findings suggest that medical students' moral competence decreases throughout medical school. This pilot study gives preliminary insights into the effects of two educational interventions in ethics classes on moral competence among medical students in Munich, Germany. Between 2012 and 2013, medical students were tested using Lind's Moral Competence Test (MCT) prior to and after completing different ethics classes. The experimental group (EG, N = 76) participated in principle-based structured case discussions (PBSCDs) and was compared with a control group with theory-based case discussions (TBCDs) (CG, N = 55). The pre/post C-scores were compared using a Wilcoxon Test, ANOVA and effect-size calculation. The C-score improved by around 3.2 C-points in the EG, and by 0.2 C-points in the CG. The mean C-score difference was not statistically significant for the EG (P = 0.14) or between the two groups (P = 0.34). There was no statistical significance for the teachers' influence (P = 0.54) on C-score. In both groups, students with below-average (M = 29.1) C-scores improved and students with above-average C-scores regressed. The increase of the C-Index was greater in the EG than in the CG. The absolute effect-size of the EG compared with the CG was 3.0 C-points, indicating a relevant effect. Teaching ethics with PBSCDs did not provide a statistically significant influence on students' moral competence, compared with TBCDs. Yet, the effect size suggests that PBSCDs may improve moral competence among medical students more effectively. Further research with larger and completely randomized samples is needed to gain definite explanations for the results.
Verbal and visual memory in patients with early Parkinson's disease: effect of levodopa.
Singh, Sumit; Behari, Madhuri
2006-03-01
The effect of initiation of levodopa therapy on the memory functions in patients with Parkinson's disease remains poorly understood. To evaluate the effect of initiation of levodopa therapy on memory, in patients with early Parkinson's disease. Prospective case control study. Seventeen patients with early Parkinson's disease were evaluated for verbal memory using Rey's auditory verbal learning test, and visual memory using the Benton's visual retention test and Form sequence learning test. UPDRS scores, Hoehn and Yahr's Staging and Schwab and England scores of Activities of daily living. Hamilton's depression rating scale and MMSE were also evaluated. Six controls were also evaluated according to similar study protocol. Levodopa was then prescribed to the cases. Same tests were repeated on all the subjects after 12 weeks. The mean age of the patients was 59.8 (+ 12.9 yrs); mean disease duration of 3.26 (+ 2.06 yrs). The mean UPDRS scores of patients were 36.52 (+ 15.84). Controls were of a similar age and sex distribution. A statistically significant improvement in the scores on the UPDRS, Hamilton's depression scale, Schwab and England scale, and a statistically significant deterioration in the scores of visual memory was observed in patients with PD after starting levodopa, as compared to their baseline scores. There was no correlation between degree of deterioration and the dose of levodopa. Initiation of levodopa therapy in patients with early and stable Parkinson's disease is associated with deterioration in visual memory functions, with relative preservation of the verbal memory.
Association between Optimism, Psychosocial Well Being and Oral Health: A Cross-Sectional Study.
Thiruvenkadam, G; Asokan, Sharath; Baby John, J; Geetha Priya, P R
The aim of the study was to assess the association of optimism and psychosocial well being of school going children on their oral health status. The study included 12- to 15-year-old school going children (N = 2014) from Tamilnadu, India. Optimism was measured using the revised version of the Life Orientation Test (LOT-R). A questionnaire was sent to the parents regarding their child's psychosocial behavior which included shyness, feeling inferiority, unhappiness and friendliness. Clinical examination for each child was done to assess the DMFT score and OHI-S score. The data obtained were statistically analyzed using Pearson Chi-Square test, Mann-Whitney test and Kruskal-Wallis test with the aid of SPSS software (version 17). Odds Ratio (OR) was calculated with 95% Confidence Interval (CI). The p value ≤ 0.05 was considered statistically significant. Boys with high optimism had significantly lesser DMFT score than the boys with low optimism (p=0.001). Girls with high optimism had significantly higher DMFT score (p=0.001). In psychosocial outcomes, inferiority (p=0.002) and friendliness (p=0.001) showed significant association with DMFT score. Among the boys, children who felt less inferior (p=0.001), less unhappy (p=0.029) and more friendly (p=0.001) had lesser DMFT score. Among the psychosocial outcomes assessed, inferiority and friendliness had significant association with oral health of the children and hence, can be used as a proxy measures oral health.
ERIC Educational Resources Information Center
De Ball, Suzanne; Sullivan, Kathleen; Horine, Julie; Duncan, William K.; Replogle, William
2002-01-01
Comapred University of Mississippi dental student scores on the Dental Admission Test (DAT) and Part I of the National Board Dental Examinations (NBDE) and found that DAT reading comprehension was a statistically significant predictor of all four subtests of the NBDE. Also found that DAT biology and organic chemistry scores were predictors of NBDE…
ERIC Educational Resources Information Center
Weltman, David; Whiteside, Mary
2010-01-01
This research shows that active learning is not universally effective and, in fact, may inhibit learning for certain types of students. The results of this study show that as increased levels of active learning are utilized, student test scores decrease for those with a high grade point average. In contrast, test scores increase as active learning…
Nathoo, Salim; Mateo, Luis R; Chaknis, Patricia; Kemp, James H; Gatzemeyer, John; Morrison, Boyce M; Panagakos, Fotinos
2014-01-01
To evaluate the efficacy of a power toothbrush with distinct multi-directional cleaning action using two different heads (Colgate ProClinical C200 toothbrush with either a triple clean head or a sensitive head) as compared to a manual flat-trim toothbrush (Oral B Indicator toothbrush) on supragingival plaque and established gingivitis. This examiner-blind, randomized, controlled, three-treatment, parallel-group clinical research study assessed plaque removal via the comparison of pre- to post-brushing after a single use and again after four weeks of use, using the Rustogi Modified Navy Plaque Index. This study also assessed gingivitis at four weeks using the Löe-Silness Gingival Index. Qualifying adult male and female subjects from the central New Jersey, USA area reported to the study site after refraining from any oral hygiene procedures for 24 hours, and from eating, drinking, and smoking for four hours. Following an examination for plaque and gingivitis, they were randomized into three balanced groups. Subjects were instructed to brush their teeth for two minutes under supervision with their assigned toothbrush and a commercially available toothpaste (Colgate Cavity Protection toothpaste), after which they were again evaluated for plaque. Subjects were dismissed from the study site with the toothpaste and their assigned toothbrush to use at home twice daily for the next four weeks. They reported to the study site after four weeks of product use, at which time they were evaluated for plaque and gingivitis. One hundred twenty (120) enrolled subjects complied with the protocol and completed the clinical study. The results of the study indicated that all three test products provided statistically significant reductions in pre-brushing to post-brushing plaque scores for whole mouth and interproximal sites after a single use. For gingival margin plaque sites, only the Colgate ProClinical C200 toothbrush, with either the triple clean head or the sensitive head, provided statistically significant reductions in pre- to post-brushing plaque scores. After four weeks of product use, all three test products provided statistically significant reductions in baseline to four-week whole mouth and interproximal site plaque scores, but only the Colgate ProClinical C200 toothbrush, with either the triple clean head or the sensitive head, provided a statistically significant reduction in plaque scores at gingival margin sites. All three test products provided statistically significant reductions in gingival and gingivitis severity index scores after four weeks of product use. Relative to the manual toothbrush group, after a single tooth brushing the Colgate ProClinical C200 toothbrush, with either the triple clean head or sensitive head, provided statistically significantly greater reductions in whole mouth plaque index scores (51.9% and 59.3%, respectively), in gingival margin plaque index scores (700% and 650%, respectively), and interproximal plaque index scores (64.2% and 60.4%, respectively). Relative to the manual toothbrush group, after four weeks of use the Colgate ProClinical C200 toothbrush, with either the triple clean head or sensitive head, provided statistically significantly greater reductions in whole mouth plaque index scores (78.6%, and 82.1%, respectively), in gingival margin plaque index scores (3700% and 3400%, respectively), and interproximal plaque index scores (50.8% and 52.5%, respectively). Relative to the manual toothbrush group, after four weeks of use the Colgate ProClinical C200 toothbrush, with either the triple clean head or sensitive head, provided statistically significantly greater reductions in gingival index scores of 900% and 833%, respectively, and in gingivitis severity index scores of 466.7% and 600%, respectively. All statistically significant reductions were at the p ≤ 0.05 level. There were no statistically significant differences between the scores of the Colgate ProClinical C200 toothbrush with triple clean head and the scores of the Colgate ProClinical C200 toothbrush with sensitive head at any comparison time point. The Colgate ProClinicaI C200 toothbrush, with either a triple clean head or a sensitive head, provides statistically significant and clinically relevant levels of efficacy in the removal of supragingival dental plaque in the whole mouth, at the gingival margin, and interproximally after a single tooth brushing and after four weeks of use, as well as a statistically significantly greater level of efficacy in the reduction of gingivitis and gingival bleeding when compared to a manual flat-trim toothbrush.
2016-12-22
included assessments and instruments, descriptive statistics were calculated. Independent-samples t-tests were conducted using participant survey scores...integrity tests within a multimodal system. Both conditions included the Military Acute Concussion Evaluation (MACE) and an Ease-of-Use survey . Mean scores...for the Ease-of-Use survey and mean test administration times for each measure were compared. Administrative feedback was also considered for
Thrall, Grace C; Coverdale, John H; Benjamin, Sophiya; Wiggins, Anna; Lane, Christianne Joy; Pato, Michele T
2016-10-01
This goal of this study was to evaluate the efficacy of team-based learning (TBL) on knowledge retention compared to traditional lectures with small break-out group discussion (teaching as usual (TAU)) using a randomized controlled trial. This randomized controlled trial was conducted during a daylong conference for psychiatric educators on attention-deficit hyperactivity disorder and the research literacy topic of efficacy versus effectiveness trials. Learners (n = 115) were randomized with concealed allocation to either TBL or TAU. Knowledge was measured prior to the intervention, immediately afterward, and 2 months later via multiple-choice tests. Participants were necessarily unblinded. Data enterers, data analysts, and investigators were blinded to group assignment in data analysis. Per-protocol analyses of test scores were performed using change in knowledge from baseline. The primary endpoint was test scores at 2 months. At baseline, there were no statistically significant differences between groups in pre-test knowledge. At immediate post-test, both TBL and TAU groups showed improved knowledge scores compared with their baseline scores. The TBL group performed better statistically on the immediate post-test than the TAU group (Cohen's d = 0.73; p < 0.001), although the differences in knowledge scores were not educationally meaningful, averaging just one additional test question correct (out of 15). On the 2-month remote post-test, there were no group differences in knowledge retention among the 42 % of participants who returned the 2-month test. Both TBL and TAU learners acquired new knowledge at the end of the intervention and retained knowledge over 2 months. At the end of the intervention day and after 2 months, knowledge test scores were not meaningfully different between TBL and TAU completers. In conclusion, this study failed to demonstrate the superiority of TBL over TAU on the primary outcome of knowledge retention at 2 months post-intervention.
Nursing students' mathematic calculation skills.
Rainboth, Lynde; DeMasi, Chris
2006-12-01
This mixed method study used a pre-test/post-test design to evaluate the efficacy of a teaching strategy in improving beginning nursing student learning outcomes. During a 4-week student teaching period, a convenience sample of 54 sophomore level nursing students were required to complete calculation assignments, taught one calculation method, and mandated to attend medication calculation classes. These students completed pre- and post-math tests and a major medication mathematic exam. Scores from the intervention student group were compared to those achieved by the previous sophomore class. Results demonstrated a statistically significant improvement from pre- to post-test and the students who received the intervention had statistically significantly higher scores on the major medication calculation exam than did the students in the control group. The evaluation completed by the intervention group showed that the students were satisfied with the method and outcome.
An Asian validation of the TIMI risk score for ST-segment elevation myocardial infarction.
Selvarajah, Sharmini; Fong, Alan Yean Yip; Selvaraj, Gunavathy; Haniff, Jamaiyah; Uiterwaal, Cuno S P M; Bots, Michiel L
2012-01-01
Risk stratification in ST-elevation myocardial infarction (STEMI) is important, such that the most resource intensive strategy is used to achieve the greatest clinical benefit. This is essential in developing countries with wide variation in health care facilities, scarce resources and increasing burden of cardiovascular diseases. This study sought to validate the Thrombolysis In Myocardial Infarction (TIMI) risk score for STEMI in a multi-ethnic developing country. Data from a national, prospective, observational registry of acute coronary syndromes was used. The TIMI risk score was evaluated in 4701 patients who presented with STEMI. Model discrimination and calibration was tested in the overall population and in subgroups of patients that were at higher risk of mortality; i.e., diabetics and those with renal impairment. Compared to the TIMI population, this study population was younger, had more chronic conditions, more severe index events and received treatment later. The TIMI risk score was strongly associated with 30-day mortality. Discrimination was good for the overall study population (c statistic 0.785) and in the high risk subgroups; diabetics (c statistic 0.764) and renal impairment (c statistic 0.761). Calibration was good for the overall study population and diabetics, with χ2 goodness of fit test p value of 0.936 and 0.983 respectively, but poor for those with renal impairment, χ2 goodness of fit test p value of 0.006. The TIMI risk score is valid and can be used for risk stratification of STEMI patients for better targeted treatment.
Filipiak, Katarzyna; Klein, Daniel; Roy, Anuradha
2017-01-01
The problem of testing the separability of a covariance matrix against an unstructured variance-covariance matrix is studied in the context of multivariate repeated measures data using Rao's score test (RST). The RST statistic is developed with the first component of the separable structure as a first-order autoregressive (AR(1)) correlation matrix or an unstructured (UN) covariance matrix under the assumption of multivariate normality. It is shown that the distribution of the RST statistic under the null hypothesis of any separability does not depend on the true values of the mean or the unstructured components of the separable structure. A significant advantage of the RST is that it can be performed for small samples, even smaller than the dimension of the data, where the likelihood ratio test (LRT) cannot be used, and it outperforms the standard LRT in a number of contexts. Monte Carlo simulations are then used to study the comparative behavior of the null distribution of the RST statistic, as well as that of the LRT statistic, in terms of sample size considerations, and for the estimation of the empirical percentiles. Our findings are compared with existing results where the first component of the separable structure is a compound symmetry (CS) correlation matrix. It is also shown by simulations that the empirical null distribution of the RST statistic converges faster than the empirical null distribution of the LRT statistic to the limiting χ 2 distribution. The tests are implemented on a real dataset from medical studies. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Fold assessment for comparative protein structure modeling.
Melo, Francisco; Sali, Andrej
2007-11-01
Accurate and automated assessment of both geometrical errors and incompleteness of comparative protein structure models is necessary for an adequate use of the models. Here, we describe a composite score for discriminating between models with the correct and incorrect fold. To find an accurate composite score, we designed and applied a genetic algorithm method that searched for a most informative subset of 21 input model features as well as their optimized nonlinear transformation into the composite score. The 21 input features included various statistical potential scores, stereochemistry quality descriptors, sequence alignment scores, geometrical descriptors, and measures of protein packing. The optimized composite score was found to depend on (1) a statistical potential z-score for residue accessibilities and distances, (2) model compactness, and (3) percentage sequence identity of the alignment used to build the model. The accuracy of the composite score was compared with the accuracy of assessment by single and combined features as well as by other commonly used assessment methods. The testing set was representative of models produced by automated comparative modeling on a genomic scale. The composite score performed better than any other tested score in terms of the maximum correct classification rate (i.e., 3.3% false positives and 2.5% false negatives) as well as the sensitivity and specificity across the whole range of thresholds. The composite score was implemented in our program MODELLER-8 and was used to assess models in the MODBASE database that contains comparative models for domains in approximately 1.3 million protein sequences.
Relationship of the functional movement screen in-line lunge to power, speed, and balance measures.
Hartigan, Erin H; Lawrence, Michael; Bisson, Brian M; Torgerson, Erik; Knight, Ryan C
2014-05-01
The in-line lunge of the Functional Movement Screen (FMS) evaluates lateral stability, balance, and movement asymmetries. Athletes who score poorly on the in-line lunge should avoid activities requiring power or speed until scores are improved, yet relationships between the in-line lunge scores and other measures of balance, power, and speed are unknown. (1) Lunge scores will correlate with center of pressure (COP), maximum jump height (MJH), and 36.6-meter sprint time and (2) there will be no differences between limbs on lunge scores, MJH, or COP. Descriptive laboratory study. Level 3. Thirty-seven healthy, active participants completed the first 3 tasks of the FMS (eg, deep squat, hurdle step, in-line lunge), unilateral drop jumps, and 36.6-meter sprints. A 3-dimensional motion analysis system captured MJH. Force platforms measured COP excursion. A laser timing system measured 36.6-m sprint time. Statistical analyses were used to determine whether a relationship existed between lunge scores and COP, MJH, and 36.6-m speed (Spearman rho tests) and whether differences existed between limbs in lunge scores (Wilcoxon signed-rank test), MJH, and COP (paired t tests). Lunge scores were not significantly correlated with COP, MJH, or 36.6-m sprint time. Lunge scores, COP excursion, and MJH were not statistically different between limbs. Performance on the FMS in-line lunge was not related to balance, power, or speed. Healthy participants were symmetrical in lunging measures and MJH. Scores on the FMS in-line lunge should not be attributed to power, speed, or balance performance without further examination. However, assessing limb symmetry appears to be clinically relevant.
Integral criteria for large-scale multiple fingerprint solutions
NASA Astrophysics Data System (ADS)
Ushmaev, Oleg S.; Novikov, Sergey O.
2004-08-01
We propose the definition and analysis of the optimal integral similarity score criterion for large scale multmodal civil ID systems. Firstly, the general properties of score distributions for genuine and impostor matches for different systems and input devices are investigated. The empirical statistics was taken from the real biometric tests. Then we carry out the analysis of simultaneous score distributions for a number of combined biometric tests and primary for ultiple fingerprint solutions. The explicit and approximate relations for optimal integral score, which provides the least value of the FRR while the FAR is predefined, have been obtained. The results of real multiple fingerprint test show good correspondence with the theoretical results in the wide range of the False Acceptance and the False Rejection Rates.
Understanding a Widely Misunderstood Statistic: Cronbach's "Alpha"
ERIC Educational Resources Information Center
Ritter, Nicola L.
2010-01-01
It is important to explore score reliability in virtually all studies, because tests are not reliable. The present paper explains the most frequently used reliability estimate, coefficient alpha, so that the coefficient's conceptual underpinnings will be understood. Researchers need to understand score reliability because of the possible impact…
Keedy, Alexander W; Durack, Jeremy C; Sandhu, Parmbir; Chen, Eric M; O'Sullivan, Patricia S; Breiman, Richard S
2011-01-01
This study was designed to determine whether an interactive three-dimensional presentation depicting liver and biliary anatomy is more effective for teaching medical students than a traditional textbook format presentation of the same material. Forty-six medical students volunteered for participation in this study. Baseline demographic information, spatial ability, and knowledge of relevant anatomy were measured. Participants were randomized into two groups and presented with a computer-based interactive learning module comprised of animations and still images to highlight various anatomical structures (3D group), or a computer-based text document containing the same images and text without animation or interactive features (2D group). Following each teaching module, students completed a satisfaction survey and nine-item anatomic knowledge post-test. The 3D group scored higher on the post-test than the 2D group, with a mean score of 74% and 64%, respectively; however, when baseline differences in pretest scores were accounted for, this difference was not statistically significant (P = 0.33). Spatial ability did not statistically significantly correlate with post-test scores for the 3D group or the 2D group. In the post-test satisfaction survey the 3D group expressed a statistically significantly higher overall satisfaction rating compared to students in the 2D control group (4.5 versus 3.7 out of 5, P = 0.02). While the interactive 3D multimedia module received higher satisfaction ratings from students, it neither enhanced nor inhibited learning of complex hepatobiliary anatomy compared to an informationally equivalent traditional textbook style approach. . Copyright © 2011 American Association of Anatomists.
Sargin, Mehmet Akif; Yassa, Murat; Taymur, Bilge Dogan; Taymur, Bulent; Akca, Gizem; Tug, Niyazi
2017-04-01
To compare the status of female sexual dysfunction (FSD) between women with a history of previous gestational diabetes mellitus (GDM) and those with follow-up of a healthy pregnancy, using the female sexual function index (FSFI) questionnaire. Cross-sectional study. Department of Obstetrics and Gynecology, Fatih Sultan Mehmet Training and Research Hospital, Istanbul, Turkey, from September to December 2015. Healthy sexually active adult parous females were included. Participants were asked to complete the validated Turkish versions of the FSFI and Hospital Anxiety and Depression Scale (HADS) questionnaires. Student's t-test was used for two-group comparisons of normally distributed variables and quantitative data. Mann-Whitney U-test was used for two-group comparisons of non-normally distributed variables. Pearson's chi-squared test, the Fisher-FreemanHalton test, Fisher's exact test, and Yates' continuity correction test were used for comparison of qualitative data. The mean FSFI scores of the 179 participants was 23.50 ±3.94. FSFI scores and scores of desire, arousal, lubrication, orgasm, satisfaction, and pain were not statistically significantly different (p>0.05), according to a history of GDM and types of FSD (none, mild, severe). HADS scores and anxiety and depression types did not statistically significantly differ according to the history of GDM (p>0.05). An association could not be found in FSFI scores between participants with both the history of previous GDM and with healthy pregnancy; subclinical sexual dysfunction may be observed in the late postpartum period among women with a history of previous GDM. This may adversely affect their sexual health.
AlFaleh, Hussam F; Alsheikh-Ali, Alawi A; Ullah, Anhar; AlHabib, Khalid F; Hersi, Ahmad; Suwaidi, Jassim Al; Sulaiman, Kadhim; Saif, Shukri Al; Almahmeed, Wael; Asaad, Nidal; Amin, Haitham; Al-Motarreb, Ahmed; Kashour, Tarek
2015-09-01
Several risk scores have been developed for acute coronary syndrome (ACS) patients, but their use is limited by their complexity. The new Canada Acute Coronary Syndrome (C-ACS) risk score is a simple risk-assessment tool for ACS patients. This study assessed the performance of the C-ACS risk score in predicting hospital mortality in a contemporary Middle Eastern ACS cohort. The C-ACS score accurately predicts hospital mortality in ACS patients. The baseline risk of 7929 patients from 6 Arab countries who were enrolled in the Gulf RACE-2 registry was assessed using the C-ACS risk score. The score ranged from 0 to 4, with 1 point assigned for the presence of each of the following variables: age ≥75 years, Killip class >1, systolic blood pressure <100 mm Hg, and heart rate >100 bpm. The discriminative ability and calibration of the score were assessed using C statistics and goodness-of-fit tests, respectively. The C-ACS score demonstrated good predictive values for hospital mortality in all ACS patients with a C statistic of 0.77 (95% confidence interval [CI]: 0.74-0.80) and in ST-segment elevation myocardial infarction and non-ST-segment elevation acute coronary syndrome patients (C statistic: 0.76, 95% CI: 0.73-0.79; and C statistic: 0.80, 95% CI: 0.75-0.84, respectively). The discriminative ability of the score was moderate regardless of age category, nationality, and diabetic status. Overall, calibration was optimal in all subgroups. The new C-ACS score performed well in predicting hospital mortality in a contemporary ACS population outside North America. © 2015 Wiley Periodicals, Inc.
The Probability of Exceedance as a Nonparametric Person-Fit Statistic for Tests of Moderate Length
ERIC Educational Resources Information Center
Tendeiro, Jorge N.; Meijer, Rob R.
2013-01-01
To classify an item score pattern as not fitting a nonparametric item response theory (NIRT) model, the probability of exceedance (PE) of an observed response vector x can be determined as the sum of the probabilities of all response vectors that are, at most, as likely as x, conditional on the test's total score. Vector x is to be considered…
Haile, Demewoz; Nigatu, Dabere; Gashaw, Ketema; Demelash, Habtamu
2016-01-01
Academic achievement of school age children can be affected by several factors such as nutritional status, demographics, and socioeconomic factors. Though evidence about the magnitude of malnutrition is well established in Ethiopia, there is a paucity of evidence about the association of nutritional status with academic performance among the nation's school age children. Hence, this study aimed to determine how nutritional status and cognitive function are associated with academic performance of school children in Goba town, South East Ethiopia. An institution based cross-sectional study was conducted among 131 school age students from primary schools in Goba town enrolled during the 2013/2014 academic year. The nutritional status of students was assessed by anthropometric measurement, while the cognitive assessment was measured by the Kaufman Assessment Battery for Children (KABC-II) and Ravens colored progressive matrices (Raven's CPM) tests. The academic performance of the school children was measured by collecting the preceding semester academic result from the school record. Descriptive statistics, bivariate and multivariable linear regression were used in the statistical analysis. This study found a statistically significant positive association between all cognitive test scores and average academic performance except for number recall (p = 0.12) and hand movements (p = 0.08). The correlation between all cognitive test scores and mathematics score was found positive and statistically significant (p < 0.05). In the multivariable linear regression model, better wealth index was significantly associated with higher mathematics score (ß = 0.63; 95 % CI: 0.12-0.74). Similarly a unit change in height for age z score resulted in 2.11 unit change in mathematics score (ß = 2.11; 95 % CI: 0.002-4.21). A single unit change of wealth index resulted 0.53 unit changes in average score of all academic subjects among school age children (ß = 0.53; 95 % CI: 0.11-0.95). A single unit change of age resulted 3.23 unit change in average score of all academic subjects among school age children (ß = 3.23; 95 % CI: 1.20-5.27). Nutritional status (height for age Z score) and wealth could be modifiable factors to improve academic performance of school age children. Moreover, interventions to improve nutrition for mothers and children may be an important contributor to academic success and national economic growth in Ethiopia. Further study with strong design and large sample size is needed.
Serin, Gürdeniz; Karabulut, Gonca; Kabasakal, Yasemin; Kandiloğlu, Gülşen; Akalin, Taner
2016-01-01
Minor salivary gland biopsy is one of the objective tests used in the diagnosis of Sjögren syndrome. The aim of our study was to compare the clinical and laboratory data of primary and secondary Sjögren syndrome cases with a lymphocyte score 3 and 4 in the minor salivary gland biopsy. Data from a total of 2346 consecutive minor salivary gland biopsies were retrospectively evaluated in this study. Clinical and autoantibody characteristics of 367 cases with lymphocyte score 3 or 4 and diagnosed with primary or secondary Sjögren syndrome were compared. There was no difference between lymphocyte score 3 and 4 primary Sjögren syndrome patients in terms of dry mouth, dry eye symptoms and Schirmer test results but Anti-Ro and Antinuclear Antibody positivity was statistically significantly higher in cases with lymphocyte score 4 (p= 0.025, p= 0.001). Anti-Ro test results were also found to be statistically significantly higher in secondary Sjögren syndrome patients with lymphocyte score 4 (p= 0.048). In this study, the high proportion of cases with negative autoantibody but positive lymphocyte score is significant in terms of showing the contribution of minor salivary gland biopsy to Sjögren syndrome diagnosis. Lymphocyte score 3 and 4 cases were found to have similar clinical findings but a difference regarding antibody positivity in primary Sjögren syndrome. We believe that cases with lymphocyte score 4 may be Sjögren syndrome cases whose clinical manifestations are relatively established and higher autoantibody levels are therefore found.
Huynh-Thu, Vân Anh; Saeys, Yvan; Wehenkel, Louis; Geurts, Pierre
2012-07-01
Univariate statistical tests are widely used for biomarker discovery in bioinformatics. These procedures are simple, fast and their output is easily interpretable by biologists but they can only identify variables that provide a significant amount of information in isolation from the other variables. As biological processes are expected to involve complex interactions between variables, univariate methods thus potentially miss some informative biomarkers. Variable relevance scores provided by machine learning techniques, however, are potentially able to highlight multivariate interacting effects, but unlike the p-values returned by univariate tests, these relevance scores are usually not statistically interpretable. This lack of interpretability hampers the determination of a relevance threshold for extracting a feature subset from the rankings and also prevents the wide adoption of these methods by practicians. We evaluated several, existing and novel, procedures that extract relevant features from rankings derived from machine learning approaches. These procedures replace the relevance scores with measures that can be interpreted in a statistical way, such as p-values, false discovery rates, or family wise error rates, for which it is easier to determine a significance level. Experiments were performed on several artificial problems as well as on real microarray datasets. Although the methods differ in terms of computing times and the tradeoff, they achieve in terms of false positives and false negatives, some of them greatly help in the extraction of truly relevant biomarkers and should thus be of great practical interest for biologists and physicians. As a side conclusion, our experiments also clearly highlight that using model performance as a criterion for feature selection is often counter-productive. Python source codes of all tested methods, as well as the MATLAB scripts used for data simulation, can be found in the Supplementary Material.
NASA Astrophysics Data System (ADS)
Sukji, Paweena; Wichaidit, Pacharee Rompayom; Wichaidit, Sittichai
2018-01-01
The objectives of this study were to: 1) compare learning achievement and analytical thinking ability of Mathayomsuksa 3 students before and after learning through inquiry-based learning activities integrated with the local learning resource, and 2) compare average post-test score of learning achievement and analytical thinking ability to its cutting score. The target of this study was 23 Mathayomsuksa 3 students who were studying in the second semester of 2016 academic year from Banchatfang School, Chainat Province. Research instruments composed of: 1) 6 lesson plans of Environment and Natural Resources, 2) the learning achievement test, and 3) analytical thinking ability test. The results showed that 1) student' learning achievement and analytical thinking ability after learning were higher than that of before at the level of .05 statistical significance, and 2) average posttest score of student' learning achievement and analytical thinking ability were higher than its cutting score at the level of .05 statistical significance. The implication of this research is for science teachers and curriculum developers to design inquiry activities that relate to student's context.
Wooley, Dennis S; Kinner, Tracy J
2016-11-01
The purpose was to compare perceived self-management practices of adult type 2 diabetic patients after completing an American Diabetes Association (ADA) certified diabetes self-management education (DSME) program with unstructured individualized nurse practitioner led DSME. Demographic questions and the Self-Care Inventory-Revised (SCIR) were given to two convenience sample patient groups comprising a formal DSME program group and a group within a clinical setting who received informal and unstructured individual education during patient encounters. A t-test was executed between the formal ADA certified education sample and the informal sample's SCI-R individual scores. A second t-test was performed between the two samples' SCI-R mean scores. A t-test determined no statistically significant difference between the formal ADA structured education and informal education samples' SCI-R individual scores. There was not a statistically significant difference between the samples' SCI-R mean scores. The study results suggest that there are not superior DSME settings and instructional approaches. Copyright © 2016 Elsevier Inc. All rights reserved.
Prabhakaran, Shyam; Jovin, Tudor G.; Tayal, Ashis H.; Hussain, Muhammad S.; Nguyen, Thanh N.; Sheth, Kevin N.; Terry, John B.; Nogueira, Raul G.; Horev, Anat; Gandhi, Dheeraj; Wisco, Dolora; Glenn, Brenda A.; Ludwig, Bryan; Clemmons, Paul F.; Cronin, Carolyn A.; Tian, Melissa; Liebeskind, David; Zaidat, Osama O.; Castonguay, Alicia C.; Martin, Coleman; Mueller-Kronast, Nils; English, Joey D.; Linfante, Italo; Malisch, Timothy W.; Gupta, Rishi
2014-01-01
Background There are multiple clinical and radiographic factors that influence outcomes after endovascular reperfusion therapy (ERT) in acute ischemic stroke (AIS). We sought to derive and validate an outcome prediction score for AIS patients undergoing ERT based on readily available pretreatment and posttreatment factors. Methods The derivation cohort included 511 patients with anterior circulation AIS treated with ERT at 10 centers between September 2009 and July 2011. The prospective validation cohort included 223 patients with anterior circulation AIS treated in the North American Solitaire Acute Stroke registry. Multivariable logistic regression identified predictors of good outcome (modified Rankin score ≤2 at 3 months) in the derivation cohort; model β coefficients were used to assign points and calculate a risk score. Discrimination was tested using C statistics with 95% confidence intervals (CIs) in the derivation and validation cohorts. Calibration was assessed using the Hosmer-Lemeshow test and plots of observed to expected outcomes. We assessed the net reclassification improvement for the derived score compared to the Totaled Health Risks in Vascular Events (THRIVE) score. Subgroup analysis in patients with pretreatment Alberta Stroke Program Early CT Score (ASPECTS) and posttreatment final infarct volume measurements was also performed to identify whether these radiographic predictors improved the model compared to simpler models. Results Good outcome was noted in 186 (36.4%) and 100 patients (44.8%) in the derivation and validation cohorts, respectively. Combining readily available pretreatment and posttreatment variables, we created a score (acronym: SNARL) based on the following parameters: symptomatic hemorrhage [2 points: none, hemorrhagic infarction (HI)1–2 or parenchymal hematoma (PH) type 1; 0 points: PH2], baseline National Institutes of Health Stroke Scale score (3 points: 0–10; 1 point: 11–20; 0 points: >20), age (2 points: <60 years; 1 point: 60–79 years; 0 points: >79 years), reperfusion (3 points: Thrombolysis In Cerebral Ischemia score 2b or 3) and location of clot (1 point: M2; 0 points: M1 or internal carotid artery). The SNARL score demonstrated good discrimination in the derivation (C statistic 0.79, 95% CI 0.75–0.83) and validation cohorts (C statistic 0.74, 95% CI 0.68–0.81) and was superior to the THRIVE score (derivation cohort: C statistic 0.65, 95% CI 0.60–0.70; validation cohort: C-statistic 0.59, 95% CI 0.52–0.67; p < 0.01 in both cohorts) but was inferior to a score that included age, ASPECTS, reperfusion status and final infarct volume (C statistic 0.86, 95% CI 0.82–0.91; p = 0.04). Compared with the THRIVE score, the SNARL score resulted in a net reclassification improvement of 34.8%. Conclusions Among AIS patients treated with ERT, pretreatment scores such as the THRIVE score provide only fair prognostic information. Inclusion of posttreatment variables such as reperfusion and symptomatic hemorrhage greatly influences outcome and results in improved outcome prediction. PMID:24942008
Mendell, M J; Eliseeva, E A; Davies, M M; Lobscheid, A
2016-08-01
Limited evidence has associated lower ventilation rates (VRs) in schools with reduced student learning or achievement. We analyzed longitudinal data collected over two school years from 150 classrooms in 28 schools within three California school districts. We estimated daily classroom VRs from real-time indoor carbon dioxide measured by web-connected sensors. School districts provided individual-level scores on standard tests in Math and English, and classroom-level demographic data. Analyses assessing learning effects used two VR metrics: average VRs for 30 days prior to tests, and proportion of prior daily VRs above specified thresholds during the year. We estimated relationships between scores and VR metrics in multivariate models with generalized estimating equations. All school districts had median school-year VRs below the California VR standard. Most models showed some positive associations of VRs with test scores; however, estimates varied in magnitude and few 95% confidence intervals excluded the null. Combined-district models estimated statistically significant increases of 0.6 points (P = 0.01) on English tests for each 10% increase in prior 30-day VRs. Estimated increases in Math were of similar magnitude but not statistically significant. Findings suggest potential small positive associations between classroom VRs and learning. Published 2015. This article is a U.S. Government work and is in the public domain in the USA.
Saadati, Farzaneh; Ahmad Tarmizi, Rohani; Mohd Ayub, Ahmad Fauzi; Abu Bakar, Kamariah
2015-01-01
Because students' ability to use statistics, which is mathematical in nature, is one of the concerns of educators, embedding within an e-learning system the pedagogical characteristics of learning is 'value added' because it facilitates the conventional method of learning mathematics. Many researchers emphasize the effectiveness of cognitive apprenticeship in learning and problem solving in the workplace. In a cognitive apprenticeship learning model, skills are learned within a community of practitioners through observation of modelling and then practice plus coaching. This study utilized an internet-based Cognitive Apprenticeship Model (i-CAM) in three phases and evaluated its effectiveness for improving statistics problem-solving performance among postgraduate students. The results showed that, when compared to the conventional mathematics learning model, the i-CAM could significantly promote students' problem-solving performance at the end of each phase. In addition, the combination of the differences in students' test scores were considered to be statistically significant after controlling for the pre-test scores. The findings conveyed in this paper confirmed the considerable value of i-CAM in the improvement of statistics learning for non-specialized postgraduate students.
Monte Carlo Approach for Reliability Estimations in Generalizability Studies.
ERIC Educational Resources Information Center
Dimitrov, Dimiter M.
A Monte Carlo approach is proposed, using the Statistical Analysis System (SAS) programming language, for estimating reliability coefficients in generalizability theory studies. Test scores are generated by a probabilistic model that considers the probability for a person with a given ability score to answer an item with a given difficulty…
A sup-score test for the cure fraction in mixture models for long-term survivors.
Hsu, Wei-Wen; Todem, David; Kim, KyungMann
2016-12-01
The evaluation of cure fractions in oncology research under the well known cure rate model has attracted considerable attention in the literature, but most of the existing testing procedures have relied on restrictive assumptions. A common assumption has been to restrict the cure fraction to a constant under alternatives to homogeneity, thereby neglecting any information from covariates. This article extends the literature by developing a score-based statistic that incorporates covariate information to detect cure fractions, with the existing testing procedure serving as a special case. A complication of this extension, however, is that the implied hypotheses are not typical and standard regularity conditions to conduct the test may not even hold. Using empirical processes arguments, we construct a sup-score test statistic for cure fractions and establish its limiting null distribution as a functional of mixtures of chi-square processes. In practice, we suggest a simple resampling procedure to approximate this limiting distribution. Our simulation results show that the proposed test can greatly improve efficiency over tests that neglect the heterogeneity of the cure fraction under the alternative. The practical utility of the methodology is illustrated using ovarian cancer survival data with long-term follow-up from the surveillance, epidemiology, and end results registry. © 2016, The International Biometric Society.
Ng'andu, N H
1997-03-30
In the analysis of survival data using the Cox proportional hazard (PH) model, it is important to verify that the explanatory variables analysed satisfy the proportional hazard assumption of the model. This paper presents results of a simulation study that compares five test statistics to check the proportional hazard assumption of Cox's model. The test statistics were evaluated under proportional hazards and the following types of departures from the proportional hazard assumption: increasing relative hazards; decreasing relative hazards; crossing hazards; diverging hazards, and non-monotonic hazards. The test statistics compared include those based on partitioning of failure time and those that do not require partitioning of failure time. The simulation results demonstrate that the time-dependent covariate test, the weighted residuals score test and the linear correlation test have equally good power for detection of non-proportionality in the varieties of non-proportional hazards studied. Using illustrative data from the literature, these test statistics performed similarly.
Quality of life and self-esteem of persons with paraplegia living in São Paulo, Brazil.
Blanes, Leila; Carmagnani, Maria Isabel S; Ferreira, Lydia M
2009-02-01
To evaluate the quality of life (QoL) and self-esteem of paraplegic persons. The sample consisted of 60 outpatients with traumatic paraplegia living in São Paulo, Brazil, from whom clinical and demographic data were obtained. QoL was assessed by the 36-item Short-Form (SF-36) health survey questionnaire, and self-esteem was measured by Rosenberg's Self-Esteem (RSE) scale. Statistical analysis was performed using Student's t-test, analysis of variance and Fisher's least significant difference (LSD) test at a significance level of 5%. Participants were predominately men (86.7%) with a mean age of 32.9 (standard deviation [SD] = 9.47) years, low education level and low income. The SF-36 dimensions that received the lowest scores were physical functioning, role physical and role emotional. Cronbach's alpha for the SF-36 questionnaire was 0.80. A significant statistical difference was found between the presence of pressure ulcers and low scores on mental health (P = 0.001), as determined by Student's t-test. The mean self-esteem score was 8.35 and there was a significant statistical difference between low self-esteem scores and occupation (P = 0.008). Participants reported low QoL and self-esteem. The results provide background information that may be useful in the development of strategies to reduce the impact of spinal cord injury (SCI) on the life and health of persons with SCI, improving their QoL.
Peer Teaching to Foster Learning in Physiology.
Srivastava, Tripti K; Waghmare, Lalitbhushan S; Mishra, Ved Prakash; Rawekar, Alka T; Quazi, Nazli; Jagzape, Arunita T
2015-08-01
Peer teaching is an effective tool to promote learning and retention of knowledge. By preparing to teach, students are encouraged to construct their own learning program, so that they can explain effectively to fellow learners. Peer teaching is introduced in present study to foster learning and pedagogical skills amongst first year medical under-graduates in physiology with a Hypothesis that teaching is linked to learning on part of the teacher. Non-randomized, Interventional study, with mixed methods design. Cases experienced peer teaching whereas controls underwent tutorials for four consecutive classes. Quantitative Evaluation was done through pre/post test score analysis for Class average normalized gain and tests of significance, difference in average score in surprise class test after one month and percentage of responses in closed ended items of feedback questionnaire. Qualitative Evaluation was done through categorization of open ended items and coding of reflective statements. The average pre and post test score was statistically significant within cases (p = 0.01) and controls (p = 0.023). The average post test scores was more for cases though not statistically significant. The class average normalized gain (g) for Tutorials was 49% and for peer teaching 53%. Surprise test had average scoring of 36 marks (out of 50) for controls and 41 marks for cases. Analysed section wise, the average score was better for Long answer question (LAQ) in cases. Section wise analysis suggested that through peer teaching, retention was better for descriptive answers as LAQ has better average score in cases. Feedback responses were predominantly positive for efficacy of peer teaching as a learning method. The reflective statements were sorted into reflection in action, reflection on action, claiming evidence, describing experience, and recognizing discrepancies. Teaching can stimulate further learning as it involves interplay of three processes: metacognitive awareness; deliberate practice, and self-explanation. Coupled with immediate feedback and reflective exercises, learning can be measurably enhanced along with improved teaching skills.
Peer Teaching to Foster Learning in Physiology
Srivastava, Tripti K; Waghmare, Lalitbhushan S.; Mishra, Ved Prakash; Rawekar, Alka T; Quazi, Nazli; Jagzape, Arunita T
2015-01-01
Introduction Peer teaching is an effective tool to promote learning and retention of knowledge. By preparing to teach, students are encouraged to construct their own learning program, so that they can explain effectively to fellow learners. Peer teaching is introduced in present study to foster learning and pedagogical skills amongst first year medical under-graduates in physiology with a Hypothesis that teaching is linked to learning on part of the teacher. Materials and Methods Non-randomized, Interventional study, with mixed methods design. Cases experienced peer teaching whereas controls underwent tutorials for four consecutive classes. Quantitative Evaluation was done through pre/post test score analysis for Class average normalized gain and tests of significance, difference in average score in surprise class test after one month and percentage of responses in closed ended items of feedback questionnaire. Qualitative Evaluation was done through categorization of open ended items and coding of reflective statements. Results The average pre and post test score was statistically significant within cases (p = 0.01) and controls (p = 0.023). The average post test scores was more for cases though not statistically significant. The class average normalized gain (g) for Tutorials was 49% and for peer teaching 53%. Surprise test had average scoring of 36 marks (out of 50) for controls and 41 marks for cases. Analysed section wise, the average score was better for Long answer question (LAQ) in cases. Section wise analysis suggested that through peer teaching, retention was better for descriptive answers as LAQ has better average score in cases. Feedback responses were predominantly positive for efficacy of peer teaching as a learning method. The reflective statements were sorted into reflection in action, reflection on action, claiming evidence, describing experience, and recognizing discrepancies. Conclusion Teaching can stimulate further learning as it involves interplay of three processes: metacognitive awareness; deliberate practice, and self-explanation. Coupled with immediate feedback and reflective exercises, learning can be measurably enhanced along with improved teaching skills. PMID:26435969
Computation of the Molenaar Sijtsma Statistic
NASA Astrophysics Data System (ADS)
Andries van der Ark, L.
The Molenaar Sijtsma statistic is an estimate of the reliability of a test score. In some special cases, computation of the Molenaar Sijtsma statistic requires provisional measures. These provisional measures have not been fully described in the literature, and we show that they have not been implemented in the software. We describe the required provisional measures as to allow the computation of the Molenaar Sijtsma statistic for all data sets.
Maserejian, Nancy N.; Trachtenberg, Felicia L.; Hauser, Russ; McKinlay, Sonja; Shrader, Peter; Bellinger, David C.
2012-01-01
Background Resin-based dental restorations may intra-orally release their components and bisphenol A. Gestational bisphenol A exposure has been associated with poorer executive functioning in children. Objectives To examine whether exposure to resin-based composite restorations is associated with neuropsychological development in children. Methods Secondary analysis of treatment level data from the New England Children’s Amalgam Trial, a 2-group randomized safety trial conducted from 1997–2006. Children (N=534) aged 6–10 y with >2 posterior tooth caries were randomized to treatment with amalgam or resin-based composites (bisphenol-A-diglycidyl-dimethacrylate-composite for permanent teeth; urethane dimethacrylate-based polyacid-modified compomer for primary teeth). Neuropsychological function at 4- and 5-year follow-up (N=444) was measured by a battery of tests of executive function, intelligence, memory, visual-spatial skills, verbal fluency, and problem-solving. Multivariable generalized linear regression models were used to examine the association between composite exposure levels and changes in neuropsychological test scores from baseline to follow-up. For comparison, data on children randomized to amalgam treatment were similarly analyzed. Results With greater exposure to either dental composite material, results were generally consistent in the direction of slightly poorer changes in tests of intelligence, achievement or memory, but there were no statistically significant associations. For the four primary measures of executive function, scores were slightly worse with greater total composite exposure, but statistically significant only for the test of Letter Fluency (10-surface-years β= −0.8, SE=0.4, P=0.035), and the subtest of color naming (β= −1.5, SE=0.5, P=0.004) in the Stroop Color-Word Interference Test. Multivariate analysis of variance confirmed that the negative associations between composite level and executive function were not statistically significant (MANOVA P=0.18). Results for greater amalgam exposure were mostly nonsignificant in the opposite direction of slightly improved scores over follow-up. Conclusions Dental composite restorations had statistically insignificant associations of small magnitude with impairments in neuropsychological test change scores over 4- or 5-years of follow-up in this trial. PMID:22906860
Kim, Yun Hak; Jeong, Dae Cheon; Pak, Kyoungjune; Goh, Tae Sik; Lee, Chi-Seung; Han, Myoung-Eun; Kim, Ji-Young; Liangwen, Liu; Kim, Chi Dae; Jang, Jeon Yeob; Cha, Wonjae; Oh, Sae-Ock
2017-09-29
Accurate prediction of prognosis is critical for therapeutic decisions regarding cancer patients. Many previously developed prognostic scoring systems have limitations in reflecting recent progress in the field of cancer biology such as microarray, next-generation sequencing, and signaling pathways. To develop a new prognostic scoring system for cancer patients, we used mRNA expression and clinical data in various independent breast cancer cohorts (n=1214) from the Molecular Taxonomy of Breast Cancer International Consortium (METABRIC) and Gene Expression Omnibus (GEO). A new prognostic score that reflects gene network inherent in genomic big data was calculated using Network-Regularized high-dimensional Cox-regression (Net-score). We compared its discriminatory power with those of two previously used statistical methods: stepwise variable selection via univariate Cox regression (Uni-score) and Cox regression via Elastic net (Enet-score). The Net scoring system showed better discriminatory power in prediction of disease-specific survival (DSS) than other statistical methods (p=0 in METABRIC training cohort, p=0.000331, 4.58e-06 in two METABRIC validation cohorts) when accuracy was examined by log-rank test. Notably, comparison of C-index and AUC values in receiver operating characteristic analysis at 5 years showed fewer differences between training and validation cohorts with the Net scoring system than other statistical methods, suggesting minimal overfitting. The Net-based scoring system also successfully predicted prognosis in various independent GEO cohorts with high discriminatory power. In conclusion, the Net-based scoring system showed better discriminative power than previous statistical methods in prognostic prediction for breast cancer patients. This new system will mark a new era in prognosis prediction for cancer patients.
Kim, Yun Hak; Jeong, Dae Cheon; Pak, Kyoungjune; Goh, Tae Sik; Lee, Chi-Seung; Han, Myoung-Eun; Kim, Ji-Young; Liangwen, Liu; Kim, Chi Dae; Jang, Jeon Yeob; Cha, Wonjae; Oh, Sae-Ock
2017-01-01
Accurate prediction of prognosis is critical for therapeutic decisions regarding cancer patients. Many previously developed prognostic scoring systems have limitations in reflecting recent progress in the field of cancer biology such as microarray, next-generation sequencing, and signaling pathways. To develop a new prognostic scoring system for cancer patients, we used mRNA expression and clinical data in various independent breast cancer cohorts (n=1214) from the Molecular Taxonomy of Breast Cancer International Consortium (METABRIC) and Gene Expression Omnibus (GEO). A new prognostic score that reflects gene network inherent in genomic big data was calculated using Network-Regularized high-dimensional Cox-regression (Net-score). We compared its discriminatory power with those of two previously used statistical methods: stepwise variable selection via univariate Cox regression (Uni-score) and Cox regression via Elastic net (Enet-score). The Net scoring system showed better discriminatory power in prediction of disease-specific survival (DSS) than other statistical methods (p=0 in METABRIC training cohort, p=0.000331, 4.58e-06 in two METABRIC validation cohorts) when accuracy was examined by log-rank test. Notably, comparison of C-index and AUC values in receiver operating characteristic analysis at 5 years showed fewer differences between training and validation cohorts with the Net scoring system than other statistical methods, suggesting minimal overfitting. The Net-based scoring system also successfully predicted prognosis in various independent GEO cohorts with high discriminatory power. In conclusion, the Net-based scoring system showed better discriminative power than previous statistical methods in prognostic prediction for breast cancer patients. This new system will mark a new era in prognosis prediction for cancer patients. PMID:29100405
The Third U.S.A. Mathematical Olympiad
ERIC Educational Resources Information Center
Greitzer, Samuel L.
1975-01-01
The 1974 Third United States of America Mathematical Olympiad for secondary school students is described. Included are five test problems with solutions, a brief statistical analysis of test scores, and a list of the eight finalists. (CR)
Using Multilevel Modeling in Language Assessment Research: A Conceptual Introduction
ERIC Educational Resources Information Center
Barkaoui, Khaled
2013-01-01
This article critiques traditional single-level statistical approaches (e.g., multiple regression analysis) to examining relationships between language test scores and variables in the assessment setting. It highlights the conceptual, methodological, and statistical problems associated with these techniques in dealing with multilevel or nested…
Zhao, Yue
2017-03-01
In patient-reported outcome research that utilizes item response theory (IRT), using statistical significance tests to detect misfit is usually the focus of IRT model-data fit evaluations. However, such evaluations rarely address the impact/consequence of using misfitting items on the intended clinical applications. This study was designed to evaluate the impact of IRT item misfit on score estimates and severity classifications and to demonstrate a recommended process of model-fit evaluation. Using secondary data sources collected from the Patient-Reported Outcome Measurement Information System (PROMIS) wave 1 testing phase, analyses were conducted based on PROMIS depression (28 items; 782 cases) and pain interference (41 items; 845 cases) item banks. The identification of misfitting items was assessed using Orlando and Thissen's summed-score item-fit statistics and graphical displays. The impact of misfit was evaluated according to the agreement of both IRT-derived T-scores and severity classifications between inclusion and exclusion of misfitting items. The examination of the presence and impact of misfit suggested that item misfit had a negligible impact on the T-score estimates and severity classifications with the general population sample in the PROMIS depression and pain interference item banks, implying that the impact of item misfit was insignificant. Findings support the T-score estimates in the two item banks as robust against item misfit at both the group and individual levels and add confidence to the use of T-scores for severity diagnosis in the studied sample. Recommendations on approaches for identifying item misfit (statistical significance) and assessing the misfit impact (practical significance) are given.
Validity and reliability of Abbreviated Mental Test Score (AMTS) among older Iranian.
Foroughan, Mahshid; Wahlund, Lars-Olof; Jafari, Zahra; Rahgozar, Mehdi; Farahani, Ida G; Rashedi, Vahid
2017-11-01
Cognitive impairment is common among older people and is associated with increased morbidity and mortality. The main aim of this study was to evaluate the validity of the Persian version of the Abbreviated Mental Test Score (AMTS) as a screening tool for dementia. Data were obtained from a cross-sectional study. One hundred and one older adults who were members of Iranian Alzheimer Association and 101 of their siblings were entered into this study by convenient sampling. The Diagnostic and Statistical Manual of Mental Disorders, 4th edition, criteria for diagnosing dementia and the Mini-Mental State Examination were used as the study tools. The gathered data were analyzed by the Mann-Whitney U-test, the Kruskal-Wallis test, Spearman's rank correlation coefficient, and the receiver-operating characteristic. The AMTS could successfully differentiate the dementia group from the non-dementia group. Scores were significantly correlated with Diagnostic and Statistical Manual of Mental Disorders diagnosis for dementia and Mini-Mental State Examination scores (P < 0.001). Educational level (P < 0.001) and male sex (P = 0.015) were positively associated with AMTS, whereas (P < 0.001) was negatively associated with AMTS. Total Cronbach's α coefficient was 0.90. The scores 6 and 7 showed the optimum balance between sensitivity (99% and 94%, respectively) and specificity (85% and 86%, respectively). The Persian version of the AMTS is a valid cognitive assessment tool for older Iranian adults and can be used for dementia screening in Iran. © 2017 Japanese Psychogeriatric Society.
The potential of composite cognitive scores for tracking progression in Huntington's disease.
Jones, Rebecca; Stout, Julie C; Labuschagne, Izelle; Say, Miranda; Justo, Damian; Coleman, Allison; Dumas, Eve M; Hart, Ellen; Owen, Gail; Durr, Alexandra; Leavitt, Blair R; Roos, Raymund; O'Regan, Alison; Langbehn, Doug; Tabrizi, Sarah J; Frost, Chris
2014-01-01
Composite scores derived from joint statistical modelling of individual risk factors are widely used to identify individuals who are at increased risk of developing disease or of faster disease progression. We investigated the ability of composite measures developed using statistical models to differentiate progressive cognitive deterioration in Huntington's disease (HD) from natural decline in healthy controls. Using longitudinal data from TRACK-HD, the optimal combinations of quantitative cognitive measures to differentiate premanifest and early stage HD individuals respectively from controls was determined using logistic regression. Composite scores were calculated from the parameters of each statistical model. Linear regression models were used to calculate effect sizes (ES) quantifying the difference in longitudinal change over 24 months between premanifest and early stage HD groups respectively and controls. ES for the composites were compared with ES for individual cognitive outcomes and other measures used in HD research. The 0.632 bootstrap was used to eliminate biases which result from developing and testing models in the same sample. In early HD, the composite score from the HD change prediction model produced an ES for difference in rate of 24-month change relative to controls of 1.14 (95% CI: 0.90 to 1.39), larger than the ES for any individual cognitive outcome and UHDRS Total Motor Score and Total Functional Capacity. In addition, this composite gave a statistically significant difference in rate of change in premanifest HD compared to controls over 24-months (ES: 0.24; 95% CI: 0.04 to 0.44), even though none of the individual cognitive outcomes produced statistically significant ES over this period. Composite scores developed using appropriate statistical modelling techniques have the potential to materially reduce required sample sizes for randomised controlled trials.
Testing non-inferiority of a new treatment in three-arm clinical trials with binary endpoints.
Tang, Nian-Sheng; Yu, Bin; Tang, Man-Lai
2014-12-18
A two-arm non-inferiority trial without a placebo is usually adopted to demonstrate that an experimental treatment is not worse than a reference treatment by a small pre-specified non-inferiority margin due to ethical concerns. Selection of the non-inferiority margin and establishment of assay sensitivity are two major issues in the design, analysis and interpretation for two-arm non-inferiority trials. Alternatively, a three-arm non-inferiority clinical trial including a placebo is usually conducted to assess the assay sensitivity and internal validity of a trial. Recently, some large-sample approaches have been developed to assess the non-inferiority of a new treatment based on the three-arm trial design. However, these methods behave badly with small sample sizes in the three arms. This manuscript aims to develop some reliable small-sample methods to test three-arm non-inferiority. Saddlepoint approximation, exact and approximate unconditional, and bootstrap-resampling methods are developed to calculate p-values of the Wald-type, score and likelihood ratio tests. Simulation studies are conducted to evaluate their performance in terms of type I error rate and power. Our empirical results show that the saddlepoint approximation method generally behaves better than the asymptotic method based on the Wald-type test statistic. For small sample sizes, approximate unconditional and bootstrap-resampling methods based on the score test statistic perform better in the sense that their corresponding type I error rates are generally closer to the prespecified nominal level than those of other test procedures. Both approximate unconditional and bootstrap-resampling test procedures based on the score test statistic are generally recommended for three-arm non-inferiority trials with binary outcomes.
Karr, Justin E; Garcia-Barrera, Mauricio A; Holdnack, James A; Iverson, Grant L
2018-01-01
Multivariate base rates allow for the simultaneous statistical interpretation of multiple test scores, quantifying the normal frequency of low scores on a test battery. This study provides multivariate base rates for the Delis-Kaplan Executive Function System (D-KEFS). The D-KEFS consists of 9 tests with 16 Total Achievement scores (i.e. primary indicators of executive function ability). Stratified by education and intelligence, multivariate base rates were derived for the full D-KEFS and an abbreviated four-test battery (i.e. Trail Making, Color-Word Interference, Verbal Fluency, and Tower Test) using the adult portion of the normative sample (ages 16-89). Multivariate base rates are provided for the full and four-test D-KEFS batteries, calculated using five low score cutoffs (i.e. ≤25th, 16th, 9th, 5th, and 2nd percentiles). Low scores occurred commonly among the D-KEFS normative sample, with 82.6 and 71.8% of participants obtaining at least one score ≤16th percentile for the full and four-test batteries, respectively. Intelligence and education were inversely related to low score frequency. The base rates provided herein allow clinicians to interpret multiple D-KEFS scores simultaneously for the full D-KEFS and an abbreviated battery of commonly administered tests. The use of these base rates will support clinicians when differentiating between normal variations in cognitive performance and true executive function deficits.
A seven-year follow-up of intelligence test scores of foster grandparents.
Troll, L E; Saltz, R; Dunin-Markiewicz, A
1976-09-01
After 7 years, a group of originally nonemployed poverty-level older people (over 60) who had been employed as foster grandparents were retested with the WAIS. Four WAIS subtests - Vocabulary Similarities, Digit Span, and Block Design - were employed. Of the original group of 39, complete data were available for 28; 18 of these were still working on the project, and the other 10 had dropped out. Dropouts as a group tested lower originally and also showed more deterioration in functional health ratings over time. For the total group of 32 foster grandparents, three subtest scores showed stability over the 7 years. Only Digit Span showed a statistically significant drop. Neither age nor the initial level of health or WAIS scores was related to test-score changes over time.
New statistical potential for quality assessment of protein models and a survey of energy functions
2010-01-01
Background Scoring functions, such as molecular mechanic forcefields and statistical potentials are fundamentally important tools in protein structure modeling and quality assessment. Results The performances of a number of publicly available scoring functions are compared with a statistical rigor, with an emphasis on knowledge-based potentials. We explored the effect on accuracy of alternative choices for representing interaction center types and other features of scoring functions, such as using information on solvent accessibility, on torsion angles, accounting for secondary structure preferences and side chain orientation. Partially based on the observations made, we present a novel residue based statistical potential, which employs a shuffled reference state definition and takes into account the mutual orientation of residue side chains. Atom- and residue-level statistical potentials and Linux executables to calculate the energy of a given protein proposed in this work can be downloaded from http://www.fiserlab.org/potentials. Conclusions Among the most influential terms we observed a critical role of a proper reference state definition and the benefits of including information about the microenvironment of interaction centers. Molecular mechanical potentials were also tested and found to be over-sensitive to small local imperfections in a structure, requiring unfeasible long energy relaxation before energy scores started to correlate with model quality. PMID:20226048
Turc, Guillaume; Aguettaz, Pierre; Ponchelle-Dequatre, Nelly; Hénon, Hilde; Naggara, Olivier; Leclerc, Xavier; Cordonnier, Charlotte; Leys, Didier; Mas, Jean-Louis; Oppenheim, Catherine
2014-01-01
The aim of our study was to validate in an independent cohort the MRI-DRAGON score, an adaptation of the (CT-) DRAGON score to predict 3-month outcome in acute ischemic stroke patients undergoing MRI before intravenous thrombolysis (IV-tPA). We reviewed consecutive (2009-2013) anterior circulation stroke patients treated within 4.5 hours by IV-tPA in the Lille stroke unit (France), where MRI is the first-line pretherapeutic work-up. We assessed the discrimination and calibration of the MRI-DRAGON score to predict poor 3-month outcome, defined as modified Rankin Score >2, using c-statistic and the Hosmer-Lemeshow test, respectively. We included 230 patients (mean ±SD age 70.4±16.0 years, median [IQR] baseline NIHSS 8 [5]-[14]; poor outcome in 78(34%) patients). The c-statistic was 0.81 (95%CI 0.75-0.87), and the Hosmer-Lemeshow test was not significant (p = 0.54). The MRI-DRAGON score showed good prognostic performance in the external validation cohort. It could therefore be used to inform the patient's relatives about long-term prognosis and help to identify poor responders to IV-tPA alone, who may be candidates for additional therapeutic strategies, if they are otherwise eligible for such procedures based on the institutional criteria.
Does gender and experience influence shade matching quality?
Haddad, Helene J; Jakstat, Holger A; Arnetzl, Gerwin; Borbely, Judit; Vichi, Alessandro; Dumfahrt, Herbert; Renault, Patrick; Corcodel, Nicoleta; Pohlen, Bostjan; Marada, Gyula; de Parga, Juan A Martinez Vazquez; Reshad, Mamaly; Klinke, Thomas U; Hannak, Wolfgang B; Paravina, Rade D
2009-01-01
To evaluate the influence of gender and level of experience on shade matching quality. A study was simultaneously performed at 15 universities located in 9 countries. A total of 614 color normal participants completed all phases of the experiment. Among them, there were 305 females and 309 males, 319 dental students and 295 dental professionals. A lecture on color matching in dentistry was given to all participants. Initial training was performed using Toothguide Trainer software (TT), while Toothguide Training Box (TTB) was used for both training and testing of participants' shade matching results. The test task was to successively match 15 shade guide tabs with the corresponding shade guide. The shade matching score for each participant was computed as a sum of color differences (SigmaDeltaE(ab)(*) score) between target tabs and selected tabs. Lower scores corresponded to better shade matching results and vice versa. Means and standard deviations were calculated. Mann-Whitney U test was used for statistical analysis of the data (alpha=0.05). The mean shade matching score (S.D.) for all participants was 41 (21). The score for female and male participants was 38 (20) and 44 (21), respectfully (p<0.001). The difference in scores between dental students, 42 (20), and dental professionals, 39 (21), was not statistically significant. Within the limitations of this study, females achieved significantly better shade matching results than males, indicating that gender plays an important role in shade matching. The level of experience was not found to be significant factor in shade matching.
Statistical Summary of Missouri Higher Education, 1999-2000.
ERIC Educational Resources Information Center
Missouri State Coordinating Board for Higher Education, Jefferson City.
This report provides a statistical summary of higher education in Missouri for the 1999-2000 academic year. More than 74 tables provide data on: advanced placement enrollment in secondary schools, American College Testing program scores by institutional sector, high school rankings by institutional sector, the Missouri Coordinating Board for…
Assessment of numeracy in sports and exercise science students at an Australian university
NASA Astrophysics Data System (ADS)
Green, Simon; McGlynn, Susan; Stuart, Deidre; Fahey, Paul; Pettigrew, Jim; Clothier, Peter
2018-05-01
The effect of high school study of mathematics on numeracy performance of sports and exercise science (SES) students is not clear. To investigate this further, we tested the numeracy skills of 401 students enrolled in a Bachelor of Health Sciences degree in SES using a multiple-choice survey consisting of four background questions and 39 numeracy test questions. Background questions (5-point scale) focused on highest level of mathematics studied at high school, self-perception of mathematics proficiency, perceived importance of mathematics to SES and likelihood of seeking help with mathematics. Numeracy questions focused on rational number, ratios and rates, basic algebra and graph interpretation. Numeracy performance was based on answers to these questions (1 mark each) and represented by the total score (maximum = 39). Students from first (n = 212), second (n = 78) and third (n = 111) years of the SES degree completed the test. The distribution of numeracy test scores for the entire cohort was negatively skewed with a median (IQR) score of 27(11). We observed statistically significant associations between test scores and the highest level of mathematics studied (P < 0.05), being lowest in students who studied Year 10 Mathematics (20 (9)), intermediate in students who studied Year 12 General Mathematics (26 (8)) and highest in two groups of students who studied higher-level Year 12 Mathematics (31 (9), 31 (6)). There were statistically significant associations between test scores and level of self-perception of mathematics proficiency and also likelihood of seeking help with mathematics (P < 0.05) but not with perceived importance of mathematics to SES. These findings reveal that the level of mathematics studied in high school is a critical factor determining the level of numeracy performance in SES students.
An Analytical Evaluation of Two Common-Odds Ratios as Population Indicators of DIF.
ERIC Educational Resources Information Center
Pommerich, Mary; And Others
The Mantel-Haenszel (MH) statistic for identifying differential item functioning (DIF) commonly conditions on the observed test score as a surrogate for conditioning on latent ability. When the comparison group distributions are not completely overlapping (i.e., are incongruent), the observed score represents different levels of latent ability…
Healthcare teams as complex adaptive systems: Focus on interpersonal interaction.
Pype, Peter; Krystallidou, Demi; Deveugele, Myriam; Mertens, Fien; Rubinelli, Sara; Devisch, Ignaas
2017-11-01
The aim of this study is to test the feasibility of a tool to objectify the functioning of healthcare teams operating in the complexity zone, and to evaluate its usefulness in identifying areas for team quality improvement. We distributed The Complex Adaptive Leadership (CAL™) Organisational Capability Questionnaire (OCQ) to all members of one palliative care team (n=15) and to palliative care physicians in Flanders, Belgium (n=15). Group discussions were held on feasibility aspects and on the low scoring topics. Data was analysed calculating descriptive statistics (sum score, mean and standard deviation). The one sample T-Test was used to detect differences within each group. Both groups of participants reached mean scores ranging from good to excellent. The one sample T test showed statistically significant differences between participants' sum scores within each group (p<0,001). Group discussion led to suggestions for quality improvement e.g. enhanced feedback strategies between team members. The questionnaire used in our study shows to be a feasible and useful instrument for the evaluation of the palliative care teams' day-to-day operations and to identify areas for quality improvement. The CAL™OCQ is a promising instrument to evaluate any healthcare team functioning. A group discussion on the questionnaire scores can serve as a starting point to identify targets for quality improvement initiatives. Copyright © 2017 Elsevier B.V. All rights reserved.
Schiff, Thomas; Delgado, Evaristo; Zhang, Yun Po; Cummins, Diane; DeVizio, William; Mateo, Luis R
2009-03-01
To determine the efficacy of an in-office desensitizing paste containing 8% arginine and calcium carbonate relative to that of a commercially-available pumice prophylaxis paste in reducing dentin hypersensitivity instantly after a single application following a dental scaling procedure and to establish the duration of sensitivity relief over a period of 4 weeks and 12 weeks. This was a single-center, parallel group, double-blind, stratified clinical study conducted in San Francisco, California, USA. Qualifying adult male and female subjects who presented two hypersensitive teeth with a tactile hypersensitivity score (Yeaple Probe) between 10-50 grams of force and an air blast hypersensitivity score of 2 or 3 (Schiff Cold Air Sensitivity Scale) were stratified according to their baseline hypersensitivity scores and randomly assigned within strata to one of two treatment groups: (1) A Test Paste, a desensitizing paste containing 8% arginine and calcium carbonate (Colgate-Palmolive Co); and (2) A Control Paste, Nupro pumice prophylaxis paste (Dentsply Professional). Subjects received a professionally-administered scaling procedure, after which they were re-examined for tactile and air blast dentin hypersensitivity (Post-Scaling Examinations). The assigned pastes were then applied as the final step to the professional dental cleaning procedure. Tactile and air blast dentin hypersensitivity examinations were again performed immediately after paste application. Subjects were provided with a commercially-available non-desensitizing dentifrice containing 0.243% sodium fluoride (Crest Cavity Protection, Procter & Gamble Co.) and an adult soft-bristled toothbrush and were instructed to brush their teeth for 1 minute, twice daily at home using only the toothbrush and dentifrice provided, for the next 12 weeks. Subjects returned to the testing facility 4 and 12 weeks after the single application of Test or Control paste, having refrained from all oral hygiene procedures and chewing gum for 8 hours and from eating and drinking for 4 hours, prior to each follow-up visit. Assessments of tactile and air blast hypersensitivity, and examinations of oral soft and hard tissue were repeated at these 4- and 12-week examinations. 68 subjects completed the 12-week study. No statistically significant differences from baseline scores were indicated at the Post-Scaling Examinations for either the Test Paste or Control Paste groups. Immediately following product application and 4 weeks after product application, subjects assigned to the Test Paste group exhibited statistically significant improvements from baseline with respect to baseline-adjusted mean air blast (44.1% and 45.9% respectively) and mean tactile hypersensitivity scores (156.2% and 170.3% respectively). At the same time points, subjects assigned to the Control Paste group exhibited statistically significant improvements from baseline with respect to baseline-adjusted mean air blast (15.1% and 8.9% respectively) and mean tactile hypersensitivity scores (43.1% and 8.3% respectively). Immediately following application of the assigned paste and 4 weeks later, the Test Paste group demonstrated statistically significant reductions in dentin hypersensitivity with respect to baseline-adjusted mean air blast (34.1% and 40.6% respectively) and mean tactile hypersensitivity scores (79.0% and 149.6% respectively), compared to the Control Paste group. No statistically significant differences were exhibited between paste groups at the Post-Scaling and 12-week examinations with respect to mean tactile and baseline-adjusted mean air blast hypersensitivity scores.
Shahraki, Mohammad Reza; Ahmadimoghadm, Mahdieh; Shahraki, Ahmad Reza
2015-10-01
Borago officinalis flower (borage) is a known sedative in herbal medicine; the aim of the present study was to evaluate the antinociceptive effect of borage hydroalcoholic extract in formalin test male rats. Fifty-six adult male albino Wistar rats were randomly divided into seven groups: Control groups of A (intact), B (saline), and C (Positive control) plus test groups of D, E, F, and G (n=8). The groups D, E, and F received 6.25, 12.5, and 25 mg/kg, Borago officinalis flower hydroalcholic extract before the test, respectively but group G received 25 mg/kg borage extract and aspirin before the test. A biphasic pain was induced by injection of formalin 1%. The obtained data were analyzed by SPSS software ver. 17 employing statistical tests of Kruskal-Wallis and Mann-Whitney. The results were expressed as mean±SD. Statistical differences were considered significant at P<0.05. The results revealed that the acute and chronic pain behavior score in test groups of D, E, F, and G significantly decreased compared to groups A and B, but this score did not show any difference compared to group C. Moreover, chronic pain behavior score in group G was significantly lower than all other groups. The results indicated that Borago officinalis hydroalcoholic extract affects the acute and chronic pain behavior response in formaline test male rats.
Application of Bayesian Methods for Detecting Fraudulent Behavior on Tests
ERIC Educational Resources Information Center
Sinharay, Sandip
2018-01-01
Producers and consumers of test scores are increasingly concerned about fraudulent behavior before and during the test. There exist several statistical or psychometric methods for detecting fraudulent behavior on tests. This paper provides a review of the Bayesian approaches among them. Four hitherto-unpublished real data examples are provided to…
Dyer, Jeannie; Thomas, Karen; Sandsund, Cathy; Shaw, Clare
2013-08-01
To test whether reflexology was inferior to aromatherapy massage for ameliorating self-selected problems or concerns. Non-blinded, randomised study with a 1:1 allocation. Adult outpatients recruited from a UK cancer centre, randomised by the minimisation method to either four aromatherapy massage or four reflexology sessions. MYCaW scores at baseline and completion; VAS (relaxation) pre and post-sessions. Unpaired t-test for the primary outcome; analysis of variance tests for repeated measures for VAS (relaxation); descriptive statistics (means and 95% confidence intervals) and content analysis for patient comments. 115 subjects (58 aromatherapy massage, 57 reflexology) recruited. Reflexology was found to be no less effective than aromatherapy massage for MYCaW first concerns (p = 0.046). There was no statistical difference between groups for MYCaW second concerns or overall well-being scores, proportions of patients gaining clinical benefit, VAS scores over time (p = 0.489) or between groups (p = 0.408) or in the written responses. Copyright © 2013 Elsevier Ltd. All rights reserved.
Wang, Chih-Wei; Liu, Yi-Jui; Lee, Yi-Hsiung; Hueng, Dueng-Yuan; Fan, Hueng-Chuen; Yang, Fu-Chi; Hsueh, Chun-Jen; Kao, Hung-Wen; Juan, Chun-Jung; Hsu, Hsian-He
2014-01-01
Purpose To investigate the performance of hematoma shape, hematoma size, Glasgow coma scale (GCS) score, and intracerebral hematoma (ICH) score in predicting the 30-day mortality for ICH patients. To examine the influence of the estimation error of hematoma size on the prediction of 30-day mortality. Materials and Methods This retrospective study, approved by a local institutional review board with written informed consent waived, recruited 106 patients diagnosed as ICH by non-enhanced computed tomography study. The hemorrhagic shape, hematoma size measured by computer-assisted volumetric analysis (CAVA) and estimated by ABC/2 formula, ICH score and GCS score was examined. The predicting performance of 30-day mortality of the aforementioned variables was evaluated. Statistical analysis was performed using Kolmogorov-Smirnov tests, paired t test, nonparametric test, linear regression analysis, and binary logistic regression. The receiver operating characteristics curves were plotted and areas under curve (AUC) were calculated for 30-day mortality. A P value less than 0.05 was considered as statistically significant. Results The overall 30-day mortality rate was 15.1% of ICH patients. The hematoma shape, hematoma size, ICH score, and GCS score all significantly predict the 30-day mortality for ICH patients, with an AUC of 0.692 (P = 0.0018), 0.715 (P = 0.0008) (by ABC/2) to 0.738 (P = 0.0002) (by CAVA), 0.877 (P<0.0001) (by ABC/2) to 0.882 (P<0.0001) (by CAVA), and 0.912 (P<0.0001), respectively. Conclusion Our study shows that hematoma shape, hematoma size, ICH scores and GCS score all significantly predict the 30-day mortality in an increasing order of AUC. The effect of overestimation of hematoma size by ABC/2 formula in predicting the 30-day mortality could be remedied by using ICH score. PMID:25029592
Rescore protein-protein docked ensembles with an interface contact statistics.
Mezei, Mihaly
2017-02-01
The recently developed statistical measure for the type of residue-residue contact at protein complex interfaces, based on a parameter-free definition of contact, has been used to define a contact score that is correlated with the likelihood of correctness of a proposed complex structure. Comparing the proposed contact scores on the native structure and on a set of model structures the proposed measure was shown to generally favor the native structure but in itself was not able to reliably score the native structure to be the best. Adjusting the scores of redocking experiments with the contact score showed that the adjusted score was able to move up the ranking of the native-like structure among the proposed complexes when the native-like was not ranked the best by the respective program. Tests on docking of unbound proteins compared the contact scores of the complexes with the contact score of the crystal structure again showing the tendency of the contact score to favor native-like conformations. The possibility of using the contact score to improve the determination of biological dimers in a crystal structure was also explored. Proteins 2017; 85:235-241. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Task-based learning versus problem-oriented lecture in neurology continuing medical education.
Vakani, Farhan; Jafri, Wasim; Ahmad, Amina; Sonawalla, Aziz; Sheerani, Mughis
2014-01-01
To determine whether general practitioners learned better with task-based learning or problem-oriented lecture in a Continuing Medical Education (CME) set-up. Quasi-experimental study. The Aga Khan University, Karachi campus, from April to June 2012. Fifty-nine physicians were given a choice to opt for either Task-based Learning (TBL) or Problem Oriented Lecture (PBL) in a continuing medical education set-up about headaches. The TBL group had 30 participants divided into 10 small groups, and were assigned case-based tasks. The lecture group had 29 participants. Both groups were given a pre and a post-test. Pre/post assessment was done using one-best MCQs. The reliability coefficient of scores for both the groups was estimated through Cronbach's alpha. An item analysis for difficulty and discriminatory indices was calculated for both the groups. Paired t-test was used to determine the difference between pre- and post-test scores of both groups. Independent t-test was used to compare the impact of the two teaching methods in terms of learning through scores produced by MCQ test. Cronbach's alpha was 0.672 for the lecture group and 0.881 for TBL group. Item analysis for difficulty (p) and discriminatory indexes (d) was obtained for both groups. The results for the lecture group showed pre-test (p) = 42% vs. post-test (p) = 43%; pre- test (d) = 0.60 vs. post-test (d) = 0.40. The TBL group showed pre -test (p) = 48% vs. post-test (p) = 70%; pre-test (d) = 0.69 vs. post-test (d) = 0.73. Lecture group pre-/post-test mean scores were (8.52 ± 2.95 vs. 12.41 ± 2.65; p < 0.001), where TBL group showed (9.70 ± 3.65 vs. 14 ± 3.99; p < 0.001). Independent t-test exhibited an insignificant difference at baseline (lecture 8.52 ± 2.95 vs. TBL 9.70 ± 3.65; p = 0.177). The post-scores were not statistically different lecture 12.41 ± 2.65 vs. TBL 14 ± 3.99; p = 0.07). Both delivery methods were found to be equally effective, showing statistically insignificant differences. However, TBL groups' post-test higher mean scores and radical increase in the post-test difficulty index demonstrated improved learning through TBL delivery and calls for further exploration of longitudinal studies in the context of CME.
Empirical methods for assessing meaningful neuropsychological change following epilepsy surgery.
Sawrie, S M; Chelune, G J; Naugle, R I; Lüders, H O
1996-11-01
Traditional methods for assessing the neurocognitive effects of epilepsy surgery are confounded by practice effects, test-retest reliability issues, and regression to the mean. This study employs 2 methods for assessing individual change that allow direct comparison of changes across both individuals and test measures. Fifty-one medically intractable epilepsy patients completed a comprehensive neuropsychological battery twice, approximately 8 months apart, prior to any invasive monitoring or surgical intervention. First, a Reliable Change (RC) index score was computed for each test score to take into account the reliability of that measure, and a cutoff score was empirically derived to establish the limits of statistically reliable change. These indices were subsequently adjusted for expected practice effects. The second approach used a regression technique to establish "change norms" along a common metric that models both expected practice effects and regression to the mean. The RC index scores provide the clinician with a statistical means of determining whether a patient's retest performance is "significantly" changed from baseline. The regression norms for change allow the clinician to evaluate the magnitude of a given patient's change on 1 or more variables along a common metric that takes into account the reliability and stability of each test measure. Case data illustrate how these methods provide an empirically grounded means for evaluating neurocognitive outcomes following medical interventions such as epilepsy surgery.
Rasch analysis of three dry eye questionnaires and correlates with objective clinical tests.
McAlinden, Colm; Gao, Rongrong; Wang, Qinmei; Zhu, Senmiao; Yang, Jing; Yu, Ayong; Bron, Anthony J; Huang, Jinhai
2017-04-01
To assess the psychometric properties of Chinese versions of the Ocular Comfort Index (OCI), Ocular Surface Disease Index (OSDI) and McMonnies questionnaires. Further, to assess the correlation between questionnaire scores and objective dry eye disease (DED) clinical tests. Translated versions of the OCI, OSDI and McMonnies questionnaires were completed in a random order by 238 participants with DED. Objective clinical tests included visual acuity (VA), fluorescein tear film break-up time (TBUT), corneal fluorescein staining, Schirmer I testing and meibomian gland grading. Rasch analysis was used to assess questionnaire psychometrics and spearman rank for correlations. For the OCI, the person separation was 2.31, item infit and outfit statistics ranged from 0.74-1.14 and 0.75-1.32, respectively, and targeting 1.54 logits. For the OSDI, person separation was 0.94. None of the three subscales provided valid measurements based on Rasch analysis. For the McMonnies questionnaire, person separation was 1.17, item infit and outfit statistics ranged from 0.7 to 1.21 and 0.51-3.49, respectively. There were weak correlations between questionnaire scores and clinical tests. There were weak correlations between OSDI scores and VA, fluorescein TBUT, Schirmer I testing and corneal fluorescein staining. There were weak correlations between McMonnies scores and VA, fluorescein TBUT, Schirmer I testing, and corneal fluorescein staining and meibomian gland grading. The OCI questionnaire was the only questionnaire that provided valid measurement on the basis of Rasch analysis, although slight multidimensionality was found. There were weak correlations between OCI scores and fluorescein TBUT, Schirmer I testing, and corneal fluorescein staining. Due to this paradoxical disconnect between symptoms and signs and the repeatability of tests, the use of both subjective and objective markers in the clinical management of patients or as endpoints in clinical trials would appear prudent. Copyright © 2017 Elsevier Inc. All rights reserved.
Mathematical problem solving ability of sport students in the statistical study
NASA Astrophysics Data System (ADS)
Sari, E. F. P.; Zulkardi; Putri, R. I. I.
2017-12-01
This study aims to determine the problem-solving ability of sport students of PGRI Palembang semester V in the statistics course. Subjects in this study were sport students of PGRI Palembang semester V which amounted to 31 people. The research method used is quasi experiment type one case shoot study. Data collection techniques in this study use the test and data analysis used is quantitative descriptive statistics. The conclusion of this study shown that the mathematical problem solving ability of PGRI Palembang sport students of V semester in the statistical course is categorized well with the average of the final test score of 80.3.
ERIC Educational Resources Information Center
Duong, Minh Quang
2011-01-01
Testing programs often use multiple test forms of the same test to control item exposure and to ensure test security. Although test forms are constructed to be as similar as possible, they often differ. Test equating techniques are those statistical methods used to adjust scores obtained on different test forms of the same test so that they are…
Rostami, Reza; Sadeghi, Vahid; Zarei, Jamileh; Haddadi, Parvaneh; Mohazzab-Torabi, Saman; Salamati, Payman
2013-04-01
The aim of this study was to compare the Persian version of the wechsler intelligence scale for children - fourth edition (WISC-IV) and cognitive assessment system (CAS) tests, to determine the correlation between their scales and to evaluate the probable concurrent validity of these tests in patients with learning disorders. One-hundered-sixty-two children with learning disorder who were presented at Atieh Comprehensive Psychiatry Center were selected in a consecutive non-randomized order. All of the patients were assessed based on WISC-IV and CAS scores questionnaires. Pearson correlation coefficient was used to analyze the correlation between the data and to assess the concurrent validity of the two tests. Linear regression was used for statistical modeling. The type one error was considered 5% in maximum. There was a strong correlation between total score of WISC-IV test and total score of CAS test in the patients (r=0.75, P<0.001). The correlations among the other scales were mostly high and all of them were statistically significant (P<0.001). A linear regression model was obtained (α = 0.51, β = 0.81 and P<0.001). There is an acceptable correlation between the WISC-IV scales and CAS test in children with learning disorders. A concurrent validity is established between the two tests and their scales.
Rostami, Reza; Sadeghi, Vahid; Zarei, Jamileh; Haddadi, Parvaneh; Mohazzab-Torabi, Saman; Salamati, Payman
2013-01-01
Objective The aim of this study was to compare the Persian version of the wechsler intelligence scale for children - fourth edition (WISC-IV) and cognitive assessment system (CAS) tests, to determine the correlation between their scales and to evaluate the probable concurrent validity of these tests in patients with learning disorders. Methods One-hundered-sixty-two children with learning disorder who were presented at Atieh Comprehensive Psychiatry Center were selected in a consecutive non-randomized order. All of the patients were assessed based on WISC-IV and CAS scores questionnaires. Pearson correlation coefficient was used to analyze the correlation between the data and to assess the concurrent validity of the two tests. Linear regression was used for statistical modeling. The type one error was considered 5% in maximum. Findings There was a strong correlation between total score of WISC-IV test and total score of CAS test in the patients (r=0.75, P<0.001). The correlations among the other scales were mostly high and all of them were statistically significant (P<0.001). A linear regression model was obtained (α = 0.51, β = 0.81 and P<0.001). Conclusion There is an acceptable correlation between the WISC-IV scales and CAS test in children with learning disorders. A concurrent validity is established between the two tests and their scales. PMID:23724180
Padmavathi, P
2014-01-01
Premenstrual syndrome is the most common of gynaecologic complaints. It affects half of all female adolescents today and represents the leading cause of college/school absenteeism among that population. It was sought to assess the effectiveness of acupressure Vs reflexology on premenstrual syndrome among adolescents. Two-group pre-test and post-test true experimental design was adopted for the study. Forty adolescent girls from Government Girls Secondary School, Erode with pre- menstrual syndrome fulfilling the inclusion criteria were selected by simple random sampling. A pre-test was conducted by using premenstrual symptoms assessment scale. Immediately after pre-test acupressure Vs reflexology was given once a week for 6 weeks and again post-test was conducted to assess the effectiveness of treatment. Collected data was analysed by using descriptive and inferential statistics. In post-test, the mean score of the experimental group I sample was 97.3 (SD = 2.5) and the group II mean score was 70:8 (SD = 10.71) with paired 't' value of 19.2 and 31.9. This showed that the reflexology was more effective than acupressure in enhancing the practice of the sample regarding pre-menstrual syndrome. Statistically no significant association was found between the post-test scores of the sample with their demographic variables. The findings imply the need for educating adolescent girls on effective management of pre-menstrual syndrome.
Clock face drawing test performance in children with ADHD.
Ghanizadeh, Ahmad; Safavi, Salar; Berk, Michael
2013-01-01
The utility and discriminatory pattern of the clock face drawing test in ADHD is unclear. This study therefore compared Clock Face Drawing test performance in children with ADHD and controls. 95 school children with ADHD and 191 other children were matched for gender ratio and age. ADHD symptoms severities were assessed using DSM-IV ADHD checklist and their intellectual functioning was assessed. The participants completed three clock-drawing tasks, and the following four functions were assessed: Contour score, Numbers score, Hands setting score, and Center score. All the subscales scores of the three clock drawing tests of the ADHD group were lower than that of the control group. In ADHD children, inattention and hyperactivity/ impulsivity scores were not related to free drawn clock test scores. When pre-drawn contour test was performed, inattentiveness score was statistically associated with Number score while none of the other variables of age, gender, intellectual functioning, and hand use preference were associated with that kind of score. In pre-drawn clock, no association of ADHD symptoms with any CDT subscales found significant. In addition, more errors are observed with free drawn clock and Pre-drawn contour than pre-drawn clock. Putting Numbers and Hands setting are more sensitive measures to screen ADHD than Contour and Center drawing. Test performance, except Hands setting, may have already reached a developmental plateau. It is probable that Hand setting deficit in children with ADHD may not decrease from age 8 to 14 years. Performance of children with ADHD is associated with complexity of CDT.
Pretest online discussion groups to augment teaching and learning.
Kuhn, Jonathan; Hasbargen, Barbara; Miziniak, Halina
2010-01-01
Tests and final examination scores of three semesters of control students in a nursing foundation course were compared with tests and final examination scores of three semesters of participating students. Participating students were offered access to an asynchronous pretest online discussion activity with a faculty e-moderator. While the simplified Bloom's revised taxonomy assisted in creating appropriate preparatory test and final examination questions for pretest online discussion, Salmon's five-stage online method provided direction to the e-moderator on how to encourage students to achieve Bloom's higher-order thinking skills during the pretest online discussions. Statistical analysis showed the pretest online discussion activity had a generally positive impact on tests and final examination scores, when controlling for a number of possible confounding variables, including instructor, cumulative grade point average, age, and credit hours.
Renal dysfunction in liver cirrhosis and its correlation with Child-Pugh score and MELD score
NASA Astrophysics Data System (ADS)
Siregar, G. A.; Gurning, M.
2018-03-01
Renal dysfunction (RD) is a serious and common complication in a patient with liver cirrhosis. It provides a poor prognosis. The aim of our study was to evaluate the renal function in liver cirrhosis, also to determine the correlation with the graduation of liver disease assessed by Child-Pugh Score (CPS) and MELD score. This was a cross-sectional study included patients with liver cirrhosis admitted to Adam Malik Hospital Medan in June - August 2016. We divided them into two groups as not having renal dysfunction (serum creatinine < 1.5 mg/dL) and having renal dysfunction (serum creatinine ≤ 1.5 mg/dL). For the processing of data, SPSS 22.0 was used. Statistical methods used: Chi-square, Fisher exact, one way ANOVA, Kruskal Wallis test and Pearson coefficient of correlation. The level of significance was p<0.05. 55 patients with presented with renal dysfunction were 16 (29.1 %). There was statistically significant inverse correlation between GFR and CPS (r = -0.308), GFR and MELD score (r = -0.278). There was a statistically significant correlation between creatinine and MELD score (r = 0.359), creatinine and CPS (r = 0.382). The increase of the degree of liver damage is related to the increase of renal dysfunction.
Better prognostic marker in ICU - APACHE II, SOFA or SAP II!
Naqvi, Iftikhar Haider; Mahmood, Khalid; Ziaullaha, Syed; Kashif, Syed Mohammad; Sharif, Asim
2016-01-01
This study was designed to determine the comparative efficacy of different scoring system in assessing the prognosis of critically ill patients. This was a retrospective study conducted in medical intensive care unit (MICU) and high dependency unit (HDU) Medical Unit III, Civil Hospital, from April 2012 to August 2012. All patients over age 16 years old who have fulfilled the criteria for MICU admission were included. Predictive mortality of APACHE II, SAP II and SOFA were calculated. Calibration and discrimination were used for validity of each scoring model. A total of 96 patients with equal gender distribution were enrolled. The average APACHE II score in non-survivors (27.97+8.53) was higher than survivors (15.82+8.79) with statistically significant p value (<0.001). The average SOFA score in non-survivors (9.68+4.88) was higher than survivors (5.63+3.63) with statistically significant p value (<0.001). SAP II average score in non-survivors (53.71+19.05) was higher than survivors (30.18+16.24) with statistically significant p value (<0.001). All three tested scoring models (APACHE II, SAP II and SOFA) would be accurate enough for a general description of our ICU patients. APACHE II has showed better calibration and discrimination power than SAP II and SOFA.
Physicians’ attitudes toward pharmacogenetic testing before and after pharmacogenetic education
Luzum, Jasmine A; Luzum, Matthew J
2016-01-01
Aim: Our aim was to evaluate physicians’ attitudes toward pharmacogenetic testing before and after pharmacogenetic education. Methods: In total, 12 physicians (˜40% response rate) completed a survey with eight questions on 10-point scales on their attitudes toward pharmacogenetic testing before and after a 1-h grand rounds presentation on pharmacogenetics. Differences in question scores overall, among training levels (resident/fellow/attending), and specific drugs (clopidogrel/simvastatin/warfarin) were assessed using Wilcoxon signed-rank and exact Kruskal–Wallis tests. Results & conclusion: The scores for all eight questions increased, with statistically significant (p < 0.05) increases for four out of eight questions. The scores were similar among training levels, but the postscores for clopidogrel were significantly higher than for simvastatin and warfarin. In conclusion, brief pharmacogenetic education can significantly affect physicians’ attitudes toward pharmacogenetic testing. PMID:29749904
Alshahrani, Mohammed S
2017-01-01
To assess the effect of the mode of transportation of trauma patients (emergency medical service [EMS] vs. non-EMS) on their final clinical outcome in terms of mortality and length of hospital stay. A retrospective study included all patients who were involved in motor vehicle crashes, and who were transferred immediately to an emergency department of a trauma care center from December 2008 to December 2012. Patients were classified into two groups: those brought through EMS and those brought by non-EMS (private transport). Information on demographic characteristics including age and gender was recorded and medical data such as blood pressure, pulse, oxygen saturation, temperature, initial Glasgow Coma Score (GCS), saturation, temperature, initial Glasgow Coma Score (GCS), injury severity score (ISS), and final outcome (discharged or expired) were obtained. Descriptive statistics, mean and standard deviation (SD) were computed for continuous variables and statistical significance was tested by t -test or Mann-Whitney U-test. Categorical variables were described by frequency distribution and percentages; Chi-square or Fisher's exact test as appropriate were employed to test for statistical significance. Logistics regression was performed with mortality as dependent variable and mode of transport and all demographic and prehospital variables as independent variables. A general linear model analysis was performed to test whether the mode of transport was significant to length of hospital stay in EMS and non-EMS clients. Out of 308 patients identified during the study period, 232 were transported through EMS and 76 through non-EMS. The two groups were similar with regard to mortality and length of stay. The crude mortality rate was 30.6% (95% confidence interval [CI]: 24.64-36.53) in the EMS group and 28.9% (95% CI: 18.44-38.76) in the non-EMS group ( p = 0.785). The average length of hospital stay was 9 days (interquartile range [IQR] = 8, 95% CI: 7.3-10.1) for the EMS group and 8 days (IQR = 9.5, 95% CI: 6.7-10.9) for the non-EMS group ( p = 0.803). Multivariate analysis showed that of the study variables, only the injury severity score (ISS) and Glasgow coma score (GCS) were significant to mortality ( p < 0.01), and GCS was more significant to the length of hospital stay ( p < 0.01). There was no significant difference between the EMS and non-EMS groups as they relate to mortality and length of stay in hospital. However, the mortality and length of hospital stay was statistically significant to ISS and GCS.
Saadati, Farzaneh; Ahmad Tarmizi, Rohani
2015-01-01
Because students’ ability to use statistics, which is mathematical in nature, is one of the concerns of educators, embedding within an e-learning system the pedagogical characteristics of learning is ‘value added’ because it facilitates the conventional method of learning mathematics. Many researchers emphasize the effectiveness of cognitive apprenticeship in learning and problem solving in the workplace. In a cognitive apprenticeship learning model, skills are learned within a community of practitioners through observation of modelling and then practice plus coaching. This study utilized an internet-based Cognitive Apprenticeship Model (i-CAM) in three phases and evaluated its effectiveness for improving statistics problem-solving performance among postgraduate students. The results showed that, when compared to the conventional mathematics learning model, the i-CAM could significantly promote students’ problem-solving performance at the end of each phase. In addition, the combination of the differences in students' test scores were considered to be statistically significant after controlling for the pre-test scores. The findings conveyed in this paper confirmed the considerable value of i-CAM in the improvement of statistics learning for non-specialized postgraduate students. PMID:26132553
Results of the Intelligence Test for Visually Impaired Children (ITVIC).
ERIC Educational Resources Information Center
Dekker, R.; And Others
1991-01-01
Statistical analyses of scores on subtests of the Intelligence Test for Visually Impaired Children were done for two groups of children, either with or without usable vision. Results suggest that the battery has differential factorial and predictive validity. (Author/DB)
Knowledge-Sharing Intention among Information Professionals in Nigeria: A Statistical Analysis
ERIC Educational Resources Information Center
Tella, Adeyinka
2016-01-01
In this study, the researcher administered a survey and developed and tested a statistical model to examine the factors that determine the intention of information professionals in Nigeria to share knowledge with their colleagues. The result revealed correlations between the overall score for intending to share knowledge and other…
ERIC Educational Resources Information Center
Harder, Valerie S.; Stuart, Elizabeth A.; Anthony, James C.
2010-01-01
There is considerable interest in using propensity score (PS) statistical techniques to address questions of causal inference in psychological research. Many PS techniques exist, yet few guidelines are available to aid applied researchers in their understanding, use, and evaluation. In this study, the authors give an overview of available…
ERIC Educational Resources Information Center
Koch, Bevan; Slate, John R.; Moore, George W.
2016-01-01
We compared the performance of Hispanic students from California, Texas, and Arizona on the two Advanced Placement (AP) English exams (i.e., English Language and Composition and English Literature and Composition) using archival data from the College Board from 1997 through 2012. Pearson chi-square tests yielded statistically significant…
Wang, Yunpeng; Thompson, Wesley K.; Schork, Andrew J.; Holland, Dominic; Chen, Chi-Hua; Bettella, Francesco; Desikan, Rahul S.; Li, Wen; Witoelar, Aree; Zuber, Verena; Devor, Anna; Nöthen, Markus M.; Rietschel, Marcella; Chen, Qiang; Werge, Thomas; Cichon, Sven; Weinberger, Daniel R.; Djurovic, Srdjan; O’Donovan, Michael; Visscher, Peter M.; Andreassen, Ole A.; Dale, Anders M.
2016-01-01
Most of the genetic architecture of schizophrenia (SCZ) has not yet been identified. Here, we apply a novel statistical algorithm called Covariate-Modulated Mixture Modeling (CM3), which incorporates auxiliary information (heterozygosity, total linkage disequilibrium, genomic annotations, pleiotropy) for each single nucleotide polymorphism (SNP) to enable more accurate estimation of replication probabilities, conditional on the observed test statistic (“z-score”) of the SNP. We use a multiple logistic regression on z-scores to combine information from auxiliary information to derive a “relative enrichment score” for each SNP. For each stratum of these relative enrichment scores, we obtain nonparametric estimates of posterior expected test statistics and replication probabilities as a function of discovery z-scores, using a resampling-based approach that repeatedly and randomly partitions meta-analysis sub-studies into training and replication samples. We fit a scale mixture of two Gaussians model to each stratum, obtaining parameter estimates that minimize the sum of squared differences of the scale-mixture model with the stratified nonparametric estimates. We apply this approach to the recent genome-wide association study (GWAS) of SCZ (n = 82,315), obtaining a good fit between the model-based and observed effect sizes and replication probabilities. We observed that SNPs with low enrichment scores replicate with a lower probability than SNPs with high enrichment scores even when both they are genome-wide significant (p < 5x10-8). There were 693 and 219 independent loci with model-based replication rates ≥80% and ≥90%, respectively. Compared to analyses not incorporating relative enrichment scores, CM3 increased out-of-sample yield for SNPs that replicate at a given rate. This demonstrates that replication probabilities can be more accurately estimated using prior enrichment information with CM3. PMID:26808560
Paige, John T; Garbee, Deborah D; Kozmenko, Valeriy; Yu, Qingzhao; Kozmenko, Lyubov; Yang, Tong; Bonanno, Laura; Swartz, William
2014-01-01
Effective teamwork in the operating room (OR) is often undermined by the "silo mentality" of the differing professions. Such thinking is formed early in one's professional experience and is fostered by undergraduate medical and nursing curricula lacking interprofessional education. We investigated the immediate impact of conducting interprofessional student OR team training using high-fidelity simulation (HFS) on students' team-related attitudes and behaviors. Ten HFS OR interprofessional student team training sessions were conducted involving 2 standardized HFS scenarios, each of which was followed by a structured debriefing that targeted team-based competencies. Pre- and post-session mean scores were calculated and analyzed for 15 Likert-type items measuring self-efficacy in teamwork competencies using the t-test. Additionally, mean scores of observer ratings of team performance after each scenario and participant ratings after the second scenario for an 11-item Likert-type teamwork scale were calculated and analyzed using one-way ANOVA and t-test. Eighteen nursing students, 20 nurse anesthetist students, and 28 medical students participated in the training. Statistically significant gains from mean pre- to post-training scores occurred on 11 of the 15 self-efficacy items. Statistically significant gains in mean observer performance scores were present on all 3 subscales of the teamwork scale from the first scenario to the second. A statistically significant difference was found in comparisons of mean observer scores with mean participant scores for the team-based behaviors subscale. High-fidelity simulation OR interprofessional student team training improves students' team-based attitudes and behaviors. Students tend to overestimate their team-based behaviors. Copyright © 2014 American College of Surgeons. Published by Elsevier Inc. All rights reserved.
Østergaard, Mia L; Nielsen, Kristina R; Albrecht-Beste, Elisabeth; Konge, Lars; Nielsen, Michael B
2018-01-01
This study aimed to develop a test with validity evidence for abdominal diagnostic ultrasound with a pass/fail-standard to facilitate mastery learning. The simulator had 150 real-life patient abdominal scans of which 15 cases with 44 findings were selected, representing level 1 from The European Federation of Societies for Ultrasound in Medicine and Biology. Four groups of experience levels were constructed: Novices (medical students), trainees (first-year radiology residents), intermediates (third- to fourth-year radiology residents) and advanced (physicians with ultrasound fellowship). Participants were tested in a standardized setup and scored by two blinded reviewers prior to an item analysis. The item analysis excluded 14 diagnoses. Both internal consistency (Cronbach's alpha 0.96) and inter-rater reliability (0.99) were good and there were statistically significant differences (p < 0.001) between all four groups, except the intermediate and advanced groups (p = 1.0). There was a statistically significant correlation between experience and test scores (Pearson's r = 0.82, p < 0.001). The pass/fail-standard failed all novices (no false positives) and passed all advanced (no false negatives). All intermediate participants and six out of 14 trainees passed. We developed a test for diagnostic abdominal ultrasound with solid validity evidence and a pass/fail-standard without any false-positive or false-negative scores. • Ultrasound training can benefit from competency-based education based on reliable tests. • This simulation-based test can differentiate between competency levels of ultrasound examiners. • This test is suitable for competency-based education, e.g. mastery learning. • We provide a pass/fail standard without false-negative or false-positive scores.
The effect of adhesive dressing edges on cutaneous irritancy and skin barrier function.
Dykes, P J
2007-03-01
To assess the effect of repeated application and removal of adhesive edges from wound-care products on cutaneous irritancy and barrier function in normal volunteer subjects. This was a study using a 'repeat-insult patch test'. Adhesive edges from six commonly used wound-care products were applied continuously to the same site (six applications over a 14-day period) in 30 normal volunteer subjects. The test sites were assessed clinically before product reapplication using established ranking scales for cutaneous erythema. The cumulative irritancy score (CIS) for each test site was determined by adding the erythema scores at days 3, 5, 8, 10, 12 and 15. At the study end the barrier function of each test site was assessed by measuring transepidermal water loss (TEWL). The CIS showed that the products fall into two distinct groups, with Mepilex, Tielle and Allevyn giving low scores and Biatain, Comfeel and DuoDERM higher scores. Statistical analysis indicated significant differences (p < 0.05) between Mepilex and Biatain, Mepilex and Comfeel, Mepilex and DuoDERM, Tielle and Biatain, Allevyn and Biatain. The mean TEWL values also indicated that the products fall into two distinct groups: Mepilex, Tielle and Allevyn with low mean values close to that of normal adjacent back skin and Biatain, Comfeel and DuoDERM with much higher mean values. Statistical analysis indicated that Mepilex, Tielle and Allevyn were not significantly different from normal skin (p < 0.05), whereas Biatain, Comfeel and DuoDERM were significantly higher than normal skin and the other products tested. The results show clear differences between products; the clinical scores and TEWL measurements indicate that the products fall into two distinct groups. This novel approach seems able to discriminate between adhesive borders and may be useful during product development and in selecting products for clinical trials.
Yang, Xin-Wei; Wang, Zhi-Ming; Jin, Tai-Yi
2006-05-01
This study was conducted to assess occupational stress in different gender, age, work duration, educational level and marital status group. A test of occupational stress in different gender, age, work duration, educational level and marital status group, was carried out with revised occupational stress inventory (OSI-R) for 4278 participants. The results of gender show that there are heavier occupational role, stronger interpersonal and physical strain in male than that in female, and the differences are statistically significant (P < 0.01). The score of recreation in the male is higher than that in female, but the score of self-care in the female is higher than that in male, and the differences are statistically significant (P < 0.01). Difference in the scores of occupational role, personal resource among various age groups is significant (P < 0.01). Vocational, interpersonal strain scores among various age groups is significant (P < 0.05). The results of educational level analyses suggest that the difference in the scores of occupational stress and strain among various educational levels show statistically significant (P < 0.05), whereas there are no statistic significance of coping resources among the groups (P > 0.05). The occupational stress so as to improve the work ability of different groups. Different measure should be taken to reduce the occupational stress so as to improve the work ability of different groups.
[Propensity score matching in SPSS].
Huang, Fuqiang; DU, Chunlin; Sun, Menghui; Ning, Bing; Luo, Ying; An, Shengli
2015-11-01
To realize propensity score matching in PS Matching module of SPSS and interpret the analysis results. The R software and plug-in that could link with the corresponding versions of SPSS and propensity score matching package were installed. A PS matching module was added in the SPSS interface, and its use was demonstrated with test data. Score estimation and nearest neighbor matching was achieved with the PS matching module, and the results of qualitative and quantitative statistical description and evaluation were presented in the form of a graph matching. Propensity score matching can be accomplished conveniently using SPSS software.
Dividing the Force Concept Inventory into two equivalent half-length tests
NASA Astrophysics Data System (ADS)
Han, Jing; Bao, Lei; Chen, Li; Cai, Tianfang; Pi, Yuan; Zhou, Shaona; Tu, Yan; Koenig, Kathleen
2015-06-01
The Force Concept Inventory (FCI) is a 30-question multiple-choice assessment that has been a building block for much of the physics education research done today. In practice, there are often concerns regarding the length of the test and possible test-retest effects. Since many studies in the literature use the mean score of the FCI as the primary variable, it would be useful then to have different shorter tests that can produce FCI-equivalent scores while providing the benefits of being quicker to administer and overcoming the test-retest effects. In this study, we divide the 1995 version of the FCI into two half-length tests; each contains a different subset of the original FCI questions. The two new tests are shorter, still cover the same set of concepts, and produce mean scores equivalent to those of the FCI. Using a large quantitative data set collected at a large midwestern university, we statistically compare the assessment features of the two half-length tests and the full-length FCI. The results show that the mean error of equivalent scores between any two of the three tests is within 3%. Scores from all tests are well correlated. Based on the analysis, it appears that the two half-length tests can be a viable option for score based assessment that need to administer tests quickly or need to measure short-term gains where using identical pre- and post-test questions is a concern.
Grigoriadis, Themos; Giannoulis, George; Zacharakis, Dimitris; Protopapas, Athanasios; Cardozo, Linda; Athanasiou, Stavros
2016-03-01
The purpose of the study was to examine whether a test performed during urodynamics, the "1-3-5 cough test", could determine the severity of urodynamic stress incontinence (USI). We included women referred for urodynamics who were diagnosed with USI. The "1-3-5 cough test" was performed to grade the severity of USI at the completion of filling cystometry. A diagnosis of "severe", "moderate" or "mild" USI was given if urine leakage was observed after one, three or five consecutive coughs respectively. We examined the associations between grades of USI severity and measures of subjective perception of stress urinary incontinence (SUI): International Consultation of Incontinence Modular Questionnaire-Female Lower Urinary Tract Symptom (ICIQ-FLUTS), King's Health Questionnaire (KHQ), Urinary Distress Inventory-6 (UDI-6), Urinary Impact Questionnaire-7 (UIQ-7). A total of 1,181 patients completed the ICIQ-FLUTS and KHQ and 612 completed the UDI-6 and UIQ-7 questionnaires. There was a statistically significant association of higher grades of USI severity with higher scores of the incontinence domain of the ICIQ-FLUTS. The scores of the UDI-6, UIQ-7 and of all KHQ domains (with the exception of general health perception and personal relationships) had statistically significant larger mean values for higher USI severity grade. Groups of higher USI severity had statistically significant associations with higher scores of most of the subjective measures of SUI. Severity of USI, as defined by the "1-3-5 cough test", was associated with the severity of subjective measures of SUI. This test may be a useful tool for the objective interpretation of patients with SUI who undergo urodynamics.
Pharmacy students' test-taking motivation-effort on a low-stakes standardized test.
Waskiewicz, Rhonda A
2011-04-11
To measure third-year pharmacy students' level of motivation while completing the Pharmacy Curriculum Outcomes Assessment (PCOA) administered as a low-stakes test to better understand use of the PCOA as a measure of student content knowledge. Student motivation was manipulated through an incentive (ie, personal letter from the dean) and a process of statistical motivation filtering. Data were analyzed to determine any differences between the experimental and control groups in PCOA test performance, motivation to perform well, and test performance after filtering for low motivation-effort. Incentivizing students diminished the need for filtering PCOA scores for low effort. Where filtering was used, performance scores improved, providing a more realistic measure of aggregate student performance. To ensure that PCOA scores are an accurate reflection of student knowledge, incentivizing and/or filtering for low motivation-effort among pharmacy students should be considered fundamental best practice when the PCOA is administered as a low-stakes test.
ERIC Educational Resources Information Center
Noser, Thomas C.; Tanner, John R.; Shah, Situl
2008-01-01
The purpose of this study was to measure the comprehension of basic mathematical skills of students enrolled in statistics classes at a large regional university, and to determine if the scores earned on a basic math skills test are useful in forecasting student performance in these statistics classes, and to determine if students' basic math…
ERIC Educational Resources Information Center
Kadel, Robert
2004-01-01
To her surprise, Ms. Logan had just conducted a statistical analysis of her 10th grade biology students' quiz scores. The results indicated that she needed to reinforce mitosis before the students took the high-school proficiency test in three weeks, as required by the state. "Oh! That's easy!" She exclaimed. Teachers like Ms. Logan are…
Abstracts of ARI Research Publications, FY 1978
1980-09-01
initial item pool, 49 items were identified as having signifi- cant item-to-total-score correlations and were statistically determined to address a...failing. Differences among the three groups on main gun performance measures and the previous experience of gun- ners were not statistically significant...forms of the noncognitive cod- ing speed test; and (d) a second field administration to derive norms and other statistical characteristics of the new
Posada, David
2006-01-01
ModelTest server is a web-based application for the selection of models of nucleotide substitution using the program ModelTest. The server takes as input a text file with likelihood scores for the set of candidate models. Models can be selected with hierarchical likelihood ratio tests, or with the Akaike or Bayesian information criteria. The output includes several statistics for the assessment of model selection uncertainty, for model averaging or to estimate the relative importance of model parameters. The server can be accessed at . PMID:16845102
Poole, Kerry; Mason, Howard
2007-03-15
To establish the relationship between quantitative tests of hand function and upper limb disability, as measured by the Disability of the Arm, Shoulder and Hand (DASH) questionnaire, in hand-arm vibration syndrome (HAVS). A total of 228 individuals with HAVS were included in this study. Each had undergone a full HAVS assessment by an experienced physician, including quantitative tests of vibrotactile and thermal perception thresholds, maximal hand-grip strength (HG) and the Purdue pegboard (PP) test. Individuals were also asked to complete a DASH questionnaire. PP and HG of the quantitative tests gave the best and statistically significant individual correlations with the DASH disability score (r2 = 0.168 and 0.096). Stepwise linear regression analysis revealed that only PP and HG measurements were statistically significant predictors of upper limb disability (r2 = 0.178). Overall a combination of the PP and HG measurements, rather than each alone, gave slightly better discrimination, although not statistically significant, between normal and abnormal DASH scores with a sensitivity of 73.1% and specificity of 64.3%. Measurements of manual dexterity and hand-grip strength using PP and HG may be useful in helping to confirm lack of upper limb function and 'perceived' disability in HAVS.
Hostility and Learning: A Follow-Up Note
ERIC Educational Resources Information Center
Costin, Frank
1971-01-01
Hostility scores, as measured by the Scrambled Sentence Test, were found to be statistically significant negative predictors of course achievement in a meteorology course at a military installation. (MS)
Depression and Self-Esteem in Early Adolescence.
Tripković, Ingrid; Roje, Romilda; Krnić, Silvana; Nazor, Mirjana; Karin, Željka; Čapkun, Vesna
2015-06-01
Depression prevalence has increased in the last few decades, affecting younger age groups. The aim of this research was to determine the range of depression and low self-esteem in elementary school children in the city of Split. Testing was carried out at school and the sample comprised 1,549 children (714 boys and 832 girls, aged 13). Two psychological instruments were used: the Coopersmith Self-Esteem Inventory (SEI) and the Children and Adolescent Depression Scale (SDD). The average value of scores obtained by SEI test was 17.8 for all tested children. No statistically significant difference was found be-tween boys and girls. It was found that 11.9% of children showed signs of clinically significant depression, and 16.2% showed signs of depression. Statistically significant association between low self-esteem and clinically significant depression was found. No statistically significant difference among boys and girls according to dimension of cognitive depression was found, whereas statistically significant level of emotional depression was higher in girls than boys. It was found that both dimensions of depression decreased proportionally with the increase of SEI test score values: cognitive and emotional dimension of depression. The results of this study show that it is necessary to provide early detection of emotional difficulties in order to prevent serious mental disorders. Copyright© by the National Institute of Public Health, Prague 2015.
A Nonparametric K-Sample Test for Equality of Slopes.
ERIC Educational Resources Information Center
Penfield, Douglas A.; Koffler, Stephen L.
1986-01-01
The development of a nonparametric K-sample test for equality of slopes using Puri's generalized L statistic is presented. The test is recommended when the assumptions underlying the parametric model are violated. This procedure replaces original data with either ranks (for data with heavy tails) or normal scores (for data with light tails).…
Outlier Detection in High-Stakes Certification Testing.
ERIC Educational Resources Information Center
Meijer, Rob R.
2002-01-01
Used empirical data from a certification test to study methods from statistical process control that have been proposed to classify an item score pattern as fitting or misfitting the underlying item response theory model in computerized adaptive testing. Results for 1,392 examinees show that different types of misfit can be distinguished. (SLD)
Is Cognitive Test-Taking Anxiety Associated With Academic Performance Among Nursing Students?
Duty, Susan M; Christian, Ladonna; Loftus, Jocelyn; Zappi, Victoria
2016-01-01
The cognitive component of test anxiety was correlated with academic performance among nursing students. Modest but statistically significant lower examination grade T scores were observed for students with high compared with low levels of cognitive test anxiety (CTA). High levels of CTA were associated with reduced academic performance.
Detection of Item Preknowledge Using Likelihood Ratio Test and Score Test
ERIC Educational Resources Information Center
Sinharay, Sandip
2017-01-01
An increasing concern of producers of educational assessments is fraudulent behavior during the assessment (van der Linden, 2009). Benefiting from item preknowledge (e.g., Eckerly, 2017; McLeod, Lewis, & Thissen, 2003) is one type of fraudulent behavior. This article suggests two new test statistics for detecting individuals who may have…
Lods, wrods, and mods: the interpretation of lod scores calculated under different models.
Hodge, S E; Elston, R C
1994-01-01
In this paper we examine the relationships among classical lod scores, "wrod" scores (lod scores calculated under the wrong genetic model), and "mod" scores (lod scores maximized over genetic model parameters). We compare the behavior of these scores when the state of nature is linkage to their behavior when the state of nature is no linkage. We describe sufficient conditions for mod scores to be valid and discuss their use to determine the correct genetic model. We show that lod scores represent a likelihood-ratio test for independence. We explain the "ascertainment-assumption-free" aspect of using mod scores to determine mode of inheritance and we set this aspect into a well-established statistical framework. Finally, we summarize practical guidelines for the use of mod scores.
Self, D J; Schrader, D E; Baldwin, D C; Wolinsky, F D
1993-01-01
Medicine endorses a code of ethics and encourages a high moral character among doctors. This study examines the influence of medical education on the moral reasoning and development of medical students. Kohlberg's Moral Judgment Interview was given to a sample of 20 medical students (41.7% of students in that class). The students were tested at the beginning and at the end of their medical course to determine whether their moral reasoning scores had increased to the same extent as other people who extend their formal education. It was found that normally expected increases in moral reasoning scores did not occur over the 4 years of medical education for these students, suggesting that their educational experience somehow inhibited their moral reasoning ability rather than facilitating it. With a range of moral reasoning scores between 315 and 482, the finding of a mean increase from first year to fourth year of 18.5 points was not statistically significant at the P < or = 0.05 level. Statistical analysis revealed no significant correlations at the P < or = 0.05 level between the moral reasoning scores and age, gender, Medical College Admission Test scores, or grade point average scores. Along with a brief description of Kohlberg's cognitive moral development theory, some interpretations and explanations are given for the findings of the study.
Sachan, D; Gupta, N; Agarwal, P; Chaudhary, R
2011-08-01
Heparin-induced thrombocytopenia (HIT) should be diagnosed clinically as well as by laboratory assays for timely recognition, prevention and management of complications. To evaluate the clinical utility of pre-test clinical scoring system in combination with two immunoassays for the diagnosis of HIT in cardiac surgery patients. A total of 100 consecutive patients undergoing cardiac surgery were studied. Pre-test clinical scoring was carried out in patients with thrombocytopenia and further tested by two immunoassays, i.e., Heparin platelet factor 4 (H-PF4) enzyme-linked immunosorbent assay (ELISA) and particle gel immunoassay (PaGIA). Of the 100 patients studied, 42 patients developed thrombocytopenia post-operatively. On pre-test clinical scoring, low T-score was observed in 6 patients, intermediate in 28 and high score in 8 patients, whereas 19 patients (45.2%) were positive by H-PF4 ELISA and 10 (23.8%) by PaGIA for H-PF4 antibody. The difference in the incidence of clinically significant HIT antibodies in the three categories was statistically significant. A good correlation was also observed with ELISA optical density, T-scoring and PaGIA. Pre-test clinical scoring correlates well with the development of H-PF4 antibodies which are incriminated in the causation of thrombotic complications in patients with HIT. We also propose a protocol for diagnosing patients with clinical suspicion of HIT using pre-test clinical scoring and immunoassay. © 2011 The Authors. Transfusion Medicine © 2011 British Blood Transfusion Society.
Lachin, John M
2011-11-10
The power of a chi-square test, and thus the required sample size, are a function of the noncentrality parameter that can be obtained as the limiting expectation of the test statistic under an alternative hypothesis specification. Herein, we apply this principle to derive simple expressions for two tests that are commonly applied to discrete ordinal data. The Wilcoxon rank sum test for the equality of distributions in two groups is algebraically equivalent to the Mann-Whitney test. The Kruskal-Wallis test applies to multiple groups. These tests are equivalent to a Cochran-Mantel-Haenszel mean score test using rank scores for a set of C-discrete categories. Although various authors have assessed the power function of the Wilcoxon and Mann-Whitney tests, herein it is shown that the power of these tests with discrete observations, that is, with tied ranks, is readily provided by the power function of the corresponding Cochran-Mantel-Haenszel mean scores test for two and R > 2 groups. These expressions yield results virtually identical to those derived previously for rank scores and also apply to other score functions. The Cochran-Armitage test for trend assesses whether there is an monotonically increasing or decreasing trend in the proportions with a positive outcome or response over the C-ordered categories of an ordinal independent variable, for example, dose. Herein, it is shown that the power of the test is a function of the slope of the response probabilities over the ordinal scores assigned to the groups that yields simple expressions for the power of the test. Copyright © 2011 John Wiley & Sons, Ltd.
A comparison of microscopic ink characteristics of 35 commercially available surgical margin inks.
Milovancev, Milan; Löhr, Christiane V; Bildfell, Robert J; Gelberg, Howard B; Heidel, Jerry R; Valentine, Beth A
2013-11-01
To compare microscopic characteristics of commercially available surgical margin inks used for surgical pathology specimens. Prospective in vitro study. Thirty-five different surgical margin inks (black, blue, green, orange, red, violet, and yellow from 5 different manufacturers). Inks were applied to uniform, single-source, canine cadaveric full-thickness ventral abdominal tissue blocks. Tissue blocks and ink manufacturers were randomly paired and each color was applied to a length of the cut tissue margin. After drying, tissues were fixed in formalin, and 3 radial slices were obtained from each color section and processed for routine histologic evaluation, yielding 105 randomly numbered slides with each manufacturer's color represented in triplicate. Slides were evaluated by 5 blinded, board-certified veterinary anatomic pathologists using a standardized scoring scheme. Statistical analyses were performed to evaluate for ink manufacturer effects on scores, correlation among different subjective variables, and pathologist agreement. Black and blue had the most consistently high scores whereas red and violet had the most consistently low overall scores, across all manufacturers. All colors tested, except yellow, had statistically significant differences in overall scores among individual manufacturers. Overall score was significantly correlated to all other subjective microscopic scores evaluated. The average Spearman correlation coefficient among the 10 pairwise pathologists overall ink scores was 0.60. There are statistically significant differences in microscopic ink characteristics among manufacturers, with a notable degree of inter-pathologist agreement. © Copyright 2013 by The American College of Veterinary Surgeons.
Jamil, Muhammad Nasir; Farooq, Umer; Sultan, Babar; Khan, Raza Muhammad
2016-01-01
Uncomplicated urinary tract infections (UTIs) are the most common bacterial infections among women presenting to primary care causing rapidly increasing strains of resistant bacteria to the growing antibiotic industry. Restricting antibiotics to necessary indications is the only solution. The objectives of the study were to compare the efficacy of symptomatic treatment vs antibiotic in patients with uncomplicated UTI, in terms of individual symptom score, i.e., frequency, urgency, dysuria, supra pubic pain scores and total symptoms scores. A randomized control trial (RCT) in 100 women (15-50 years) with symptoms of urinary frequency, urgency, dysuria and pain supra pubic region, associated with uncomplicated UTI, at Urology department, AMI, Abbottabad. Two treatment strategies were compared in uncomplicated UTI patient). Patients were randomized to antibiotic or symptomatic treatment groups on consecutive non-probability basis (50 in each group) given for 05 days. Efficacy of medications was assessed by comparing pre and post treatment symptom scores along with the post treatment scores of both groups compared to see statistical significance of difference by independent samples t-test. There was a statistically significant difference in symptoms improvement in both treatment arms of all scores, i.e., p-value=0.000. Whereas only dysuria score was able to show a statistically significance of difference in post Rx scores comparison of both groups, p-value=0.004. Symptomatic treatment is not inferior to antibiotic treatment when proper patient selection is undertaken, resulting in decreased need for unnecessary antibiotics use.
Translation and Validation of the Knee Society Score - KSS for Brazilian Portuguese
Silva, Adriana Lucia Pastore e; Demange, Marco Kawamura; Gobbi, Riccardo Gomes; da Silva, Tânia Fernanda Cardoso; Pécora, José Ricardo; Croci, Alberto Tesconi
2012-01-01
Objective To translate, culturally adapt and validate the "Knee Society Score"(KSS) for the Portuguese language and determine its measurement properties, reproducibility and validity. Methods We analyzed 70 patients of both sexes, aged between 55 and 85 years, in a cross-sectional clinical trial, with diagnosis of primary osteoarthritis ,undergoing total knee arthroplasty surgery. We assessed the patients with the English version of the KSS questionnaire and after 30 minutes with the Portuguese version of the KSS questionnaire, done by a different evaluator. All the patients were assessed preoperatively, and again at three, and six months postoperatively. Results There was no statistical difference, using Cronbach's alpha index and the Bland-Altman graphical analysis, for the knees core during the preoperative period (p =1), and at three months (p =0.991) and six months postoperatively (p =0.985). There was no statistical difference for knee function score for all three periods (p =1.0). Conclusion The Brazilian version of the Knee Society Score is easy to apply, as well providing as a valid and reliable instrument for measuring the knee score and function of Brazilian patients undergoing TKA. Level of Evidence: Level I - Diagnostic Studies- Investigating a Diagnostic Test- Testing of previously developed diagnostic criteria on consecutive patients (with universally applied 'gold' reference standard). PMID:24453576
Strom, Suzanne L; Anderson, Craig L; Yang, Luanna; Canales, Cecilia; Amin, Alpesh; Lotfipour, Shahram; McCoy, C Eric; Osborn, Megan Boysen; Langdorf, Mark I
2015-11-01
Traditional Advanced Cardiac Life Support (ACLS) courses are evaluated using written multiple-choice tests. High-fidelity simulation is a widely used adjunct to didactic content, and has been used in many specialties as a training resource as well as an evaluative tool. There are no data to our knowledge that compare simulation examination scores with written test scores for ACLS courses. To compare and correlate a novel high-fidelity simulation-based evaluation with traditional written testing for senior medical students in an ACLS course. We performed a prospective cohort study to determine the correlation between simulation-based evaluation and traditional written testing in a medical school simulation center. Students were tested on a standard acute coronary syndrome/ventricular fibrillation cardiac arrest scenario. Our primary outcome measure was correlation of exam results for 19 volunteer fourth-year medical students after a 32-hour ACLS-based Resuscitation Boot Camp course. Our secondary outcome was comparison of simulation-based vs. written outcome scores. The composite average score on the written evaluation was substantially higher (93.6%) than the simulation performance score (81.3%, absolute difference 12.3%, 95% CI [10.6-14.0%], p<0.00005). We found a statistically significant moderate correlation between simulation scenario test performance and traditional written testing (Pearson r=0.48, p=0.04), validating the new evaluation method. Simulation-based ACLS evaluation methods correlate with traditional written testing and demonstrate resuscitation knowledge and skills. Simulation may be a more discriminating and challenging testing method, as students scored higher on written evaluation methods compared to simulation.
Aşkar, Petek; Altun, Arif; Cangöz, Banu; Cevik, Vildan; Kaya, Galip; Türksoy, Hasan
2012-04-01
The purpose of this study was to assess whether a computerized battery of neuropsychological tests could produce similar results as the conventional forms. Comparisons on 77 volunteer undergraduates were carried out with two neuropsychological tests: Line Orientation Test and Enhanced Cued Recall Test. Firstly, students were assigned randomly across the test medium (paper-and-pencil versus computerized). Secondly, the groups were given the same test in the other medium after a 30-day interval between tests. Results showed that the Enhanced Cued Recall Test-Computer-based did not correlate with the Enhanced Cued Recall Test-Paper-and-pencil results. Line Orientation Test-Computer-based scores, on the other hand, did correlate significantly with the Line Orientation Test-Paper-and-pencil version. In both tests, scores were higher on paper-and-pencil tests compared to computer-based tests. Total score difference between modalities was statistically significant for both Enhanced Cued Recall Tests and for the Line Orientation Test. In both computer-based tests, it took less time for participants to complete the tests.
Hsieh, Cheng-Yang; Lee, Cheng-Han; Wu, Darren Philbert; Sung, Sheng-Feng
2018-05-01
Early detection of atrial fibrillation after stroke is important for secondary prevention in stroke patients without known atrial fibrillation (AF). We aimed to compare the performance of CHADS 2 , CHA 2 DS 2 -VASc and HATCH scores in predicting AF detected after stroke (AFDAS) and to test whether adding stroke severity to the risk scores improves predictive performance. Adult patients with first ischemic stroke event but without a prior history of AF were retrieved from a nationwide population-based database. We compared C-statistics of CHADS 2 , CHA 2 DS 2 -VASc and HATCH scores for predicting the occurrence of AFDAS during stroke admission (cohort I) and during follow-up after hospital discharge (cohort II). The added value of stroke severity to prediction models was evaluated using C-statistics, net reclassification improvement, and integrated discrimination improvement. Cohort I comprised 13,878 patients and cohort II comprised 12,567 patients. Among them, 806 (5.8%) and 657 (5.2%) were diagnosed with AF, respectively. The CHADS 2 score had the lowest C-statistics (0.558 in cohort I and 0.597 in cohort II), whereas the CHA 2 DS 2 -VASc score had comparable C-statistics (0.603 and 0.644) to the HATCH score (0.612 and 0.653) in predicting AFDAS. Adding stroke severity to each of the three risk scores significantly increased the model performance. In stroke patients without known AF, all three risk scores predicted AFDAS during admission and follow-up, but with suboptimal discrimination. Adding stroke severity improved their predictive abilities. These risk scores, when combined with stroke severity, may help prioritize patients for continuous cardiac monitoring in daily practice. Copyright © 2018 Elsevier B.V. All rights reserved.
Bağci Bosi, A Tülay; Camur, Derya; Güler, Cağatay
2007-11-01
This study has been carried out to "identify highly sensitive behavior on healthy nutrition (orthorexia nervosa-ON)" in residence medical doctors (MD) in the Faculty of Medicine. Diagnoses of ON was based on the presence of a disorder with obsessive-compulsive personality. The study is a cross-sectional research, which reached out to the entire 318 MD. The ORTO-15 test was used to propose a diagnostic proceeding and to try verify the prevalence of ON. Those subjects who were classified below 40 from the ORTO-15 test are accepted to have ON. Chi-square test, ANOVA (univariate) analysis and logistic regression were used for analyses of the data. Mean score of the participants from the ORTO-15 test is 39.8+/-0.22, and there is no statistical difference between women and men. A total of 45.5% of the residence MD involved in the research scored below 40 in the ORTO-15 test. Those who do their food shopping themselves, skip a meal with a salad/fruit, care about the quality of the things they eat, think that eating outside is healthy, look at the content of what they eat and the content of food is important in selection of a product score lower in their average marks in ORTO-15 and the difference among the groups is statistically significant. Food selection of 20.1% of the male participants and 38.9% of the female participants among the residence MD is influenced by the programs on nutrition/health in mass-media. The difference between the groups is statistically significant (p<0.05). Female medical doctors are more careful than men of their physical appearance and weight control and consume less caloric food, which is statistically significant. Since those who exhibit "healthy fanatic" eating habits may have a risk of ON in the future, it would be useful to conduct studies that identify the prevalence of ON in the public.
Ortuño-Sierra, Javier; Aritio-Solana, Rebeca; Inchausti, Félix; Chocarro de Luis, Edurne; Lucas Molina, Beatriz; Pérez de Albéniz, Alicia; Fonseca-Pedrero, Eduardo
2017-01-01
The main purpose of the present study was to assess the depressive symptomatology and to gather new validity evidences of the Reynolds Depression Scale-Short form (RADS-SF) in a representative sample of youths. The sample consisted of 2914 adolescents with a mean age of 15.85 years (SD = 1.68). We calculated the descriptive statistics and internal consistency of the RADS-SF scores. Also, confirmatory factor analyses (CFAs) at the item level and successive multigroup CFAs to test measurement invariance, were conducted. Latent mean differences across gender and educational level groups were estimated, and finally, we studied the sources of validity evidences with other external variables. The level of internal consistency of the RADS-SF Total score by means of Ordinal alpha was .89. Results from CFAs showed that the one-dimensional model displayed appropriate goodness of-fit indices with CFI value over .95, and RMSEA value under .08. In addition, the results support the strong measurement invariance of the RADS-SF scores across gender and age. When latent means were compared, statistically significant differences were found by gender and age. Females scored 0.347 over than males in Depression latent variable, whereas older adolescents scored 0.111 higher than the younger group. In addition, the RADS-SF score was associated with the RADS scores. The results suggest that the RADS-SF could be used as an efficient screening test to assess self-reported depressive symptoms in adolescents from the general population.
Pollock, Benjamin D; Hu, Tian; Chen, Wei; Harville, Emily W; Li, Shengxu; Webber, Larry S; Fonseca, Vivian; Bazzano, Lydia A
2017-01-01
To evaluate several adult diabetes risk calculation tools for predicting the development of incident diabetes and pre-diabetes in a bi-racial, young adult population. Surveys beginning in young adulthood (baseline age ≥18) and continuing across multiple decades for 2122 participants of the Bogalusa Heart Study were used to test the associations of five well-known adult diabetes risk scores with incident diabetes and pre-diabetes using separate Cox models for each risk score. Racial differences were tested within each model. Predictive utility and discrimination were determined for each risk score using the Net Reclassification Index (NRI) and Harrell's c-statistic. All risk scores were strongly associated (p<.0001) with incident diabetes and pre-diabetes. The Wilson model indicated greater risk of diabetes for blacks versus whites with equivalent risk scores (HR=1.59; 95% CI 1.11-2.28; p=.01). C-statistics for the diabetes risk models ranged from 0.79 to 0.83. Non-event NRIs indicated high specificity (non-event NRIs: 76%-88%), but poor sensitivity (event NRIs: -23% to -3%). Five diabetes risk scores established in middle-aged, racially homogenous adult populations are generally applicable to younger adults with good specificity but poor sensitivity. The addition of race to these models did not result in greater predictive capabilities. A more sensitive risk score to predict diabetes in younger adults is needed. Copyright © 2017 Elsevier Inc. All rights reserved.
Albanese, Mark A; Farrell, Philip; Dottl, Susan L
2005-01-01
Using Medical College Admission Test-grade point average (MCAT-GPA) scores as a threshold has the potential to address issues raised in recent Supreme Court cases, but it introduces complicated methodological issues for medical school admissions. To assess various statistical indexes to determine optimally discriminating thresholds for MCAT-GPA scores. Entering classes from 1992 through 1998 (N = 752) are used to develop guidelines for cut scores that optimize discrimination between students who pass and do not pass the United States Medical Licensing Examination (USMLE) Step 1 on the first attempt. Risk differences, odds ratios, sensitivity, and specificity discriminated best for setting thresholds. Compensatory versus noncompensatory procedures both accounted for 54% of Step 1 failures, but demanded different performance requirements (noncompensatory MCAT-biological sciences = 8, physical sciences = 7, verbal reasoning = 7--sum of scores = 22; compensatory MCAT total = 24). Rational and defensible intellectual achievement thresholds that are likely to comply with recent Supreme Court decisions can be set from MCAT scores and GPAs.
Toffan, Adam; Alexander, Marion J L; Peeler, Jason
2017-07-28
The purpose of the study was to compare the most effective joint movements, segment velocities and body positions to perform the fastest and most accurate pass of high school and university football quarterbacks. Secondary purposes were to develop a quarterback throwing test to assess skill level, to determine which kinematic variables were different between high school and university athletes as well as to determine which variables were significant predictors of quarterback throwing test performance. Ten high school and ten university athletes were filmed for the study, performing nine passes at a target and two passes for maximum distance. Thirty variables were measured using Dartfish Team Pro 4.5.2 video analysis system, and Microsoft Excel was used for statistical analysis. University athletes scored slightly higher than the high school athletes on the throwing test, however this result was not statistically significant. Correlation analysis and forward stepwise multiple regression analysis was performed on both the high school players and the university players in order to determine which variables were significant predictors of throwing test score. Ball velocity was determined to have the strongest predictive effect on throwing test score (r = 0.900) for the high school athletes, however, position of the back foot at release was also determined to be important (r = 0.661) for the university group. Several significant differences in throwing technique between groups were noted during the pass, however, body position at release showed the greatest differences between the two groups. High school players could benefit from more complete weight transfer and decreased throw time to increase throwing test score. University athletes could benefit from increased throw time and greater range of motion in external shoulder rotation and trunk rotation to increase their throwing test score. Coaches and practitioners will be able to use the findings of this research to help improve these and related throwing variables in their high school and university quarterbacks.
Allele-sharing models: LOD scores and accurate linkage tests.
Kong, A; Cox, N J
1997-11-01
Starting with a test statistic for linkage analysis based on allele sharing, we propose an associated one-parameter model. Under general missing-data patterns, this model allows exact calculation of likelihood ratios and LOD scores and has been implemented by a simple modification of existing software. Most important, accurate linkage tests can be performed. Using an example, we show that some previously suggested approaches to handling less than perfectly informative data can be unacceptably conservative. Situations in which this model may not perform well are discussed, and an alternative model that requires additional computations is suggested.
Allele-sharing models: LOD scores and accurate linkage tests.
Kong, A; Cox, N J
1997-01-01
Starting with a test statistic for linkage analysis based on allele sharing, we propose an associated one-parameter model. Under general missing-data patterns, this model allows exact calculation of likelihood ratios and LOD scores and has been implemented by a simple modification of existing software. Most important, accurate linkage tests can be performed. Using an example, we show that some previously suggested approaches to handling less than perfectly informative data can be unacceptably conservative. Situations in which this model may not perform well are discussed, and an alternative model that requires additional computations is suggested. PMID:9345087
Hoenigl, Martin; Weibel, Nadir; Mehta, Sanjay R; Anderson, Christy M; Jenks, Jeffrey; Green, Nella; Gianella, Sara; Smith, Davey M; Little, Susan J
2015-08-01
Although men who have sex with men (MSM) represent a dominant risk group for human immunodeficiency virus (HIV), the risk of HIV infection within this population is not uniform. The objective of this study was to develop and validate a score to estimate incident HIV infection risk. Adult MSM who were tested for acute and early HIV (AEH) between 2008 and 2014 were retrospectively randomized 2:1 to a derivation and validation dataset, respectively. Using the derivation dataset, each predictor associated with an AEH outcome in the multivariate prediction model was assigned a point value that corresponded to its odds ratio. The score was validated on the validation dataset using C-statistics. Data collected at a single HIV testing encounter from 8326 unique MSM were analyzed, including 200 with AEH (2.4%). Four risk behavior variables were significantly associated with an AEH diagnosis (ie, incident infection) in multivariable analysis and were used to derive the San Diego Early Test (SDET) score: condomless receptive anal intercourse (CRAI) with an HIV-positive MSM (3 points), the combination of CRAI plus ≥5 male partners (3 points), ≥10 male partners (2 points), and diagnosis of bacterial sexually transmitted infection (2 points)-all as reported for the prior 12 months. The C-statistic for this risk score was >0.7 in both data sets. The SDET risk score may help to prioritize resources and target interventions, such as preexposure prophylaxis, to MSM at greatest risk of acquiring HIV infection. The SDET risk score is deployed as a freely available tool at http://sdet.ucsd.edu. © The Author 2015. Published by Oxford University Press on behalf of the Infectious Diseases Society of America. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Ballesteros-Peña, Sendoa; Vallejo-De la Hoz, Gorka; Fernández-Aedo, Irrintzi
2017-12-23
To analyse vein catheterisation and blood gas test-related pain among adult patients in the emergency department and to explore pain score-related factors. An observational and multicentre research study was performed. Patients undergoing vein catheterisation or arterial puncture for gas test were included consecutively. After each procedure, patients scored the pain experienced using the NRS-11. 780 vein catheterisations and 101 blood gas tests were analysed. Venipuncture was scored with an average score of 2.8 (95% CI: 2.6-3), and arterial puncture with 3.6 (95%CI 3.1-4). Iatrogenic pain scores were associated with moderate - high difficulty procedures (P<.001); with the choice of the humeral rather than the radial artery (P=.02) in the gas test and correlated to baseline pain in venipunctures (P<.001). Pain scores related to other variables such as sex, place of origin or needle gauge did not present statistically significant differences. Vein catheterisation and blood gas test-related pain can be considered mild to moderately and moderately painful procedures, respectively. The pain score is associated with certain variables such as the difficulty of the procedure, the anatomic area of the puncture or baseline pain. A better understanding of painful effects related to emergency nursing procedures and the factors associated with pain self-perception could help to determine when and how to act to mitigate this undesired effect. Copyright © 2017 Elsevier España, S.L.U. All rights reserved.
Khan, Asaduzzaman; Chien, Chi-Wen; Bagraith, Karl S
2015-04-01
To investigate whether using a parametric statistic in comparing groups leads to different conclusions when using summative scores from rating scales compared with using their corresponding Rasch-based measures. A Monte Carlo simulation study was designed to examine between-group differences in the change scores derived from summative scores from rating scales, and those derived from their corresponding Rasch-based measures, using 1-way analysis of variance. The degree of inconsistency between the 2 scoring approaches (i.e. summative and Rasch-based) was examined, using varying sample sizes, scale difficulties and person ability conditions. This simulation study revealed scaling artefacts that could arise from using summative scores rather than Rasch-based measures for determining the changes between groups. The group differences in the change scores were statistically significant for summative scores under all test conditions and sample size scenarios. However, none of the group differences in the change scores were significant when using the corresponding Rasch-based measures. This study raises questions about the validity of the inference on group differences of summative score changes in parametric analyses. Moreover, it provides a rationale for the use of Rasch-based measures, which can allow valid parametric analyses of rating scale data.
ERIC Educational Resources Information Center
Olsen, Marilyn
A study (conducted in suburban central New Jersey using 218 second graders' California Achievement Test (CAT) scores from 1986-1988 compared the effectiveness of two well-known reading programs. Results indicated that although there was no statistically significant difference in the scores, the mean difference suggested that children who were…
Ido, Yoshikazu; Uchiyama, Shigeharu; Nakamura, Koichi; Itsubo, Toshiro; Hayashi, Masanori; Hata, Yukihiko; Imaeda, Toshihiko; Kato, Hiroyuki
2016-06-06
We investigated a recovery pattern in subjective and objective measures among 52 patients with cubital tunnel syndrome after anterior subcutaneous transposition of the ulnar nerve. Disabilities of the Arm, Shoulder and Hand (DASH) score (primary outcome), numbness score, grip and pinch strength, Semmes-Weinstein (SW) score, static 2-point discrimination (2PD) score, and motor conduction velocity (MCV) stage were examined preoperatively and 1, 3, 6, 12, and ≥24 months postoperatively. Statistical analyses were conducted to evaluate how each variable improved after surgery. A linear mixed-effects model was used for continuous variables (DASH score, numbness, grip and pinch strength), and a proportional odds model was used for categorical variables (SW and 2PD tests and MCV stages). DASH score significantly improved by 6 months. Significant recovery in numbness and SW test scores occurred at 1 month. Grip and pinch strength, 2PD test scores, and MCV stage improved by 3 months. DASH scores and numbness recovered regardless of age, sex, or disease severity. It was still unclear if both subjective and objective measures improved beyond 1-year postoperatively. These data are helpful for predicting postoperative recovery patterns and tend to be most important for patients prior to surgery.
Umphress, Thomas B
2008-06-01
Twenty people with suspected intellectual disability took the Reynolds Intellectual Assessment Scales (RIAS; C. R. Reynolds & R. W. Kamphaus, 1998) and the Wechsler Adult Intelligence Scale-3rd Edition (WAIS-III; D. Wechsler, 1997) to see if the 2 IQ tests produced comparable results. A t test showed that the RIAS Composite Intelligence Index scores were significantly higher than WAIS-III Full Scale IQ scores at the alpha level of .01. There was a significant difference between the RIAS Nonverbal Intelligence and WAIS-III Performance Scale, but there was no significant difference between the RIAS Verbal Intelligence Index and the WAIS-III Verbal Scale IQ. The results raise questions concerning test selection for diagnosing intellectual disability and the use of the correlation statistic for comparing intelligence tests.
Integration of a Community Pharmacy Simulation Program into a Therapeutics Course.
Shin, Jaekyu; Tabatabai, Daryush; Boscardin, Christy; Ferrone, Marcus; Brock, Tina
2018-02-01
Objective. To demonstrate the feasibility of integrating the computer simulation, MyDispense, into a therapeutics course and to measure its effects on student perception and learning. Methods. We conducted a prospective study with an experimental phase and an implementation phase. In the first phase, students were randomized to complete a therapeutics case using MyDispense or traditional paper methods in class. In the second phase, all students completed two therapeutic cases using MyDispense in class with the option to complete four additional outside-of-class cases using MyDispense. Students completed pre- and post-tests in class and three surveys. Results. In the experimental phase, mean test scores increased from pre- to post-test for both MyDispense and traditional paper groups, but the difference between the groups was not statistically significant. Students in the traditional paper group reported statistically significant gains in confidence compared to the MyDispense group. In the implementation phase, mean test scores again increased, however, student perception of the use of MyDispense for therapeutics was negative. Completing the optional outside-of-class cases, however, was positively and significantly correlated with the midterm and final examination scores. Conclusion. Implementation of MyDispense in therapeutics may be feasible and has positive effects (eg, correlation with exam scores, capacity for immediate feedback, and potential for effective self-study). With short-term use and in the absence of assessment methods that also require seeking information from patients, students prefer to learn via traditional paper cases.
Bull, Leona
2007-02-01
The aim of the study was to determine the clinical and perceived effectiveness of the Sunflower therapy in the treatment of childhood dyslexia. The Sunflower therapy includes applied kinesiology, physical manipulation, massage, homeopathy, herbal remedies and neuro-linguistic programming. A multi-centred, randomised controlled trial was undertaken with 70 dyslexic children aged 6-13 years. The research study aimed to test the research hypothesis that dyslexic children 'feel better' and 'perform better' as a result of treatment by the Sunflower therapy. Children in the treatment group and the control group were assessed using a battery of standardised cognitive, Literacy and self-esteem tests before and after the intervention. Parents of children in the treatment group gave feedback on their experience of the Sunflower therapy. Test scores were compared using the Mann Whitney, and Wilcoxon statistical tests. While both groups of children improved in some of their test scores over time, there were no statistically significant improvements in cognitive or Literacy test performance associated with the treatment. However, there were statistically significant improvements in academic self-esteem, and reading self-esteem, for the treatment group. The majority of parents (57.13%) felt that the Sunflower therapy was effective in the treatment of learning difficulties. Further research is required to verify these findings, and should include a control group receiving a dummy treatment to exclude placebo effects.
Intelligence--Group Administered, Grades 4-6. Collection. Annotated Bibliography of Tests.
ERIC Educational Resources Information Center
Educational Testing Service, Princeton, NJ. Test Collection.
Most of the 28 tests included in this bibliography assess intelligence and provide an actual I.Q. score or one that is statistically similar. (A complete list of mental or cognitive ability tests is available separately.) Although all ages are represented, the majority of tests are targeted to grades 4 through 6. This document is one in a series…
Meenakshi, S; Gujjari, Anil Kumar; Thippeswamy, H N; Raghunath, N
2014-12-01
Stereognosis has been defined as the appreciation of the form of objects by palpation. Whilst this definition holds good for the manual exploration of objects, it is possible for the shape of objects to be explored intra orally referred to as oral stereognosis. To better understand patients' relative satisfaction with complete dentures, differences in oral stereognostic perception, based on the identification of 6 edible objects was analyzed in a group of 30 edentulous individuals at 3 stages, namely, just before (pre-treatment), 30 min after (30 min post-treatment) and 1 month after (1 month post-treatment) the insertion of new dentures. The time required to identify each object was recorded and the correctness of identification of each object was scored using oral stereognostic score. Descriptive statistics, Wilcoxon signed rank test, Spearman's rank correlation test, Pearson Chi square test was used to statistically analyze the data obtained. OSA scores was significantly increased 1 month post-treatment compared to 30 min post-treatment (p < 0.05). It was found that Oral stereognostic test is reliable for measuring patients' oral stereognostic perception and may be used as one of the clinical aids in appreciating the functional limitations imposed by the prostheses.
Prognostic Value of Metabolic Liver Function Tests: a Study on 711 Cirrhotic Patients.
Lebossé, Fanny; Guillaud, Olivier; Forestier, Julien; Ecochard, Marie; Boillot, Olivier; Roman, Sabine; Mion, François; Dumortier, Jérôme
2016-09-01
The prognosis of cirrhotic patients is usually assessed by Child-Pugh and MELD scores. Metabolic liver function tests such as aminopyrine breath test (ABT) and indocyanine green clearance (IGC) have been shown to reveal hepatocellular dysfunction. The aim of this retrospective study was to compare the prognostic value of the MELD score, Child-Pugh score, ABT and IGC in a large cohort of cirrhotic patients. Between January 1996 and June 2008, 711 cirrhotic patients were included and the primary endpoint was survival without LT. The ROC curves with c-statistics, correlation coefficient and survival were calculated. Metabolic function tests and scores were strongly correlated. At the time of evaluation, 111 patients had died and 520 had received a transplant. Prognostic ability (estimated by the AUROC curve) to predict survival without LT at 6 months was 0.662, 0.691, 0.738 and 0.715 for ABT, IGC, Child-Pugh score and MELD score, respectively. Similarly, at 1 year, AUROC was 0.738 for Child-Pugh score, 0.716 for MELD score, 0.693 for IGC clearance and 0.651 for ABT. Our results strongly confirm that IGC and ABT have a high prognostic value in cirrhotic patients, similar to Child-Pugh and MELD scores. They could be developed to routinely evaluate the prognosis of patients in addition to clinical and biochemical data.
Evaluation of creative thinking in children with idiopathic epilepsy (absence epilepsy).
Di Filippo, T; Parisi, L; Roccella, M
2012-02-01
Creativity represents the silent character of human behaviour. In children with epilepsy, cognitive performance of has mainly been investigated under the assumption that the disorder represents a risk factor for the development of intellectual function. In subjects with different forms of epilepsy, neuropsychologic disorders have been detected even when cognitive-global functioning is unimpaired. The cognitive functions of subjects with epilepsy have been widely studied, but their creativity has been never evaluated to date. The aim of this study was to describe the development of creative thinking in a group of children with absence epilepsy. The test battery included: the Torrance Test of Creative Thinking (TTCT), the Wechsler Intelligence Scale for Children-revised (WISC-R) and the Goodenough Human Figure Drawing Test. Statistical analysis (Mann-Whitney test) showed a statistically significant difference (P <0.05) in test scores between two groups of subjects (children with epilesy vs control group), with higher scores for figure originality, figure fluidity and figure elaboration in the control group. There was a significant correlation (Spearman's rho) between verbal IQ and verbal fluidity and verbal flexibility subscale scores and between performance IQ and figure elaboration, between total IQ and verbal fluidity and verbal flexibility subscales (P <0.05; r >0.30). Low scores on the figure originality subscales seem to confirm the hypothesis that adverse psychodynamic and relational factors impoverish autonomy, flexibility and manipulator interests. The communication channels between subjects with epilepsy and their family members were affected by the disorder, as were the type of emotional dynamics and affective flux.
Negative Marking and the Student Physician–-A Descriptive Study of Nigerian Medical Schools
Ndu, Ikenna Kingsley; Ekwochi, Uchenna; Di Osuorah, Chidiebere; Asinobi, Isaac Nwabueze; Nwaneri, Michael Osita; Uwaezuoke, Samuel Nkachukwu; Amadi, Ogechukwu Franscesca; Okeke, Ifeyinwa Bernadette; Chinawa, Josephat Maduabuchi; Orjioke, Casmir James Ginikanwa
2016-01-01
Background There is considerable debate about the two most commonly used scoring methods, namely, the formula scoring (popularly referred to as negative marking method in our environment) and number right scoring methods. Although the negative marking scoring system attempts to discourage students from guessing in order to increase test reliability and validity, there is the view that it is an excessive and unfair penalty that also increases anxiety. Feedback from students is part of the education process; thus, this study assessed the perception of medical students about negative marking method for multiple choice question (MCQ) examination formats and also the effect of gender and risk-taking behavior on scores obtained with this assessment method. Methods This was a prospective multicenter survey carried out among fifth year medical students in Enugu State University and the University of Nigeria. A structured questionnaire was administered to 175 medical students from the two schools, while a class test was administered to medical students from Enugu State University. Qualitative statistical methods including frequencies, percentages, and chi square were used to analyze categorical variables. Quantitative statistics using analysis of variance was used to analyze continuous variables. Results Inquiry into assessment format revealed that most of the respondents preferred MCQs (65.9%). One hundred and thirty students (74.3%) had an unfavorable perception of negative marking. Thirty-nine students (22.3%) agreed that negative marking reduces the tendency to guess and increases the validity of MCQs examination format in testing knowledge content of a subject compared to 108 (61.3%) who disagreed with this assertion (χ2 = 23.0, df = 1, P = 0.000). The median score of the students who were not graded with negative marking was significantly higher than the score of the students graded with negative marking (P = 0.001). There was no statistically significant difference in the risk-taking behavior between male and female students in their MCQ answering patterns with negative marking method (P = 0.618). Conclusions In the assessment of students, it is more desirable to adopt fair penalties for discouraging guessing rather than excessive penalties for incorrect answers, which could intimidate students in negative marking schemes. There is no consensus on the penalty for an incorrect answer. Thus, there is a need for continued research into an effective and objective assessment tool that will ensure that the students’ final score in a test truly represents their level of knowledge. PMID:29349304
ERIC Educational Resources Information Center
Ladyshewsky, Richard K.
2015-01-01
This research explores differences in multiple choice test (MCT) scores in a cohort of post-graduate students enrolled in a management and leadership course. A total of 250 students completed the MCT in either a supervised in-class paper and pencil test or an unsupervised online test. The only statistically significant difference between the nine…
Mohamadirizi, Soheila; Fahami, Fariba; Bahadoran, Parvin; Ehsanpour, Soheila
2015-01-01
Background: An active teaching method has been used widely in medical education. The aim of this study was to determine the effectiveness of the four-phase teaching method on midwifery students’ emotional intelligence (EQ) in managing the childbirth. Materials and Methods: This was an experimental study that performed in 2013 in Isfahan University of Medical Sciences. Thirty midwifery students were involved in this study and selected through a random sampling method. The EQ questionnaire (43Q) was completed by both the groups, before and after the education. The collected data were analyzed using SPSS 14, the independent t-test, and the paired t-test. The statistically significant level was considered to be <0.05. Results: The findings of the independent t-test did not show any significant difference between EQ scores of the experimental and the control group before the intervention, whereas a statistically significant difference was observed after the intervention between the scores of two groups (P = 0.009). The paired t-test showed a statistically significant difference in EQ scores in the two groups after the intervention in the four-phase and the control group, respectively, as P = 0.005 and P = 0.018. Furthermore, the rate of self-efficiency has increased in the experimental group and control group as 66% and 13% (P = 0.024), respectively. Conclusion: The four-phase teaching method can increase the EQ levels of midwifery students. Therefore, the conduction of this educational model is recommended as an effective learning method. PMID:26097861
Long-term occlusal changes assessed by the American Board of Orthodontics' model grading system.
Aszkler, Robert M; Preston, Charles B; Saltaji, Humam; Tabbaa, Sawsan
2014-02-01
The purpose of this study was to assess the long-term posttreatment changes in all criteria of the American Board of Orthodontics' (ABO) model grading system. We used plaster models from patients' final and posttreatment records. Thirty patients treated by 1 orthodontist using 1 bracket prescription were selected. An initial discrepancy index for each subject was performed to determine the complexity of each case. The final models were then graded using the ABO's model grading system immediately at posttreatment and postretention. Statistical analysis was performed on the 8 criteria of the model grading system, including paired t tests and Pearson correlations. An alpha of 0.05 was considered statistically significant. The average length of time between the posttreatment and postretention records was 12.7 ± 4.4 years. It was shown that alignment and rotations worsened by postretention (P = 0.014), and a weak statistically significant correlation at posttreatment and postretention was found (0.44; P = 0.016). Both marginal ridges and occlusal contacts scored less well at posttreatment. These criteria showed a significant decrease in scores between posttreatment and postretention (P <0.001), but the correlations were not statistically significant. The average total score showed a significant decrease between posttreatment and postretention (P <0.001), partly because of the large decrease in the previous 2 criteria. Higher scores for occlusal contacts and marginal ridges were found at the end of treatment; however, those scores and the overall scores for the 30 subjects improved in the postretention phase. Copyright © 2014. Published by Mosby, Inc.
Husbands, Adrian; Mathieson, Alistair; Dowell, Jonathan; Cleland, Jennifer; MacKenzie, Rhoda
2014-04-23
The UK Clinical Aptitude Test (UKCAT) was designed to address issues identified with traditional methods of selection. This study aims to examine the predictive validity of the UKCAT and compare this to traditional selection methods in the senior years of medical school. This was a follow-up study of two cohorts of students from two medical schools who had previously taken part in a study examining the predictive validity of the UKCAT in first year. The sample consisted of 4th and 5th Year students who commenced their studies at the University of Aberdeen or University of Dundee medical schools in 2007. Data collected were: demographics (gender and age group), UKCAT scores; Universities and Colleges Admissions Service (UCAS) form scores; admission interview scores; Year 4 and 5 degree examination scores. Pearson's correlations were used to examine the relationships between admissions variables, examination scores, gender and age group, and to select variables for multiple linear regression analysis to predict examination scores. Ninety-nine and 89 students at Aberdeen medical school from Years 4 and 5 respectively, and 51 Year 4 students in Dundee, were included in the analysis. Neither UCAS form nor interview scores were statistically significant predictors of examination performance. Conversely, the UKCAT yielded statistically significant validity coefficients between .24 and .36 in four of five assessments investigated. Multiple regression analysis showed the UKCAT made a statistically significant unique contribution to variance in examination performance in the senior years. Results suggest the UKCAT appears to predict performance better in the later years of medical school compared to earlier years and provides modest supportive evidence for the UKCAT's role in student selection within these institutions. Further research is needed to assess the predictive validity of the UKCAT against professional and behavioural outcomes as the cohort commences working life.
2014-01-01
Background The UK Clinical Aptitude Test (UKCAT) was designed to address issues identified with traditional methods of selection. This study aims to examine the predictive validity of the UKCAT and compare this to traditional selection methods in the senior years of medical school. This was a follow-up study of two cohorts of students from two medical schools who had previously taken part in a study examining the predictive validity of the UKCAT in first year. Methods The sample consisted of 4th and 5th Year students who commenced their studies at the University of Aberdeen or University of Dundee medical schools in 2007. Data collected were: demographics (gender and age group), UKCAT scores; Universities and Colleges Admissions Service (UCAS) form scores; admission interview scores; Year 4 and 5 degree examination scores. Pearson’s correlations were used to examine the relationships between admissions variables, examination scores, gender and age group, and to select variables for multiple linear regression analysis to predict examination scores. Results Ninety-nine and 89 students at Aberdeen medical school from Years 4 and 5 respectively, and 51 Year 4 students in Dundee, were included in the analysis. Neither UCAS form nor interview scores were statistically significant predictors of examination performance. Conversely, the UKCAT yielded statistically significant validity coefficients between .24 and .36 in four of five assessments investigated. Multiple regression analysis showed the UKCAT made a statistically significant unique contribution to variance in examination performance in the senior years. Conclusions Results suggest the UKCAT appears to predict performance better in the later years of medical school compared to earlier years and provides modest supportive evidence for the UKCAT’s role in student selection within these institutions. Further research is needed to assess the predictive validity of the UKCAT against professional and behavioural outcomes as the cohort commences working life. PMID:24762134
Radiology resident dictation instruction: effectiveness of the didactic lecture.
Woodfield, Courtney A; Mainiero, Martha B
2008-07-01
The study's purpose was to determine the effectiveness of a didactic lecture for teaching and evaluating radiology resident dictation skills. A 23-question test was created to assess resident knowledge of the American College of Radiology practice guidelines for reporting and our institution-specific requirements for communication of diagnostic imaging results. The test was administered to 23 residents before and after a 40-minute didactic lecture covering the structure of radiology reports and requirements for communication of imaging findings. The pre- and postlecture tests were graded on the basis of the number of correct answers. Data were analyzed using the mixed linear model for repeated measures and the Holm test for group comparisons. Mean pre- and postlecture test scores were 74.6% +/- 2.73% and 94.6% +/- 5.94% for postgraduate year (PGY) 2, 88.1% +/- 5.55% and 95.6% +/- 4.50% for PGY 3, 94.8% +/- 2.5% and 100% +/- 0% for PGY 4, and 96.8% +/- 1.79% and 98.4% +/- 2.19% for PGY 5, respectively. The increase of pre- to postlecture test scores was statistically significant for PGY 2, PGY 3, and PGY 4 residents (P < .005). Pre- to postlecture test improvement was greatest for PGY 2 residents. Test performance of PGY 2 residents compared with PGY 5 residents was statistically different. Test scores for PGY 2 to PGY 4 residents significantly increased after didactic instruction on the reporting and communication of diagnostic imaging results. These findings suggest that a lecture and test format can be used to teach and assess radiology resident reporting and communication skills.
Development and standardization of Arabic words in noise test in Egyptian children.
Abdel Rahman, Tayseer Taha
2018-05-01
To develop and establish norms of Arabic Words in Noise test in Egyptian children. Total number of participants was 152 with normal hearing and ranging in age from 5 to 12 years. They are subdivided into two main groups (standardization group) which comprised 120 children with normal scholastic achievement and (application group) which comprised 32 children with different types of central auditory processing disorders. Arabic version of both Speech perception in noise (SPIN) and Words in Noise (WIN) tests were presented in each ear at zero signal to-noise ratio (SNR) using ipsilateral Cafeteria noise fixed at 50 dB sensation level (dBSL). The least performance in WIN test occurred between 5 and 7 years and highest scores from 9 to 12 years. However, no statistically significant difference was found among the three standardization age groups. Moreover, no statistically significant difference was found between the right and left ears scores or among the three lists. When the WIN test was compared to SPIN test in children with and without abnormal SPIN scores it showed highly consistent results except in children suffering from memory deficit reflecting that WIN test is more accurate than SPIN in this group of children. The Arabic WIN test can be used in children as young as 5 years. Also, it can be a good cross check test with SPIN test or used to follow up children after rehabilitation program in hearing impaired children or follow up after central auditory remediation of children with selective auditory attention deficit. Copyright © 2017. Published by Elsevier B.V.
Serizawa, Toru; Higuchi, Yoshinori; Nagano, Osamu; Hirai, Tatsuo; Ono, Junichi; Saeki, Naokatsu; Miyakawa, Akifumi
2012-12-01
The authors conducted validity testing of the 5 major reported indices for radiosurgically treated brain metastases- the original Radiation Therapy Oncology Group's Recursive Partitioning Analysis (RPA), the Score Index for Radiosurgery in Brain Metastases (SIR), the Basic Score for Brain Metastases (BSBM), the Graded Prognostic Assessment (GPA), and the subclassification of RPA Class II proposed by Yamamoto-in nearly 2500 cases treated with Gamma Knife surgery (GKS), focusing on the preservation of neurological function as well as the traditional endpoint of overall survival. The authors analyzed data from 2445 cases treated with GKS by the first author (T.S.), the primary surgeon. The patient group consisted of 1716 patients treated between January 1998 and March 2008 (the Chiba series) and 729 patients treated between April 2008 and December 2011 (the Tokyo series). The interval from the date of GKS until the date of the patient's death (overall survival) and impaired activities of daily living (qualitative survival) were calculated using the Kaplan-Meier method, while the absolute risk for two adjacent classes of each grading system and both hazard ratios and 95% confidence intervals were estimated using the Cox proportional hazards model. For overall survival, there were highly statistically significant differences between each two adjacent patient groups characterized by class or score (all p values < 0.001), except for GPA Scores 3.5-4.0 and 3.0. The SIR showed the best statistical results for predicting preservation of neurological function. Although no other grading systems yielded statistically significant differences in qualitative survival, the BSBM and the modified RPA appeared to be better than the original RPA and GPA. The modified RPA subclassification, proposed by Yamamoto, is well balanced in scoring simplicity with respect to case number distribution and statistical results for overall survival. However, a new or revised grading system is necessary for predicting qualitative survival and for selecting the optimal treatment for patients with brain metastasis treated by GKS.
Radha, G; Swathi, V; Jha, Abhishek
2016-01-01
This study explores the association of disabilities and oral health. The aim of the study was to assess the salivary and plaque pH and oral health status of children with and without disabilities. A total of 100 schoolchildren (50 with disabilities and 50 without disabilities) were examined from 9 to 15 years age group. Saliva and plaque pH analysis were done to both the groups. Clinical data were collected on periodontal status, dental caries using WHO criteria. pH values of different groups, difference between the means were calculated using independent t-test, and frequency distribution was analyzed using Chi-square test. Statistical significance, P value was set at 0.05. Mean plaque and salivary pH scores were lesser (5.73 and 5.67) in children with intellectual disabilities (IDs) (P< 0.001). Subjects with disabilities had also statistically significant higher CPI scores and decayed, missing, and filled scores than their healthy counterparts (P< 0.001). There is a statistically significant difference in plaque and salivary pH among children with and without ID with lower plaque and salivary pH among children with ID. In addition to this, the oral health was also more compromised in children with ID, which confirms a need for preventive treatment for these children.
Using a portable sulfide monitor as a motivational tool: a clinical study.
Uppal, Ranjit Singh; Malhotra, Ranjan; Grover, Vishakha; Grover, Deepak
2012-01-01
Bad breath has a significant impact on daily life of those who suffer from it. Oral malodor may rank only behind dental caries and periodontal disease as the cause of patient's visit to dentist. An aim of this study was to use a portable sulfide monitor as a motivational tool for encouraging the patients towards the better oral hygiene by correlating the plaque scores with sulfide monitor scores, and comparing the sulfide monitor scores before and after complete prophylaxis and 3 months after patient motivation. 30 patients with chronic periodontitis, having chief complaint of oral malodor participated in this study. At first visit, the plaque scores (P1) and sulfide monitor scores before (BCR1) and after complete oral prophylaxis (BCR2) were taken. Then the patients were motivated towards the better oral hygiene. After 3 months, plaque scores (P2) and sulfide monitor scores (BCR3) were recorded again. It was done using SPSS (student package software for statistical analysis). Paired sample test was performed. Statistically significant reduction in sulfide monitor scores was reported after the complete oral prophylaxis and 3 months after patient motivation. Plaque scores were significantly reduced after a period of 3 months. Plaque scores and breathchecker scores were positively correlated. An intensity of the oral malodor was positively correlated with the plaque scores. The portable sulfide monitor was efficacious in motivating the patients towards the better oral hygiene.
A Statistical Analysis of Data Used in Critical Decision Making by Secondary School Personnel.
ERIC Educational Resources Information Center
Dunn, Charleta J.; Kowitz, Gerald T.
Guidance decisions depend on the validity of standardized tests and teacher judgment records as measures of student achievement. To test this validity, a sample of 400 high school juniors, randomly selected from two large Gulf Coas t area schools, were administered the Iowa Tests of Educational Development. The nine subtest scores and each…
Using Linguistic Knowledge in Statistical Machine Translation
2010-09-01
on newswire test data . . . . . . . . . . . . . . . . . . . . . 65 3.4 Arabic to English MT results for Arabic morphological segmentation, measured on...web test data. . . . . . . . . . . . . . . . . . . . . . . . 65 3.5 Recombination Results. Percentage of sentences with mis-combined words...scores for syntactic reordering of the Spoken Language Domain. 90 5.1 Normalized likelihood of the test set alignments without decision trees, and then
ERIC Educational Resources Information Center
Gadalla, Tahany M.
The equivalence of multiple-choice (MC) and constructed response (discrete) (CR-D) response formats as applied to mathematics computation at grade levels two to six was tested. The difference between total scores from the two response formats was tested for statistical significance, and the factor structure of items in both response formats was…
Rasch Based Analysis of Oral Proficiency Test Data.
ERIC Educational Resources Information Center
Nakamura, Yuji
2001-01-01
This paper examines the rating scale data of oral proficiency tests analyzed by a Rasch Analysis focusing on an item map and factor analysis. In discussing the item map, the difficulty order of six items and students' answering patterns are analyzed using descriptive statistics and measures of central tendency of test scores. The data ranks the…
ERIC Educational Resources Information Center
Walker, A. Adrienne; Jennings, Jeremy Kyle; Engelhard, George, Jr.
2018-01-01
Individual person fit analyses provide important information regarding the validity of test score inferences for an "individual" test taker. In this study, we use data from an undergraduate statistics test (N = 1135) to illustrate a two-step method that researchers and practitioners can use to examine individual person fit. First, person…
Prediction of true test scores from observed item scores and ancillary data.
Haberman, Shelby J; Yao, Lili; Sinharay, Sandip
2015-05-01
In many educational tests which involve constructed responses, a traditional test score is obtained by adding together item scores obtained through holistic scoring by trained human raters. For example, this practice was used until 2008 in the case of GRE(®) General Analytical Writing and until 2009 in the case of TOEFL(®) iBT Writing. With use of natural language processing, it is possible to obtain additional information concerning item responses from computer programs such as e-rater(®). In addition, available information relevant to examinee performance may include scores on related tests. We suggest application of standard results from classical test theory to the available data to obtain best linear predictors of true traditional test scores. In performing such analysis, we require estimation of variances and covariances of measurement errors, a task which can be quite difficult in the case of tests with limited numbers of items and with multiple measurements per item. As a consequence, a new estimation method is suggested based on samples of examinees who have taken an assessment more than once. Such samples are typically not random samples of the general population of examinees, so that we apply statistical adjustment methods to obtain the needed estimated variances and covariances of measurement errors. To examine practical implications of the suggested methods of analysis, applications are made to GRE General Analytical Writing and TOEFL iBT Writing. Results obtained indicate that substantial improvements are possible both in terms of reliability of scoring and in terms of assessment reliability. © 2015 The British Psychological Society.
Bishop, Somer L.; Farmer, Cristan; Thurm, Audrey
2014-01-01
Nonverbal IQ (NVIQ) was examined in 84 individuals with ASD followed from age 2 to 19. Most adults who scored in the range of ID also received scores below 70 as children, and the majority of adults with scores in the average range had scored in this range by age 3. However, within the lower ranges of ability, actual scores declined from age 2 to 19, likely due in part to limitations of appropriate tests. Use of Vineland-II DLS scores in place of NVIQ did not statistically improve the correspondence between age 2 and age 19 scores. Clinicians and researchers should use caution when making comparisons based on exact scores or specific ability ranges within or across individuals with ASD of different ages. PMID:25239176
Zarshenas, Ladan; Keshavarz, Tala; Momennasab, Marzieh; Zarifsanaiey, Nahid
2017-08-01
Given the limitations of traditional teaching methods in the learning process of adolescents, this study was designed to investigate the effects of osteoporosis prevention training through interactive multimedia method on the degree of knowledge and self-efficacy of female high school students. In this interventional study which was conducted in 2016 in Fars province, Iran, 120 high school students were selected through proportional stratified sampling from schools and different classes at first, second, third, and pre-university grades. The participants were randomly divided into two groups, each containing 60 students. Educational interventions for the test group included an interactive multimedia CD, and for the control group was an educational booklet. Before and one month after the intervention the students' level of knowledge and self-efficacy was measured. The spss 19 statistical software was used, and descriptive and analytical tests were performed to analyze the data. Results showed a significant difference in self-efficacy scores after the intervention (P=0.012) with the test group obtained a higher self-efficacy score than the control group. Also, a significant increase was observed in the knowledge score of both groups after the training (P<0.001), but the knowledge score between the two groups was not statistically significant (P=0.38) after the intervention. The use of new training methods like interactive multimedia CD for public education, particular adolescents about health and hygiene is recommended.
Sainz de Baranda, Pilar; Rodríguez-Iniesta, María; Ayala, Francisco; Santonja, Fernando; Cejudo, Antonio
2014-07-01
To examine the criterion-related validity of the horizontal hip joint angle (H-HJA) test and vertical hip joint angle (V-HJA) test for estimating hamstring flexibility measured through the passive straight-leg raise (PSLR) test using contemporary statistical measures. Validity study. Controlled laboratory environment. One hundred thirty-eight professional trampoline gymnasts (61 women and 77 men). Hamstring flexibility. Each participant performed 2 trials of H-HJA, V-HJA, and PSLR tests in a randomized order. The criterion-related validity of H-HJA and V-HJA tests was measured through the estimation equation, typical error of the estimate (TEEST), validity correlation (β), and their respective confidence limits. The findings from this study suggest that although H-HJA and V-HJA tests showed moderate to high validity scores for estimating hamstring flexibility (standardized TEEST = 0.63; β = 0.80), the TEEST statistic reported for both tests was not narrow enough for clinical purposes (H-HJA = 10.3 degrees; V-HJA = 9.5 degrees). Subsequently, the predicted likely thresholds for the true values that were generated were too wide (H-HJA = predicted value ± 13.2 degrees; V-HJA = predicted value ± 12.2 degrees). The results suggest that although the HJA test showed moderate to high validity scores for estimating hamstring flexibility, the prediction intervals between the HJA and PSLR tests are not strong enough to suggest that clinicians and sport medicine practitioners should use the HJA and PSLR tests interchangeably as gold standard measurement tools to evaluate and detect short hamstring muscle flexibility.
DIFAS: Differential Item Functioning Analysis System. Computer Program Exchange
ERIC Educational Resources Information Center
Penfield, Randall D.
2005-01-01
Differential item functioning (DIF) is an important consideration in assessing the validity of test scores (Camilli & Shepard, 1994). A variety of statistical procedures have been developed to assess DIF in tests of dichotomous (Hills, 1989; Millsap & Everson, 1993) and polytomous (Penfield & Lam, 2000; Potenza & Dorans, 1995) items. Some of these…
Observed-Score Equating with a Heterogeneous Target Population
ERIC Educational Resources Information Center
Duong, Minh Q.; von Davier, Alina A.
2012-01-01
Test equating is a statistical procedure for adjusting for test form differences in difficulty in a standardized assessment. Equating results are supposed to hold for a specified target population (Kolen & Brennan, 2004; von Davier, Holland, & Thayer, 2004) and to be (relatively) independent of the subpopulations from the target population (see…
[Description and evaluation of creative thinking in preterm low birth weight infants].
Parisi, L; Di Filippo, T; Firrigno, L; La Grutta, S; Testa, D; Roccella, M
2007-04-01
Since the 1950s, the problem of how to evaluate creativity has been addressed in studies on the definition of measurement criteria and on the relationship between intelligence and creative thinking. Many revealed cognitive and relational disorders in preterm infants, particularly in preterm very low birth weight infants (birth weight <1500 g) and in infants with serious complications. This study describes the development of creative thinking in a group of children born preterm. The study sample was 43 children (21 males, 22 females; age range 6-11 years), regularly attending school, born with low birth weight (1050-2450 g) at 29-32 weeks gestational age, and compared with a control group with birth weight >2500 g. The test battery included: Torrance Test of Creative Thinking (TCTT); WISC-R intelligence test; Goodenough Human Figure Drawing Test. Statistical analysis (Mann-Whitney U test) showed a statistically significant difference (P>0.05) between the 2 groups; scores for figure originality, figure fluidity and figure elaboration were consistently higher in the control group. Within the low birth weight group, there was a significant correlation (Spearman r) between verbal IQ and verbal fluidity and verbal flexibility subscale scores and between IQ performance and figure elaboration. Scores on the figure drawing tests showed higher creative ability in the control group. In children born preterm with low birth weight, emotive dynamics and flow of affection may influence the channels of communication between child and family. The low figure originality subscale scores support the hypothesis that psychodynamic and relational factors (worry about the preterm condition, overprotective behaviour by parents and others) could lead to diminished autonomy, flexibility and manipulatory interest in the child.
Ferreira, António Miguel; Marques, Hugo; Tralhão, António; Santos, Miguel Borges; Santos, Ana Rita; Cardoso, Gonçalo; Dores, Hélder; Carvalho, Maria Salomé; Madeira, Sérgio; Machado, Francisco Pereira; Cardim, Nuno; de Araújo Gonçalves, Pedro
2016-11-01
Current guidelines recommend the use of the Modified Diamond-Forrester (MDF) method to assess the pre-test likelihood of obstructive coronary artery disease (CAD). We aimed to compare the performance of the MDF method with two contemporary algorithms derived from multicenter trials that additionally incorporate cardiovascular risk factors: the calculator-based 'CAD Consortium 2' method, and the integer-based CONFIRM score. We assessed 1069 consecutive patients without known CAD undergoing coronary CT angiography (CCTA) for stable chest pain. Obstructive CAD was defined as the presence of coronary stenosis ≥50% on 64-slice dual-source CT. The three methods were assessed for calibration, discrimination, net reclassification, and changes in proposed downstream testing based upon calculated pre-test likelihoods. The observed prevalence of obstructive CAD was 13.8% (n=147). Overestimations of the likelihood of obstructive CAD were 140.1%, 9.8%, and 18.8%, respectively, for the MDF, CAD Consortium 2 and CONFIRM methods. The CAD Consortium 2 showed greater discriminative power than the MDF method, with a C-statistic of 0.73 vs. 0.70 (p<0.001), while the CONFIRM score did not (C-statistic 0.71, p=0.492). Reclassification of pre-test likelihood using the 'CAD Consortium 2' or CONFIRM scores resulted in a net reclassification improvement of 0.19 and 0.18, respectively, which would change the diagnostic strategy in approximately half of the patients. Newer risk factor-encompassing models allow for a more precise estimation of pre-test probabilities of obstructive CAD than the guideline-recommended MDF method. Adoption of these scores may improve disease prediction and change the diagnostic pathway in a significant proportion of patients. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Formal testing and utilization of streaming media to improve flight crew safety knowledge.
Bellazzini, Marc A; Rankin, Peter M; Quisling, Jason; Gangnon, Ronald; Kohrs, Mike
2008-01-01
Increased concerns over the safety of air medical transport have prompted development of novel ways to increase safety. The objective of our study was to determine if an Internet streaming media safety video increased crew safety knowledge. 23 out of 40 crew members took an online safety pre-test, watched a safety video specific to our program and completed immediate and long-term post-testing 6 months later. Mean pre-test, post-test and 6 month follow up test scores were 84.9%, 92.3% and 88.4% respectively. There was a statistically significant difference in all scores (p
NASA Astrophysics Data System (ADS)
Huerta, Margarita
This quantitative study explored the impact of literacy integration in a science inquiry classroom involving the use of science notebooks on the academic language development and conceptual understanding of students from diverse (i.e., English Language Learners, or ELLs) and low socio-economic status (low-SES) backgrounds. The study derived from a randomized, longitudinal, field-based NSF funded research project (NSF Award No. DRL - 0822343) targeting ELL and non-ELL students from low-SES backgrounds in a large urban school district in Southeast Texas. The study used a scoring rubric (modified and tested for validity and reliability) to analyze fifth-grade school students' science notebook entries. Scores for academic language quality (or, for brevity, language ) were used to compare language growth over time across three time points (i.e., beginning, middle, and end of the school year) and to compare students across categories (ELL, former ELL, non-ELL, and gender) using descriptive statistics and mixed between-within subjects analysis of variance (ANOVA). Scores for conceptual understanding (or, for brevity, concept) were used to compare students across categories (ELL, former ELL, non-ELL, and gender) in three domains using descriptive statistics and ANOVA. A correlational analysis was conducted to explore the relationship, if any, between language scores and concept scores for each group. Students demonstrated statistically significant growth over time in their academic language as reflected by science notebook scores. While ELL students scored lower than former ELL and non-ELL students at the first two time points, they caught up to their peers by the third time point. Similarly, females outperformed males in language scores in the first two time points, but males caught up to females in the third time point. In analyzing conceptual scores, ELLs had statistically significant lower scores than former-ELL and non-ELL students, and females outperformed males in the first two domains. These differences, however, were not statistically significant in the last domain. Last, correlations between language and concept scores were overall, positive, large, and significant across domains and groups. The study presents a rubric useful for quantifying diverse students' science notebook entries, and findings add to the sparse research on the impact of writing in diverse students' language development and conceptual understanding in science.
A novel examination of atypical major depressive disorder based on attachment theory.
Levitan, Robert D; Atkinson, Leslie; Pedersen, Rebecca; Buis, Tom; Kennedy, Sidney H; Chopra, Kevin; Leung, Eman M; Segal, Zindel V
2009-06-01
While a large body of descriptive work has thoroughly investigated the clinical correlates of atypical depression, little is known about its fundamental origins. This study examined atypical depression from an attachment theory framework. Our hypothesis was that, compared to adults with melancholic depression, those with atypical depression would report more anxious-ambivalent attachment and less secure attachment. As gender has been an important consideration in prior work on atypical depression, this same hypothesis was further tested in female subjects only. One hundred ninety-nine consecutive adults presenting to a tertiary mood disorders clinic with major depressive disorder with either atypical or melancholic features according to the Structured Clinical Interview for DSM-IV Axis-I Disorders were administered a self-report adult attachment questionnaire to assess the core dimensions of secure, anxious-ambivalent, and avoidant attachment. Attachment scores were compared across the 2 depressed groups defined by atypical and melancholic features using multivariate analysis of variance. The study was conducted between 1999 and 2004. When men and women were considered together, the multivariate test comparing attachment scores by depressive group was statistically significant at p < .05. Between-subjects testing indicated that atypical depression was associated with significantly lower secure attachment scores, with a trend toward higher anxious-ambivalent attachment scores, than was melancholia. When women were analyzed separately, the multivariate test was statistically significant at p < .01, with both secure and anxious-ambivalent attachment scores differing significantly across depressive groups. These preliminary findings suggest that attachment theory, and insecure and anxious-ambivalent attachment in particular, may be a useful framework from which to study the origins, clinical correlates, and treatment of atypical depression. Gender may be an important consideration when considering atypical depression from an attachment perspective. Copyright 2009 Physicians Postgraduate Press, Inc.
Assertiveness and problem solving in midwives.
Yurtsal, Zeliha Burcu; Özdemir, Levent
2015-01-01
Midwifery profession is required to bring solutions to problems and a midwife is expected to be an assertive person and to develop midwifery care. This study was planned to examine the relationship between assertiveness and problem-solving skills of midwives. This cross-sectional study was conducted with 201 midwives between July 2008 and February 2009 in the city center of Sivas. The Rathus Assertiveness Schedule (RAS) and Problem Solving Inventory (PSI) were used to determine the level of assertiveness and problem-solving skills of midwives. Statistical methods were used as mean, standard deviation, percentage, Student's T, ANOVA and Tukey HSD, Kruskal Wallis, Fisher Exact, Pearson Correlation and Chi-square tests and P < 0.05. The RAS mean scores and the PSI mean scores showed statistically significant differences in terms of a midwife's considering herself as a member of the health team, expressing herself within the health care team, being able to say "no" when necessary, cooperating with her colleagues, taking part in problem-solving skills training. A statistically significant negative correlation was found between the RAS and PSI scores. The RAS scores decreased while the problem-solving scores increased (r: -0451, P < 0.01). There were significant statistical differences between assertiveness levels and problem solving skills of midwives, and midwives who were assertive solved their problems better than did others. Assertiveness and problem-solving skills training will contribute to the success of the midwifery profession. Midwives able to solve problems, and display assertive behaviors will contribute to the development of midwifery profession.
Relationship between orofacial function, dentofacial morphology, and bite force in young subjects.
Marquezin, M C S; Gavião, M B D; Alonso, M B C C; Ramirez-Sotelo, L R; Haiter-Neto, F; Castelo, P M
2014-09-01
The aim was to evaluate the relationship between orofacial function, dentofacial morphology, and bite force in young subjects. Three hundred and sixteen subjects were divided according to dentition stage (early, intermediate, and late mixed and permanent dentition). Orofacial function was screened using the Nordic Orofacial Test-Screening (NOT-S). Orthodontic treatment need, bite force, lateral and frontal craniofacial dimensions and presence of sleep bruxism were also assessed. The results were submitted to descriptive statistics, normality and correlation tests, analysis of variance, and multiple linear regression to test the relationship between NOT-S scores and the studied independent variables. The variance of NOT-S scores between groups was not significant. The evaluation of the variables that significantly contributed to NOT-S scores variation showed that age and presence of bruxism related to higher NOT-S total scores, while the increase in overbite measurement and presence of closed lip posture related to lower scores. Bite force did not show a significant relationship with scores of orofacial dysfunction. No significant correlations between craniofacial dimensions and NOT-S scores were observed. Age and sleep bruxism were related to higher NOT-S scores, while the increase in overbite measurement and closed lip posture contributed to lower scores of orofacial dysfunction. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Prevalence of depressive symptoms among college students and the influence of sport activity.
Uglesić, Boran; Lasić, Davor; Zuljan-Cvitanović, Marija; Buković, Damir; Karelović, Deni; Delić-Brkljacić, Diana; Buković, Nevia; Radan, Mirjana
2014-03-01
The present study asses the prevalence of depressive symptoms among college students in Split, Croatia, and positive influence of sport activity on decreasing the depression symptoms. Authors screened all 664 college students of the first year of study. All of them were over the 18 years and the mean age was 19.4 +/- 1.2 years. There were 466 females (70.2%) and 178 (26.8%) males. They answered The Beck Depression Inventory (BDI) and questionnaire about their sport activity (no sport activity, recreational and active in sports). For the purpose of the analysis depressive symptoms were defined as a score of > 11. Chi-square and Mann-Whitney test were used for data analysis. 9.4% of the students had significant depression symptoms (score > 11). No one student had score > 26 (symptoms of major depression). Statistically significant lower score on BDI have students who are active in sports (score median = 3) compared to group of recreational (score median = 4) and in correlation to group who are not active in sports (score median = 5) (Kruskal-Wallis: p < 0.001). In the group of active in sports (N = 254) there are only 5.5% with depressions symptoms, while in the group of non active in sports (N = 60) are 18 depressive (chi2-test: p = 0,005). Females are statistically more depressed than males (chi2-test: p = 0.01). In the female group 49 (10.5%) are depressed, and in the male group are 9 (5%). Compared to gender in separate analysis we did not find correlation of decreasing depression symptoms and sport activity among males (chi2-test: p = 0.47), while in females we find that sport activity has significant effect (chi2-test: p = 0.026). Our results shoved moderate values of depression symptoms among college population in Split, Croatia. More females than males experienced depressive symptoms. While sport activity did not have significant influence on the depression in male population, it has significant influence in reducing the depression symptoms among females.
Measuring change in critical thinking skills of dental students educated in a PBL curriculum.
Pardamean, Bens
2012-04-01
This study measured the change in critical thinking skills of dental students educated in a problem-based learning (PBL) pedagogical method. The quantitative analysis was focused on measuring students' critical thinking skills achievement from their first through third years of dental education at the University of Southern California. This non-experimental evaluation was based on a volunteer sample of ninety-eight dental students who completed a demographics/academic questionnaire and a psychometric assessment known as the Health Sciences Reasoning Test (HSRT). The HSRT produced the overall critical thinking skills score. Additionally, the HSRT generated five subscale scores: analysis, inference, evaluation, deductive reasoning, and inductive reasoning. The results of this study concluded that the students showed no continuous and significant incremental improvement in their overall critical thinking skills score achievement during their PBL-based dental education. Except for the inductive reasoning score, this result was very consistent with the four subscale scores. Moreover, after performing the statistical adjustment on total score and subscale scores, no significant statistical differences were found among the three student groups. However, the results of this study found some aspects of critical thinking achievements that differed by categories of gender, race, English as first language, and education level.
Suicidal behaviour of Indian patients with obsessive compulsive disorder.
Dhyani, Mohan; Trivedi, Jitendra Kumar; Nischal, Anil; Sinha, Pramod Kumar; Verma, Subham
2013-04-01
The chronicity, distress, high rates of comorbidity and varying degree of non response to treatment in Obsessive Compulsive Disorder (OCD) may contribute to suicidal behavior. There is relatively little information on suicidal behavior in OCD subjects. Our study design is Single point non-invasive, cross sectional, clinical study of new and follow up cases. Assessment of Suicidal Behavior in patients of OCD attending the adult Psychiatry O.P.D. of Chatrapati Shahuji Maharaj Medical University (CSMMU) U.P. Lucknow using (DSM-IV) criteria for diagnosis of Obsessive Compulsive Disorder, Structured Clinical Interview for DSM-IV Axis-I disorders, Yale Brown Obsessive Compulsive Rating Scale, Scale for Suicidal Ideation (SSI), Beck's Hopelessness Scale (BHS). Mean standard deviation and t test for independent samples, Pearson's correlation coefficient. Statistically significant differences were seen in the SSI score between the "Clinical" and "Sub-Clinical" cases with Clinical group having higher scores. Value of correlation coefficient between YBOCS score and SSI and BHS score is positive and statistically significant (P<0.01). "Clinical" group of patients had significantly higher scores of suicidal ideation measured by Scale of Suicidal Ideation (SSI). There was a significantly positive correlation between disease severity (YBOCS Score) and degree of suicidal ideation (SIS Score).
Sun, Benjamin C; Laurie, Amber; Fu, Rongwei; Ferencik, Maros; Shapiro, Michael; Lindsell, Christopher J; Diercks, Deborah; Hoekstra, James W; Hollander, Judd E; Kirk, J Douglas; Peacock, W Frank; Anantharaman, Venkataraman; Pollack, Charles V
2016-03-01
The emergency department evaluation for suspected acute coronary syndrome (ACS) is common, costly, and challenging. Risk scores may help standardize clinical care and screening for research studies. The Thrombolysis in Myocardial Infarction (TIMI) and HEART are two commonly cited risk scores. We tested the null hypothesis that the TIMI and HEART risk scores have equivalent test characteristics. We analyzed data from the Internet Tracking Registry of Acute Coronary Syndromes (i*trACS) from 9 EDs on patients with suspected ACS, 1999-2001. We excluded patients with an emergency department diagnosis consistent with ACS, or without sufficient data to calculate TIMI and HEART scores. The primary outcome was 30-day major adverse cardiovascular events, including all-cause death, acute myocardial infarction, and urgent revascularization. We describe test characteristics of the TIMI and HEART risk scores. The study cohort included 8255 patients with 508 (6.2%) 30-day major adverse cardiovascular events. Receiver operating curve and reclassification analyses favored HEART [c statistic: 0.753, 95% confidence interval (CI): 0.733-0.773; continuous net reclassification improvement: 0.608, 95% CI: 0.527-0.689] over TIMI (c statistic: 0.678, 95% CI: 0.655-0.702). A HEART score 0-3 [negative predictive value (NPV) 0.982, 95% CI: 0.978-0.986; positive predictive value (PPV) 0.103, 95% CI: 0.094-0.113; likelihood ratio (LR) positive 1.76; LR negative 0.28] demonstrates similar or superior NPV/PPV/LR compared with TIMI = 0 (NPV 0.978, 95% CI: 0.971-0.983; PPV 0.077, 95% CI: 0.071-0.084; LR positive 1.28; LR negative 0.35) and TIMI = 0-1 (NPV 0.963, 95% CI: 0.958-0.968; PPV 0.102, 95% CI: 0.092-0.113; LR positive 1.73; LR negative 0.58). The HEART score has better discrimination than TIMI and outperforms TIMI within previously published "low-risk" categories.
Bennett, Rebecca J; Taljaard, Dunay S; Olaithe, Michelle; Brennan-Jones, Chris; Eikelboom, Robert H
2017-09-18
The purpose of this study is to raise awareness of interobserver concordance and the differences between interobserver reliability and agreement when evaluating the responsiveness of a clinician-administered survey and, specifically, to demonstrate the clinical implications of data types (nominal/categorical, ordinal, interval, or ratio) and statistical index selection (for example, Cohen's kappa, Krippendorff's alpha, or interclass correlation). In this prospective cohort study, 3 clinical audiologists, who were masked to each other's scores, administered the Practical Hearing Aid Skills Test-Revised to 18 adult owners of hearing aids. Interobserver concordance was examined using a range of reliability and agreement statistical indices. The importance of selecting statistical measures of concordance was demonstrated with a worked example, wherein the level of interobserver concordance achieved varied from "no agreement" to "almost perfect agreement" depending on data types and statistical index selected. This study demonstrates that the methodology used to evaluate survey score concordance can influence the statistical results obtained and thus affect clinical interpretations.
Statistical innovations in the medical device world sparked by the FDA.
Campbell, Gregory; Yue, Lilly Q
2016-01-01
The world of medical devices while highly diverse is extremely innovative, and this facilitates the adoption of innovative statistical techniques. Statisticians in the Center for Devices and Radiological Health (CDRH) at the Food and Drug Administration (FDA) have provided leadership in implementing statistical innovations. The innovations discussed include: the incorporation of Bayesian methods in clinical trials, adaptive designs, the use and development of propensity score methodology in the design and analysis of non-randomized observational studies, the use of tipping-point analysis for missing data, techniques for diagnostic test evaluation, bridging studies for companion diagnostic tests, quantitative benefit-risk decisions, and patient preference studies.
Training improves laparoscopic tasks performance and decreases operator workload.
Hu, Jesse S L; Lu, Jirong; Tan, Wee Boon; Lomanto, Davide
2016-05-01
It has been postulated that increased operator workload during task performance may increase fatigue and surgical errors. The National Aeronautics and Space Administration-Task Load Index (NASA-TLX) is a validated tool for self-assessment for workload. Our study aims to assess the relationship of workload and performance of novices in simulated laparoscopic tasks of different complexity levels before and after training. Forty-seven novices without prior laparoscopic experience were recruited in a trial to investigate whether training improves task performance as well as mental workload. The participants were tested on three standard tasks (ring transfer, precision cutting and intracorporeal suturing) in increasing complexity based on the Fundamentals of Laparoscopic Surgery (FLS) curriculum. Following a period of training and rest, participants were tested again. Test scores were computed from time taken and time penalties for precision errors. Test scores and NASA-TLX scores were recorded pre- and post-training and analysed using paired t tests. One-way repeated measures ANOVA was used to analyse differences in NASA-TLX scores between the three tasks. NASA-TLX score was lowest with ring transfer and highest with intracorporeal suturing. This was statistically significant in both pre-training (p < 0.001) and post-training (p < 0.001). NASA-TLX scores mirror the changes in test scores for the three tasks. Workload scores decreased significantly after training for all three tasks (ring transfer = 2.93, p < 0.001, precision cutting = 3.74, p < 0.001, intracorporeal suturing = 2.98, p < 0.001). NASA-TLX score is an accurate reflection of the complexity of simulated laparoscopic tasks in the FLS curriculum. This also correlates with the relationship of test scores between the three tasks. Simulation training improves both performance score and workload score across the tasks.
Can patients interpret health information? An assessment of the medical data interpretation test.
Schwartz, Lisa M; Woloshin, Steven; Welch, H Gilbert
2005-01-01
To establish the reliability/validity of an 18-item test of patients' medical data interpretation skills. Survey with retest after 2 weeks. Subjects. 178 people recruited from advertisements in local newspapers, an outpatient clinic, and a hospital open house. The percentage of correct answers to individual items ranged from 20% to 87%, and medical data interpretation test scores (on a 0- 100 scale) were normally distributed (median 61.1, mean 61.0, range 6-94). Reliability was good (test-retest correlation=0.67, Cronbach's alpha=0.71). Construct validity was supported in several ways. Higher scores were found among people with highest versus lowest numeracy (71 v. 36, P<0.001), highest quantitative literacy (65 v. 28, P<0.001), and highest education (69 v. 42, P=0.004). Scores for 15 physician experts also completing the survey were significantly higher than participants with other postgraduate degrees (mean score 89 v. 69, P<0.001). The medical data interpretation test is a reliable and valid measure of the ability to interpret medical statistics.
ERIC Educational Resources Information Center
Molenaar, Dylan; Dolan, Conor V.; de Boeck, Paul
2012-01-01
The Graded Response Model (GRM; Samejima, "Estimation of ability using a response pattern of graded scores," Psychometric Monograph No. 17, Richmond, VA: The Psychometric Society, 1969) can be derived by assuming a linear regression of a continuous variable, Z, on the trait, [theta], to underlie the ordinal item scores (Takane & de Leeuw in…
Empathy and burnout: an analytic cross-sectional study among nurses and nursing students.
Ferri, Paola; Guerra, Eleonora; Marcheselli, Luigi; Cunico, Laura; Di Lorenzo, Rosaria
2015-09-09
Empathy is an essential element of good nursing care associated with increased patient satisfaction. Burnout represents chronic occupational stress which diminishes interest in work and reduces patient safety and satisfaction. The purpose of this study was to evaluate the correlation between empathy and burnout in nursing students and nurses. This cross-sectional research was conducted in a sample of 298 nurses and 115 nursing students. Socio-demographic and career information was collected. Balanced Emotional Empathy Scale (BEES) and Maslach Burnout Inventory (MBI) were administered. Data were statistically analysed. 63% of our sample answered questionnaires (54% of nurses and 84% of students). The BEES global mean score was slightly inferior to empathy cut-off of 32. In the student group, two BEES dimension scores were statistically significantly higher than nurses (p=0.011 and p=0.007 respectively, t-test). Empathy was negatively related to age (p=0.001, ANOVA). Emotional exhaustion (EE) scores of MBI reported statistically significantly lower levels for students (p<0.0001, t-test). EE was negatively related to BEES mean total score in students (r=-0.307, p<0.002) and nurses (r=-0.245, p<0.002), personal accomplishment of MBI presented positive correlation with BEES mean total scores in students (r=0.319, p<0.002) and nurses (r=0.266, p<0.001, Pearson's correlation). Female students showed superior empathy capacity in comparison to male students in all 5 dimensions of BEES (p<0.001), whereas females nurses in only one dimension (p<0.001). Our data suggest empathy declines with age and career. High levels of empathy can be protective against burnout development, which, when presents, reduces empathy.
Comparative Evaluation of Neem Mouthwash on Plaque and Gingivitis: A Double-blind Crossover Study.
Jalaluddin, Md; Rajasekaran, U B; Paul, Sam; Dhanya, R S; Sudeep, C B; Adarsh, V J
2017-07-01
The present study aimed at evaluating the impact of neem-containing mouthwash on plaque and gingivitis. This randomized, double-blinded, crossover clinical trial included 40 participants aged 18 to 35 years with washout period of 1 week between the crossover phases. A total of 20 participants, each randomly allocated into groups I and II, wherein in the first phase, group I was provided with 0.2% chlorhexidine gluconate and group II with 2% neem mouthwash. After the scores were recorded, 1-week time period was given to the participants to carry over the effects of the mouthwashes and then the second phase of the test was performed. The participants were instructed to use the other mouthwash through the second test phase. There was a slight reduction of plaque level in the first phase as well as in the second phase. When comparison was made between the groups, no statistically significant difference was seen. Both the groups showed reduction in the gingival index (GI) scores in the first phase, and there was a statistically significant difference in both groups at baseline and after intervention (0.005 and 0.01 respectively). In the second phase, GI scores were reduced in both groups, but there was a statistically significant difference between the groups only at baseline scores (0.01). In the present study, it has been concluded that neem mouthwash can be used as an alternative to chlorhexidine mouthwash based on the reduced scores in both the groups. Using neem mouthwash in maintaining oral hygiene might have a better impact in prevention as well as pervasiveness of oral diseases as it is cost-effective and easily available.
Wahner-Roedler, Dietlind L.; Thompson, Jeffrey M.; Luedtke, Connie A.; King, Susan M.; Cha, Stephen S.; Elkin, Peter L.; Bruce, Barbara K.; Townsend, Cynthia O.; Bergeson, Jody R.; Eickhoff, Andrea L.; Loehrer, Laura L.; Sood, Amit; Bauer, Brent A.
2011-01-01
Most patients with fibromyalgia use complementary and alternative medicine (CAM). Properly designed controlled trials are necessary to assess the effectiveness of these practices. This study was a randomized, double-blind, placebo-controlled, early phase trial. Fifty patients seen at a fibromyalgia outpatient treatment program were randomly assigned to a daily soy or placebo (casein) shake. Outcome measures were scores of the Fibromyalgia Impact Questionnaire (FIQ) and the Center for Epidemiologic Studies Depression Scale (CES-D) at baseline and after 6 weeks of intervention. Analysis was with standard statistics based on the null hypothesis, and separation test for early phase CAM comparative trials. Twenty-eight patients completed the study. Use of standard statistics with intent-to-treat analysis showed that total FIQ scores decreased by 14% in the soy group (P = .02) and by 18% in the placebo group (P < .001). The difference in change in scores between the groups was not significant (P = .16). With the same analysis, CES-D scores decreased in the soy group by 16% (P = .004) and in the placebo group by 15% (P = .05). The change in scores was similar in the groups (P = .83). Results of statistical analysis using the separation test and intent-to-treat analysis revealed no benefit of soy compared with placebo. Shakes that contain soy and shakes that contain casein, when combined with a multidisciplinary fibromyalgia treatment program, provide a decrease in fibromyalgia symptoms. Separation between the effects of soy and casein (control) shakes did not favor the intervention. Therefore, large-sample studies using soy for patients with fibromyalgia are probably not indicated. PMID:18990724
Statistical power as a function of Cronbach alpha of instrument questionnaire items.
Heo, Moonseong; Kim, Namhee; Faith, Myles S
2015-10-14
In countless number of clinical trials, measurements of outcomes rely on instrument questionnaire items which however often suffer measurement error problems which in turn affect statistical power of study designs. The Cronbach alpha or coefficient alpha, here denoted by C(α), can be used as a measure of internal consistency of parallel instrument items that are developed to measure a target unidimensional outcome construct. Scale score for the target construct is often represented by the sum of the item scores. However, power functions based on C(α) have been lacking for various study designs. We formulate a statistical model for parallel items to derive power functions as a function of C(α) under several study designs. To this end, we assume fixed true score variance assumption as opposed to usual fixed total variance assumption. That assumption is critical and practically relevant to show that smaller measurement errors are inversely associated with higher inter-item correlations, and thus that greater C(α) is associated with greater statistical power. We compare the derived theoretical statistical power with empirical power obtained through Monte Carlo simulations for the following comparisons: one-sample comparison of pre- and post-treatment mean differences, two-sample comparison of pre-post mean differences between groups, and two-sample comparison of mean differences between groups. It is shown that C(α) is the same as a test-retest correlation of the scale scores of parallel items, which enables testing significance of C(α). Closed-form power functions and samples size determination formulas are derived in terms of C(α), for all of the aforementioned comparisons. Power functions are shown to be an increasing function of C(α), regardless of comparison of interest. The derived power functions are well validated by simulation studies that show that the magnitudes of theoretical power are virtually identical to those of the empirical power. Regardless of research designs or settings, in order to increase statistical power, development and use of instruments with greater C(α), or equivalently with greater inter-item correlations, is crucial for trials that intend to use questionnaire items for measuring research outcomes. Further development of the power functions for binary or ordinal item scores and under more general item correlation strutures reflecting more real world situations would be a valuable future study.
Agranovich, Anna V; Panter, A T; Puente, Antonio E; Touradji, Pegah
2011-07-01
Cultural differences in time attitudes and their effect on timed neuropsychological test performance were examined in matched non-clinical samples of 100 Russian and American adult volunteers using 8 tests that were previously reported to be relatively free of cultural bias: Color Trails Test (CTT); Ruff Figural Fluency Test (RFFT); Symbol Digit Modalities Test (SDMT); and Tower of London-Drexel Edition (ToL(Dx)). A measure of time attitudes, the Culture of Time Inventory (COTI-33) was used to assess time attitudes potentially affecting time-limited testing. Americans significantly outscored Russians on CTT, SDMT, and ToL(Dx) (p,.05) while differences in RFFT scores only approached statistical significance. Group differences also emerged in COTI-33 factor scores, which partially mediated differences in performance on CTT-1, SDMT, and ToL(Dx) initiation time, but did not account for the effect of culture on CTT-2. Significant effect of culture was revealed in ratings of familiarity with testing procedures that was negatively related to CTT, ToL(Dx), and SDMT scores. Current findings indicated that attitudes toward time may influence results of time limited testing and suggested that individuals who lack familiarity with timed testing procedures tend to obtain lower scores on timed tests.
Lifestyle in Visually Impaired or Blind Massage Therapists: A Preliminary Study.
Hung, Shu-Ling; Chen, Mei-Fang; Lin, Yu-Hua; Kao, Chia-Chan; Chang, Ya-Wen; Chan, Hui-Shan
2017-11-09
Lifestyle is among the most important factors affecting individual health status. Limited access to health information may limit the ability of people with visual impairment or blindness to practice healthy lifestyles. However, no studies have investigated how lifestyle practices affect health specifically in visually impaired and blind populations. The aim of this study was to investigate the lifestyle behaviors of visually impaired and blind massage therapists (VIBMTs) in Taiwan. This exploratory study used a purposive sampling technique to recruit 50 VIBMTs who were employed at massage stations in southern Taiwan. All of the participants completed the Health-Promoting Lifestyle Profile II (HPLP-II) and a survey of demographic characteristics. Descriptive and inferential statistical tests, including the Mann-Whitney U test and the Kruskal-Wallis H test, were used. Statistical significance was defined as p < .05 in two-tailed tests. Fifty participants completed both the HPLP-II and the demographic survey. The mean subscale score for the HPLP-II was 2.52 ± 0.37. The lowest scores were on the physical activity (2.09 ± 0.67) and nutrition (2.35 ± 0.39) subscales, and the highest scores were on the spiritual growth (2.89 ± 0.56) and interpersonal relations (2.79 ± 0.46) subscales. Scores on the stress management and physical activity subscales were significantly higher in men than in women (p < .05). In addition, mean HPLP-II scores were significantly higher in VIBMTs who exercised regularly compared with those who did not (p < .05). Compared with nonsmokers, current smokers had significantly higher scores on the stress management subscale (p < .05). The low physical activity scores in this population may be improved by developing physical activity programs for the home and workplace and by establishing community recreational and exercise facilities for visually impaired populations. The low scores for nutrition may be improved by establishing nutrition education programs that are designed specifically for VIBMTs to increase their consumption of fresh produce and other healthy foods and by requiring food manufacturers to use labels that may be easily read or understood by visually impaired populations.
Clauson, Kevin A; Polen, Hyla H; Peak, Amy S; Marsh, Wallace A; DiScala, Sandra L
2008-11-01
Clinical decision support tools (CDSTs) on personal digital assistants (PDAs) and online databases assist healthcare practitioners who make decisions about dietary supplements. To assess and compare the content of PDA dietary supplement databases and their online counterparts used as CDSTs. A total of 102 question-and-answer pairs were developed within 10 weighted categories of the most clinically relevant aspects of dietary supplement therapy. PDA versions of AltMedDex, Lexi-Natural, Natural Medicines Comprehensive Database, and Natural Standard and their online counterparts were assessed by scope (percent of correct answers present), completeness (3-point scale), ease of use, and a composite score integrating all 3 criteria. Descriptive statistics and inferential statistics, including a chi(2) test, Scheffé's multiple comparison test, McNemar's test, and the Wilcoxon signed rank test were used to analyze data. The scope scores for PDA databases were: Natural Medicines Comprehensive Database 84.3%, Natural Standard 58.8%, Lexi-Natural 50.0%, and AltMedDex 36.3%, with Natural Medicines Comprehensive Database statistically superior (p < 0.01). Completeness scores were: Natural Medicines Comprehensive Database 78.4%, Natural Standard 51.0%, Lexi-Natural 43.5%, and AltMedDex 29.7%. Lexi-Natural was superior in ease of use (p < 0.01). Composite scores for PDA databases were: Natural Medicines Comprehensive Database 79.3, Natural Standard 53.0, Lexi-Natural 48.0, and AltMedDex 32.5, with Natural Medicines Comprehensive Database superior (p < 0.01). There was no difference between the scope for PDA and online database pairs with Lexi-Natural (50.0% and 53.9%, respectively) or Natural Medicines Comprehensive Database (84.3% and 84.3%, respectively) (p > 0.05), whereas differences existed for AltMedDex (36.3% vs 74.5%, respectively) and Natural Standard (58.8% vs 80.4%, respectively) (p < 0.01). For composite scores, AltMedDex and Natural Standard online were better than their PDA counterparts (p < 0.01). Natural Medicines Comprehensive Database achieved significantly higher scope, completeness, and composite scores compared with other dietary supplement PDA CDSTs in this study. There was no difference between the PDA and online databases for Lexi-Natural and Natural Medicines Comprehensive Database, whereas online versions of AltMedDex and Natural Standard were significantly better than their PDA counterparts.
ERIC Educational Resources Information Center
New South Wales Dept. of Education, Sydney (Australia).
Continuing a series of short tests aimed at measuring student mastery of specific skills in the natural sciences, this supplementary volume includes teachers' notes, a users' guide and inspection copies of test items 27 to 50. Answer keys and test scoring statistics are provided. The items are designed for grades 7 through 10, and a list of the…
Spreckelsen, C; Juenger, J
2017-09-26
Adequate estimation and communication of risks is a critical competence of physicians. Due to an evident lack of these competences, effective training addressing risk competence during medical education is needed. Test-enhanced learning has been shown to produce marked effects on achievements. This study aimed to investigate the effect of repeated tests implemented on top of a blended learning program for risk competence. We introduced a blended-learning curriculum for risk estimation and risk communication based on a set of operationalized learning objectives, which was integrated into a mandatory course "Evidence-based Medicine" for third-year students. A randomized controlled trial addressed the effect of repeated testing on achievement as measured by the students' pre- and post-training score (nine multiple-choice items). Basic numeracy and statistical literacy were assessed at baseline. Analysis relied on descriptive statistics (histograms, box plots, scatter plots, and summary of descriptive measures), bootstrapped confidence intervals, analysis of covariance (ANCOVA), and effect sizes (Cohen's d, r) based on adjusted means and standard deviations. All of the 114 students enrolled in the course consented to take part in the study and were assigned to either the intervention or control group (both: n = 57) by balanced randomization. Five participants dropped out due to non-compliance (control: 4, intervention: 1). Both groups profited considerably from the program in general (Cohen's d for overall pre vs. post scores: 2.61). Repeated testing yielded an additional positive effect: while the covariate (baseline score) exhibits no relation to the post-intervention score, F(1, 106) = 2.88, p > .05, there was a significant effect of the intervention (repeated tests scenario) on learning achievement, F(1106) = 12.72, p < .05, d = .94, r = .42 (95% CI: [.26, .57]). However, in the subgroup of participants with a high initial numeracy score no similar effect could be observed. Dedicated training can improve relevant components of risk competence of medical students. An already promising overall effect of the blended learning approach can be improved significantly by implementing a test-enhanced learning design, namely repeated testing. As students with a high initial numeracy score did not profit equally from repeated testing, target-group specific opt-out may be offered.
NASA Astrophysics Data System (ADS)
Keown, Sandra L.
This study was devised to determine effects of the use of interactive thematic organizers and concept maps in middle school science classes during a unit study on minerals. The design, a pretest-posttest control group, consisted of matched groups (three experimental groups and one comparison group). It also included a student survey assessing qualitative aspects of the investigation. The 67 6th-grade students and one science teacher who participated in the study were from an independent K-12 school. Students represented a normal, well-distributed range of abilities. Group I (control) proceeded with their usual method of studying a unit---reading aloud the text and answering workbook questions. Group II worked with interactive thematic organizers, designed to activate prior knowledge and help students make inferences about target concepts in three treatments. Group III created three interactive concept maps, which represented both understandings and misconceptions. Concept maps were reviewed and repaired as students completed each treatment. Group IV participated in both thematic organizer and concept map treatments. Statistical analyses were determined through a pretest and a delayed recall posttest essay for all four groups. Two scores were assigned---one quantitative raw score of correct explicit answers and one rubric score based on the quality of interpretive responses. Group II also received scores for thematic organizer responses. Group III received rubric scores for concept maps. Group IV received all possible scores. Paired t-tests reported comparisons of scores across the treatment groups. A linear regression indicated whether or not concept map misconceptions affected posttest scores. Finally, an ANCOVA reported statistical significance across the four treatment groups. Findings of data analysis indicated statistically significant improvement in posttest scores among students in the three experimental groups. Students who participated in both treatments represented the highest scores among the four groups. Results of the ANCOVA indicated there was statistically significant difference in scores among the four treatments. Recommendations were made to further investigate development of interactive thematic organizers with student-chosen hyperlinks to concepts, as well as a recommendation that researchers investigate teacher understandings of interpretive purpose and form in the creation of thematic organizers.
Dixon, Donna
2015-04-01
Previous studies by the author showed differences in preadmission variables and Comprehensive Osteopathic Medical Licensing Examination-USA (COMLEX-USA) scores between women and men at the New York Institute of Technology College of Osteopathic Medicine (NYIT-COM). It is pertinent to reexamine the preadmission variables, medical school performance, and COMLEX-USA scores of women and men to determine whether these differences still exist. To examine the relationship between student sex and performance on COMLEX-USA Level 1 and Level 2-Cognitive Evaluation (CE), performance during medical school, and preadmission academic variables at NYIT-COM. Scores on COMLEX-USA Level 1 and COMLEX-USA Level 2-CE, grades in all courses taken during the first 2 years of medical school, the National Board of Osteopathic Medical Examiners' clinical science subject examination scores, Medical College Admission Test (MCAT) scores, and undergraduate grade point averages (GPAs) were compared between women and men in the classes graduating between 2009 and 2012. Data from 748 students were analyzed. Men had statistically significantly higher scores than women on COMLEX-USA Level 1 in 2009 (540 vs 500; P<.001) and 2010 (537 vs 496; P<.001). No statistically significant difference in COMLEX-USA Level 2-CE scores was found between women and men. The performance of women and men was comparable during the first 2 years of medical school and on clinical science subject examinations in years 3 and 4. Men had statistically significantly higher MCAT scores than women, but no statistically significant differences were found between women's and men's undergraduate GPAs. Men were found to have higher scores than women on COMLEX-USA Level 1 and the MCAT. However, the reasons behind these data have yet to be elucidated. Although a stronger background in basic science could explain the discrepancy in scores between women and men, women were found to have equally high science GPAs and performed comparably to men in osteopathic medical school. The results were in agreement with previous studies at NYIT-COM.
FLiGS Score: A New Method of Outcome Assessment for Lip Carcinoma–Treated Patients
Grassi, Rita; Toia, Francesca; Di Rosa, Luigi; Cordova, Adriana
2015-01-01
Background: Lip cancer and its treatment have considerable functional and cosmetic effects with resultant nutritional and physical detriments. As we continue to investigate new treatment regimens, we are simultaneously required to assess postoperative outcomes to design interventions that lessen the adverse impact of this disease process. We wish to introduce Functional Lip Glasgow Scale (FLiGS) score as a new method of outcome assessment to measure the effect of lip cancer and its treatment on patients’ daily functioning. Methods: Fifty patients affected by lip squamous cell carcinoma were recruited between 2009 and 2013. Patients were asked to fill the FLiGS questionnaire before surgery, 1 month, 6 months, and 1 year after surgery. The subscores were used to calculate a total FLiGS score of global oral disability. Statistical analysis was performed to test validity and reliability. Results: FLiGS scores improved significantly from preoperative to 12 months postoperative values (P = 0.000). Statistical evidence of validity was provided through rs (Spearman correlation coefficient) that resulted >0.30 for all surveys and for which P < 0.001. FLiGS score reliability was shown through examination of internal consistency and test-retest reliability. Conclusions: FLiGS score is a simple way of assessing functional impairment related to lip cancer before and after surgery; it is sensitive, valid, reliable, and clinically relevant: it provides useful information to orient the physician in the postoperative management and in the rehabilitation program. PMID:26034652
Validation of the Narrowing Beam Walking Test in Lower Limb Prosthesis Users.
Sawers, Andrew; Hafner, Brian
2018-04-11
To evaluate the content, construct, and discriminant validity of the Narrowing Beam Walking Test (NBWT), a performance-based balance test for lower limb prosthesis users. Cross-sectional study. Research laboratory and prosthetics clinic. Unilateral transtibial and transfemoral prosthesis users (N=40). Not applicable. Content validity was examined by quantifying the percentage of participants receiving maximum or minimum scores (ie, ceiling and floor effects). Convergent construct validity was examined using correlations between participants' NBWT scores and scores or times on existing clinical balance tests regularly administered to lower limb prosthesis users. Known-groups construct validity was examined by comparing NBWT scores between groups of participants with different fall histories, amputation levels, amputation etiologies, and functional levels. Discriminant validity was evaluated by analyzing the area under each test's receiver operating characteristic (ROC) curve. No minimum or maximum scores were recorded on the NBWT. NBWT scores demonstrated strong correlations (ρ=.70‒.85) with scores/times on performance-based balance tests (timed Up and Go test, Four Square Step Test, and Berg Balance Scale) and a moderate correlation (ρ=.49) with the self-report Activities-specific Balance Confidence scale. NBWT performance was significantly lower among participants with a history of falls (P=.003), transfemoral amputation (P=.011), and a lower mobility level (P<.001). The NBWT also had the largest area under the ROC curve (.81) and was the only test to exhibit an area that was statistically significantly >.50 (ie, chance). The results provide strong evidence of content, construct, and discriminant validity for the NBWT as a performance-based test of balance ability. The evidence supports its use to assess balance impairments and fall risk in unilateral transtibial and transfemoral prosthesis users. Copyright © 2018 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Kisala, Pamela A; Tulsky, David S; Kalpakjian, Claire Z; Heinemann, Allen W; Pohlig, Ryan T; Carle, Adam; Choi, Seung W
2015-05-01
To develop a calibrated item bank and computer adaptive test to assess anxiety symptoms in individuals with spinal cord injury (SCI), transform scores to the Patient Reported Outcomes Measurement Information System (PROMIS) metric, and create a statistical linkage with the Generalized Anxiety Disorder (GAD)-7, a widely used anxiety measure. Grounded-theory based qualitative item development methods; large-scale item calibration field testing; confirmatory factor analysis; graded response model item response theory analyses; statistical linking techniques to transform scores to a PROMIS metric; and linkage with the GAD-7. Setting Five SCI Model System centers and one Department of Veterans Affairs medical center in the United States. Participants Adults with traumatic SCI. Spinal Cord Injury-Quality of Life (SCI-QOL) Anxiety Item Bank Seven hundred sixteen individuals with traumatic SCI completed 38 items assessing anxiety, 17 of which were PROMIS items. After 13 items (including 2 PROMIS items) were removed, factor analyses confirmed unidimensionality. Item response theory analyses were used to estimate slopes and thresholds for the final 25 items (15 from PROMIS). The observed Pearson correlation between the SCI-QOL Anxiety and GAD-7 scores was 0.67. The SCI-QOL Anxiety item bank demonstrates excellent psychometric properties and is available as a computer adaptive test or short form for research and clinical applications. SCI-QOL Anxiety scores have been transformed to the PROMIS metric and we provide a method to link SCI-QOL Anxiety scores with those of the GAD-7.
Crnošija, Luka; Krbot Skorić, Magdalena; Gabelić, Tereza; Adamec, Ivan; Brinar, Vesna; Habek, Mario
2015-12-15
To investigate the correlation of the vestibular evoked myogenic potential (VEMP) score with Timed 25-Foot Walk (T25FW), 9-Hole Peg Test (9HPT), Paced Auditory Serial Addition Test (PASAT) and EDSS in patients with multiple sclerosis (MS). This prospective, cross sectional study included 52 patients with clinically isolated syndrome (CIS). Cervical VEMP (cVEMP) and ocular VEMP (oVEMP), analyzed in the form of the cVEMP, oVEMP and VEMP scores, T25FW, 9HPT, PASAT and Expanded Disability Status Scale (EDSS) were performed. The only predictor of walking impairment in this study was general disability as measured by the EDSS, after controlling for age, gender, PASAT and EDSS the effect of VEMP score was non-significant (p=0.419). 9HPT of the dominant hand did not correlate with the oVEMP score (rs=0.258, p=0.065), however after controlling for age, gender, PASAT and EDSS, the effect of the oVEMP score on 9HPT of the dominant hand was statistically significant (p=0.017). After controlling for age, gender and oVEMP score, the effect of the PASAT on 9HPT variable for the non-dominant hand was statistically significant (p=0.001). We found possible effects of brainstem dysfunction on walking impairment, however they were not seen after correction for EDSS and cognitive dysfunction. On the other hand, dominant hand function seems to be influenced by upper brainstem dysfunction measured with oVEMP, while cognitive dysfunction is related to non-dominant hand function. Copyright © 2015 Elsevier B.V. All rights reserved.
Pharmacy Students' Test-Taking Motivation-Effort on a Low-Stakes Standardized Test
2011-01-01
Objective To measure third-year pharmacy students' level of motivation while completing the Pharmacy Curriculum Outcomes Assessment (PCOA) administered as a low-stakes test to better understand use of the PCOA as a measure of student content knowledge. Methods Student motivation was manipulated through an incentive (ie, personal letter from the dean) and a process of statistical motivation filtering. Data were analyzed to determine any differences between the experimental and control groups in PCOA test performance, motivation to perform well, and test performance after filtering for low motivation-effort. Results Incentivizing students diminished the need for filtering PCOA scores for low effort. Where filtering was used, performance scores improved, providing a more realistic measure of aggregate student performance. Conclusions To ensure that PCOA scores are an accurate reflection of student knowledge, incentivizing and/or filtering for low motivation-effort among pharmacy students should be considered fundamental best practice when the PCOA is administered as a low-stakes test PMID:21655395
Impact of a weekly reading program on orthopedic surgery residents' in-training examination.
Weglein, Daniel G; Gugala, Zbigniew; Simpson, Suzanne; Lindsey, Ronald W
2015-05-01
In response to a decline in individual residents' performance and overall program performance on the Orthopaedic In-Training Examination (OITE), the authors' department initiated a daily literature reading program coupled with weekly tests on the assigned material. The goal of this study was to assess the effect of the reading program on individual residents' scores and the training program's OITE scores. The reading program consisted of daily review articles from the Journal of the American Academy of Orthopaedic Surgeons, followed by a weekly written examination consisting of multiple-choice or fill-in-the-blank questions. All articles were selected and all questions were written by the departmental chair. A questionnaire was given to assess residents' perceptions of the weekly tests. As a result of implementing the reading program for a 10-month period, residents' subsequent performance on the OITE significantly improved (mean score increase, 4, P<.0001; percentile score increase, 11, P=.0007). The difference in mean score was significant for residents in postgraduate years 3, 4, and 5. A statistically significant correlation was found between weekly test scores and performance on the OITE, with a significant correlation between weekly test scores and OITE percentile ranking. The study results also showed a positive correlation between reading test attendance and weekly test scores. Residents' anonymous questionnaire responses also demonstrated the reading program to be a valuable addition to the residency training curriculum. In conclusion, the study strongly supports the benefits of a weekly reading and examination program in enhancing the core knowledge of orthopedic surgery residents. Copyright 2015, SLACK Incorporated.
Daylighting Makes a Difference.
ERIC Educational Resources Information Center
Heschong, Lisa; Knecht, Carey
2002-01-01
Examined the role of daylight in student achievement in three schools and found a uniformly positive and statistically significant correlation between the presence of more daylight and better student test scores. Offers guidelines on designing daylit classrooms. (EV)
Bodenburg, Sebastian; Dopslaff, Nina
2008-01-01
The Dysexecutive Questionnaire (DEX, , Behavioral assessment of the dysexecutive syndrome, 1996) is a standardized instrument to measure possible behavioral changes as a result of the dysexecutive syndrome. Although initially intended only as a qualitative instrument, the DEX has also been used increasingly to address quantitative problems. Until now there have not been more fundamental statistical analyses of the questionnaire's testing quality. The present study is based on an unselected sample of 191 patients with acquired brain injury and reports on the data relating to the quality of the items, the reliability and the factorial structure of the DEX. Item 3 displayed too great an item difficulty, whereas item 11 was not sufficiently discriminating. The DEX's reliability in self-rating is r = 0.85. In addition to presenting the statistical values of the tests, a clinical severity classification of the overall scores of the 4 found factors and of the questionnaire as a whole is carried out on the basis of quartile standards.
Avşar, Fatma; Ayaz Alkaya, Sultan
The aim of this study was to determine the effectiveness of an assertive training for school-aged children on peer bullying and assertiveness. A quasi-experimental design using pre- and post-testing was conducted. Data were collected using a demographic questionnaire, an assertiveness scale, and the peer victimization scale. The training program was comprised of eight sessions which were implemented to intervention group. Descriptive characteristics were not statistically different between the groups (p>0.05). The peer victimization victim dimension results show that post-test mean scores of the students in the intervention group were lower than the pre-test mean scores (p<0.05). For the control group, no significant change was found in the pre-test and post-test mean scores (p>0.05). A comparison of the mean pre-test/post-test scores of peer-victimization bully dimension of the students' intervention and control groups revealed that the mean post-test scores of the students in the each group decreased (p>0.05). An assertiveness training program increased the assertiveness level and reduced the state of being victims, but did not affect the state of being bullies. The results of this study can help children acquire assertive behaviors instead of negative behaviors such as aggression and shyness, and help them to build effective social communication. Copyright © 2017 Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Ojerinde, Dibu; Popoola, Omokunmi; Onyeneho, Patrick; Egberongbe, Aminat
2016-01-01
Statistical procedure used in adjusting test score difficulties on test forms is known as "equating". Equating makes it possible for various test forms to be used interchangeably. In terms of where the equating method fits in the assessment cycle, there are pre-equating and post-equating methods. The major benefits of pre-equating, when…
ERIC Educational Resources Information Center
Airola, Denise Tobin
2011-01-01
Changes to state tests impact the ability of State Education Agencies (SEAs) to monitor change in performance over time. The purpose of this study was to evaluate the Standardized Performance Growth Index (PGIz), a proposed statistical model for measuring change in student and school performance, across transitions in tests. The PGIz is a…
ERIC Educational Resources Information Center
Meloy, Linda L.; Deville, Craig; Frisbie, David A.
2002-01-01
A study examined the effect of a read aloud testing accommodation on 260 middle school students with and without learning disabilities in reading. Students with learning disabilities in reading, as well as those without, exhibited statistically significant gains with the read aloud test administration. Interaction effects were not significant.…
Implications for social policy of variability in racial groups.
Helms, Janet E
2008-11-01
Social policy and federal and state legislation require the use of single cut scores when tests of cognitive ability, knowledge, or skills (CAKS) are used to make high-stakes assessment decisions, such as whether students or employees may be promoted. Rationales offered for the requirement are that cut scores provide objective standards and are fairer than using subjective criteria, such as racial group membership. It is argued that failure to consider threats to statistical conclusion validity, such as differences in variability between groups, obscures the differential impact of using a common cut score as the basis for highstakes decisions. Analyses of 40 Black and White samples revealed that (a) Whites might be considerably advantaged and Blacks might be considerably disadvantaged by the same cut score and (b) depending on where the cut score is set, decisions based on ratios of numbers of Whites numbers of Blacks might be fairer than use of CAKS test cut scores. Implications for assessment practice and social policy are discussed.
The relationship between twelve-month home stimulation and school achievement.
van Doorninck, W J; Caldwell, B M; Wright, C; Frankenburg, W K
1981-09-01
Home Observation for Measurement of the Environment (HOME) was designed to reflect parental support of early cognitive and socioemotional development. 12-month HOME scores were correlated with elementary school achievement, 5--9 years later. 50 low-income children were rank ordered by a weighted average of centile estimates of achievement test scores, letter grades, and curriculum levels in reading and math. 24 children were classified as having significant school achievement problems. The HOME total score correlated significantly, r = .37, with school centile scores among the low-income families. The statistically more appropriate contingency table analysis revealed a 68% correct classification rate and a significantly reduced error rate over random or blanket prediction. The results supported the predictive value of the 12-month HOME for school achievement among low-income families. In an additional sample of 21 middle-income families, there was insufficient variability among HOME scores to allow prediction. The HOME total scores were highly correlated, r = .86, among siblings tested at least 10 months apart.
Pobocik, Tamara
2015-01-01
This quantitative research study used a pretest/posttest design and reviewed how an educational electronic documentation system helped nursing students to identify the accurate "related to" statement of the nursing diagnosis for the patient in the case study. Students in the sample population were senior nursing students in a bachelor of science nursing program in the northeastern United States. Two distinct groups were used for a control and intervention group. The intervention group used the educational electronic documentation system for three class assignments. Both groups were given a pretest and posttest case study. The Accuracy Tool was used to score the students' responses to the related to statement of a nursing diagnosis given at the end of the case study. The scores of the Accuracy Tool were analyzed, and then the numeric scores were placed in SPSS, and the paired t test scores were analyzed for statistical significance. The intervention group's scores were statistically different from the pretest scores to posttest scores, while the control group's scores remained the same from pretest to posttest. The recommendation to nursing education is to use the educational electronic documentation system as a teaching pedagogy to help nursing students prepare for nursing practice. © 2014 NANDA International, Inc.
Mahantesha, Taranatha; Nara, Asha; Kumari, Parveen Reddy; Halemani, Praveen Kumar Nugadoni; Buddiga, Vinutna; Mythri, Sarpangala
2015-12-01
The aim of this study is to compare the oral hygiene status among institutionalized visually impaired children of age between 6 and 20 years given with Braille and audio instructions in Raichur city of Karnataka. A total of 50 children aged between 6 to 20 years were included in this study from a residential school for visually impaired children. These children were randomly divided into two equal groups. One group was given oral hygiene instructions by audio recordings and another written in Braille and were instructed to practice the same. After three months time the oral hygiene status and dental caries experience was recorded and compared using patient performance index. Statistical analysis was done by student paired t test and multiple comparison by Tukey's HSD (honest significant difference) test. The mean PHP (Patient Hygiene Performance) score of group A at baseline was 3.88 compared to 3.90 of group B. At 7 days PHP score of group A and group B was 3.42 and 3.45 respectively. At 3 month PHP score of group A and group B was 2.47 and 2.86 respectively. Even though over a period of time the mean score of PHP index reduced the score comparison between the 2 groups were statistically non significant. In group A the mean difference of PHP score between baseline and 7 days was 0.46, between baseline and 3 months it was 1.40. The PHP score between 7 days and 3 months was 0.94. All the above values were statistically significant. Effective dental health education method has to be instituted for visually impaired children. The present study shows improvement of oral health status in both the study population by decrease in the mean plaque score. Hence continuous motivation and reinforcement in the form of Braille and audio instruction is beneficial to achieve good oral hygiene levels in visually impaired children.
ERIC Educational Resources Information Center
Sklar, Jeffrey C.; Zwick, Rebecca
2009-01-01
Proper interpretation of standardized test scores is a crucial skill for K-12 teachers and school personnel; however, many do not have sufficient knowledge of measurement concepts to appropriately interpret and communicate test results. In a recent four-year project funded by the National Science Foundation, three web-based instructional…
Outlier Detection in High-Stakes Certification Testing. Research Report.
ERIC Educational Resources Information Center
Meijer, Rob R.
Recent developments of person-fit analysis in computerized adaptive testing (CAT) are discussed. Methods from statistical process control are presented that have been proposed to classify an item score pattern as fitting or misfitting the underlying item response theory (IRT) model in a CAT. Most person-fit research in CAT is restricted to…
Story Based Activities Enhance Literacy Skills in Preschool Children
ERIC Educational Resources Information Center
Yazici, Elçin; Bolay, Hayrunnisa
2017-01-01
We investigated the impact of story-based activities on literacy skills in pre-school children. The efficacy of story-based activities program were tested by literacy skills survey test. Results showed that, the scores of overall literacy skills and all subsets skills in the study group (n = 45) were statistically significantly higher than the…
The Influence of Ability Grouping on Math Achievement in a Rural Middle School
ERIC Educational Resources Information Center
Pritchard, Robert R.
2012-01-01
The researcher examined the academic performance of low-tracked students (n = 156) using standardized math test scores to determine whether there is a statistically significant difference in achievement depending on academic environment, tracked or nontracked. An analysis of variance (ANOVA) was calculated, using a paired samples t-test for a…
Biofeedback-assisted relaxation training to decrease test anxiety in nursing students.
Prato, Catherine A; Yucha, Carolyn B
2013-01-01
Nursing students experiencing debilitating test anxiety may be unable to demonstrate their knowledge and have potential for poor academic performance. A biofeedback-assisted relaxation training program was created to reduce test anxiety. Anxiety was measured using Spielberger's Test Anxiety Inventory and monitoring peripheral skin temperature, pulse, and respiration rates during the training. Participants were introduced to diaphragmatic breathing, progressive muscle relaxation, and autogenic training. Statistically significant changes occurred in respiratory rates and skin temperatures during the diaphragmatic breathing session; respiratory rates and peripheral skin temperatures during progressive muscle relaxation session; respiratory and pulse rates, and peripheral skin temperatures during the autogenic sessions. No statistically significant difference was noted between the first and second TAI. Subjective test anxiety scores of the students did not decrease by the end of training. Autogenic training session was most effective in showing a statistically significant change in decreased respiratory and pulse rates and increased peripheral skin temperature.
Ignjatović, Aleksandra; Stojanović, Miodrag; Milošević, Zoran; Anđelković Apostolović, Marija
2017-12-02
The interest in developing risk models in medicine not only is appealing, but also associated with many obstacles in different aspects of predictive model development. Initially, the association of biomarkers or the association of more markers with the specific outcome was proven by statistical significance, but novel and demanding questions required the development of new and more complex statistical techniques. Progress of statistical analysis in biomedical research can be observed the best through the history of the Framingham study and development of the Framingham score. Evaluation of predictive models comes from a combination of the facts which are results of several metrics. Using logistic regression and Cox proportional hazards regression analysis, the calibration test, and the ROC curve analysis should be mandatory and eliminatory, and the central place should be taken by some new statistical techniques. In order to obtain complete information related to the new marker in the model, recently, there is a recommendation to use the reclassification tables by calculating the net reclassification index and the integrated discrimination improvement. Decision curve analysis is a novel method for evaluating the clinical usefulness of a predictive model. It may be noted that customizing and fine-tuning of the Framingham risk score initiated the development of statistical analysis. Clinically applicable predictive model should be a trade-off between all abovementioned statistical metrics, a trade-off between calibration and discrimination, accuracy and decision-making, costs and benefits, and quality and quantity of patient's life.
Testing independence of bivariate interval-censored data using modified Kendall's tau statistic.
Kim, Yuneung; Lim, Johan; Park, DoHwan
2015-11-01
In this paper, we study a nonparametric procedure to test independence of bivariate interval censored data; for both current status data (case 1 interval-censored data) and case 2 interval-censored data. To do it, we propose a score-based modification of the Kendall's tau statistic for bivariate interval-censored data. Our modification defines the Kendall's tau statistic with expected numbers of concordant and disconcordant pairs of data. The performance of the modified approach is illustrated by simulation studies and application to the AIDS study. We compare our method to alternative approaches such as the two-stage estimation method by Sun et al. (Scandinavian Journal of Statistics, 2006) and the multiple imputation method by Betensky and Finkelstein (Statistics in Medicine, 1999b). © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Training Effectiveness Assessment. Volume II. Problems, Concepts, and Evaluation Alternatives.
1976-12-01
i nforma ti on abou t areas where course impr ov emer t might be indicated . Percentiles , pretest and posttest scores , or other measures of amount...statistical sophisti- cation. Interpretation of gain scores derived from pretests - posttests of trainees and other forms of trend analysis requires...CPM ), computer - managed testing (CMI). time-series analysi s, pretest / posttest design , and secondary anal ysis. Criterion -referenced measurement is
Traditional Nurse Triage vs. Physician Tele-Presence in a Pediatric Emergency Department
Marconi, Greg P.; Chang, Todd; Pham, Phung K.; Grajower, Daniel N.; Nager, Alan L.
2014-01-01
Objectives To compare traditional nurse triage (TNT) in a Pediatric Emergency Department (PED) to physician tele-presence (PTP). Methods Prospective, 2×2 crossover study with random assignment using a sample of walk-in patients seeking care in a PED at a large, tertiary care children’s hospital, from May 2012 to January 2013. Outcomes of triage times, documentation errors, triage scores, and survey responses were compared between TNT and PTP. Comparison between PTP to actual treating PED physicians regarding the accuracy of ordering blood and urine tests, throat cultures, and radiologic imaging was also studied. Results Paired samples t-tests showed a statistically significant difference in triage time between TNT and PTP (p=0.03), but no significant difference in documentation errors (p=0.10). Triage scores of TNT were 71% accurate, compared to PTP, which were 95% accurate. Both parents and children had favorable scores regarding PTP and the majority indicated they would prefer PTP again at their next PED visit. PTP diagnostic ordering was comparable to the actual PED physician ordering, showing no statistical differences. Conclusions Utilizing physician tele-presence technology to remotely perform triage is a feasible alternative to traditional nurse triage, with no clinically significant differences in time, triage scores, errors and patient and parent satisfaction. PMID:24445223
Kahraman, Serif Samil; Arli, Cengiz; Copoglu, Umit Sertan; Kokacya, Mehmet Hanifi; Colak, Sait
2017-05-01
Patients with BPPV experienced short but intense anxiety and/or panic disorder, especially at the initial visit, but most patients recovered without medication with successful treatment. Recent studies have shown that people with dizziness report some psychological problems such as panic and agoraphobia and anxiety. The aim of this study was to evaluate anxiety and panic agorophobia levels in patients with benign paroxysmal positional vertigo on initial presentation and at the follow-up visit and compare the scores with the control group. All the 32 patients in the study had a diagnosis of BPPV confirmed by their history, typical subjective symptom reports, and characteristic positional nystagmus during the Dix-Hallpike test and/or Roll test. The patients were instructed to complete the standard forms of Beck anxiety inventory and panic agoraphobia scale questionnaire before and at 7 and 14 days after the canalith repositioning treatment. The validity scores of panic agoraphobia were statistically significantly higher in patients with BPPV than in the control group in each period (p < .001) and the validity scores of the Beck anxiety inventory were statistically significantly higher in patients with BPPV than in the control group at the first and second evaluation (p < .001).
NASA Astrophysics Data System (ADS)
Smith, Kimberly A.
The research study investigates the effectiveness of an integrated high school science curriculum on student achievement, knowledge retention and science attitudes using quantitative and qualitative research. Data was collected from tenth grade students, in a small urban high school in Kansas City, Missouri, who were enrolled in a traditional Biology course or an integrated Environmental Science course. Quantitative data was collected in Phase 1 of the study. Data collected for academic achievement included pretest and posttest scores on the CTBS MATN exam. Data collected for knowledge retention included post-posttest scores on the CTBS MATN exam. Data collected for science attitudes were scores on a pretest and posttest using the TOSRA. SPSS was used to analyze the data using independent samples t-tests, one-way ANCOVA's and paired samples statistics. Qualitative data was collected in Phase 2 of the study. Data included responses to open-ended interview questions using three focus groups. Data was analyzed for common themes. Data analysis revealed the integrated Environmental Science course had a statistically significant impact on academic achievement, knowledge retention and positive science attitudes. Gender and socioeconomic status did not influence results. The study also determined that the CTBS MATN exam was not an accurate predictor of scores on state testing as was previously thought.
Significance of specificity of Tinetti B-POMA test and fall risk factor in third age of life.
Avdić, Dijana; Pecar, Dzemal
2006-02-01
As for the third age, psychophysical abilities of humans gradually decrease, while the ability of adaptation to endogenous and exogenous burdens is going down. In 1987, "Harada" et al. (1) have found out that 9.5 million persons in USA have difficulties running daily activities, while 59% of them (which is 5.6 million) are older than 65 years in age. The study has encompassed 77 questioned persons of both sexes with their average age 71.73 +/- 5.63 (scope of 65-90 years in age), chosen by random sampling. Each patient has been questioned in his/her own home and familiar to great extent with the methodology and aims of the questionnaire. Percentage of questioned women was 64.94% (50 patients) while the percentage for men was 35.06% (27 patients). As for the value of risk factor score achieved conducting the questionnaire and B-POMA test, there are statistically significant differences between men and women, as well as between patients who fell and those who never did. As for the way of life (alone or in the community), there are no significant statistical differences. Average results gained through B-POMA test in this study are statistically significantly higher in men and patients who did not provide data about falling, while there was no statistically significant difference in the way of life. In relation to the percentage of maximum number of positive answers to particular questions, regarding gender, way of life and the data about falling, there were no statistically significant differences between the value of B-POMA test and the risk factor score (the questionnaire).
ERIC Educational Resources Information Center
Wilkins, M. Elaine
2012-01-01
In 2001, No Child Left Behind introduced the highly qualified status for k-12 teachers, which mandated the successful scores on a series of high-stakes test; within this series is the Pre-Professional Skills Test (PPST) or PRAXIS I. The PPST measures basic k-12 skills for reading, writing, and mathematics. The mathematics sub-test is a national…
Gandhi, Sailaxmi; Thomas, Linsu; Desai, Geetha
2017-08-01
Post partum psychiatric illnesses are quiet common nowadays, which can interfere with postnatal care of both mother and infant. The present study was a one group pre-test - post-test design, adopted with an aim to enhance the knowledge on mother infant health among primary caregivers of mothers with postpartum psychiatric illnesses conducted in the mother-baby unit, NIMHANS, Bengaluru. Twenty five subjects who met the inclusion criteria were recruited through convenience sampling. After the pilot study, data was collected with a researcher developed tool. The Video Assisted Psycho-Education [VAPE] consisted of three sessions lasting for thirty minutes, taken over three consecutive days following the pre-test. Post-test was done immediately after the last session. Effectiveness of the intervention was established by McNemar test, Paired t-test and Wilcoxon Sign Ranks test. Analysis revealed statistically significant (p<0.001) increase in the post-test mean knowledge scores following the VAPE sessions. There was no statistically significant association between the pre-intervention knowledge score and the socio-demographic variables of the study subjects. The study findings revealed that the VAPE programme was effective in increasing the knowledge of the primary caregivers on mother infant health. Copyright © 2017 Elsevier B.V. All rights reserved.
Boccio, Cashen M; Beaver, Kevin M
2017-08-01
There is conflicting evidence regarding the association between adolescent marijuana use and adult intelligence, with some studies suggesting adolescent marijuana use can lead to declines in intelligence. The purpose of this study is to shed additional light on the potential link between marijuana use and changes in intelligence. We employed change scores and ordinary least squares (OLS) analysis to test for associations between marijuana use and changes in intelligence scores from adolescence (ages 12-21) to adulthood (ages 18-26) using data drawn from the National Longitudinal Study of Adolescent to Adult Health. The findings revealed that while a binary measure of marijuana use (ever/never) maintains a statistically significant association with changes in intelligence scores, the effect sizes are relatively small (β=0.043-0.051). Additionally, our findings did not reveal a significant association between cumulative marijuana use and changes in intelligence scores. Taken together, the results suggest that while the binary measure of marijuana use (ever/never) has a statistically significant association with changes in intelligence scores, the binary measure accounts for at most a 1-2 point change in intelligence scores. Copyright © 2017 Elsevier B.V. All rights reserved.
No better moment to score a goal than just before half time? A soccer myth statistically tested
Amez, Simon
2018-01-01
We test the soccer myth suggesting that a particularly good moment to score a goal is just before half time. To this end, rich data on 1,179 games played in the UEFA Champions League and UEFA Europa League are analysed. In contrast to the myth, we find that, conditional on the goal difference and other game characteristics at half time, the final goal difference at the advantage of the home team is 0.520 goals lower in case of a goal just before half time by this team. We show that this finding relates to this team’s lower probability of scoring a goal during the second half. PMID:29518165
Self-esteem and its associated factors among secondary school students in Klang District, Selangor.
Sherina, M S; Rampal, L; Loh, J W; Chan, C L; Teh, P C; Tan, P O
2008-03-01
Self-esteem is an important determinant of psychological well-being that is particularly problematic during adolescent life stage. There is a correlation between low self-esteem and other social problems among today's adolescents. This study was conducted to determine the mean self-esteem score, and to determine the association between self-esteem and age, sex, race, religion, number of siblings, ranking among siblings, family function, parental marital status and smoking among adolescents aged 12 to 20-years-old. A cross sectional study design using random cluster sampling method was done. Four out of a total of 35 secondary schools in Klang District, Selangor were selected. Respondents consisted of individual students in selected classes from the four selected schools. Data was collected using a self-administered, structured, pre-tested questionnaire and was analyzed using the SPSS version 12.0. Out of 1089 respondents, 793 completed the questionnaire (response rate 73.82%). The overall mean self-esteem score was 27.65. The mean self-esteem score for males (27.99) was slightly higher than females (27.31). The differences in the mean scores by race were statistically significant. There was a statistically significant relationship between mean self-esteem scores and sex, age, race, religion, number of siblings, smoking and family function. There was no statistically significant difference between mean self-esteem score with parental marital status and with ranking among siblings. The overall mean self-esteem score was 27.65. Self-esteem was associated with sex, age, race, religion, number of siblings, smoking and family function.
Assertiveness and problem solving in midwives
Yurtsal, Zeliha Burcu; Özdemir, Levent
2015-01-01
Background: Midwifery profession is required to bring solutions to problems and a midwife is expected to be an assertive person and to develop midwifery care. This study was planned to examine the relationship between assertiveness and problem-solving skills of midwives. Materials and Methods: This cross-sectional study was conducted with 201 midwives between July 2008 and February 2009 in the city center of Sivas. The Rathus Assertiveness Schedule (RAS) and Problem Solving Inventory (PSI) were used to determine the level of assertiveness and problem-solving skills of midwives. Statistical methods were used as mean, standard deviation, percentage, Student's T, ANOVA and Tukey HSD, Kruskal Wallis, Fisher Exact, Pearson Correlation and Chi-square tests and P < 0.05. Results: The RAS mean scores and the PSI mean scores showed statistically significant differences in terms of a midwife's considering herself as a member of the health team, expressing herself within the health care team, being able to say “no” when necessary, cooperating with her colleagues, taking part in problem-solving skills training. A statistically significant negative correlation was found between the RAS and PSI scores. The RAS scores decreased while the problem-solving scores increased (r: -0451, P < 0.01). Conclusions: There were significant statistical differences between assertiveness levels and problem solving skills of midwives, and midwives who were assertive solved their problems better than did others. Assertiveness and problem-solving skills training will contribute to the success of the midwifery profession. Midwives able to solve problems, and display assertive behaviors will contribute to the development of midwifery profession. PMID:26793247
Eating Attitudes and Related Factors in Turkish Nursing Students
Celik, Sevim; Ugur, Bayram Ali; Aykurt, Fethi Ahmet; Bektas, Muammer
2015-01-01
Background: Changing eating behaviors might trigger obesity, deficiency, anorexia nervosa, bulimia nervosa, and reactive eating disorders. Objectives: This study aimed to determine eating attitudes of nursing students in the western Black-Sea region of Turkey as well as to examine the effects of demographic features, self-esteem, body image, income level, and family structure on their eating attitudes. Materials and Methods: This cross-sectional study was conducted on 310 nursing students between January and February 2014. Data were collected using the personal information form, Eating Attitudes Test (EAT), Rosenberg Self-Esteem Scale (RSES), Beck Depression Scale (BDS), Body-Cathexis Scale (BCS), and Body Mass Index (BMI). Data were evaluated by descriptive statistics, independent samples t-test, one-way ANOVA, Kruskal-Wallis test, and Pearson correlation analysis. Results: About 30.0% of Turkish nursing students had negative eating attitudes. There was a significant positive correlation between the BDS and EAT scores (P < 0.001). There was a significant negative correlation between RSES scores and EAT scores of nursing students (P < 0.001). A statistically significant difference was found between the father’s occupation (P < 0.05) and mother’s working condition (P < 0.05), and the students’ eating attitudes. Conclusions: Psychological status, self-esteem, economic level, and place of residence of nursing students may be the potential factors for eating disorders. PMID:26339662
Klegeris, Andis; Bahniwal, Manpreet; Hurren, Heather
2013-01-01
Problem-based learning (PBL) was originally introduced in medical education programs as a form of small-group learning, but its use has now spread to large undergraduate classrooms in various other disciplines. Introduction of new teaching techniques, including PBL-based methods, needs to be justified by demonstrating the benefits of such techniques over classical teaching styles. Previously, we demonstrated that introduction of tutor-less PBL in a large third-year biochemistry undergraduate class increased student satisfaction and attendance. The current study assessed the generic problem-solving abilities of students from the same class at the beginning and end of the term, and compared student scores with similar data obtained in three classes not using PBL. Two generic problem-solving tests of equal difficulty were administered such that students took different tests at the beginning and the end of the term. Blinded marking showed a statistically significant 13% increase in the test scores of the biochemistry students exposed to PBL, while no trend toward significant change in scores was observed in any of the control groups not using PBL. Our study is among the first to demonstrate that use of tutor-less PBL in a large classroom leads to statistically significant improvement in generic problem-solving skills of students. PMID:23463230
Zhang, Li-juan; Zhu, Rong; Xu, Jing; Li, Shi-zhu
2015-06-01
To understand the knowledge level on schistosomiasis prevention and treatment among professionals of schistosomiasis endemic counties in Hunan and Hubei provinces, so as to provide the basis for the ability construction of schistosomiasis control institution. The theoretical test was applied to investigate the mastering situation on schistosomiasis prevention and control among professionals of 12 selected schistosomiasis endemic counties in Hunan and Hubei provinces, and the results were analyzed statistically. Ninety-six professionals were surveyed. The average score was 66.94 ± 11.53, in the range of 34-91, and the pass rate was 75.00%. The scoring rates of the knowledge points of the test and treatment of schistosomiasis, snail survey and killing as well as basic knowledge and laws and regulations about schistosome were 68.69%, 70.54% and 73.19%, respectively. On the knowledge points of the test and treatment of schistosomiasis and basic knowledge and laws and regulations about schistosome, the differences among different education backgrounds were significant (F = 3.337, 4.793, both P < 0.05), and the scores were higher in professionals with higher diploma. In the scores, there were no statistical differences between or among different genders, age groups, professional titles or specialties (all P > 0.05). The overall knowledge level on schistosomiasis prevention and treatment of the professionals from 12 schistosomiasis endemic counties in Hunan and Hubei provinces is low. Therefore, the learning of relative knowledge should be strengthened.
Pittman, Joyce; Beeson, Terrie; Terry, Colin; Dillon, Jill; Hampton, Charity; Kerley, Denise; Mosier, Judith; Gumiela, Ellen; Tucker, Jessica
2016-01-01
Despite prevention strategies, hospital-acquired pressure ulcers (HAPUs) continue to occur in the acute care setting. The purpose of this study was to develop an operational definition of and an instrument for identifying avoidable/unavoidable HAPUs in the acute care setting. The Indiana University Health Pressure Ulcer Prevention Inventory (PUPI) was developed and psychometric testing was performed. A retrospective pilot study of 31 adult hospitalized patients with an HAPU was conducted using the PUPI. Overall content validity index of 0.99 and individual item content validity index scores (0.9-1.0) demonstrated excellent content validity. Acceptable PUPI criterion validity was demonstrated with no statistically significant differences between wound specialists' and other panel experts' scoring. Construct validity findings were acceptable with no statistically significant differences among avoidable or unavoidable HAPU patients and their Braden Scale total scores. Interrater reliability was acceptable with perfect agreement on the total PUPI score between raters (κ = 1.0; P = .025). Raters were in total agreement 93% (242/260) of the time on all 12 individual PUPI items. No risk factors were found to be significantly associated with unavoidable HAPUs. An operational definition of and an instrument for identifying avoidable/unavoidable HAPUs in the acute care setting were developed and tested. The instrument provides an objective and structured method for identifying avoidable/unavoidable HAPUs. The PUPI provides an additional method that could be used in root-cause analyses and when reporting adverse pressure ulcer events.
Ferguson, John; Wheeler, William; Fu, YiPing; Prokunina-Olsson, Ludmila; Zhao, Hongyu; Sampson, Joshua
2013-01-01
With recent advances in sequencing, genotyping arrays, and imputation, GWAS now aim to identify associations with rare and uncommon genetic variants. Here, we describe and evaluate a class of statistics, generalized score statistics (GSS), that can test for an association between a group of genetic variants and a phenotype. GSS are a simple weighted sum of single-variant statistics and their cross-products. We show that the majority of statistics currently used to detect associations with rare variants are equivalent to choosing a specific set of weights within this framework. We then evaluate the power of various weighting schemes as a function of variant characteristics, such as MAF, the proportion associated with the phenotype, and the direction of effect. Ultimately, we find that two classical tests are robust and powerful, but details are provided as to when other GSS may perform favorably. The software package CRaVe is available at our website (http://dceg.cancer.gov/bb/tools/crave). PMID:23092956
Libon, David J.; Bondi, Mark W.; Price, Catherine C.; Lamar, Melissa; Eppig, Joel; Wambach, Denene M.; Nieves, Christine; Delano-Wood, Lisa; Giovannetti, Tania; Lippa, Carol; Kabasakalian, Anahid; Cosentino, Stephanie; Swenson, Rod; Penney, Dana L.
2012-01-01
Using cluster analysis Libon et al. (2010) found three verbal serial list-learning profiles involving delay memory test performance in patients with mild cognitive impairment (MCI). Amnesic MCI (aMCI) patients presented with low scores on delay free recall and recognition tests; mixed MCI (mxMCI) patients scored higher on recognition compared to delay free recall tests; and dysexecutive MCI (dMCI) patients generated relatively intact scores on both delay test conditions. The aim of the current research was to further characterize memory impairment in MCI by examining forgetting/savings, interference from a competing word list, intrusion errors/perseverations, intrusion word frequency, and recognition foils in these three statistically determined MCI groups compared to normal control (NC) participants. The aMCI patients exhibited little savings, generated more highly prototypic intrusion errors, and displayed indiscriminate responding to delayed recognition foils. The mxMCI patients exhibited higher saving scores, fewer and less prototypic intrusion errors, and selectively endorsed recognition foils from the interference list. dMCI patients also selectively endorsed recognition foils from the interference list but performed similarly compared to NC participants. These data suggest the existence of distinct memory impairments in MCI and caution against the routine use of a single memory test score to operationally define MCI. PMID:21880171
Value of EZSCAN parameters for diabetes screening in Chinese.
Lin, Yanhui; Chen, Zhiheng; Guo, Xu; Deng, Yulin
2017-05-23
To study the parameters of EZSCAN as a screening tool for diabetes in Chinese. A total of 6,270 subjects participated in the study. All subjects underwent tests of EZSCAN, fasting plasma glucose (FPG), oral glucose tolerance test and HbA 1c . 1. All subjects were divided into 4 groups: the normal group, sugar metabolic abnormalities as low-risk group, middle-risk group and high-risk group. The difference of diabetes incidence among the 4 groups was statistically significant. With the increase of EZSCAN score, the prevalence of diabetes increased significantly. But there is no statistically difference between the low-risk group and the middle-risk group. 2. After adjustment for other variables, there is significantly positive relationship among EZSCAN risk score and the risk of diabetes. Meanwhile there is no statistically difference between the low-risk group and the middle-risk group. 3. The cut-off point of EZSCAN for diabetes was 44.5% with the sensitivity was 73.2% which was higher than of FPG and HbA 1c . As EZSCAN-diabetes risk score increases, the risk of diabetes increases. EZSCAN can be used as a tool for screening for diabetes. At the best screening diabetes cut-off point value 44.5%, the sensitivity is higher than traditional method of FPG and HbA 1c . Copyright © 2017 Elsevier España, S.L.U. All rights reserved.
Test/score/report: Simulation techniques for automating the test process
NASA Technical Reports Server (NTRS)
Hageman, Barbara H.; Sigman, Clayton B.; Koslosky, John T.
1994-01-01
A Test/Score/Report capability is currently being developed for the Transportable Payload Operations Control Center (TPOCC) Advanced Spacecraft Simulator (TASS) system which will automate testing of the Goddard Space Flight Center (GSFC) Payload Operations Control Center (POCC) and Mission Operations Center (MOC) software in three areas: telemetry decommutation, spacecraft command processing, and spacecraft memory load and dump processing. Automated computer control of the acceptance test process is one of the primary goals of a test team. With the proper simulation tools and user interface, the task of acceptance testing, regression testing, and repeatability of specific test procedures of a ground data system can be a simpler task. Ideally, the goal for complete automation would be to plug the operational deliverable into the simulator, press the start button, execute the test procedure, accumulate and analyze the data, score the results, and report the results to the test team along with a go/no recommendation to the test team. In practice, this may not be possible because of inadequate test tools, pressures of schedules, limited resources, etc. Most tests are accomplished using a certain degree of automation and test procedures that are labor intensive. This paper discusses some simulation techniques that can improve the automation of the test process. The TASS system tests the POCC/MOC software and provides a score based on the test results. The TASS system displays statistics on the success of the POCC/MOC system processing in each of the three areas as well as event messages pertaining to the Test/Score/Report processing. The TASS system also provides formatted reports documenting each step performed during the tests and the results of each step. A prototype of the Test/Score/Report capability is available and currently being used to test some POCC/MOC software deliveries. When this capability is fully operational it should greatly reduce the time necessary to test a POCC/MOC software delivery, as well as improve the quality of the test process.
Abdelbary, B E; Garcia-Viveros, M; Ramirez-Oropesa, H; Rahbar, M H; Restrepo, B I
2017-10-01
The purpose of this study was to develop a method for identifying newly diagnosed tuberculosis (TB) patients at risk for TB adverse events in Tamaulipas, Mexico. Surveillance data between 2006 and 2013 (8431 subjects) was used to develop risk scores based on predictive modelling. The final models revealed that TB patients failing their treatment regimen were more likely to have at most a primary school education, multi-drug resistance (MDR)-TB, and few to moderate bacilli on acid-fast bacilli smear. TB patients who died were more likely to be older males with MDR-TB, HIV, malnutrition, and reporting excessive alcohol use. Modified risk scores were developed with strong predictability for treatment failure and death (c-statistic 0·65 and 0·70, respectively), and moderate predictability for drug resistance (c-statistic 0·57). Among TB patients with diabetes, risk scores showed moderate predictability for death (c-statistic 0·68). Our findings suggest that in the clinical setting, the use of our risk scores for TB treatment failure or death will help identify these individuals for tailored management to prevent these adverse events. In contrast, the available variables in the TB surveillance dataset are not robust predictors of drug resistance, indicating the need for prompt testing at time of diagnosis.
NASA Astrophysics Data System (ADS)
Mulkerrin, Elizabeth A.
The purpose of this study was to determine the effect of an 11th-grade and 12th-grade zoo-based academic high school experiential science program compared to a same school-district school-based academic high school experiential science program on students' pretest and posttest science, math, and reading achievement, and student perceptions of program relevance, rigor, and relationships. Science coursework delivery site served as the study's independent variable for the two naturally formed groups representing students (n = 18) who completed a zoo-based experiential academic high school science program and students (n = 18) who completed a school-based experiential academic high school science program. Students in the first group, a zoo-based experiential academic high school science program, completed real world, hands-on projects at the zoo while students in the second group, those students who completed a school-based experiential academic high school science program, completed real world, simulated projects in the classroom. These groups comprised the two research arms of the study. Both groups of students were selected from the same school district. The study's two dependent variables were achievement and school climate. Achievement was analyzed using norm-referenced 11th-grade pretest PLAN and 12th-grade posttest ACT test composite scores. Null hypotheses were rejected in the direction of improved test scores for both science program groups---students who completed the zoo-based experiential academic high school science program (p < .001) and students who completed the school-based experiential academic high school science program (p < .001). The posttest-posttest ACT test composite score comparison was not statistically different ( p = .93) indicating program equipoise for students enrolled in both science programs. No overall weighted grade point average score improvement was observed for students in either science group, however, null hypotheses were rejected in the direction of improved science grade point average scores for 11th-grade (p < .01) and 12th-grade (p = .01) students who completed the zoo-based experiential academic high school science program. Null hypotheses were not rejected for between group posttest science grade point average scores and school district criterion reference math and reading test scores. Finally, students who completed the zoo-based experiential academic high school science program had statistically improved pretest-posttest perceptions of program relationship scores (p < .05) and compared to students who completed the school-based experiential academic high school science program had statistically greater posttest perceptions of program relevance (p < .001), perceptions of program rigor (p < .001), and perceptions of program relationships (p < .001).
Oak, Sameer R; O'Rourke, Colin; Strnad, Greg; Andrish, Jack T; Parker, Richard D; Saluan, Paul; Jones, Morgan H; Stegmeier, Nicole A; Spindler, Kurt P
2015-09-01
The International Knee Documentation Committee (IKDC) Subjective Knee Evaluation Form is a patient-reported outcome with adult (1998) and pediatric (2011) versions validated at different ages. Prior longitudinal studies of patients aged 13 to 17 years who tore their anterior cruciate ligament (ACL) have used the only available adult IKDC, whereas currently the pediatric IKDC is the accepted form of choice. This study compared the adult and pediatric IKDC forms and tested whether the differences were clinically significant. The hypothesis was that the pediatric and adult IKDC questionnaires would show no clinically significant differences in score when completed by patients aged 13 to 17 years. Cohort study (diagnosis); Level of evidence, 2. A total of 100 participants aged 13 to 17 years with knee injuries were split into 2 groups by use of simple randomization. One group answered the adult IKDC form first and then the pediatric form. The second group answered the pediatric IKDC form first and then the adult form. A 10-minute break was given between form administrations to prevent rote repetition of answers. Study design was based on established methods to compare 2 forms of patient-reported outcomes. A 5-point threshold for clinical significance was set below previously published minimum clinically important differences for the adult IKDC. Paired t tests were used to test both differences and equivalence between scores. By ordinary least-squares models, scores were modeled to predict adult scores given certain pediatric scores and vice versa. Comparison between adult and pediatric IKDC scores showed a statistically significant difference of 1.5 points; however, the 95% CI (0.3-2.6) fell below the threshold of 5 points set for clinical significance. Further equivalence testing showed the 95% CI (0.5-2.4) between adult and pediatric scores being within the defined 5-point equivalence region. The scores were highly correlated, with a linear relationship (R(2) = 92%). There was no clinically significant difference between the pediatric and adult IKDC form scores in adolescents aged 13 to 17 years. This result allows use of whichever form is most practical for long-term tracking of patients. A simple linear equation can convert one form into the other. If the adult questionnaire is used at this age, it can be consistently used during follow-up. © 2015 The Author(s).
Atayero, Aderemi A; Popoola, Segun I; Egeonu, Jesse; Oludayo, Olumuyiwa
2018-08-01
Citation is one of the important metrics that are used in measuring the relevance and the impact of research publications. The potentials of citation analytics may be exploited to understand the gains of publishing scholarly peer-reviewed research outputs in either Open Access (OA) sources or Subscription-Based (SB) sources in the bid to increase citation impact. However, relevant data required for such comparative analysis must be freely accessible for evidence-based findings and conclusions. In this data article, citation scores ( CiteScores ) of 2542 OA sources and 15,040 SB sources indexed in Scopus from 2014 to 2016 were presented and analyzed based on a set of five inclusion criteria. A robust dataset, which contains the CiteScores of OA and SB publication sources included, is attached as supplementary material to this data article to facilitate further reuse. Descriptive statistics and frequency distributions of OA CiteScores and SB CiteScores are presented in tables. Boxplot representations and scatter plots are provided to show the statistical distributions of OA CiteScores and SB CiteScores across the three sub-categories (Book Series, Journal, and Trade Journal). Correlation coefficient and p-value matrices are made available within the data article. In addition, Probability Density Functions (PDFs) and Cumulative Distribution Functions (CDFs) of OA CiteScores and SB CiteScores are computed and the results are presented using tables and graphs. Furthermore, Analysis of Variance (ANOVA) and multiple comparison post-hoc tests are conducted to understand the statistical difference (and its significance, if any) in the citation impact of OA publication sources and SB publication source based on CiteScore . In the long run, the data provided in this article will help policy makers and researchers in Higher Education Institutions (HEIs) to identify the appropriate publication source type and category for dissemination of scholarly research findings with maximum citation impact.
Amesse, Lawrence S; Callendar, Ealena; Pfaff-Amesse, Teresa; Duke, Janice; Herbert, William N P
2008-09-24
To evaluate whether computer-based learning (CBL) improves newly acquired knowledge and is an effective strategy for teaching prenatal ultrasound diagnostic skills to third-year medical students when compared with instruction by traditional paper-based methods (PBM). We conducted a randomized, prospective study involving volunteer junior (3(rd) year) medical students consecutively rotating through the Obstetrics and Gynecology clerkship during six months of the 2005-2006 academic year. The students were randomly assigned to permuted blocks and divided into two groups. Half of the participants received instruction in prenatal ultrasound diagnostics using an interactive CBL program; the other half received instruction using equivalent material by the traditional PBM. Outcomes were evaluated by comparing changes in pre-tutorial and post instruction examination scores. All 36 potential participants (100%) completed the study curriculum. Students were divided equally between the CBL (n = 18) and PBM (n = 18) groups. Pre-tutorial exam scores (mean+/-s.d.) were 44%+/-11.1% for the CBL group and 44%+/-10.8% for the PBL cohort, indicating no statistically significant differences (p>0.05) between the two groups. After instruction, post-tutorial exam scores (mean+/-s.d.) were increased from the pre-tutorial scores, 74%+/-11% and 67%+/-12%, for students in the CBL and the PBM groups, respectively. The improvement in post-tutorial exam scores from the pre-test scores was considered significant (p<0.05). When post-test scores for the tutorial groups were compared, the CBL subjects achieved a score that was, on average, 7 percentage points higher than their PBM counterparts, a statistically significant difference (p < 0.05). Instruction by either CBL or PBM strategies is associated with improvements in newly acquired knowledge as reflected by increased post-tutorial examination scores. Students that received CBL had significantlyhigher post-tutorial exam scores than those in the PBM group, indicating that CBL is an effective instruction strategy in this setting.
Distribution of Model-based Multipoint Heterogeneity Lod Scores
Xing, Chao; Morris, Nathan; Xing, Guan
2011-01-01
The distribution of two-point heterogeneity lod scores (HLOD) has been intensively investigated because the conventional χ2 approximation to the likelihood ratio test is not directly applicable. However, there was no study investigating the distribution of the multipoint HLOD despite its wide application. Here we want to point out that, compared with the two-point HLOD, the multipoint HLOD essentially tests for homogeneity given linkage and follows a relatively simple limiting distribution 12χ02+12χ12, which can be obtained by established statistical theory. We further examine the theoretical result by simulation studies. PMID:21104892
Everard, Eoin; Lyons, Mark; Harrison, Andrew J
2018-06-01
To examine the association of injury with the Functional Movement Screen (FMS) and Landing Error Scoring System (LESS) in military recruits undergoing an intensive 16-week training block. Prospective cohort study. One hundred and thirty-two entry-level male soldiers (18-25years) were tested using the FMS and LESS. The participants underwent an intensive 16-week training program with injury data recorded daily. Chi-squared statistics were used to examine associations between injury risk and (1) poor LESS scores, (2) any score of 1 on the FMS and (3) composite FMS score of ≤14. A composite FMS score of ≤14 was not a significant predictor of injury. LESS scores of >5 and having a score of 1 on any FMS test were significantly associated with injury. LESS scores had greater relative risk, sensitivity and specificity (2.2 (95% CI=1.48-3.34); 71% and 87% respectively) than scores of 1 on the FMS (relative risk=1.32 (95% CI=1.0-1.7); sensitivity=50% and specificity=76%). There was no association between composite FMS score and injury but LESS scores and scores of 1 in the FMS test were significantly associated with injury in varying degrees. LESS scores had a much better association with injury than both any scores of 1 on the FMS and a combination of LESS scores and scores of 1 on the FMS. Furthermore, the LESS provides comparable information related to injury risk as other well-established markers associated with injury such as age, muscular strength and previous injury. Copyright © 2017. Published by Elsevier Ltd.
Saleh, George M; Lamparter, Julia; Sullivan, Paul M; O'Sullivan, Fiona; Hussain, Badrul; Athanasiadis, Ioannis; Litwin, Andre S; Gillan, Stewart N
2013-06-01
To investigate the effect of a structured, supervised, cataract simulation programme on ophthalmic surgeons in their first year of training, and to evaluate the level of skill transfer. Trainees with minimal intraocular and simulator experience in their first year of ophthalmology undertook a structured, sequential, customised, virtual reality (VR) cataract training programme developed through the International Forum of Ophthalmic Simulation. A set of one-handed, bimanual, static and dynamic tasks were evaluated before and after the course and scores obtained. Statistical significance was evaluated with the Wilcoxon sign-rank test. The median precourse score of 101.50/400 (IQR 58.75-145.75) was significantly improved after completing the training programme ((postcourse score: 302/400, range: 266.25-343), p<0.001). While improvement was evident and found to be statistically significant in all parameters, greatest improvements were found for capsulorhexis and antitremor training ((Capsulorhexis: precourse score=0/100, range 0-4.5; postcourse score=81/100, range 13-87.75; p=0.002), (antitremor training: precourse score=0/100, range 0-0; postcourse score=80/100, range 60.25-91.50; p=0.001)). Structured and supervised VR training can offer a significant level of skills transfer to novice ophthalmic surgeons. VR training at the earliest stage of ophthalmic surgical training may, therefore, be of benefit.
Haran, F Jay; Dretsch, Michael N; Slaboda, Jill C; Johnson, Dagny E; Adam, Octavian R; Tsao, Jack W
2016-01-01
To examine differences between the baseline-referenced and norm-referenced approaches for determining decrements in Automated Neuropsychological Assessment Metrics Version 4 TBI-MIL (ANAM) performance following mild traumatic brain injury (mTBI). ANAM data were reviewed for 616 US Service members, with 528 of this sample having experienced an mTBI and 88 were controls. Post-injury change scores were calculated for each sub-test: (1) normative change score = in-theater score - normative mean and (2) baseline change score = in-theater score - pre-deployment baseline. Reliable change cut-scores were applied to the change and the resulting frequency distributions were compared using McNemar tests. Receiver operator curves (ROC) using both samples (i.e. mTBI and control) were calculated for the change scores for each approach to determine the discriminate ability of the ANAM. There were no statistical differences, p < 0.05 (Bonferonni-Holm corrected), between the approaches. When the area under the curve for the ROCs were averaged across sub-tests, there were no significant differences between either the norm-referenced (0.65) or baseline-referenced (0.66) approaches, p > 0.05. Overall, the findings suggest there is no clear advantage of using the baseline-referenced approach over norm-referenced approach.
The Effect of Pretest Exercise on Baseline Computerized Neurocognitive Test Scores.
Pawlukiewicz, Alec; Yengo-Kahn, Aaron M; Solomon, Gary
2017-10-01
Baseline neurocognitive assessment plays a critical role in return-to-play decision making following sport-related concussions. Prior studies have assessed the effect of a variety of modifying factors on neurocognitive baseline test scores. However, relatively little investigation has been conducted regarding the effect of pretest exercise on baseline testing. The aim of our investigation was to determine the effect of pretest exercise on baseline Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) scores in adolescent and young adult athletes. We hypothesized that athletes undergoing self-reported strenuous exercise within 3 hours of baseline testing would perform more poorly on neurocognitive metrics and would report a greater number of symptoms than those who had not completed such exercise. Cross-sectional study; Level of evidence, 3. The ImPACT records of 18,245 adolescent and young adult athletes were retrospectively analyzed. After application of inclusion and exclusion criteria, participants were dichotomized into groups based on a positive (n = 664) or negative (n = 6609) self-reported history of strenuous exercise within 3 hours of the baseline test. Participants with a positive history of exercise were then randomly matched, based on age, sex, education level, concussion history, and hours of sleep prior to testing, on a 1:2 basis with individuals who had reported no pretest exercise. The baseline ImPACT composite scores of the 2 groups were then compared. Significant differences were observed for the ImPACT composite scores of verbal memory, visual memory, reaction time, and impulse control as well as for the total symptom score. No significant between-group difference was detected for the visual motor composite score. Furthermore, pretest exercise was associated with a significant increase in the overall frequency of invalid test results. Our results suggest a statistically significant difference in ImPACT composite scores between individuals who report strenuous exercise prior to baseline testing compared with those who do not. Since return-to-play decision making often involves documentation of return to neurocognitive baseline, the baseline test scores must be valid and accurate. As a result, we recommend standardization of baseline testing such that no strenuous exercise takes place 3 hours prior to test administration.
Karimi, Zahra; Dehkordi, Mahnaz Aliakbari; Alipour, Ahmad; Mohtashami, Tayebeh
2018-03-01
Premenstrual syndrome (PMS) consists of repetitious physical and psychological symptoms. The symptoms occur during the luteal phase of the menstrual period and cease when the menstrual period starts. This study included pre-test and post-test experiments between a control group and a test group. The statistical population involved 40 females, chosen based on multistage cluster sampling. The participants were then divided into four groups to undergo treatment with calcium supplement plus vitamin D together with cognitive behavioral therapy (CBT), and were screened with the Premenstrual Syndrome Screening Test (PSST). The pre-test and post-test scores in the PSST, the General Health Questionnaire (GHQ-28), and Bell's Adjustment Inventory (BAI) were used as assessment tools (p < .05). According to the parameters of PMS symptoms, when evaluating the pre-test and post-test scores, the overall score of each individual in the experimental group was improved and a significant effect for the combination of calcium supplement plus vitamin D together with CBT was observed in comparison to the post-test control group. A comparison of multivariate analysis of covariance (MANCOVA) results collected from the pre-test and post-test scores revealed that the method of treatment was beneficial for PMS, adjustment, and general health. © 2018 The Institute of Psychology, Chinese Academy of Sciences and John Wiley & Sons Australia, Ltd.
Maurício, Sílvia Fernandes; da Silva, Jacqueline Braga; Bering, Tatiana; Correia, Maria Isabel Toulson Davisson
2013-04-01
The association between nutritional status and inflammation was assessed in patients with colorectal cancer and to verify their association with complications during anticancer treatment. The agreement between the Subjective Global Assessment (SGA) and different nutritional assessment methods was also evaluated. A cross-sectional, prospective, and descriptive study was performed. The nutritional status was defined by the SGA and the severity of inflammation was defined by the Glasgow Prognostic Score (GPS). The complications were classified using the Common Toxicity Criteria, version 3. Anthropometric measurements such as body mass index, triceps skinfold, midarm circumference, midarm muscle area, and adductor pollicis muscle thickness were also performed, as were handgrip strength and phase angle. The chi-square test, Fisher exact test, Spearman correlation coefficient, independent t test, analysis of variance, Gabriel test, and κ index were used for the statistical analysis. P < 0.05 was considered statistically significant. Seventy patients with colorectal cancer (60.4 ± 14.3 y old) were included. The nutritional status according to the SGA was associated with the GPS (P < 0.05), but the SGA and GPS were not related to the presence of complications. When comparing the different nutritional assessment methods with the SGA, there were statistically significant differences. Malnutrition is highly prevalent in patients with colorectal cancer. The nutritional status was associated with the GPS. Copyright © 2013 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Soros, P.; Ponkham, K.; Ekkapim, S.
2018-01-01
This research aimed to: 1) compare the critical think and problem solving skills before and after learning using STEM Education plan, 2) compare student achievement before and after learning about force and laws of motion using STEM Education plan, and 3) the satisfaction of learning by using STEM Education. The sample used were 37 students from grade 10 at Borabu School, Borabu District, Mahasarakham Province, semester 2, Academic year 2016. Tools used in this study consist of: 1) STEM Education plan about the force and laws of motion for grade 10 students of 1 schemes with total of 14 hours, 2) The test of critical think and problem solving skills with multiple-choice type of 5 options and 2 option of 30 items, 3) achievement test on force and laws of motion with multiple-choice of 4 options of 30 items, 4) satisfaction learning with 5 Rating Scale of 20 items. The statistics used in data analysis were percentage, mean, standard deviation, and t-test (Dependent). The results showed that 1) The student with learning using STEM Education plan have score of critical think and problem solving skills on post-test higher than pre-test with statistically significant level .01. 2) The student with learning using STEM Education plan have achievement score on post-test higher than pre-test with statistically significant level of .01. 3) The student'level of satisfaction toward the learning by using STEM Education plan was at a high level (X ¯ = 4.51, S.D=0.56).
SADEGHI, ROYA; SEDAGHAT, MOHAMMAD MEHDI; SHA AHMADI, FARAMARZ
2014-01-01
Introduction: Blended learning, a new approach in educational planning, is defined as an applying more than one method, strategy, technique or media in education. Todays, due to the development of infrastructure of Internet networks and the access of most of the students, the Internet can be utilized along with traditional and conventional methods of training. The aim of this study was to compare the students’ learning and satisfaction in combination of lecture and e-learning with conventional lecture methods. Methods: This quasi-experimental study is conducted among the sophomore students of Public Health School, Tehran University of Medical Science in 2012-2013. Four classes of the school are randomly selected and are divided into two groups. Education in two classes (45 students) was in the form of lecture method and in the other two classes (48 students) was blended method with e-Learning and lecture methods. The students’ knowledge about tuberculosis in two groups was collected and measured by using pre and post-test. This step has been done by sending self-reported electronic questionnaires to the students' email addresses through Google Document software. At the end of educational programs, students' satisfaction and comments about two methods were also collected by questionnaires. Statistical tests such as descriptive methods, paired t-test, independent t-test and ANOVA were done through the SPSS 14 software, and p≤0.05 was considered as significant difference. Results: The mean scores of the lecture and blended groups were 13.18±1.37 and 13.35±1.36, respectively; the difference between the pre-test scores of the two groups was not statistically significant (p=0.535). Knowledge scores increased in both groups after training, and the mean and standard deviation of knowledge scores of the lectures and combined groups were 16.51±0.69 and 16.18±1.06, respectively. The difference between the post-test scores of the two groups was not statistically significant (p=0.112). Students’ satisfaction in blended learning method was higher than lecture method. Conclusion: The results revealed that the blended method is effective in increasing the students' learning rate. E-learning can be used to teach some courses and might be considered as economic aspects. Since in universities of medical sciences in the country, the majority of students have access to the Internet and email address, using e-learning could be used as a supplement to traditional teaching methods or sometimes as educational alternative method because this method of teaching increases the students’ knowledge, satisfaction and attention. PMID:25512938
Sadeghi, Roya; Sedaghat, Mohammad Mehdi; Sha Ahmadi, Faramarz
2014-10-01
Blended learning, a new approach in educational planning, is defined as an applying more than one method, strategy, technique or media in education. Todays, due to the development of infrastructure of Internet networks and the access of most of the students, the Internet can be utilized along with traditional and conventional methods of training. The aim of this study was to compare the students' learning and satisfaction in combination of lecture and e-learning with conventional lecture methods. This quasi-experimental study is conducted among the sophomore students of Public Health School, Tehran University of Medical Science in 2012-2013. Four classes of the school are randomly selected and are divided into two groups. Education in two classes (45 students) was in the form of lecture method and in the other two classes (48 students) was blended method with e-Learning and lecture methods. The students' knowledge about tuberculosis in two groups was collected and measured by using pre and post-test. This step has been done by sending self-reported electronic questionnaires to the students' email addresses through Google Document software. At the end of educational programs, students' satisfaction and comments about two methods were also collected by questionnaires. Statistical tests such as descriptive methods, paired t-test, independent t-test and ANOVA were done through the SPSS 14 software, and p≤0.05 was considered as significant difference. The mean scores of the lecture and blended groups were 13.18±1.37 and 13.35±1.36, respectively; the difference between the pre-test scores of the two groups was not statistically significant (p=0.535). Knowledge scores increased in both groups after training, and the mean and standard deviation of knowledge scores of the lectures and combined groups were 16.51±0.69 and 16.18±1.06, respectively. The difference between the post-test scores of the two groups was not statistically significant (p=0.112). Students' satisfaction in blended learning method was higher than lecture method. The results revealed that the blended method is effective in increasing the students' learning rate. E-learning can be used to teach some courses and might be considered as economic aspects. Since in universities of medical sciences in the country, the majority of students have access to the Internet and email address, using e-learning could be used as a supplement to traditional teaching methods or sometimes as educational alternative method because this method of teaching increases the students' knowledge, satisfaction and attention.
Multisample adjusted U-statistics that account for confounding covariates.
Satten, Glen A; Kong, Maiying; Datta, Somnath
2018-06-19
Multisample U-statistics encompass a wide class of test statistics that allow the comparison of 2 or more distributions. U-statistics are especially powerful because they can be applied to both numeric and nonnumeric data, eg, ordinal and categorical data where a pairwise similarity or distance-like measure between categories is available. However, when comparing the distribution of a variable across 2 or more groups, observed differences may be due to confounding covariates. For example, in a case-control study, the distribution of exposure in cases may differ from that in controls entirely because of variables that are related to both exposure and case status and are distributed differently among case and control participants. We propose to use individually reweighted data (ie, using the stratification score for retrospective data or the propensity score for prospective data) to construct adjusted U-statistics that can test the equality of distributions across 2 (or more) groups in the presence of confounding covariates. Asymptotic normality of our adjusted U-statistics is established and a closed form expression of their asymptotic variance is presented. The utility of our approach is demonstrated through simulation studies, as well as in an analysis of data from a case-control study conducted among African-Americans, comparing whether the similarity in haplotypes (ie, sets of adjacent genetic loci inherited from the same parent) occurring in a case and a control participant differs from the similarity in haplotypes occurring in 2 control participants. Copyright © 2018 John Wiley & Sons, Ltd.
Scores on the 16 Personality Factor Test and Success in College Calculus 1.
ERIC Educational Resources Information Center
Shaughnessy, Michael F.; And Others
This study explored personality variables measured by the 16 Personality Factor (16PF) test and their relevance to success, as defined by the final course grade, in college calculus courses with 94 students. Two personality variables were significant predictors of success as determined by the final course grade. A Statistical Analysis System…
Appropriate statistical analyses are critical for evaluating interactions of mixtures with a common mode of action, as is often the case for cumulative risk assessments. Our objective is to develop analyses for use when a response variable is ordinal, and to test for interaction...
Kahn, David; Mittelstaedt, Daniel; Matyas, John; Qu, Xiangui; Lee, Ji Hyun; Badar, Farid; Les, Clifford; Zhuang, Zhiguo; Xia, Yang
2016-01-01
Background: The predictable outcome of the anterior cruciate ligament transection (ACLT) canine model, and the similarity to naturally occurring osteoarthritis (OA) in humans, provide a translatable method for studying OA. Still, evidence of direct meniscus-induced cartilaginous damage has not been identified, and gross-anatomical blinded scoring of early-stage OA has not been performed. Objective: A gross anatomical observation and statistical analysis of OA progression to determine meniscus induced cartilaginous damage, to measure the macroscopic progression of OA, and to address matters involving arthroscopic and surgical procedures of the knee. Method: Unblinded assessment and blinded scoring of meniscal, tibial, femoral, and patellar damage were performed for control and at four time points following unilateral ACLT: 3-week (N=4), 8-week (N=4), 12-week (N=5), and 25-week (N=4). Mixed-model statistics illustrates damage (score) progression; Wilcoxon rank-sum tests compared time-point scores; and Wilcoxon signed-rank tests compared ACLT and contralateral scores, and meniscus and tibia scores. Result: Damage was manifest first on the posterior aspect of the medial meniscus and subsequently on the tibia and femur, implying meniscal damage can precede, coincide with, and aggravate cartilage damage. Damage extent varied chronologically and was dependent upon the joint component. Meniscal damage was evident at 3 weeks and progressed through 25-weeks. Meniscal loose bodies corresponded to tibial cartilage damage location and extent through 12 weeks, followed by cartilage repair activity after complete meniscal degeneration. Conclusion: This study provides additional information for understanding OA progression, identifying OA biomarkers, and arthroscopic and meniscectomy procedures. PMID:28144379
Hermida-Ameijeiras, Á; López-Paz, J E; Riveiro-Cruz, M A; Calvo-Gómez, C
2016-01-01
Carotid intima-media thickness (cIMT) has been suggested as a further tool for risk function charts. The aim of this study was to describethe relationship between cIMT and cardiovascular risk (CVR) estimation according to Framingham-REGICOR and SCORE equations. Observational, cross-sectional cohort study from 362 hypertensive subjects. Demographic and clinical information were collected as well as laboratory, ultrasonographic and CVR estimation by the Framingham-REGICOR and SCORE functions. Statistical analysis was performed using SPSS software (version 20,0). To analyze the data, statistical tests such as Chi-square, T-test, ANOVA, and Pearson correlation coefficient were used. According to both functions, differences on mean cIMT were found between low CVR group and intermediate to high groups. No differences were found between intermediate and high risk groups (cIMT: 0,73mm low risk patients vs. 0,89 or 0,88mm respectively according to SCORE function and cIMT: 0,73 vs. 0,85 or 0,87mm respectively according to Framingham-REGICOR function). cIMT correlated positively with CVR estimation according to both SCORE (r=0,421; P<.01), and Framingham-REGICOR functions (r=0,363; P<.01). cIMT correlates positively with CVR estimated by SCORE and Framingham-REGICOR functions. cIMT in those subjects at intermediate risk is similar to those at high risk. Our findings highlight the importance of carotid ultrasound in identifying silent target-organ damage in those patients at intermediate CVR. Copyright © 2015 SEHLELHA. Published by Elsevier España, S.L.U. All rights reserved.
Gui, Jiang; Moore, Jason H.; Williams, Scott M.; Andrews, Peter; Hillege, Hans L.; van der Harst, Pim; Navis, Gerjan; Van Gilst, Wiek H.; Asselbergs, Folkert W.; Gilbert-Diamond, Diane
2013-01-01
We present an extension of the two-class multifactor dimensionality reduction (MDR) algorithm that enables detection and characterization of epistatic SNP-SNP interactions in the context of a quantitative trait. The proposed Quantitative MDR (QMDR) method handles continuous data by modifying MDR’s constructive induction algorithm to use a T-test. QMDR replaces the balanced accuracy metric with a T-test statistic as the score to determine the best interaction model. We used a simulation to identify the empirical distribution of QMDR’s testing score. We then applied QMDR to genetic data from the ongoing prospective Prevention of Renal and Vascular End-Stage Disease (PREVEND) study. PMID:23805232
Arkaravichien, Wiwat; Wongpratat, Apichaya; Lertsinudom, Sunee
2016-08-01
Background Quality indicators determine the quality of actual practice in reference to standard criteria. The Community Pharmacy Association (Thailand), with technical support from the International Pharmaceutical Federation, developed a tool for quality assessment and quality improvement at community pharmacies. This tool has passed validity and reliability tests, but has not yet had feasibility testing. Objective (1) To test whether this quality tool could be used in routine settings. (2) To compare quality scores between accredited independent and accredited chain pharmacies. Setting Accredited independent pharmacies and accredited chain pharmacies in the north eastern region of Thailand. Methods A cross sectional study was conducted in 34 accredited independent pharmacies and accredited chain pharmacies. Quality scores were assessed by observation and by interviewing the responsible pharmacists. Data were collected and analyzed by independent t-test and Mann-Whitney U test as appropriate. Results were plotted by histogram and spider chart. Main outcome measure Domain's assessable scores, possible maximum scores, mean and median of measured scores. Results Domain's assessable scores were close to domain's possible maximum scores. This meant that most indicators could be assessed in most pharmacies. The spider chart revealed that measured scores in the personnel, drug inventory and stocking, and patient satisfaction and health promotion domains of chain pharmacies were significantly higher than those of independent pharmacies (p < 0.05). There was no statistical difference between independent pharmacies and chain pharmacies in the premise and facility or dispensing and patient care domains. Conclusion Quality indicators developed by the Community Pharmacy Association (Thailand) could be used to assess quality of practice in pharmacies in routine settings. It is revealed that the quality scores of chain pharmacies were higher than those of independent pharmacies.
Park, Cheol Eon; Shin, Seung Youp; Lee, Kun Hee; Cho, Joong Saeng; Kim, Sung Wan
2012-09-01
Both allergic rhinitis (AR) and obstructive sleep apnea (OSA) are known to increase stress and fatigue, but the result of their coexistence has not been studied. The objective of this study was to evaluate the amount of stress and fatigue when AR is combined with OSA. One hundred and twelve patients diagnosed with OSA by polysomnography were enrolled. Among them, 37 patients were diagnosed with AR by a skin prick test and symptoms (OSA-AR group) and 75 patients were classified into the OSA group since they tested negative for allergies. We evaluated the Epworth sleepiness scale (ESS), stress score, fatigue score, ability to cope with stress, and rhinosinusitis quality of life questionnaire (RQLQ) with questionnaires and statistically compared the scores of both groups. There were no significant differences in BMI and sleep parameters such as LSAT, AHI, and RERA between the two groups. However, the OSA-AR group showed a significantly higher ESS score compared to the OSA group (13.7 ± 4.7 vs. 9.3 ± 4.8). Fatigue scores were also significantly higher in the OSA-AR group than in the OSA group (39.8 ± 11.0 vs. 30.6 ± 5.4). The OSA-AR group had a significantly higher stress score (60.4 ± 18.6 vs. 51.2 ± 10.4). The ability to cope with stress was higher in the OSA group, although this difference was not statistically significant. RQLQ scores were higher in the OSA-AR group (60.2 ± 16.7 compared to 25.1 ± 13.9). In conclusion, management of allergic rhinitis is very important in treating OSA patients in order to eliminate stress and fatigue and to minimize daytime sleepiness and quality of life.
Yu, Yu; Manku, Mandeep; Backman, Catherine L
2018-04-01
There is an assumption that occupational balance is integrally related to health and well-being. This study aimed to investigate test-retest reliability of the English-translated Occupational Balance Questionnaire (OBQ), its relationship to measures of health (Short Form Health Survey-36 Version 2.0 [SF-36v2]) and stress (Perceived Stress Scale-10; PSS-10), and demographic differences in OBQ scores in Canadian adults. Test-retest reliability (2 weeks) was assessed using intraclass correlation (ICC) coefficients. Online surveys from 86 adults were analyzed using descriptive, correlational, and t test statistics. OBQ test-retest reliability was ICC = 0.74 (95% CI [0.34, 0.90]; p = .003) when excluding an influential case ( n = 20). OBQ correlations with PSS-10 were r = -.72; with SF-36v2 Mental Component Score, r = .65; and with Physical Component Score, r = .31; all p < .001. Age and gender had no impact on OBQ scores. Findings help elucidate relationships among health, stress, and occupational balance; however, further psychometric testing is warranted before using OBQ for clinical purposes.
Chao, Serena H; Brett, Belle; Wiecha, John M; Norton, Lisa E; Levine, Sharon A
2012-07-01
Web-based learning methods are being used increasingly to teach core curriculum in medical school clerkships, but few studies have compared the effectiveness of online methods with that of live lectures in teaching the same topics to students. Boston University School of Medicine has implemented an online, case-based, interactive curriculum using videos and text to teach delirium to fourth-year medical students during their required 1-month Geriatrics and Home Medical Care clerkship. A control group of 56 students who received a 1-hour live delirium lecture only was compared with 111 intervention group students who completed the online delirium curriculum only. Evaluation consisted of a short-answer test with two cases given as a pre- and posttest to both groups. The total possible maximum test score was 34 points, and the lowest possible score was -8 points. Mean pre- and posttest scores were 10.5 ± 4.0 and 12.7 ± 4.4, respectively, in the intervention group and 9.9 ± 3.5 and 11.2 ± 4.5, respectively, in the control group. The intervention group had statistically significant improvement between the pre- and posttest scores (2.21-point difference; P < .001), as did the control group (1.36-point difference; P = .03); the difference in test score improvement between the two groups was not statistically significant. An interactive case-based online curriculum in delirium is as effective as a live lecture in teaching delirium, although neither of these educational methods alone produces robust increases in knowledge. © 2012, Copyright the Authors Journal compilation © 2012, The American Geriatrics Society.
Fassbinder, Tânia Regina Cavinatto; Winkelmann, Eliane Roseli; Schneider, Juliana; Wendland, Juliana; Oliveira, Olvânia Basso de
2015-01-01
Chronic kidney disease (CKD) infers directly in functional capacity, independence and therefore quality of life (QOL). To compare the physical fitness and quality of life of patients with chronic kidney disease submitted on hemodialysis (G1) and predialysis treatment (G2). A cross-sectional study, 54 patients with CKD, 27 of the G1 group (58.15 ± 10.84 years), 27 of G2 group (62.04 ± 16.56 years). There were cardiovascular risk factors, anthropometric measurements, respiratory muscle strength was measured by the inspiratory pressure (MIP) and expiratory (MEP) maximum measured in the manometer, six-minute walk (TC6'), cardiopulmonary exercise test, sit and stand one minute test (TSL1') and the Short-Form Questionary (SF-36) to assess QOL. The patients presented disease of stage between 2 and 5. It was applied the Kolmogorov-Smirnov normality test and used the t (Student) test or the U (Mann Whitney) test to compare the means of quantitative variables and the chi-square Pearson test and Fisher's exact test for qualitative variables. Pearson's or Spearman's test was used to identify correlations. No statistically significant difference was found between G1 and G2 in VO2peak (p = 0,259) in TC6' (p = 0,433) in the MIPmáx (p = 0,158) and found only in the MEPmáx (p = 0,024) to G1. The scores of the SF-36 in both groups showed a worse health status as evidenced by the low score in scores for QOL. Patients with CKD had reduced functional capacity and QOL, and hemodialysis, statistically, didn't have showed negative repercussions when compared with pre-dialysis patients.
Newton-Clarke, M J; Divers, T J; Delahunta, A; Mohammed, H O
1994-09-01
A study was conducted over a 12 month period to assess the specificity and sensitivity of the 'slap test', using endoscopic evaluation, in the detection of cervical spinal cord and caudal brainstem lesions in horses. Fifteen ataxic horses were subjected to the 'slap test' and subsequently examined post mortem. Twelve out of the 15 had histopathological lesions consistent with their clinical signs. Thirteen horses with no history of neurological dysfunction and no histopathological evidence of cervical spinal cord or brainstem disease were used as controls. The laryngeal adductory responses exhibited by all horses were filmed and later scored independently by 3 assessors. The proportion of animals diagnosed with cervical spinal cord and/or brainstem disease, defined by histopathological criteria, was found to be statistically similar to the proportion with abnormal 'slap test' responses, using the McNemar chi-Square test. Despite statistical significance between proportions, sensitivity of the 'slap test' was low, 50% for the left side on both days and 58% for the right side. Specificity was higher, 69% (Day 1) and 75% (Day 2) for the left side and 75% (Day 1) and 69% (Day 2) for the right side. In contrast to this, conventional neurological examination was found to be 100% sensitive and 81% specific in the detection of lesions of histopathological significance in the cervical spinal cord/caudal brainstem. Agreement between scores for the 'slap test' from the same assessor on different days was good, with values for kappa of 0.59 to 0.85. In contrast, agreement between assessors on the 'slap test' score was poor, with kappa 0.35.(ABSTRACT TRUNCATED AT 250 WORDS)
Academic Outcome Measures of a Dedicated Education Unit Over Time: Help or Hinder?
Smyer, Tish; Gatlin, Tricia; Tan, Rhigel; Tejada, Marianne; Feng, Du
2015-01-01
Critical thinking, nursing process, quality and safety measures, and standardized RN exit examination scores were compared between students (n = 144) placed in a dedicated education unit (DEU) and those in a traditional clinical model. Standardized test scores showed that differences between the clinical groups were not statistically significant. This study shows that the DEU model is 1 approach to clinical education that can enhance students' academic outcomes.
Kenya, Amilliah W.; Hart, John F.; Vuyiya, Charles K.
2016-01-01
Objective: This study compared National Board of Chiropractic Examiners part I test scores between students who did and did not serve as tutors on the subject matter. Methods: Students who had a prior grade point average of 3.45 or above on a 4.0 scale just before taking part I of the board exams were eligible to participate. A 2-sample t-test was used to ascertain the difference in the mean scores on part I between the tutor group (n = 28) and nontutor (n = 29) group. Results: Scores were higher in all subjects for the tutor group compared to the nontutor group and the differences were statistically significant (p < .01) with large effect sizes. Conclusion: The tutors in this study performed better on part I of the board examination compared to nontutors, suggesting that tutoring results in an academic benefit for tutors themselves. PMID:26998665
Nagar, Namit; Vaz, Anna C
2013-01-01
To compare the shear bond strength of a nano-ceramic restorative composite Ceram-X Mono(TM♦), a restorative resin with the traditional orthodontic composite Transbond XT(TM†) and to evaluate the site of bond failure using Adhesive Remnant Index. Sixty extracted human premolars were divided into two groups of 30 each. Stainless steel brackets were bonded using Transbond XT(TM†) (Group I) and Ceram-X Mono(TM♦) (Group II) according to manufacturer's protocol. Shear bond strength was measured on Universal testing machine at crosshead speed of 1 mm/minute. Adhesive Remnant Index scores were assigned to debonded brackets of each group. Data was analyzed using unpaired 't' test and Chi square test. The mean shear bond strength of Group I (Transbond XT(TM†)) was 12.89 MPa ± 2.19 and that of Group II (Ceram-X Mono(TM)) was 7.29 MPa ± 1.76. Unpaired 't' test revealed statistically significant differences amongst the shear bond strength of the samples measured. Chi-square test revealed statistically insignificant differences amongst the ARI scores of the samples measured. Ceram-X Mono(TM♦) had a lesser mean shear bond strength when compared to Transbond XT(TM†) which was statistically significant difference. However, the mean shear bond of Ceram X Mono was within the clinically acceptable range for bonding. Ceram-X Mono(TM†) and Transbond XT(TM†) showed cohesive fracture of adhesive in 72.6% and 66.6% of the specimens, respectively.
Pre-Deployment Stress, Mental Health, and Help-Seeking Behaviors Among Marines
2014-01-01
associations between two categori- cal variables, and Wald tests were conducted to compare mean scores on continuous variables across groups (e.g...Cluster- adjusted wald tests were conducted to determine whether there were significant differences by rank on the average number of potentially...deployed to Iraq or Afghanistan in 2010 or 2011 of rank O6 or lower. a Omnibus rao-Scott chi-square test or adjusted wald test is statistically
Effectiveness of the training material in drug-dose calculation skills.
Basak, Tulay; Aslan, Ozlem; Unver, Vesile; Yildiz, Dilek
2016-07-01
The aim of study was to evaluate the effectiveness of the training material based on low-level environmental fidelity simulation in drug-dose calculation skills in senior nursing students. A quasi-experimental design with one group. The sample included senior nursing students attending a nursing school in Turkey in the period December 2012-January 2013. Eighty-two senior nursing students were included in the sample. Data were obtained using a data collection form which was developed by the researchers. A paired-sample t-test was used to compare the pretest and post-test scores. The difference between the mean pretest score and the mean post-test score was statistically significant (P < 0.05). This study revealed that the training material based on low-level environmental fidelity simulation positively impacted accurate drug-dose calculation skills in senior nursing students. © 2016 Japan Academy of Nursing Science.
Analyzing force concept inventory with item response theory
NASA Astrophysics Data System (ADS)
Wang, Jing; Bao, Lei
2010-10-01
Item response theory is a popular assessment method used in education. It rests on the assumption of a probability framework that relates students' innate ability and their performance on test questions. Item response theory transforms students' raw test scores into a scaled proficiency score, which can be used to compare results obtained with different test questions. The scaled score also addresses the issues of ceiling effects and guessing, which commonly exist in quantitative assessment. We used item response theory to analyze the force concept inventory (FCI). Our results show that item response theory can be useful for analyzing physics concept surveys such as the FCI and produces results about the individual questions and student performance that are beyond the capability of classical statistics. The theory yields detailed measurement parameters regarding the difficulty, discrimination features, and probability of correct guess for each of the FCI questions.
Anger and depression levels of mothers with premature infants in the neonatal intensive care unit.
Kardaşözdemir, Funda; AKGüN Şahin, Zümrüt
2016-02-04
The aim of this study was to examine anger and depression levels of mothers who had a premature infant in the NICU, and all factors affecting the situation. This descriptive study was performed in the level I and II units of NICU at three state hospitals in Turkey. The data was collected with a demographic questionnaire, "Beck Depression Inventory" and "Anger Expression Scale". Descriptive statistics, parametric and nonparametric statistical tests and Pearson correlation were used in the data analysis. Mothers whose infants are under care in NICU have moderate depression. It has also been determined that mothers' educational level, income level and gender of infants were statistically significant (p <0.05). A positive relationship between depression and trait anger scores was found to be statistically significant. A negative relationship existed between depression and anger-control scores for the mothers, which was statistically significant (p <0.05). Due to the results of research, recommended that mothers who are at risk of depression and anger in the NICU evaluated by nurses and these nurses to develop their consulting roles.
The effect of teaching medical ethics on medical students' moral reasoning.
Self, D J; Wolinsky, F D; Baldwin, D C
1989-12-01
A study assessed the effect of incorporating medical ethics into the medical curriculum and the relative effects of two methods of implementing that curriculum, namely, lecture and case-study discussions. Results indicate a statistically significant increase (p less than or equal to .0001) in the level of moral reasoning of students exposed to the medical ethics course, regardless of format. Moreover, the unadjusted posttest scores indicated that the case-study method was significantly (p less than or equal to .03) more effective than the lecture method in increasing students' level of moral reasoning. When adjustment were made for the pretest scores, however, this difference was not statistically significant (p less than or equal to .18). Regression analysis by linear panel techniques revealed that age, gender, undergraduate grade-point average, and scores on the Medical College Admission Test were not related to the changes in moral-reasoning scores. All of the variance that could be explained was due to the students' being in one of the two experimental groups. In comparison with the control group, the change associated with each experimental format was statistically significant (lecture, p less than or equal to .004; case study, p less than or equal to .0001). Various explanations for these findings and their implications are given.
Pei, Yanbo; Tian, Guo-Liang; Tang, Man-Lai
2014-11-10
Stratified data analysis is an important research topic in many biomedical studies and clinical trials. In this article, we develop five test statistics for testing the homogeneity of proportion ratios for stratified correlated bilateral binary data based on an equal correlation model assumption. Bootstrap procedures based on these test statistics are also considered. To evaluate the performance of these statistics and procedures, we conduct Monte Carlo simulations to study their empirical sizes and powers under various scenarios. Our results suggest that the procedure based on score statistic performs well generally and is highly recommended. When the sample size is large, procedures based on the commonly used weighted least square estimate and logarithmic transformation with Mantel-Haenszel estimate are recommended as they do not involve any computation of maximum likelihood estimates requiring iterative algorithms. We also derive approximate sample size formulas based on the recommended test procedures. Finally, we apply the proposed methods to analyze a multi-center randomized clinical trial for scleroderma patients. Copyright © 2014 John Wiley & Sons, Ltd.
Turkish Version of Kolcaba's Immobilization Comfort Questionnaire: A Validity and Reliability Study.
Tosun, Betül; Aslan, Özlem; Tunay, Servet; Akyüz, Aygül; Özkan, Hüseyin; Bek, Doğan; Açıksöz, Semra
2015-12-01
The purpose of this study was to determine the validity and reliability of the Turkish version of the Immobilization Comfort Questionnaire (ICQ). The sample used in this methodological study consisted of 121 patients undergoing lower extremity arthroscopy in a training and research hospital. The validity study of the questionnaire assessed language validity, structural validity and criterion validity. Structural validity was evaluated via exploratory factor analysis. Criterion validity was evaluated by assessing the correlation between the visual analog scale (VAS) scores (i.e., the comfort and pain VAS scores) and the ICQ scores using Spearman's correlation test. The Kaiser-Meyer-Olkin coefficient and Bartlett's test of sphericity were used to determine the suitability of the data for factor analysis. Internal consistency was evaluated to determine reliability. The data were analyzed with SPSS version 15.00 for Windows. Descriptive statistics were presented as frequencies, percentages, means and standard deviations. A p value ≤ .05 was considered statistically significant. A moderate positive correlation was found between the ICQ scores and the VAS comfort scores; a moderate negative correlation was found between the ICQ and the VAS pain measures in the criterion validity analysis. Cronbach α values of .75 and .82 were found for the first and second measurements, respectively. The findings of this study reveal that the ICQ is a valid and reliable tool for assessing the comfort of patients in Turkey who are immobilized because of lower extremity orthopedic problems. Copyright © 2015. Published by Elsevier B.V.
NASA Astrophysics Data System (ADS)
Reed, Krystal Astra
The "Advancement via Individual Determination (AVID) program was designed to provide resources and strategies that enable underrepresented minority students to attend 4-year colleges" (AVID Center, 2013, p. 2). These students are characterized as the forgotten middle in that they have high test scores, average-to-low grades, minority or low socioeconomic status, and will be first-generation college students (AVID, 2011). Research indicates (Huerta, Watt, & Butcher, 2013) that strict adherence to 11 program components supports success of students enrolled in AVID, and AVID certification depends on districts following those components. Several studies (AVID Center, 2013) have investigated claims about the AVID program through qualitative analyses; however, very few have addressed this program quantitatively. This researcher sought to determine whether differences existed between student achievement and attendance rates between AVID and non-AVID middle schools. To achieve this goal, the researcher compared eighth-grade science and seventh- and eighth-grade mathematics scores from the 2007 to 2011 Texas Assessment of Knowledge and Skills (TAKS) and overall attendance rates in demographically equivalent AVID and non-AVID middle schools. Academic Excellence Indicator System (AEIS) reports from the Texas Education Agency (TEA) were used to obtain 2007 to 2011 TAKS results and attendance information for the selected schools. The results indicated a statistically significant difference between AVID demonstration students and non-AVID students in schools with similar CI. No statistically significant differences were found on any component of the TAKS for AVID economically disadvantaged students. The mean scores indicated an achievement gap between non-AVID and AVID demonstration middle schools. The findings from the other three research questions indicated no statistically significant differences between AVID and non-AVID student passing rates on the seventh- and eighth-grade TAKS math tests or on overall attendance rates. The mean scores on the eighth-grade TAKS science test revealed some positive results in the academic performance of economically disadvantaged in non-AVID demonstration middle schools. Specifically, the results indicated that the mean passing percentage of AVID demonstration was lower than that of non-AVID middle schools. The TAKS scores showed a small achievement gap between non-AVID and AVID demonstration middle schools.
The effect of nurses' ethical leadership and ethical climate perceptions on job satisfaction.
Özden, Dilek; Arslan, Gülşah Gürol; Ertuğrul, Büşra; Karakaya, Salih
2017-01-01
The development of ethical leadership approaches plays an important role in achieving better patient care. Although studies that analyze the impact of ethical leadership on ethical climate and job satisfaction have gained importance in recent years, there is no study on ethical leadership and its relation to ethical climate and job satisfaction in our country. This descriptive and cross-sectional study aimed to determine the effect of nurses' ethical leadership and ethical climate perceptions on their job satisfaction. The study sample is composed of 285 nurses who agreed to participate in this research and who work at the internal, surgical, and intensive care units of a university hospital and a training and research hospital in İzmir, Turkey. Data were collected using Ethical Leadership Scale, Hospital Ethical Climate Scale, and Minnesota Satisfaction Scale. While the independent sample t-test, analysis of variance, Mann-Whitney U test, and Kruskal-Wallis test were used to analyze the data, the correlation analysis was used to determine the relationship between the scales. Ethical considerations: The study proposal was approved by the ethics committee of the Faculty of Medicine, Dokuz Eylül University. The nurses' mean scores were 59.05 ± 14.78 for the ethical leadership, 92.62 ± 17 for the ethical climate, and 62.15 ± 13.46 for the job satisfaction. The correlation between the nurses' ethical leadership and ethical climate mean scores was moderately positive and statistically significant (r = +0.625, p = 0.000), was weak but statistically significant between their ethical leadership and job satisfaction mean scores (r = +0.461, p = 0.000), and was moderately positive and statistically significant between their ethical climate and job satisfaction mean scores (r = +0.603, p = 0.000). The nurses' ethical leadership, ethical climate, and job satisfaction levels are moderate, and there is a positive relationship between them. The nurses' perceptions of ethical leadership are influenced by their educational status, workplace, and length of service.
CranioSacral Therapy and Visceral Manipulation: A New Treatment Intervention for Concussion Recovery
Roland, Melinda; Fryer-Dietz, Sally; Dettmann-Ahern, Dee
2017-01-01
Abstract Background: Military service members and veterans face health issues related to traumatic brain injury (TBI), especially during combat, use of heavy equipment, and exposures to environmental hazards and explosives. There were 400,000 TBIs reported in deployed U.S. troops in 2012. Athletes are also subject to TBI. Studies have indicated that some manual therapies could be helpful for treating patients who have post-concussive syndrome. Objective: This case series report describes the effects of CranioSacral Therapy (CST), Visceral Manipulation (VM), and Neural Manipulation (NM) modalities for treating patients who have post-concussion syndrome. The goal of this study was to evaluate these effects on immobility, pain intensity, quality of life, sleep disorders, and cognition in these patients. Materials and Methods: This single-blinded case series was conducted at the Upledger Institute, in West Palm Beach, FL. The patients were 11 male retired professional football players from the National Football League and the Canadian Football League who had been medically diagnosed with post-concussion syndrome. Each participant received a morning and afternoon 2-hour session of these three specific manual therapies, which were capable of accessing and addressing the structural, vascular, and neurologic tissues of the cranium and brain—as well addressing far-reaching ramifications throughout the body following trauma. The main outcome measures were scores on the: Impact Neurocognitive Test; Dynavisiontm Test; Short Form–36 Quality of Life Survey, Headache Impact Test, Dizziness Handicap Inventory; a numeric pain rating scale; orthopedic range of motion tests (ROM); and vestibular testing. Hours of sleep were also checked. These outcome measures were registered at baseline, after treatment, and after a 3-month follow up. Results: Statistically significant differences were seen with a decrease in overall pain rating scale scores (P = 0.0448), and cervicogenic pain levels decreased (P = 0.0486). There were statistically significant increases in Dynavision Average Reaction Time (P = 0.0332), Memory Test (P = 0.0156) scores, and cervical ROM scores (P = 0.0377). Hours of sleep averaged 2 hours on the first day of treatment and increased to 4.0 hours at the end of treatment and were continuing to increase, as noted at a 3-month evaluation. Conclusions: Ten sessions of specific CST/VM/NM therapy resulted in statistically greater improvements in pain intensity, ROM, memory, cognition, and sleep in concussed patients. PMID:28874926
Corneal permeability changes in dry eye disease: an observational study.
Fujitani, Kenji; Gadaria, Neha; Lee, Kyu-In; Barry, Brendan; Asbell, Penny
2016-05-13
Diagnostic tests for dry eye disease (DED), including ocular surface disease index (OSDI), tear breakup time (TBUT), corneal fluorescein staining, and lissamine staining, have great deal of variability. We investigated whether fluorophotometry correlated with previously established DED diagnostic tests and whether it could serve as a novel objective metric to evaluate DED. Dry eye patients who have had established signs or symptoms for at least 6 months were included in this observational study. Normal subjects with no symptoms of dry eyes served as controls. Each eye had a baseline fluorescein scan prior to any fluorescein dye. Fluorescein dye was then placed into both eyes, rinsed with saline solution, and scanned at 5, 10, 15, and 30 min. Patients were administered the following diagnostic tests to correlate with fluorophotometry: OSDI, TBUT, fluorescein, and lissamine. Standard protocols were used. P < 0.05 was considered significant. Fifty eyes from 25 patients (DED = 22 eyes, 11 patients; Normal = 28 eyes, 14 patients) were included. Baseline scans of the dry eye and control groups did not show any statistical difference (p = 0.84). Fluorescein concentration of DED and normal patients showed statistical significance at all time intervals (p < 10(-5), 0.001, 0.002, 0.049 for 5, 10, 15, & 30 min respectively). Fluorophotometry values converged towards baseline as time elapsed, but both groups were still statistically different at 30 min (p < 0.01). We used four fluorophotometry scoring methods and correlated them with OSDI, TBUT, fluorescein, and lissamine along with adjusted and aggregate scores. The four scoring schemes did not show any significant correlations with the other tests, except for correlations seen with lissamine and 10 (p = 0.045, 0.034) and 15 min (p = 0.013, 0.012), and with aggregate scores and 15 min (p = 0.042, 0.017). Fluorophotometry generally did not correlate with any other DED tests, even though it showed capability of differentiating between DED and normal eyes up to 30 min after fluorescein dye instillation. There may be an aspect of DED that is missed in the current regimen of DED tests and only captured with fluorophotometry. Adding fluorophotometry may be useful in screening, diagnosing, and monitoring patients with DED.
Al Saffan, Abdulrahman Dahham; Baseer, Mohammad Abdul; Alshammary, Abdul Aziz; Assery, Mansour; Kamel, Ashraf; Rahman, Ghousia
2017-01-01
Aims and Objectives: To assess the early effect of oral health education on oral health knowledge of primary and intermediate school students of private schools by utilizing pre/post questionnaires data from oral health educational projects in Riyadh city, Saudi Arabia. Second, to examine topic-specific knowledge differences between genders, nationalities, and educational levels of the students. Materials and Methods: Cross-sectional oral health educational data of private school students (n = 1279) in primary and intermediate levels were extracted from the King Salman Centre for Children's Health (KSCCH) projects undertaken by Riyadh Colleges of Dentistry and Pharmacy. Student's pre- and post-test data were analyzed for changes in oral health knowledge. Overall knowledge score and topic-specific knowledge scores were calculated and the differences between gender, nationality, and educational level were examined using Mann–Whitney U-test. Pre/post change in the oral health knowledge was evaluated by Wilcoxon's sign rank test. Results: Immediately, after oral health educational session high knowledge score category showed an increase of 25.6%, medium and low knowledge score categories showed −3.2% and −22.3% decrease, and this change was statistically significant (P < 0.001). Comparison of correct responses between pre- and post-test showed statistically significant (P < 0.05) increase in all the questions except for the timing of tooth brushing. Females, non-Saudi nationals and students in primary level of education showed significantly high mean knowledge (P < 0.001) at posttest assessment. Conclusion: Primary and intermediate private school student's overall, and topic-specific oral health knowledge improved immediately after educational intervention provided by KSCCH. High knowledge gain was observed among female non-Saudi primary school students. PMID:29285475
Sleeper, Mark D; Kenyon, Lisa K; Elliott, James M; Cheng, M Samuel
2016-12-01
Despite the availability of various field-tests for many competitive sports, a reliable and valid test specifically developed for use in men's gymnastics has not yet been developed. The Men's Gymnastics Functional Measurement Tool (MGFMT) was designed to assess sport-specific physical abilities in male competitive gymnasts. The purpose of this study was to develop the MGFMT by establishing a scoring system for individual test items and to initiate the process of establishing test-retest reliability and construct validity. A total of 83 competitive male gymnasts ages 7-18 underwent testing using the MGFMT. Thirty of these subjects underwent re-testing one week later in order to assess test-retest reliability. Construct validity was assessed using a simple regression analysis between total MGFMT scores and the gymnasts' USA-Gymnastics competitive level to calculate the coefficient of determination (r 2 ). Test-retest reliability was analyzed using Model 1 Intraclass correlation coefficients (ICC). Statistical significance was set at the p<0.05 level. The relationship between total MGFMT scores and subjects' current USA-Gymnastics competitive level was found to be good (r 2 = 0.63). Reliability testing of the MGFMT composite test score showed excellent test-retest reliability over a one-week period (ICC = 0.97). Test-retest reliability of the individual component tests ranged from good to excellent (ICC = 0.75-0.97). The results of this study provide initial support for the construct validity and test-retest reliability of the MGFMT. Level 3.
Tiffin, Paul A; Mwandigha, Lazaro M; Paton, Lewis W; Hesselgreaves, H; McLachlan, John C; Finn, Gabrielle M; Kasim, Adetayo S
2016-09-26
The UK Clinical Aptitude Test (UKCAT) has been shown to have a modest but statistically significant ability to predict aspects of academic performance throughout medical school. Previously, this ability has been shown to be incremental to conventional measures of educational performance for the first year of medical school. This study evaluates whether this predictive ability extends throughout the whole of undergraduate medical study and explores the potential impact of using the test as a selection screening tool. This was an observational prospective study, linking UKCAT scores, prior educational attainment and sociodemographic variables with subsequent academic outcomes during the 5 years of UK medical undergraduate training. The participants were 6812 entrants to UK medical schools in 2007-8 using the UKCAT. The main outcome was academic performance at each year of medical school. A receiver operating characteristic (ROC) curve analysis was also conducted, treating the UKCAT as a screening test for a negative academic outcome (failing at least 1 year at first attempt). All four of the UKCAT scale scores significantly predicted performance in theory- and skills-based exams. After adjustment for prior educational achievement, the UKCAT scale scores remained significantly predictive for most years. Findings from the ROC analysis suggested that, if used as a sole screening test, with the mean applicant UKCAT score as the cut-off, the test could be used to reject candidates at high risk of failing at least 1 year at first attempt. However, the 'number needed to reject' value would be high (at 1.18), with roughly one candidate who would have been likely to pass all years at first sitting being rejected for every higher risk candidate potentially declined entry on this basis. The UKCAT scores demonstrate a statistically significant but modest degree of incremental predictive validity throughout undergraduate training. Whilst the UKCAT could be considered a fairly crude screening tool for future academic performance, it may offer added value when used in conjunction with other selection measures. Future work should focus on the optimum role of such tests within the selection process and the prediction of post-graduate performance.
Ye, Siqin; Cheng, Bin; Lip, Gregory Y. H.; Buchsbaum, Richard; Sacco, Ralph L.; Levin, Bruce; Di Tullio, Marco R.; Qian, Min; Mann, Douglas L.; Pullicino, Patrick M.; Freudenberger, Ronald S.; Teerlink, John R.; Mohr, J.P.; Graham, Susan; Labovitz, Arthur J.; Estol, Conrado J.; Lok, Dirk J.; Ponikowski, Piotr; Anker, Stefan D.; Thompson, John L.P.; Homma, Shunichi
2015-01-01
We sought to assess the performance of existing bleeding risk scores, such as HAS-BLED or OBRI, in patients with heart failure with reduced ejection fraction (HFrEF) in sinus rhythm (SR) treated with warfarin or aspirin. We calculated HAS-BLED and OBRI risk scores for 2,305 patients with HFrEF in SR enrolled in the Warfarin versus Aspirin in Reduced Cardiac Ejection Fraction (WARCEF) trial. Proportional hazards models were used to test whether each score predicted major bleeding, and comparison of different risk scores was performed using Harell’s c-statistic and net-reclassification improvement (NRI) index. For the warfarin arm, both scores predicted bleeding risk, with OBRI having significantly higher c-statistic (0.72 vs 0.61; p=0.03) compared to HAS-BLED, though the NRI for comparing OBRI to HAS-BLED was not significant (0.32, 95% CI - 0.18-0.37). Performance of the OBRI and HAS-BLED risk scores were similar for the aspirin arm. For participants with OBRI score of 0 to 1, warfarin compared with aspirin reduced ischemic stroke (HR 0.51, 95% CI 0.26-0.98, p=0.042) without significantly increasing major bleeding (HR 1.24, 95% CI 0.66-2.30, p=0.51). For those with OBRI score of ≥2, there was a trend for reduced ischemic stroke with warfarin compared to aspirin (HR 0.56, 95% CI 0.27-1.15, p=0.12), but major bleeding was increased (HR 4.04, 95% CI 1.99-8.22, p<0.001). In conclusion, existing bleeding risk scores can identify bleeding risk in HFrEF patients in SR, and could be tested for potentially identifying patients with a favorable risk / benefit profile for antithrombotic therapy with warfarin. PMID:26189039
Hiemstra, Laurie Anne; Kerslake, Sarah; Lafave, Mark R
2017-11-01
Trochlear dysplasia is a well-described risk factor for recurrent patellofemoral instability. Despite its clear association with the incidence of patellofemoral instability, it is unclear whether the presence of high-grade trochlear dysplasia influences clinical outcome after patellofemoral stabilization. The purpose of this study was to assess whether trochlear dysplasia influenced patient-reported, disease-specific outcomes in surgically treated patellar instability patients, when risk factors were addressed in accordance with the à la carte surgical approach to the treatment of patellofemoral instability. The study design is of a case series. A total of 318 patellar stabilization procedures were performed during the study period. Of these procedures, 260 had adequate lateral radiographs and complete Banff Patellar Instability Instrument (BPII) scores available for assessment. A Pearson r correlation was calculated between four characteristics of trochlear dysplasia, the BPII total and the BPII symptoms, and physical complaints scores, a mean of 24 months following patellofemoral stabilization. Independent t -tests were performed between stratified trochlear dysplasia groups (no/low grade and high grade) and all BPII measures. There was a statistically significant correlation between measures of trochlear dysplasia and quality-of-life physical symptoms scores, an average of 2 years following patellofemoral stabilization surgery. The BPII symptoms and physical complaints domain score, as well as the individual weakness and stiffness questions, correlated with the classification of trochlear dysplasia as well as the presence of a trochlear bump ( p < 0.05). Independent t -tests demonstrated statistically significant differences between the no/low-grade and high-grade dysplasia groups for the BPII stiffness ( p = 0.002), BPII weakness ( p = 0.05) and BPII symptom, and physical complaints values ( p = 0.04). Two additional measures-the 24-month postoperative total BPII score ( p = 0.11) and BPII pain score ( p = 0.07)-demonstrated trends toward statistical significance. This research has established a statistically significant correlation between trochlear dysplasia and disease-specific quality-of-life outcomes following patellofemoral stabilization surgery. There was a significant correlation between patient-reported physical symptoms after surgery and high-grade trochlear dysplasia. Thieme Medical Publishers 333 Seventh Avenue, New York, NY 10001, USA.
ERIC Educational Resources Information Center
Federal Trade Commission, Washington, DC. Bureau of Consumer Protection.
The effect of commercial coaching on Scholastic Aptitude Test (SAT) scores was analyzed, using 1974-1977 test results of 2,500 non-coached students and 1,568 enrollees in two coaching schools. (The Stanley H. Kaplan Educational Center, Inc., and the Test Preparation Center, Inc.). Multiple regression analysis was used to control for student…
Hashmi, Noreen Rahat; Khan, Shazad Ali
2018-05-31
To check if mobile health (m-Health) short message service (SMS) can improve the knowledge and practice of the American Diabetic Association preventive care guidelines (ADA guidelines) recommendations among physicians. Quasi-experimental pre-post study design with a control group. The participants of the study were 62 medical officers/medical postgraduate trainees from two hospitals in Lahore, Pakistan. Pretested questionnaire was used to collect baseline information about physicians' knowledge and adherence according to the ADA guidelines. All the respondents attended 1-day workshop about the guidelines. The intervention group received regular reminders by SMS about the ADA guidelines for the next 5 months. Postintervention knowledge and practice scores of 13 variables were checked again using the same questionnaire. Statistical analysis included χ 2 and McNemar's tests for categorical variables and t-test for continuous variables. Pearson's correlation analysis was done to check correlation between knowledge and practice scores in the intervention group. P values of <0.05 were considered statistically significant. The total number of participating physicians was 62. Fifty-three (85.5%) respondents completed the study. Composite scores within the intervention group showed statistically significant improvement in knowledge (p<0.001) and practice (p<0.001) postintervention. The overall composite scores preintervention and postintervention also showed statistically significant difference of improvement in knowledge (p=0.002) and practice (p=0.001) between non-intervention and intervention groups. Adherence to individual 13 ADA preventive care guidelines level was noted to be suboptimal at baseline. Statistically significant improvement in the intervention group was seen in the following individual variables: review of symptoms of hypoglycaemia and hyperglycaemia, eye examination, neurological examination, lipid examination, referral to ophthalmologist, and counselling about non-smoking. m-Health technology can be a useful educational tool to help with improving knowledge and practice of diabetic guidelines. Future multicentre trials will help to scale this intervention for wider use in resource-limited countries. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Lambert, Carole; Gagnon, Robert; Nguyen, David; Charlin, Bernard
2009-01-01
Background The Script Concordance test (SCT) is a reliable and valid tool to evaluate clinical reasoning in complex situations where experts' opinions may be divided. Scores reflect the degree of concordance between the performance of examinees and that of a reference panel of experienced physicians. The purpose of this study is to demonstrate SCT's usefulness in radiation oncology. Methods A 90 items radiation oncology SCT was administered to 155 participants. Three levels of experience were tested: medical students (n = 70), radiation oncology residents (n = 38) and radiation oncologists (n = 47). Statistical tests were performed to assess reliability and to document validity. Results After item optimization, the test comprised 30 cases and 70 questions. Cronbach alpha was 0.90. Mean scores were 51.62 (± 8.19) for students, 71.20 (± 9.45) for residents and 76.67 (± 6.14) for radiation oncologists. The difference between the three groups was statistically significant when compared by the Kruskall-Wallis test (p < 0.001). Conclusion The SCT is reliable and useful to discriminate among participants according to their level of experience in radiation oncology. It appears as a useful tool to document the progression of reasoning during residency training. PMID:19203358
Visuospatial Aptitude Testing Differentially Predicts Simulated Surgical Skill.
Hinchcliff, Emily; Green, Isabel; Destephano, Christopher; Cox, Mary; Smink, Douglas; Kumar, Amanika; Hokenstad, Erik; Bengtson, Joan; Cohen, Sarah
2018-02-05
To determine if visuospatial perception (VSP) testing is correlated to simulated or intraoperative surgical performance as rated by the American College of Graduate Medical Education (ACGME) milestones. Classification II-2 SETTING: Two academic training institutions PARTICIPANTS: 41 residents, including 19 Brigham and Women's Hospital and 22 Mayo Clinic residents from three different specialties (OBGYN, general surgery, urology). Participants underwent three different tests: visuospatial perception testing (VSP), Fundamentals of Laparoscopic Surgery (FLS®) peg transfer, and DaVinci robotic simulation peg transfer. Surgical grading from the ACGME milestones tool was obtained for each participant. Demographic and subject background information was also collected including specialty, year of training, prior experience with simulated skills, and surgical interest. Standard statistical analysis using Student's t test were performed, and correlations were determined using adjusted linear regression models. In univariate analysis, BWH and Mayo training programs differed in both times and overall scores for both FLS® peg transfer and DaVinci robotic simulation peg transfer (p<0.05 for all). Additionally, type of residency training impacted time and overall score on robotic peg transfer. Familiarity with tasks correlated with higher score and faster task completion (p= 0.05 for all except VSP score). There was no difference in VSP scores by program, specialty, or year of training. In adjusted linear regression modeling, VSP testing was correlated only to robotic peg transfer skills (average time p=0.006, overall score p=0.001). Milestones did not correlate to either VSP or surgical simulation testing. VSP score was correlated with robotic simulation skills but not with FLS skills or ACGME milestones. This suggests that the ability of VSP score to predict competence differs between tasks. Therefore, further investigation is required into aptitude testing, especially prior to its integration as an entry examination into a surgical subspecialty. Copyright © 2018. Published by Elsevier Inc.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zain, Zakiyah, E-mail: zac@uum.edu.my; Ahmad, Yuhaniz, E-mail: yuhaniz@uum.edu.my; Azwan, Zairul, E-mail: zairulazwan@gmail.com, E-mail: farhanaraduan@gmail.com, E-mail: drisagap@yahoo.com
Colorectal cancer is the third and the second most common cancer worldwide in men and women respectively, and the second in Malaysia for both genders. Surgery, chemotherapy and radiotherapy are among the options available for treatment of patients with colorectal cancer. In clinical trials, the main purpose is often to compare efficacy between experimental and control treatments. Treatment comparisons often involve several responses or endpoints, and this situation complicates the analysis. In the case of colorectal cancer, sets of responses concerned with survival times include: times from tumor removal until the first, the second and the third tumor recurrences, andmore » time to death. For a patient, the time to recurrence is correlated to the overall survival. In this study, global score test methodology is used in combining the univariate score statistics for comparing treatments with respect to each survival endpoint into a single statistic. The data of tumor recurrence and overall survival of colorectal cancer patients are taken from a Malaysian hospital. The results are found to be similar to those computed using the established Wei, Lin and Weissfeld method. Key factors such as ethnic, gender, age and stage at diagnose are also reported.« less
NASA Astrophysics Data System (ADS)
Zain, Zakiyah; Aziz, Nazrina; Ahmad, Yuhaniz; Azwan, Zairul; Raduan, Farhana; Sagap, Ismail
2014-12-01
Colorectal cancer is the third and the second most common cancer worldwide in men and women respectively, and the second in Malaysia for both genders. Surgery, chemotherapy and radiotherapy are among the options available for treatment of patients with colorectal cancer. In clinical trials, the main purpose is often to compare efficacy between experimental and control treatments. Treatment comparisons often involve several responses or endpoints, and this situation complicates the analysis. In the case of colorectal cancer, sets of responses concerned with survival times include: times from tumor removal until the first, the second and the third tumor recurrences, and time to death. For a patient, the time to recurrence is correlated to the overall survival. In this study, global score test methodology is used in combining the univariate score statistics for comparing treatments with respect to each survival endpoint into a single statistic. The data of tumor recurrence and overall survival of colorectal cancer patients are taken from a Malaysian hospital. The results are found to be similar to those computed using the established Wei, Lin and Weissfeld method. Key factors such as ethnic, gender, age and stage at diagnose are also reported.
Decreasing Wait Times and Increasing Patient Satisfaction: A Lean Six Sigma Approach.
Godley, Mary; Jenkins, Jeanne B
2018-06-08
Patient satisfaction scores in the vascular interventional radiology department were low, especially related to wait times in registration and for tests/treatments, with low scores for intentions to recommend. The purpose of our quality improvement project was to decrease wait times and improve patient satisfaction using Lean Six Sigma's define, measure, analyze, improve, and control (DMAIC) framework with a pre-/postintervention design. There was a statistically significant decrease in wait times (P < .0019) and an increase in patient satisfaction scores in 3 areas: registration wait times (from 17 to 99 percentiles), test/treatment (from 19 to 60 percentiles), and likelihood to recommend (from 6 to 97 percentiles). Lean Six Sigma was an effective framework for use in decreasing wait times and improving patient satisfaction.
A study on Korean nursing students' educational outcomes
Oh, Kasil; Lee, Hyang-Yeon; Lee, Sook-Ja; Kim, In-Ja; Choi, Kyung-Sook; Ko, Myung-Sook
2011-01-01
The purpose of this study was to describe outcome indicators of nursing education including critical thinking, professionalism, leadership, and communication and to evaluate differences among nursing programs and academic years. A descriptive research design was employed. A total of 454 students from four year baccalaureate (BS) nursing programs and two three-year associate degree (AD) programs consented to complete self-administered questionnaires. The variables were critical thinking, professionalism, leadership and communication. Descriptive statistics, χ2-test, t-tests, ANOVA, and the Tukey test were utilized for the data analysis. All the mean scores of the variables were above average for the test instruments utilized. Among the BS students, those in the upper classes tended to attain higher scores, but this tendency was not identified in AD students. There were significant differences between BS students and AD students for the mean scores of leadership and communication. These findings suggested the need for further research to define properties of nursing educational outcomes, and to develop standardized instruments for research replication and verification. PMID:21602914
Iuliano, Enzo; Fiorilli, Giovanni; Aquino, Giovanna; Di Costanzo, Alfonso; Calcagno, Giuseppe; di Cagno, Alessandra
2017-10-01
This study aimed to evaluate the effects of different types of exercise on memory performance and memory complaint after a 12-week intervention. Eighty community-dwelling volunteers, aged 66.96 ± 11.73 years, were randomly divided into four groups: resistance, cardiovascular, postural, and control groups (20 participants for each group). All participants were tested for their cognitive functions before and after their respective 12-week intervention using Rey memory words test, Prose memory test, and Memory Complaint Questionnaire (MAC-Q). Statistical analysis showed that the three experimental groups significantly improved MAC-Q scores in comparison with the control group (p < .05). The variation of MAC-Q scores and the variations of Rey and Prose memory tests scores were not correlated. These results indicate that the 12-week interventions exclusively influenced memory complaint but not memory performance. Further investigations are needed to understand the relation between memory complaint and memory performance, and the factors that can influence this relationship.
Niemeijer, Anuschka S; van Waelvelde, Hilde; Smits-Engelsman, Bouwien C M
2015-02-01
The Movement Assessment Battery for Children has been revised as the Movement ABC-2 (Henderson, Sugden, & Barnett, 2007). In Europe, the 15th percentile score on this test is recommended for one of the DSM-IV diagnostic criteria for Developmental Coordination Disorder (DCD). A representative sample of Dutch and Flemish children was tested to cross-validate the UK standard scores, including the 15th percentile score. First, the mean, SD and percentile scores of Dutch children were compared to those of UK normative samples. Item standard scores of Dutch speaking children deviated from the UK reference values suggesting necessary adjustments. Except for very young children, the Dutch-speaking samples performed better. Second, based on the mean and SD and clinical relevant cut-off scores (5th and 15th percentile), norms were adjusted for the Dutch population. For diagnostic use, researchers and clinicians should use the reference norms that are valid for the group of children they are testing. The results indicate that there possibly is an effect of testing procedure in other countries that validated the UK norms and/or cultural influence on the age norms of the Movement ABC-2. It is suggested to formulate criterion-based norms for age groups in addition to statistical norms. Copyright © 2014 Elsevier B.V. All rights reserved.
Saxena, Amrish; Prabhakar, Manish Chandra
2013-01-01
Background Dizziness/vertigo is one of the most common complaint and handicapping condition among patients aged 65 years and older (Geriatric patients). This study was conducted to assess the impact of dizziness/vertigo on the quality of life in the geriatric patients attending a geriatric outpatient clinic. Settings and Design A cross-sectional study was performed in a geriatric outpatient clinic of a rural teaching tertiary care hospital in central India. Materials and Methods In all consecutive geriatric patients with dizziness/vertigo attending geriatric outpatient clinic, DHI questionnaire was applied to assess the impact of dizziness/vertigo and dizziness associated handicap in the three areas of a patients’ life: physical, functional and emotional domain. Later, each patient was evaluated and underwent Dix-Hallpike maneuver by the physician who was blind of the DHI scoring of the patient. Statistical Analysis Used We compared means and proportions of variables across two categories of benign paroxysmal positional vertigo (BPPV) and non-BPPV. For these comparisons we used Student’s t-test to test for continuous variables, chi-square test for categorical variables and Fisher’s exact test in the case of small cell sizes (expected value<5). Results The magnitude of dizziness/vertigo was 3%. Of the 88 dizziness/vertigo patients, 19 (22%) and 69(78%) cases, respectively, were attributed to BPPV and non-BPPV group. The association of DHI score ≥50 with the BPPV was found to be statistically significant with x2 value = 58.2 at P<0.01. Conclusion DHI Score is a useful tool for the prediction of benign paroxysmal positional vertigo. Correct diagnosis of BPPV is 16 times greater if the DHI Score is greater than or equal to 50. The physical, functional and emotional investigation of dizziness, through the DHI, has demonstrated to be a valuable and useful instrument in the clinical routine. PMID:23472142
Wechsler Intelligence Scale for Children-V: Test Review.
Na, Sabrina D; Burns, Thomas G
2016-01-01
Changes from the fourth edition of the Wechsler Intelligence Scale for Children (WISC) to the fifth edition are discussed, with particular emphasis on how the electronic administration facilitated assessment. The hierarchical organization and conceptualization of primary indices have been adjusted, based on recent theory and research on the construct of intelligence. Changes also include updates to psychometric properties and consideration of cultural bias. The scoring program allows intelligence scores to be linked statistically to achievement measures to aid in diagnoses of learning disabilities. Electronic assessment was clunky at times but overall delivered on its promise of quicker and more accurate administration and scoring.
Kindergarten Predictors of Math Learning Disability
Mazzocco, Michèle M. M.; Thompson, Richard E.
2009-01-01
The aim of the present study was to address how to effectively predict mathematics learning disability (MLD). Specifically, we addressed whether cognitive data obtained during kindergarten can effectively predict which children will have MLD in third grade, whether an abbreviated test battery could be as effective as a standard psychoeducational assessment at predicting MLD, and whether the abbreviated battery corresponded to the literature on MLD characteristics. Participants were 226 children who enrolled in a 4-year prospective longitudinal study during kindergarten. We administered measures of mathematics achievement, formal and informal mathematics ability, visual-spatial reasoning, and rapid automatized naming and examined which test scores and test items from kindergarten best predicted MLD at grades 2 and 3. Statistical models using standardized scores from the entire test battery correctly classified ~80–83 percent of the participants as having, or not having, MLD. Regression models using scores from only individual test items were less predictive than models containing the standard scores, except for models using a specific subset of test items that dealt with reading numerals, number constancy, magnitude judgments of one-digit numbers, or mental addition of one-digit numbers. These models were as accurate in predicting MLD as was the model including the entire set of standard scores from the battery of tests examined. Our findings indicate that it is possible to effectively predict which kindergartners are at risk for MLD, and thus the findings have implications for early screening of MLD. PMID:20084182
Wallmann, Harvey W; Gillis, Carrie B; Alpert, Patricia T; Miller, Sally K
2009-01-01
The purpose of this pilot study is to assess the impact of a senior jazz dance class on static balance for healthy women over 50 years of age using the NeuroCom Smart Balance Master System (Balance Master). A total of 12 healthy women aged 54-88 years completed a 15-week jazz dance class which they attended 1 time per week for 90 min per class. Balance data were collected using the Sensory Organization Test (SOT) at baseline (pre), at 7 weeks (mid), and after 15 weeks (post). An equilibrium score measuring postural sway was calculated for each of six different conditions. The composite equilibrium score (all six conditions integrated to 1 score) was used as an overall measure of balance. Repeated measures analyses of variance (ANOVAs) were used to compare the means of each participant's SOT composite equilibrium score in addition to the equilibrium score for each individual condition (1-6) across the 3 time points (pre, mid, post). There was a statistically significant difference among the means, p < .0005. Pairwise (Bonferroni) post hoc analyses revealed the following statistically significant findings for SOT composite equilibrium scores for the pre (67.33 + 10.43), mid (75.25 + 6.97), and post (79.00 + 4.97) measurements: premid (p = .008); prepost (p < .0005); midpost (p = .033). In addition, correlational statistics were used to determine any relationship between SOT scores and age. Results indicated that administration of a 15-week jazz dance class 1 time per week was beneficial in improving static balance as measured by the Balance Master SOT.
Lin, Kai-Yang; Zheng, Wei-Ping; Bei, Wei-Jie; Chen, Shi-Qun; Islam, Sheikh Mohammed Shariful; Liu, Yong; Xue, Lin; Tan, Ning; Chen, Ji-Yan
2017-03-01
A few studies developed simple risk model for predicting CIN with poor prognosis after emergent PCI. The study aimed to develop and validate a novel tool for predicting the risk of contrast-induced nephropathy (CIN) in patients undergoing emergent percutaneous coronary intervention (PCI). 692 consecutive patients undergoing emergent PCI between January 2010 and December 2013 were randomly (2:1) assigned to a development dataset (n=461) and a validation dataset (n=231). Multivariate logistic regression was applied to identify independent predictors of CIN, and established CIN predicting model, whose prognostic accuracy was assessed using the c-statistic for discrimination and the Hosmere Lemeshow test for calibration. The overall incidence of CIN was 55(7.9%). A total of 11 variables were analyzed, including age >75years old, baseline serum creatinine (SCr)>1.5mg/dl, hypotension and the use of intra-aortic balloon pump(IABP), which were identified to enter risk score model (Chen). The incidence of CIN was 32(6.9%) in the development dataset (in low risk (score=0), 1.0%, moderate risk (score:1-2), 13.4%, high risk (score≥3), 90.0%). Compared to the classical Mehran's and ACEF CIN risk score models, the risk score (Chen) across the subgroup of the study population exhibited similar discrimination and predictive ability on CIN (c-statistic:0.828, 0.776, 0.853, respectively), in-hospital mortality, 2, 3-years mortality (c-statistic:0.738.0.750, 0.845, respectively) in the validation population. Our data showed that this simple risk model exhibited good discrimination and predictive ability on CIN, similar to Mehran's and ACEF score, and even on long-term mortality after emergent PCI. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
NBME subject examination in surgery scores correlate with surgery clerkship clinical experience.
Myers, Jonathan A; Vigneswaran, Yalini; Gabryszak, Beth; Fogg, Louis F; Francescatti, Amanda B; Golner, Christine; Bines, Steven D
2014-01-01
Most medical schools in the United States use the National Board of Medical Examiners Subject Examinations as a method of at least partial assessment of student performance, yet there is still uncertainty of how well these examination scores correlate with clinical proficiency. Thus, we investigated which factors in a surgery clerkship curriculum have a positive effect on academic achievement on the National Board of Medical Examiners Subject Examination in Surgery. A retrospective analysis of 83 third-year medical students at our institution with 4 unique clinical experiences on the general surgery clerkship for the 2007-2008 academic year was conducted. Records of the United States Medical Licensing Examination Step 1 scores, National Board of Medical Examiners Subject Examination in Surgery scores, and essay examination scores for the groups were compared using 1-way analysis of variance testing. Rush University Medical Center, Chicago IL, an academic institution and tertiary care center. Our data demonstrated National Board of Medical Examiners Subject Examination in Surgery scores from the group with the heavier clinical loads and least time for self-study were statistically higher than the group with lighter clinical services and higher rated self-study time (p = 0.036). However, there was no statistical difference of National Board of Medical Examiners Subject Examination in Surgery scores between the groups with equal clinical loads (p = 0.751). Students experiencing higher clinical volumes on surgical services, but less self-study time demonstrated statistically higher academic performance on objective evaluation, suggesting clinical experience may be of higher value than self-study and reading. Copyright © 2014 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.
Quantitative analysis of the text and graphic content in ophthalmic slide presentations.
Ing, Edsel; Celo, Erdit; Ing, Royce; Weisbrod, Lawrence; Ing, Mercedes
2017-04-01
To determine the characteristics of ophthalmic digital slide presentations. Retrospective quantitative analysis. Slide presentations from a 2015 Canadian primary eye care conference were analyzed for their duration, character and word count, font size, words per minute (wpm), lines per slide, words per slide, slides per minute (spm), text density product (wpm × spm), proportion of graphic content, and Flesch Reading Ease (FRE) score using Microsoft PowerPoint and Word. The median audience evaluation score for the lectures was used to dichotomize the higher scoring lectures (HSL) from the lower scoring lectures (LSL). A priori we hypothesized that there would be a difference in the wpm, spm, text density product, and FRE score between HSL and LSL. Wilcoxon rank-sum tests with Bonferroni correction were utilized. The 17 lectures had medians of 2.5 spm, 20.3 words per slide, 5.0 lines per slide, 28-point sans serif font, 36% graphic content, and text density product of 136.4 words × slides/minute 2 . Although not statistically significant, the HSL had more wpm, fewer words per slide, more graphics per slide, greater text density, and higher FRE score than LSL. There was a statistically significant difference in the spm of the HSL (3.1 ± 1.0) versus the LSL (2.2 ± 1.0) at p = 0.0124. All presenters showed more than 1 slide per minute. The HSL showed more spm than the LSL. The descriptive statistics from this study may aid in the preparation of slides used for teaching and conferences. Copyright © 2017 Canadian Ophthalmological Society. Published by Elsevier Inc. All rights reserved.
Validation of the Fatigue Impact Scale in Hungarian patients with multiple sclerosis.
Losonczi, Erika; Bencsik, Krisztina; Rajda, Cecília; Lencsés, Gyula; Török, Margit; Vécsei, László
2011-03-01
Fatigue is one of the most frequent complaints of patients with multiple sclerosis (MS). The Fatigue Impact Scale (FIS), one of the 30 available fatigue questionnaires, is commonly applied because it evaluates multidimensional aspects of fatigue. The main purposes of this study were to test the validity, test-retest reliability, and internal consistency of the Hungarian version of the FIS. One hundred and eleven MS patients and 85 healthy control (HC) subjects completed the FIS and the Beck Depression Inventory, a large majority of them on two occasions, 3 months apart. The total FIS score and subscale scores differed statistically between the MS patients and the HC subjects in both FIS sessions. In the test-retest reliability assessment, statistically, the intraclass correlation coefficients were high in both the MS and HC groups. Cronbach's alpha values were also notably high. The results of this study indicate that the FIS can be regarded as a valid and reliable scale with which to improve our understanding of the impact of fatigue on the health-related quality of life in MS patients without severe disability.
The effect of digital rectal exam on the 4Kscore for aggressive prostate cancer.
Maccini, Michael A; Westfall, Nicholas J; Van Bokhoven, Adrie; Lucia, Marshall Scott; Poage, Wendy; Maroni, Paul D; Wilson, Shandra S; Glodé, Leonard Michael; Arangua, Paul; Newmark, Jay; Steiner, Mitchell; Werahera, Priya N; Crawford, Elward David
2018-05-01
The 4Kscore is a new commercially available blood-based diagnostic test which predicts risk for aggressive, clinically significant prostate cancer on prostate biopsy. The 4Kscore is currently restricted to patients who have not had a digital rectal exam (DRE) in the previous 96 h, owing to prior mixed data suggesting that prostate specific antigen (PSA) isoforms may increase by a statistically significant-if not necessarily clinically significant-amount shortly after DRE. Our primary objective was to determine if 4Kscore test results are affected by a preceding DRE. Participants at a Prostate Cancer Awareness Week screening event sponsored by the Prostate Conditions Education Council filled out clinical history questionnaires and had blood samples for 4Kscore testing drawn prior to DRE, then 15-45 min following DRE. Patients with prior cancer diagnosis, 5-alpha reductase inhibitor medication use, or lower urinary tract procedures in the prior 6 months were excluded, resulting in a population of 162 participants for analysis. Values were then compared to determine if there was a significant difference in 4Kscore following DRE. A statistically significant increase was seen in levels of 3 kallikreins measured (total PSA, free PSA, and intact PSA; median <0.03 ng/mL for all). This resulted in a small but statistically significant decrease in post-DRE 4Kscore (median absolute score decrease 0.43%). Using a 4Kscore cutoff of 7.5% resulted in reclassification of 10 patients (6.2%), nine of whom were "downgraded" from above the cutoff to below. If the blood draw for the 4 K score is performed after a screening DRE, there is a statistically significant difference in the 4 K score results, but in the vast majority of cases it would not affect clinical decision making. © 2018 Wiley Periodicals, Inc.
Sadri, Donia; Farhadi, Sareh; Shahabi, Zahra; Sarshar, Samaneh
2016-01-01
The recent scientific reports have shown that angiogenesis can affect biological behavior of pathologic lesions. Regarding unique clinical outcome of Odontogenic keratocyst (OKC), the present study was aimed to compare angiogenesis in Odontogenic keratocyst and Dentigerous cyst (DC). In this experimental study, tissue sections of 46 samples of OKC and DC were stained through immunohistochemical method using Vascular Endothelial Growth Factor (VEGF) antibody. VEGF expression was evaluated in epithelial cells, fibroblasts and endothelial cells. The average percentage of stained cells in any samples was categorized to 3 groups as follows: SCORE 0: 10% of cells or less are positive. SCORE 1: 10 to 50% of cells are positive. SCORE 2: more than 50% of cells are positive. Mann-U-Whitney, T-test and chi-square was used for statistical analysis. The average of VEGF expression in 24 samples of DC was 20.2% and in 22 samples of OKC was 52.6%, respectively. The average of VEGF expression in these two cysts had statistical significant differences. (PV= 0.045). There was significant statistical differences between two cysts in the terms of VEGF SCORE (PV= 0.000). OKC samples had significantly higher SCORE for the purpose of VEGF incidence than DC. Also, there were no differences between VEGF expression in epithelial cells of two cysts (PV= 0.268) there were significant statistical differences between two cysts in terms of endothelial cell staining. The endothelial cell staining was significantly higher in OKC than DC (PV= 0.037%). Regarding higher expression of Vascular Endothelial Growth factor in OKC than DC, it seems that angiogenesis may have great impression on clinical outcome of OKC.
ERIC Educational Resources Information Center
Ossai, Peter Agbadobi Uloku
2016-01-01
This study examined the relationship between students' scores on Research Methods and statistics, and undergraduate project at the final year. The purpose was to find out whether students matched knowledge of research with project-writing skill. The study adopted an expost facto correlational design. Scores on Research Methods and Statistics for…
Peters, L L; Boter, H; Burgerhof, J G M; Slaets, J P J; Buskens, E
2015-09-01
The primary objective of the present study was to evaluate the validity of the Groningen Frailty Indicator (GFI) in a sample of Dutch elderly persons participating in LifeLines, a large population-based cohort study. Additional aims were to assess differences between frail and non-frail elderly and examine which individual characteristics were associated with frailty. By December 2012, 5712 elderly persons were enrolled in LifeLines and complied with the inclusion criteria of the present study. Mann-Whitney U or Kruskal-Wallis tests were used to assess the variability of GFI-scores among elderly subgroups that differed in demographic characteristics, morbidity, obesity, and healthcare utilization. Within subgroups Kruskal-Wallis tests were also used to examine differences in GFI-scores across age groups. Multivariate logistic regression analyses were performed to assess associations between individual characteristics and frailty. The GFI discriminated between subgroups: statistically significantly higher GFI-median scores (interquartile range) were found in e.g. males (1 [0-2]), the oldest old (2 [1-3]), in elderly who were single (1 [0-2]), with lower socio economic status (1 [0-3]), with increasing co-morbidity (2 [1-3]), who were obese (2 [1-3]), and used more healthcare (2 [1-4]). Overall age had an independent and statistically significant association with GFI scores. Compared with the non-frail, frail elderly persons experienced statistically significantly more chronic stress and more social/psychological related problems. In the multivariate logistic regression model, psychological morbidity had the strongest association with frailty. The present study supports the construct validity of the GFI and provides an insight in the characteristics of (non)frail community-dwelling elderly persons participating in LifeLines. Copyright © 2015 Elsevier Inc. All rights reserved.
Takebayashi, Hironori; Tsuzuki, Kenzo; Oka, Hideki; Fukazawa, Keijiro; Daimon, Takashi; Sakagami, Masafumi
2011-02-01
This study demonstrated statistical correlations between a novel self-administered odor questionnaire (SAOQ) and other olfaction tests in patients with olfactory disorders, and the usefulness of this questionnaire was discussed. Between December 2004 and November 2009 (5 years), the SAOQ was completed by 405 healthy people without any nasal diseases (Group A) and 539 patients with an olfactory disorder (Group B) at the Department of Otolaryngology, Hyogo College of Medicine. This was a prospective study. The SAOQ proposed by the Japan Rhinology Society is a self-administered survey consisting of 20 smell-related items: "steamed rice, miso, seaweed, soy sauce, baked bread, butter, curry, garlic, orange, strawberry, green tea, coffee, chocolate, household gas, garbage, timber, stercus, sweat, flower, and perfume". The normal reference range of scores (%) of the SAOQ was calculated in Group A. To determine whether the results of the SAOQ were correlated with those of visual analogue scale (VAS) and T&T olfactometer, pre- and post-treatment results of the SAOQ and olfaction tests were analyzed. The questionnaire response rates were 99.5% (403/405 people) in Group A and 95.9% (517/539 patients) in Group B. The statistically normal reference level of the SAOQ was determined as more than 70%. In Group B, the mean pre-treatment SAOQ score (20.4%), VAS score (16.5%), and T&T recognition threshold (5.0) significantly improved to values of 46.7%, 41.1%, and 4.1 after treatments, respectively (n=249). Both pre- and post-treatment SAOQ scores (ΔQ) had statistically significant relationships with those of VAS and T&T (n=249). The utility of the SAOQ as an easy method of estimating olfaction was suggested. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
Finn, Gabrielle M; Mwandigha, Lazaro; Paton, Lewis W; Tiffin, Paul A
2018-05-03
In addition to the evaluation of educational attainment and intellectual ability there has been interest in the potential to select medical school applicants on non-academic qualities. Consequently, a battery of self-report measures concerned with assessing 'non-cognitive' traits was piloted as part of the UK Clinical Aptitude Test (UKCAT) administration to evaluate their potential to be used in selection. The four non-cognitive instruments piloted were: 1) the Libertarian-communitarian scale, (2) The NACE (narcissism, aloofness, confidence and empathy, (3) the MEARS (Managing emotions and resilience scale; self-esteem, optimism, control, self-discipline, emotional-nondefensiveness and faking, and (4) an abridged version of instruments (1) and (2) combined. Non-cognitive scores and sociodemographic characteristics were available for 14,387 applicants. A series of univariable and multivariable analyses were conducted in order to assess the ability of the non-cognitive scores to predict knowledge and skills-based performance, as well as the odds of passing each academic year at first attempt. Non-cognitive scores and medical performance were standardised within cohorts. The scores on the non-cognitive scales showed only very small (magnitude of standardised betas< 0.2), though sometimes statistically significant (p < 0.01) univariable associations with subsequent performance on knowledge or skills-based assessments. The only statistically significant association between the non-cognitive scores and the probability of passing an academic year at first attempt was the narcissism score from one the abridged tests (OR 0.84,95% confidence intervals 0.71 to 0.97, p = 0.02). Our findings are consistent with previously published research. The tests had a very limited ability to predict undergraduate academic performance, though further research on identifying narcissism in medical students may be warranted. However, the validity of such self-report tools in high-stakes settings may be affected, making such instruments unlikely to add value within the selection process.
Rivalta, Massimo; Sighinolfi, Maria Chiara; Micali, Salvatore; De Stefani, Stefano; Bianchi, Giampaolo
2010-03-01
Urinary incontinence (UI) is a debilitating condition that can cause discomfort, embarrassment, loss of confidence; it can lead to withdrawal from social life, and adversely affects physical and mental health, sexual function and quality of life (QoL) in women. The aim is to determine the impact of combined pelvic floor rehabilitation (PFR) on UI, female sexual dysfunction, and QoL. Female Sexual Function Index questionnaire (FSFI) and King's Health Questionnaire (KHQ). Sixteen patients with UI were selected and underwent a complete PFR program (biofeedback, functional electrical stimulation, pelvic floor muscles exercises, and vaginal cones). Patient filled out the FSFI questionnaire and the KHQ at the baseline and at follow-up. After PFR none of the patients reported urine leakage during sexual activity. Resolution of incontinence was achieved in 13 (81.25%) women. Only three (18.75%) patients had positive 1-hour pad test after the treatment. There was significant difference between pad test leakage before and after the PFR (P < 0.001). The mean Stamey incontinence score was 1.37 +/- 0.5 at the baseline vs. 0.25 +/- 0.57 at the follow up (P < 0.001). Before PFR, FSFI total score ranged from 25.8 to 2 (mean 14.65 +/- 6.88), after treatment the FSFI total score ranged from 36 to 2 (mean 22.65 +/- 9.5) (P < 0.001). The improvement of the scores in the six FSFI domains, 5 months after the conclusion of PFR, was statistically significant (desire, arousal, lubrication, orgasm, satisfaction, and pain). All the nine domains in the KHQ presented a low average score after treatment and the improvements were statistically significant. PFR led to a significant difference in the daily use of pads, 1-hour pad test, and Stamey incontinence scores. The treatment caused an improvement in patient's QoL index and sexual function.
Zhang, Ying-Li; Liang, Wei; Chen, Zuo-Ming; Zhang, Hong-Mei; Zhang, Jian-Hong; Weng, Xiao-Qin; Yang, Shi-Chang; Zhang, Lei; Shen, Li-Juan; Zhang, Ya-Lin
2013-12-01
This study examined the validity and reliability of the Patient Health Questionnaire-9 (PHQ-9) and Patient Health Questionnaire-2 (PHQ-2). The optimal cutoff score when screening for depression among Chinese college students was also determined. A total of 959 participants completed the PHQ-9 and the Beck Depression Inventory (BDI) questionnaire. The Structured Clinical Interview for Diagnostic and Statistical Manual of Mental Disorders was used to diagnose depression. Statistical tests were performed to determine the reliability, validity, and receiver operating characteristic curve of the data. The concurrent validity was tested by examining associations between PHQ-9 and BDI. The sensitivity and specificity, as well as the positive and negative predictive values, were calculated for different cutoff scores of PHQ-9 and PHQ-2. The internal consistency values of PHQ-9 and PHQ-2 were 0.854 and 0.727, respectively. The test-retest reliability values of PHQ-9 and PHQ-2 were 0.873 and 0.829, respectively. The scores of PHQ-9 (r = 0.790) and PHQ-2 (r = 0.651) were significantly associated with that of BDI. PHQ-9 had an optimal cutoff score of 11, which indicated a sensitivity of 0.89 and a specificity of 0.97, with an area under the curve of 0.977 (95% confidence interval: 0.966-0.988). The PHQ-2 demonstrated satisfactory sensitivity (0.81) and specificity (0.96) at the cutoff score of 3, and its area under the curve was 0.939. The PHQ-9 and the PHQ-2 are valid and reliable tools to screen depression in Chinese college students. For screening purposes, cutoff scores of 11 and 3 are recommended for PHQ-9 and PHQ-2, respectively. Copyright © 2013 Wiley Publishing Asia Pty Ltd.
Association between osteoporosis and periodontal disease among postmenopausal Indian women.
Richa; R, Yashoda; Puranik, Manjunath P; Shrivastava, Amit
2017-08-01
The aim of the present study was to determine the association between osteoporosis and periodontal disease among postmenopausal Indian women. A cross-sectional comparative study was conducted among postmenopausal women aged 45-65 years attending various hospitals in Bangalore, India. The examination was performed using the plaque index, gingival index, modified sulcus bleeding index, and community periodontal index. The women then underwent a bone mineral density (BMD) test using an ultrasonometer. Based on the BMD scores, participants were divided into osteoporotic and non-osteoporotic groups. For the statistical analysis, χ 2 -test, Student's t-test, and multiple regression analysis were applied. The mean plaque, gingival, and bleeding scores were significantly higher among osteoporotic women (1.83 ± 0.47, 1.73 ± 0.49, 1.82 ± 0.52) compared to the non-osteoporotic women (1.31 ± 0.40, 1.09 ± 0.52, 1.25 ± 0.50). The mean number of sextants affected for codes 3 and 4 of the community periodontal index and codes 1, 2, and 3 of loss of attachment were significantly higher among osteoporotic group compared to the non-osteoporotic group. Multiple logistic regression tests confirmed the statistically-significant association between osteoporosis and menopause duration, loss of attachment, bleeding, and gingivitis scores. Skeletal BMD is related to clinical attachment loss, bleeding, and gingivitis, which suggests that there is an association between osteoporosis and periodontal diseases. © 2016 John Wiley & Sons Australia, Ltd.
ERIC Educational Resources Information Center
Barton, James M.
2016-01-01
Carnegie Learning's Cognitive Tutor®The purpose of this study is to determine whether there is a statistically significant difference between pre-test and post-test achievement scores when Compass Learning's Odyssey Math® is used together with Carnegie Learning's Math Cognitive Tutor® in a mathematics intervention program at ABC Middle School. The…
The Accuracy of Estimated Total Test Statistics. Final Report.
ERIC Educational Resources Information Center
Kleinke, David J.
In a post-mortem study of item sampling, 1,050 examinees were divided into ten groups 50 times. Each time, their papers were scored on four different sets of item samples from a 150-item test of academic aptitude. These samples were selected using (a) unstratified random sampling and stratification on (b) content, (c) difficulty, and (d) both.…
Blind image quality assessment without training on human opinion scores
NASA Astrophysics Data System (ADS)
Mittal, Anish; Soundararajan, Rajiv; Muralidhar, Gautam S.; Bovik, Alan C.; Ghosh, Joydeep
2013-03-01
We propose a family of image quality assessment (IQA) models based on natural scene statistics (NSS), that can predict the subjective quality of a distorted image without reference to a corresponding distortionless image, and without any training results on human opinion scores of distorted images. These `completely blind' models compete well with standard non-blind image quality indices in terms of subjective predictive performance when tested on the large publicly available `LIVE' Image Quality database.
Ng, M L; Warlow, R S; Chrishanthan, N; Ellis, C; Walls, R
2000-09-01
The aim of this study is to formulate criteria for the definition of allergic rhinitis. Other studies have sought to develop scoring systems to categorize the severity of allergic rhinitis symptoms but it was never used for the formulation of diagnostic criteria. These other scoring systems were arbitrarily chosen and were not derived by any statistical analysis. To date, a study of this kind has not been performed. The hypothesis of this study is that it is possible to formulate criteria for the definition of allergic rhinitis. This is the first study to systematically examine and evaluate the relative importance of symptoms, signs and investigative tests in allergic rhinitis. We sought to statistically rank, from the most to the least important, the multiplicity of symptoms, signs and test results. Forty-seven allergic rhinitis and 23 normal subjects were evaluated with a detailed questionnaire and history, physical examination, serum total immunoglobulin E, skin prick tests and serum enzyme allergosorbent tests (EAST). Statistical ranking of variables indicated rhinitis symptoms (nasal, ocular and oronasal) were the most commonly occurring, followed by a history of allergen provocation, then serum total IgE, positive skin prick tests and positive EAST's to house dust mite, perennial rye and bermuda/couch grass. Throat symptoms ranked even lower whilst EAST's to cat epithelia, plantain and cockroach were the least important. Not all symptoms, signs and tests evaluated proved to be statistically significant when compared to a control group; this included symtoms and signs which had been considered historically to be traditionally associated with allergic rhinitis, e.g. sore throat and bleeding nose. In performing statistical analyses, we were able to rank from most to least important, the multiplicity of symptoms signs and test results. The most important symptoms and signs were identified for the first time, even though some of these were not included in our original selection criteria for defining the disease cohort i.e. sniffing, postnasal drip, oedematous nasal mucosa, impaired sense of smell, mouth breathing, itchy nose and many of the specific provocation factors.
de Souza, Ana Célia Caetano; Moreira, Thereza Maria Magalhaes; de Oliveira, Edmar Souza; de Menezes, Anaíze Viana Bezerra; Loureiro, Aline Maria Oliveira; Silva, Camila Brasileiro de Araújo; Linard, Jair Gomes; de Almeida, Italo Lennon Sales; Mattos, Samuel Miranda; Borges, José Wicto Pereira
2016-01-01
The objective of this study was to test the effectiveness of an educational intervention with use of educational technology (flipchart) to promote quality of life (QOL) and treatment adherence in people with hypertension. It was an intervention study of before-and-after type conducted with 116 hypertensive people registered in Primary Health Care Units. The educational interventions were conducted using the flipchart educational technology. Quality of life was assessed through the MINICHAL (lowest score = better QOL) and the QATSH (higher score = better adherence) was used to assess the adherence to hypertension treatment. Both were measured before and after applying the intervention. In the analysis, we used the Student’s t-test for paired data. The average baseline quality of life was 11.66 ± 7.55, and 7.71 ± 5.72 two months after the intervention, showing a statistically significant reduction (p <0.001) and mean of differences of 3.95. The average baseline adherence to treatment was 98.03 ± 7.08 and 100.71 ± 6.88 two months after the intervention, which is statistically significant (p < 0.001), and mean of differences of 2.68. The conclusion was that the educational intervention using the flipchart improved the total score of quality of life in the scores of physical and mental domains, and increased adherence to hypertension treatment in people with the disease. PMID:27851752
de Souza, Ana Célia Caetano; Moreira, Thereza Maria Magalhaes; Oliveira, Edmar Souza de; Menezes, Anaíze Viana Bezerra de; Loureiro, Aline Maria Oliveira; Silva, Camila Brasileiro de Araújo; Linard, Jair Gomes; Almeida, Italo Lennon Sales de; Mattos, Samuel Miranda; Borges, José Wicto Pereira
2016-01-01
The objective of this study was to test the effectiveness of an educational intervention with use of educational technology (flipchart) to promote quality of life (QOL) and treatment adherence in people with hypertension. It was an intervention study of before-and-after type conducted with 116 hypertensive people registered in Primary Health Care Units. The educational interventions were conducted using the flipchart educational technology. Quality of life was assessed through the MINICHAL (lowest score = better QOL) and the QATSH (higher score = better adherence) was used to assess the adherence to hypertension treatment. Both were measured before and after applying the intervention. In the analysis, we used the Student's t-test for paired data. The average baseline quality of life was 11.66 ± 7.55, and 7.71 ± 5.72 two months after the intervention, showing a statistically significant reduction (p <0.001) and mean of differences of 3.95. The average baseline adherence to treatment was 98.03 ± 7.08 and 100.71 ± 6.88 two months after the intervention, which is statistically significant (p < 0.001), and mean of differences of 2.68. The conclusion was that the educational intervention using the flipchart improved the total score of quality of life in the scores of physical and mental domains, and increased adherence to hypertension treatment in people with the disease.
Kelly, Maureen E; Regan, Daniel; Dunne, Fidelma; Henn, Patrick; Newell, John; O'Flynn, Siun
2013-05-10
Internationally, tests of general mental ability are used in the selection of medical students. Examples include the Medical College Admission Test, Undergraduate Medicine and Health Sciences Admission Test and the UK Clinical Aptitude Test. The most widely used measure of their efficacy is predictive validity.A new tool, the Health Professions Admission Test- Ireland (HPAT-Ireland), was introduced in 2009. Traditionally, selection to Irish undergraduate medical schools relied on academic achievement. Since 2009, Irish and EU applicants are selected on a combination of their secondary school academic record (measured predominately by the Leaving Certificate Examination) and HPAT-Ireland score. This is the first study to report on the predictive validity of the HPAT-Ireland for early undergraduate assessments of communication and clinical skills. Students enrolled at two Irish medical schools in 2009 were followed up for two years. Data collected were gender, HPAT-Ireland total and subsection scores; Leaving Certificate Examination plus HPAT-Ireland combined score, Year 1 Objective Structured Clinical Examination (OSCE) scores (Total score, communication and clinical subtest scores), Year 1 Multiple Choice Questions and Year 2 OSCE and subset scores. We report descriptive statistics, Pearson correlation coefficients and Multiple linear regression models. Data were available for 312 students. In Year 1 none of the selection criteria were significantly related to student OSCE performance. The Leaving Certificate Examination and Leaving Certificate plus HPAT-Ireland combined scores correlated with MCQ marks.In Year 2 a series of significant correlations emerged between the HPAT-Ireland and subsections thereof with OSCE Communication Z-scores; OSCE Clinical Z-scores; and Total OSCE Z-scores. However on multiple regression only the relationship between Total OSCE Score and the Total HPAT-Ireland score remained significant; albeit the predictive power was modest. We found that none of our selection criteria strongly predict clinical and communication skills. The HPAT- Ireland appears to measures ability in domains different to those assessed by the Leaving Certificate Examination. While some significant associations did emerge in Year 2 between HPAT Ireland and total OSCE scores further evaluation is required to establish if this pattern continues during the senior years of the medical course.
2013-01-01
Background Internationally, tests of general mental ability are used in the selection of medical students. Examples include the Medical College Admission Test, Undergraduate Medicine and Health Sciences Admission Test and the UK Clinical Aptitude Test. The most widely used measure of their efficacy is predictive validity. A new tool, the Health Professions Admission Test- Ireland (HPAT-Ireland), was introduced in 2009. Traditionally, selection to Irish undergraduate medical schools relied on academic achievement. Since 2009, Irish and EU applicants are selected on a combination of their secondary school academic record (measured predominately by the Leaving Certificate Examination) and HPAT-Ireland score. This is the first study to report on the predictive validity of the HPAT-Ireland for early undergraduate assessments of communication and clinical skills. Method Students enrolled at two Irish medical schools in 2009 were followed up for two years. Data collected were gender, HPAT-Ireland total and subsection scores; Leaving Certificate Examination plus HPAT-Ireland combined score, Year 1 Objective Structured Clinical Examination (OSCE) scores (Total score, communication and clinical subtest scores), Year 1 Multiple Choice Questions and Year 2 OSCE and subset scores. We report descriptive statistics, Pearson correlation coefficients and Multiple linear regression models. Results Data were available for 312 students. In Year 1 none of the selection criteria were significantly related to student OSCE performance. The Leaving Certificate Examination and Leaving Certificate plus HPAT-Ireland combined scores correlated with MCQ marks. In Year 2 a series of significant correlations emerged between the HPAT-Ireland and subsections thereof with OSCE Communication Z-scores; OSCE Clinical Z-scores; and Total OSCE Z-scores. However on multiple regression only the relationship between Total OSCE Score and the Total HPAT-Ireland score remained significant; albeit the predictive power was modest. Conclusion We found that none of our selection criteria strongly predict clinical and communication skills. The HPAT- Ireland appears to measures ability in domains different to those assessed by the Leaving Certificate Examination. While some significant associations did emerge in Year 2 between HPAT Ireland and total OSCE scores further evaluation is required to establish if this pattern continues during the senior years of the medical course. PMID:23663266
Comparison of Risk Scores for Prediction of Complications following Aortic Valve Replacement.
Wang, Tom Kai Ming; Choi, David Hyun-Min; Haydock, David; Gamble, Greg; Stewart, Ralph; Ruygrok, Peter
2015-06-01
Risk models play an important role in stratification of patients for cardiac surgery, but their prognostic utilities for post-operative complications are rarely studied. We compared the EuroSCORE, EuroSCORE II, Society of Thoracic Surgeon's (STS) Score and an Australasian model (Aus-AVR Score) for predicting morbidities after aortic valve replacement (AVR), and also evaluated seven STS complications models in this context. We retrospectively calculated risk scores for 620 consecutive patients undergoing isolated AVR at Auckland City Hospital during 2005-2012, assessing their discrimination and calibration for post-operative complications. Amongst mortality scores, the EuroSCORE was the best at discriminating stroke (c-statistic 0.845); the EuroSCORE II at deep sternal wound infection (c=0.748); and the STS Score at composite morbidity or mortality (c=0.666), renal failure (c=0.634), ventilation>24 hours (c=0.732), return to theatre (c=0.577) and prolonged hospital stay >14 days post-operatively (c=0.707). The individual STS complications models had a marginally higher c-statistic (c=0.634-0.846) for all complications except mediastinitis, and had good calibration (Hosmer-Lemeshow test P-value 0.123-0.915) for all complications. The STS Score was best overall at discriminating post-operative complications and their composite for AVR. All STS complications models except for deep sternal wound infection had good discrimination and calibration for post-operative complications. Copyright © 2014 Australian and New Zealand Society of Cardiac and Thoracic Surgeons (ANZSCTS) and the Cardiac Society of Australia and New Zealand (CSANZ). Published by Elsevier B.V. All rights reserved.
Difference to Inference: teaching logical and statistical reasoning through on-line interactivity.
Malloy, T E
2001-05-01
Difference to Inference is an on-line JAVA program that simulates theory testing and falsification through research design and data collection in a game format. The program, based on cognitive and epistemological principles, is designed to support learning of the thinking skills underlying deductive and inductive logic and statistical reasoning. Difference to Inference has database connectivity so that game scores can be counted as part of course grades.
Effects of Classroom Ventilation Rate and Temperature on Students' Test Scores.
Haverinen-Shaughnessy, Ulla; Shaughnessy, Richard J
2015-01-01
Using a multilevel approach, we estimated the effects of classroom ventilation rate and temperature on academic achievement. The analysis is based on measurement data from a 70 elementary school district (140 fifth grade classrooms) from Southwestern United States, and student level data (N = 3109) on socioeconomic variables and standardized test scores. There was a statistically significant association between ventilation rates and mathematics scores, and it was stronger when the six classrooms with high ventilation rates that were indicated as outliers were filtered (> 7.1 l/s per person). The association remained significant when prior year test scores were included in the model, resulting in less unexplained variability. Students' mean mathematics scores (average 2286 points) were increased by up to eleven points (0.5%) per each liter per second per person increase in ventilation rate within the range of 0.9-7.1 l/s per person (estimated effect size 74 points). There was an additional increase of 12-13 points per each 1°C decrease in temperature within the observed range of 20-25°C (estimated effect size 67 points). Effects of similar magnitude but higher variability were observed for reading and science scores. In conclusion, maintaining adequate ventilation and thermal comfort in classrooms could significantly improve academic achievement of students.
Effects of Classroom Ventilation Rate and Temperature on Students’ Test Scores
2015-01-01
Using a multilevel approach, we estimated the effects of classroom ventilation rate and temperature on academic achievement. The analysis is based on measurement data from a 70 elementary school district (140 fifth grade classrooms) from Southwestern United States, and student level data (N = 3109) on socioeconomic variables and standardized test scores. There was a statistically significant association between ventilation rates and mathematics scores, and it was stronger when the six classrooms with high ventilation rates that were indicated as outliers were filtered (> 7.1 l/s per person). The association remained significant when prior year test scores were included in the model, resulting in less unexplained variability. Students’ mean mathematics scores (average 2286 points) were increased by up to eleven points (0.5%) per each liter per second per person increase in ventilation rate within the range of 0.9–7.1 l/s per person (estimated effect size 74 points). There was an additional increase of 12–13 points per each 1°C decrease in temperature within the observed range of 20–25°C (estimated effect size 67 points). Effects of similar magnitude but higher variability were observed for reading and science scores. In conclusion, maintaining adequate ventilation and thermal comfort in classrooms could significantly improve academic achievement of students. PMID:26317643
African-American adolescents’ stress responses after the 9/11/01 terrorist attacks
Barnes, Vernon A.; Treiber, Frank A.; Ludwig, David A.
2012-01-01
Purpose To examine the impact of indirect exposure to the 9/11/01 attacks upon physical and emotional stress-related responses in a community sample of African-American (AA) adolescents. Methods Three months after the 9/11/01 terrorist attacks, 406 AA adolescents (mean age [SD] of 16.1 ± 1.3 years) from an inner-city high school in Augusta, GA were evaluated with a 12-item 5-point Likert scale measuring loss of psychosocial resources (PRS) such as control, hope, optimism, and perceived support, a 17-item 5-point Likert scale measuring post-traumatic stress symptomatology (PCL), and measures of state and trait anger, anger expression, and hostility. Given the observational nature of the study, statistical differences and correlations were evaluated for effect size before statistical testing (5% minimum variance explained). Bootstrapping was used for testing mean differences and differences between correlations. Results PCL scores indicated that approximately 10% of the sample was experiencing probable clinically significant levels of post-traumatic distress (PCL score > 50). The PCL and PRS were moderately correlated with a r = .59. Gender differences for the PCL and PRS were small, accounting for 1% of the total variance. Higher PCL scores were associated with higher state anger (r = .47), as well as measures of anger-out (r = .32) and trait anger (r = .34). Higher PRS scores were associated only with higher state anger (r = .27). Scores on the two 9/11/01-related scales were not statistically associated (i.e., less than 5% of the variance explained) with traits of anger control, anger-in, or hostility. Conclusions The majority of students were not overly stressed by indirect exposure to the events of 9/11/01, perhaps owing to the temporal, social, and/or geographical distance from the event. Those who reported greater negative impact appeared to also be experiencing higher levels of current anger and exhibited a characterologic style of higher overt anger expression. PMID:15737775
Udompataikul, Montree; Limpa-o-vart, Dipenn
2012-03-01
Atopic dermatitis (AD) is a common chronic relapsing disease particularly affecting children. The emollient used for protection of skin barrier function is the standard treatment for patients with AD. Currently, there is a growing interest in the use of nonsteroidal anti-inflammatory agents such as dexpanthenol (vitamin B5) as an alternative treatment. To compare the effectiveness of 5% dexpanthenol (DT) ointment with 1% hydrocortisone (HC) ointment in childhood AD therapy. Patients were treated topically with 5% DT ointment on the right side of the body and 1% HC ointment on the other side twice daily for 4 weeks. The clinical responses were evaluated by SCORAD (Scoring Atopic Dermatitis index) with statistical analysis using paired t-test. Of the 30 children enrolled, 26 completed the protocol; mean age was 7.19 years. The average baseline SCORAD score of the DT-treated side and the HC-treated side was 30.95 and 30.54, respectively. There was no statistically significant difference in SCORAD score reduction between the 2 agents. The edematous score of the HC-treated side exhibited faster resolution than that of the DT-treated side, with a statistically significant difference at week 1 and without a statistically significant difference at weeks 2 to 4. The lichenification response rate of HC treatment was more rapid than that of DT treatment; however, there was no statistical group difference. No adverse events were observed with either agent. The effectiveness of 5% DT ointment is equal to that of 1% HC ointment. DT ointment may be used as alternative treatment in mild to moderate childhood AD therapy.
Mallett, Susan; Halligan, Steve; Collins, Gary S.; Altman, Doug G.
2014-01-01
Background Different methods of evaluating diagnostic performance when comparing diagnostic tests may lead to different results. We compared two such approaches, sensitivity and specificity with area under the Receiver Operating Characteristic Curve (ROC AUC) for the evaluation of CT colonography for the detection of polyps, either with or without computer assisted detection. Methods In a multireader multicase study of 10 readers and 107 cases we compared sensitivity and specificity, using radiological reporting of the presence or absence of polyps, to ROC AUC calculated from confidence scores concerning the presence of polyps. Both methods were assessed against a reference standard. Here we focus on five readers, selected to illustrate issues in design and analysis. We compared diagnostic measures within readers, showing that differences in results are due to statistical methods. Results Reader performance varied widely depending on whether sensitivity and specificity or ROC AUC was used. There were problems using confidence scores; in assigning scores to all cases; in use of zero scores when no polyps were identified; the bimodal non-normal distribution of scores; fitting ROC curves due to extrapolation beyond the study data; and the undue influence of a few false positive results. Variation due to use of different ROC methods exceeded differences between test results for ROC AUC. Conclusions The confidence scores recorded in our study violated many assumptions of ROC AUC methods, rendering these methods inappropriate. The problems we identified will apply to other detection studies using confidence scores. We found sensitivity and specificity were a more reliable and clinically appropriate method to compare diagnostic tests. PMID:25353643
Mallett, Susan; Halligan, Steve; Collins, Gary S; Altman, Doug G
2014-01-01
Different methods of evaluating diagnostic performance when comparing diagnostic tests may lead to different results. We compared two such approaches, sensitivity and specificity with area under the Receiver Operating Characteristic Curve (ROC AUC) for the evaluation of CT colonography for the detection of polyps, either with or without computer assisted detection. In a multireader multicase study of 10 readers and 107 cases we compared sensitivity and specificity, using radiological reporting of the presence or absence of polyps, to ROC AUC calculated from confidence scores concerning the presence of polyps. Both methods were assessed against a reference standard. Here we focus on five readers, selected to illustrate issues in design and analysis. We compared diagnostic measures within readers, showing that differences in results are due to statistical methods. Reader performance varied widely depending on whether sensitivity and specificity or ROC AUC was used. There were problems using confidence scores; in assigning scores to all cases; in use of zero scores when no polyps were identified; the bimodal non-normal distribution of scores; fitting ROC curves due to extrapolation beyond the study data; and the undue influence of a few false positive results. Variation due to use of different ROC methods exceeded differences between test results for ROC AUC. The confidence scores recorded in our study violated many assumptions of ROC AUC methods, rendering these methods inappropriate. The problems we identified will apply to other detection studies using confidence scores. We found sensitivity and specificity were a more reliable and clinically appropriate method to compare diagnostic tests.
Dixon, Donna
2012-04-01
The relationships of students' preadmission academic variables, sex, undergraduate major, and undergraduate institution to academic performance in medical school have not been thoroughly examined. To determine the ability of students' preadmission academic variables to predict osteopathic medical school performance and whether students' sex, undergraduate major, or undergraduate institution influence osteopathic medical school performance. The study followed students who graduated from New York College of Osteopathic Medicine of New York Institute of Technology in Old Westbury between 2003 and 2006. Student preadmission data were Medical College Admission Test (MCAT) scores, undergraduate grade point averages (GPAs), sex, undergraduate major, and undergraduate institutional selectivity. Medical school performance variables were GPAs, clinical performance (ie, clinical subject examinations and clerkship evaluations), and scores on the Comprehensive Osteopathic Medical Licensing Examination-USA (COMLEX-USA) Level 1 and Level 2-Clinical Evaluation (CE). Data were analyzed with Pearson product moment correlation coefficients and multivariate linear regression analyses. Differences between student groups were compared with the independent-samples, 2-tailed t test. A total of 737 students were included. All preadmission academic variables, except nonscience undergraduate GPA, were statistically significant predictors of performance on COMLEX-USA Level 1, and all preadmission academic variables were statistically significant predictors of performance on COMLEX-USA Level 2-CE. The MCAT score for biological sciences had the highest correlation among all variables with COMLEX-USA Level 1 performance (Pearson r=0.304; P<.001) and Level 2-CE performance (Pearson r=0.272; P<.001). All preadmission variables were moderately correlated with the mean clinical subject examination scores. The mean clerkship evaluation score was moderately correlated with mean clinical examination results (Pearson r=0.267; P<.001) and COMLEX-USA Level 2-CE performance (Pearson r=0.301; P<.001). Clinical subject examination scores were highly correlated with COMLEX-USA Level 2-CE scores (Pearson r=0.817; P<.001). No statistically significant difference in medical school performance was found between students with science and nonscience undergraduate majors, nor was undergraduate institutional selectivity a factor influencing performance. Students' preadmission academic variables were predictive of osteopathic medical school performance, including GPAs, clinical performance, and COMLEX-USA Level 1 and Level 2-CE results. Clinical performance was predictive of COMLEX-USA Level 2-CE performance.
Ramandeep, Gambhir; Arshdeep, Singh; Vinod, Kapoor; Parampreet, Pannu
2014-07-01
Limited health literacy among adults is one of the many barriers to better oral health outcomes. It is not uncommon to find people who consider understanding oral health information a challenge. Therefore, the present study assessed oral health literacy among clients visiting Gian Sagar Dental College and Hospital, Rajpura. A cross-sectional study was conducted on 450participants who visited the Out Patient Department (OPD) of Gian Sagar Dental College and Hospital for a period of two months (Nov-Dec, 2013). A questionnaire was given to each of the participants. Oral health literacy was graded on a 12-point Likert scale based on the total score. Oral Health Literacy of the participants was assessed as low, medium and high on the basis of responses. Statistical analysis was done using SPSS-15 statistical package. ANOVA and Student t-test were used to do comparisons between groups. Low oral health literacy scores were reported in 60.2% (271) participants. More than 60% of the study participants had knowledge about dental terms such as 'dental caries,' and 'oral cancer.' Only 22% of the graduates had a high literacy score. Mean oral health literacy score according to educational qualification was statistically significant (p<0.05), whereas there was no significant difference in terms of age and gender (p>0.05). The majority of the participants had low literacy scores. There is a need to address these problems especially among rural population by health care providers and the government.
Evaluating the efficacy of a chemistry video game
NASA Astrophysics Data System (ADS)
Shapiro, Marina
A quasi-experimental design pre-test/post-test intervention study utilizing a within group analysis was conducted with 45 undergraduate college chemistry students that investigated the effect of implementing a game-based learning environment into an undergraduate college chemistry course in order to learn if serious educational games (SEGs) can be used to achieve knowledge gains of complex chemistry concepts and to achieve increase in students' positive attitude toward chemistry. To evaluate if students learn chemistry concepts by participating in a chemistry game-based learning environment, a one-way repeated measures analysis of variance (ANOVA) was conducted across three time points (pre-test, post-test, delayed post-test which were chemistry content exams). Results showed that there was an increase in exam scores over time. The results of the ANOVA indicated a statistically significant time effect. To evaluate if students' attitude towards chemistry increased as a result of participating in a chemistry game-based learning environment a paired samples t-test was conducted using a chemistry attitudinal survey by Mahdi (2014) as the pre- and post-test. Results of the paired-samples t-test indicated that there was no significant difference in pre-attitudinal scores and post-attitudinal scores.
Sharifi, Parvane; Rahmati, Abbas; Saber, Maryam
2013-10-01
To evaluate the effect of note-taking skills training on the achievement motivation in learning. The experimental study comprised graduate students of the 2010-11 batch at Kerman's Bahonar University and Kerman's Medical Sciences University, Iran. The study sample included 110 people; 55 in the test group, and 55 in the control group. They were randomly selected and replaced through the single-stage cluster sampling. To collect the data, a questionnaire was used. Pre-test was performed before the training session in two groups. After training course, a post-test was taken. For data analysis, the independent t-test, was used. The average pre-test score of the test group was 182 +/- 34.15, while for the control group it was 191 +/- 30.37 (p < 0.089). After the training, the post-test showed statistically significant change. The test group scored 220 +/- 20.94 against the controls who scored 195 +/- 27.26 (p < 0.001). The findings showed that achievement motivation in learning increased significantly after imparting training in note-taking skills. Authorities in the educational system should invest more for promotion of such skills.
Gauging Skills of Hospital Security Personnel: a Statistically-driven, Questionnaire-based Approach.
Rinkoo, Arvind Vashishta; Mishra, Shubhra; Rahesuddin; Nabi, Tauqeer; Chandra, Vidha; Chandra, Hem
2013-01-01
This study aims to gauge the technical and soft skills of the hospital security personnel so as to enable prioritization of their training needs. A cross sectional questionnaire based study was conducted in December 2011. Two separate predesigned and pretested questionnaires were used for gauging soft skills and technical skills of the security personnel. Extensive statistical analysis, including Multivariate Analysis (Pillai-Bartlett trace along with Multi-factorial ANOVA) and Post-hoc Tests (Bonferroni Test) was applied. The 143 participants performed better on the soft skills front with an average score of 6.43 and standard deviation of 1.40. The average technical skills score was 5.09 with a standard deviation of 1.44. The study avowed a need for formal hands on training with greater emphasis on technical skills. Multivariate analysis of the available data further helped in identifying 20 security personnel who should be prioritized for soft skills training and a group of 36 security personnel who should receive maximum attention during technical skills training. This statistically driven approach can be used as a prototype by healthcare delivery institutions worldwide, after situation specific customizations, to identify the training needs of any category of healthcare staff.
Identifying and Investigating Unexpected Response to Treatment: A Diabetes Case Study.
Ozery-Flato, Michal; Ein-Dor, Liat; Parush-Shear-Yashuv, Naama; Aharonov, Ranit; Neuvirth, Hani; Kohn, Martin S; Hu, Jianying
2016-09-01
The availability of electronic health records creates fertile ground for developing computational models of various medical conditions. We present a new approach for detecting and analyzing patients with unexpected responses to treatment, building on machine learning and statistical methodology. Given a specific patient, we compute a statistical score for the deviation of the patient's response from responses observed in other patients having similar characteristics and medication regimens. These scores are used to define cohorts of patients showing deviant responses. Statistical tests are then applied to identify clinical features that correlate with these cohorts. We implement this methodology in a tool that is designed to assist researchers in the pharmaceutical field to uncover new features associated with reduced response to a treatment. It can also aid physicians by flagging patients who are not responding to treatment as expected and hence deserve more attention. The tool provides comprehensive visualizations of the analysis results and the supporting data, both at the cohort level and at the level of individual patients. We demonstrate the utility of our methodology and tool in a population of type II diabetic patients, treated with antidiabetic drugs, and monitored by the HbA1C test.
Gauging Skills of Hospital Security Personnel: a Statistically-driven, Questionnaire-based Approach
Rinkoo, Arvind Vashishta; Mishra, Shubhra; Rahesuddin; Nabi, Tauqeer; Chandra, Vidha; Chandra, Hem
2013-01-01
Objectives This study aims to gauge the technical and soft skills of the hospital security personnel so as to enable prioritization of their training needs. Methodology A cross sectional questionnaire based study was conducted in December 2011. Two separate predesigned and pretested questionnaires were used for gauging soft skills and technical skills of the security personnel. Extensive statistical analysis, including Multivariate Analysis (Pillai-Bartlett trace along with Multi-factorial ANOVA) and Post-hoc Tests (Bonferroni Test) was applied. Results The 143 participants performed better on the soft skills front with an average score of 6.43 and standard deviation of 1.40. The average technical skills score was 5.09 with a standard deviation of 1.44. The study avowed a need for formal hands on training with greater emphasis on technical skills. Multivariate analysis of the available data further helped in identifying 20 security personnel who should be prioritized for soft skills training and a group of 36 security personnel who should receive maximum attention during technical skills training. Conclusion This statistically driven approach can be used as a prototype by healthcare delivery institutions worldwide, after situation specific customizations, to identify the training needs of any category of healthcare staff. PMID:23559904
Patterson, Brendan M; Orvets, Nathan D; Aleem, Alexander W; Keener, Jay D; Calfee, Ryan P; Nixon, Devon C; Chamberlain, Aaron M
2018-06-01
The Patient-Reported Outcomes Measurement Information System (PROMIS) is being used to assess outcomes in many patient populations despite limited validation. The purpose of this study was to investigate the relationship between American Shoulder and Elbow Surgeons (ASES) and Simple Shoulder Test (SST) scores and PROMIS Physical Function (PF) and Upper Extremity (UE) function scores collected preoperatively in patients undergoing rotator cuff repair. This cross-sectional study analyzed 164 consecutive patients undergoing arthroscopic rotator cuff repair. Study inclusion required preoperative completion of the ASES and SST evaluations, as well as the PROMIS PF, UE, and Pain Interference computerized adaptive tests. Descriptive statistics were produced, and Pearson correlation coefficients were calculated between each of the outcome measures. Average PROMIS UE scores indicated greater impairment than PROMIS PF scores (34 vs 44). Three percent of patients reached the PROMIS UE ceiling score of 56. PROMIS PF scores demonstrated a weak correlation with ASES scores (r = 0.43, P < .001) and a moderate correlation with SST scores (r = 0.51, P < .001). PROMIS UE scores demonstrated a moderate correlation with both ASES scores (r = 0.59, P < .001) and SST scores (r = 0.62, P < .001). PROMIS Pain Interference scores demonstrated weak negative correlations with both ASES scores (r = -0.43, P < .001) and SST scores (r = -0.41, P < .001). Patients answered fewer questions on average using the PROMIS PF and UE instruments as compared with the ASES and SST instruments. PROMIS UE scores indicate greater impairment and demonstrate a stronger correlation with the legacy shoulder scores than PROMIS PF scores in patients with symptomatic rotator cuff tears. PROMIS computerized adaptive tests allow for more efficient patient-reported outcome data collection compared with traditional outcome scores. Copyright © 2018 Journal of Shoulder and Elbow Surgery Board of Trustees. Published by Elsevier Inc. All rights reserved.
Distribution of model-based multipoint heterogeneity lod scores.
Xing, Chao; Morris, Nathan; Xing, Guan
2010-12-01
The distribution of two-point heterogeneity lod scores (HLOD) has been intensively investigated because the conventional χ(2) approximation to the likelihood ratio test is not directly applicable. However, there was no study investigating th e distribution of the multipoint HLOD despite its wide application. Here we want to point out that, compared with the two-point HLOD, the multipoint HLOD essentially tests for homogeneity given linkage and follows a relatively simple limiting distribution ½χ²₀+ ½χ²₁, which can be obtained by established statistical theory. We further examine the theoretical result by simulation studies. © 2010 Wiley-Liss, Inc.
Clinical observation on effect of scalp electroacupuncture for mild cognitive impairment.
Zhang, Hong; Zhao, Ling; Yang, Sha; Chen, Zhigang; Li, Yingkun; Peng, Xiaohong; Yang, Yulong; Zhu, Manjia
2013-02-01
To evaluate the therapeutic effect of scalp electroacupuncture for mild cognitive impairment (MCI) in the early stage. Two hundred and thirty three MCI patients were randomly divided into three groups: the drug group, the scalp electroacupuncture group, and the syndrome differentiation group. For the scalp electroacupuncture group, the points of Baihui (DU 20), Sishecong (EX-HN1), Fengchi (GB 20), and Shenting (DU 24) were selected. For the syndrome differentiation group, specific acupoints were added on the basis of syndrome differentiation and according to the scale for the differentiation of syndromes in vascular dementia (SDSVD) beside the acupoints used in the scalp electroacupuncture group. For the drug group, nimodipine was orally administered. Each patient was treated for two courses, eight weeks. The score differences in mini-mental state examination (MMSE), picture recognition, and clock drawing test were observed before and after the treatment. After treatment, the score differences in MMSE and clock drawing test were of obvious statistical significance among three groups (P < 0.01, P < 0.05). The score differences in picture recognition were of extremely statistical significance between the scalp electroacupuncture group and the syndrome differentiation group (P < 0.01), while the difference was not found in the drug group (P > 0.05). There were statistical significant differences in therapeutic effects between the scalp electroacupuncture group and the drug group, and between the syndrome differentiation group and the drug group (P < 0.05), while no statistical difference was found between scalp electroacupuncture group and the syndrome differentiation group (P > 0.05). All the three therapies may improve the cognitive function of MCI patients. The therapeutic effects in the scalp electroacupuncture and syndrome differentiation groups were basically the same, but superior to nimodipine.
Assessment of demographic and pathoanatomic risk factors in recurrent patellofemoral instability.
Hiemstra, Laurie Anne; Kerslake, Sarah; Lafave, Mark
2017-12-01
The WARPS/STAID classification employs clinical assessment of presenting features and anatomic characteristics to identify two distinct subsets of patients within the patellofemoral instability population. The purpose of this study was to further define the specific demographics and the prevalence of risky pathoanatomies in patients classified as either WARPS or STAID presenting with recurrent patellofemoral instability. A secondary purpose was to further validate the WARPS/STAID classification with the Banff Patella Instability Instrument (BPII), the Marx activity scale and the Patellar Instability Severity Score (ISS). A convenience sample of 50 patients with recurrent patellofemoral instability, including 25 WARPS and 25 STAID subtype patients, were assessed. Clinical data were collected including assessment of demographic risk factors (sex, BMI, bilaterality of symptoms, affected limb side and age at first dislocation) and pathoanatomic risk factors (TT-TG distance, patella height, patellar tilt, grade of trochlear dysplasia, Beighton score and rotational abnormalities of the tibia or femur). Patients completed the BPII and the Marx activity scale. The ISS was calculated from the clinical assessment data. Patients were stratified into the WARPS or STAID subtypes for comparative analysis. An independent t test was used to compare demographics, the pathoanatomic risk factors and subjective measures between the groups. Convergent validity was tested with a Pearson r correlation coefficient between the WARPS/STAID and ISS scores. Demographic risk factors statistically associated with a WARPS subtype included female sex, age at first dislocation and bilaterality. Pathoanatomic risk factors statistically associated with a WARPS subtype included trochlear dysplasia, TT-TG distance, generalized ligamentous laxity, patellar tilt and rotational abnormalities. The independent t test revealed a significant difference between the ISS scores: WARPS subtype (M = 4.4, SD = 1.1) and STAID subtype (M = 2.5, SD = 1.5); t(48) = 5.2, p < 0.001. The relationship between the WARPS/STAID and the ISS scores, measured using a Pearson r correlation coefficient, demonstrated a strong relationship: r = -0.61, n = 50, p < 0.001. This study has demonstrated statistically significant evidence that certain demographics and pathoanatomies are more prevalent in each of the WARPS and STAID patellofemoral instability subtypes. There was no difference in quality-of-life or activity level between the subtypes. The WARPS/STAID score demonstrated convergent validity to the ISS and divergent validity to the BPII score and the Marx activity scale. This study has further validated both the WARPS/STAID classification and the ISS of patients that present with recurrent patellofemoral instability. III.
Bansal, Disha; Mahajan, Mrinalini
2017-01-01
The design of the class V cavity presents a clinical challenge in the field of adhesive dentistry as the margin placement is partially in enamel and partly in dentin, and the trouble associated with this design is the microleakage at the dentinal margin. When these restorations undergo microabrasion due to cosmetic reasons, this trouble aggravates to the significant levels. The aim of this study was the measurement of microleakage of class V glass ionomer restorations over two different periods of enamel microabrasion. This in vitro experimental study was conducted on 120 class V cavities which had been prepared on the buccal and lingual surfaces of 60 sound human premolars. One-half of the cavities were restored with the resin-modified glass ionomer cement (GIC) (60 cavities) and another half with the compomer (60 cavities). Finishing and polishing were performed. Then, the teeth were classified into six groups (n = 20). Microabrasion treatment was performed with Opaluster (Ultradent Product Inc., South Jordan, UT, USA) for 0 (control no treatment), 60 and 120 s. Then, teeth were thermocycled between 5°C and 55°C, immersed in rhodamine B solution (24 h), and sectioned longitudinally in buccolingual direction. Dye penetration was examined with stereomicroscope (×10). Microleakage scores were statistically analyzed. The mean occlusal margin scores and gingival margin scores were compared between all the groups using the Kruskal-Wallis test, Mann-Whitney U-test, Wilcoxon signed-rank test, and post hoc comparison. There was a significant difference between Group 1a, Group 2a, Group 1b, Group 2b, Group 1c, and Group 2c. Statistical analysis used in this study was Kruskal-Wallis test, Mann-Whitney U-test, Wilcoxon signed-rank test, and post hoc comparison. The least microleakage scores were observed in occlusal margins of control groups (without microabrasion). Moreover, in both restorations, the microleakage scores in occlusal margins were higher than gingival margins, and compoglass had less microleakage in occlusal and occlusal plus axial walls of class V cavities compared with resin-modified GIC. Whereas, the light-cured glass ionomer had less microleakage in the gingival and gingival plus axial walls of class V cavities when compared with compoglass. The least microleakage scores were observed in occlusal margins of control groups (without microabrasion). Moreover, in both restorations, the microleakage scores in occlusal margins were higher than gingival margins.
Kumar, Ramesh; Somrongthong, Ratana; Ahmed, Jamil
2016-01-01
To evaluate the sustainability and effectiveness of training as an intervention to improve the knowledge, attitude and practices of hospital workers on health care waste management. We conducted this quasi-experimental study in two tertiary care teaching hospitals in Rawalpindi in October 2013. Training, practical demonstrations and reminders on standard waste management were given to 138 hospital workers in one hospital and compared with 137 workers from the control hospital. We collected data 18 months after intervention through a structured questionnaire to assess the impact of the intervention. We used paired t-test to compare the scores on knowledge, attitude and practices at baseline and first follow up and final impact assessment. Chi square test was used to compare group variables between intervention and control groups. After 18 months since intervention the mean scores on knowledge attitude and practices differed statistically significantly since baseline and intervention group had statistically significantly better knowledge positive attitudes and good health care waste management practices (p < 0.001). Health care and sanitary workers in intervention group scored statistically significantly higher (p < 0.001). Trainings of health and sanitary workers on health care waste management guidelines were sustainable among the intervention group after 18 months which shows the positive impact of our intervention. It is recommended that the trainings as intervention be included in the overall policies of the public and private sector hospitals in Pakistan and other similar settings.
Can macrocirculation changes predict nonhealing diabetic foot ulcers?
Lee, Ye-Na; Kim, Hyon-Surk; Kang, Jeong-A; Han, Seung-Kyu
2014-01-01
Transcutaneous partial oxygen tension (TcpO2) is considered the gold standard for assessment of tissue oxygenation, which is an essential factor for wound healing. The purpose of this study was to evaluate the association between macrocirculation and TcpO2 in persons with diabetes mellitus. Ninety-eight patients with diabetic foot ulcers participated in the study (61 men and 37 women). The subjects had a mean age of 66.6 years (range, 30-83 years) and were treated at the Diabetic Wound Center of Korea University Guro Hospital, Seoul, Republic of Korea. Macrocirculation was evaluated using 2 techniques: computed tomographic angiography and Doppler ultrasound. Macrocirculation scores were based on the patency of the two tibial arteries in 98 patients. Computed tomographic angiography and Doppler ultrasound scores (0-4 points) were given according to intraluminal filling defects and arterial pulse waveform of each vessel, respectively. Tissue oxygenation was measured by TcpO2. Macrocirculation scores were statistically analyzed as a function of the TcpO2. Statistical analysis revealed no significant linear trend between the macrocirculation status and TcpO2. Biavariate analysis using the Fisher exact test, Mantel-Haenszel tests, and McNemar-Bowker tests also found no significant relationship between macrocirculation and TcpO2. Computed tomographic angiography and Doppler ultrasound are not sufficiently reliable substitutes for TcpO2 measurements in regard to determining the optimal treatment for diabetic patients.
Clinical use of the ABO-Scoring Index: reliability and subtraction frequency.
Lieber, William S; Carlson, Sean K; Baumrind, Sheldon; Poulton, Donald R
2003-10-01
This study tested the reliability and subtraction frequency of the study model-scoring system of the American Board of Orthodontists (ABO). We used a sample of 36 posttreatment study models that were selected randomly from six different orthodontic offices. Intrajudge and interjudge reliability was calculated using nonparametric statistics (Spearman rank coefficient, Wilcoxon, Kruskal-Wallis, and Mann-Whitney tests). We found differences ranging from 3 to 6 subtraction points (total score) for intrajudge scoring between two sessions. For overall total ABO score, the average correlation was .77. Intrajudge correlation was greatest for occlusal relationships and least for interproximal contacts. Interjudge correlation for ABO score averaged r = .85. Correlation was greatest for buccolingual inclination and least for overjet. The data show that some judges, on average, were much more lenient than others and that this resulted in a range of total scores between 19.7 and 27.5. Most of the deductions were found in the buccal segments and most were related to the second molars. We present these findings in the context of clinicians preparing for the ABO phase III examination and for orthodontists in their ongoing evaluation of clinical results.
NASA Astrophysics Data System (ADS)
Koparan, Timur; Güven, Bülent
2015-07-01
The point of this study is to define the effect of project-based learning approach on 8th Grade secondary-school students' statistical literacy levels for data representation. To achieve this goal, a test which consists of 12 open-ended questions in accordance with the views of experts was developed. Seventy 8th grade secondary-school students, 35 in the experimental group and 35 in the control group, took this test twice, one before the application and one after the application. All the raw scores were turned into linear points by using the Winsteps 3.72 modelling program that makes the Rasch analysis and t-tests, and an ANCOVA analysis was carried out with the linear points. Depending on the findings, it was concluded that the project-based learning approach increases students' level of statistical literacy for data representation. Students' levels of statistical literacy before and after the application were shown through the obtained person-item maps.
Scarponi, Letizia; de Felicio, Claudia Maria; Sforza, Chiarella; Pimenta Ferreira, Claudia Lucia; Ginocchio, Daniela; Pizzorni, Nicole; Barozzi, Stefania; Mozzanica, Francesco; Schindler, Antonio
2018-05-30
To evaluate the reliability, validity, and responsiveness of the Italian OMES (I-OMES). The study consisted of 3 phases: (1) internal consistency and reliability, (2) validity, and (3) responsiveness analysis. The recruited population included 27 patients with orofacial myofunctional disorders (OMD) and 174 healthy volunteers. Forty-seven subjects, 18 healthy and all recruited patients with OMD were assessed for inter-rater and test-retest reliability analysis. I-OMES and Nordic Orofacial Test - Screening (NOT-S) scores of the patients were correlated for concurrent validity analysis. I-OMES scores from 27 patients with OMD and 27 age- and gender-matched healthy subjects were compared to investigate construct validity. I-OMES scores before and after successful swallowing rehabilitation in patients were compared for responsiveness analysis. Adequate internal consistency (Cronbach α = 0.71) and strong inter-rater and test-retest reliability (intraclass coefficient correlation = 0.97 and 0.98, respectively) were found. I-OMES and NOT-S scores significantly and inversely correlated (r = -0.38). A statistical significance (p < 0.001) was found between the pathological group and the control group for the total I-OMES score. The mean I-OMES score improved from 90 (78-102) to 99 (89-103) after myofunctional rehabilitation (p < 0.001). The I-OMES is a reliable and valid tool to evaluate OMD. © 2018 S. Karger AG, Basel.
Moriyama, Yasushi; Yoshino, Aihide; Muramatsu, Taro; Mimura, Masaru
2017-05-01
The supermarket task, which is included in the Japanese version of the Rapid Dementia Screening Test, requires the quick (1 min) generation of words for things that can be bought in a supermarket. Cluster size and switches are investigated during this task. We investigated how the severity of dementia related to cluster size and switches on the supermarket task in patients with Alzheimer's disease. We administered the Japanese version of the Rapid Dementia Screening Test to 250 patients with very mild to severe Alzheimer's disease and to 49 healthy volunteers. Patients had Mini-Mental State Examination scores from 12 to 26 and Clinical Dementia Rating scale scores from 0.5 to 3. Patients were divided into four groups based on their Clinical Dementia Rating score (0.5, 1, 2, 3). We performed statistical analyses between the four groups and control subjects based on cluster size and switch scores on the supermarket task. The score for cluster size and switches deteriorated according to the severity of dementia. Moreover, for subjects with a Clinical Dementia Rating score of 0.5, cluster size was impaired, but switches were intact. Our findings indicate that the scores for cluster size and switches on the supermarket task may be useful for detecting the severity of symptoms of dementia in patients with Alzheimer's disease. © 2016 The Authors. Psychogeriatrics © 2016 Japanese Psychogeriatric Society.
Bahador, Reza; Mirbolook, Ahmadreza; Arbab, Sara; Derakhshan, Pooya; Gholizadeh, Amirmohammad; Abedi, Sadegh
2016-01-01
Background Reflex sympathetic dystrophy (RSD) syndrome is a multifactorial disorder with clinical features of neurogenic inflammation that causes hypersensitivity to pain or severe allodynia as well as blood flow problems, swelling, skin discoloration and maladaptive neuroplasticity due to vasomotor disorders. Patients with major trauma are prone to homeostasis leading to inflammatory response syndrome and multiple organ distress syndrome. Several studies have investigated the etiology of this condition, but the cause remains unknown. The role of associated factors such as the limb immobilization technique and genetics has been reported in the development of this complication, but, so far, there is no information regarding the effect of trauma severity on the risk of RSD occurrence. Objectives Given the importance of diagnosing and treating this condition, we aimed to study the effect of trauma severity on the prevalence of RSD. Patients and Methods In this cross-sectional study, we examined patients with distal tibial fracture who visited Rasht Poursina hospital from 2010 to 2013. Exclusion criteria included associated fractures, underlying musculoskeletal diseases and mental and cognitive problems. To assess the severity of the initial injury in patients, the Hannover Fracture Scale 98 (HFS98) scoring checklist was used. The diagnosis of RSD was made on the basis of the IASP criterion. Demographic data, HFS98 scores, and information regarding RSD prevalence were analyzed using SPSS version 20. The Mann Whitney U nonparametric test was used for variables that were not normally distributed; the chi-square test was used to compare the qualitative variables. Results Among the 488 patients, 292 (59.83%) were male. The mean age of the study population was 44 ± 9.82 years. During the 6-month follow-up, RSD occurred in 45 patients, of whom 28 (62.22%) were female and 17 (37.77%) were male; there was thus a significant difference in the prevalence of RSD in terms of gender (P = 0.00; chi square test). The mean HFS98 score in patients without and with RSD was 3.081 ± 4.083 and 4.080 ± 4.622, respectively, and the difference was not statistically significant (P = 0.363; Mann Whitney U test). Analyses of the eight items of HFS98 shows that local circulation in patients with RSD is significantly better than that in patients without RDS (0.683 ± 0.822 vs. 0.528 ± 0.629, respectively). Statistical analysis showed that the odds ratio for RSD for patients with HFS95 score > 0 was 1.079 (confidence interval [CI]: 0.898 - 1.333). Moreover, the odds ratio for RSD was 1.100 (CI: 795 - 1.531) in patients with an injury severity score higher than the calculated mean score in patients without RSD (> 4.083). Conclusions The results suggest no significant relationship between the severity of injury and risk of RSD occurrence, although the mean injury severity score was higher in patients with RSD than in those without RSD in this study population. The lower score of local circulation in patients with RSD than in those without RSD is a statistically significant finding and can be attributed to changes in the antioxidant levels at the injury site, which is one of the main mechanisms for the onset of RSD. Wound contamination was also justifiably higher in patients with RSD, although the difference was not statistically significant. In summary, the severity of injury alone cannot be a determining factor for predicting the probability of RSD. PMID:27626009
Bahador, Reza; Mirbolook, Ahmadreza; Arbab, Sara; Derakhshan, Pooya; Gholizadeh, Amirmohammad; Abedi, Sadegh
2016-05-01
Reflex sympathetic dystrophy (RSD) syndrome is a multifactorial disorder with clinical features of neurogenic inflammation that causes hypersensitivity to pain or severe allodynia as well as blood flow problems, swelling, skin discoloration and maladaptive neuroplasticity due to vasomotor disorders. Patients with major trauma are prone to homeostasis leading to inflammatory response syndrome and multiple organ distress syndrome. Several studies have investigated the etiology of this condition, but the cause remains unknown. The role of associated factors such as the limb immobilization technique and genetics has been reported in the development of this complication, but, so far, there is no information regarding the effect of trauma severity on the risk of RSD occurrence. Given the importance of diagnosing and treating this condition, we aimed to study the effect of trauma severity on the prevalence of RSD. In this cross-sectional study, we examined patients with distal tibial fracture who visited Rasht Poursina hospital from 2010 to 2013. Exclusion criteria included associated fractures, underlying musculoskeletal diseases and mental and cognitive problems. To assess the severity of the initial injury in patients, the Hannover Fracture Scale 98 (HFS98) scoring checklist was used. The diagnosis of RSD was made on the basis of the IASP criterion. Demographic data, HFS98 scores, and information regarding RSD prevalence were analyzed using SPSS version 20. The Mann Whitney U nonparametric test was used for variables that were not normally distributed; the chi-square test was used to compare the qualitative variables. Among the 488 patients, 292 (59.83%) were male. The mean age of the study population was 44 ± 9.82 years. During the 6-month follow-up, RSD occurred in 45 patients, of whom 28 (62.22%) were female and 17 (37.77%) were male; there was thus a significant difference in the prevalence of RSD in terms of gender (P = 0.00; chi square test). The mean HFS98 score in patients without and with RSD was 3.081 ± 4.083 and 4.080 ± 4.622, respectively, and the difference was not statistically significant (P = 0.363; Mann Whitney U test). Analyses of the eight items of HFS98 shows that local circulation in patients with RSD is significantly better than that in patients without RDS (0.683 ± 0.822 vs. 0.528 ± 0.629, respectively). Statistical analysis showed that the odds ratio for RSD for patients with HFS95 score > 0 was 1.079 (confidence interval [CI]: 0.898 - 1.333). Moreover, the odds ratio for RSD was 1.100 (CI: 795 - 1.531) in patients with an injury severity score higher than the calculated mean score in patients without RSD (> 4.083). The results suggest no significant relationship between the severity of injury and risk of RSD occurrence, although the mean injury severity score was higher in patients with RSD than in those without RSD in this study population. The lower score of local circulation in patients with RSD than in those without RSD is a statistically significant finding and can be attributed to changes in the antioxidant levels at the injury site, which is one of the main mechanisms for the onset of RSD. Wound contamination was also justifiably higher in patients with RSD, although the difference was not statistically significant. In summary, the severity of injury alone cannot be a determining factor for predicting the probability of RSD.
Testing homogeneity in Weibull-regression models.
Bolfarine, Heleno; Valença, Dione M
2005-10-01
In survival studies with families or geographical units it may be of interest testing whether such groups are homogeneous for given explanatory variables. In this paper we consider score type tests for group homogeneity based on a mixing model in which the group effect is modelled as a random variable. As opposed to hazard-based frailty models, this model presents survival times that conditioned on the random effect, has an accelerated failure time representation. The test statistics requires only estimation of the conventional regression model without the random effect and does not require specifying the distribution of the random effect. The tests are derived for a Weibull regression model and in the uncensored situation, a closed form is obtained for the test statistic. A simulation study is used for comparing the power of the tests. The proposed tests are applied to real data sets with censored data.
Kiper Unal, Hatice Demet; Comert Ozkan, Melda; Atilla, Fatos Dilan; Demirci, Zuhal; Soyer, Nur; Yildirim Simsir, Ilgin; Omur, Ozgur; Capaci, Kazim; Saydam, Guray; Sahin, Fahri
2017-01-01
Haemophilia has been associated with low bone mineral density (BMD) probably due to some predisposing factors. The aim of this study was to evaluate the relationship between BMD and potential clinical predictors in adult haemophilic patients. Fortynine patients with moderate and severe haemophilia were enrolled. BMD was measured by Dual Energy X-Ray Absorptiometry (DXA) and blood tests were performed for vitamin D, calcium, phosphore, alkaline phosphatase and parathormone levels. Functional Independence Score in Haemophilia (FISH) and Haemophilia Joint Health Score (HJHS) were used to assess musculoskeletal functions. Body mass index (BMI), Hepatitis C virus (HCV)/Human immunodeficiency virus (HIV) seropositivity and smoking status were also recorded. BMD was found lower than expected for reference age in 34.8% of patients of less than 50 years old. In patients older than 50 years, 66.6% of them had osteoporosis and 33.3% of them had normal BMD. FISH score was statistically significant correlated with BMD of total hip (TH) and femur neck (FN) but not with lumbar spine (LS). In eligible patients, there was also a statistically significant correlation between BMD of TH and HJHS. Vitamine D deficiency was common and found in 77.5% of patients, although there was no significant correlation with BMD. Also no correlation was found between BMD and blood tests, HCV/HIV status, BMI and smoking. This study confirmed that patients with haemophilia have an increased prevelance of low BMD even in younger group. Our results showed that there are significant correlations between FISH score and BMD of TH and FN and also between HJHS score and BMD of TH. Thus, using scoring systems may be beneficial as a simple predictors of BMD to reflect the severity of haemophilic arthropathy. PMID:29181264
Kiper Unal, Hatice Demet; Comert Ozkan, Melda; Atilla, Fatos Dilan; Demirci, Zuhal; Soyer, Nur; Yildirim Simsir, Ilgin; Omur, Ozgur; Capaci, Kazim; Saydam, Guray; Sahin, Fahri
2017-01-01
Haemophilia has been associated with low bone mineral density (BMD) probably due to some predisposing factors. The aim of this study was to evaluate the relationship between BMD and potential clinical predictors in adult haemophilic patients. Fortynine patients with moderate and severe haemophilia were enrolled. BMD was measured by Dual Energy X-Ray Absorptiometry (DXA) and blood tests were performed for vitamin D, calcium, phosphore, alkaline phosphatase and parathormone levels. Functional Independence Score in Haemophilia (FISH) and Haemophilia Joint Health Score (HJHS) were used to assess musculoskeletal functions. Body mass index (BMI), Hepatitis C virus (HCV)/Human immunodeficiency virus (HIV) seropositivity and smoking status were also recorded. BMD was found lower than expected for reference age in 34.8% of patients of less than 50 years old. In patients older than 50 years, 66.6% of them had osteoporosis and 33.3% of them had normal BMD. FISH score was statistically significant correlated with BMD of total hip (TH) and femur neck (FN) but not with lumbar spine (LS). In eligible patients, there was also a statistically significant correlation between BMD of TH and HJHS. Vitamine D deficiency was common and found in 77.5% of patients, although there was no significant correlation with BMD. Also no correlation was found between BMD and blood tests, HCV/HIV status, BMI and smoking. This study confirmed that patients with haemophilia have an increased prevelance of low BMD even in younger group. Our results showed that there are significant correlations between FISH score and BMD of TH and FN and also between HJHS score and BMD of TH. Thus, using scoring systems may be beneficial as a simple predictors of BMD to reflect the severity of haemophilic arthropathy.
Nurses' health promoting lifestyle behaviors in a community hospital.
Kurnat-Thoma, Emma; El-Banna, Majeda; Oakcrum, Monica; Tyroler, Jill
2017-06-01
To examine nurses' health-promoting lifestyle behaviors, describe their self-reported engagement in employee wellness program benefit options, and explore relationships between nurse demographic factors, health characteristics and lifestyle behaviors. Nurses adopting unhealthy lifestyle behaviors are at significantly higher risk for developing a number of chronic diseases and are at increased susceptibility to exhaustion, job dissatisfaction and turnover. Strengthening professional nurses' abilities to engage in healthy lifestyle behaviors could serve as a valuable tool in combating negative workplace stress, promote improved work-life balance and personal well-being, and help retain qualified health-care providers. In a 187-bed community hospital in the Washington D.C. metropolitan area, we conducted an IRB-approved exploratory descriptive study. We examined 127 nurses' demographic characteristics, self-reported employer wellness program use, and measured their healthy lifestyle behaviors using the 52-item Health-Promoting Lifestyle Profile-II (HPLP-II) survey instrument. Nurse demographic and HPLP-II scores were analyzed in SPSS v20.0. Inferential univariate statistical testing examined relationships between nurse demographic factors, health and job characteristics, and HPLP-II score outcomes. Nurses over 40years old were more likely to report participation in hospital wellness program options. Statistically significant age differences were identified in total HPLP-II score (p=0.005), and two subscale scores-spiritual growth (p=0.002) and interpersonal relations (p=0.000). Post-hoc testing identified nurse participants 40-49years old and ≥50years old experienced slightly lower total HPLP-II score, subscale scores in comparison to younger colleagues. Nurses ≥40years old may benefit from additional employer support and guidance to promote and maintain healthy lifestyles, personal well-being, and positive interpersonal relationships. Copyright © 2017 Elsevier Inc. All rights reserved.
Froelich, John; Milbrandt, Joseph C; Allan, D Gordon
2009-01-01
This study examines the impact of the 80-hour workweek on the number of surgical cases performed by PGY-2 through PGY-5 orthopedic residents. We also evaluated orthopedic in-training examination (OITE) scores during the same time period. Data were collected from the Accreditation Council for Graduate Medical Education (ACGME) national database for 3 academic years before and 5 years after July 1, 2003. CPT surgical procedure codes logged by all residents 3 years before and 5 years after implementation of the 80-hour workweek were compared. The average raw OITE scores for each class obtained during the same time period were also evaluated. Data were reported as the mean +/- standard deviation (SD), and group means were compared using independent t-tests. No statistical difference was noted in the number of surgical procedure codes logged before or after the institution of the 80-hour week during any single year of training. However, an increase in the number of CPT codes logged in the PGY-3 years after 2003 did approach significance (457.7 vs 551.9, p = 0.057). Overall, the average number of cases performed per resident increased each year after implementation of the work-hour restriction (464.4 vs 515.5 cases). No statistically significant difference was noted in the raw OITE scores before or after work-hour restrictions for our residents or nationally. We found no statistical difference for each residency class in the average number of cases performed or OITE scores, although the total number of cases performed has increased after implementation of the work-hour restrictions. We also found no statistical difference in the national OITE scores. Our data suggest that the impact of the 80-hour workweek has not had a detrimental effect on these 2 resident training measurements.
The impact of testing accommodations on MCAT scores: descriptive results.
Julian, Ellen R; Ingersoll, Deborah J; Etienne, Patricia M; Hilger, Anthony E
2004-04-01
Medical College Admission Test (MCAT) examinees with disabilities who receive accommodations receive flagged scores indicating nonstandard administration. This report compares MCAT examinees who received accommodations and their performances with standard examinees. Aggregate history records of all 1994-2000 MCAT examinees were identified as flagged (2,401) or standard (297,880), then further sorted by race/ethnicity (broadly identified as underrepresented minority and non-URM, at the time of testing) and gender. Those with flagged scores were also classified by disability (LD = learning disability, ADHD = attention deficit hyperactivity disorder, LD/ADHD = learning disability and attention deficit hyperactivity disorder, and Other = other disability) and type of accommodation. Mean MCAT scores were calculated for all groups. A group of 866 examinees took the MCAT first as a standard administration and subsequently with accommodations. In a separate analysis, their two sets of scores were compared. Less than 1% of examinees (2,401) had accommodations; of these, 55% were LD, 17% ADHD, 5% LD/ADHD, and 23% Other. Extended time was the most frequently provided accommodation. Mean flagged scores slightly exceeded mean standard scores on all MCAT sections. Examinees who retook the MCAT with accommodations after a standard administration increased their scores by six points, quadrupling the average gain Standard-Standard retest cohort from another study. The small but statistically significant different higher flagged scores may reflect either appropriate compensation or overly generous accommodations. Extended time had a positive impact on the scores of those who retested with this accommodation. The validity the flagged MCAT in predicting success in medical school is not known, and further investigation is underway.
Afsharnia, Elahe; Pakgohar, Minoo; Khosravi, Shahla; Haghani, Hamid
2018-06-01
The objective of this study was to determine the effect of the computer-based educational package on men's QoL and the severity of their hypogonadism symptoms. A quasi-experimental study was conducted on 80 male employees. The data collection tool included the 'Aging Male Symptoms' (AMS) and 'Short Form-36' (SF36) questionnaires. Four sessions were held for the intervention group over a period of 4 weeks. Two months after training, QoL and the severity of hypogonadism symptoms were measured in both the intervention and control groups. The data were analyzed with SPSS 22 software and statistical tests, such as χ 2 , independent t-test, Fisher's exact test, and paired t-tests. Significant statistical changes were observed in the intervention group before and 2 months after the training in the QoL score in the overall dimensions of physical-psychological health and all its domains except for three domains of emotional role, social function, and pain. Furthermore, the paired t-tests showed significant differences between 2 months before and after the training in all the domains and the overall hypogonadism score in the intervention group. Based on our findings, the computer-based educational package has a positive effect on QoL and reduction of hypogonadism symptoms.
Evaluating Neurotoxicity of a Mixture of Five OP Pesticides Using a Composite Score
The evaluation of the cumulative effects of neurotoxic pesticides often involves the analysis of both neurochemical and behavioral endpoints. Multiple statistical tests on many endpoints can greatly inflate Type I error rates. Multiple comparison adjustments are often overly con...
Amer, A F; Baddour, M M; Elshazly, M A; Fadally, G; Hanafi, N F; Assar, S L
2016-02-01
There is strong epidemiological evidence linking hepatitis C virus (HCV) infection and diabetes. Our aim was to evaluate the prevalence of insulin resistance in Egyptian patients with chronic HCV genotype 4 infection, to assess factors associated with insulin resistance and to test the impact of insulin resistance on outcomes of treatment with pegylated interferon/ribavirin. Insulin resistance [homeostasis model assessmentinsulin resistance (HOMA-IR) score > 3.0] was detected in 31 of 100 nondiabetic patients. The relationship between elevated HOMA-IR and baseline viral load and degree of fibrosis was statistically significant (r = 0.218 and r = 0.223). Follow-up of patients with complete early virological response until the end of treatment showed a statistically significant decrease in HOMA-IR score. Out of 29 liver tissue sections examined, 14 had a low level of expression of insulin receptor type 1 by immunohistochemical studies. This study confirms that insulin resistance affects treatment outcome, and thus HOMA-IR testing before initiation of therapy may be a cost-effective tool.
Arneja, Jugpal S; Narasimhan, Kailash; Bouwman, David; Bridge, Patrick D
2009-12-01
In-training evaluations in graduate medical education have typically been challenging. Although the majority of standardized examination delivery methods have become computer-based, in-training examinations generally remain pencil-paper-based, if they are performed at all. Audience response systems present a novel way to stimulate and evaluate the resident-learner. The purpose of this study was to assess the outcomes of audience response systems testing as compared with traditional testing in a plastic surgery residency program. A prospective 1-year pilot study of 10 plastic surgery residents was performed using audience response systems-delivered testing for the first half of the academic year and traditional pencil-paper testing for the second half. Examination content was based on monthly "Core Quest" curriculum conferences. Quantitative outcome measures included comparison of pretest and posttest and cumulative test scores of both formats. Qualitative outcomes from the individual participants were obtained by questionnaire. When using the audience response systems format, pretest and posttest mean scores were 67.5 and 82.5 percent, respectively; using traditional pencil-paper format, scores were 56.5 percent and 79.5 percent. A comparison of the cumulative mean audience response systems score (85.0 percent) and traditional pencil-paper score (75.0 percent) revealed statistically significantly higher scores with audience response systems (p = 0.01). Qualitative outcomes revealed increased conference enthusiasm, greater enjoyment of testing, and no user difficulties with the audience response systems technology. The audience response systems modality of in-training evaluation captures participant interest and reinforces material more effectively than traditional pencil-paper testing does. The advantages include a more interactive learning environment, stimulation of class participation, immediate feedback to residents, and immediate tabulation of results for the educator. Disadvantages include start-up costs and lead-time preparation.
RAId_aPS: MS/MS Analysis with Multiple Scoring Functions and Spectrum-Specific Statistics
Alves, Gelio; Ogurtsov, Aleksey Y.; Yu, Yi-Kuo
2010-01-01
Statistically meaningful comparison/combination of peptide identification results from various search methods is impeded by the lack of a universal statistical standard. Providing an -value calibration protocol, we demonstrated earlier the feasibility of translating either the score or heuristic -value reported by any method into the textbook-defined -value, which may serve as the universal statistical standard. This protocol, although robust, may lose spectrum-specific statistics and might require a new calibration when changes in experimental setup occur. To mitigate these issues, we developed a new MS/MS search tool, RAId_aPS, that is able to provide spectrum-specific -values for additive scoring functions. Given a selection of scoring functions out of RAId score, K-score, Hyperscore and XCorr, RAId_aPS generates the corresponding score histograms of all possible peptides using dynamic programming. Using these score histograms to assign -values enables a calibration-free protocol for accurate significance assignment for each scoring function. RAId_aPS features four different modes: (i) compute the total number of possible peptides for a given molecular mass range, (ii) generate the score histogram given a MS/MS spectrum and a scoring function, (iii) reassign -values for a list of candidate peptides given a MS/MS spectrum and the scoring functions chosen, and (iv) perform database searches using selected scoring functions. In modes (iii) and (iv), RAId_aPS is also capable of combining results from different scoring functions using spectrum-specific statistics. The web link is http://www.ncbi.nlm.nih.gov/CBBresearch/Yu/raid_aps/index.html. Relevant binaries for Linux, Windows, and Mac OS X are available from the same page. PMID:21103371
Santi, Luca; Farina, Gabriele; Gramenzi, Annagiulia; Trevisani, Franco; Baccini, Margherita; Bernardi, Mauro; Cavazza, Mario
2017-04-01
The HEART score is a simple scoring system, ranging from 0 to 10, specifically developed for risk stratification of patients with undifferentiated chest pain. It has been validated for the conventional troponin, but not for high-sensitive troponin. We assess a modified version of the HEART score using a single high-sensitivity troponin T dosage at presentation, regardless of symptom duration, and with different ECG criteria to evaluate if the patients with a low HEART score could be safely discharged early. The secondary aim was to confirm a statistically significant difference in each HEART score group (low 0-3, intermediate 4-6, high 7-10) in the occurrence of major adverse cardiac events at 30 and 180 days. We retrospectively analyzed the HEART score of 1597 consecutive patients admitted to the Emergency Department of our Hospital for chest pain between January 1 and June 30, 2014. Of these, 190 did not meet the inclusion criteria and 29 were lost to follow-up. None of the 512 (37.2 %) patients with a low HEART score had an event within 180 days. The difference between the cumulative incidences of events in the three HEART score groups was statistically significant (P < 0.0001). We demonstrate that it might be possible to safely discharge Emergency Department chest pain patients with a low modified HEART score after an initial determination of high-sensitive troponin T, without a prolonged observation period or an additional cardiac testing.
Jovanović, Zagorka B; Pavlović, Aleksandra M; Pekmezović, Tatjana; Mijajlović, Milija; Covicković, Nadezda Sternić
2013-07-30
There are still dilemmas about the vasodilating effect of vinpocetine, a synthetic ethyl alkaloid vincamine. The method of measuring cerebral vasomotor reactivity (VMR) by transcranial Doppler (TCD) technique before and after administration of the medication was used to estimate the degree of arterioles vasodilatation. The aim of this study was to test of the vasodilating effect of vinpocetine in patients with cerebral small vessel disease (SVD) by measuring cerebral VMR. Thirty patients with SVD were on 3-month-long oral treatment with 15 mg vinpocetine daily. Cerebral VMR was determined by breath holding test. The breath holding index (BHI) was calculated in standard manner and values > 0.69 were considered normal. At the baseline, before treatment (I), BHI, modified Rankin scale (mRS) score, Mini Mental State Examination (MMSE) score were determined. One month later (II) BHI was assessed again, while after 3 months of treatment (III) we analyzed BHI, mRS score and MMSE score. The average age of patients was 61.4 +/- 11.5 years (range 40 to 77 years), 18 (60%) female and 12 (40%) males. Values of BHIs were increased during treatment at the right MCA (I) 1.18 +/- 0.53, (II) 1.26 +/- 0.54, (III) 1.37 +/- 0.41, with statistical significance between I and III measurement (p < 0.05). An increase was noted on the left MCA (I) 1.25 +/- 0.53, (II) 1.31 +/- 0.55 and (III) 1.32 +/- 0.42, but it did not reach statistical significance (p > 0.05). Mean MMSE score significantly increased from baseline 27.4 +/- 2.3 to 28.5 +/- 2.0 after three months of treatment (p < 0.001). Functional status showed a statistically significant improvement with mRS score increasing from 2.1 +/- 1.0 to 1.1 +/- 0.6 (p < 0.001). This pilot study showed that 3-month-long oral treatment with vinpocetine 15 mg daily had tendency to increase BHI, indicating improvement of cerebral VMR. It is possible that higher doses of vinpocetine are needed to achieve substantial increase of VMR.
Gyanani, Hitesh; Chhabra, Naveen; Parmar, Ghanshyam R
2016-01-01
Study aimed to evaluate the efficacy of two different pretreatment single oral doses of betamethasone on the incidence of inter-appointment flare up and postoperative discomfort. Fifty-four patients aged 18-59 years requiring endodontic treatment were selected and randomly assigned to three groups; single pretreatment oral dose of placebo or betamethasone in two different oral doses of 0.5 mg and 1 mg, respectively. Endodontic therapy was completed in two visits using triple antibiotic paste as intracanal medicament. Patients were given a questionnaire to record their pain at 1, 2, 3, and 7 days after treatment. In the second visit, obturation was done, and the patients were again instructed to record their pain scores after treatment and discharged. The verbal rating scale was used for recording the pain scores. Statistical analysis was done using ANOVA and the Friedman test. 0.5 mg betamethasone group showed least mean pain scores among all experimental groups; however, there was no statistically significant difference between any of the groups ( P > 0.05). Pretreatment single oral dose of betamethasone is an effective in managing endodontic flare-ups; however, the results were statistically insignificant.
Prodinger, Birgit; Ballert, Carolina S; Brach, Mirjam; Brinkhof, Martin W G; Cieza, Alarcos; Hug, Kerstin; Jordan, Xavier; Post, Marcel W M; Scheel-Sailer, Anke; Schubert, Martin; Tennant, Alan; Stucki, Gerold
2016-02-01
Functioning is an important outcome to measure in cohort studies. Clear and operational outcomes are needed to judge the quality of a cohort study. This paper outlines guiding principles for reporting functioning in cohort studies and addresses some outstanding issues. Principles of how to standardize reporting of data from a cohort study on functioning, by deriving scores that are most useful for further statistical analysis and reporting, are outlined. The Swiss Spinal Cord Injury Cohort Study Community Survey serves as a case in point to provide a practical application of these principles. Development of reporting scores must be conceptually coherent and metrically sound. The International Classification of Functioning, Disability and Health (ICF) can serve as the frame of reference for this, with its categories serving as reference units for reporting. To derive a score for further statistical analysis and reporting, items measuring a single latent trait must be invariant across groups. The Rasch measurement model is well suited to test these assumptions. Our approach is a valuable guide for researchers and clinicians, as it fosters comparability of data, strengthens the comprehensiveness of scope, and provides invariant, interval-scaled data for further statistical analyses of functioning.
Proposal and validation of a clinical trunk control test in individuals with spinal cord injury.
Quinzaños, J; Villa, A R; Flores, A A; Pérez, R
2014-06-01
One of the problems that arise in spinal cord injury (SCI) is alteration in trunk control. Despite the need for standardized scales, these do not exist for evaluating trunk control in SCI. To propose and validate a trunk control test in individuals with SCI. National Institute of Rehabilitation, Mexico. The test was developed and later evaluated for reliability and criteria, content, and construct validity. We carried out 531 tests on 177 patients and found high inter- and intra-rater reliability. In terms of criterion validity, analysis of variance demonstrated a statistically significant difference in the test score of patients with adequate or inadequate trunk control according to the assessment of a group of experts. A receiver operating characteristic curve was plotted for optimizing the instrument's cutoff point, which was determined at 13 points, with a sensitivity of 98% and a specificity of 92.2%. With regard to construct validity, the correlation between the proposed test and the spinal cord independence measure (SCIM) was 0.873 (P=0.001) and that with the evolution time was 0.437 (P=0.001). For testing the hypothesis with qualitative variables, the Kruskal-Wallis test was performed, which resulted in a statistically significant difference between the scores in the proposed scale of each group defined by these variables. It was proven experimentally that the proposed trunk control test is valid and reliable. Furthermore, the test can be used for all patients with SCI despite the type and level of injury.
Lynch, Louise I.; Dauer, Jenny M.; Babchuk, Wayne A.; Heng-Moss, Tiffany
2018-01-01
A mixed methods study was used to transcend the traditional pre-, post-test approach of citizen science evaluative research by integrating adults’ test scores with their perceptions. We assessed how contributory entomology citizen science affects participants’ science self-efficacy, self-efficacy for environmental action, nature relatedness and attitude towards insects. Pre- and post-test score analyses from citizen scientists (n = 28) and a control group (n = 72) were coupled with interviews (n = 11) about science experiences and entomological interactions during participation. Considering quantitative data alone, no statistically significant changes were evident in adults following participation in citizen science when compared to the control group. Citizen scientists’ pre-test scores were significantly higher than the control group for self-efficacy for environmental action, nature relatedness and attitude towards insects. Interview data reveal a notable discrepancy between measured and perceived changes. In general, citizen scientists had an existing, long-term affinity for the natural world and perceived increases in their science self-efficacy, self-efficacy for environmental action, nature relatedness and attitude towards insects. Perceived influences may act independently of test scores. Scale instruments may not show impacts with variances in individual’s prior knowledge and experiences. The value of mixed methods on citizen science program evaluation is discussed. PMID:29415522
Cooke, Brian K; Garvan, Cynthia; Hobbs, Jacqueline A
2013-07-01
The purpose of this study was to examine trends in the Psychiatry Resident-In-Training Examination (PRITE®) scores at one institution from 2001 to 2010. The authors hypothesized that two factors, the 2003 implementation of the Accreditation Council for Graduate Medical Education (ACGME) duty-hour restrictions and the residency program's 2008 restructuring of its curriculum to a half-day per week of didactics, would lead to improved scores. Residents in the general psychiatry program at the University of Florida College of Medicine from 2001 to 2010 were included in this study. To examine the effect of the 2003 ACGME duty-hours change, the authors compared test results from 2001-2002 and 2003-2010. To examine the effect of the 2008 didactic restructuring, they compared test results from 2001-2007 and 2008-2010. There were 288 PRITE test scores from 2001 to 2010. The authors did not find a statistical difference between test results before and after the 2003 implementation of ACGME duty-hour restrictions or between test results before and after the 2008 restructuring of residency didactics. The hypothesis was rejected. The results of the literature review propose that examination scores are affected by other elements of residency training.
Lynch, Louise I; Dauer, Jenny M; Babchuk, Wayne A; Heng-Moss, Tiffany; Golick, Doug
2018-02-06
A mixed methods study was used to transcend the traditional pre-, post-test approach of citizen science evaluative research by integrating adults' test scores with their perceptions. We assessed how contributory entomology citizen science affects participants' science self-efficacy, self-efficacy for environmental action, nature relatedness and attitude towards insects. Pre- and post-test score analyses from citizen scientists ( n = 28) and a control group ( n = 72) were coupled with interviews ( n = 11) about science experiences and entomological interactions during participation. Considering quantitative data alone, no statistically significant changes were evident in adults following participation in citizen science when compared to the control group. Citizen scientists' pre-test scores were significantly higher than the control group for self-efficacy for environmental action, nature relatedness and attitude towards insects. Interview data reveal a notable discrepancy between measured and perceived changes. In general, citizen scientists had an existing, long-term affinity for the natural world and perceived increases in their science self-efficacy, self-efficacy for environmental action, nature relatedness and attitude towards insects. Perceived influences may act independently of test scores. Scale instruments may not show impacts with variances in individual's prior knowledge and experiences. The value of mixed methods on citizen science program evaluation is discussed.
Characteristics of Inpatient Units Associated With Sustained Hand Hygiene Compliance.
Wolfe, Jonathan D; Domenico, Henry J; Hickson, Gerald B; Wang, Deede; Dubree, Marilyn; Feistritzer, Nancye; Wells, Nancy; Talbot, Thomas R
2018-04-20
Following institution of a hand hygiene (HH) program at an academic medical center, HH compliance increased from 58% to 92% for 3 years. Some inpatient units modeled early, sustained increases, and others exhibited protracted improvement rates. We examined the association between patterns of HH compliance improvement and unit characteristics. Adult inpatient units (N = 35) were categorized into the following three tiers based on their pattern of HH compliance: early adopters, nonsustained and late adopters, and laggards. Unit-based culture measures were collected, including nursing practice environment scores (National Database of Nursing Quality Indicators [NDNQI]), patient rated quality and teamwork (Hospital Consumer Assessment of Healthcare Provider and Systems), patient complaint rates, case mix index, staff turnover rates, and patient volume. Associations between variables and the binary outcome of laggard (n = 18) versus nonlaggard (n = 17) were tested using a Mann-Whitney U test. Multivariate analysis was performed using an ordinal regression model. In direct comparison, laggard units had clinically relevant differences in NDNQI scores, Hospital Consumer Assessment of Healthcare Provider and Systems scores, case mix index, patient complaints, patient volume, and staff turnover. The results were not statistically significant. In the multivariate model, the predictor variables explained a significant proportion of the variability associated with laggard status, (R = 0.35, P = 0.0481) and identified NDNQI scores and patient complaints as statistically significant. Uptake of an HH program was associated with factors related to a unit's safety culture. In particular, NDNQI scores and patient complaint rates might be used to assist in identifying units that may require additional attention during implementation of an HH quality improvement program.
Fifteen-minute music intervention reduces pre-radiotherapy anxiety in oncology patients.
Chen, Lee-Chen; Wang, Tze-Fang; Shih, Yi-Nuo; Wu, Le-Jung
2013-08-01
Oncology patients may respond to radiation treatment with anxiety expressed as stress, fear, depression, and frustration. This study aimed to investigate effects of music intervention on reducing pre-radiotherapy anxiety in oncology patients. Quasi-experimental study with purposeful sampling was conducted in the Department of Radiation Oncology, at Far Eastern Memorial Hospital, Taipei, Taiwan. Subjects were assigned into a music group (n = 100) receiving 15 min of music therapy prior to radiation and a control group (n = 100) receiving 15 min rest prior to radiation. Both groups were evaluated for pre- and post-test anxiety using the State-Trait Anxiety Inventory. Physiological indicators of anxiety were measured pre- and post-test. Baseline State/Trait scores and vital signs were comparable between groups (P > 0.05). Mean change in pre- and post-test State/Trait scores showed significant decreases from baseline to post-test in both groups (all P < 0.05). A statistically significant difference was observed between music therapy and control groups in mean change of State anxiety scores (mean decreases 7.19 and 1.04, respectively; P < 0.001) and Trait anxiety scores (mean decreases 2.77 and 1.13, respectively; P = 0.036). In vital signs, both groups had significant decreases in pre- and post-test heart rate and respiration rate (P < 0.05). A statistically significant difference in mean change of systolic pressure was found between music and control groups (-5.69 ± 0.41 mmHg vs. -0.67 ± 1.29 mmHg, respectively; P = 0.009). Music therapy decreased State anxiety levels, Trait anxiety levels and systolic blood pressure in oncology patients who received the intervention prior to radiotherapy. Copyright © 2012 Elsevier Ltd. All rights reserved.
Ballistic Resistance of Armored Passenger Vehicles: Test Protocols and Quality Methods
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jeffrey M. Lacy; Robert E. Polk
2005-07-01
This guide establishes a test methodology for determining the overall ballistic resistance of the passenger compartment of assembled nontactical armored passenger vehicles (APVs). Because ballistic testing of every piece of every component of an armored vehicle is impractical, if not impossible, this guide describes a testing scheme based on statistical sampling of exposed component surface areas. Results from the test of the sampled points are combined to form a test score that reflects the probability of ballistic penetration into the passenger compartment of the vehicle.
Kenyon, Lisa K.; Elliott, James M; Cheng, M. Samuel
2016-01-01
Purpose/Background Despite the availability of various field-tests for many competitive sports, a reliable and valid test specifically developed for use in men's gymnastics has not yet been developed. The Men's Gymnastics Functional Measurement Tool (MGFMT) was designed to assess sport-specific physical abilities in male competitive gymnasts. The purpose of this study was to develop the MGFMT by establishing a scoring system for individual test items and to initiate the process of establishing test-retest reliability and construct validity. Methods A total of 83 competitive male gymnasts ages 7-18 underwent testing using the MGFMT. Thirty of these subjects underwent re-testing one week later in order to assess test-retest reliability. Construct validity was assessed using a simple regression analysis between total MGFMT scores and the gymnasts’ USA-Gymnastics competitive level to calculate the coefficient of determination (r2). Test-retest reliability was analyzed using Model 1 Intraclass correlation coefficients (ICC). Statistical significance was set at the p<0.05 level. Results The relationship between total MGFMT scores and subjects’ current USA-Gymnastics competitive level was found to be good (r2 = 0.63). Reliability testing of the MGFMT composite test score showed excellent test-retest reliability over a one-week period (ICC = 0.97). Test-retest reliability of the individual component tests ranged from good to excellent (ICC = 0.75-0.97). Conclusions The results of this study provide initial support for the construct validity and test-retest reliability of the MGFMT. Level of Evidence Level 3 PMID:27999723
Ervin, R Bethene
2011-12-13
This report provides Healthy Eating Index-2005 (HEI-2005) scores for adults aged 20 and over, by sex, age groups, race and ethnicity, and level of education in the 2003-2004 National Health and Nutrition Examination Survey (NHANES 2003-2004). The analytic sample consisted of 4,448 adults aged 20 and over from NHANES 2003-2004. The Day 1 dietary recall was used to estimate the HEI-2005 scores. Food and nutrient intakes were assessed on a density basis. The population's mean usual HEI-2005 component and total scores were calculated using a population ratio method based on programs written by the U.S. Department of Agriculture's Center for Nutrition Policy and Promotion. A two-tailed t-test was used to test significant differences between sexes, age and race, and ethnic groups and levels of education. Statistical hypotheses were tested at the p < 0.05 level of significance using a t statistic. The t-value at 0.975 with 15 degrees of freedom was 2.131. The Bonferroni method of adjustment was used to adjust the critical value for the family of pairwise comparisons for age, race and ethnicity, and education. Adults were below the maximum standard for all the HEI-2005 component scores except for total grains and meat and beans. Females and the oldest age group were more successful in meeting the Dietary Guidelines for Americans 2005 recommendations for the fruit and vegetable components and discretionary calories, and had a slightly higher overall diet quality score than their counterparts. Adults with more than a high school education more closely complied with the recommendations for many of the components compared with those with less education. No one racial and ethnic group stood out as having the highest HEI-2005 scores across most of the components. These results demonstrate that adults continue to fall short in meeting the Dietary Guidelines for Americans 2005 recommendations, and that sociodemographic characteristics influence their food choices and overall diet quality.
Ghiasvand, Arezoo Mohamadkhani; Naderi, Manijeh; Tafreshi, Mansoureh Zagheri; Ahmadi, Farzane; Hosseini, Meimanat
2017-01-01
Time management skills are essential for nursing students' success, and development of clinical competence. The purpose of this study was to determine the relationship between time management skills and anxiety and academic motivation of nursing students in Tehran medical sciences universities in 2015. This cross-sectional study was carried out on 441 nursing students in three medical universities in Tehran. Random stratified sampling was done to select the samples. Data were collected using demographic Questionnaire, Time Management Questionnaire (TMQ), Spielberger State-Trait Anxiety Inventory (STAI) and Academic Motivation Scale (AMS), which was completed t by self-report. Data were analyzed using SPSS 18 software with descriptive and analytical statistics such as ANOVA, independent t-test, Regression and Pearson Correlation Coefficient. Most participants had a moderate level of time Management skills (49%), State Anxiety (58%), Trait Anxiety (60%) and Academic Motivation (58%). The results also showed a statistically significant negative correlation between the students' TMQ scores and the state anxiety (r= -0.282, p< 0.001) and trait anxiety scores (r= -0.325, p<0.001). Moreover, there was a statistically significant positive correlation between the students' TMQ scores and AMS scores (r= 0.279, p< 0.001). Regarding the findings, it seems that it is necessary to plan for improving time management skills in order to enhance academic motivation and reduce anxiety rates among nursing students.
Improved score statistics for meta-analysis in single-variant and gene-level association studies.
Yang, Jingjing; Chen, Sai; Abecasis, Gonçalo
2018-06-01
Meta-analysis is now an essential tool for genetic association studies, allowing them to combine large studies and greatly accelerating the pace of genetic discovery. Although the standard meta-analysis methods perform equivalently as the more cumbersome joint analysis under ideal settings, they result in substantial power loss under unbalanced settings with various case-control ratios. Here, we investigate the power loss problem by the standard meta-analysis methods for unbalanced studies, and further propose novel meta-analysis methods performing equivalently to the joint analysis under both balanced and unbalanced settings. We derive improved meta-score-statistics that can accurately approximate the joint-score-statistics with combined individual-level data, for both linear and logistic regression models, with and without covariates. In addition, we propose a novel approach to adjust for population stratification by correcting for known population structures through minor allele frequencies. In the simulated gene-level association studies under unbalanced settings, our method recovered up to 85% power loss caused by the standard methods. We further showed the power gain of our methods in gene-level tests with 26 unbalanced studies of age-related macular degeneration . In addition, we took the meta-analysis of three unbalanced studies of type 2 diabetes as an example to discuss the challenges of meta-analyzing multi-ethnic samples. In summary, our improved meta-score-statistics with corrections for population stratification can be used to construct both single-variant and gene-level association studies, providing a useful framework for ensuring well-powered, convenient, cross-study analyses. © 2018 WILEY PERIODICALS, INC.
ERIC Educational Resources Information Center
DelCampo, Diana; Baca, Jacqueline S.; Jimenez, Desaree; Sanchez, Paula Roybal; DelCampo, Robert
2011-01-01
Just Be It! Healthy and Fit reduces the risk factors for childhood obesity for fifth graders using hands-on field trips, in-class lessons, and parent outreach efforts. Pre-test and post-test scores from the year-long classroom instruction showed a statistically significant increase in fruit and vegetable intake, physical activity, and nutrition…
Evaluation of Next-Generation Vision Testers for Aeromedical Certification of Aviation Personnel
2009-07-01
measure distant, intermediate, and near acuity. The slides are essentially abbreviated versions of the Early Treatment for Diabetic Retinopathy Study...over, requiring intermediate vision testing and 12 were color deficient. Analysis was designed to detect statistically significant differences between...Vertical Phoria (Right & Left Hyperphoria) Test scores from each of the vision testers were collated and analyzed. Analysis was designed to detect
ERIC Educational Resources Information Center
Bennett, Randy Elliot; Braswell, James; Oranje, Andreas; Sandene, Brent; Kaplan, Bruce; Yan, Fred
2008-01-01
This article describes selected results from the Math Online (MOL) study, one of three field investigations sponsored by the National Center for Education Statistics (NCES) to explore the use of new technology in NAEP. Of particular interest in the MOL study was the comparability of scores from paper- and computer-based tests. A nationally…
ERIC Educational Resources Information Center
Kadhi, T.; Holley, D.; Palasota, A.
2010-01-01
The following report gives descriptive and correlational statistical findings of the Grade Point Averages (GPAs) of the February 2010 and July 2009 TMSL First Time Texas Bar Test Takers to their TMSL Final GPA. Data was pre-existing and was given to the Evaluator by email from the Dean and Registrar. Statistical analyses were run using SPSS 17 to…
Ethiopian health care professionals' knowledge, attitude, and interests toward pharmacogenomics.
Abdela, Ousman Abubeker; Bhagavathula, Akshaya Srikanth; Gebreyohannes, Eyob Alemayehu; Tegegn, Henok Getachew
2017-01-01
Pharmacogenomics is a field of science which studies the impact of inheritance on individual variation in medication therapy response. We assessed healthcare professionals' knowledge, attitude, and interest toward pharmacogenomics. A cross-sectional survey was conducted using a 32-item questionnaire among physicians, nurses, and pharmacists who were working at the University of Gondar Referral and Teaching Hospital in northwest Ethiopia. Descriptive statistics was applied, and the categorical variables were summarized as frequency and percentages. An analysis of variance (ANOVA) test was performed to compare mean scores among health professionals. A p -value of <0.05 was considered as statistically significant. Of 292 health professionals who responded, the majority were male (60%) and the mean age of study participants was 27.00 (±4.85 SD) years. The mean knowledge scores of all participants, pharmacists, physicians, and nurses were 2.343±1.109, 2.671±1.059, 2.375±1.093, and 2.173±1.110, respectively. Based on the ANOVA test, a statistically significant difference was noted in mean knowledge score between pharmacists and nurses ( p =0.002). More than two-thirds (67.33%) of nurses, 42.86% of pharmacists, and 40.27% of physicians who participated did not know that genetic variations can account for as much as 95% of the variability in drug disposition and effects. The ability to accurately apply their knowledge to drug therapy selection, dosing, or monitoring parameter was reported by 35.3% of the participants. More than two-thirds (69.2%) of participants thought that pharmacogenomic testing will allow the identification of the right drug with less side effects. Most of the participants (83.2%) also requested to have training on pharmacogenomics. Participants showed limited knowledge, but they had positive attitude toward pharmacogenomics. Educational programs focusing on pharmacogenomic testing and its clinical application need to be emphasized.
Melody and pitch processing in five musical savants with congenital blindness.
Pring, Linda; Woolf, Katherine; Tadic, Valerie
2008-01-01
We examined absolute-pitch (AP) and short-term musical memory abilities of five musical savants with congenital blindness, seven musicians, and seven non-musicians with good vision and normal intelligence in two experiments. In the first, short-term memory for musical phrases was tested and the savants and musicians performed statistically indistinguishably, both significantly outperforming the non-musicians and remembering more material from the C major scale sequences than random trials. In the second experiment, participants learnt associations between four pitches and four objects using a non-verbal paradigm. This experiment approximates to testing AP ability. Low statistical power meant the savants were not statistically better than the musicians, although only the savants scored statistically higher than the non-musicians. The results are evidence for a musical module, separate from general intelligence; they also support the anecdotal reporting of AP in musical savants, which is thought to be necessary for the development of musical-savant skill.
Qu, Long; Guennel, Tobias; Marshall, Scott L
2013-12-01
Following the rapid development of genome-scale genotyping technologies, genetic association mapping has become a popular tool to detect genomic regions responsible for certain (disease) phenotypes, especially in early-phase pharmacogenomic studies with limited sample size. In response to such applications, a good association test needs to be (1) applicable to a wide range of possible genetic models, including, but not limited to, the presence of gene-by-environment or gene-by-gene interactions and non-linearity of a group of marker effects, (2) accurate in small samples, fast to compute on the genomic scale, and amenable to large scale multiple testing corrections, and (3) reasonably powerful to locate causal genomic regions. The kernel machine method represented in linear mixed models provides a viable solution by transforming the problem into testing the nullity of variance components. In this study, we consider score-based tests by choosing a statistic linear in the score function. When the model under the null hypothesis has only one error variance parameter, our test is exact in finite samples. When the null model has more than one variance parameter, we develop a new moment-based approximation that performs well in simulations. Through simulations and analysis of real data, we demonstrate that the new test possesses most of the aforementioned characteristics, especially when compared to existing quadratic score tests or restricted likelihood ratio tests. © 2013, The International Biometric Society.
RELIABILITY CONCERNS IN THE REPEATED COMPUTERIZED ASSESSMENT OF ATTENTION IN CHILDREN
Zabel, T. Andrew; von Thomsen, Christian; Cole, Carolyn; Martin, Rebecca; Mahone, E. Mark
2010-01-01
Assessment of attentional processes via computerized assessment is frequently used to quantify intra-individual cognitive improvement or decline in response to treatment. However, assessment of intra-individual change is highly dependent on sufficient test reliability. We examined the test–retest reliability of selected variables from one popular computerized continuous performance test (CPT)—i.e., the Conners’ CPT – Second Edition (CPT-II). Participants were 39 healthy children (20 girls) ages 6–18 without intellectual impairment (mean PPVT-III SS = 102.6), LD, or psychiatric disorders (DICA-IV). Test–retest reliability over the 3–8 month interval (mean = 6 months) was acceptable (Intraclass Correlations [ICC] = .82 to .92) on comparison measures (Beery Test of Visual Perception, WISC-IV Block Design, PPVT-III). In contrast, test–retest reliability was only modest for CPT-II raw scores (ICCs ranging from .62 to .82) and T-scores (ICCs ranging from .33 to .65) for variables of interest (Omissions, Commissions, Variability, Hit Reaction Time, and Attentiveness). Using test–retest reliability information published in the CPT-II manual, 90% confidence intervals based on reliable change index (RCI) methodology were constructed to examine the significance of test–retest difference/change scores. Of the participants in this sample of typically developing youth, 30% generated intra-individual changes in T-scores on the Omissions and Attentiveness variables that exceeded the 90% confidence intervals and qualified as “statistically rare” changes in score. These results suggest a considerable degree of normal variability in CPT-II test scores over extended test–retest intervals, and suggest a need for caution when interpreting test score changes in neurologically unstable clinical populations. PMID:19452302
NASA Astrophysics Data System (ADS)
Jensen-Ruopp, Helga Spitko
A comparison of hands-on inquiry instruction with lecture instruction was presented to 134 Patterns and Process Biology students. Students participated in seven biology lessons that were selected from Biology Survey of Living Things (1992). A pre and post paper and pencil assessment was used as the data collecting instrument. The treatment group was taught using hands-on inquiry strategies while the non-treatment group was taught in the lecture method of instruction. The team teaching model was used as the mode of presentation to the treatment group and the non-treatment group. Achievement levels using specific criterion; novice (0% to 50%), developing proficiency (51% to 69%), accomplished (70% to 84) and exceptional or mastery level (85% to 100%) were used as a guideline to tabulate the results of the pre and post assessment. Rubric tabulation was done to interpret the testing results. The raw data was plotted using percentage change in test score totals versus reading level score by gender as well as percentage change in test score totals versus auditory vocabulary score by gender. Box Whisker plot comparative descriptive of individual pre and post test scores for the treatment and non-treatment group was performed. Analysis of covariance (ANCOVA) using MINITAB Statistical Software version 14.11 was run on data of the seven lessons, as well as on gender (male results individual and combined, and female results individual and combined) results. Normal Probability Plots for total scores as well as individual test scores were performed. The results suggest that hands-on inquiry based instruction when presented to special needs students including; at-risk; English as a second language limited, English proficiency and special education inclusive students' learning may enhance individual student achievement.
Mapping Quantitative Traits in Unselected Families: Algorithms and Examples
Dupuis, Josée; Shi, Jianxin; Manning, Alisa K.; Benjamin, Emelia J.; Meigs, James B.; Cupples, L. Adrienne; Siegmund, David
2009-01-01
Linkage analysis has been widely used to identify from family data genetic variants influencing quantitative traits. Common approaches have both strengths and limitations. Likelihood ratio tests typically computed in variance component analysis can accommodate large families but are highly sensitive to departure from normality assumptions. Regression-based approaches are more robust but their use has primarily been restricted to nuclear families. In this paper, we develop methods for mapping quantitative traits in moderately large pedigrees. Our methods are based on the score statistic which in contrast to the likelihood ratio statistic, can use nonparametric estimators of variability to achieve robustness of the false positive rate against departures from the hypothesized phenotypic model. Because the score statistic is easier to calculate than the likelihood ratio statistic, our basic mapping methods utilize relatively simple computer code that performs statistical analysis on output from any program that computes estimates of identity-by-descent. This simplicity also permits development and evaluation of methods to deal with multivariate and ordinal phenotypes, and with gene-gene and gene-environment interaction. We demonstrate our methods on simulated data and on fasting insulin, a quantitative trait measured in the Framingham Heart Study. PMID:19278016
ERIC Educational Resources Information Center
Hanford, Terry; White, Kathleen
1991-01-01
Although numbers such as average test scores or dropout rates can capture part of a school system's success or failure, school statistics seldom tell the whole story. School board members should realize that numbers might measure compliance or process, rather than improvement. Also, improvements in numbers might reflect changes in assessment…
ERIC Educational Resources Information Center
Hartwig, Elizabeth Kjellstrand; Van Overschelde, James P.
2016-01-01
The authors investigated predictor variables for the Counselor Preparation Comprehensive Examination (CPCE) to examine whether academic variables, demographic variables, and test version were associated with graduate counseling students' CPCE scores. Multiple regression analyses revealed all 3 variables were statistically significant predictors of…
A Unified Mixed-Effects Model for Rare-Variant Association in Sequencing Studies
Sun, Jianping; Zheng, Yingye; Hsu, Li
2013-01-01
For rare-variant association analysis, due to extreme low frequencies of these variants, it is necessary to aggregate them by a prior set (e.g., genes and pathways) in order to achieve adequate power. In this paper, we consider hierarchical models to relate a set of rare variants to phenotype by modeling the effects of variants as a function of variant characteristics while allowing for variant-specific effect (heterogeneity). We derive a set of two score statistics, testing the group effect by variant characteristics and the heterogeneity effect. We make a novel modification to these score statistics so that they are independent under the null hypothesis and their asymptotic distributions can be derived. As a result, the computational burden is greatly reduced compared with permutation-based tests. Our approach provides a general testing framework for rare variants association, which includes many commonly used tests, such as the burden test [Li and Leal, 2008] and the sequence kernel association test [Wu et al., 2011], as special cases. Furthermore, in contrast to these tests, our proposed test has an added capacity to identify which components of variant characteristics and heterogeneity contribute to the association. Simulations under a wide range of scenarios show that the proposed test is valid, robust and powerful. An application to the Dallas Heart Study illustrates that apart from identifying genes with significant associations, the new method also provides additional information regarding the source of the association. Such information may be useful for generating hypothesis in future studies. PMID:23483651
Arrow, P; Klobas, E
2016-06-01
The aim of this study was to compare changes in child oral health-related quality of life (COHRQoL) after treatment for early childhood caries (ECC) using two alternative treatment approaches. A randomized control trial with random allocation of parent/child dyads with ECC to test (minimum intervention) or control (standard care). Participating parents completed the Early Childhood Oral Health Impact Scale (ECOHIS) at baseline and follow-up. Changes in ECOHIS scores and extent of COHRQoL impacts between and within groups were tested using the chi-squared statistic for groups, Wilcoxon's rank-sum test, and matched-pairs signed-rank test. Two hundred and fifty-four children were randomized (test = 127; control = 127). At baseline, mean ECOHIS score 11.1, sd 8.2; mean age = 3.8 years, sd 0.90; mean dmft = 4.9, sd 4.0; and 59% male. After a mean interval of 11.4 months, 210 children were followed-up and returned a completed questionnaire (test = 111; control = 99). There was no significant difference in COHRQoL changes between test and control. For all the children combined, there were significantly fewer impacts at follow-up in the child and family domains and the total ECOHIS, Wilcoxon signed-rank test, p < 0.05. COHRQoL improved with primary dental care for ECC, and there was no statistically significant difference between test and control in the extent of the improvement. © 2016 Australian Dental Association.
Tarescavage, Anthony M; Brewster, JoAnne; Corey, David M; Ben-Porath, Yossef S
2015-08-01
We examined associations between prehire Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF) scores and posthire performance ratings for a sample of 131 male police officers. Substantive scale scores in this sample were meaningfully lower than those obtained by the test's normative sample and substantially range restricted, but scores were consistent with those produced by members of the police candidate comparison group (Corey & Ben-Porath). After applying a statistical correction for range restriction, we found several associations between MMPI-2-RF substantive scale scores and supervisor ratings of job-related performance. Findings for scales from the emotional dysfunction and interpersonal functioning domains of the test were particularly strong. For example, scales assessing low positive emotions and social avoidance were associated with several criteria that may be affected by lack of engagement with one's environment and other people, including problems with routine task performance, decision making, assertiveness, conscientiousness, and social competence. Implications of these findings for assessment science and practice are discussed. © The Author(s) 2014.
Hebbal, M; Ankola, A V
2012-10-01
To develop a special oral health education technique and compare plaque scores before and after health education. Non-randomised before and after comparison trial without controls. The final study population comprised of 96 visually impaired children aged 6-18 years old. Silness and Loe plaque index scores were recorded at baseline. 'Audio tactile performance technique' (ATP Technique) a specially designed health education method was used to educate these children regarding oral hygiene maintenance. Periodic reinforcement of health education was performed at an interval of 9 months. Re-examination was carried out after 18 months of health education to assess plaque scores. Wilcoxon's sign rank test and paired t test was used to assess the difference between the scores before and after health education. There was increase in frequency of tooth brushing after health education. The mean plaque scores pre- and post-health education were 1.41 (+/-0.58) and 0.63 (+/-0.39) respectively. The difference was statistically significant (p<0.001). Visually impaired children could maintain an acceptable level of oral hygiene when taught using special customised methods.
Stegmeier, Nicole; Oak, Sameer R; O'Rourke, Colin; Strnad, Greg; Spindler, Kurt P; Jones, Morgan; Farrow, Lutul D; Andrish, Jack; Saluan, Paul
Two versions of the International Knee Documentation Committee (IKDC) Subjective Knee Evaluation form currently exist: the original version (1999) and a recently modified pediatric-specific version (2011). Comparison of the pediatric IKDC with the adult version in the adult population may reveal that either version could be used longitudinally. We hypothesize that the scores for the adult IKDC and pediatric IKDC will not be clinically different among adult patients aged 18 to 50 years. Randomized crossover study design. Level 2. The study consisted of 100 participants, aged 18 to 50 years, who presented to orthopaedic outpatient clinics with knee problems. All participants completed both adult and pediatric versions of the IKDC in random order with a 10-minute break in between. We used a paired t test to test for a difference between the scores and a Welch's 2-sample t test to test for equivalence. A least-squares regression model was used to model adult scores as a function of pediatric scores, and vice versa. A paired t test revealed a statistically significant 1.6-point difference between the mean adult and pediatric scores. However, the 95% confidence interval (0.54-2.66) for this difference did not exceed our a priori threshold of 5 points, indicating that this difference was not clinically important. Equivalence testing with an equivalence region of 5 points further supported this finding. The adult and pediatric scores had a linear relationship and were highly correlated with an R 2 of 92.6%. There is no clinically relevant difference between the scores of the adult and pediatric IKDC forms in adults, aged 18 to 50 years, with knee conditions. Either form, adult or pediatric, of the IKDC can be used in this population for longitudinal studies. If the pediatric version is administered in adolescence, it can be used for follow-up into adulthood.
Stegmeier, Nicole; Oak, Sameer R.; O’Rourke, Colin; Strnad, Greg; Spindler, Kurt P.; Jones, Morgan; Farrow, Lutul D.; Andrish, Jack; Saluan, Paul
2017-01-01
Background: Two versions of the International Knee Documentation Committee (IKDC) Subjective Knee Evaluation form currently exist: the original version (1999) and a recently modified pediatric-specific version (2011). Comparison of the pediatric IKDC with the adult version in the adult population may reveal that either version could be used longitudinally. Hypothesis: We hypothesize that the scores for the adult IKDC and pediatric IKDC will not be clinically different among adult patients aged 18 to 50 years. Study Design: Randomized crossover study design. Level of Evidence: Level 2. Methods: The study consisted of 100 participants, aged 18 to 50 years, who presented to orthopaedic outpatient clinics with knee problems. All participants completed both adult and pediatric versions of the IKDC in random order with a 10-minute break in between. We used a paired t test to test for a difference between the scores and a Welch’s 2-sample t test to test for equivalence. A least-squares regression model was used to model adult scores as a function of pediatric scores, and vice versa. Results: A paired t test revealed a statistically significant 1.6-point difference between the mean adult and pediatric scores. However, the 95% confidence interval (0.54-2.66) for this difference did not exceed our a priori threshold of 5 points, indicating that this difference was not clinically important. Equivalence testing with an equivalence region of 5 points further supported this finding. The adult and pediatric scores had a linear relationship and were highly correlated with an R2 of 92.6%. Conclusion: There is no clinically relevant difference between the scores of the adult and pediatric IKDC forms in adults, aged 18 to 50 years, with knee conditions. Clinical Relevance: Either form, adult or pediatric, of the IKDC can be used in this population for longitudinal studies. If the pediatric version is administered in adolescence, it can be used for follow-up into adulthood. PMID:28080306
Brown, Dorothy Cimino; Bell, Margie; Rhodes, Linda
2013-12-01
To determine the optimal method for use of the Canine Brief Pain Inventory (CBPI) to quantitate responses of dogs with osteoarthritis to treatment with carprofen or placebo. 150 dogs with osteoarthritis. Data were analyzed from 2 studies with identical protocols in which owner-completed CBPIs were used. Treatment for each dog was classified as a success or failure by comparing the pain severity score (PSS) and pain interference score (PIS) on day 0 (baseline) with those on day 14. Treatment success or failure was defined on the basis of various combinations of reduction in the 2 scores when inclusion criteria were set as a PSS and PIS ≥ 1, 2, or 3 at baseline. Statistical analyses were performed to select the definition of treatment success that had the greatest statistical power to detect differences between carprofen and placebo treatments. Defining treatment success as a reduction of ≥ 1 in PSS and ≥ 2 in PIS in each dog had consistently robust power. Power was 62.8% in the population that included only dogs with baseline scores ≥ 2 and 64.7% in the population that included only dogs with baseline scores ≥ 3. The CBPI had robust statistical power to evaluate the treatment effect of carprofen in dogs with osteoarthritis when protocol success criteria were predefined as a reduction ≥ 1 in PIS and ≥ 2 in PSS. Results indicated the CBPI can be used as an outcome measure in clinical trials to evaluate new pain treatments when it is desirable to evaluate success in individual dogs rather than overall mean or median scores in a test population.
The effect of neck dissection on quality of life after chemoradiation.
Donatelli-Lassig, Amy Anne; Duffy, Sonia A; Fowler, Karen E; Ronis, David L; Chepeha, Douglas B; Terrell, Jeffrey E
2008-10-01
To determine differences in quality of life (QOL) between patients with head and neck cancer who receive chemoradiation versus chemoradiation and neck dissection. A prospective cohort study was conducted at two tertiary otolaryngology clinics and a Veterans Administration hospital. 103 oropharyngeal patients with Stage IV squamous cell carcinoma treated via chemoradiation +/- neck dissection. self-administered health survey to collect health, demographic, and QOL information pretreatment and 1 year later. QOL via SF-36 and HNQoL. Descriptive statistics were calculated for health/clinical characteristics, demographics, and QOL scores. t tests evaluated changes in QOL over time. Sixty-five patients underwent chemoradiation and 38 patients underwent chemoradiation and neck dissection. Only the pain index of the SF-36 showed a significant difference between groups (P < 0.05) with the neck dissection group reporting greater pain. After post-treatment neck dissection, patients experience statistically significant decrement in bodily pain domain scores, but other QOL scores are similar to those of patients who underwent chemoradiation alone.
The effect of neck dissection on quality of life after chemoradiation
Lassig, Amy Anne Donatelli; Duffy, Sonia A.; Fowler, Karen E.; Ronis, David L.; Chepeha, Douglas B.; Terrell, Jeffrey E.
2010-01-01
Objective To determine differences in QOL between head and neck cancer patients receiving chemoradiation versus chemoradiation and neck dissection. Methods A prospective cohort study was conducted at 2 tertiary otolaryngology clinics and a VA. Sample: 103 oropharyngeal Stage IV SCCA patients treated via chemoradiation +/− neck dissection. Intervention: self-administered health survey collecting health, demographic, and QOL information pretreatment and 1 year later. Main outcome measures: QOL via SF-36 and HNQoL. Descriptive statistics were calculated for health / clinical characteristics, demographics, and QOL scores. T-tests evaluated changes in QOL over time. Results 65 patients received chemoradiation and 38 chemoradiation + neck dissection. Only the pain index of the SF-36 showed a significant difference between groups (p<.05) with the neck dissection group reporting greater pain. Conclusions After post-treatment neck dissection, patients experience statistically significant decrement in bodily pain domain scores, but other QOL scores are similar to those of patients undergoing chemoradiation alone. PMID:18922336
Blended Learning Versus Traditional Lecture in Introductory Nursing Pathophysiology Courses.
Blissitt, Andrea Marie
2016-04-01
Currently, many undergraduate nursing courses use blended-learning course formats with success; however, little evidence exists that supports the use of blended formats in introductory pathophysiology courses. The purpose of this study was to compare the scores on pre- and posttests and course satisfaction between traditional and blended course formats in an introductory nursing pathophysiology course. This study used a quantitative, quasi-experimental, nonrandomized control group, pretest-posttest design. Analysis of covariance compared pre- and posttest scores, and a t test for independent samples compared students' reported course satisfaction of the traditional and blended course formats. Results indicated that the differences in posttest scores were not statistically significant between groups. Students in the traditional group reported statistically significantly higher satisfaction ratings than students in the blended group. The results of this study support the need for further research of using blended learning in introductory pathophysiology courses in undergraduate baccalaureate nursing programs. Further investigation into how satisfaction is affected by course formats is needed. Copyright 2016, SLACK Incorporated.
Test-retest stability of the Task and Ego Orientation Questionnaire.
Lane, Andrew M; Nevill, Alan M; Bowes, Neal; Fox, Kenneth R
2005-09-01
Establishing stability, defined as observing minimal measurement error in a test-retest assessment, is vital to validating psychometric tools. Correlational methods, such as Pearson product-moment, intraclass, and kappa are tests of association or consistency, whereas stability or reproducibility (regarded here as synonymous) assesses the agreement between test-retest scores. Indexes of reproducibility using the Task and Ego Orientation in Sport Questionnaire (TEOSQ; Duda & Nicholls, 1992) were investigated using correlational (Pearson product-moment, intraclass, and kappa) methods, repeated measures multivariate analysis of variance, and calculating the proportion of agreement within a referent value of +/-1 as suggested by Nevill, Lane, Kilgour, Bowes, and Whyte (2001). Two hundred thirteen soccer players completed the TEOSQ on two occasions, 1 week apart. Correlation analyses indicated a stronger test-retest correlation for the Ego subscale than the Task subscale. Multivariate analysis of variance indicated stability for ego items but with significant increases in four task items. The proportion of test-retest agreement scores indicated that all ego items reported relatively poor stability statistics with test-retest scores within a range of +/-1, ranging from 82.7-86.9%. By contrast, all task items showed test-retest difference scores ranging from 92.5-99%, although further analysis indicated that four task subscale items increased significantly. Findings illustrated that correlational methods (Pearson product-moment, intraclass, and kappa) are influenced by the range in scores, and calculating the proportion of agreement of test-retest differences with a referent value of +/-1 could provide additional insight into the stability of the questionnaire. It is suggested that the item-by-item proportion of agreement method proposed by Nevill et al. (2001) should be used to supplement existing methods and could be especially helpful in identifying rogue items in the initial stages of psychometric questionnaire validation.